Skip to main navigation Skip to search Skip to main content

Personal data retrieval and disambiguation in web person search

Research output: Contribution to journalArticlepeer-review

Abstract

Web person search often return web pages related to several distinct namesakes. This paper proposes a new web page model for template-free person data extraction, and uses Dirichlet Process Mixture model to solve name disambiguation. The results show that our method works best on web pages with complex structure.

Original languageEnglish
Pages (from-to)392-395
Number of pages4
JournalIEICE Transactions on Information and Systems
Issue number2
DOIs
StatePublished - Feb 2019
Externally publishedYes

Keywords

  • Deep learning
  • Name disambiguation
  • Sequential block model
  • Web extraction

Fingerprint

Dive into the research topics of 'Personal data retrieval and disambiguation in web person search'. Together they form a unique fingerprint.

Cite this