Skip to main navigation Skip to search Skip to main content

Web topic detection using a ranked clustering-like pattern across similarity cascades

  • Fei Jia
  • , Junbiao Pang
  • , Weigang Zhang
  • , Guorong Li
  • , Chunjie Zhang
  • , Qingming Huang
  • , Yugui Liu
  • University of Chinese Academy of Sciences
  • Beijing University of Technology
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Chinese Academy of Sciences

Research output: Contribution to journalConference articlepeer-review

Abstract

In multi-media and social media communities, web topic detection poses two main difficulties that conventional approaches can barely handle: 1) there are large inter-topic variations among web topics; 2) supervised information is rare to identify the real topics. In this paper, we address these problems from the similarity diffusion perspective among objects on web, and present a clustering-like pattern across similarity cascades (SCs). SCs are a series of subgraphs generated by truncating a weighted graph with a set of thresholds, and then maximal cliques are used to describe the topic candidates. Poisson deconvolution is adopted to efficiently identify the real topics from these topic candidates. Experiments demonstrate that our approach outperforms the state-of-the-arts on two datasets. In addition, we report accuracy v.s. false positives per topic (FPPT) curves for performance evaluation. To our knowledge, this is the first complete evaluation of web topic detection at the topic-wise level, and it establishes a new benchmark for this problem.

Original languageEnglish
Article number6890261
JournalProceedings - IEEE International Conference on Multimedia and Expo
Volume2014-September
Issue numberSeptmber
DOIs
StatePublished - 3 Sep 2014
Externally publishedYes
Event2014 IEEE International Conference on Multimedia and Expo, ICME 2014 - Chengdu, China
Duration: 14 Jul 201418 Jul 2014

Keywords

  • Poisson process
  • Web Topic detection
  • maximal cliques
  • similarity cascade
  • unsupervised ranking

Fingerprint

Dive into the research topics of 'Web topic detection using a ranked clustering-like pattern across similarity cascades'. Together they form a unique fingerprint.

Cite this