Skip to main navigation Skip to search Skip to main content

Topic tracking based on keywords dependency profile

  • Wei Zheng*
  • , Yu Zhang
  • , Yu Hong
  • , Jili Fan
  • , Ting Liu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Topic tracking is an important task of Topic Detection and Tracking (TDT). Its purpose is to detect stories, from a stream of news, related to known topics. Each topic is "known" by its association with several sample stories that discuss it. In this paper, we propose a new method to build the keywords dependency profile (KDP) of each story and track topic basing on similarity between the profiles of topic and story. In this method, keywords of a story are selected by document summarization technology. The KDP is built by keywords co-occurrence frequency in the same sentences of the story. We demonstrate this profile can describe the core events in a story accurately. Experiments on the mandarin resource of TDT4 and TDT5 show topic tracking system basing on KDP improves the performance by 13.25% on training dataset and 7.49% on testing dataset comparing to baseline.

Original languageEnglish
Title of host publicationInformation Retrieval Technology - 4th Asia Information Retrieval Symposium, AIRS 2008, Revised Selected Papers
Pages129-140
Number of pages12
DOIs
StatePublished - 2008
Externally publishedYes
Event4th Asia Information Retrieval Symposium, AIRS 2008 - Harbin, China
Duration: 15 Jan 200818 Jan 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4993 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference4th Asia Information Retrieval Symposium, AIRS 2008
Country/TerritoryChina
CityHarbin
Period15/01/0818/01/08

Keywords

  • Keywords dependency profile
  • Topic detection and tracking
  • Topic tracking
  • Word co-occurrence

Fingerprint

Dive into the research topics of 'Topic tracking based on keywords dependency profile'. Together they form a unique fingerprint.

Cite this