Skip to main navigation Skip to search Skip to main content

Online topic detection and tracking of financial news based on hierarchical clustering

  • Xiang Ying Dai*
  • , Qing Cai Chen
  • , Xiao Long Wang
  • , Jun Xu
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we apply TDT technology to the vertical search engine in the financial field. The returned results are grouped into several topics with the stock as the unit. Then we show the topics to the users in time series order. As a result, users can easily learn about the important events which belong to a stock. Moreover, the causes and the effects of these events can also be found out easily. We improve the common agglomerative hierarchical clustering algorithm based on average-link method, which is then used to implement the retrospective topic detection and the online topic detection of news stories of the stocks. Additionally, the improved single pass clustering algorithm is employed to accomplish topic tracking. We consider that the feature terms which occur in the title of a news story contribute more during the similarity calculation and increase their corresponding weights. Experiments are performed on two datasets which are annotated by human judgment. The results show that the proposed method can effectively detect and track the online financial topics.

Original languageEnglish
Title of host publication2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
Pages3341-3346
Number of pages6
DOIs
StatePublished - 2010
Externally publishedYes
Event2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010 - Qingdao, China
Duration: 11 Jul 201014 Jul 2010

Publication series

Name2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
Volume6

Conference

Conference2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
Country/TerritoryChina
CityQingdao
Period11/07/1014/07/10

Keywords

  • Agglomerative hierarchical clustering
  • Topic detection and tracking
  • Vector space model

Fingerprint

Dive into the research topics of 'Online topic detection and tracking of financial news based on hierarchical clustering'. Together they form a unique fingerprint.

Cite this