Skip to main navigation Skip to search Skip to main content

MFCRank: A Web ranking algorithm based on correlation of multiple features

  • Harbin Institute of Technology Shenzhen
  • The University of Hong Kong

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents a new ranking algorithm MFCRank for topic-specific Web search systems. The basic idea is to correlate two types of similarity information into a unified link analysis model so that the rich content and link features in Web collections can be exploited efficiently to improve the ranking performance. First, a new surfer model JBC is proposed, under which the topic similarity information among neighborhood pages is used to weigh the jumping probability of the surfer and to direct the surfing activities. Secondly, as JBC surfer model is still query-independent, a correlation between the query and JBC is essential. This is implemented by the definition of MFCRank score, which is the linear combination of JBC score and the similarity value between the query and the matched pages. Through the two correlation steps, the features contained in the plain text, link structure, anchor text and user query can be smoothly correlated in one single ranking model. Ranking experiments have been carried out on a set of topic-specific Web page collections. Experimental results showed that our algorithm gained great improvement with regard to the ranking precision.

Original languageEnglish
Title of host publicationComputational Linguistics and Intelligent Text Processing - 7th International Conference, CICLing 2006, Proceedings
PublisherSpringer Verlag
Pages378-388
Number of pages11
ISBN (Print)3540322051, 9783540322054
DOIs
StatePublished - 2006
Externally publishedYes
Event7th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2006 - Mexico City, Mexico
Duration: 19 Feb 200625 Feb 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3878 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2006
Country/TerritoryMexico
CityMexico City
Period19/02/0625/02/06

Keywords

  • Link Analysis
  • PageRank
  • Ranking
  • Search Engine
  • Web

Fingerprint

Dive into the research topics of 'MFCRank: A Web ranking algorithm based on correlation of multiple features'. Together they form a unique fingerprint.

Cite this