Skip to main navigation Skip to search Skip to main content

Chinese query reformulation and variation: A case study in Sogou log

  • Xiaochun Wang*
  • , Muyun Yang
  • , Daren Li
  • , Sheng Li
  • , Haoliang Qi
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Heilongjiang Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Query reformulation and variation are two of the most difficult and essential aspects of information seeking and retrieval. Current literatures only investigate changes in number of terms in successive queries, but they don't analyze modified positions and reformulation patterns when users reformulate queries. In this paper, we first propose 8 reformulation classes according to different modified positions and changes in number of query terms to analyze query reformulation. Experimental results show that Chinese users prefer to partially reformulate queries to shorter ones. Second, we define reformulation pattern and try to find the most frequently used reformulation pattern by Chinese users. We find that if a user only reformulates query once in one session, he is most likely to add terms to the query at the two ends. Finally, we denote the conception of query variation the query differences with the same search intent. According to Sogou log, different abstractive level of queries can most likely lead to the same click-through set with different queries. The findings of this paper are beneficial to the study of information retrieval modeling and higher preference prediction accuracy.

Original languageEnglish
Pages (from-to)251-257
Number of pages7
JournalJournal of Information and Computational Science
Volume7
Issue number1
StatePublished - Jan 2010
Externally publishedYes

Keywords

  • Chinese users
  • Log analysis
  • Query differences
  • Query reformulation

Fingerprint

Dive into the research topics of 'Chinese query reformulation and variation: A case study in Sogou log'. Together they form a unique fingerprint.

Cite this