Skip to main navigation Skip to search Skip to main content

Query rewriting using statistical machine translation

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the area of Information Retrieval, user queries often mismatch the documents users exactly want. We regard this problem as a Query Rewriting task from user queries to document space. Using query logs containing query-keywords-CTR pairs, we trained a state-of-the-art statistical machine translation model to translate the user query to keywords of a web document. Using this method we successfully built the "lecical gap" between user queries and document keywords, and got the keywords as rewritings of the queries. We separately use BLUE and CTR-Recall as optimization target to complete eight comparable experiments. CTR-Recall is presented by us as an optimization target and evaluation indicator. It shows that if forcing the same word to be aligned in word alignment and using BLEU as optimization target we get both the best CTR-Recall and BLEU. At the same time using CTR-Recall as optimization target we get both the best CTR-Recall and BLEU too.

Original languageEnglish
Title of host publicationProceedings - International Conference on Machine Learning and Cybernetics
PublisherIEEE Computer Society
Pages814-819
Number of pages6
ISBN (Electronic)9781479902576
DOIs
StatePublished - 2013
Externally publishedYes
Event12th International Conference on Machine Learning and Cybernetics, ICMLC 2013 - Tianjin, China
Duration: 14 Jul 201317 Jul 2013

Publication series

NameProceedings - International Conference on Machine Learning and Cybernetics
Volume2
ISSN (Print)2160-133X
ISSN (Electronic)2160-1348

Conference

Conference12th International Conference on Machine Learning and Cybernetics, ICMLC 2013
Country/TerritoryChina
CityTianjin
Period14/07/1317/07/13

Keywords

  • BLEU
  • CTR-Recall
  • Information Retrieval
  • Query Rewriting
  • Statistic Machine Translation

Fingerprint

Dive into the research topics of 'Query rewriting using statistical machine translation'. Together they form a unique fingerprint.

Cite this