Skip to main navigation Skip to search Skip to main content

Predicting the quality of user-generated answers using co-training in community-based question answering portals

  • Bingquan Liu*
  • , Jian Feng
  • , Ming Liu
  • , Haifeng Hu
  • , Xiaolong Wang
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Predicting the quality of user-generated answers is definitely of great importance for community-based question answering (CQA) due to the frequent occurrence of low-quality answers. Most existing answer quality prediction works combine non-textual features of user-generated answers directly without considering the diversity of non-textual features. In this paper, we propose two co-training approaches: random subspace split-based co-training (RSS-CoT) and content and social split-based co-training (CS-CoT) to predict the quality of answers by mining the relationships of non-textual features and unlabeled data in CQA. Our results demonstrate that both appropriate combination of non-textual features and unlabeled data can promote the prediction performance of answer quality.

Original languageEnglish
Pages (from-to)29-34
Number of pages6
JournalPattern Recognition Letters
Volume58
DOIs
StatePublished - 1 Jun 2015
Externally publishedYes

Keywords

  • Answer quality predicting
  • Co-training
  • Semi-supervised methods
  • Social features
  • Surface linguistic features

Fingerprint

Dive into the research topics of 'Predicting the quality of user-generated answers using co-training in community-based question answering portals'. Together they form a unique fingerprint.

Cite this