Skip to main navigation Skip to search Skip to main content

Emerging topic identification from app reviews via adaptive online biterm topic modeling

  • Wan Zhou
  • , Yong Wang*
  • , Cuiyun Gao
  • , Fei Yang
  • *Corresponding author for this work
  • Anhui Polytechnic University
  • Nanjing University
  • Harbin Institute of Technology
  • Zhejiang Lab

Research output: Contribution to journalArticlepeer-review

Abstract

Emerging topics in app reviews highlight the topics (e.g., software bugs) with which users are concerned during certain periods. Identifying emerging topics accurately, and in a timely manner, could help developers more effectively update apps. Methods for identifying emerging topics in app reviews based on topic models or clustering methods have been proposed in the literature. However, the accuracy of emerging topic identification is reduced because reviews are short in length and offer limited information. To solve this problem, an improved emerging topic identification (IETI) approach is proposed in this work. Specifically, we adopt natural language processing techniques to reduce noisy data, and identify emerging topics in app reviews using the adaptive online biterm topic model. Then we interpret the implicature of emerging topics through relevant phrases and sentences. We adopt the official app changelogs as ground truth, and evaluate IETI in six common apps. The experimental results indicate that IETI is more accurate than the baseline in identifying emerging topics, with improvements in the F1 score of 0.126 for phrase labels and 0.061 for sentence labels. Finally, we release the codes of IETI on Github (https://github.com/wanizhou/IETI).

Translated title of the contribution基于自适应在线双词主题模型的应用程序评论新兴主题识别
Original languageEnglish
Pages (from-to)678-691
Number of pages14
JournalFrontiers of Information Technology and Electronic Engineering
Volume23
Issue number5
DOIs
StatePublished - May 2022
Externally publishedYes

Keywords

  • App reviews
  • Emerging topic identification
  • Natural language processing
  • TP311.5
  • Topic model

Fingerprint

Dive into the research topics of 'Emerging topic identification from app reviews via adaptive online biterm topic modeling'. Together they form a unique fingerprint.

Cite this