TY - GEN
T1 - How to Answer Comparison Questions
AU - Tang, Hongxuan
AU - Hong, Yu
AU - Chen, Xin
AU - Wu, Kaili
AU - Zhang, Min
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - 'Which city has the larger population, Tokyo or New York?'. To answer the question, in general, we necessarily obtain the prior knowledge about the populations of both cities, and accordingly determine the answer by numeric comparison. Using Machine Reading Comprehension (MRC) to answer such a question has become a popular research topic, which is referred to as a task of Comparison Question Answering (CQA). In this paper, we propose a novel neural CQA model which is trained to answer comparison question. The model is designed as a sophisticated neural network which performs inference in a step-by-step pipeline, including the steps of attentive entity detection (e.g., 'city'), alignment of comparable attributes (e.g., 'population' of the target 'cities'), contrast calculation (larger or smaller), as well as binary classification of positive and negative answers. The experimentation on HotpotQA illustrates that the proposed method achieves an average F1 score of 63.09%, outperforming the baseline with about 10% F1 scores. In addition, it performs better than a series of competitive models, including DecompRC, BERT.
AB - 'Which city has the larger population, Tokyo or New York?'. To answer the question, in general, we necessarily obtain the prior knowledge about the populations of both cities, and accordingly determine the answer by numeric comparison. Using Machine Reading Comprehension (MRC) to answer such a question has become a popular research topic, which is referred to as a task of Comparison Question Answering (CQA). In this paper, we propose a novel neural CQA model which is trained to answer comparison question. The model is designed as a sophisticated neural network which performs inference in a step-by-step pipeline, including the steps of attentive entity detection (e.g., 'city'), alignment of comparable attributes (e.g., 'population' of the target 'cities'), contrast calculation (larger or smaller), as well as binary classification of positive and negative answers. The experimentation on HotpotQA illustrates that the proposed method achieves an average F1 score of 63.09%, outperforming the baseline with about 10% F1 scores. In addition, it performs better than a series of competitive models, including DecompRC, BERT.
KW - Calculation
KW - Comparison Question Answering
KW - Machine Reading Comprehension
UR - https://www.scopus.com/pages/publications/85083168232
U2 - 10.1109/IALP48816.2019.9037729
DO - 10.1109/IALP48816.2019.9037729
M3 - 会议稿件
AN - SCOPUS:85083168232
T3 - Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019
SP - 337
EP - 342
BT - Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019
A2 - Lan, Man
A2 - Wu, Yuanbin
A2 - Dong, Minghui
A2 - Lu, Yanfeng
A2 - Yang, Yan
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 23rd International Conference on Asian Language Processing, IALP 2019
Y2 - 15 November 2019 through 17 November 2019
ER -