TY - GEN
T1 - Auto-ACE
T2 - 10th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2024
AU - Bai, Zhixin
AU - Wang, Bingbing
AU - Liang, Bin
AU - Xu, Ruifeng
N1 - Publisher Copyright:
© 2024 Association for Computational Linguistics
PY - 2024
Y1 - 2024
N2 - Conversational question answering aims to respond to questions based on relevant contexts and previous question-answer history. Existing studies typically use ground-truth answers in history, leading to the inconsistency between the training and inference phases. However, in real-world scenarios, progress in question answering can only be made using predicted answers. Since not all predicted answers are correct, indiscriminately using all predicted answers for training introduces noise into the model. To tackle these challenges, we propose an automatic answer correctness evaluation method named Auto-ACE. Specifically, we first construct an Att-BERT model which employs attention weight to the BERT model, so as to bridge the relation between the current question and the question-answer pair in history. Furthermore, to reduce the interference of the irrelevant information in the predicted answer, A-Scorer, an answer scorer is designed to evaluate the confidence of the predicted answer. We conduct a series of experiments on QuAC and CoQA datasets, and the results demonstrate the effectiveness and practicality of our proposed Auto-ACE framework.
AB - Conversational question answering aims to respond to questions based on relevant contexts and previous question-answer history. Existing studies typically use ground-truth answers in history, leading to the inconsistency between the training and inference phases. However, in real-world scenarios, progress in question answering can only be made using predicted answers. Since not all predicted answers are correct, indiscriminately using all predicted answers for training introduces noise into the model. To tackle these challenges, we propose an automatic answer correctness evaluation method named Auto-ACE. Specifically, we first construct an Att-BERT model which employs attention weight to the BERT model, so as to bridge the relation between the current question and the question-answer pair in history. Furthermore, to reduce the interference of the irrelevant information in the predicted answer, A-Scorer, an answer scorer is designed to evaluate the confidence of the predicted answer. We conduct a series of experiments on QuAC and CoQA datasets, and the results demonstrate the effectiveness and practicality of our proposed Auto-ACE framework.
UR - https://www.scopus.com/pages/publications/85204880773
U2 - 10.18653/v1/2024.sighan-1.9
DO - 10.18653/v1/2024.sighan-1.9
M3 - 会议稿件
AN - SCOPUS:85204880773
T3 - SIGHAN 2024 - 10th SIGHAN Workshop on Chinese Language Processing, Proceedings of the Workshop
SP - 80
EP - 87
BT - SIGHAN 2024 - 10th SIGHAN Workshop on Chinese Language Processing, Proceedings of the Workshop
A2 - Wong, Kam-Fai
A2 - Zhang, Min
A2 - Xu, Ruifeng
A2 - Li, Jing
A2 - Wei, Zhongyu
A2 - Gui, Lin
A2 - Liang, Bin
A2 - Zhao, Runcong
PB - Association for Computational Linguistics (ACL)
Y2 - 16 August 2024
ER -