TY - GEN
T1 - Ask and Retrieve Knowledge
T2 - 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025
AU - Zhang, Bolin
AU - Wang, Shengwei
AU - Jiang, Yangqin
AU - Sui, Dianbo
AU - Tu, Zhiying
AU - Chu, Dianhui
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/7/13
Y1 - 2025/7/13
N2 - Large language models (LLMs) cannot effectively collaborate with humans who provide imperfect information at the initial stage of the dialogue, unless they learn to proactively ask questions. Our core idea is to enable LLMs to decide whether to take the action of "ask" or "tell" at each turn by self-reasoning, with the belief of the decisions enhanced by retrieving knowledge related to the user input. Thus, we propose the ask and retrieve knowledge framework (Ark), where LLMs think through what to retrieve, when to stop retrieving, and then take actions accordingly. Ark is used to produce the action paths for model training. To mitigate the collapse of models trained on synthetic data, we propose a progressive training strategy: self-reason learning by supervised fine-tuning on produced paths and knowledge alignment through direct preference optimization on doctor response. To evaluate the information gain brought by the ask action, we design a method to calculate the ask utility value (AUV) based on the expected value of perfect information (EVPI) theory. Although MedArk is trained using synthetic data from GPT-4o-mini, it highly outperforms GPT-4o and other medical LLMs in six aspects: helpfulness, hallucination, action selection, BERTScore, AUV, and asking correctness. MedArk also achieves SOTA results in the perfect information scenario, i.e., medical examinations. We release our code, data and models at https://github.com/Bolin97/MedArk.
AB - Large language models (LLMs) cannot effectively collaborate with humans who provide imperfect information at the initial stage of the dialogue, unless they learn to proactively ask questions. Our core idea is to enable LLMs to decide whether to take the action of "ask" or "tell" at each turn by self-reasoning, with the belief of the decisions enhanced by retrieving knowledge related to the user input. Thus, we propose the ask and retrieve knowledge framework (Ark), where LLMs think through what to retrieve, when to stop retrieving, and then take actions accordingly. Ark is used to produce the action paths for model training. To mitigate the collapse of models trained on synthetic data, we propose a progressive training strategy: self-reason learning by supervised fine-tuning on produced paths and knowledge alignment through direct preference optimization on doctor response. To evaluate the information gain brought by the ask action, we design a method to calculate the ask utility value (AUV) based on the expected value of perfect information (EVPI) theory. Although MedArk is trained using synthetic data from GPT-4o-mini, it highly outperforms GPT-4o and other medical LLMs in six aspects: helpfulness, hallucination, action selection, BERTScore, AUV, and asking correctness. MedArk also achieves SOTA results in the perfect information scenario, i.e., medical examinations. We release our code, data and models at https://github.com/Bolin97/MedArk.
KW - Active Inquiry
KW - Conversational IR
KW - Medical Consultation
KW - RAG
UR - https://www.scopus.com/pages/publications/105011827992
U2 - 10.1145/3726302.3729898
DO - 10.1145/3726302.3729898
M3 - 会议稿件
AN - SCOPUS:105011827992
T3 - SIGIR 2025 - Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval
SP - 1055
EP - 1065
BT - SIGIR 2025 - Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval
PB - Association for Computing Machinery, Inc
Y2 - 13 July 2025 through 18 July 2025
ER -