TY - GEN
T1 - DS2-ABSA
T2 - 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025
AU - Xu, Hongling
AU - Zhang, Yice
AU - Wang, Qianlong
AU - Xu, Ruifeng
N1 - Publisher Copyright:
© 2025 Association for Computational Linguistics.
PY - 2025
Y1 - 2025
N2 - Recently developed large language models (LLMs) have presented promising new avenues to address data scarcity in low-resource scenarios. In few-shot aspect-based sentiment analysis (ABSA), previous efforts have explored data augmentation techniques, which prompt LLMs to generate new samples by modifying existing ones. However, these methods fail to produce adequately diverse data, impairing their effectiveness. Additionally, some studies apply in-context learning for ABSA by using specific instructions and a few selected examples as prompts. Though promising, LLMs often yield labels that deviate from task requirements. To overcome these limitations, we propose DS2ABSA, a dual-stream data synthesis framework targeted for few-shot ABSA. It leverages LLMs to synthesize data from two complementary perspectives: key-point-driven and instance-driven, which effectively generate diverse and high-quality ABSA samples in low-resource settings. Furthermore, a label refinement module is integrated to improve the synthetic labels. Extensive experiments demonstrate that DS2ABSA significantly outperforms previous few-shot ABSA solutions and other LLM-oriented data generation methods. Our code and synthetic data are available at https://github.com/HITSZ-HLT/DS2-ABSA.
AB - Recently developed large language models (LLMs) have presented promising new avenues to address data scarcity in low-resource scenarios. In few-shot aspect-based sentiment analysis (ABSA), previous efforts have explored data augmentation techniques, which prompt LLMs to generate new samples by modifying existing ones. However, these methods fail to produce adequately diverse data, impairing their effectiveness. Additionally, some studies apply in-context learning for ABSA by using specific instructions and a few selected examples as prompts. Though promising, LLMs often yield labels that deviate from task requirements. To overcome these limitations, we propose DS2ABSA, a dual-stream data synthesis framework targeted for few-shot ABSA. It leverages LLMs to synthesize data from two complementary perspectives: key-point-driven and instance-driven, which effectively generate diverse and high-quality ABSA samples in low-resource settings. Furthermore, a label refinement module is integrated to improve the synthetic labels. Extensive experiments demonstrate that DS2ABSA significantly outperforms previous few-shot ABSA solutions and other LLM-oriented data generation methods. Our code and synthetic data are available at https://github.com/HITSZ-HLT/DS2-ABSA.
UR - https://www.scopus.com/pages/publications/105021031527
U2 - 10.18653/v1/2025.acl-long.752
DO - 10.18653/v1/2025.acl-long.752
M3 - 会议稿件
AN - SCOPUS:105021031527
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 15460
EP - 15478
BT - Long Papers
A2 - Che, Wanxiang
A2 - Nabende, Joyce
A2 - Shutova, Ekaterina
A2 - Pilehvar, Mohammad Taher
PB - Association for Computational Linguistics (ACL)
Y2 - 27 July 2025 through 1 August 2025
ER -