TY - GEN
T1 - Multi-stage chinese collocation extraction
AU - Xu, Rui Feng
AU - Lu, Qin
PY - 2005
Y1 - 2005
N2 - Collocation is a recurrent and conventional natural language expression. In this research, Chinese collocations are categorized into four types. Based on the statistical analysis of different types of typical collocations, a multi-stage window-based collocation extraction system is designed, in which lexical statistic, synonyms information, syntactic information, and dependency knowledge, are used to extract n-gram collocations and different types of bi-gram collocations separately. Experimental results show that this system achieves a better precision and recall performance, compared with existed statistical collocation extraction techniques.
AB - Collocation is a recurrent and conventional natural language expression. In this research, Chinese collocations are categorized into four types. Based on the statistical analysis of different types of typical collocations, a multi-stage window-based collocation extraction system is designed, in which lexical statistic, synonyms information, syntactic information, and dependency knowledge, are used to extract n-gram collocations and different types of bi-gram collocations separately. Experimental results show that this system achieves a better precision and recall performance, compared with existed statistical collocation extraction techniques.
KW - Collocation extraction
KW - Multi-stage extraction
KW - Natural language processing
UR - https://www.scopus.com/pages/publications/28444487992
M3 - 会议稿件
AN - SCOPUS:28444487992
SN - 078039092X
SN - 9780780390928
T3 - 2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005
SP - 3254
EP - 3259
BT - 2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005
T2 - International Conference on Machine Learning and Cybernetics, ICMLC 2005
Y2 - 18 August 2005 through 21 August 2005
ER -