Skip to main navigation Skip to search Skip to main content

Opinion annotation in on-line Chinese product reviews

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents the design and construction of a Chinese opinion corpus. Based on the observation on the characteristics of opinion expression in Chinese online product reviews, which is quite different from in the formal texts such as news, an annotation framework is proposed to guide the construction of an opinion corpus based on online product reviews. The opinionated sentences are manually identified from the review text. Furthermore, for each comment in the opinionated sentences, its 13 describing elements are annotated including the expressions related to the target product attributes and user opinion expressions as well as the polarity and degree of the opinions. Currently, 12,724 comments are annotated in 10,935 sentences from product reviews. Through statistical observation on the opinion corpus, some interesting characteristics of Chinese opinion expression are presented. This corpus is helpful to support systematic research on Chinese opinion analysis.

Original languageEnglish
Title of host publicationProceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008
PublisherEuropean Language Resources Association (ELRA)
Pages1625-1632
Number of pages8
ISBN (Electronic)2951740840, 9782951740846
StatePublished - 2008
Externally publishedYes
Event6th International Conference on Language Resources and Evaluation, LREC 2008 - Marrakech, Morocco
Duration: 28 May 200830 May 2008

Publication series

NameProceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008

Conference

Conference6th International Conference on Language Resources and Evaluation, LREC 2008
Country/TerritoryMorocco
CityMarrakech
Period28/05/0830/05/08

Cite this