Skip to main navigation Skip to search Skip to main content

Semi-supervised learning framework for cross-lingual projection

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Cross-lingual projection encounters two major challenges, the noise from word-alignment error and the syntactic divergences between two languages. To solve these two problems, a semi-supervised learning framework of cross-lingual projection is proposed to get better annotations using parallel data. Moreover, a projection model is introduced to model the projection process of labeling from the resource-rich language to the resource-scarce language. The projection model, together with the traditional target model of cross-lingual projection, can be seen as two views of parallel data. Utilizing these two views, an extension of co-training algorithm to structured predictions is designed to boost the result of the two models. Experiments show that the proposed cross-lingual projection method improves the accuracy in the task of POS-tagging projection. And using only one-to-one alignments proves to lead to more accurate results than using all kinds of alignment information.

Original languageEnglish
Title of host publicationProceedings - 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011
Pages213-216
Number of pages4
DOIs
StatePublished - 2011
Externally publishedYes
Event2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011 - Lyon, France
Duration: 22 Aug 201127 Aug 2011

Publication series

NameProceedings - 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011
Volume3

Conference

Conference2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011
Country/TerritoryFrance
CityLyon
Period22/08/1127/08/11

Keywords

  • Co-training
  • Pos tagging
  • Semi-supervised learning
  • Structured predictions

Fingerprint

Dive into the research topics of 'Semi-supervised learning framework for cross-lingual projection'. Together they form a unique fingerprint.

Cite this