Skip to main navigation Skip to search Skip to main content

Radical-enhanced chinese character embedding

  • Yaming Sun
  • , Lei Lin*
  • , Nan Yang
  • , Zhenzhou Ji
  • , Xiaolong Wang
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Microsoft USA

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we present a method to leverage radical for learning Chinese character embedding. Radical is a semantic and phonetic component of Chinese character. It plays an important role for modelling character semantics as characters with the same radical usually have similar semantic meaning and grammatical usage. However, most existing character (or word) embedding learning algorithms typically only model the syntactic contexts but ignore the radical information. As a result, they do not explicitly capture the inner semantic connections of characters via radical into the embedding space of characters. To solve this problem, we propose to incorporate the radical information for enhancing the Chinese character embedding. We present a dedicated neural architecture with a hybrid loss function, and integrate the radical information through softmax upon each character. To verify the effectiveness of the learned character embedding, we apply it on Chinese word segmentation. Experiment results on two benchmark datasets show that, our radical-enhanced method outperforms two widely-used context-based embedding learning algorithms.

Original languageEnglish
Title of host publicationNeural Information Processing - 21st International Conference, ICONIP 2014, Proceedings
EditorsChu Kiong Loo, Keem Siah Yap, Kok Wai Wong, Andrew Teoh, Kaizhu Huang
PublisherSpringer Verlag
Pages279-286
Number of pages8
ISBN (Electronic)9783319126395
DOIs
StatePublished - 2014
Event21st International Conference on Neural Information Processing, ICONIP 2014 - Kuching, Malaysia
Duration: 3 Nov 20146 Nov 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8835
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Neural Information Processing, ICONIP 2014
Country/TerritoryMalaysia
CityKuching
Period3/11/146/11/14

Keywords

  • Chinese character embedding
  • Neural network
  • Radical

Fingerprint

Dive into the research topics of 'Radical-enhanced chinese character embedding'. Together they form a unique fingerprint.

Cite this