Skip to main navigation Skip to search Skip to main content

Character-Level Syntax Infusion in Pre-Trained Models for Chinese Semantic Role Labeling

  • Yuxuan Wang
  • , Zhilin Lei
  • , Wanxiang Che*
  • *Corresponding author for this work
  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Semantic role labeling (SRL) aims at identifying the predicate-argument structure of a sentence. Recent work has significantly improved SRL performance by incorporating syntactic information and exploiting pre-trained models like BERT. Most of them use pre-trained models as isolated encoders to obtain word embeddings and enhance them with word-level syntax. Unlike many other languages, Chinese pre-trained models normally use Chinese characters instead of subwords as the basic input units, making the many-units-in-one-word phenomena more frequent and the relationship between characters more important. However, this character-level information is often ignored by previous research. In this paper, we propose the Character-Level Syntax-Infused network for Chinese SRL, which effectively incorporates the syntactic information between Chinese characters into pre-trained models. Experiments on the Chinese benchmarks of CoNLL-2009 and Universal Proposition Bank (UPB) show that the proposed approach achieves state-of-the-art results.

Original languageEnglish
Pages (from-to)3503-3515
Number of pages13
JournalInternational Journal of Machine Learning and Cybernetics
Volume12
Issue number12
DOIs
StatePublished - Dec 2021

Keywords

  • Pre-Trained Models
  • Semantic Role Labeling
  • Syntax Infusion

Fingerprint

Dive into the research topics of 'Character-Level Syntax Infusion in Pre-Trained Models for Chinese Semantic Role Labeling'. Together they form a unique fingerprint.

Cite this