Skip to main navigation Skip to search Skip to main content

HIT at SemEval-2022 Task 2: Pre-trained Language Model for Idioms Detection

  • Zheng Chu
  • , Ziqing Yang
  • , Yiming Cui
  • , Zhigang Chen
  • , Ming Liu
  • Harbin Institute of Technology
  • IFLYTEK Co., Ltd.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The same multi-word expressions may have different meanings in different sentences. They can be mainly divided into two categories, which are literal meaning and idiomatic meaning. Non-contextual-based methods perform poorly on this problem, and we need contextual embedding to understand the idiomatic meaning of multi-word expressions correctly. We use a pre-trained language model, which can provide a context-aware sentence embedding, to detect whether multi-word expression in the sentence is idiomatic usage.

Original languageEnglish
Title of host publicationSemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop
EditorsGuy Emerson, Natalie Schluter, Gabriel Stanovsky, Ritesh Kumar, Alexis Palmer, Nathan Schneider, Siddharth Singh, Shyam Ratan
PublisherAssociation for Computational Linguistics (ACL)
Pages221-227
Number of pages7
ISBN (Electronic)9781955917803
DOIs
StatePublished - 2022
Event16th International Workshop on Semantic Evaluation, SemEval 2022 - Seattle, United States
Duration: 14 Jul 202215 Jul 2022

Publication series

NameSemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop

Conference

Conference16th International Workshop on Semantic Evaluation, SemEval 2022
Country/TerritoryUnited States
CitySeattle
Period14/07/2215/07/22

Fingerprint

Dive into the research topics of 'HIT at SemEval-2022 Task 2: Pre-trained Language Model for Idioms Detection'. Together they form a unique fingerprint.

Cite this