Skip to main navigation Skip to search Skip to main content

Clinical acronym/abbreviation normalization using a hybrid approach

  • Yonghui Wu
  • , Buzhou Tang
  • , Min Jiang
  • , Sungrim Moon
  • , Joshua C. Denny
  • , Hua Xu*
  • *Corresponding author for this work
  • University of Texas Health Science Center at Houston
  • Vanderbilt University

Research output: Contribution to journalConference articlepeer-review

Abstract

A unique characteristic of clinical text is the pervasive use of acronyms and abbreviations, which are often ambiguous. The ShARe/CLEF eHealth Evaluation Lab organized three shared tasks on clinical natural language processing (NLP) and information retrieval (IR) in 2013 and one of them was to normalize acronyms/abbreviations to UMLS concept unique identifiers (CUIs). This paper describes a hybrid system, which combines different Word Sense Disambiguation (WSD) methods and existing knowledge bases to normalize and encode clinical abbreviations. Our system achieved the best accuracy of 0.719 on the independent test set, which was ranked first in the challenge.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume1179
StatePublished - 2013
Externally publishedYes
Event2013 Cross Language Evaluation Forum Conference, CLEF 2013 - Valencia, Spain
Duration: 23 Sep 201326 Sep 2013

Keywords

  • Clinical abbreviation
  • Support vector machines
  • Vector space model
  • Word sense disambiguation

Fingerprint

Dive into the research topics of 'Clinical acronym/abbreviation normalization using a hybrid approach'. Together they form a unique fingerprint.

Cite this