Abstract
A unique characteristic of clinical text is the pervasive use of acronyms and abbreviations, which are often ambiguous. The ShARe/CLEF eHealth Evaluation Lab organized three shared tasks on clinical natural language processing (NLP) and information retrieval (IR) in 2013 and one of them was to normalize acronyms/abbreviations to UMLS concept unique identifiers (CUIs). This paper describes a hybrid system, which combines different Word Sense Disambiguation (WSD) methods and existing knowledge bases to normalize and encode clinical abbreviations. Our system achieved the best accuracy of 0.719 on the independent test set, which was ranked first in the challenge.
| Original language | English |
|---|---|
| Journal | CEUR Workshop Proceedings |
| Volume | 1179 |
| State | Published - 2013 |
| Externally published | Yes |
| Event | 2013 Cross Language Evaluation Forum Conference, CLEF 2013 - Valencia, Spain Duration: 23 Sep 2013 → 26 Sep 2013 |
Keywords
- Clinical abbreviation
- Support vector machines
- Vector space model
- Word sense disambiguation
Fingerprint
Dive into the research topics of 'Clinical acronym/abbreviation normalization using a hybrid approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver