Abstract
The flexible nomenclature of gene name results in severe semantic ambiguity, which is an obstacle for deep biomedical text mining. Gene name normalization (GN) is an effective way to resolve this problem. In this work, a multi-level disambiguation framework was proposed to solve gene name normalization problem. Aiming at different ambiguity situations during the procedure of GN, three different strategies were included in the framework. They were dictionary-based gene name detection, machine-learning-based candidate selection, and semantic-based disambiguation. Experimental results showed that the proposed method could achieve 0.746 F-measure on the BioCre-AtIvE2006 GN task test data set.
| Original language | English |
|---|---|
| Pages (from-to) | 193-197 |
| Number of pages | 5 |
| Journal | Zidonghua Xuebao/Acta Automatica Sinica |
| Volume | 35 |
| Issue number | 2 |
| DOIs | |
| State | Published - Feb 2009 |
| Externally published | Yes |
Keywords
- Gene name normalization (GN)
- Maximum entropy model
- Semantic similarity
Fingerprint
Dive into the research topics of 'Multi-level disambiguation framework for gene name normalization'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver