Skip to main navigation Skip to search Skip to main content

Biomedical Named Entity Recognition Model Based on Knowledge Distillation

  • Rong Han
  • , Dequan Zheng*
  • , Feng Yu
  • , Yannan Li
  • , Jia Hu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As the literature in the biomedical field continues to grow, the identification of biomedical named entities becomes increasingly important. A pre-trained language model called BioBERT was created exclusively for identifying biological named entities, and the performance of using the BioBERT model in extracting biomedical named entity identification has been dramatically improved compared to the previously proposed BERT model. However, due to its large model and many parameters, even more than 110 million. Its drawback is that it is time-consuming and requires significant resources. Consequently, we suggest a knowledge distillation strategy in this study, in which knowledge from a teacher model with a complicated structure is learned by a student model with a simple structure to increase recognition performance. In this paper, the BioBERT model is used as the teacher model and the BiLSTM model is used as the student model, and the best distillation effect is finally found when the weighting factor a = 0.3 by experimentally comparing different weighting factors. At this time, the F1 value of the refined student model is improved by 0.29% compared with the original model.

Original languageEnglish
Title of host publicationBusiness Intelligence and Information Technology - Proceedings of BIIT 2023
EditorsAboul Ella Hassanien, Dequan Zheng, Zhijie Zhao, Zhipeng Fan
PublisherSpringer Science and Business Media Deutschland GmbH
Pages443-451
Number of pages9
ISBN (Print)9789819739790
DOIs
StatePublished - 2023
Externally publishedYes
EventInternational Conference on Business Intelligence and Information Technology, BIIT 2023 - Harbin, China
Duration: 17 Dec 202318 Dec 2023

Publication series

NameSmart Innovation, Systems and Technologies
Volume394 SIST
ISSN (Print)2190-3018
ISSN (Electronic)2190-3026

Conference

ConferenceInternational Conference on Business Intelligence and Information Technology, BIIT 2023
Country/TerritoryChina
CityHarbin
Period17/12/2318/12/23

Keywords

  • Knowledge distillation
  • Named entity recognition
  • Pre-trained language models

Fingerprint

Dive into the research topics of 'Biomedical Named Entity Recognition Model Based on Knowledge Distillation'. Together they form a unique fingerprint.

Cite this