Skip to main navigation Skip to search Skip to main content

Extraction of risk factors for cardiovascular diseases from Chinese electronic medical records

  • Jia Su
  • , Jinpeng Hu
  • , Jingchi Jiang
  • , Jing Xie
  • , Yang Yang
  • , Bin He
  • , Jinfeng Yang
  • , Yi Guan*
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Harbin University of Science and Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Background and objective: Early prevention of cardiovascular diseases (CVDs) can effectively prevent later loss of health, and the detection of CVDs risk factors is a simple method to achieve early prevention. Personal health records play a prominent role in the field of health information extraction because of their factuality and reliability. This present study describes how to extract risk factors for CVDs from Chinese electronic medical records (CEMRs). Methods: The extraction process involves two tasks: (a) CVDs risk factor recognition and (b) risk factor time and assertion classification. We considered risk factor recognition as a named entity recognition (NER) task and time and assertion classification as a textual classification task. An information extraction pipeline system consisting of NER and textual classification modules with machine learning models was developed. In the risk factor recognition module, bidirectional long short term memory (BLSTM) with extra risk factor textual feature input was built, as well, convolutional neural networks (CNNs) with risk factor type and section label input and support vector machine (SVM) were built for time and assertion classification. Results: We have achieved the best performance of risk factor recognition with F1 value of 0.9609, time and assertion classification with F1 of 0.9812 and 0.9612, respectively. The experimental results showed that our system achieved a high performance and can extract risk factors from CEMRs efficiently. Conclusions: The proposed system is the first system for CVDs risk factors extraction from CEMRs and shows competition to risk factor extraction systems that developed on English EMRs. Further, its good performance should have a strong influence on CVDs prevention.

Original languageEnglish
Pages (from-to)1-10
Number of pages10
JournalComputer Methods and Programs in Biomedicine
Volume172
DOIs
StatePublished - Apr 2019

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Cardiovascular diseases
  • Chinese electronic medical records
  • Information extraction
  • Machine learning
  • Risk factor

Fingerprint

Dive into the research topics of 'Extraction of risk factors for cardiovascular diseases from Chinese electronic medical records'. Together they form a unique fingerprint.

Cite this