Skip to main navigation Skip to search Skip to main content

A contrastive triplet network for automatic chest X-ray reporting

  • Yan Yang
  • , Jun Yu*
  • , Hanliang Jiang
  • , Weidong Han
  • , Jian Zhang
  • , Wei Jiang
  • *Corresponding author for this work
  • Hangzhou Dianzi University
  • Sir Run Run Shaw Hospital
  • Zhejiang International Studies University

Research output: Contribution to journalArticlepeer-review

Abstract

Chest X-ray reporting aims at generating linguistic descriptions automatically for chest X-ray images, in which accurate detection and description of abnormalities are essential. However, the seriously biased data distribution (e.g., the normal cases usually dominate the whole dataset over abnormal cases) causes huge challenges for the data-driven neural models to generate satisfied abnormality descriptions. To this end, we propose a contrastive triplet network (CTN) built on the Transformer architecture for automatic chest X-ray reporting to alleviate the data-bias problem. Our CTN effectively enhances abnormalities by comparing visual and semantic information between normal and abnormal cases using a triplet network. Specifically, triplets including normal and abnormal cases are first constructed. Then, visual tokens of the chest X-ray are extracted and fed to the Transformer to generate an associated report. During training, comparisons between normal and abnormal cases are conducted via contrasting: 1) the visual embedding of the chest X-ray image encoded by the Transformer encoder, and 2) the semantic embedding of the generated report encoded by a pre-trained textual encoder. Comprehensive experiments on two publicly-available databases have shown the good performance of our method.

Original languageEnglish
Pages (from-to)71-83
Number of pages13
JournalNeurocomputing
Volume502
DOIs
StatePublished - 1 Sep 2022
Externally publishedYes

Keywords

  • Chest X-ray report generation
  • Contrastive learning
  • Data bias
  • Encoder-decoder architecture
  • Textual encoder
  • Transformer
  • Triplet network

Fingerprint

Dive into the research topics of 'A contrastive triplet network for automatic chest X-ray reporting'. Together they form a unique fingerprint.

Cite this