Skip to main navigation Skip to search Skip to main content

A Unified Visual Information Preservation Framework for Self-supervised Pre-Training in Medical Image Analysis

  • Hong Yu Zhou
  • , Chixiang Lu
  • , Chaoqi Chen
  • , Sibei Yang
  • , Yizhou Yu*
  • *Corresponding author for this work
  • The University of Hong Kong
  • ShanghaiTech University

Research output: Contribution to journalArticlepeer-review

Abstract

Recent advances in self-supervised learning (SSL) in computer vision are primarily comparative, whose goal is to preserve invariant and discriminative semantics in latent representations by comparing siamese image views. However, the preserved high-level semantics do not contain enough local information, which is vital in medical image analysis (e.g., image-based diagnosis and tumor segmentation). To mitigate the locality problem of comparative SSL, we propose to incorporate the task of pixel restoration for explicitly encoding more pixel-level information into high-level semantics. We also address the preservation of scale information, a powerful tool in aiding image understanding but has not drawn much attention in SSL. The resulting framework can be formulated as a multi-task optimization problem on the feature pyramid. Specifically, we conduct multi-scale pixel restoration and siamese feature comparison in the pyramid. In addition, we propose non-skip U-Net to build the feature pyramid and develop sub-crop to replace multi-crop in 3D medical imaging. The proposed unified SSL framework (PCRLv2) surpasses its self-supervised counterparts on various tasks, including brain tumor segmentation (BraTS 2018), chest pathology identification (ChestX-ray, CheXpert), pulmonary nodule detection (LUNA), and abdominal organ segmentation (LiTS), sometimes outperforming them by large margins with limited annotations. Codes and models are available at https://github.com/RL4M/PCRLv2.

Original languageEnglish
Pages (from-to)8020-8035
Number of pages16
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume45
Issue number7
DOIs
StatePublished - 1 Jul 2023
Externally publishedYes

Keywords

  • Context restoration
  • feature pyramid
  • medical image analysis
  • self-supervised learning
  • transfer learning

Fingerprint

Dive into the research topics of 'A Unified Visual Information Preservation Framework for Self-supervised Pre-Training in Medical Image Analysis'. Together they form a unique fingerprint.

Cite this