Skip to main navigation Skip to search Skip to main content

Pathformer: a biological pathway informed transformer for disease diagnosis and prognosis using multi-omics data

  • Xiaofan Liu
  • , Yuhuan Tao
  • , Zilin Cai
  • , Pengfei Bao
  • , Hongli Ma
  • , Kexing Li
  • , Mengtao Li*
  • , Yunping Zhu*
  • , Zhi John Lu*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Multi-omics data provide a comprehensive view of gene regulation at multiple levels, which is helpful in achieving accurate diagnosis of complex diseases like cancer. However, conventional integration methods rarely utilize prior biological knowledge and lack interpretability. Results: To integrate various multi-omics data of tissue and liquid biopsies for disease diagnosis and prognosis, we developed a biological pathway informed Transformer, Pathformer. It embeds multi-omics input with a compacted multi-modal vector and a pathway-based sparse neural network. Pathformer also leverages criss-cross attention mechanism to capture the crosstalk between different pathways and modalities. We first benchmarked Pathformer with 18 comparable methods on multiple cancer datasets, where Pathformer outperformed all the other methods, with an average improvement of 6.3%–14.7% in F1 score for cancer survival prediction, 5.1%–12% for cancer stage prediction, and 8.1%–13.6% for cancer drug response prediction. Subsequently, for cancer prognosis prediction based on tissue multi-omics data, we used a case study to demonstrate the biological interpretability of Pathformer by identifying key pathways and their biological crosstalk. Then, for cancer early diagnosis based on liquid biopsy data, we used plasma and platelet datasets to demonstrate Pathformer’s potential of clinical applications in cancer screening. Moreover, we revealed deregulation of interesting pathways (e.g. scavenger receptor pathway) and their crosstalk in cancer patients’ blood, providing potential candidate targets for cancer microenvironment study.

Original languageEnglish
Article numberbtae316
JournalBioinformatics
Volume40
Issue number5
DOIs
StatePublished - 1 May 2024
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'Pathformer: a biological pathway informed transformer for disease diagnosis and prognosis using multi-omics data'. Together they form a unique fingerprint.

Cite this