SVDF: enhancing structural variation detect from long-read sequencing via automatic filtering strategies

  • Heng Hu
  • , Runtian Gao
  • , Wentao Gao
  • , Bo Gao
  • , Zhongjun Jiang
  • , Murong Zhou
  • , Guohua Wang*
  • , Tao Jiang*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Structural variation (SV) is an important form of genomic variation that inf luences gene function and expression by altering the structure of the genome. Although long-read data have been proven to better characterize SVs, SVs detected from noisy long-read data still include a considerable portion of false-positive calls. To accurately detect SVs in long-read data, we present SVDF, a method that employs a learning-based noise filtering strategy and an SV signature-adaptive clustering algorithm, for effectively reducing the likelihood of false-positive events. Benchmarking results from multiple orthogonal experiments demonstrate that, across different sequencing platforms and depths, SVDF achieves higher calling accuracy for each sample compared to several existing general SV calling tools. We believe that, with its meticulous and sensitive SV detection capability, SVDF can bring new opportunities and advancements to cutting-edge genomic research.

Original languageEnglish
Article numberbbae336
JournalBriefings in Bioinformatics
Volume25
Issue number4
DOIs
StatePublished - 1 Jul 2024
Externally publishedYes

Keywords

  • deep learning
  • false-positives
  • long-read sequencing
  • structural variation detection

Fingerprint

Dive into the research topics of 'SVDF: enhancing structural variation detect from long-read sequencing via automatic filtering strategies'. Together they form a unique fingerprint.

Cite this