Skip to main navigation Skip to search Skip to main content

An Infrared-Visible Image Fusion Network with Multi-Scale Convolutional Attention and Transformer

  • Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Infrared-visible image fusion (IVIF) obtains an information-rich fused image by fusing the intensity information in infrared images and the texture information in visible light images. However, existing methods usually have difficulty in effectively balancing global and edge feature extraction while maintaining cross-modal consistency, resulting in unsatisfactory fusion quality. In this paper, a cross-modal multi-scale dual-branch feature fusion network named MTFuse is proposed to address the key challenges of IVIF, including global and edge feature extraction, cross-modal information fusion, and highlighting important features. The features of the proposed method include a Transformer-CNN framework for integrated feature extraction, a multi-scale convolutional attention fusion block (MCAFB) for improved detail preservation, and a novel loss function inspired by focal loss for highlighting key areas in the fused image. Experimental results on benchmark datasets show that our method performs well on various metrics and significantly improves the fusion quality and effectiveness.

Original languageEnglish
Title of host publication2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350376739
DOIs
StatePublished - 2024
Externally publishedYes
Event2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024 - Doha, Qatar
Duration: 8 Nov 202412 Nov 2024

Publication series

Name2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024

Conference

Conference2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
Country/TerritoryQatar
CityDoha
Period8/11/2412/11/24

Keywords

  • Transformer-CNN framework
  • infrared and visible images fusion
  • multi-scale convolutional attention fusion block

Fingerprint

Dive into the research topics of 'An Infrared-Visible Image Fusion Network with Multi-Scale Convolutional Attention and Transformer'. Together they form a unique fingerprint.

Cite this