TY - GEN
T1 - An Infrared-Visible Image Fusion Network with Multi-Scale Convolutional Attention and Transformer
AU - Meng, Qi
AU - Tian, Chunwei
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Infrared-visible image fusion (IVIF) obtains an information-rich fused image by fusing the intensity information in infrared images and the texture information in visible light images. However, existing methods usually have difficulty in effectively balancing global and edge feature extraction while maintaining cross-modal consistency, resulting in unsatisfactory fusion quality. In this paper, a cross-modal multi-scale dual-branch feature fusion network named MTFuse is proposed to address the key challenges of IVIF, including global and edge feature extraction, cross-modal information fusion, and highlighting important features. The features of the proposed method include a Transformer-CNN framework for integrated feature extraction, a multi-scale convolutional attention fusion block (MCAFB) for improved detail preservation, and a novel loss function inspired by focal loss for highlighting key areas in the fused image. Experimental results on benchmark datasets show that our method performs well on various metrics and significantly improves the fusion quality and effectiveness.
AB - Infrared-visible image fusion (IVIF) obtains an information-rich fused image by fusing the intensity information in infrared images and the texture information in visible light images. However, existing methods usually have difficulty in effectively balancing global and edge feature extraction while maintaining cross-modal consistency, resulting in unsatisfactory fusion quality. In this paper, a cross-modal multi-scale dual-branch feature fusion network named MTFuse is proposed to address the key challenges of IVIF, including global and edge feature extraction, cross-modal information fusion, and highlighting important features. The features of the proposed method include a Transformer-CNN framework for integrated feature extraction, a multi-scale convolutional attention fusion block (MCAFB) for improved detail preservation, and a novel loss function inspired by focal loss for highlighting key areas in the fused image. Experimental results on benchmark datasets show that our method performs well on various metrics and significantly improves the fusion quality and effectiveness.
KW - Transformer-CNN framework
KW - infrared and visible images fusion
KW - multi-scale convolutional attention fusion block
UR - https://www.scopus.com/pages/publications/85216541175
U2 - 10.1109/ICCSI62669.2024.10799490
DO - 10.1109/ICCSI62669.2024.10799490
M3 - 会议稿件
AN - SCOPUS:85216541175
T3 - 2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
BT - 2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
Y2 - 8 November 2024 through 12 November 2024
ER -