TY - GEN
T1 - SPM-MIM
T2 - 2nd International Conference on Computer, Vision and Intelligent Technology, ICCVIT 2024
AU - Qiu, Yujie
AU - Jin, Pengcheng
AU - Gao, Guoming
AU - Wang, Qingwang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In this paper, we propose a super-pixel based multitemporal masked image modeling (SPM-MIM) method for multitemporal remote sensing land cover classification. Compared with the previous self-supervised masked image modeling methods, SPM-MIM masks and reconstructs images based on super-pixels, effectively avoiding the problem that tiny remote sensing scenes are completely covered, while better retaining the structural information of remote sensing scenes, so that the model can be trained with stronger feature extraction capabilities. In order to better combine multi-temporal information to achieve land cover classification, we propose a cross-temporal information complementary module (CTICM). CTICM interacts with shallow multi-temporal features in spatial dimension to promote cross-phase information complementarity in local scenes, and interacts with deep multi-temporal features in channel dimension to promote cross-phase information complementarity in global scenes. Experimental results on remote sensing images of Gaofen-1 Harbin area and Gaofen2 Dalian Lushun area show that our proposed SPM-MIM method has excellent land cover classification capability.
AB - In this paper, we propose a super-pixel based multitemporal masked image modeling (SPM-MIM) method for multitemporal remote sensing land cover classification. Compared with the previous self-supervised masked image modeling methods, SPM-MIM masks and reconstructs images based on super-pixels, effectively avoiding the problem that tiny remote sensing scenes are completely covered, while better retaining the structural information of remote sensing scenes, so that the model can be trained with stronger feature extraction capabilities. In order to better combine multi-temporal information to achieve land cover classification, we propose a cross-temporal information complementary module (CTICM). CTICM interacts with shallow multi-temporal features in spatial dimension to promote cross-phase information complementarity in local scenes, and interacts with deep multi-temporal features in channel dimension to promote cross-phase information complementarity in global scenes. Experimental results on remote sensing images of Gaofen-1 Harbin area and Gaofen2 Dalian Lushun area show that our proposed SPM-MIM method has excellent land cover classification capability.
KW - multitemporal remote sensing
KW - remote sensing classification
KW - self-supervised learning
KW - super-pixel masking
UR - https://www.scopus.com/pages/publications/85219619185
U2 - 10.1109/ICCVIT63928.2024.10872507
DO - 10.1109/ICCVIT63928.2024.10872507
M3 - 会议稿件
AN - SCOPUS:85219619185
T3 - 2024 2nd International Conference on Computer, Vision and Intelligent Technology, ICCVIT 2024 - Proceedings
BT - 2024 2nd International Conference on Computer, Vision and Intelligent Technology, ICCVIT 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 November 2024 through 27 November 2024
ER -