TY - GEN
T1 - Object Detection for Optical Remote Sensing Images with Self-supervised Feature Representation
AU - Shao, Wenyi
AU - Yu, Jinxiang
AU - Huang, Chaowei
AU - Yang, Jingyi
AU - Peng, Yu
AU - Liu, Liansheng
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Automatically detecting objects of interest on remote sensing images is crucial for earth observations. Existing remote sensing object detectors mainly rely on supervised methods, and the quality and quantity of annotated samples determine the detection performance. However, obtaining large-scale labeled images is labor-intensive and requires domain expertise, which hinders the advancement of remote sensing object detection. To solve the problem, a method based on self-supervised feature representation is presented, with the goal of investigating how to utilize a large number of unlabeled remote-sensing images to enhance detection performance. The method contains three steps. Firstly, the presented method collects many unlabeled remote-sensing photos and reconstructs them to suppress the ineffective expression of background information while preserving the key features of the object. Then, object-level contrastive learning is used to acquire the generalized feature representation. Finally, the extracted feature expression is transferred to downstream task to improve the final detection performance. Experiment results show that with only a few training epochs on the NWPU VHR-10.v2 dataset, the proposed method outperforms supervised-only methods.
AB - Automatically detecting objects of interest on remote sensing images is crucial for earth observations. Existing remote sensing object detectors mainly rely on supervised methods, and the quality and quantity of annotated samples determine the detection performance. However, obtaining large-scale labeled images is labor-intensive and requires domain expertise, which hinders the advancement of remote sensing object detection. To solve the problem, a method based on self-supervised feature representation is presented, with the goal of investigating how to utilize a large number of unlabeled remote-sensing images to enhance detection performance. The method contains three steps. Firstly, the presented method collects many unlabeled remote-sensing photos and reconstructs them to suppress the ineffective expression of background information while preserving the key features of the object. Then, object-level contrastive learning is used to acquire the generalized feature representation. Finally, the extracted feature expression is transferred to downstream task to improve the final detection performance. Experiment results show that with only a few training epochs on the NWPU VHR-10.v2 dataset, the proposed method outperforms supervised-only methods.
KW - contrastive learning
KW - earth observation
KW - feature representation
KW - object detection
KW - remote sensing images
UR - https://www.scopus.com/pages/publications/85207063866
U2 - 10.1109/IPEC61310.2024.00039
DO - 10.1109/IPEC61310.2024.00039
M3 - 会议稿件
AN - SCOPUS:85207063866
T3 - Proceedings - 2024 Asia-Pacific Conference on Image Processing, Electronics and Computers, IPEC 2024
SP - 171
EP - 176
BT - Proceedings - 2024 Asia-Pacific Conference on Image Processing, Electronics and Computers, IPEC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th Asia-Pacific Conference on Image Processing, Electronics and Computers, IPEC 2024
Y2 - 12 April 2024 through 14 April 2024
ER -