Abstract
Object imaging and recognition under difficult visual conditions is extremely challenging due to the captured low-quality images, and traditional optical-based recognition methods always fail in this task. In this article, we propose to utilize the visual-microwave image pairs captured by both visual cameras and microwave sensors for imaging and recognition. To address the heavy noises in the low-quality optical images, we retrieve the physically quantitative images from associated scattered field data and enhance visual features by both optical and retrieval images. We develop a cross-modal enhanced attentive visual-microwave fusion (EAVMF) object recognition model to jointly learn the cross-modal generator and multimodal recognizer. In addition, an attention module for the visual subnetwork is utilized to highlight the regions of interest. Two multimodal datasets with synthetic visual-microwave image pairs are built to simulate the difficult visual condition. The numerical results on these datasets demonstrate that: 1) the multimodal fusion, cross-modal enhancement, and visual attention module can enhance the performance; and 2) compared with the existing methods, the proposed EAVMF not only performs better in terms of accuracy, but also has good scalability and one-shot learning ability.
| Original language | English |
|---|---|
| Article number | 2001812 |
| Journal | IEEE Transactions on Geoscience and Remote Sensing |
| Volume | 61 |
| DOIs | |
| State | Published - 2023 |
| Externally published | Yes |
Keywords
- Adversarial learning
- electromagnetic (EM) inversion
- multimodal fusion
- object recognition
- optical imaging
Fingerprint
Dive into the research topics of 'Electromagnetic Imaging Boosted Visual Object Recognition Under Difficult Visual Conditions'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver