Skip to main navigation Skip to search Skip to main content

A Multi-Phase Camera-LiDAR Fusion Network for 3D Semantic Segmentation With Weak Supervision

  • Harbin Institute of Technology
  • Ningbo Institute of Intelligent Equipment Technology Company Ltd
  • Yongjiang Laboratory

Research output: Contribution to journalArticlepeer-review

Abstract

Camera and LiDAR are indispensable perception units in autonomous driving, providing complementary environmental information for 3D semantic segmentation. It is the key point that fuses the information of two modalities to accurate and robust semantic segmentation. However, three major factors will restrict the performance of fusion-based methods, i.e., the reliability of image features, the contribution of different image features, and the trade-off between results of image and point cloud. This paper proposes a novel multi-phase fusion network for 3D semantic segmentation. For the first factor, this paper takes the lead in regarding the problem that image features may be wrong due to the lack of dense annotations in the common datasets as a weak supervision problem and introduces the weakly supervised loss. Second, the proposed attention based feature fusion module can filter and reweight the image features effectively. Third, the results of the two modalities are further fused by self-confidence based late fusion module at pixel-level to complement their advantages. The proposed scheme has been evaluated on nuScenes and SemanticKITTI benchmarks, and the results show the competitiveness with state-of-the-art methods. The ablation studies demonstrate the superiority of the method in sparse classes segmentation. In addition, the robustness is also evaluated, and the results of the proposed method can keep relatively accurate even when faults in one of the sensors.

Original languageEnglish
Pages (from-to)3737-3746
Number of pages10
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume33
Issue number8
DOIs
StatePublished - 1 Aug 2023

Keywords

  • 3D semantic segmentation
  • Autonomous driving
  • multi-modal fusion
  • weak supervision

Fingerprint

Dive into the research topics of 'A Multi-Phase Camera-LiDAR Fusion Network for 3D Semantic Segmentation With Weak Supervision'. Together they form a unique fingerprint.

Cite this