Skip to main navigation Skip to search Skip to main content

PSVMLP: Point and Shifted Voxel MLP for 3D deep learning

  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

We propose a high-performance 3D feature extraction deep learning network based on point cloud and shifted voxel, named Point and Shifted Voxel MLP (PSVMLP). The main component of PSVMLP is simple Multi-Layer Perceptron (MLP) structure. PSVMLP achieves effective extraction of multi-scale features from 3D data. Specifically, we combine point cloud and voxel-based feature extraction methods. In voxel representation learning, we propose a wide-range geometric feature extraction method based on axial shifting operations and simple MLP structure. The axial shifting operations allow shifting voxels in the depth, height, and width directions, capturing more geometric information. In point cloud representation learning, we use simple MLP structure to extract local features, and we also extract global features by combining transformer structure. By combining point cloud and voxel feature extraction methods, we obtain rich feature representations from different scales, enhancing the model's expressive power and generalization performance. Applying our designed model to basic geometric feature learning tasks, we achieve excellent results. Despite being built primarily on a simple MLP framework, our model demonstrates remarkable performance on both shape classification and shape part segmentation tasks. Our code is available at https://github.com/hitxraz/psvmlp.

Original languageEnglish
Pages (from-to)1-7
Number of pages7
JournalPattern Recognition Letters
Volume185
DOIs
StatePublished - Sep 2024

Keywords

  • Deep learning
  • Point clouds
  • Shape classification
  • Shape part segmentation

Fingerprint

Dive into the research topics of 'PSVMLP: Point and Shifted Voxel MLP for 3D deep learning'. Together they form a unique fingerprint.

Cite this