Skip to main navigation Skip to search Skip to main content

A lightweight convolutional neural network-based feature extractor for visible images

  • Xujie He
  • , Jing Jin*
  • , Yu Jiang
  • , Dandan Li
  • *Corresponding author for this work
  • School of Astronautics, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Feature extraction networks (FENs), as the first stage in many computer vision tasks, play critical roles. Previous studies regarding FENs employed deeper and wider networks to attain higher accuracy, but their approaches were memory-inefficient and computationally intensive. Here, we present an accurate and lightweight feature extractor (RoShuNet) for visible images based on ShuffleNetV2. The provided improvements are threefold. To make ShuffleNetV2 compact without degrading its feature extraction ability, we propose an aggregated dual group convolutional module; to better aid the channel interflow process, we propose a γ-weighted shuffling module; to further reduce the complexity and size of the model, we introduce slimming strategies. Classification experiments demonstrate the state-of-the-art (SOTA) performance of RoShuNet, which yields an increase in accuracy and reduces the complexity and size of the model compared to those of ShuffleNetV2. Generalization experiments verify that the proposed method is also applicable to feature extraction tasks in semantic segmentation and multiple-object tracking scenarios, achieving comparable accuracy to that of other approaches with more memory and greater computational efficiency. Our method provides a novel perspective for designing lightweight models.

Original languageEnglish
Article number104157
JournalComputer Vision and Image Understanding
Volume249
DOIs
StatePublished - Dec 2024
Externally publishedYes

Keywords

  • Computer vision
  • Feature extraction
  • Feature extraction network
  • Lightweight
  • Memory efficiency

Fingerprint

Dive into the research topics of 'A lightweight convolutional neural network-based feature extractor for visible images'. Together they form a unique fingerprint.

Cite this