Skip to main navigation Skip to search Skip to main content

Active Visual Perception Enhancement Method Based on Deep Reinforcement Learning

  • Zhonglin Yang
  • , Hao Fang
  • , Huanyu Liu*
  • , Junbao Li
  • , Yutong Jiang
  • , Mengqi Zhu
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • China North Vehicle Research Institute

Research output: Contribution to journalArticlepeer-review

Abstract

Traditional object detection methods using static cameras are constrained by their limited perspectives, hampering the effective detection of low-confidence targets. To address this challenge, this study introduces a deep reinforcement learning-based visual perception enhancement technique. This approach leverages pan–tilt–zoom (PTZ) cameras to achieve active vision, enabling them to autonomously make decisions and actions tailored to the current scene and object detection outcomes. This optimization enhances both the object detection process and information acquisition, significantly boosting the intelligent perception capabilities of PTZ cameras. Experimental findings demonstrate the robust generalization capabilities of this method across various object detection algorithms, resulting in an average confidence level improvement of 23.80%.

Original languageEnglish
Article number1654
JournalElectronics (Switzerland)
Volume13
Issue number9
DOIs
StatePublished - May 2024

Keywords

  • PTZ camera
  • active vision
  • deep reinforcement learning
  • object detection

Fingerprint

Dive into the research topics of 'Active Visual Perception Enhancement Method Based on Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this