Abstract
Neural network hardware accelerators have become essential for deploying convolutional neural networks (CNNs) on edge devices. However, due to hardware and software constraints, these accelerators are often optimized for specific network architectures, limiting their ability to accelerate general-purpose neural network models such as YOLO (You Only Look Once). In contrast, in this paper, the integration of the NVIDIA Deep Learning Accelerator (NVDLA) into a RISC-V architecture-based System on Chip (SoC) to accelerate the YOLOv5s network model is presented. While simple network acceleration can be achieved through register-level configurations of NVDLA, like LeNet-5, optimizing YOLOv5s performance requires a more advanced approach. We interfaced the Tengine edge computing framework with the NVDLA software, enabling the registration of specific operators to leverage NVDLA for intensive computations while delegating unsupported tasks to the CPU. This hybrid acceleration strategy ensures efficient workload distribution, maximizing the performance benefits of both the accelerator and the processor. The proposed system was validated on a 250MHz Field Programmable Gate Array (FPGA) platform, demonstrating significant improvements in inference speed and energy efficiency for the YOLOv5s model compared to traditional CPU core implementations. Experimental results indicate that the integrated RISC-V SoC with NVDLA achieves processing efficiency 17x and 25x greater than the standard ARM Cortex-A53 and Intel Core i7-12700H. This work establishes a foundation for deploying more complex deep learning models on resource-constrained edge devices, highlighting its potential for real-time object detection applications across various domains.
| Original language | English |
|---|---|
| Title of host publication | Fourth International Conference on Electronic Information Engineering and Data Processing, EIEDP 2025 |
| Editors | Lei Chen, Azlan Bin Mohd Zain |
| Publisher | SPIE |
| ISBN (Electronic) | 9781510690561 |
| DOIs | |
| State | Published - 2025 |
| Externally published | Yes |
| Event | 4th International Conference on Electronic Information Engineering and Data Processing, EIEDP 2025 - Kuala Lumpur, Malaysia Duration: 17 Jan 2025 → 19 Jan 2025 |
Publication series
| Name | Proceedings of SPIE - The International Society for Optical Engineering |
|---|---|
| Volume | 13574 |
| ISSN (Print) | 0277-786X |
| ISSN (Electronic) | 1996-756X |
Conference
| Conference | 4th International Conference on Electronic Information Engineering and Data Processing, EIEDP 2025 |
|---|---|
| Country/Territory | Malaysia |
| City | Kuala Lumpur |
| Period | 17/01/25 → 19/01/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Accelerator
- Edge Computing
- FPGA
- NVDLA
- RISC-V
- YOLOv5
Fingerprint
Dive into the research topics of 'The acceleration of YOLOv5s through NVDLA on RISC-V SoC'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver