TY - GEN
T1 - MDNET
T2 - 5th IEEE International Conference on Electronic Technology, Communication and Information, ICETCI 2025
AU - Gao, Jinghan
AU - Liu, Sijia
AU - Xie, Tao
AU - Li, Ruifeng
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - In recent years, deep neural networks have revolutionized image classification by enabling automatic feature learning. However, this performance gain comes at the cost of increased model complexity and computational demands. In many real-world application scenarios, constraints related to memory usage and response speed necessitate the development of efficient lightweight models. To address this, we propose MDNET, a highly efficient multi-scale differential lightweight network without additional model deep compression for image classification. MDNET incorporates a Multi-scale Differential Transformer (MDIFFormer) architecture that integrates diverse receptive fields along the channel dimension, rather than the spatial dimension, enabling parallel extraction of multi-scale information and facilitating global context understanding. A differential attention mechanism inspired by the differential amplifier circuit in physics is introduced to mitigate attention noise from irrelevant contexts, enhancing model robustness. Furthermore, label smoothing is applied to the probability distribution, introducing uniform noise to reduce over-reliance on true labels. Extensive experiments on benchmark datasets, including CIFAR100, CIFAR10, and a newly released dataset focused on new energy battery packs (NEBP), demonstrate that MDNET achieves state-of-the-art performance while maintaining low computational complexity. Here, we show that MDNET achieves a Top-1 accuracy of 73.34% on CIFAR100 with only 2.38M parameters, outperforming existing lightweight models.
AB - In recent years, deep neural networks have revolutionized image classification by enabling automatic feature learning. However, this performance gain comes at the cost of increased model complexity and computational demands. In many real-world application scenarios, constraints related to memory usage and response speed necessitate the development of efficient lightweight models. To address this, we propose MDNET, a highly efficient multi-scale differential lightweight network without additional model deep compression for image classification. MDNET incorporates a Multi-scale Differential Transformer (MDIFFormer) architecture that integrates diverse receptive fields along the channel dimension, rather than the spatial dimension, enabling parallel extraction of multi-scale information and facilitating global context understanding. A differential attention mechanism inspired by the differential amplifier circuit in physics is introduced to mitigate attention noise from irrelevant contexts, enhancing model robustness. Furthermore, label smoothing is applied to the probability distribution, introducing uniform noise to reduce over-reliance on true labels. Extensive experiments on benchmark datasets, including CIFAR100, CIFAR10, and a newly released dataset focused on new energy battery packs (NEBP), demonstrate that MDNET achieves state-of-the-art performance while maintaining low computational complexity. Here, we show that MDNET achieves a Top-1 accuracy of 73.34% on CIFAR100 with only 2.38M parameters, outperforming existing lightweight models.
KW - deep learning
KW - image classification
KW - lightweight
KW - transformer
UR - https://www.scopus.com/pages/publications/105013072118
U2 - 10.1109/ICETCI64844.2025.11084157
DO - 10.1109/ICETCI64844.2025.11084157
M3 - 会议稿件
AN - SCOPUS:105013072118
T3 - 2025 IEEE 5th International Conference on Electronic Technology, Communication and Information, ICETCI 2025
SP - 47
EP - 56
BT - 2025 IEEE 5th International Conference on Electronic Technology, Communication and Information, ICETCI 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 23 May 2025 through 25 May 2025
ER -