TY - GEN
T1 - Comparative Performance Analysis of Lightweight BERT-derived Models for Cybersecurity Corpus in Cloud Networks
AU - Wu, Yue
AU - Ye, Lin
AU - Li, Baisong
AU - Zhang, Hongli
AU - Cao, Jing
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - The rapid development of cloud networks has created a complex cybersecurity landscape, with new challenges and threats arising from the shift to cloud-based systems. Efficient on-device processing of network security reports has become essential for handling the distributed and edge-oriented scenario with substantial data volumes. Pre-trained models, like BERT, have been extensively used in network security, but their high computational resource demands and limited input length (512 tokens) can hinder their performance in cloud network environments, particularly when classifying network security reports that often exceed this limit. Consequently, this paper presents a comparative analysis of the performance of a series of lightweight pre-trained models derived from BERT on cybersecurity corpus. In order to solve the limitation of the input length of the BERT-derived model, we leverage the latest research advancements in the Natural Language Processing (NLP) domain, adopting an approach that involves abstract extraction, application of pre-trained models, and fine-tuning, to accomplish the task of classifying network security reports on edge devices. A series of comparative experiments conducted on a real-world dataset revealed that employing the BRIO-TinyBERT-Fine-tuning architecture for network security report classification achieved an accuracy rate of 82%. Remarkably, this model utilized only half the parameters of the standard BERT model.
AB - The rapid development of cloud networks has created a complex cybersecurity landscape, with new challenges and threats arising from the shift to cloud-based systems. Efficient on-device processing of network security reports has become essential for handling the distributed and edge-oriented scenario with substantial data volumes. Pre-trained models, like BERT, have been extensively used in network security, but their high computational resource demands and limited input length (512 tokens) can hinder their performance in cloud network environments, particularly when classifying network security reports that often exceed this limit. Consequently, this paper presents a comparative analysis of the performance of a series of lightweight pre-trained models derived from BERT on cybersecurity corpus. In order to solve the limitation of the input length of the BERT-derived model, we leverage the latest research advancements in the Natural Language Processing (NLP) domain, adopting an approach that involves abstract extraction, application of pre-trained models, and fine-tuning, to accomplish the task of classifying network security reports on edge devices. A series of comparative experiments conducted on a real-world dataset revealed that employing the BRIO-TinyBERT-Fine-tuning architecture for network security report classification achieved an accuracy rate of 82%. Remarkably, this model utilized only half the parameters of the standard BERT model.
KW - BERT
KW - cyber security text-classification
KW - lightweight pre-trained models
UR - https://www.scopus.com/pages/publications/85191229970
U2 - 10.1109/CloudNet59005.2023.10490080
DO - 10.1109/CloudNet59005.2023.10490080
M3 - 会议稿件
AN - SCOPUS:85191229970
T3 - 2023 IEEE 12th International Conference on Cloud Networking, CloudNet 2023
SP - 443
EP - 447
BT - 2023 IEEE 12th International Conference on Cloud Networking, CloudNet 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 12th IEEE International Conference on Cloud Networking, CloudNet 2023
Y2 - 1 November 2023 through 3 November 2023
ER -