Research on Automated Detection of Sensitive Information Based on BERT

  • Meng Ding*
  • , Xing Wang
  • , Changming Wu
  • , Kaixuan Wang
  • , Xue Yang
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

With the booming of the Internet, Web public opinion plays an increasingly important role in the stability of the network community. Therefore, the sensitive information hidden on the Internet is likely to lead to unpredictable social impact. This paper focuses on the detection of Chinese sensitive information. First, we build a corpus to train the detection model. Secondly, we apply the Bert method to the detection problem. Then, many popular NLP methods are applied to this problem to show the progress of Bert in a sensitive information detection task. Finally, we got a sensitive information detection model based on BERT with a high F1 score of 97.31.

Original languageEnglish
Article number012088
JournalJournal of Physics: Conference Series
Volume1757
Issue number1
DOIs
StatePublished - 3 Feb 2021
Externally publishedYes
Event2020 International Conference on Computer Big Data and Artificial Intelligence, ICCBDAI 2020 - Changsha, China
Duration: 24 Oct 202025 Oct 2020

Keywords

  • component
  • formatting
  • style
  • styling

Fingerprint

Dive into the research topics of 'Research on Automated Detection of Sensitive Information Based on BERT'. Together they form a unique fingerprint.

Cite this