Abstract
With the booming of the Internet, Web public opinion plays an increasingly important role in the stability of the network community. Therefore, the sensitive information hidden on the Internet is likely to lead to unpredictable social impact. This paper focuses on the detection of Chinese sensitive information. First, we build a corpus to train the detection model. Secondly, we apply the Bert method to the detection problem. Then, many popular NLP methods are applied to this problem to show the progress of Bert in a sensitive information detection task. Finally, we got a sensitive information detection model based on BERT with a high F1 score of 97.31.
| Original language | English |
|---|---|
| Article number | 012088 |
| Journal | Journal of Physics: Conference Series |
| Volume | 1757 |
| Issue number | 1 |
| DOIs | |
| State | Published - 3 Feb 2021 |
| Externally published | Yes |
| Event | 2020 International Conference on Computer Big Data and Artificial Intelligence, ICCBDAI 2020 - Changsha, China Duration: 24 Oct 2020 → 25 Oct 2020 |
Keywords
- component
- formatting
- style
- styling
Fingerprint
Dive into the research topics of 'Research on Automated Detection of Sensitive Information Based on BERT'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver