Skip to main navigation Skip to search Skip to main content

Sensitive information detection based on convolution neural network and bi-directional LSTM

  • Yan Lin
  • , Guosheng Xu
  • , Guoai Xu
  • , Yudong Chen
  • , Dawei Sun

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Electronic documents can carry lots of information and are widely used in daily lives. It will cause substantial economic losses to individual users, enterprises, and governments when the documents containing sensitive information are leaked. How to detect sensitive information to prevent data leakage is still a challenge in the field of information security. This paper mainly focuses on the detection of unstructured documents containing sensitive information. Governments, military, and other institutions can actively mark whether the electronic documents contain sensitive information according to the detection results. We propose a reliable method to detect sensitive electronic documents automatically and compare it with other basic methods. The algorithm structure can extract the characteristics of the data more comprehensively to obtain better detection results. Our model outperformed the other models with 93.44 % accuracy. Our model can also reduce the time cost, which is beneficial for realistic production.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2020
EditorsGuojun Wang, Ryan Ko, Md Zakirul Alam Bhuiyan, Yi Pan
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1614-1621
Number of pages8
ISBN (Electronic)9781665403924
DOIs
StatePublished - Dec 2020
Externally publishedYes
Event19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2020 - Guangzhou, China
Duration: 29 Dec 20201 Jan 2021

Publication series

NameProceedings - 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2020

Conference

Conference19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2020
Country/TerritoryChina
CityGuangzhou
Period29/12/201/01/21

Keywords

  • Convolutional neural network
  • Data leak
  • Information security
  • Sensitive information prevention
  • Unstructured documents

Fingerprint

Dive into the research topics of 'Sensitive information detection based on convolution neural network and bi-directional LSTM'. Together they form a unique fingerprint.

Cite this