TIDF-DLPM: Term and Inverse Document Frequency based Data Leakage Prevention Model

Ishu Gupta,Sloni Mittal,Ankit Tiwari,Priya Agarwal,Ashutosh Kumar Singh
DOI: https://doi.org/10.48550/arXiv.2203.05367
2022-03-10
Cryptography and Security
Abstract:Confidentiality of the data is being endangered as it has been categorized into false categories which might get leaked to an unauthorized party. For this reason, various organizations are mainly implementing data leakage prevention systems (DLPs). Firewalls and intrusion detection systems are being outdated versions of security mechanisms. The data which are being used, in sending state or are rest are being monitored by DLPs. The confidential data is prevented with the help of neighboring contexts and contents of DLPs. In this paper, a semantic-based approach is used to classify data based on the statistical data leakage prevention model. To detect involved private data, statistical analysis is being used to contribute secure mechanisms in the environment of data leakage. The favored Frequency-Inverse Document Frequency (TF-IDF) is the facts and details recapture function to arrange documents under particular topics. The results showcase that a similar statistical DLP approach could appropriately classify documents in case of extent alteration as well as interchanged documents.
What problem does this paper attempt to address?