Security to text (S2T): multi-layered based security approaches for secret text content

Shamal Kashid,Lalit K. Awasthi,Krishan Berwal
DOI: https://doi.org/10.1007/s11042-024-19669-9
IF: 2.577
2024-06-20
Multimedia Tools and Applications
Abstract:In the digital world, text data is produced in an unstructured manner across various communication channels. Extracting valuable information from such data with security is crucial and requires the development of techniques in text mining, information retrieval, and natural language processing (NLP). To solve this issue, we introduce two novel approaches: keyword extraction (KE) and a multi-layered secret sharing scheme (MLSS) to provide security to extracted keywords rather than overall text documents. The KE approach encompasses a sequence of text pre-processing procedures, including tokenization, stopword removal, stemming, and bag of words representation, followed by indexing. This methodology aims to efficiently extract keywords from text datasets. For this research work, we have proposed three datasets, including text messages, whatsapp messages, and electronic mail. MLSS enhances the security of extracted textual data by leveraging text pre-processing steps. This scheme ensures better confidentiality and the non-revealment of sensitive information. Additionally, we evaluate our KE model on our dataset as well as on standard datasets. Experimental results demonstrate the effectiveness of our proposed security to text (S2T) model, which outperform existing state-of-the-art approaches. The model obtains a 100% correlation between the reconstructed text and the original text.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?