A deep learning framework for multi-document summarization using LSTM with improved Dingo Optimizer (IDO)

Geetanjali Singh,Namita Mittal,Satyendra Singh Chouhan
DOI: https://doi.org/10.1007/s11042-024-18248-2
IF: 2.577
2024-02-03
Multimedia Tools and Applications
Abstract:Multi-document summarization (MDS) is a topic of much attention in extensive knowledge areas. Extractive MDS techniques intend to shrink the text from a document compilation by enclosing essential content and minimizing unnecessary data. MDS is more challenging than single document summarization and has several weaknesses, including an inaccurate selection of important sentences, a percentage of low coverage, and redundancy among the sentences. To address these issues, our proposed system focuses on pioneering an innovative automated extractive MDS approach. The process begins with original document pre-processing, followed by the extraction of features such as modified TF-IDF, Bag of Word (BOW), and concept similarity (CS) features. These features are then inputted into a Long Short-Term Memory (LSTM) framework. The model's weights are fine-tuned using the Improved Dingo Optimization (IDO) technique. The proposed model is evaluated on the Amazon Review and DUC-2002 datasets and compared its performance with various existing algorithms. The results demonstrated significant enhancements over baseline models, with an accuracy of 0.922862 for the Amazon Review dataset and 0.899730 for the DUC2002 dataset. These findings underscore the effectiveness of our developed technique in improving the accuracy of extractive multi-document summarization.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?