Comparative Study and Framework for Automated Summariser Evaluation: LangChain and Hybrid Algorithms

Bagiya Lakshmi S,Sanjjushri Varshini R,Rohith Mahadevan,Raja CSP Raman
2023-10-04
Abstract:Automated Essay Score (AES) is proven to be one of the cutting-edge technologies. Scoring techniques are used for various purposes. Reliable scores are calculated based on influential variables. Such variables can be computed by different methods based on the domain. The research is concentrated on the user's understanding of a given topic. The analysis is based on a scoring index by using Large Language Models. The user can then compare and contrast the understanding of a topic that they recently learned. The results are then contributed towards learning analytics and progression is made for enhancing the learning ability. In this research, the focus is on summarizing a PDF document and gauging a user's understanding of its content. The process involves utilizing a Langchain tool to summarize the PDF and extract the essential information. By employing this technique, the research aims to determine how well the user comprehends the summarized content.
Machine Learning,Computation and Language
What problem does this paper attempt to address?
The problem this paper attempts to address is the assessment of users' understanding of PDF document content. Specifically, the study focuses on using scoring indices generated by large language models (such as BERT) to evaluate users' comprehension of specific topics, surpassing traditional assessment methods. The research not only focuses on users' overall understanding levels but also aims to provide specific feedback by comparing users' understanding of document summaries and the original PDF content, to help users improve their learning outcomes. The research methods include: 1. Using the LangChain tool to summarize PDF documents and extract key information. 2. Evaluating users' understanding of the summary content and the original PDF content through various similarity measures (such as cosine similarity, Sorensen similarity, Jaccard similarity, and BERT embedding similarity). 3. Calculating the average of these similarity measures to derive a comprehensive score that reflects users' understanding of the document content. Ultimately, the study provides a quantitative indicator through this method to assess users' understanding levels and offers new tools for personalized learning and employee evaluation in the fields of education and information technology.