Combining Hierachical VAEs with LLMs for clinically meaningful timeline summarisation in social media

Jiayu Song,Jenny Chim,Adam Tsakalidis,Julia Ive,Dana Atzil-Slonim,Maria Liakata
2024-02-16
Abstract:We introduce a hybrid abstractive summarisation approach combining hierarchical VAE with LLMs (LlaMA-2) to produce clinically meaningful summaries from social media user timelines, appropriate for mental health monitoring. The summaries combine two different narrative points of view: clinical insights in third person useful for a clinician are generated by feeding into an LLM specialised clinical prompts, and importantly, a temporally sensitive abstractive summary of the user's timeline in first person, generated by a novel hierarchical variational autoencoder, TH-VAE. We assess the generated summaries via automatic evaluation against expert summaries and via human evaluation with clinical experts, showing that timeline summarisation by TH-VAE results in more factual and logically coherent summaries rich in clinical utility and superior to LLM-only approaches in capturing changes over time.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to generate clinically meaningful summaries of social media user timelines to assist in mental health monitoring by combining hierarchical Variational Autoencoders (hierarchical VAE) with large language models (LLM). Specifically, the paper attempts to address the following issues: 1. **Longitudinal Assessment of Individual Mental Health Changes**: Although previous studies have utilized social media data to identify mental health issues such as depression and bipolar disorder, few have focused on the changes in an individual's mental health state over time. The paper proposes a method to capture the changes in the content posted by individuals on social media over time, thereby better assessing their mental health status. 2. **Generation of Clinically Meaningful Summaries**: Currently, clinicians primarily rely on patients' self-reports to assess their mental health status. These reports have many limitations, such as the individual's level of self-awareness, willingness to fill out questionnaires, and limited range of choices. The proposed method can generate concise and clinically informative summaries that can complement traditional self-reports, improving the efficiency and accuracy of mental health monitoring. 3. **Combining Different Narrative Perspectives**: The method proposed in the paper generates summaries that include two different narrative perspectives: one is third-person clinical insights suitable for clinicians, and the other is first-person time-sensitive abstract summaries reflecting the user's own timeline. This combination helps to more comprehensively understand an individual's mental health status. ### Main Contributions 1. **Developed a New Abstract Timeline Summarization Method (TH-VAE)**: This method is based on the hierarchical Variational Autoencoder model (NVAE) and is adapted to longitudinal social media data (user timelines). 2. **Proposed a New Task of Generating Clinically Meaningful Summaries from Social Media Data**: These summaries are generated using a hybrid approach, including high-level third-person information (diagnosis, interpersonal and personal relationships, moments of change) and first-person evidence from the user's timeline. 3. **Created a Dataset of Mental Health Summaries Written by Experts**: This dataset is used for modeling and evaluation. 4. **Provided a Detailed Summary Evaluation Method**: Based on the retention of clinical information, the consistency of the summary, and its usefulness to clinicians, using semantic similarity metrics, natural language inference (NLI), and expert human evaluation. 5. **Experimental Results Show that TH-VAE Performs Well in Unsupervised Summarization Methods**: Compared to methods that only use LLM, TH-VAE is more effective in capturing temporal changes.