Combining Hierachical VAEs with LLMs for clinically meaningful timeline summarisation in social media

Jiayu Song,Jenny Chim,Adam Tsakalidis,Julia Ive,Dana Atzil-Slonim,Maria Liakata

2024-02-16

Abstract:We introduce a hybrid abstractive summarisation approach combining hierarchical VAE with LLMs (LlaMA-2) to produce clinically meaningful summaries from social media user timelines, appropriate for mental health monitoring. The summaries combine two different narrative points of view: clinical insights in third person useful for a clinician are generated by feeding into an LLM specialised clinical prompts, and importantly, a temporally sensitive abstractive summary of the user's timeline in first person, generated by a novel hierarchical variational autoencoder, TH-VAE. We assess the generated summaries via automatic evaluation against expert summaries and via human evaluation with clinical experts, showing that timeline summarisation by TH-VAE results in more factual and logically coherent summaries rich in clinical utility and superior to LLM-only approaches in capturing changes over time.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper aims to generate clinically meaningful summaries of social media user timelines to assist in mental health monitoring by combining hierarchical Variational Autoencoders (hierarchical VAE) with large language models (LLM). Specifically, the paper attempts to address the following issues: 1. **Longitudinal Assessment of Individual Mental Health Changes**: Although previous studies have utilized social media data to identify mental health issues such as depression and bipolar disorder, few have focused on the changes in an individual's mental health state over time. The paper proposes a method to capture the changes in the content posted by individuals on social media over time, thereby better assessing their mental health status. 2. **Generation of Clinically Meaningful Summaries**: Currently, clinicians primarily rely on patients' self-reports to assess their mental health status. These reports have many limitations, such as the individual's level of self-awareness, willingness to fill out questionnaires, and limited range of choices. The proposed method can generate concise and clinically informative summaries that can complement traditional self-reports, improving the efficiency and accuracy of mental health monitoring. 3. **Combining Different Narrative Perspectives**: The method proposed in the paper generates summaries that include two different narrative perspectives: one is third-person clinical insights suitable for clinicians, and the other is first-person time-sensitive abstract summaries reflecting the user's own timeline. This combination helps to more comprehensively understand an individual's mental health status. ### Main Contributions 1. **Developed a New Abstract Timeline Summarization Method (TH-VAE)**: This method is based on the hierarchical Variational Autoencoder model (NVAE) and is adapted to longitudinal social media data (user timelines). 2. **Proposed a New Task of Generating Clinically Meaningful Summaries from Social Media Data**: These summaries are generated using a hybrid approach, including high-level third-person information (diagnosis, interpersonal and personal relationships, moments of change) and first-person evidence from the user's timeline. 3. **Created a Dataset of Mental Health Summaries Written by Experts**: This dataset is used for modeling and evaluation. 4. **Provided a Detailed Summary Evaluation Method**: Based on the retention of clinical information, the consistency of the summary, and its usefulness to clinicians, using semantic similarity metrics, natural language inference (NLI), and expert human evaluation. 5. **Experimental Results Show that TH-VAE Performs Well in Unsupervised Summarization Methods**: Compared to methods that only use LLM, TH-VAE is more effective in capturing temporal changes.

Combining Hierachical VAEs with LLMs for clinically meaningful timeline summarisation in social media

Hierarchical organization for medical video summarization using latent visual and semantic analysis

Evolutionary Hierarchical Dirichlet Process for Timeline Summarization.

SA-HAVE: A Self-Attention based Hierarchical VAEs Network for Abstractive Summarization

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Realizing Video Summarization from the Path of Language-based Semantic Understanding

Adapting Large Language Models for Automated Summarisation of Electronic Medical Records in Clinical Coding

A Dataset and Benchmark for Hospital Course Summarization with Adapted Large Language Models

Personalized Video Summarization by Multimodal Video Understanding

Sexual hormone fluctuation in chinchillas.

Enhanced Electronic Health Records Text Summarization Using Large Language Models

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: Benchmark Study

Harnessing LLMs for Automated Video Content Analysis: An Exploratory Workflow of Short Videos on Depression

On Learning to Summarize with Large Language Models as References

A Framework to Assess Clinical Safety and Hallucination Rates of LLMs for Medical Text Summarisation

Filling the gaps: leveraging large language models for temporal harmonization of clinical text across multiple medical visits for clinical prediction

Harmonising the Clinical Melody: Tuning Large Language Models for Hospital Course Summarisation in Clinical Coding

Large Language Models Can Enable Inductive Thematic Analysis of a Social Media Corpus in a Single Prompt: Human Validation Study

A Comparative Study of Recent Large Language Models on Generating Hospital Discharge Summaries for Lung Cancer Patients

Align vision-language semantics by multi-task learning for multi-modal summarization