Abstract:The impressive capabilities of large language models (LLMs) have attracted extensive interests of applying LLMs to medical field. However, the complex nature of clinical environments presents significant hallucination challenges for LLMs, hindering their widespread adoption. In this paper, we address these hallucination issues in the context of Medical Information Extraction (MIE) tasks by introducing ALternate Contrastive Decoding (ALCD). We begin by redefining MIE tasks as an identify-and-classify process. We then separate the identification and classification functions of LLMs by selectively masking the optimization of tokens during fine-tuning. During the inference stage, we alternately contrast output distributions derived from sub-task models. This approach aims to selectively enhance the identification and classification capabilities while minimizing the influence of other inherent abilities in LLMs. Additionally, we propose an alternate adaptive constraint strategy to more effectively adjust the scale and scope of contrastive tokens. Through comprehensive experiments on two different backbones and six diverse medical information extraction tasks, ALCD demonstrates significant improvements in resolving hallucination issues compared to conventional decoding methods.

What problem does this paper attempt to address?

### Problems the Paper Aims to Solve This paper aims to address the hallucination issues that occur in large language models (LLMs) during medical information extraction (MIE) tasks. Specifically, the hallucination issues include: 1. **Identifying non-existent medical entities**: LLMs may generate medical entities that do not exist in the original text, thereby creating false information. 2. **Classification inference errors**: Due to statistical biases in the pre-training corpus, LLMs may make inference errors when classifying medical entities. These issues are particularly severe in the medical field because the accuracy of medical information directly impacts patient health and safety. Therefore, reducing these hallucination issues is crucial for improving the application of LLMs in the medical domain. To address these problems, the authors propose a method called Alternating Contrastive Decoding (ALCD). By separating the recognition and classification functions and alternating the comparison of output distributions during the inference phase, ALCD aims to enhance the recognition and classification capabilities of LLMs while minimizing the impact of other inherent abilities. Additionally, the authors propose an adaptive constraint strategy to more effectively adjust the scale and scope of contrastive tokens. ### Main Contributions 1. **First application of contrastive decoding**: To the best of the authors' knowledge, this is the first time contrastive decoding has been applied as a strategy to reduce hallucination issues in LLMs for MIE tasks. 2. **Extensive applicability validation**: The broad applicability of the ALCD method is validated through experiments on two different LLM backbones, covering six diverse medical information extraction tasks. 3. **Significant performance improvement**: Experimental results show that ALCD significantly outperforms eight established decoding methods across multiple tasks.

Mitigating Hallucinations of Large Language Models in Medical Information Extraction via Contrastive Decoding

Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning

Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD)

Alleviating Hallucinations of Large Language Models through Induced Hallucinations

Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding

Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

Mitigating Entity-Level Hallucination in Large Language Models

Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization

IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding

Detecting and Evaluating Medical Hallucinations in Large Vision Language Models

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Mitigating Multilingual Hallucination in Large Vision-Language Models

MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided Decoding