Abstract:During the pretraining phase, large language models (LLMs) acquire vast amounts of knowledge from extensive text corpora. Nevertheless, in later stages such as fine-tuning and inference, the model may encounter knowledge not covered in the initial training, which can lead to hallucinations and degraded performance. This issue has a profound impact on the model's capabilities, as it will inevitably face out-of-scope knowledge after pretraining. Furthermore, fine-tuning is often required to adapt LLMs to domain-specific tasks. However, this phenomenon limits the model's ability to learn and integrate new information during fine-tuning. The effectiveness of fine-tuning largely depends on the type of knowledge involved. Existing research suggests that fine-tuning the model on partially mastered knowledge-for instance, question-answer pairs where the model has a chance of providing correct responses under non-greedy decoding-can enable the model to acquire new knowledge while mitigating hallucination. Notably, this approach can still lead to the forgetting of fully mastered knowledge, constraining the fine-tuning dataset to a narrower range and limiting the model's overall potential for improvement. Given the model's intrinsic reasoning abilities and the interconnectedness of different knowledge areas, it is likely that as the model's capacity to utilize existing knowledge improves during fine-tuning, previously unmastered knowledge may become more understandable. To explore this hypothesis, we conducted experiments and, based on the results, proposed a two-stage fine-tuning strategy. This approach not only improves the model's overall test accuracy and knowledge retention but also preserves its accuracy on previously mastered content. When fine-tuning on the WikiQA dataset, our method increases the amount of knowledge acquired by the model in this stage by 24%.

60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering

Enhancing Large Language Model Performance To Answer Questions and Extract Information More Accurately

KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

Learning from "Silly" Questions Improves Large Language Models, But Only Slightly

Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models

Maybe Only 0.5 Training Data Instruction Tuning

Evaluating Fine-Tuning Efficiency of Human-Inspired Learning Strategies in Medical Question Answering

LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models

Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning

From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses

Fine-Tuning LLMs for Reliable Medical Question-Answering Services

Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering

Dial-insight: Fine-tuning Large Language Models with High-Quality Domain-Specific Data Preventing Capability Collapse

Instruction-tuned Language Models are Better Knowledge Learners

Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise

Fine-tuning can Help Detect Pretraining Data from Large Language Models

Optimizing Language Model's Reasoning Abilities with Weak Supervision

Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models

Large Language Models with Controllable Working Memory