LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Yaowei Zheng,Richong Zhang,Junhao Zhang,Yanhan Ye,Zheyan Luo,Zhangchi Feng,Yongqiang Ma

2024-06-28

Abstract:Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks. However, it requires non-trivial efforts to implement these methods on different models. We present LlamaFactory, a unified framework that integrates a suite of cutting-edge efficient training methods. It provides a solution for flexibly customizing the fine-tuning of 100+ LLMs without the need for coding through the built-in web UI LlamaBoard. We empirically validate the efficiency and effectiveness of our framework on language modeling and text generation tasks. It has been released at <a class="link-external link-https" href="https://github.com/hiyouga/LLaMA-Factory" rel="external noopener nofollow">this https URL</a> and received over 25,000 stars and 3,000 forks.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The paper attempts to address the challenge of efficient fine-tuning faced by large language models (LLMs) when adapting to downstream tasks. Specifically, the paper points out: 1. **Fine-tuning challenges under resource constraints**: Due to the large number of parameters in LLMs, direct fine-tuning requires a significant amount of computational resources, which is a major obstacle for many researchers and developers. 2. **Lack of a systematic framework**: Although the community has proposed various efficient fine-tuning methods, there is a lack of uniformity and systematicity among these methods, making it difficult for users to flexibly choose and apply them. To address these issues, the paper proposes a unified framework called **LLAMA FACTORY**, which integrates multiple advanced efficient training methods, aiming to simplify and optimize the fine-tuning process of LLMs through the following points: - **Modular design**: By adopting a modular design, it reduces the dependencies between different models, datasets, and training methods, allowing the framework to flexibly adapt to hundreds of different LLMs. - **User-friendly interface**: Provides a web-based user interface **LLAMA BOARD**, through which users can easily configure and launch fine-tuning tasks without writing code. - **Efficient fine-tuning techniques**: Integrates various efficient fine-tuning techniques, such as Freeze-tuning, GaLore (low-rank projection), and LoRA (low-rank adapters), significantly improving the efficiency and effectiveness of fine-tuning. Through these designs, **LLAMA FACTORY** aims to lower the threshold for fine-tuning LLMs, enabling more researchers and developers to conveniently utilize these powerful models for various downstream tasks.

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

LMTuner: An user-friendly and highly-integrable Training Framework for fine-tuning Large Language Models

Fine-tuning Large Language Models for Domain-specific Machine Translation

Llama 2: Open Foundation and Fine-Tuned Chat Models

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Fine-grained LLM Agent: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback

TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

TinyLlama: An Open-Source Small Language Model

Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca

Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

YuLan: An Open-source Large Language Model

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild

FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs

FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models

A Framework to Implement 1+N Multi-task Fine-tuning Pattern in LLMs Using the CGC-LORA Algorithm

LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements

A Framework for Fine-Tuning LLMs using Heterogeneous Feedback