OpenAssistant Conversations -- Democratizing Large Language Model Alignment

Andreas Köpf,Yannic Kilcher,Dimitri von Rütte,Sotiris Anagnostidis,Zhi-Rui Tam,Keith Stevens,Abdullah Barhoum,Nguyen Minh Duc,Oliver Stanley,Richárd Nagyfi,Shahul ES,Sameer Suri,David Glushkov,Arnav Dantuluri,Andrew Maguire,Christoph Schuhmann,Huu Nguyen,Alexander Mattick

2023-10-31

Abstract:Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their accessibility and utility across various domains. However, state-of-the-art alignment techniques like RLHF rely on high-quality human feedback data, which is expensive to create and often remains proprietary. In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over 10,000 complete and fully annotated conversation trees. The corpus is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. Models trained on OpenAssistant Conversations show consistent improvements on standard benchmarks over respective base models. We release our code and data under a fully permissive licence.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The paper aims to address the issue of aligning large language models (LLMs) with human preferences to improve their usability and safety. Specifically: 1. **Model Alignment Issue**: The paper points out that although current large language models have a massive number of parameters and rich training data, there is still a discrepancy between model predictions and their intended use in practical applications. This inconsistency mainly manifests in the content generated by the models, which may not align with human values, intentions, or preferences. 2. **Scarcity of High-Quality Feedback Data**: The most advanced alignment techniques, such as Reinforcement Learning from Human Feedback (RLHF), rely on high-quality human feedback data. However, creating such datasets is costly and often proprietary, limiting the research community's progress in large-scale alignment. To address these issues, the authors released the OpenAssistant Conversations dataset, a large-scale dialogue tree dataset containing 161,443 messages, covering 35 languages, and annotated with 461,292 quality ratings. This dataset was collaboratively built by over 13,500 volunteers worldwide, aiming to promote more diverse and inclusive research through open access. This allows researchers to explore complex human language and interaction patterns, thereby better aligning large language models. Additionally, the paper discusses the composition of the dataset, the characteristics of the contributors, and the results of their satisfaction survey. Experiments were also conducted, demonstrating that models fine-tuned using this dataset outperformed baseline models in multiple benchmark tests.

OpenAssistant Conversations -- Democratizing Large Language Model Alignment

Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

Aligning Large Language Models with Human: A Survey

UltraFeedback: Boosting Language Models with Scaled AI Feedback

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

Assistive Large Language Model Agents for Socially-Aware Negotiation Dialogues

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs

AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators

The Real, the Better: Aligning Large Language Models with Online Human Behaviors

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Emphasising Structured Information: Integrating Abstract Meaning Representation into LLMs for Enhanced Open-Domain Dialogue Evaluation

SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation

GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants

Supporting Human-AI Collaboration in Auditing LLMs with LLMs

LaMDA: Language Models for Dialog Applications