Abstract:Large language models (LLMs), typically designed as a function of next-word prediction, have excelled across extensive NLP tasks. Despite the generality, next-word prediction is often not an efficient formulation for many of the tasks, demanding an extreme scale of model parameters (10s or 100s of billions) and sometimes yielding suboptimal performance. In practice, it is often desirable to build more efficient models -- despite being less versatile, they still apply to a substantial subset of problems, delivering on par or even superior performance with much smaller model sizes. In this paper, we propose text alignment as an efficient unified model for a wide range of crucial tasks involving text entailment, similarity, question answering (and answerability), factual consistency, and so forth. Given a pair of texts, the model measures the degree of alignment between their information. We instantiate an alignment model (Align) through lightweight finetuning of RoBERTa (355M parameters) using 5.9M examples from 28 datasets. Despite its compact size, extensive experiments show the model's efficiency and strong performance: (1) On over 20 datasets of aforementioned diverse tasks, the model matches or surpasses FLAN-T5 models that have around 2x or 10x more parameters; the single unified model also outperforms task-specific models finetuned on individual datasets; (2) When applied to evaluate factual consistency of language generation on 23 datasets, our model improves over various baselines, including the much larger GPT-3.5 (ChatGPT) and sometimes even GPT-4; (3) The lightweight model can also serve as an add-on component for LLMs such as GPT-3.5 in question answering tasks, improving the average exact match (EM) score by 17.94 and F1 score by 15.05 through identifying unanswerable questions.

Value Alignment from Unstructured Text

Heterogeneous Value Alignment Evaluation for Large Language Models

Strong and weak alignment of large language models with human values

Large Language Model Alignment: A Survey

Contextual Moral Value Alignment Through Context-Based Aggregation

Exploring Multilingual Concepts of Human Values in Large Language Models: is Value Alignment Consistent, Transferable and Controllable Across Languages?

Exploring Multilingual Concepts of Human Value in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches

Text Alignment Is An Efficient Unified Model for Massive NLP Tasks

High-Dimension Human Value Representation in Large Language Models

Do LLMs have Consistent Values?

Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

Towards an End-to-End Personal Fine-Tuning Framework for AI Value Alignment

From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models

Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Value

Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning

ABC Align: Large Language Model Alignment for Safety & Accuracy

A Moral Imperative: The Need for Continual Superalignment of Large Language Models

Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Values

Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations