Abstract:In Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated high text generation quality. However, in real-world applications, LLMs must meet increasingly complex requirements. Beyond avoiding misleading or inappropriate content, LLMs are also expected to cater to specific user needs, such as imitating particular writing styles or generating text with poetic richness. These varied demands have driven the development of Controllable Text Generation (CTG) techniques, which ensure that outputs adhere to predefined control conditions--such as safety, sentiment, thematic consistency, and linguistic style--while maintaining high standards of helpfulness, fluency, and diversity. This paper systematically reviews the latest advancements in CTG for LLMs, offering a comprehensive definition of its core concepts and clarifying the requirements for control conditions and text quality. We categorize CTG tasks into two primary types: content control and attribute control. The key methods are discussed, including model retraining, fine-tuning, reinforcement learning, prompt engineering, latent space manipulation, and decoding-time intervention. We analyze each method's characteristics, advantages, and limitations, providing nuanced insights for achieving generation control. Additionally, we review CTG evaluation methods, summarize its applications across domains, and address key challenges in current research, including reduced fluency and practicality. We also propose several appeals, such as placing greater emphasis on real-world applications in future research. This paper aims to offer valuable guidance to researchers and developers in the field. Our reference list and Chinese version are open-sourced at <a class="link-external link-https" href="https://github.com/IAAR-Shanghai/CTGSurvey" rel="external noopener nofollow">this https URL</a>.

Gamma Sampling: Fine-grained Controlling Language Models without Training

Efficient and Training-Free Control of Language Generation

Controllable Generation via Locally Constrained Resampling

Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs

A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation

Diffusion-LM Improves Controllable Text Generation

Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent Space

Informed Sampling for Diversity in Concept-to-Text NLG

Critic-Guided Decoding for Controlled Text Generation

Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding

Plug and Play Language Models: A Simple Approach to Controlled Text Generation

Priority Sampling of Large Language Models for Compilers

Balancing Diversity and Risk in LLM Sampling: How to Select Your Method and Parameter for Open-Ended Text Generation

Penalizing the High-likelihood: A Novel Sampling Method for Open-ended Neural Text Generation via Inverse Probability Weighting

EDT: Improving Large Language Models' Generation by Entropy-based Dynamic Temperature Sampling

Controllable Text Generation for Open-Domain Creativity and Fairness

Controllable Text Generation for Large Language Models: A Survey

Flaming-hot Initiation with Regular Execution Sampling for Large Language Models

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation