Towards Trustworthy Large Language Models.

Sanmi Koyejo,Bo Li
DOI: https://doi.org/10.1145/3616855.3636454
2024-01-01
Abstract:Large Language models are among the most exciting technologies developed in the last few years. While the model's capabilities continue to improve, researchers, practitioners, and the general public are increasingly aware of some of its shortcomings. What will it take to build trustworthy large language models? This tutorial will present a range of recent findings, discussions, questions, and partial answers in the space of trustworthiness in large language models. While this tutorial will not attempt a comprehensive overview of this rich area, we aim to provide the participants with some tools and insights and to understand both the conceptual foundations of trustworthiness and a broad range of ongoing research efforts. We will tackle some of the hard questions that you may have about trustworthy large language models and hopefully address some misconceptions that have become pervasive.
What problem does this paper attempt to address?