Developing a framework for auditing large language models using human-in-the-loop
Maryam Amirizaniani, Jihan Yao, Adrian Lavergne, Elizabeth Snell Okada, Aman Chadha, Tanya Roosta, Chirag Shah
2024-02-01
Abstract:* Work does not relate to position at Amazon. Authors’ addresses: Maryam Amirizaniani, amaryam@ uw. edu, University of Washington, Seattle, WA, USA; Jihan Yao, jihany2@ uw. edu, University of Washington, Seattle, WA, USA; Adrian Lavergne, alavergn@ uw. edu, University of Washington, Seattle, WA, USA; Elizabeth Snell Okada, esokada@ uw. edu, University of Washington, Seattle, WA, USA; Aman Chadha,, Stanford University, Amazon AI, Palo Alto, CA, USA*; Tanya Roosta,, UC Berkeley, Amazon, Saratoga, CA, USA*; Chirag Shah, University of Washington, Seattle, WA, USA. 1 arXiv: 2402.09346 v1 [cs. AI] 14 Feb 2024