Imagen 3
Imagen-Team-Google,Jason Baldridge,Jakob Bauer,Mukul Bhutani,Nicole Brichtova,Andrew Bunner,Kelvin Chan,Yichang Chen,Sander Dieleman,Yuqing Du,Zach Eaton-Rosen,Hongliang Fei,Nando de Freitas,Yilin Gao,Evgeny Gladchenko,Sergio Gómez Colmenarejo,Mandy Guo,Alex Haig,Will Hawkins,Hexiang Hu,Huilian Huang,Tobenna Peter Igwe,Christos Kaplanis,Siavash Khodadadeh,Yelin Kim,Ksenia Konyushkova,Karol Langner,Eric Lau,Shixin Luo,Soňa Mokrá,Henna Nandwani,Yasumasa Onoe,Aäron van den Oord,Zarana Parekh,Jordi Pont-Tuset,Hang Qi,Rui Qian,Deepak Ramachandran,Poorva Rane,Abdullah Rashwan,Ali Razavi,Robert Riachi,Hansa Srinivasan,Srivatsan Srinivasan,Robin Strudel,Benigno Uria,Oliver Wang,Su Wang,Austin Waters,Chris Wolff,Auriel Wright,Zhisheng Xiao,Hao Xiong,Keyang Xu,Marc van Zee,Junlin Zhang,Katie Zhang,Wenlei Zhou,Konrad Zolna,Ola Aboubakar,Canfer Akbulut,Oscar Akerlund,Isabela Albuquerque,Nina Anderson,Marco Andreetto,Lora Aroyo,Ben Bariach,David Barker,Sherry Ben,Dana Berman,Courtney Biles,Irina Blok,Pankil Botadra,Jenny Brennan,Karla Brown,John Buckley,Rudy Bunel,Elie Bursztein,Christina Butterfield,Ben Caine,Viral Carpenter,Norman Casagrande,Ming-Wei Chang,Solomon Chang,Shamik Chaudhuri,Tony Chen,John Choi,Dmitry Churbanau,Nathan Clement,Matan Cohen,Forrester Cole,Mikhail Dektiarev,Vincent Du,Praneet Dutta,Tom Eccles,Ndidi Elue,Ashley Feden,Shlomi Fruchter,Frankie Garcia,Roopal Garg,et al. (151 additional authors not shown)
2024-08-14
Abstract:We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.
Computer Vision and Pattern Recognition