Francis Real,Gabriel Goh,Janko Altenschmidt,Kenny Hsu,Vik Goel,Evan Morikawa,Heewoo Jun,Niko Felix,Toki Sherbakov,Greg Brockman,Carroll L. Wainwright,N. Keskar,N. Tezak,Jack W. Rae,Barret Zoph,Giambattista Parascandolo,Benjamin D. Sokolowsky,David Medina,Lilian Weng,Hyeonwoo Noh,Denny Jin,Logan Kilpatrick,Vishal Kuo,Jake McNeil,Eric Sigler,Clemens Winter,Derek Chen,Girish Sastry,Trevor Cai,Joe Palermo,Valerie Balcom,Yaniv Markovski,O. Murk,Che Chang,Fotis Chantzis,Jordan Sitkin,Red Avila,Peter Hoeschele,Matthew Knight,Alvin Wang,Yongjik Kim,Carl Ross,Atty Eleti,J. Leike,Hannah Wong,Marvin Zhang,Jake Berdine,P. Welinder,Igor Babuschkin,Xin Hu,Kai Xiao,Adam Perelman,Ruby Chen,Haim-ing Bao,Madelaine Boyd,CJ Weinmann,Tao Xu,Joshua Gross,Lauren Workman,Lenny Bogdonoff,David Schnurr,J. Kiros,Michael Lampe,Aris Konstantinidis,Sarah Shoker,Mike Heaton,Tong Mu,Justin Jay Wang,S. Balaji,Jong Wook Kim,Kyla Sheppard,Joel Parish,Akila Welihinda,Kendra Rimbach,Cory Decareaux,Teddy Lee,Katarina Slama,Maddie Simens,Raphael Gontijo-Lopes,Jade Leung,Madeleine Thompson,Matt Wiethoff,Chong Zhang,Yunxing Dai,Ashvin Nair,Natalie Staudacher,Rosie Campbell,Adrien Ecoffet,A. Makanju,Shyamal Anadkat,Bob McGrew,Andrew Peng,Lama Ahmad,Cameron Raymond,Bob Rotsted,Shibani Santurkar,Michelle Pokrass,Chris Hesse,Mikhail Pavlov,Lukasz Kaiser,Daniel Levy,Shantanu Jain,Rajeev Nayak,Yang Song,Dave Willner,Florencia Leoni Aleman,Michael Pokorny,Jason Chen,Jerry Tworek,Qim-ing Yuan,Tianhao Zheng,Wade Hickey,Ikai Lan,Tyna Eloundou,Joanne Jang,Arvind Neelakantan,M. Saltarelli,Shengli Hu,Noah Deutsch,Is-abella Fulford,Mark Chen,Christopher Berner,Molly Lin,Ma-teusz Litwin,Theresa Lopez,Alec Radford,Jason Wei,Tabarak Khan,Aalok Mehta,Jacob Menick,Juston Forte,J. Pachocki,Jeff Belgum,Shino Jomoto,Aditya Ramesh,A. Kondrich,B. Chess,Andrew Mayne,Ashley Pantuliano,Samuel Wolrich,Sandhini Agarwal,Richard Ngo,Alethea Power,Nick Ryder,Arka Dhar,Elizabeth Proehl,Alex Paino,Ted Sanders,Heather Schmidt,Jie Tang,Leo Gao,Thomas Degry,B. Jonn,Roger Jiang,Amin Tootoonchian,F. Such,Gabriel Bernadett-Shapiro,Bianca Martin,Vinnie Monaco,Raul Puri,Natalie Summers,Yuchen He,Sheila Dunning,Jiayi Weng,Shengjia Zhao,Elie Georges,Shawn Jain,Anna-Luisa Brakman,Emy Parparita,Arun Vijayvergiya,Morgan Grafstein,Damien Deville,Henri Roussez,C. McLeavey,Andrea Vallone,Haozhun Jin,Alexandre Passos,Sherwin Wu,Christina Kim,Brittany Carey,Pamela Mishkin,Ouyang Long,I. Kanitscheider,Kevin Yu,OpenAI Josh Achiam,S. Gu,Jessica Shieh,John Schulman,Paul McMillan,I. Sutskever,Dave Cummings,Mo Bavarian,Chelsea Carlson,Sully Chen,David Farhi,Mira Murati,Kevin Button,Casey Chu,Sam Altman,Ali Kamali,Gretchen Krueger,Ilge Akkaya,Kim Malfacini,Boris Power,Chester Cho,Michael Wu,Johannes Heidecke,Wojciech Zaremba,Oleg Boiko,Jeremiah Currier,Hyung Won Chung,Chak Ming Li,Luke Metz,Filipe de Avila Belbute Peres,Henrique Pondé de Oliveira Pinto,Brandon Houghton,Preston Tuggle,Brooke Chan,Chelsea Voss,Angela Jiang,Ryan Greene,Lukasz Kondraciuk,Diogo Almeida,Steven Adler,Jonathan Ward,Hendrik Kirchner,Miles Brundage,Katie Mayer,Daniel Selsam,Yufei Guo,Chris Hallacy,Pranav Shyam,Tomer Kaftan,Juan Felipe Cer'on Uribe,Jonathan Gordon,Phil Tillet,Sim'on Posada Fishman,Reiichiro Nakano,C. Gibson,Rowan Zellers,Daniel P. Mossing,Kyle Kosic,Todor Markov,Vitchyr H. Pong,Jesse Han,Ben Wang,Ryan Lowe,Juntang Zhuang,Szymon Sidor,Rory Carmichael,Stephanie Lin,Nick Turley,David M'ely,Irwan Bello,Andrew Cann,Sam Manning,David Dohan,Tolly Powell,Joost Huizinga,Elizabeth Tseng,Sarah Yoo,Liam Fedus,Daniel Kokotajlo,S. McKinney,Cullen O'Keefe,Steve Dowling,Jeff Harris,Rachel Lim,Tim Brooks,Tarun Gogineni,Patricia Lue,Paul Baltescu,Alan Hickey,Michael Petrov,Ian Sohl,Jeff Wu,Andrey Mishchenko,Scott Gray,William Zhuk

Abstract:We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.

Phi-4 Technical Report

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

Textbooks Are All You Need II: phi-1.5 technical report

GPT-4 Technical Report

Textbooks Are All You Need

Apple Intelligence Foundation Language Models

RadPhi-3: Small Language Models for Radiology

PaLM 2 Technical Report

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Qwen Technical Report

SC-Phi2: A Fine-tuned Small Language Model for StarCraft II Macromanagement Tasks

Putting GPT-4o to the Sword: A Comprehensive Evaluation of Language, Vision, Speech, and Multimodal Proficiency

Nemotron-4 340B Technical Report

PHUDGE: Phi-3 as Scalable Judge

The Llama 3 Herd of Models

GPT-4o System Card

RAD-PHI2: Instruction Tuning PHI-2 for Radiology

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models