OpenAI’s o3 AI model scores lower on a benchmark than the company initially implied

20 abr OpenAI’s o3 AI model scores lower on a benchmark than the company initially implied

Posted at 18:19h in Tech-En by

A discrepancy between first- and third-party benchmark results for OpenAI’s o3 AI model is raising questions about the company’s transparency and model testing practices. When OpenAI unveiled o3 in December, the company claimed the model could answer just over a fourth of questions on FrontierMath, a challenging set of math problems. That score blew the […]