Open ai research

Why Tejal Patwardhan stopped underestimating the models - Episode 21

Why Tejal Patwardhan stopped underestimating the models - Episode 21

Tejal Patwardhan, head of OpenAI's frontier evals team, discusses the critical evolution of AI evaluations. She explains why traditional benchmarks fail as models become more capable, how OpenAI develops realistic, long-horizon tests (including groundbreaking wet lab experiments), and the implications of rapidly advancing multimodal and reasoning models for scientific discovery and the future of human work.

How a reasoning model cracked an 80-year-old math problem — the OpenAI Podcast Ep. 20

How a reasoning model cracked an 80-year-old math problem — the OpenAI Podcast Ep. 20

OpenAI's reasoning researchers discuss how a general-purpose AI model disproved an 80-year-old conjecture from mathematician Paul Erdős. They detail the journey from initial IMO/IOI breakthroughs to the verification of the proof, highlighting the model's creative application of advanced number theory. The episode explores the profound implications for the future of mathematics, AI-human collaboration, and the broader scientific landscape, offering advice for researchers seeking to leverage AI for groundbreaking discoveries.