The reference discrepancy of AI reveals lagoons in performance claims

FrontyerMath precision for O3 and O4-MINI of OpenAI compared to the main models. Image: Epoch AI The latest FrontierMath results,…