GPT-5.2 vs Gemini 3.1 Pro: Frontier AI Benchmarks 2026
OpenAI’s GPT-5.2 achieved a perfect 100% on AIME 2025 math, while Google’s Gemini 3.1 Pro scored 77.1% on ARC-AGI-2 — more than double GPT-5.2’s 52.9% on that test. These results measure different capabilities, and choosing the right frontier model for your workload requires understanding exactly what each benchmark is and isn’t telling you.