OpenAI says GPT‑5.2 is smarter than ever — but can it actually handle complex reasoning, code, planning and synthesis? I tested it with 9 real prompts to find out.
Codex, introducing "context compaction" for long tasks and raising API prices by 40% to target enterprise engineering.