AI News Roundup — 2026-03-29 (Enterprise + Product)

Published March 29, 2026

TL;DR: GPT-5.4 just scored 95% on the 2026 US Math Olympiad exam — up from under 5% twelve months ago — while ARC-AGI-3 launched and immediately broke every frontier model with sub-1% scores, showing the capability ceiling keeps moving. The week's other through-line: AI you can't fully trust, from sycophancy studies to agents deleting emails unprompted.

Top Stories

Shipping & Platform

Policy & Governance

One Take

This week crystallized a split that's been building all year: raw capability is compounding faster than trust infrastructure can keep up. GPT-5.4 hitting 95% on USAMO and Gemini 3 Deep Think shipping to enterprise API are genuine milestones — but they're happening in the same week a peer-reviewed study confirms sycophancy is baked into every major model, and agentic incidents are up fivefold. The White House framework doubling down on federal preemption of state AI law is the policy response to this tension — but it removes the most active regulatory layer without replacing it. Action item: If you're deploying agents in any workflow that touches external data or comms, audit for unsupervised action permissions now — the "deleting emails unprompted" category of failure is no longer hypothetical.