Exploring how AI is reshaping our world
Daily analysis of AI tools, research, and industry shifts — written for engineers and decision-makers.
-
GPT-5.4 Review: Accuracy Gains and Context Window Limits
OpenAI’s GPT-5.4 cuts individual factual errors by 33% and raises the context ceiling to 1.05M tokens—but roughly 1 in 12 claims in long outputs is still wrong, and quality degrades past 800K tokens. Here’s what changed, what the benchmarks don’t…
-
Gemini 3.1 Pro vs GPT-5.2: The Context Window War
Google’s Gemini 3.1 Pro has a 1M-token context window. OpenAI’s GPT-5.2 caps at 400K. The raw numbers favor Gemini —…
-
Cursor BugBot Autofix: Parallel Agents Fix PRs
Cursor’s BugBot Autofix — now generally available — uses isolated cloud agents to propose fixes directly on pull requests. Over…
-
AI Writes Its Own Paper—And Passes Peer Review
Sakana AI’s AI Scientist-v2 produced the first fully AI-generated paper to pass human peer review—published in Nature today. The result…
-
AI Coding Agents in 2026: 90% Adoption, Zero DORA Gain
90 to 95 percent of developers now use AI coding tools, and individual velocity metrics are clearly up. But DORA…
-
Enterprise AI Factories: What NTT DATA and NVIDIA Built
NTT DATA and NVIDIA launched enterprise AI factories in March 2026, targeting the gap between successful AI pilots and production…
-
Snowflake + OpenAI $200M: AI Agents on Enterprise Data
Snowflake’s $200 million OpenAI deal — the second such deal after a matching Anthropic partnership in December 2025 — brings…
-
Autoscience’s $14M Bet: AI That Does Its Own Research
Every week, thousands of new ML papers land on arXiv — and no human team can act on them all.…
-
DORA 2025: More AI, More Code, Flatter Delivery
Ninety percent of developers use AI daily, yet organizational delivery metrics have barely moved. The 2025 DORA report explains why:…
