Chain of News Digest

Chain of News 17/03/2026

17/03/2026
OpenAI is accelerating the “tiny‑but‑mighty” trend in large‑language models with the launch of GPT‑5.4 mini and nano, lightweight siblings to the recently released GPT‑5.4. Both are billed as faster, lower‑latency options tuned for code generation, tool use, multimodal reasoning, and high‑volume API traffic, and they support sub‑agent orchestration. The move mirrors OpenAI’s broader push toward modular AI pipelines, a direction reinforced by the general‑availability rollout of Codex subagents and custom agents that let developers stitch together specialized tool‑calling components without hitting the core model’s context ceiling. For engineers, the practical upshot is a new tier of models that can be embedded in CI/CD bots, IDE extensions, or edge services where response time and token budget are at a premium. At the same time,

Today's Stories

Today's articles

Cloudflare Blog

Introducing Custom Regions for precision data control

We are expanding Regional Services with new pre-defined regions and the launch of Custom Regions. Customers can now define precise geographical boundaries for data processing, tailored to meet their compliance and performance needs.

18/03/2026
Latent Space

Why Anthropic Thinks AI Should Have Its Own Computer — Felix Rieseberg of Claude Cowork & Claude Code Desktop

Claude Cowork came out of an accident.

17/03/2026
GNews: Claude AI

Anthropic's Claude down: Users report issues - Latest news from Azerbaijan

Anthropic's Claude down: Users report issues Latest news from Azerbaijan

17/03/2026
Simon Willison

GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52

OpenAI today: Introducing GPT‑5.4 mini and nano . These models join GPT-5.4 which was released two weeks ago . OpenAI's self-reported benchmarks show the new 5.4-nano out-performing their previous GPT-5 mini model when run at maximum reasoning effort. The new mini is also 2x faster than the previous mini. Here's how the pricing looks - all prices are per million tokens. gpt-5.4-nano is notably even cheaper than Google's Gemini 3.1 Flash-Lite: Model Input Cached input Output gpt-5.4 $2.50 $0.25 $

17/03/2026
Hugging Face Blog

State of Open Source on Hugging Face: Spring 2026

17/03/2026
GitHub Blog

Investing in the people shaping open source and securing the future together

See how GitHub is investing in open source security funding maintainers, partnering with Alpha-Omega, and expanding access to help reduce burden and strengthen software supply chains. The post Investing in the people shaping open source and securing the future together appeared first on The GitHub Blog .

17/03/2026
Google AI Blog

Our latest investment in open source security for the AI era

A collage including security icons and photos of hands clasped, a man looking at a computer, and two people pointing at something off camera

17/03/2026
Google AI Blog

Bringing the power of Personal Intelligence to more people

Bubble that says "Personal Intelligence" with Google G, Google Photos logo, and Gmail logo around it

17/03/2026
Hugging Face Blog

Holotron-12B - High Throughput Computer Use Agent

17/03/2026
Simon Willison

Subagents

Agentic Engineering Patterns > LLMs are restricted by their context limit - how many tokens they can fit in their working memory at any given time. These values have not increased much over the past two years even as the LLMs themselves have seen dramatic improvements in their abilities - they generally top out at around 1,000,000, and benchmarks frequently report better quality results below 200,000. Carefully managing the context such that it fits within those limits is critical to getting gre

17/03/2026
OpenAI Blog

Introducing GPT-5.4 mini and nano

GPT-5.4 mini and nano are smaller, faster versions of GPT-5.4 optimized for coding, tool use, multimodal reasoning, and high-volume API and sub-agent workloads.

17/03/2026
OpenAI Blog

Equipping workers with insights about compensation

New research shows Americans send nearly 3 million daily messages to ChatGPT asking about compensation and earnings, helping close the wage information gap.

17/03/2026
Simon Willison

Introducing Mistral Small 4

Introducing Mistral Small 4 Big new release from Mistral today (despite the name) - a new Apache 2 licensed 119B parameter (Mixture-of-Experts, 6B active) model which they describe like this: Mistral Small 4 is the first Mistral model to unify the capabilities of our flagship models, Magistral for reasoning, Pixtral for multimodal, and Devstral for agentic coding, into a single, versatile model. It supports reasoning_effort="none" or reasoning_effort="high" , with the latter providing "equivalen

16/03/2026
Simon Willison

Use subagents and custom agents in Codex

Use subagents and custom agents in Codex Subagents were announced in general availability today for OpenAI Codex, after several weeks of preview behind a feature flag. They're very similar to the Claude Code implementation, with default subagents for "explorer", "worker" and "default". It's unclear to me what the difference between "worker" and "default" is but based on their CSV example I think "worker" is intended for running large numbers of small tasks in parallel. Codex also lets you define

16/03/2026