Chain of News Digest

Chain of News 26/03/2026

26/03/2026
**Top Story** OpenAI's rumored "Spud" model and Anthropic's government-stirring AI benchmark breakthrough signal a new competitive phase in frontier AI development. According to exclusive reports, OpenAI is developing Spud as a next-generation model while Anthropic believes its latest system will force governments to accelerate AI regulation and policy frameworks. This comes alongside the launch of ARC-AGI-3, an extremely difficult benchmark that's already exposing fundamental limitations in current AI architectures. The timing suggests both companies are positioning for what could be a decisive year in AI capabilities, with implications for everything from national security to enterprise adoption timelines. **AI Models & Research** Google's Gemini 3.1 Flash Live has launched with enhanced audio capabilities, making AI conversations more natural and reliable through improved real-time processing. The update focuses on reducing latency and improving voice synthesis quality for applications like live translation and voice assistants. Cohere released a new open-source voice model specifically for transcription, weighing in at just 2 billion parameters and supporting 14 languages. Designed for consumer-grade GPUs, it offers developers a lightweight alternative to cloud-based transcription services. Meanwhile, Anthropic's research team published findings on their latest benchmark system that they claim will "stir government urgency," though specific technical details remain under wraps pending official announcements. **Developer Tools & Frameworks** LangGraph continues gaining traction in enterprise AI development, with Kensho (S&P Global's AI innovation arm) publishing a detailed case study on building their multi-agent Grounding framework. The framework solves fragmented financial data retrieval at enterprise scale by creating a unified agentic access layer. The LangGraph team also released updates to their middleware system, allowing developers to customize agent harnesses that connect LLMs to their environments. This "Agent Middleware" approach enables building application-specific harnesses without starting from scratch. Additionally, the datasette-llm 0.1a1 release introduces a new register_llm_purposes() plugin hook and get_purposes() function, making it easier for developers to integrate various LLM models into their Datasette applications through standardized interfaces. **Industry & Business** Conntour secured $7 million in funding from General Catalyst and Y Combinator to build an AI search engine for security video systems. The platform allows security teams to query camera feeds using natural language to find specific objects, people, or situations—potentially disrupting the $50 billion physical security market. Xero and Anthropic announced a partnership to bring small business finances into Claude, integrating accounting data directly into the AI assistant for streamlined financial management. This marks one of the first major enterprise integrations of Claude for business operations. Meanwhile, GitHub published its 2026 security roadmap for GitHub Actions, outlining plans for secure defaults, policy controls, and CI/CD observability to harden the software supply chain end-to-end. **Worth Watching** Google Translate's Live Translate feature is expanding globally, now officially available on iOS with support for more countries on both iOS and Android platforms. The feature transforms headphones into live personal translators, potentially disrupting the language learning and travel assistance markets. Security researchers reported a surge in malware advisories alongside a four-year low in CVEs, suggesting attackers are shifting tactics toward more sophisticated, targeted campaigns. The open-source community is also seeing increased CNA (CVE Numbering Authority) publishing activity, which could impact how organizations prioritize vulnerability triage and response in the coming months. Finally, Mario Zechner's critique of current agentic engineering trends—calling for developers to "slow the fuck down"—is gaining traction among senior engineers concerned about the sustainability of current development practices.

Today's Stories

Today's articles

GNews: AI Italia

Se l’intelligenza artificiale è in crisi di idenità - Il Manifesto

Se l’intelligenza artificiale è in crisi di idenità Il Manifesto

26/03/2026
GNews: AI España

Usar IA daña la reputación de artistas y empresas, advierte un estudio realizado en Estados Unidos - La Voz de Galicia

Usar IA daña la reputación de artistas y empresas, advierte un estudio realizado en Estados Unidos La Voz de Galicia

26/03/2026
GNews: AI España

¿Puede la inteligencia artificial mejorar la montaña sin cambiar su esencia? - Lugares de Aventura

¿Puede la inteligencia artificial mejorar la montaña sin cambiar su esencia? Lugares de Aventura

26/03/2026
AI Explained

Two AI Models Set to “stir government urgency”, But Will This Challenge Undo Them?

First look at exclusive reports about OpenAI's new Spud model, and the model Anthropic think will stir governments to urgency, all in the context of the newly-launched ARC-AGI-3. What does the extreme difficulty of that benchmarks, and its quirky scoring metrics, mean for AI in 2026? https://assemblyai.com/aiexplained Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Int

26/03/2026
LangChain Blog

How Kensho built a multi-agent framework with LangGraph to solve trusted financial data retrieval

Discover how Kensho, S&P Global’s AI innovation engine, leveraged LangGraph to create its Grounding framework–a unified agentic access layer solving fragmented financial data retrieval at enterprise scale.

26/03/2026
GNews: AI Italia

Una carriera a prova di intelligenza artificiale: i consigli degli esperti - Agenda Digitale

Una carriera a prova di intelligenza artificiale: i consigli degli esperti Agenda Digitale

26/03/2026
GNews: Claude AI

Xero and Anthropic partner to bring small business finances into Claude - The Next Web

Xero and Anthropic partner to bring small business finances into Claude The Next Web

26/03/2026
Google AI Blog

Watch James Manyika talk AI and creativity with LL COOL J.

In the latest episode of our Dialogues on Technology and Society series, LL COOL J sits down with James Manyika.

26/03/2026
GitHub Blog

What’s coming to our GitHub Actions 2026 security roadmap

A look at GitHub Actions’ 2026 roadmap, outlining how secure defaults, policy controls, and CI/CD observability harden the software supply chain end to end. The post What’s coming to our GitHub Actions 2026 security roadmap appeared first on The GitHub Blog .

26/03/2026
Simon Willison

Quantization from the ground up

Quantization from the ground up Sam Rose continues his streak of publishing spectacularly informative interactive essays, this time explaining how quantization of Large Language Models works. Also included is the best visual explanation I've ever seen of how floating point numbers are represented using binary digits. I hadn't heard about outlier values in quantization - rare float values that exist outside of the normal tiny-value distribution - but apparently they're very important: Why do thes

26/03/2026
GitHub Blog

A year of open source vulnerability trends: CVEs, advisories, and malware

Reviewed advisories hit a four-year low, malware advisories surged, and CNA publishing grew—here’s what changed and what it means for your triage and response. The post A year of open source vulnerability trends: CVEs, advisories, and malware appeared first on The GitHub Blog .

26/03/2026
Google AI Blog

Transform your headphones into a live personal translator on iOS.

Google Translate’s Live translate with headphones is officially arriving on iOS! And we're expanding the capability for both iOS and Android users to even more countries…

26/03/2026
Google AI Blog

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

The Gemini emblem sits next to text reading 'Gemini 3.1 Flash Live'. The background has blue, multicolored dots making up a microphone icon

26/03/2026
LangChain Blog

How we build evals for Deep Agents

💡 TLDR: The best agent evals directly measure an agent behavior we care about. Here's how we source data, create metrics, and run well-scoped, targeted experiments over time to make agents more accurate and reliable. Evals shape agent behavior We’ve been curating evaluations to measure and

26/03/2026
Google AI Blog

Search Live is expanding globally

A graphic with the words Search Live shown underneath a waveform icon. To the right, a phone shows the Google app with Search Live open. The camera is pointing at trees in a forest.

26/03/2026
LangChain Blog

How Middleware Lets You Customize Your Agent Harness

Agent harnesses are what help build an agent, they connect an LLM to its environment and let it do things. When you’re building an agent, it’s likely you’ll want build an application specific agent harness. “Agent Middleware” empowers you to build on

26/03/2026
TechCrunch AI

Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems

Conntour uses AI models to let security teams query camera feeds using natural language to find any object, person, or situation.

26/03/2026
TechCrunch AI

Cohere launches an open-source voice model specifically for transcription

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It currently supports 14 languages.

26/03/2026
Simon Willison

Thoughts on slowing the fuck down

Thoughts on slowing the fuck down Mario Zechner created the Pi agent framework used by OpenClaw, giving considerable credibility to his opinions on current trends in agentic engineering. He's not impressed: We have basically given up all discipline and agency for a sort of addiction, where your highest goal is to produce the largest amount of code in the shortest amount of time. Consequences be damned. Agents and humans both make mistakes, but agent mistakes accumulate much faster: A human is a

25/03/2026
Simon Willison

datasette-llm 0.1a1

Release: datasette-llm 0.1a1 New release of the base plugin that makes models from LLM available for use by other Datasette plugins such as datasette-enrichments-llm . New register_llm_purposes() plugin hook and get_purposes() function for retrieving registered purpose strings. #1 One of the responsibilities of this plugin is to configure which models are used for which purposes, so you can say in one place "data enrichment uses GPT-5.4-nano but SQL query assistance happens using Sonnet 4.6", fo

25/03/2026