Chain of News Digest

Chain of News 26/05/2026

26/05/2026
**Top Story** The Vatican has released an encyclical on artificial intelligence, titled "Magnifica Humanitas," which focuses on safeguarding the human person in the time of artificial intelligence. This document is significant because it marks one of the first times a major religious institution has weighed in on the ethics of AI. The encyclical emphasizes the need to "disarm" artificial intelligence, suggesting that its development and deployment must be carefully considered to avoid harming humanity. This development matters to AI developers because it highlights the growing concern about the impact of AI on society and the need for responsible development and use of AI technologies. As AI becomes increasingly pervasive, developers must consider the ethical implications of their work and ensure that their creations align with human values. The encyclical's release also underscores the importance of ongoing dialogue between technologists, policymakers, and societal leaders to ensure that AI is developed and used in ways that benefit humanity. **AI Models & Research** The paper "Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction" presents a new architecture for building proactive goal-directed agents that can advance shared tasks without waiting for user prompts. This research is significant because it has the potential to replace reactive query-response chatbots with more intelligent and interactive systems. Developers should care about this work because it could enable the creation of more sophisticated and engaging AI-powered interfaces. Another notable paper is "When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure," which investigates the limitations of large language models in clinical dialogue. This study is important because it highlights the need for more robust and resilient AI systems that can maintain their performance under pressure. The "Confidence Calibration in Large Language Models" paper is also worth mentioning, as it explores the calibration of LLMs' confidence across diverse tasks and finds that current models are often overconfident. This research has implications for developers who need to understand the limitations of LLMs and develop strategies to mitigate their overconfidence. **Developer Tools & Frameworks** The BODHI project has introduced a new approach to precise OS kernel specification inference, which has the potential to improve the formal verification of operating system kernels. Developers can now use BODHI to generate precise specifications for system calls, which can help ensure the correctness and security of their code. Another notable development is the release of new large language models that can be used for a variety of tasks, including natural language processing and code generation. These models can be integrated into existing development workflows to improve the efficiency and accuracy of coding tasks. Additionally, the "In Search of the Ingredients of Open-Endedness" paper explores the use of large vision-language models for automating scientific, technological, and creative production, which could lead to new tools and frameworks for developers working in these areas. **Industry & Business** The Spanish government has approved a new law on artificial intelligence, which includes provisions for multimillion-euro fines for companies that misuse AI technologies. This development is significant because it highlights the growing regulatory scrutiny of AI and the need for companies to ensure that their AI systems are transparent, accountable, and fair. The law also emphasizes the importance of responsible AI development and deployment, which is a key concern for developers and companies working in this space. The Vatican's encyclical on AI also has implications for the industry, as it emphasizes the need for a more nuanced and thoughtful approach to AI development and deployment. As the AI landscape continues to evolve, companies and developers must be aware of these emerging regulatory and societal trends and adapt their strategies accordingly. **Worth Watching** The "How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning" paper is an interesting exploration of the limitations of large language models and the need for more efficient and effective reasoning mechanisms. This research has implications for developers who need to optimize the performance of their AI systems and reduce latency, GPU time, and energy consumption. The "Notes on Pope Leo XIV's encyclical on AI" article provides a thoughtful analysis of the Vatican's encyclical and its significance for the AI community. This piece is worth reading because it offers a unique perspective on the ethical and societal implications of AI and the need for a more nuanced and multidisciplinary approach to AI development and deployment. Overall, these developments highlight the need for ongoing dialogue and collaboration between technologists, policymakers, and societal leaders to ensure that AI is developed and used in ways that benefit humanity.

Today's Stories

Today's articles

GNews: AI España

Así es la nueva Ley de Inteligencia Artificial en España: multas millonarias y límites al "uso perverso" de la tecnología - El Independiente

Así es la nueva Ley de Inteligencia Artificial en España: multas millonarias y límites al "uso perverso" de la tecnología El Independiente

26/05/2026
GNews: AI España

El Gobierno aprueba este martes la Ley de inteligencia artificial, con sanciones de hasta 35 millones - El Periódico

El Gobierno aprueba este martes la Ley de inteligencia artificial, con sanciones de hasta 35 millones El Periódico

26/05/2026
ArXiv cs.AI

When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure

Despite strong medical benchmark accuracy, LLMs can exhibit severe multi-turn sycophancy in clinical dialogue, abandoning initial correct diagnosis under escalating pressure. We propose \textbf{\textsc{Med-Stress}}, a targeted stress test framework that evaluates belief stability under escalating pressure.

26/05/2026
ArXiv cs.AI

In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models

We are in the midst of large-scale industrial and academic efforts to automate the processes of scientific, technological and creative production through AI-driven assistants. Historically, a fundamental property of these processes in their human form has been their open-endedness: their capacity for generating a seemingly endless supply of novel and meaningful new forms. Do artificial agents have any capacity for such fruitful unguided discovery?

26/05/2026
ArXiv cs.AI

Confidence Calibration in Large Language Models

We investigate the calibration of large language models' (LLMs') confidence across diverse tasks. The results of our preregistered study show that the current crop of LLMs are, like people, too sure they are right: confidence exceeds accuracy, on average. Importantly, however, this tendency is moderated by a powerful hard-easy effect, wherein overconfidence is greatest on difficult tests; by contrast, easy tests actually show substantial underconfidence.

26/05/2026
ArXiv cs.AI

How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning

Reasoning-capable large language models solve hard problems by emitting long chains of thought, paying heavily in latency, GPU time, and energy. Casual inspection of their traces reveals extensive reformulation, verification, and circular self-reflection, yet how much of this deliberation is actually necessary has never been measured at scale or explained from first principles. This paper closes both gaps.

26/05/2026
ArXiv cs.AI

BODHI: Precise OS Kernel Specification Inference

The formal verification of operating system kernels requires precise specifications that capture the intended behavior of system calls. Writing these specifications manually demands deep domain expertise, motivating the use of large language models (LLMs) to automate the process. However, in OSV-Bench, a benchmark of 245 specification generation tasks derived from the Hyperkernel OS kernel, the best reported Pass@1 is 55.10%.

26/05/2026
ArXiv cs.AI

Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction

We present Context, the intelligence layer of the Magarshak Architecture, which replaces reactive query-response chatbots with proactive goal-directed agents that advance shared tasks without waiting for user prompts. The architecture rests on three mutually reinforcing mechanisms.

26/05/2026
Simon Willison

Notes on Pope Leo XIV's encyclical on AI

Dropped this morning by the Vatican: Magnifica Humanitas of His Holiness Pope Leo XIV on Safeguarding the Human Person in the Time of Artificial Intelligence . This is a very interesting document. It's some of the clearest writing I've seen on the ethics of integrating AI into modern society. Pope Leo XIV chose the name Leo in honor of Pope Leo XIII, who is known for his 1891 Rerum novarum encyclical on "Rights and Duties of Capital and Labor".

25/05/2026
GNews: AI Italia

«Magnifica Humanitas»: il testo integrale dell'enciclica di papa Leone XIV «sulla custodia della persona umana nel tempo dell'Intelligenza artificiale» - Corriere Roma

«Magnifica Humanitas»: il testo integrale dell'enciclica di papa Leone XIV «sulla custodia della persona umana nel tempo dell'Intelligenza artificiale» Corriere Roma “Disarmare l’intelligenza artificiale”, cosa dice la prima enciclica Magnifica humanitas di Leone XIV sul potere degli algoritmi Wired Papa Leone XIV presenta la sua prima enciclica: “L’intelligenza artificiale deve essere disarmata” Il Fatto Quotidiano

25/05/2026