Latest weekly digest

May 18–24, 2026

Gemini Unveils New Agentic AI Models, Claude Bolsters Enterprise Security

Google I/O 2026 ushered in the 'agentic Gemini era,' with the introduction of new models like Gemini 3.5 Flash and Gemini Omni, alongside AI-powered tools for science, design, and world simulation. Anthropic heavily focused on enterprise AI security, enhancing Claude's integrations and successfully identifying thousands of vulnerabilities in critical software. Concurrently, Cursor was recognized as a leader in AI coding agents and released an upgraded version of its Composer tool.

  • Google launched Gemini 3.5 Flash and Gemini Omni, emphasizing an 'agentic AI' future across its products and scientific research initiatives.
  • Anthropic introduced 28 new security and compliance integrations for Claude and initiated 'Project Glasswing,' identifying over 10,000 software vulnerabilities.
28 posts · 5 sources
Open weekly recap

Showing 112 of 599 posts

Google Labs

Catch up on 12 major I/O 2026 moments

Google I/O 2026 showcased significant AI advancements, headlined by the new Gemini Omni model, which can create content from any input, starting with video, with Gemini Omni Flash rolling out to subscribers and YouTube. Google also launched Gemini 3.5 Flash, a powerful new model for agents and coding, generally available across its platforms. Furthermore, Search is being transformed with AI-powered information agents for personalized updates and Google Antigravity-driven capabilities to generate custom interactive experiences and mini-apps. These updates aim to enhance daily productivity and intelligence across Google's ecosystem.

Read original
Claude

Introducing dynamic workflows in Claude Code

Claude Code has introduced dynamic workflows, enabling it to tackle complex, end-to-end programming tasks by dynamically orchestrating multiple parallel subagents. This new capability allows work that previously took quarters to be completed in days, addressing challenges like large-scale bug hunts, codebase migrations, and critical work requiring extensive verification. Currently in research preview, dynamic workflows are accessible via Claude Code CLI, Desktop, VS Code extension, and the Claude API across various platforms. The feature significantly boosts Claude's problem-solving capacity, as demonstrated by its use in porting the Bun runtime from Zig to Rust in eleven days.

Read original
Anthropic

Coding agents in the social sciences

A recent survey of 1,260 social scientists in early 2026 revealed that 81% have utilized AI chatbots for tasks like writing code and editing prose. However, only 20% have adopted advanced "coding agents" capable of autonomously writing and executing analysis code. Disparities in coding agent use were noted, with higher adoption among researchers with typically male names and those at top universities. While users of these agents post more working papers and grant proposals, this could reflect pre-existing differences among early adopters.

Read original
Google Labs

Dive deeper into I/O 2026 with NotebookLM.

Google has launched a comprehensive "notebook" using its NotebookLM tool to help users explore and catch up on all announcements, demos, and launches from I/O 2026. This resource aggregates YouTube videos of keynotes and product demonstrations, along with various blog posts. Users can engage with the notebook by listening to audio overviews, reading slide decks, exploring infographics, watching video overviews, or asking specific questions about new products or launches. While grounded in provided sources, Google notes that, like all AI, NotebookLM may generate inaccuracies.

Read original
Claude

How CodeRabbit used Claude to build an agent orchestration system

CodeRabbit, an AI code review platform, developed an agent orchestration system using Claude to address a common issue where AI-generated code compiles but fails to meet intended requirements due to implicit developer assumptions. This system inserts a crucial planning phase *before* code generation, coordinating multiple Claude models to analyze requirements and explicitly surface assumptions. By creating a structured execution plan that defines what needs to be built and its constraints, CodeRabbit aims to improve the quality of AI-generated code and prevent costly late-stage corrections. This approach prioritizes planning quality to ensure AI agents move in the correct direction from the outset.

Read original
Claude

Using LLMs to secure source code

This blog post details how large language models, specifically Claude Opus, can be effectively utilized to secure source code by building threat models, discovering vulnerabilities, and then verifying, triaging, and patching them. The authors emphasize that LLMs significantly accelerate vulnerability discovery, shifting the primary bottleneck to the subsequent crucial stages of verification, triage, and patching. It outlines a comprehensive six-step process starting with threat modeling and sandboxing, followed by a continuous loop of discovery, verification, triage, and patching. This approach aims to help security teams efficiently identify and remediate software flaws in their codebases.

Read original
Claude

Zero Trust for AI agents

This post introduces a Zero Trust security framework for deploying autonomous AI agents in the enterprise, addressing a new threat landscape where AI accelerates vulnerabilities and exploits. Traditional security controls are insufficient for agent autonomy, which can misuse legitimate permissions or launch persistent attacks. The proposed framework outlines a three-tier Zero Trust architecture and an eight-phase implementation workflow, covering identity, access scoping, sandboxing, input/output controls, and memory safeguards. It aims to protect against unique agentic threats like prompt injection and memory poisoning, emphasizing the need for defensive operations fast enough to contend with AI-accelerated attackers.

Read original
Claude

Code w/ Claude London 2026: Rethinking how we build

The Code w/ Claude London 2026 event brought together developers and builders to explore simplifying programming with AI, emphasizing a return to the "magic" of coding. Key announcements focused on new features for Claude Managed Agents, including self-hosted sandboxes, now in public beta, which allow tool execution within user-configured environments. Additionally, MCP tunnels, in research preview, enable agents to securely access private network servers without public exposure. These updates ensure agents can operate within established enterprise security boundaries, enhancing control and data privacy.

Read original
Cursor

Faire doubles PR throughput with Cursor Cloud Agents

Faire, an e-commerce company, significantly boosted its development efficiency by adopting Cursor Cloud Agents and Automations. This led to a doubling of weekly PR throughput and collapsed an 18-month migration into a task managed by a single engineer and a fleet of agents. Cursor's cloud platform enabled scaled parallelization and provided autonomous agents with dedicated development environments, facilitating over 2,000 automated runs weekly for tasks like bug fixes and CI investigations. This move replaced Faire's in-house solution, saving substantial infrastructure investment and engineer time.

Read original
Cursor

Cursor named a Leader in the 2026 Gartner® Magic Quadrant™ for Enterprise AI Coding Agents

Cursor has been named a Leader in the 2026 Gartner Magic Quadrant for Enterprise AI Coding Agents, earning the furthest placement on Completeness of Vision. Over 70% of the Fortune 500 currently leverage Cursor to deploy and manage coding agents across their software development lifecycle. The company plans to further enhance its platform by focusing on frontier intelligence, including a partnership with SpaceXAI for model training, expanding agent automation across the SDLC, and deepening enterprise controls, collaboration, and flexibility.

Read original
Claude

How Anthropic's finance team uses Claude to shape the narrative behind the numbers

Anthropic's corporate finance team, led by Alice Fong, utilizes Claude to ensure a consistent and accurate financial narrative for the CFO and board. By integrating Claude Cowork and Claude for Excel into their workflows, the team automates the validation of numbers and claims against a single source of truth, even as figures are constantly updated. This dramatically reduces the time spent on manual reconciliation, freeing up 10 to 20 hours per week for higher-impact tasks like strategic collaboration and creative analysis. Claude also assists in drafting initial reports and flagging inconsistencies, significantly enhancing efficiency and the reliability of financial communications.

Read original
Anthropic

Project Glasswing: An initial update

Project Glasswing, a collaborative effort to secure critical software before AI models can exploit it, has made significant progress in its first month. Using Claude Mythos Preview, the project and its 50 partners have identified over ten thousand high- or critical-severity vulnerabilities across systemically important software. This AI model has drastically accelerated bug-finding rates, with partners like Cloudflare and Mozilla reporting tenfold increases and superior performance to human testers. The primary challenge has now shifted from discovering vulnerabilities to efficiently verifying, disclosing, and patching the large volume found by AI.

Read original