Koda Intelligence | 31 March 2026

The Signal

Your daily intelligence briefing

AI smart_toy

Google launched Gemini 3 Deep Think, its most advanced reasoning model to date, on the same day NVIDIA's Nemotron 3 Super posted a 0.8 score on the GPQA benchmark.

World public

U.S. bunker busters struck Iran's Isfahan uranium enrichment site as the conflict enters its fifth week with no ceasefire in sight.

Markets trending_up

Sentiment sits at Extreme Fear as gold approaches $4,554 per ounce yet records its worst monthly loss in modern history, whipsawed by war hedging and forced liquidations.

Wild Card bolt

Amazon and OpenAI quietly unveiled a stateful AI runtime, a foundational shift that could let AI agents maintain persistent memory across sessions, arriving with almost zero public attention amid the geopolitical chaos.

Markets

S&P 500

6,343.72

-0.39%

NASDAQ

20,794.64

-0.73%

BTC

$66,591.41

-0.15%

ETH

$2,036.55

+0.61%

Oil

$107.52

+0.16%

Sentiment

11

Extreme Fear

headphones

Daily Podcast

AI-generated deep-dive briefing

smart_display

Video Briefing

AI-generated cinematic explainer

▶ Watch on YouTube

image_searchThe Lens

Today's intelligence, visualized

Click to expand

edit_note exploreDeep Dive

Tools 6 min read

The Amazon-OpenAI Stateful Runtime partnership signals that memory and tool-use

Read Analysis →

The three stories that matter most

What just shipped in AI

Model Release

Google Launches Gemini 3 Deep Think Model

Google released Gemini 3 Deep Think on March 26, a frontier model designed for scientific and engineering tasks such as spotting logical flaws in math papers and optimizing fabrication methods. The model is live in the Gemini app for Ultra subscribers, with early API access now available. It represents Google's push into specialized reasoning models alongside its broader Gemini family. Source →

Policy

OpenAI Expands Bug Bounties to AI Safety

OpenAI announced a Safety Bug Bounty Program on March 26, extending its existing bounty framework to cover AI misuse and safety risks. The company also detailed its Model Spec framework, which outlines principles for model behavior, instruction-following, and conflict resolution. The move signals a more structured approach to external safety auditing as frontier models grow more capable. Source →

Enterprise

Amazon and OpenAI Build Stateful AI Runtime

Amazon partnered with OpenAI on March 26 to co-create a Stateful Runtime Environment on Amazon Bedrock, focused on memory and tool-use infrastructure for AI applications. The environment is designed to support persistent agent state across sessions, a key requirement for complex agentic workflows. Availability is planned for the coming months. Source →

Model Release

Google Ships Lyria 3 Music Generation Models

Alongside Gemini 3 Deep Think, Google released Lyria 3 and Lyria 3 Pro on March 26, its latest music generation models. The dual release reflects Google's strategy of shipping specialized models across modalities rather than concentrating on a single general-purpose system. Details on pricing and availability beyond the Gemini ecosystem have not yet been disclosed. Source →

Benchmark

NVIDIA Nemotron 3 Super Hits 0.8 GPQA

NVIDIA's Nemotron 3 Super, a 120-billion parameter model, posted a GPQA score of 0.8 following its release around March 11. The score places it among the strongest performers on the graduate-level reasoning benchmark this quarter. The model adds to NVIDIA's growing presence in the foundation model space beyond its dominant hardware business. Source →

Model Release

Mistral Small 4 Posts 0.7 GPQA Score

Mistral released Small 4 around March 17, achieving a GPQA score of 0.7 on the graduate-level reasoning benchmark. The model continues Mistral's strategy of shipping competitive smaller models that can run efficiently on constrained infrastructure. It arrived during a record-setting Q1 that saw 255 model releases across the industry. Source →

Policy

OpenAI Model Spec Details Behavior Principles

OpenAI's newly published Model Spec framework lays out explicit principles for how its models should handle instruction-following, behavioral boundaries, and conflict resolution between user requests and safety constraints. Released alongside the Safety Bug Bounty Program on March 26, the document represents one of the most detailed public accounts of how a frontier lab governs model behavior. The framework could set a precedent for industry-wide transparency on alignment decisions. Source →

Trend

Late March AI Release Pipeline Goes Quiet

No major new model releases were announced between March 27 and 31, following a burst of activity earlier in the month that included GPT-5.4, DeepSeek V4, and multiple Qwen3.5 variants. The pause comes after Q1 2026 hit a record 255 model releases. The lull may reflect labs shifting resources toward agentic frameworks and infrastructure rather than raw model launches. Source →

Geopolitics, economics, power shifts

Conflict

Isfahan Uranium Site Hit by US Bunker Busters

The U.S. struck an ammunition depot in Isfahan, Iran, believed to house enriched uranium, using bunker buster munitions. President Trump posted footage on Truth Social showing 2,000-pound bombs detonating at the site. An American official confirmed the strikes to The Wall Street Journal, though no casualty figures or Iranian government response have been released. Source →

Conflict

Iranian Drone Strikes Oil Tanker Near Hormuz

An Iranian drone hit a massive oil tanker near the Strait of Hormuz on March 31, coinciding with the U.S. strikes on Isfahan. The attack on the tanker underscores Iran's ability to threaten the chokepoint through which roughly 20% of global oil transits daily. No details on crew casualties or the tanker's flag state were immediately available. Source →

Economy

Gold Nears $4,554 but Posts Historic Monthly Loss

Gold traded up 1% to approximately $4,553.69 per ounce early Tuesday, yet the metal is on track for its worst monthly decline since 2008. Front-month futures rose 0.6% by 3:30 a.m. ET. The paradox of rising daily prices alongside a steep monthly drawdown reflects whiplash-level volatility driven by the U.S.-Iran war and shifting risk sentiment. Source →

Policy

Trump Posts Unverified Strike Footage on Truth Social

President Trump shared video on Truth Social showing explosions from the Isfahan bunker buster strikes before any official Pentagon briefing. The post included no casualty data, damage assessment, or operational context. The move continues a pattern of the president using personal social media to disclose sensitive military operations in real time. Source →

Conflict

US-Iran War at Five Weeks With No Ceasefire

The conflict between the United States and Iran has now entered its fifth week with no diplomatic off-ramp in sight. Simultaneous escalation on both sides, U.S. strikes on nuclear-linked sites and Iranian drone attacks on commercial shipping, signals a widening war rather than a stabilizing one. Global energy markets, shipping insurance rates, and commodity prices remain in turmoil. Source →

Infrastructure

Hormuz Shipping Risk Intensifies After Tanker Attack

The drone strike on an oil tanker near the Strait of Hormuz raises the threat level for commercial vessels transiting the waterway. Shipping insurers have already hiked war-risk premiums in recent weeks, and a direct hit on a large tanker could accelerate rerouting around the Cape of Good Hope. Energy analysts warn that sustained disruption at Hormuz would push oil well above current levels and strain global supply chains further. Source →

Curated from newsletters we trust

We read and recommend these newsletters. Here is what they are covering today. Subscribe to them directly for the full experience.

Headlines

Anthropic's 'Claude Mythos' leaked as a new model tier larger and more intelligent than Opus, with dramatically higher scores on coding, academic reasoning, and cybersecurity benchmarks. The model is compute-intensive and expensive; Anthropic is working on efficiency before a general release.
Meta's Avocado 9B model pushed back to at least May as it still falls short of leading competitors. Meta is running parallel experiments with multiple Avocado variants and has reportedly discussed temporarily licensing Google's Gemini technology, with some Meta AI requests already routed through Gemini.
Anthropic's Claude paid subscriptions have more than doubled this year, with the majority of new subscribers joining the lowest tier. OpenAI remains the largest consumer AI platform but Claude's growth rate is accelerating.

Deep Dives

Quick Hits

The last original xAI cofounder has exited the company.
Black Duck Signal launched as an agentic AppSec tool combining LLM-powered code analysis with 20+ years of human-vetted security intelligence to find and fix vulnerabilities in AI-generated code.
Lessons from OpenAI highlighted in the newsletter's subject line, though details were truncated in the content.

Trending Tools

Black Duck Signal: agentic application security tool for AI-native development that analyzes and fixes vulnerabilities in AI-generated code using natural language prompts in IDEsAutoBe: open-source AI agent that generates complete backends from a single natural language conversation, with a function calling harness that dramatically improves model reliability

Read the full newsletter →

Who moved, who shipped, who's next

Tested tools and daily discoveries

Zapier Central: Chain AI Agents Across 8,000+ Apps Zapier's AI orchestration layer now connects over 8,000 apps with natural language instructions, letting you build multi-step automations without code. Start by describing a workflow in plain English, such as 'when a lead fills out a Typeform, enrich it with Clearbit, score it, and route it to the right Slack channel.' The platform handles the agent logic, error handling, and retries automatically. Try it →

Manus: Deploy End-to-End Research Agents Without Infrastructure Manus is an AI agent platform designed for autonomous task completion, from deep web research to structured report generation. Assign it a complex brief like competitive analysis or regulatory review, and it will plan sub-tasks, gather sources, and compile a deliverable. Best used for research-heavy workflows where you need cited, structured output rather than raw chat responses. Try it →

Cursor: Accelerate Development With Inline AI Pair Programming Cursor integrates frontier-level code generation directly into your editor, offering tab-complete suggestions that understand your full codebase context. Use its chat panel to refactor entire files or generate tests by referencing specific functions. Developers report 30-50% faster iteration cycles when combining Cursor's autocomplete with its codebase-aware Q&A mode. Try it →

Comet Browser: Run Agentic Tasks From Your Browser for Free Comet Browser offers a free agentic browsing mode that can autonomously navigate websites, extract data, and complete multi-step web tasks on your behalf. Point it at a comparison shopping task, a form-filling sequence, or a data scraping job and let it execute. It is particularly useful for repetitive browser-based workflows that do not justify building a full automation pipeline. Try it →

Claude Artifacts: Generate and Iterate on Documents and Code in Real Time Claude's Artifacts feature renders live previews of code, documents, and visualizations directly in the chat window, turning conversations into working prototypes. Use it to draft an interactive SVG diagram, a React component, or a formatted report, then iterate with follow-up prompts. The key workflow tip: ask Claude to produce an Artifact first, then refine it in-context rather than starting from scratch each time. Try it →

Notion AI as a Second Brain: Consolidate Knowledge Before Automating Before layering agentic tools on top of your workflow, centralize your notes, docs, and project data in Notion AI so its built-in assistant can surface relevant context automatically. The highest-leverage move is connecting meeting notes, project specs, and reference docs into linked databases, then querying Notion AI to synthesize status updates or draft briefs. Automation works best when it draws from a single organized source of truth. Try it →

scienceExplore The Lab →

boltThe Signal

The Signal

Markets

image_searchThe Lens

The Amazon-OpenAI Stateful Runtime partnership signals that memory and tool-use

Power Moves

The Radar

Google Launches Gemini 3 Deep Think Model

OpenAI Expands Bug Bounties to AI Safety

Amazon and OpenAI Build Stateful AI Runtime

Google Ships Lyria 3 Music Generation Models

NVIDIA Nemotron 3 Super Hits 0.8 GPQA

Mistral Small 4 Posts 0.7 GPQA Score

OpenAI Model Spec Details Behavior Principles

Late March AI Release Pipeline Goes Quiet

The Globe

Isfahan Uranium Site Hit by US Bunker Busters

Iranian Drone Strikes Oil Tanker Near Hormuz

Gold Nears $4,554 but Posts Historic Monthly Loss

Trump Posts Unverified Strike Footage on Truth Social

US-Iran War at Five Weeks With No Ceasefire

Hormuz Shipping Risk Intensifies After Tanker Attack

The Feed

The Arena

The Lab Field Notes

Like what you see?