May 23, 2026

Agentic Brew Daily

Your daily shot of what's brewing in AI

Fresh Batch

Distilled trend

Sacks, Musk, and Zuckerberg killed Trump's AI executive order, and Newsom signed California's worker-displacement order the same day Meta cut 8,000 jobs.
Anthropic now runs Claude across AWS Trainium, Google TPUs, Nvidia GPUs, and SpaceX Colossus at $1.25B per month, diversifying onto four silicon stacks at once.
Code-as-agent-harness became a product category in one week, as OpenAI shipped Goal Mode GA, Cursor opened Composer 2.5 as an SDK, and Anthropic made memory an API primitive.

Bold Shots

Today's biggest AI stories, no chaser

Trump postpones AI executive order on federal pre-release model review

A draft executive order asking frontier labs to voluntarily submit advanced models to federal national-security agencies for 14-90 days of pre-release review was on Trump's desk Thursday morning. By the time the news cycle ended, David Sacks, Elon Musk, and Mark Zuckerberg had each phoned the White House to argue the framework would become a de facto licensing regime, and Trump tabled the signing. The reversal leaves OpenAI, Anthropic, Google, and Microsoft with no federal floor for frontier-model safety review, even as Treasury, NSA, and CISA had concluded these models can find production-system vulnerabilities. The political split is not Republican vs Democrat: 79% of GOP voters want pre-release testing, and Steve Bannon plus 60+ MAGA signatories signed a Humans First letter demanding mandatory testing.

Why it matters: A morning of phone calls killed a year of interagency work and ceded the safety-review agenda to state legislatures and to Beijing, which is advancing comprehensive AI legislation in parallel. The donor faction of MAGA beat the base faction on the call that mattered.

Some crazy developments in AI in the last 24 hrs: Trump postpones/scraps AI executive order after pressure from Musk and Zuckerberg; OpenAI ...

@ammohitchaprana

Trump Kills AI Executive Order at the Last Minute: 'I Didn't Like It'

New York Post·115.4K views

"He just hates regulation" Trump delays AI executive order that might hinder progress, says, "it's just something doomers wanted"

r/accelerate·104 upvotes

Anthropic's $45B SpaceX/xAI Colossus compute deal

Anthropic will pay SpaceX roughly $45B over three years, or $1.25B per month through May 2029, for 300+ MW of compute at xAI's Colossus data centers in Memphis. The deal covers 222,000+ Nvidia GPUs across H100, H200, and GB200, stacks on top of Anthropic's $100B+ AWS Trainium commitment and its multi-gigawatt Google TPU deal, and was disclosed inside SpaceX's S-1 IPO filing targeting a $1.75T valuation. Either party can terminate on 90 days notice, and Musk personally retained a discretionary clause to reclaim compute if Anthropic's AI is judged to harm humanity.

Why it matters: This is the public price tag for frontier-scale infrastructure: $15B per year, single customer, single supplier, paid to a direct competitor. It validates the SpaceX/xAI IPO thesis and makes Anthropic's reliance on a Musk-controlled supplier with a unilateral kill switch a board-level question.

Checking the math behind OpenAI and Anthropic's latest headlines

Garymarcus Substack

SpaceXAI will provide @AnthropicAI with access to Colossus 1, one of the world's largest and fastest-deployed AI supercomputers

@xai·26.5K engagements

Anthropic Partners With SpaceX AI, Leopold's $5.5B Bet, and the Singularity Economy | EP #255

Peter H. Diamandis·205.7K views

Anthropic is paying SpaceX $15 billion per year

r/technology·2.4K upvotes

OpenAI reasoning model disproves Erdos unit-distance conjecture

On May 20, OpenAI announced that an internal general-purpose reasoning model produced a disproof of the 1946 Erdos planar unit-distance conjecture, constructing an infinite family of n-point configurations with n^(1+delta) unit-distance pairs that polynomially beats the long-assumed near-linear bound. Princeton's Will Sawin sharpened delta to at least 0.014 the same day. Fields Medalist Tim Gowers and Oxford's Thomas Bloom co-authored the 19-page companion paper, with Gowers saying that if a human had submitted this to the Annals of Mathematics he would have no doubt it was a milestone.

Why it matters: This is the first time a general-purpose AI has autonomously produced a frontier mathematical result that would have cleared peer review on its own. The methodology bridges plane geometry to algebraic number theory via class field towers, which is a new constructive branch, not a heuristic. Daniel Litt called it the unique interesting result produced autonomously by AI so far.

OpenAI solved an 80-year math problem by... disproving it

The Neuron AI

AI solving math problems is a good thing... (Quoting OpenAI: "Today, we share a breakthrough on the planar unit distance problem...")

@mister_shroom_2·674 engagements

The Erdős Breakthrough

OpenAI·58.8K views

AI just disproved the biggest math conjecture so far

Dr. Trefor Bazett·18.4K views

OpenAI general purpose model had a breakthrough on famous 80 year old Erdos problem.

r/singularity·623 upvotes

OpenAI claims a general-purpose reasoning model found a counterexample to Erdos's unit-distance bound

r/MachineLearning·97 upvotes

Google I/O 2026: Gemini 3.5, Antigravity 2.0, AI for Science

Pichai opened I/O declaring the agentic era and reframing Search as an agent manager, the most consequential structural shift in Google's core product in two decades. Gemini 3.5 Flash is now the default model in the Gemini app and AI Mode in Search worldwide, with AI Mode exceeding 1 billion monthly users. Demis Hassabis unveiled Gemini for Science: AI Co-Scientist plus AlphaEvolve plus Science Skills connecting agentic platforms to 30+ life-science databases, with BASF, Klarna, Daiichi Sankyo, Bayer Crop Science, Stanford Medicine, and U.S. National Labs as partners. Antigravity 2.0 shipped as a desktop app, CLI, and SDK orchestrating parallel autonomous coding agents at 12x the public API speed.

Why it matters: The full-stack play (8th-gen TPUs, Gemini 3.5, Search/Android/Workspace distribution, ad inventory) lets Google monetize the agentic transition in ways pure-play labs can't. The unstated losers are Booking, Expedia, DoorDash, Zillow, and Instacart, marketplaces that get bypassed when agentic Search completes the transaction in the SERP.

The Pulse: Antigravity 2.0 takes 'IDE' out of its new IDE

Pragmatic Engineer

"Google AI Pro" users get almost no YouTube ads — "Premium Lite" granted for free.

@itmedia_news·28.3K engagements

The Commerce Department announced letters of intent with nine quantum companies on May 21, totaling $2.013B in CHIPS-and-Science-Act funding, with the federal government taking a minority equity stake in each recipient. IBM gets $1B (plus a $1B match) to launch Anderon, America's first pure-play 300mm quantum wafer foundry in Albany. GlobalFoundries gets $375M. Atom Computing, D-Wave, Infleqtion, PsiQuantum, Quantinuum, and Rigetti each receive up to $100M, with Diraq getting $38M. D-Wave jumped 33%, Rigetti 31%, and IBM 12% the same day.

Why it matters: The government just took equity in nine quantum companies and underwrote two new foundries, modeled on the Intel CHIPS deal. Jefferies reads it as a direct response to China's state-backed quantum push. The multi-modality bet hedges across superconducting, trapped-ion, neutral-atom, silicon-spin, photonic, and topological hardware, which means no winner has been picked yet.

U.S. TO AWARD $2B TO 9 QUANTUM COMPANIES AND TAKE EQUITY STAKES. $IBM: $1B, $GFS: $375M. Other recipients ($100M each): D-Wave...

@wallstengine·945 engagements

The U.S. Just Went All In on Quantum Computing

The Quantum Bull·5.4K views

US to award Quantum Computing Firms 2 Billion and take Equity Stakes

r/IonQ·114 upvotes

Slow Drip

Blog reads worth savoring

Analysis · Semianalysis SubstackEDA Market Primer - Market Dynamics, Cadence, Synopsys, Siemens, China EDA Rise

Maps the $18B EDA market with hard numbers: token licensing yields ~20% revenue uplift on flat headcount, foundry-mandated tool flows lock in 95%+ retention, and China's share climbs as Synopsys' China revenue slips from 16% to 12%.

Analysis · Aiweekender SubstackHow Coding Agents Actually Work Under the Hood (and Why They Go Wrong)

Names ten concrete failure modes in Cursor and Claude Code (doom loops from stale observations, plan-mode that still writes, tool outputs eating 70-80% of context) with a four-layer architecture model to debug them.

Research · Alibaba Cloud BlogSGLang Hierarchical Sparse Attention

Stores the full KV cache on CPU and keeps only a Top-k LRU buffer on GPU, cutting per-request GPU memory from 8GB to 200MB at 128K context and 5x-ing batch throughput.

News · Latent SpaceNew AI Infra unicorns: Exa, Modal, TurboPuffer

Three fresh infra unicorns in one day: Exa ($250M at $2.2B), Modal ($355M at $4.7B), and TurboPuffer hitting $100M ARR profitably 19 months after first $1M while raising under $1M.

The Grind

Research papers, decoded

Reasoning Models8,626 upvotes · arxiv · X

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Apple researchers stress-tested frontier thinking models (Claude 3.7 Sonnet Thinking, DeepSeek-R1, o3-mini) inside controllable puzzles where they could dial complexity step by step. They found three sharp regimes: on easy problems, vanilla LLMs beat reasoning models for the same compute; on medium problems, reasoning models pull ahead; on hard problems, both collapse to near-zero accuracy. Counterintuitively, the reasoning models cut their thinking tokens as problems get harder, even with budget remaining. Don't pay the thinking tax for low-complexity tasks, and set a hard fallback policy past your domain's complexity ceiling.

Recursive Reasoning Architectures27 upvotes · alphaxiv

Probabilistic Tiny Recursive Model (PTRM)

Inference-time-only trick for Tiny Recursive Models: inject Gaussian noise at each recursion step to run K parallel trajectories, then pick the winner using the Q-head that TRM already trained but normally throws away. No retraining, no task-specific augmentation. Sudoku-Extreme jumps from 87.3% to 98.75%, Pencil Puzzle Bench from 62.6% to 91.2%, beating an ensemble of seven frontier LLMs at roughly $0.001 per attempt with a 7M-parameter model. Width scaling via parallel noisy rollouts and the internal verifier head is a free accuracy lever before retraining.

Recursive Reasoning Architectures104 upvotes · alphaxiv

Generative Recursive Reasoning Models (GRAM)

Converts deterministic Recursive Reasoning Models into probabilistic ones via amortized variational inference, modeling reasoning as a stochastic latent trajectory so the model can pursue multiple hypotheses in parallel and scale inference-time compute through depth and trajectory sampling. 97.0% on Sudoku-Extreme vs TRM's 87.4%, 99.7% on 8x8 N-Queens with 90.3% solution coverage, plus an unconditional-generation mode (99.05% valid Sudoku boards from empty inputs) that deterministic baselines can't do. The trained-from-scratch counterpart to PTRM.

The Mill

Builder tools ground for action

16K stars, +3.7K today

colbymchenry/codegraph

Pre-indexed code knowledge graph for Claude Code, Codex, Cursor, and OpenCode. Fewer tokens, fewer tool calls, 100% local. The drop-in knowledge layer in front of your coding agent.

TypeScript

24K stars, +2.6K today

anthropics/claude-plugins-official

Anthropic's official, managed directory of high-quality Claude Code plugins. Brand-new repo, instantly the canonical install source for Claude Code extensions.

Python

18K stars, +1.4K today

Lum1104/Understand-Anything

Turns any codebase into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, and Gemini CLI.

TypeScript

11K stars, +988 today

rohitg00/ai-engineering-from-scratch

Learn it. Build it. Ship it for others. A free, from-first-principles AI engineering curriculum that keeps climbing as bootcamp alternatives stay popular.

Python

448 votesProduct Hunt

Tycoon AI

Run one-person companies entirely with AI agents. Astra (an AI CEO) manages 10+ pre-built agents (CMO, CTO, etc.) and orchestrates Claude Code and Hermes under the hood. Hand it a KPI like 10x traffic this month and it plans, assigns, and reports back.

107 comments

307 votesProduct Hunt

Mintlify Workflows

Self-updating knowledge bases. Pre-built automations that keep docs, changelogs, and translations current automatically whenever the product changes.

39 comments

275 votesProduct Hunt

WeWeb 3.0

Vibe-code apps with the safety net of a no-code editor. Prompt AI to generate an app, then refine screens, workflows, and DB in a visible no-code editor. No more black box.

85 comments

197 votesProduct Hunt

Slideshot

Product demo videos, recorded by your AI agent. Drives your web app over MCP and returns a polished demo video plus GIF with zooms, cursor motion, and intro animation.

34 comments

The Counter

Voices from the AI bar today

5.1K views

Building signals that trade themselves

How Man Group put AI-generated trading signals into live production under a skills framework plus core data layer, with compliance oversight, and scaled it across 750+ developers. A real template for deploying compliant agents in regulated industries.

insight 10

2.2K views

Gemini 3.5 Flash Is Good. That's Not the Story

Benchmarks Gemini 3.5 Flash on the CARE benchmark (~75% planning, 46% intent recovery) and stress-tests Antigravity 2.0's agent-first, no-IDE workflow. Honest take on what works and what's still painful.

insight 9

55K engagements

SOMEONE RAN STATE-OF-THE-ART AI ON A 26-YEAR-OLD iMAC WITH ZERO INTERNET AND GOT NEAR-INSTANT RESPONSES.

Not a demo. Not a benchmark. The future already runs locally on dead hardware from 1999. Same topic surfaces a $1,472 1B-param model matching 7x peers and 66M-param TTS beating ElevenLabs on a Raspberry Pi.

topic engagement 57,478

48K engagements

THESE AI MODELS ARE RUMORED TO BE RELEASING IN JUNE: GPT-6, Claude, Llama 4, Gemini 3 Pro, Claude 'Fennec', Grok. Which would you want the most?

The June release rumor mill, spotlighting Musk's Macrohard, a purely-AI software company under xAI positioned against Microsoft.

topic engagement 48,582

1.8K upvotes · 125 comments

I'm a software engineer with a decade of experience. I vibe code all of my side projects from my phone using Claude Code and don't read any of the code. It's so fun. Here are the rules I follow:

A senior engineer lays out a vibe-coding playbook: plan mode, iterative validation, version control, and forced test generation. The rules that make hands-off AI builds actually ship.

r/ClaudeAI

1.8K upvotes · 145 comments

11 Claude things I wish someone had told me 12 months ago

Field-tested Claude tricks across Projects, Custom Styles, and subagents. A tactical cheat sheet for power users that hit the front page this morning.

r/ClaudeAI

Roast Calendar

Your AI week, day by day

Sat23

May 23 - May 24•Mountain View, CA

Eazo AI 2026 Global Hackathon: Silicon Valley

2:00 PM PT•San Francisco, CA

Demo Day: Managing Context for Agents

3:30 PM PT•San Francisco, CA

Learn Claude Code For Beginners (Women's Workshop)

Sun24

11:00 AM PT•San Francisco, CA

Inference Mode #2: DeepSeek-V4

3:00 PM PT•Mountain View, CA

HackStorm 2.0: Demo Show

3:00 PM PT•Los Altos Hills, CA

The Age of Agents: How AI x Web3 Is Reshaping Payments and Wealth

Mon25

5:30 PM PT•Sunnyvale, CA

How an Agent-Native Language Can Make Agents More Reliable in Production

7:00 PM PT•San Francisco, CA

90/30 Club (ML reading) #54: TPU Performance

7:00 PM PT•San Francisco, CA

Robots, AMRs & Autonomous Systems Night

Tue26

5:00 PM PT•San Francisco, CA

Codex Community Hackathon - San Francisco #5

5:30 PM PT•San Francisco, CA

Operationalizing Agents with speakers from Google DeepMind, Snowflake, and Google Research

5:00 PM PT•San Francisco, CA

Build Night: Web Research Agents That Don't Break in Prod x HackerSquad

Wed27

May 27 - May 28•Hackathon

Claude Builders Club Hackathon

12:00 PM PT•Stanford, CA

Stanford OpenLab Seminar with Guido Appenzeller, GP a16z AI Infrastructure

5:30 PM PT•San Francisco, CA

Agents & APIs SF Developer Meetup

Thu28

5:30 PM PT•San Francisco, CA

Founder's Hour @ OpenAI

5:30 PM PT•San Francisco, CA

Continual Learning Circle Meetup & Dinner

5:00 PM PT•San Francisco, CA

AI Cluster: Space Data Centers

Fri29

May 29•Hackathon

Kane CLI Hack Day

5:00 PM PT•Mountain View, CA

Gemini Meetup

9:30 AM PT•San Francisco, CA

AI and Sustainability Global Summit

Last Sip

Parting thoughts

A single morning of phone calls killed a year of federal interagency work, a chat model wrote a paper that cleared a Fields Medalist's smell test, and a wafer foundry is being built in Albany on a government equity check. The throughline is that the people who decide what AI does next now sit in three rooms: a White House where one call moves policy, a Mountain View stage where Search becomes an agent manager, and a Mila lab where 7M-parameter models beat ensembles of frontier LLMs for a tenth of a cent per attempt. Pick which room you're building for, and pick on purpose.

Agentic Brew Daily

Fresh Batch

Bold Shots

Some crazy developments in AI in the last 24 hrs: Trump postpones/scraps AI executive order after pressure from Musk and Zuckerberg; OpenAI ...

Trump Kills AI Executive Order at the Last Minute: 'I Didn't Like It'

"He just hates regulation" Trump delays AI executive order that might hinder progress, says, "it's just something doomers wanted"

Checking the math behind OpenAI and Anthropic's latest headlines

SpaceXAI will provide @AnthropicAI with access to Colossus 1, one of the world's largest and fastest-deployed AI supercomputers

Anthropic Partners With SpaceX AI, Leopold's $5.5B Bet, and the Singularity Economy | EP #255

Anthropic is paying SpaceX $15 billion per year

OpenAI solved an 80-year math problem by... disproving it

AI solving math problems is a good thing... (Quoting OpenAI: "Today, we share a breakthrough on the planar unit distance problem...")

The Erdős Breakthrough

AI just disproved the biggest math conjecture so far

OpenAI general purpose model had a breakthrough on famous 80 year old Erdos problem.

OpenAI claims a general-purpose reasoning model found a counterexample to Erdos's unit-distance bound

The Pulse: Antigravity 2.0 takes 'IDE' out of its new IDE

"Google AI Pro" users get almost no YouTube ads — "Premium Lite" granted for free.

Our 8th-generation TPUs made a splash on the #GoogleIO stage: TPU 8t & TPU 8i!

I/O '26 Recap: Everything You Need to Know

Google I/O 2026 keynote in 35 minutes

Everything announced at Google I/O 2026... Makes me want to sell my phone.

U.S. TO AWARD $2B TO 9 QUANTUM COMPANIES AND TAKE EQUITY STAKES. $IBM: $1B, $GFS: $375M. Other recipients ($100M each): D-Wave...

The U.S. Just Went All In on Quantum Computing

US to award Quantum Computing Firms 2 Billion and take Equity Stakes

Slow Drip

The Grind

The Mill

The Counter

Roast Calendar

Last Sip