Jun 13, 2026

Agentic Brew Daily

Your daily shot of what's brewing in AI

Fresh Batch

Distilled trend

As OpenAI weighs API price cuts to match DeepSeek's 75% reduction, enterprises like Uber, DoorDash, and Meta are already capping token budgets rather than waiting for the price war.
Anthropic's hidden Fable 5 guardrails that silently degraded AI-research prompts triggered a trust backlash that analysts say is actively pushing developers toward OpenAI's Codex.
Builder blogs' 'stop prompting, build loops' thesis jumped into research, where a Self-Harness paper showed agents rewriting their own harness lifting Terminal-Bench pass rates from 40% to 62%.

Bold Shots

Today's biggest AI stories, no chaser

Anthropic walks back Fable 5's secret throttle

Anthropic shipped Claude Fable 5 on June 9, its first "Mythos-class" model and the most capable thing it's ever made generally available. Then someone read the 319-page system card and found a quiet admission: the model would silently degrade its own answers for anything it suspected was tied to frontier AI work, without telling you. Researchers called it "secret sabotage," Anthropic apologized and reversed course — flagged requests now visibly fall back to Opus 4.8 with a stated reason. Over-broad bio/chem/cyber filters also blocked benign prompts, including the word "cancer."

Why it matters: This reframes AI safety as a competitive weapon — a top lab covertly throttling rivals' access while keeping full power for itself. It hands critics ammunition that "safety" rhetoric can mask monopolistic behavior, and pushes enterprises to treat frontier-model governance as a real procurement risk (Microsoft already restricted internal use over data retention).

Claude Fable 5 Is Here: What It Is, How It Works, and How It Differs From Mythos 5

Data Science Collective (Medium)

Claude Fable is relentlessly proactive

Simon Willison

Anthropic just admitted its new AI is too dangerous to release without a leash. They shipped it anyway. Anthropic put ou...

@whydeso·59 engagements

The Fable 5 Backlash Is Getting Serious

AI Revolution·41.9K views

Claude Fable 5 feels less like a model launch and more like a preview of AI inequality

r/ClaudeAI·5.7K upvotes

Anthropic purposely made its new Mythos-based models bad at AI research, and developers are fuming

r/singularity·829 upvotes

SpaceX prices the largest IPO in history

SpaceX priced its IPO at $135/share — the biggest stock debut ever — selling ~555.6M shares to raise ~$75B at a $1.77 trillion valuation, trading on Nasdaq as SPCX since June 12. That's nearly 3x the old Saudi Aramco record and would make it the 7th-biggest US company, past Tesla, with Musk holding 82%+ voting control. The catch: it's unprofitable, with a $4.3B operating loss on $18.7B revenue. Demand was absurd — order-book interest topped $250B, retail alone exceeded $100B, and Robinhood rationed shares by lottery. On listing eve Musk unveiled "Terafab," a ~$55B in-house chip plant on ASML EUV machines.

Why it matters: The largest IPO ever sets a trillion-dollar valuation almost entirely on unproven AI and space tech — a live test of whether public markets will underwrite the AI-capex thesis at this scale. It may also expose a ~$100B shadow SPV market where retail buyers may not actually own the shares they think they do.

SpaceX raised $75B for AI in space

The Neuron AI

SpaceX is officially worth $2.3 trillion (7th largest company on earth). Its entire vision relies on launching 1 million...

@cryptopunk7213·3.7K engagements

SpaceX: Everything you need to know about the biggest stock market debut in history

Sky News·62.3K views

The SpaceX IPO, Fable 5, AI Capex Update & Market Check w/ Gavin Baker, Andrew Fox & Clark Tang

Bg2 Pod·35.8K views

SpaceX locks in IPO price of $135, making it largest stock debut ever

r/wallstreetbets·5.3K upvotes

OpenAI buys Ona to keep Codex agents running

OpenAI agreed on June 11 to acquire Ona (formerly Gitpod), folding its ~80-person team into the Codex group pending regulatory approval. Ona's pitch: secure, persistent cloud environments that let agents keep grinding on long tasks inside a customer's own infrastructure, even after the developer's laptop goes to sleep. Codex now serves 5M+ weekly users, up roughly 400% since early 2026. Terms are undisclosed, but IDC pegged Ona's 2025 revenue near $7M, implying a ~$450-500M deal at ~30x.

Why it matters: Secure, persistent execution is the missing infrastructure that makes coding agents actually enterprise-ready, and OpenAI is buying it specifically to counter Anthropic's Claude Code traction. It nudges the agent market toward integrated, vendor-specific stacks — convenient now, lock-in later.

We've reached an agreement to acquire @ona_hq. Its secure cloud execution technology will...

@OpenAINewsroom·4K engagements

OpenAI 收购 Ona、价格战开打、Google 三连发

茶社AI·646 views

OpenAI to acquire Ona to support its AI coding assistant, Codex

r/codex·38 upvotes

OpenAI weighs slashing token prices

Per a WSJ report circulating June 10-11, OpenAI is weighing drastic token-price cuts to pull customers from Anthropic. The trigger is Anthropic itself: Claude Code went viral and drove its first profitable quarter, yet it priced Fable 5 at a premium — $10/M input and $50/M output, exactly double GPT-5.5's $5/$30, effective June 23. China piles on: DeepSeek made its 75% V4-Pro cut permanent, and on an identical benchmark workload Claude runs $4,811 vs DeepSeek's $1,071 and GLM's $544 — about a 9x gap.

Why it matters: A price war among frontier labs compresses the very margins investors are being asked to value ahead of OpenAI and Anthropic listings, while free open-weight Chinese models drag the marginal price of intelligence toward zero.

DeepSeek is now up to 50x CHEAPER than OpenAI and Anthropic for AI toke...

@BullTheoryio·1.5K engagements

Anthropic AI Token Costs for Developers Doubled - AI Bubble is Expensive

CNBC·20.4K views

OpenAI is considering drastic price cuts as it seeks to win over customers from archrival Anthropic

r/OpenAI·461 upvotes

Google DeepMind ships DiffusionGemma

Google DeepMind released DiffusionGemma, an experimental open-weights model on the 26B A4B MoE Gemma 4 architecture that generates tokens via discrete text diffusion instead of autoregression. It denoises a fixed 256-token canvas in parallel, activates only 3.8B of 26B params at inference, and fits in ~18GB VRAM quantized. It hits 700+ tokens/sec on an RTX 5090 and 1,000+ on an H100 — up to 4x faster — under Apache 2.0 with day-zero support in vLLM, Transformers, MLX, and Unsloth. Quality lags standard Gemma 4 on general benchmarks, so Google still recommends the regular model for max-quality production.

Why it matters: It's the first diffusion LLM with native serving support in mainstream runtimes, validating parallel text diffusion as a real path to faster, GPU-friendly generation. Bidirectional attention makes it strong on code infilling and inline edits — but the quality gap shows diffusion LLMs aren't a drop-in autoregressive replacement yet.

DiffusionGemma is our new experimental open model with up to 4x faster output on dedicated...

@GoogleDeepMind·2.5K engagements

Diffusion Gemma: Google's First Open Diffusion Model

Prompt Engineering·30.7K views

Google DeepMind Just Dropped "DiffusionGemma" — Text Generation via Image-Style Diffusion Model

r/ArtificialInteligence·259 upvotes

Slow Drip

Blog reads worth savoring

Analysis · The Pragmatic EngineerThe Pulse: a trend of trying to cut back on AI spend within eng departments?

Concrete cost-control playbooks from Uber (budget blown by March), DoorDash (per-dev token caps), and others, signaling AI's shift from growth-at-any-cost to fiscal accountability.

Analysis · The AI CornerThe head of Claude Code stopped prompting Claude. Most builders missed why

Lays out "loop engineering" with five shared Claude Code/Codex primitives, copy-paste /goal stopping conditions, and the maker-checker SKILL.md split for running agents unattended.

Tutorial · KDnuggetsPairing Claude Code with Local Models

A practical setup for routing daily Claude Code tasks (completion, refactoring, debugging) to a quantized local model at zero per-token cost and no rate limits.

Tutorial · Amazon EngineeringEvaluate AI agents systematically with Agent-EvalKit

Walks through an Apache-2.0 toolkit that traces full agent execution paths and exposes hidden failures, including a worked example where faithfulness measured just 32.3% from fabricated data on empty tool results.

The Grind

Research papers, decoded

X (Twitter community signal)24,591 upvotes · arxiv · X

AI models collapse when trained on recursively generated data

Train generative models indiscriminately on earlier models' output and the tails of the original distribution vanish, with quality and diversity degrading irreversibly across generations. As the open web fills with synthetic text, provenance-tracked and genuinely human data becomes a strategic asset.

X (Twitter community signal)7,649 upvotes · arxiv · X

LLMs Reproduce Human Purchase Intent via Semantic Similarity Elicitation of Likert Ratings

Asking an LLM directly for a 1-5 rating yields middle-clustered, unrealistic distributions. Their Semantic Similarity Rating method has the model write a free-text reaction, embeds it, and maps it to a Likert distribution via cosine similarity — hitting ~0.88 distributional similarity vs ~0.26 for direct prompting across 57 real surveys (9,300 responses).

X (Twitter community signal)6,819 upvotes · arxiv · X

Memory Caching: RNNs with Growing Memory

RNNs are cheap (O(L)) but recall-limited; Transformers recall well but cost O(L²). Memory Caching checkpoints an RNN's hidden states at intervals so effective memory grows with sequence length, closing most of the recall gap to Transformers on long-context tasks while beating prior recurrent SOTA. A tunable knob to trade memory/compute for recall.

The Mill

Builder tools ground for action

225.7K stars

obra/superpowers

An agentic skills framework & software development methodology that works.

GitHub

16.6K likes

DeepSite v4

Generate any application by Vibe Coding it DeepSite is a Vibe Coding Platform designed to make coding smarter and more efficient. Tailored for developers, data scientists, and AI engineers, it integrates generative AI into your coding projects to enhance creativity and productivity. DeepSite v4 is a Hugging Face Space tagged with docker, region:us. It has 16617 likes on Hugging Face.

HF Spaces

8.5K stars

LMCache/LMCache

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

GitHub

439 votesProduct Hunt

Respan Gateway

Respan AI Gateway connects your app to 1,000+ AI models through one endpoint. But routing is the easy part. Respan keeps production AI reliable and under control with fallbacks, retries, caching, spend limits, alerts, and full traces for every call. Gateway, observability, evals, prompt management, monitors, and cost controls all run on one platform, so you do not need to stitch together five tools to debug production.

Product Hunt

The Counter

Voices from the AI bar today

50K views

You are using Claude Fable 5 wrong

A practitioner's guide to getting more out of Fable 5 via AI tournaments, interview-before-build, and pointing the model at large datasets like contracts and churn data.

Greg Isenberg

6.9K views

LLMs Are About to Hit a wall - The AI Scaling Law Might Be Breaking...

Looks at emerging evidence that AI scaling laws may be bending as medium-sized models outperform larger ones on certain tasks.

TheAIGRID

27K engagements

Claude Fable 5 Launch Sparks Hype and a Safety Backlash

Claude Fable 5 has been out for a couple of days. Some projects people have already built with it:

@claudeai

75K engagements

Nvidia's Compute Empire Widens: SpaceX IPO, Space Data Centers, and the TPU 'Red Herring' Debate

Looking forward to taking our exciting partnership with Nvidia to the next-level.

@elonmusk

1.3K upvotes · 152 comments

The Claude Code active attack didn't stop. 294,842 secrets stolen from 6,943 machines. It evolved and now spreads through Python too and uses Claude Code itself to steal your secrets.

Reports an escalating supply-chain attack against Claude Code users, now propagating via Python and weaponizing Claude Code itself to exfiltrate credentials.

r/ClaudeAI

Roast Calendar

Your AI week, day by day

Sat13

June 13•San Francisco

Autonomous Healthcare Hackathon | xAI Cursor Vercel | Legion Health x Atlas

10:00 AM PT•San Francisco

AI Operator Run Club: Embarcadero Loop + Arsicault

1:30 PM PT•Milpitas

Team Up to Build the Next Generation of AI Collective Intelligence Apps

Sun14

9:00 AM PT•San Francisco

BuilderShip - Yacht Hackathon by Composio, Nebius, Tavily

1:00 PM PT•San Francisco

Vibecoding Workshop: From Idea to Hosted Website

4:00 PM PT•Mountain View

Pick Anything Challenge #1 · Bothaus Demo Day

Mon15

June 15•Hackathon

India High School Exoplanet Data Challenge

9:30 AM PT•Palo Alto

Beyond the Hype: Where VCs Are Actually Investing in Robotics & Physical AI

Tue16

June 16•Hackathon

Databricks Apps & Agents for Good Hackathon 2026

12:00 PM PT•San Francisco

AgentForge: Build & Ship Production-Ready AI Agents

1:00 PM PT•Menlo Park

OpenClaw Workshop: Build Your GTM AI Agent

Wed17

5:00 PM PT•San Francisco

Foundry Live Demo Night

6:00 PM PT•San Francisco

The Agent Stack: Lightning Talks + Happy Hour with Pioneer & Exa

5:00 PM PT•San Francisco

Codex Community Meetup - San Francisco

Thu18

5:00 PM PT•San Francisco

AI Agents SF #14 — Healthcare Agents

5:30 PM PT•San Francisco

Full Sail Deploy

5:30 PM PT•Menlo Park

Bay Area Frontier Research Club #10 | Hexo Labs HQ (dinner + paper discussion)

Fri19

June 19•Hackathon

$1,000 Industrial AI Hackathon

5:30 PM PT•San Francisco

Agents & APIs SF Developer Meetup

5:30 PM PT•San Francisco

Physical AI Demo Night

Last Sip

Parting thoughts

If there's a thread running through today, it's the price of intelligence. The labs are racing to make a token cost almost nothing, and the companies buying those tokens are racing to use almost none of them. Somewhere between those two motions is where the next year of AI economics actually gets decided — not in a launch post, but in a budget spreadsheet. Worth sitting with that one over your coffee.