Jun 24, 2026

Agentic Brew Daily

Your daily shot of what's brewing in AI

Fresh Batch

Distilled trend

Washington's order forcing Anthropic to disable Fable and Mythos abroad is fueling demand for the open-weight and orchestration alternatives it aimed to contain.
Anthropic is both the most regulation-exposed lab and the most supply-locked, banned globally the same week Micron and SpaceX deepened its compute and memory pipeline.
The AI cybersecurity race turned two-directional, with OpenAI's GPT-5.5-Cyber writing working exploits while builders pack offensive-security demo nights and red-team Mythos.

Bold Shots

Today's biggest AI stories, no chaser

Sakana AI ships Fugu, a router model that hires frontier LLMs instead of beating them

On June 22, Tokyo's Sakana AI launched Fugu and Fugu Ultra — a multi-agent orchestration system delivered as a single ~7B router model that assigns Thinker, Worker, and Verifier roles and farms work out to a swappable pool of frontier LLMs through one OpenAI-compatible API. Sakana claims Fugu Ultra matches or beats Claude Opus 4.8, GPT-5.5, and Gemini 3.1 Pro on coding and agentic benchmarks, though every figure is self-reported. It ships in subscription and pay-as-you-go tiers, but isn't available in the EU/EEA at launch pending GDPR compliance.

Why it matters: Fugu inverts the scaling playbook — a tiny router that "hires" frontier models rather than training one, letting a compute-poor lab claim frontier-class results. It's explicitly pitched as an export-control hedge. The open question is whether swappable orchestration is real sovereignty or just relocated vendor lock-in.

Sakana Fugu surprisingly performed near GLM 5.2 level but 17x more expensive! We gave the same prompt to 4 models: build a complete live Trader Desk with both frontend and backend components, real-time market data fetched from external APIs for 8 symbols, and a custom dark-theme

@atomic_chat_hq·2.2K engagements

I Battle Tested Sakana Fugu's Fable Killer

Nate Herk | AI Automation·66.7K views

Sakana in Japan just dropped a mythos competitor and it looks great

r/ArtificialInteligence·429 upvotes

Anthropic puts Claude in Slack as a tagged teammate

Anthropic launched Claude Tag, a Slack-native AI teammate you summon by typing @Claude, in beta for Team and Enterprise customers. One shared Claude per channel builds persistent memory from channel history, uses admin-granted tools, and works asynchronously over hours or days, with an optional ambient mode that lets it act proactively. It runs on Opus 4.8, is framed as the multiplayer evolution of Claude Code, and replaces the legacy Claude in Slack app on August 3. Anthropic says 65% of its product team's code is now written by its internal version — including most of the code that built Claude Tag itself.

Why it matters: This turns a private assistant into a shared coworker living inside the channel — the contested layer of enterprise AI. The pitch is form factor, not a new model, and the recursive "the tool wrote the tool" 65% stat is the dogfooding proof for buyers. It's also a land grab against Salesforce's Slackbot, with a concentrated surveillance concern attached.

Introducing Claude Tag, a new way for teams to work with Claude. In Slack, Claude joins as a team member with access to the channels and tools you choose. Tag Claude in and delega

@claudeai·17K engagements

This is a new paradigm for interacting with Claude that is significantly more "inline" with all the other human activity org-wide. Once you do all of the under the hood engineering

@karpathy·4.2K engagements

Tag Claude in, right where you already work

Claude·30.4K views

Anthropic's Claude Tag is learning your company, one Slack message at a time

r/technology·34 upvotes

OpenAI's GPT-5.5-Cyber wants to own the patch, not just find the bug

On June 22, OpenAI expanded its Daybreak cybersecurity initiative, launching the full GPT-5.5-Cyber, a Codex Security plugin, a Cyber Partner Program, and the open-source Patch the Planet effort. GPT-5.5-Cyber is a specialized offensive/defensive model that traces attack paths, validates exploitability, generates targeted patches, and produces remediation evidence — distributed only to verified "trusted defenders," not for general use. Patch the Planet (with Trail of Bits, HackerOne, CALIF) shipped 64 PRs and 37 merged patches across 19 projects in week one, with human review before any finding reaches a maintainer.

Why it matters: OpenAI's thesis is that the security bottleneck moved from finding bugs to fixing them — so it's building the whole remediation layer and positioning itself as the intermediary who finds the bug and sells the fix while defining who counts as a "trusted defender." That dual-use gating is a safety story and a commercial moat at once, sharpened by an AISI-found universal jailbreak.

We're expanding OpenAI Daybreak to help democratize patching vulnerable software at machine speed: - Codex Security plugin: find, validate, and fix vulnerabilities right inside Co

@OpenAI·4.9K engagements

NEW Mythos, GPT-5.5 Cyber & AI Security: Hype or Reality?

RoboShadow·202.6K views

Google DeepMind stakes $75M in A24, and the fans revolt

Google is investing roughly $75M in independent studio A24, paired with a multi-year research partnership between A24 and DeepMind, announced June 22. The deal is non-exclusive and explicitly does not give Google access to A24's film and TV library or other data. An early project is an AI-assisted storyboarding tool, still a prototype, built on DeepMind's Veo video generator plus custom models. It's reported as the first time Google has taken a stake in a film studio — and the r/A24 reaction ("Why A24, why did you do this," 1,900 upvotes) was the most intense Reddit response of any story today.

Why it matters: A24's entire value is a logo audiences trust as a promise of human craft, so an equity-plus-R&D deal reads as a brand wagering its own identity. Google conceding all data access shows it's buying proximity and proof — a credible reference customer for Veo, a Hollywood beachhead — not a training corpus. The fan revolt is the actual product risk.

Google is investing $75M into A24 as part of an AI research partnership between the 2 companies. They will look at using AI to create new tools for movie production & distribution.

@DiscussingFilm·16.3K engagements

Google's $75M A24 Deal Isn't About Cutting Costs

The Best One Yet·471 views

Why A24, why did you do this

r/A24·1.9K upvotes

Meta drops the Ray-Ban name and launches its own AI smart glasses

Meta launched its first AI smart glasses under its own brand — dropping the Ray-Ban name — with EssilorLuxottica, starting at $299. The lineup spans the $299 Adventurer and Fury and the $399 Kylie Jenner / Starfire edition, across 26 styles, all prescription-compatible. It's powered by Meta AI via Muse Spark, the first multimodal model from Meta Superintelligence Labs, and carries over Ray-Ban Meta Gen 2 hardware (12MP camera, 3K video, 8+ hour battery). It went on sale June 23 across 17 countries, amid allegations of an embedded "NameTag" facial-recognition capability.

Why it matters: Dropping Ray-Ban is a price-driven land grab — same hardware, $100-200 cheaper — to widen Meta/EssilorLuxottica's ~82% category share before Apple enters. But shedding the trusted name strips a reputational buffer just as surveillance allegations escalate. The recurring verdict: the hardware and price are strong, but Muse Spark is the weak, unproven link.

Our new Meta Glasses are launching with 26 styles across a range of colors, lenses, and frames, so there's a pair of Meta Glasses for just about any look.

@MetaNewsroom·254 engagements

New Meta Glasses Hands-On: New Designs Start at $299, and There's a Kylie Jenner Model?

CNET·16.2K views

Slow Drip

Blog reads worth savoring

Analysis · InterconnectsGLM-5.2 is the step change for open agents

Why the first open-weight model that actually "feels right" as a coding agent rivals Opus/Fable on benchmarks, and what that does to closed-lab pricing and the US-China gap.

Analysis · Latent SpaceRed-Teaming after Mythos — Zico Kolter & Matt Fredrikson

Why AI security is not "cybersecurity with AI": indirect prompt injection, agent attack surface, and why bigger models aren't more robust.

Tutorial · simonwillison.netPorting the Moebius 0.2B image inpainting model to run in the browser with Claude Code

A lived-experience walkthrough of taking a CUDA/PyTorch model to WebGPU in the browser, including how to run a parallel Claude Code side-project while your main agent grinds.

Tutorial · Lenny's NewsletterHow I AI: How to write AI agent loops in Claude Code and Codex

Concrete recipes for self-triggering goal loops and subagents, plus how Mozilla's AI harness surfaced a 15-year-old Firefox memory-safety bug.

Analysis (engineering) · CloudflareHow we found a bug in the hyper HTTP library

A six-week debugging story ending in a four-line poll_shutdown flush fix for a backpressure race that silently truncated large HTTP responses across multiple hyper versions.

The Grind

Research papers, decoded

World Models & Reasoning Architectures213 upvotes · alphaXiv

Looped World Models (LoopWM)

Instead of stacking layers, LoopWM reuses a single parameter-shared transformer block in a loop, refining the latent state of a simulated environment with spectrally-constrained residual dynamics for stable long rollouts. A 1B LoopWM beat Claude-Opus (~100x larger) on ScienceWorld (68.4% vs 47.2% exact match) and stayed stable over thousands of steps. Trade parameter count for loop iterations and run reliable long-horizon simulation on modest hardware.

World Models & Reasoning Architectures99 upvotes · alphaXiv

Fixed-Point Reasoners (FPRM)

A looped transformer that keeps iterating until the hidden state hits a fixed point, using convergence as a learned halting signal and doing more work on hard inputs, less on easy ones. A 7M-parameter FPRM hit 94.2% on Sudoku-Extreme (vs 74.7% for the larger Tiny Reasoning Model) and also handles Maze, state-tracking, and ARC-AGI. Adaptive compute with a built-in stopping rule.

Agents100 upvotes · alphaXiv

Tmax: A Simple Recipe for Terminal Agents

Pairs a new dataset (TMAX-15K, ~14,600 difficulty-controlled terminal tasks) with a stabilized RL recipe to train open-weight command-line agents. TMAX-9B scores 27.2% on Terminal-Bench 2.0, the best open-weight result under 10B, with gains that generalize to SWE-Bench and AIME. Dataset, models, and code planned for release.

Spatial Intelligence107 upvotes · alphaXiv

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Treats spatial reasoning as accumulating spatio-temporal evidence across frames: a VLM planner calls a hierarchy of tools (2D detection, 3D lifting, counts/measurements) backed by Scene and Agent Memory. It improves both open and closed VLMs training-free (46.4% on MMSI-Bench, 60% on ViewSpatial-Bench), and a distilled S-Agent-8B rivals advanced closed systems.

The Mill

Builder tools ground for action

16.6K likes

DeepSite v4

Generate any application by Vibe Coding it DeepSite is a Vibe Coding Platform designed to make coding smarter and more efficient. Tailored for developers, data scientists, and AI engineers, it integrates generative AI into your coding projects to enhance creativity and productivity. DeepSite v4 is a Hugging Face Space tagged with docker, region:us. It has 16617 likes on Hugging Face.

HF Spaces

73.7K stars

bytedance/deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

GitHub

32.9K stars

jamiepine/voicebox

The open-source AI voice studio. Clone, dictate, create.

GitHub

8.2K stars

palmier-io/palmier-pro

macOS video editor built for AI

GitHub

177 votesProduct Hunt

Cloudflare Temporary Accounts

The moment an agent needs to deploy something, it slams face-first into a wall built for humans. Today we're rolling out Temporary Accounts on Cloudflare Workers. Any agent can now run wrangler deploy — temporary and get a live Worker in seconds.

Product Hunt

The Counter

Voices from the AI bar today

10K views

He won a Nobel here for AlphaFold. Then he left.

AlphaFold co-creator and 2024 Nobel laureate John Jumper walks through the actual architecture (Evoformer, invariant point attention, FAPE loss) — the rare deep technical retrospective from the person who built it.

Machine Learning Street Talk

7.1K views

Bye, Bye OpenAI & Anthropic?..

Breaks down Subquadratic's "SSA" architecture claim: 12-million-token context at ~1,000x less compute than Transformers, with potential to gut RAG/vector-DB infrastructure if it holds up.

Your AI Guy

8.8K views

Google's new AI agent stack from I/O 2026

A tour of Google's four-layer agent stack (Gemini 3.5 Flash, Agent Development Kit 2.0, the A2A protocol, and a Managed Agents API) — production-ready agent tooling developers can build on now.

Google Cloud Tech

1.2K upvotes · 299 comments

GLM-5.2 is a win for local AI

The community rallies around GLM-5.2 as proof an open-weight model can run locally and trade blows with frontier closed models — the single most-discussed item of the cycle.

r/LocalLLaMA

1.1K upvotes · 91 comments

Claude Opus caught malware hidden in my repo, then reverse engineered the whole thing

A viral firsthand account of Claude Opus autonomously detecting and unpacking hidden malware, fueling debate over LLMs as practical security tooling.

r/ClaudeAI

Roast Calendar

Your AI week, day by day

Wed24

4:30 PM PT•San Francisco

AWS Agentic AI Showcase: Accelerate Agentic AI in Production

5:00 PM PT•San Francisco

Skills & Agents — YC Founder Night

6:00 PM PT•San Francisco

How We Built It: LangSmith Engine

Thu25

5:00 PM PT•Hackathon

$1,000 Industrial AI Hackathon

5:15 PM PT•Palo Alto

Surreal Dinner Series: Multi-Agent Orchestration with E2B, Factory & WorkOS

6:00 PM PT•Menlo Park

Self-Improving Agents | ODSC AI and Snowflake

Fri26

8:30 AM PT•San Francisco

Flutter SF Gen UI Hackathon — Powered by Very Good Ventures

5:00 PM PT•Mountain View

Gemini Meetup

5:30 PM PT•San Francisco

AI Memory for Agents & Cognee Launch Party

Sat27

2:00 PM PT•Santa Clara

AI-Native Entrepreneurship Demo Day

4:30 PM PT•San Francisco

Local AI Workshop

6:00 PM PT•San Francisco

World Cup [Codex] Build Day: SF x Seoul

Sun28

2:00 PM PT•San Francisco

A Floot Founders Hackathon @ Corgi Cafe

5:00 PM PT•San Francisco

AI Engineer World's Fair — New Engineer Orientation (IRL)

7:00 PM PT•Los Altos

Build Your First AI Project in 2 Hours: A Hands-On Workshop

Mon29

5:00 PM PT•San Francisco

Retrieval for Agents: Building Retrieval Infrastructure

5:30 PM PT•San Francisco

Harness Engineering: State of the Art in Agent Harnesses

6:00 PM PT•San Francisco

Artificial Analysis Intelligence Index

Tue30

9:00 AM PT•San Francisco

Runpod Flash Hack Day

3:30 PM PT•Menlo Park

Building AI Ecosystem: Talks + AI Agent Workshops (ServiceNow, Glean, SAP)

5:30 PM PT•San Francisco

Agents in Production: The Full Stack

Last Sip

Parting thoughts

Funny how a ban meant to slow people down ended up being the day's best advertisement for everyone else. Sakana's Fugu, GLM-5.2 running on someone's home rig, a 1B world model out-reasoning a model 100x its size — the common thread is that capability keeps leaking around whatever fence you build. Meanwhile Claude moved into the Slack channel and OpenAI decided it wants to write the patch too. Pour another cup and enjoy the read.