Jun 5, 2026

Agentic Brew Daily

Your daily shot of what's brewing in AI

Fresh Batch

Distilled trend

Alphabet raising $85B to fund a $185B compute bill on the same day Broadcom's in-line AI guide erased $300B of chip-sector market cap signals the market is no longer pricing AI capex as one-way upside.
Microsoft shipped MAI-Thinking-1 trained without OpenAI distillation, replaced Qualcomm and Intel with NVIDIA RTX Spark on Windows, and pulled trillion-parameter inference onto the desk via DGX Station — three vendor decouplings disguised as one keynote.
Trump's voluntary AI cyber order, traced back to Anthropic's Mythos demo that auto-chained a 27-year-old OpenBSD exploit, lands the same week Anthropic's own red team reports malicious-actor risk jumped from 33% to 56%.

Bold Shots

Today's biggest AI stories, no chaser

Microsoft and NVIDIA unveil joint agentic PC stack at Build/Computex 2026

NVIDIA and Microsoft rolled out a joint full-stack agentic architecture spanning Windows AI PCs, DGX Station, Azure, and Microsoft Foundry. RTX Spark — an Arm superchip with 20 Grace cores, a Blackwell GPU, and 128GB unified memory at ~300 GB/s — ships this fall across Surface, ASUS, Dell, HP, Lenovo, and MSI. The DGX Station for Windows puts a Grace Blackwell Ultra with 748GB of coherent memory under your desk, capable of running trillion-parameter models locally. Microsoft Scout, the always-on M365 personal agent, runs on OpenClaw with scoped Entra identities per agent.

Why it matters: Triple decoupling in a single keynote — Microsoft is reducing dependence on OpenAI (via in-house MAI-Thinking-1), on x86 (RTX Spark replaces Qualcomm/Intel/AMD), and on the cloud (DGX Station puts frontier inference on the desk). The OpenShell + Entra governance layer is what makes always-on agents enterprise-shippable.

Introducing NVIDIA DGX Station for Windows, the world's most powerful deskside AI supercomputer with Windows powered by NVIDIA GB300. Run frontier AI models with up to 1 trillion parameters locally. Build and run secure AI agents on Windows with NVIDIA OpenShell.

@nvidianewsroom·3K engagements

Microsoft Scout is a new AI personal assistant built on OpenClaw. Scout is Microsoft's 'first real personal assistant,' and you can download the desktop app today.

@tomwarren·482 engagements

Google dropped Gemma 4 12B on June 3 — a unified, encoder-free multimodal model that processes vision and audio directly into the LLM backbone, ships Apache 2.0, and runs on a 16GB laptop. The 550M vision tower is gone, replaced by a 35M-parameter vision embedder; the audio encoder is gone entirely, with raw 48x48 patches and 40ms audio frames feeding straight to the model. 256K context, Multi-Token Prediction for ~3x inference, day-one support across Hugging Face, Kaggle, LM Studio, Ollama, llama.cpp, MLX, and vLLM. Unsloth Dynamic GGUFs push it down to 8GB RAM at 4-bit.

Why it matters: Frontier multimodal AI now lives on a laptop, not in a data center. Benchmarks (GPQA 78.8%, MMLU Pro 77.2%, LiveCodeBench v6 72%) plus Apache 2.0 + day-one tooling put real pressure on Llama 4, Qwen, and Mistral in the 10-20B local tier.

Google Ditched the Encoders in Gemma 4 12B, and It Runs Multimodal AI on a 16GB Laptop

Towards AI (Medium)

Today we're introducing Gemma 4 12B — our latest open model that brings advanced agentic reasoning, vision and audio directly to your laptop.

@Google·8.2K engagements

Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs. Google's new model, Gemma 4 12B Unified supports image, audio and 256K context.

@UnslothAI·2.6K engagements

Google just dropped Gemma 4... (WOAH)

Matthew Berman·88.8K views

Gemma 4 12B Is INSANE - Is THIS the BEST Local Coding Model Yet?

Bijan Bowen·31.1K views

google/gemma-4-12B · Hugging Face

r/LocalLLaMA·940 upvotes

New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both!

r/LocalLLaMA·603 upvotes

Alphabet prices record $85B equity offering to fund AI buildout

Alphabet priced an upsized ~$84.75B equity offering on June 3 — the largest single equity offering in history, eclipsing Petrobras 2010's $70B. Four layers: ~$18B Class A/C common, $16.75B mandatory convertible preferred, $10B private placement to Berkshire Hathaway, and a $40B ATM program. The proceeds fund Alphabet's 2026 AI/cloud capex of $180-190B (~6x 2022's $31B). Buffett bought 14.2M Class A at $351.81 and 14.4M Class C at $348.20, pushing Berkshire's Alphabet exposure past $26B — its 7th-largest US holding.

Why it matters: Big Tech's low-capex, high-earnings era ends here. Microsoft, Meta, Amazon, and Oracle now face an implicit benchmark. Berkshire — famously capex-averse — anchoring an AI infra raise is the strongest validation signal Big Tech has received.

Alphabet completed an equity offering raising ~$45B, with an additional $40B to be raised via an 'at the market' program, totaling ~$85B, including a $10B investment from Berkshire Hathaway.

@sundarpichai

'Fast Money' traders react to Alphabet's proposed $80 billion equity capital raise for AI

CNBC Television·52.6K views

Alphabet Will Raise $80 Billion to Fund AI Spending

Bloomberg Television·28.5K views

Alphabet's $80 billion stock sale leaves Wall Street in 'unprecedented territory,' says Goldman's Gutman

r/technology·4.9K upvotes

Google owner Alphabet to sell $80bn in stock to fund AI spending spree

r/technology·1.9K upvotes

Broadcom's Q2 earnings trigger $300B sell-off despite record AI revenue

Broadcom posted Q2 FY2026 revenue of $22.19B (+48% YoY) with AI semi revenue of $10.8B (+143% YoY) — and the stock still fell 12-15%. Q3 AI semi guide of ~$16B vs Visible Alpha consensus ~$17.2B opened a $1.2B gap. CEO Hock Tan reiterated rather than raised the $100B+ FY2027 AI semi target. VMware-led software hit $7.18B vs $7.32B expected. The drawdown erased $300B+ in market cap and dragged AMD, Intel, Qualcomm down 4%+, with Micron and Super Micro near -7%. Tan also acknowledged that Google is diversifying its chip supply.

Why it matters: Markets are reading the guide as a hyperscaler-capex tell. If the most plugged-in custom-silicon vendor with six anchor customers only sees $16B next quarter, the implicit ceiling on AI infrastructure spend just got lower for everyone.

Broadcom beat earnings and the stock fell about 13% anyway. I called the beat on the record, so here is the grade. The prediction landed. Earnings came in at $2.44 against the $2.40 consensus, and AI chip revenue hit $10.8 billion.

@theaiportfolios·484 engagements

Broadcom AI Chip Outlook Disappoints Investors

Bloomberg Television·7.2K views

Broadcom Inc Earnings LIVE AVGO Stock Results Q2 FY2026!

Trade The Pool·8.4K views

Broadcom stock sinks after hours as AI chip sales forecast disappoints

r/wallstreetbets·2.1K upvotes

Trump signs voluntary AI executive order on frontier model cybersecurity testing

On June 2, Trump signed "Promoting Advanced Artificial Intelligence Innovation and Security" — creating a voluntary 30-day pre-release model access window for federal review. The order explicitly forecloses mandatory licensing and limits vetting to three capabilities: software-exploit discovery, chemical-weapon design assistance, and autonomous cyberattacks. NSA, Treasury, and CISA must produce a classified benchmark and a "covered frontier models" definition within 60 days; Treasury stands up an AI cybersecurity clearinghouse within 30. The 30-day window was negotiated down from a 90-day May draft after OpenAI, Anthropic, and Google pushed back. The whole thing was triggered by Anthropic's April Mythos demo, which autonomously chained exploits across major OSes including a 27-year-old OpenBSD flaw.

Why it matters: "Voluntary" is doing the heavy lifting. The opt-in is the public-facing layer, but the classified threshold defining "covered frontier models" — written by the NSA inside 60 days — is the real lever.

Many have reached out to me regarding the new Cyber EO. A few thoughts: First, President Trump is the most pro-innovation president we've ever had. He's made it clear that the U.S. has to win the AI race and that a pro-innovation, pro-energy, and pro-infrastructure policy is the [path].

@DavidSacks·1.7K engagements

Trump signs new artificial intelligence executive order

NBC News·94.9K views

Trump signs AI executive order to give government early look at new models

CBS News·114.4K views

Trump signs AI executive order asking companies to give government early access to models

r/news·2K upvotes

Slow Drip

Blog reads worth savoring

Analysis · ByteByteGoHow OpenAI Built Its Data Agent

Concrete blueprint for taming 90,000 tables with a single un-fine-tuned LLM plus six context layers and three-step query verification — and how Codex pulled off a 600-petabyte cross-cloud migration in two months.

Analysis · a16z NewsA Functional Taxonomy of World Models

Cuts through the "world model" buzzword by mapping today's video generators, physics sims, and planners onto the POMDP loop so you can argue precisely about what each system actually does.

Analysis · Latent SpaceScaling Past Informal AI - Carina Hong, Axiom Math

Makes the case that Lean-verified proof generation (12/12 Putnam, 99% on Verina vs OpenAI's 4.9%) is a fundamentally stronger training signal than RLHF/GRPO for compounding reasoning.

Tutorial · Hugging Face / NVIDIAHow to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Walks through replay-mixed fine-tuning of NVIDIA's 600M multilingual ASR with att_context_size to trade 80ms-vs-1s latency, showing 32% / 31% WER cuts on Greek and Bulgarian.

News · The Neuron AIGoogle Gemini got hijacked via WhatsApp

SafeBreach's zero-click "Fake Context Alignment" technique smuggles malicious instructions through Android notifications, weaponizing Gemini for data exfil, phishing relay, and silent surveillance across WhatsApp, Slack, Signal, and Instagram.

The Grind

Research papers, decoded

AI & Economics19,469 upvotes · arxiv · X

The AI Layoff Trap

Wharton and BU economists model why rational firms keep automating jobs even when they can see the demand cliff coming: each firm pockets the full cost savings of automation but only eats 1/N of the aggregate demand loss (the rest hits competitors), so competition itself traps the industry in an over-automation arms race. UBI, capital taxes, worker equity, upskilling, and Coasean bargaining all fail to fix it — only a Pigouvian automation tax set at τ* = ℓ(1−1/N) closes the wedge. Predicted empirical signature: profit erosion coinciding with mass layoffs in fragmented AI-deploying industries.

Embodied AI / Robotics229 upvotes · alphaxiv

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

The Qwen team bolts a 1.15B-param flow-matching Diffusion-Transformer action expert onto a Qwen3.5-4B VLM backbone and trains one model to do manipulation, navigation, and trajectory prediction across multiple robot bodies, conditioning on textual embodiment prompts (platform, control frequency, morphology). Hits 97.9% on LIBERO, 86.1/87.2% on RoboTwin-Easy/Hard, 76.9% real-world ALOHA OOD, and 26.6% zero-shot DOMINO. Takeaway: embodiment-aware text conditioning + a 4-stage curriculum (text-to-action pretrain → joint CPT → SFT → RL) transfers skills across morphologies without per-robot architectural changes.

World Models45 upvotes · alphaxiv

Cosmos 3: Omnimodal World Models for Physical AI

NVIDIA's open-weight foundation model unifies language, image, video, audio, and action in one Mixture-of-Transformers with two pathways — an autoregressive "Reasoner" for discrete tokens and a diffusion "Generator" for video/audio/action — joined by 3D Multimodal RoPE for cross-rate alignment. Post-trained versions ranked best open-source Text-to-Image and Image-to-Video on Artificial Analysis and #1 on RoboArena's policy leaderboard, with 63.4% on Physics-IQ V2V. Weights, code, curated synthetic datasets (SDG-PhyxSim, SDG-RobotSim, SDG-DriveSim), and the eval benchmark are all released under OpenMDW-1.1.

Fine-Tuning / Personalization33 upvotes · alphaxiv

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Reframes LoRA as persistent per-user "local state" on top of a shared trillion-parameter base. Ships hard empirical results across three axes: trillion-param LoRA-RL at ~10% the compute of full-param RL (with "Router Replay R3" fixing MoE train/serve mismatch), 216 PPO runs showing ranks 16-32 are the sweet spot, OLoRA-tail init beating standard LoRA at r=16, and a δ-mem online associative-memory adapter that lifts Qwen3-4B-Instruct from 46.79 → 51.66 at <0.5% param overhead. MinT infra moves 1.7 GB adapters instead of 61 GB merged checkpoints and cold-starts MoE LoRA loads 8.5–8.7× faster. Takeaway: use rank-16-32 LoRA with √r alpha scaling and serve adapters separately — don't merge.

The Mill

Builder tools ground for action

180.9K stars

NousResearch/hermes-agent

The agent that grows with you

GitHub

108.5K stars

github/spec-kit

💫 Toolkit to help you get started with Spec-Driven Development

GitHub

79.8K stars

PaddlePaddle/PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

GitHub

25K stars

lfnovo/open-notebook

An Open Source implementation of Notebook LM with more flexibility and features

GitHub

8.9K stars

github/copilot-sdk

Multi-platform SDK for integrating GitHub Copilot Agent into apps and services

GitHub

The Counter

Voices from the AI bar today

2.1K views

OpenClaw Security Risks: 6 Dangers of Autonomous AI Agents

Examines critical security vulnerabilities in autonomous AI agents, using the OpenClaw incident to highlight prompt injection, credential exposure, and memory-related risks.

11K views

The AI Data Center Boom Doesn't Survive Basic Physics

Technical and environmental critique arguing that hard constraints in power generation, water cooling, and thermal management could cap the AI data center buildout.

10K views

Beyond Components: Designing Generative UI for MCP Apps — Ruben Casas, Postman

Maps the UI spectrum from static components to fully generative interfaces that produce HTML, CSS, and JavaScript on demand inside AI agent applications.

32K engagements

Exclusive: Meta Platforms has delayed plans to release its newest artificial intelligence model to developers multiple times and as of Tuesday didn't have a planned date to release it. Internal teams cite benchmark regressions and unresolved safety evals as blockers.

WSJ scoops that Meta has repeatedly delayed its newest AI model release with internal teams citing benchmark regressions and unresolved safety evals as blockers.

17K engagements

RESEARCHERS TRAINED AN AI TO WRITE GPU CODE.. IT NOW BEATS THE INDUSTRY-STANDARD COMPILER ON 100% OF TESTS.. CRUSHES CLAUDE AND GEMINI ON THE SAME BENCHMARK. The pipeline learns kernel-level optimizations a human author would miss and ships them as drop-in replacements.

Evan Luthra flags a research pipeline that trained an AI to write GPU code, claiming it beats the industry-standard compiler on 100% of tests and outpaces Claude and Gemini.

2.9K upvotes · 285 comments

I had Opus 4.8 build Temu League of Legends in under a day - I call it LMAO

A builder shows off a one-day clone of League of Legends built end-to-end with Opus 4.8 — the comments are where the real notes on agent-driven game scaffolding live.

r/ClaudeAI

2.6K upvotes · 817 comments

Introducing Claude Opus 4.8

Anthropic's Opus 4.8 launch thread, with the most upvoted early reactions on real coding/agent work — useful before you decide where to slot it in your stack.

r/ClaudeAI

Roast Calendar

Your AI week, day by day

Fri5

9:30 AM PT•San Francisco

Agentic AI Hackathon - SF

9:00 AM PT•Sunnyvale, CA

AI-tonomy Summit: Models and Agents

5:30 PM PT•Stanford, CA

Stanford AI and Investment Series #8 (a16z, Plug and Play, ASG, Main Character Capital)

Sat6

Jun 6 - Jun 7•San Francisco

WeaveHacks 4: Multi-Agent Orchestration Hackathon with Weights & Biases

12:00 PM PT•San Francisco

AI Agents + Crypto Buildathon

1:30 PM PT•Milpitas, CA

From Lab to Enterprise: Building Trustworthy GenAI

Sun7

11:00 AM PT•Pleasanton, CA

AI Hackathon with The AI Collective Tri-Valley | Humans In AI Week

Jun 7 - Jun 8•Hillsborough, CA

Flip the Script: WWDC Watch Party + Mac Dev Conference on Building, Distributing & Monetizing AI Apps by MacPaw

3:00 PM PT•San Francisco

"Call my Agent": Vibe Coding with Friends @ Interact

Mon8

5:00 PM PT•Menlo Park, CA

Beyond Text: Future of Voice AI

5:00 PM PT•Stanford, CA

Stanford OpenLab Seminar with Matt White, Global CTO of AI of Linux Foundation and CTO of PyTorch Foundation

5:30 PM PT•San Francisco

⌘ + AI + Design: How to prompt like a designer

Tue9

6:00 PM PT•San Francisco

GTM Eng SF: Claude Code + GTM Lightning Talks with leaders from Exa, LangChain, AssemblyAI & more!

5:15 PM PT•San Francisco

Fork it, Branch it, Bop it: New Primitives for Agents

6:00 PM PT•San Francisco

Formal Verification + AI: Midspiral's Practical Approach to Bug-Free Code

Wed10

9:30 AM PT•San Francisco

AI Inference Hack Day

2:00 PM PT•San Francisco

Humans in the Loop 2026

5:00 PM PT•San Francisco

Fixing Broken Retrieval: Building Self-Correcting Agent Loops

Thu11

9:30 AM PT•San Francisco

ClickHouse + Hex AI hackathon

8:30 AM PT•San Francisco

Vector Space Day San Francisco

6:00 PM PT•San Francisco

Artificial Analysis Coding Agent Benchmarks

Last Sip

Parting thoughts

There's a quiet symmetry to today's tape: the same week the cloud bill for AI broke an all-time record, the model itself got small enough to live on the laptop you're reading this on. Pricey on one end, free on the other — and the gap between those two numbers is where the next year of product decisions will get made.

Agentic Brew Daily

Fresh Batch

Bold Shots

Introducing NVIDIA DGX Station for Windows, the world's most powerful deskside AI supercomputer with Windows powered by NVIDIA GB300. Run frontier AI models with up to 1 trillion parameters locally. Build and run secure AI agents on Windows with NVIDIA OpenShell.

Microsoft Scout is a new AI personal assistant built on OpenClaw. Scout is Microsoft's 'first real personal assistant,' and you can download the desktop app today.

Microsoft Just Shocked The Entire AI World: 7 New AI Models

Agentic AI PC's from NVIDIA and Microsoft - NVIDIA RTX Spark Killing OpenAI

NVIDIA just announced the RTX Spark CPU, developed with Microsoft, at Computex.

Would you consider getting an NVIDIA RTX Spark laptop?

Google Ditched the Encoders in Gemma 4 12B, and It Runs Multimodal AI on a 16GB Laptop

Today we're introducing Gemma 4 12B — our latest open model that brings advanced agentic reasoning, vision and audio directly to your laptop.

Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs. Google's new model, Gemma 4 12B Unified supports image, audio and 256K context.

Google just dropped Gemma 4... (WOAH)

Gemma 4 12B Is INSANE - Is THIS the BEST Local Coding Model Yet?

google/gemma-4-12B · Hugging Face

New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both!

Alphabet completed an equity offering raising ~$45B, with an additional $40B to be raised via an 'at the market' program, totaling ~$85B, including a $10B investment from Berkshire Hathaway.

'Fast Money' traders react to Alphabet's proposed $80 billion equity capital raise for AI

Alphabet Will Raise $80 Billion to Fund AI Spending

Alphabet's $80 billion stock sale leaves Wall Street in 'unprecedented territory,' says Goldman's Gutman

Google owner Alphabet to sell $80bn in stock to fund AI spending spree

Broadcom beat earnings and the stock fell about 13% anyway. I called the beat on the record, so here is the grade. The prediction landed. Earnings came in at $2.44 against the $2.40 consensus, and AI chip revenue hit $10.8 billion.

Broadcom AI Chip Outlook Disappoints Investors

Broadcom Inc Earnings LIVE AVGO Stock Results Q2 FY2026!

Broadcom stock sinks after hours as AI chip sales forecast disappoints

Many have reached out to me regarding the new Cyber EO. A few thoughts: First, President Trump is the most pro-innovation president we've ever had. He's made it clear that the U.S. has to win the AI race and that a pro-innovation, pro-energy, and pro-infrastructure policy is the [path].

Trump signs new artificial intelligence executive order

Trump signs AI executive order to give government early look at new models

Trump signs AI executive order asking companies to give government early access to models

Slow Drip

The Grind

The Mill

The Counter

Roast Calendar

Last Sip