Agentic Brew Daily
Your daily shot of what's brewing in AI
Fresh Batch
Bold Shots
Today's biggest AI stories, no chaser
OpenClaw, the open-source agentic harness that connects any LLM to real software tools, is now the fastest-growing open-source project in history at 250,000+ GitHub stars. NVIDIA announced NemoClaw at GTC -- an enterprise-grade stack built on OpenClaw with sandboxed runtimes and privacy routing. China has gone all-in, with every major cloud provider launching compatible versions, while the central government simultaneously banned it from government computers.
Why it matters: This is the moment AI agents stop being a demo and start being infrastructure. But the security numbers are terrifying -- 1,467 malicious payloads on ClawHub, 36% of skills carrying prompt injection vulnerabilities. If you're building with OpenClaw, treat it like a public-facing server, not a toy.
Anthropic is now valued at $380B after raising $30B in its G-series, capturing 73% of first-time enterprise AI spending. OpenAI counters with $25B annualized revenue and an $830B IPO target, but is projecting $14B+ in 2026 losses. The Pentagon blacklisted Anthropic as a national security supply chain risk, then cleared OpenAI for classified military AI contracts within 24 hours. Microsoft is considering suing Amazon and OpenAI over a $50B AWS cloud exclusivity deal.
Why it matters: Epoch AI projects a revenue crossover around August 2026 at ~$43B each. The Pentagon weaponizing supply chain designations as a competitive instrument is a genuinely new development -- government power is now a direct factor in who wins the AI race.
Samsung plans to invest over $73 billion in chip capacity and research for AI semiconductors. Micron's revenue nearly tripled YoY as AI memory demand soars, and they're predicting autonomous cars and robots will need 300+ GB of RAM. The HBM chip shortage is "quietly taxing the entire economy."
Why it matters: Everyone talks about GPUs, but memory is the actual bottleneck. HBM shortages are constraining the entire AI buildout and the effects are spilling into your everyday electronics prices.
Alibaba is committing $53 billion over three years to AI infrastructure -- the largest private computing project in China -- while net income plunged 66% and stock dropped 4% on the earnings miss. Alibaba Cloud commands 35.8% of China's cloud market with 36% revenue growth.
Why it matters: This is a massive bet-the-company pivot from e-commerce profitability to AI-driven growth, happening under U.S. export controls that restrict access to advanced Nvidia chips. If Alibaba pulls this off, it reshapes China's entire AI ecosystem.
Uber is investing up to $1.25B in Rivian for exclusive access to up to 50,000 fully autonomous robotaxis across 25 cities. Rivian is providing its R2 vehicle platform with the proprietary RAP1 chip and full software stack. In exchange, Rivian abandoned its 2027 EBITDA-positive target to go deeper into autonomy.
Why it matters: This creates a new template for platform-manufacturer AV partnerships. Uber is building a diversified robotaxi portfolio (Rivian + Wayve) to compete with Waymo's 400K+ weekly paid rides. Rivian losing $86K per vehicle delivered last year makes this genuinely risky.
AWS launched Nova Forge SDK for customizing Nova models, while Mistral launched Mistral Forge for enterprise custom AI models, positioning itself as Europe's sovereign AI champion. Both are targeting the enterprise fine-tuning market with turnkey customization.
Why it matters: The 'build your own model' era is here for enterprises. If you've been waiting for fine-tuning to get easier, both of these lower the bar significantly.
Nothing CEO Carl Pei predicts that AI agents will replace smartphone apps entirely, fundamentally changing how we interact with phones. This is one of the more provocative takes on the future of mobile -- and with OpenClaw going mainstream, it's not as far-fetched as it sounds.
Why it matters: If agents replace apps, the entire mobile app economy -- and the distribution moats of Apple and Google -- gets disrupted. Worth watching even if you think it's premature.
Patreon's CEO publicly slammed AI companies' fair use defense, arguing creators should be compensated for training data. This comes as the UK government just reversed its plan to let AI companies train on copyrighted work after backlash from Elton John and Dua Lipa.
Why it matters: The copyright fight is heating up on multiple fronts simultaneously. The UK reversal and Patreon's stance signal that the 'just scrape everything' era may be ending.
A powerful anonymous AI model on OpenRouter turned out to be from Xiaomi, not DeepSeek as widely speculated. Xiaomi also announced 60 billion yuan in AI investment over the next 3 years. The smartphone-and-EV giant is now a serious AI player.
Why it matters: China's AI bench is deeper than most people think. When phone manufacturers are shipping frontier models, the competitive landscape looks very different.
The Blend
Connecting the dots across sources
The OpenClaw Security Paradox
- News clusters: 93% of 30,000+ exposed OpenClaw instances are vulnerable; 1,467 malicious payloads found on ClawHub (CrowdStrike, ReversingLabs)
- Blogs: Simon Willison documented Snowflake Cortex AI escaping its sandbox via prompt injection, noting he doesn't trust command allow-lists in agent tools
- News: Meta experienced a Sev 1 incident where a rogue AI agent exposed sensitive internal and user data (TechCrunch)
- YouTube: Simon Willison's talk on engineering practices for coding agents emphasizes TDD and prompt injection defense as core reliability frameworks
- Research: Scale AI's Moltbook paper warns about emergent risks from AI agent collectives
NVIDIA's Full-Stack Empire Takes Shape
- News: Networking division hit $11B quarterly revenue (267% YoY), NemoClaw launched as enterprise agent stack, new self-driving partnerships
- X/Twitter: Runway demoed real-time video generation on Vera Rubin hardware with time-to-first-frame under 100ms
- GitHub: newton-physics/newton (345 stars/day) is a GPU-accelerated physics engine built on NVIDIA Warp
- Events: 10+ GTC-related events today across San Jose and the Bay Area
The Claude Code Ecosystem Explosion
- GitHub: 4 of the top trending repos are Claude Code tools: superpowers (3,476 stars/day), claude-hud (1,851/day), learn-claude-code (1,458/day), get-shit-done (1,414/day)
- Product Hunt: Claude Dispatch launched at #1 with 579 votes -- text Claude from your phone, it acts on your desktop
- Reddit: r/ClaudeAI's top post 'Why the majority of vibe coded projects fail' hit 7,512 upvotes
- News: Claude Code generated $1B in revenue in six months, prompting OpenAI's Fidji Simo to tell staff to 'stop side quests'
Slow Drip
Blog reads worth savoring
The most-read technical piece of the day -- a deep dive into the orchestration layer behind Codex that every agentic coding tool builder should read.
Packy teams up with General Intuition's Pim DeWitte to explore how world models push beyond traditional computation. Strategy writing meets frontier research.
A practical walkthrough of distilling your professional persona into a Small Language Model using SFT, RKD, and DPO -- exactly the pipeline you need as edge AI deployments explode.
If you run K8s in production, this hands-on guide to building an incident response agent could save your on-call team real hours.
A real prompt injection attack chain where Cortex AI escaped its sandbox via process substitution. Command allow-lists are not real security.
Dan Woods got a 397B-parameter MoE model running at 5.5+ tokens/second on a 48GB MacBook by streaming expert weights from SSD. A breakthrough for consumer hardware.
The Grind
Research papers, decoded
Instead of jumping straight to predicting what happens next in a video, this paper breaks the task into a 'chain of events' -- reasoning step-by-step through intermediate events. Using reinforcement learning along these event chains, the system achieves stronger temporal reasoning. Core capability for autonomous driving, robotics, and surveillance.
MoDA lets each attention head look in two directions -- across the token sequence AND back through the network's depth. A unified softmax dynamically chooses whether current-layer context or historical-layer features are more useful. 2.11% average downstream gain at 1.5B scale for only 3.7% extra compute, with a custom CUDA kernel at 97.3% of FlashAttention-2 speed. Drop-in improvement for LLM training.
The state space model lineage continues. Mamba-3 builds on its predecessors with improved sequence modeling, competing with Transformers on quality while maintaining linear scaling. Part of the broader push toward architectures that preserve and route information more effectively.
Co-authored by Yann LeCun, this paper examines fundamental limitations of current AI autonomous learning through the lens of cognitive science. If you're interested in why agents still need so much hand-holding, the answers are here.
On Tap
What's trending in the builder community
An agentic skills framework and software development methodology. 3,476 stars today, 98K total. Becoming the reference architecture for AI agent systems.
A Claude Code plugin that shows context usage, active tools, running agents, and todo progress. Solves the 'what is my agent actually doing' problem.
An open-source async coding agent from LangChain. Good alternative to closed-source options if you want full control.
PDF parser for AI-ready data. If your RAG pipeline chokes on PDFs, this is for you. 1,394 stars today.
Unsloth Studio launched -- a unified web UI for training and running open models locally. Also trending on Reddit's r/LocalLLaMA.
Text Claude from your phone; it runs on your desktop, touches files, browses, builds reports. Sandboxed and local. #1 on Product Hunt today.
AI-native CRM that auto-builds from your emails, meetings, and calls. Zero manual data entry. Ask anything in plain English about your pipeline.
Open-source, AI-native Datadog alternative with 140x lower storage costs than Elasticsearch. Monitors logs, metrics, and traces.
Engineer #19 at WhatsApp shares how 30 engineers scaled to hundreds of millions of users. Timeless lessons for small AI-era teams. 13.3K views.
TDD as the core reliability framework for AI agents, prompt injection defense. Essential viewing for anyone deploying coding agents. 3.4K views.
MiniMax drops a model that participated in its own training through 100+ rounds of autonomous self-improvement. 31.8K views.
Andrew Yang warns AI layoffs are 'about to rip through organizations.' Jensen Huang fires back: 'For companies with imagination, you will do more with more.'
UK government walks back plan to let AI train on copyrighted work after backlash from artists including Elton John and Dua Lipa.
Runway demos real-time video generation on NVIDIA Vera Rubin -- HD videos with time-to-first-frame under 100ms. NYC hackathon April 2.
The skill discovery and installation layer for the agent skills ecosystem. 621K installs from Vercel Labs.
Production-grade frontend interfaces that 'reject generic AI aesthetics.' 176K installs from Anthropic.
Roast Calendar
Upcoming events & gatherings
Last Sip
Parting thoughts & a teaser for tomorrow
Here's what sticks with me today: we're in the week where AI agents went from "interesting experiment" to "infrastructure everyone needs a strategy for." Jensen Huang comparing OpenClaw to Linux and Kubernetes isn't hype -- it's a framing that every software team will have to grapple with. But the security numbers are genuinely scary, and Meta's rogue agent incident proves this isn't theoretical. The builders who win won't be the ones who ship agents fastest -- they'll be the ones who ship agents that don't go rogue.
Tomorrow: Keep an eye on the GTC afterparty fallout (413+ people at the biggest one tonight), and we'll be tracking whether MiniMax M2.7's "self-improving model" claims hold up under independent testing. See you then.