TECH

Meituan trains LongCat-2.0, a 1.6T model, entirely on Chinese chips

32+

Signals

Strategic Overview

01.
Meituan open-sourced LongCat-2.0 on June 30, 2026 - a 1.6-trillion-parameter mixture-of-experts model (~48B active per token) with a 1-million-token context window, released under the MIT license.
02.
The model was trained from scratch on a 50,000-card cluster of domestic Chinese AI ASICs with no Nvidia GPUs, which Meituan calls the first trillion-parameter model to complete both full training and inference on domestic hardware.
03.
Before the reveal, LongCat-2.0 ran anonymously as the stealth model 'Owl Alpha' on OpenRouter and rose to the top of global usage.
04.
Meituan reports a 59.5 on SWE-bench Pro - edging GPT-5.5's 58.6 - plus 70+ scores across several agentic-coding benchmarks, though the figures are self-reported.

Beijing Cracked the One Wall That Actually Mattered

Meituan says LongCat-2.0 is the first trillion-parameter model to complete both full training and inference on a 50,000-card domestic Chinese compute cluster ^[1]. That phrase - full training - is the whole story. Until now, Chinese labs could run finished models on home-grown silicon, but the punishing pre-training run, the part that needs tens of thousands of chips talking to each other for weeks, was assumed to require Nvidia hardware ^[2]. To coordinate that many accelerators without Nvidia's NCCL networking layer, Meituan swapped in Huawei's HCCL communication library ^[3], and community teardowns peg the hardware as Huawei Ascend-class ASIC superpods.

The architecture is what makes the scale affordable. LongCat-2.0 is a sparse mixture-of-experts design: 1.6 trillion total parameters, but only about 48 billion fire on any given token, paired with a new 'LongCat Sparse Attention' that keeps the 1-million-token context window from exploding in cost ^[4]. The point is not that the chips beat Nvidia's - they do not - it is that 'good enough' domestic silicon, wired together cleverly, was enough to train a frontier-scale model start to finish. That is the exact assumption the entire export-control strategy was built to prevent.

The Model That Won Before Anyone Knew Its Name

The most disarming detail is that LongCat-2.0 had already been judged - anonymously - and won. For roughly two months it ran on OpenRouter as a stealth model called 'Owl Alpha,' with no company name attached, and climbed to the top of global usage while processing trillions of tokens a month ^[5]. Developers were routing real production traffic to it inside coding tools before anyone knew a food-delivery company had built it.

That sequence matters because it inverts the usual order of AI hype. When Meituan finally pulled the mask off on June 30 ^[6], the reveal doubled as a credibility test the model had already passed: its ranking came from developers actually choosing it, not from a launch-day benchmark sheet, and it predated the geopolitics that now frame every headline. The community reaction reflected that - practitioners who had quietly liked 'Owl Alpha' were mostly surprised by the maker, not the quality, and the roleplay and local-model crowds treated the domestic-chip training run as the genuinely novel part.

Why a Noodle-Delivery App Out-Trained the Labs

The obvious question - why is China's DoorDash training frontier models at all - has a boring answer and an interesting one. The boring answer: Meituan needs a reasoning engine for the in-app agents that recommend restaurants and book hotels, so building rather than renting keeps that capability in-house ^[7]. Vertical integration, not moonshot ambition.

The interesting answer, argued across the coverage and echoed in developer discussion, is data. Meituan has spent years watching trillions of real-world orders move through cities - food, couriers, shops, logistics - the kind of physical-world signal no lab training purely on scraped web text possesses. Chinese tech conglomerates also sprawl across sectors in a way that makes 'a delivery company with a 1.6T model' far less strange than it sounds from the outside. Either way, the release lands as a marker that in China, frontier model-building is no longer confined to dedicated AI labs; the capability has diffused into the broader tech economy.

What the Benchmark Sheet Isn't Telling You

Meituan's own numbers are striking - a 59.5 on SWE-bench Pro that edges GPT-5.5's 58.6, plus 70-plus scores across agentic-coding suites ^[4]- but they are self-reported, and the skeptics have a real case. Independent hands-on testing has been mixed: reviewers running LongCat-2.0 through agentic coding harnesses found it capable but not a daily driver over rivals, and one-shot chat tests landed well below the headline benchmark scores. A plausible read is that it is tuned to shine as an iterative agent rather than a single-shot responder, but that is a hypothesis, not a verified result.

Access is the other catch. The API and coding plan are gated behind Chinese payment apps like Alipay and WeChat Pay, and full open weights were still listed as 'coming soon' on Hugging Face at launch ^[8]. Analysts also caution the export-control story is more nuanced than 'the wall fell' - restrictions still raise cost, slow access, and force harder engineering trade-offs even when they do not stop a model from shipping ^[9]. The honest read: a real milestone in Chinese chip independence, wrapped in benchmark claims that still deserve third-party verification.

Historical Context

2023

Meituan founded its LongCat model team, launching its first model the following year.

2025-08

Meituan released LongCat-Flash, a 560-billion-parameter model that served as the precursor to LongCat-2.0.

2026-04-24

LongCat-2.0-Preview entered public testing as the anonymous 'Owl Alpha', appearing on OpenRouter days later.

2026-06-30

Meituan officially unveiled and open-sourced LongCat-2.0, unmasking it as the engine behind Owl Alpha.

Power Map

Key Players

Subject

Meituan trains LongCat-2.0, a 1.6T model, entirely on Chinese chips

Meituan

Chinese food-delivery and local-services giant that developed and open-sourced LongCat-2.0, demonstrating a non-AI-lab can train a trillion-parameter model without Nvidia GPUs.

Nvidia

US chipmaker whose export-controlled GPUs LongCat-2.0 pointedly bypasses; Bernstein estimated Nvidia held roughly 40% of China's AI-chip market in 2025 and forecast an 8% share loss in 2026.

Huawei / Enflame

Domestic Chinese chipmakers filling the gap left by US export controls; Meituan used Huawei's HCCL communication library, and the ASIC superpods are linked to Huawei's Atlas-950 line.

OpenRouter

Model-routing platform where LongCat-2.0 ran anonymously as 'Owl Alpha' and topped global usage before its identity was disclosed, giving the model a usage-based credibility test.

DeepSeek, Alibaba, ByteDance

Other major Chinese AI players pursuing the same reduction in US-chip dependence; SCMP places LongCat-2.0 on par with DeepSeek's V4-pro flagship.

Fact Check

9 cited

Source Articles

Top 5

THE SIGNAL.

Analysts

"Argues Nvidia export controls will backfire, saying they will just accelerate the development of AI that runs on Chinese chips."

Yuchen Jin

AI commentator and analyst

"Reads LongCat-2.0 as evidence the compute arms race is widening: if China can scale frontier training on local silicon at this level, the race is wider open than ever."

Alvin Foo

Venture partner

"Frames the model as a first - the first model ever trained to near-frontier performance on 50,000 Chinese domestic accelerators."

Hanchi Sun

Researcher, Lehigh University

"Cautions that export controls still impose real costs even as the model shifts the debate, since restrictions still raise cost, slow access, complicate scaling, and force harder engineering trade-offs."

Geopolitechs

Geopolitics analysis publication

The Crowd

"Introducing LongCat-2.0 🐱 1.6T parameters · MoE with ~48B active · 1M context The full model behind Owl Alpha on @OpenRouter — now available. Built for agentic coding from the ground up: ◆ LongCat Sparse Attention (LSA) — scales efficiently for 1M-context tokens"

@@Meituan_LongCat3489

"Meituan's LongCat-2.0 reportedly lands near GPT-5.5 on SWE-bench. So I threw 5 HTML canvas animation prompts at both. 🥷 Paper sliced fruit-ninja style. 💧 An ink drop diffusing in water. 🔥 A letter burning. 🗑️ Paper crumpling into a ball. ✂️ A strip-cut shredder. Here's how"

@@stevibe321

"a chinese model, longcat-2.0, is here and it's beating gpt-5.5 at coding.. china's meituan (yes, the food delivery company) trained it without a single nvidia chip. meet longcat-2.0, from meituan, china's doordash, basically. their AI team is barely two years old and just"

@@heyrobinai31

"Introducing LongCat-2.0 - , a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token. This was the stealth model that was on Openrouter under the name 'owl-alpha'."

@u/AnticitizenPrime441

Broadcast

Meituan LongCat 2.0 (Tested): China's 1.6T OPEN MODEL looks CRAZY!

Meituan LongCat 2.0 is HERE (Real Tests and Review)

LongCat (560B): Bye Sonnet! Why is NO ONE Talking about this!? This OPEN Model is AWESOME!