The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: Jun 18, 2026, 7:35 AM PT

AlphaXiv Trending

AlphaXiv
240

GLM-5.2: Built for Long-Horizon Tasks

Z.ai

Z.ai

From AGI to ASI
AlphaXiv
200

From AGI to ASI

Tim Genewein, Matija Franklin, Alexander Lerchner

Google DeepMind, University of Waterloo (work conducted while at Google DeepMind, Australian National University, University College London, Google DeepMind University of Waterloo (work conducted while at Google DeepMind

Computer Vision
You Don't Need Strong Assumptions: Visual Representation Learning via Temporal Differences
AlphaXiv
115

You Don't Need Strong Assumptions: Visual Representation Learning via Temporal Differences

Ninad Daithankar, Alexi Gladstone, Yann LeCun

New York University

NLP
VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models
AlphaXiv
69

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Sen Xu, Shixi Liu, Wei Wang

Computer Vision
Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation
AlphaXiv
66

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Jie Zhang, Xiaoyue Chen, Anzhe Chen

HuggingFace Daily Papers

Computer Vision
MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction
HuggingFace
6

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

Jianing Zhang, Chenhao Zheng, Yajun Yang, Max Argus, Rustin Soraki

Ai2

NLP
ViT-Up: Faithful Feature Upsampling for Vision Transformers
HuggingFace
2

ViT-Up: Faithful Feature Upsampling for Vision Transformers

Krispin Wandel, Jingchuan Wang, Hesheng Wang

Shanghai Jiao Tong University

Machine Learning
Bag of Dims: Training-Free Mechanistic Interpretability via Dimension-Level Sign Patterns
HuggingFace
1

Bag of Dims: Training-Free Mechanistic Interpretability via Dimension-Level Sign Patterns

Varun Reddy Nalagatla

Reinforcement Learning
Seeing Before Reasoning: Decoupling Perception and Reasoning for Shortcut-Resilient Multimodal On-Policy Self-Distillation
HuggingFace
1

Seeing Before Reasoning: Decoupling Perception and Reasoning for Shortcut-Resilient Multimodal On-Policy Self-Distillation

Sihan Wang, Xiyao Liu, Lianqing Liu, Zhi Han

Reinforcement Learning
iOSWorld: A Benchmark for Personally Intelligent Phone Agents
HuggingFace
1

iOSWorld: A Benchmark for Personally Intelligent Phone Agents

Lawrence Keunho Jang, Mareks Woodside, Geronimo Carom, Andrew Keunwoo Jang, Jing Yu Koh

Carnegie Mellon University

Reinforcement Learning
MyPCBench: A Benchmark for Personally Intelligent Computer-Use Agents
HuggingFace
1

MyPCBench: A Benchmark for Personally Intelligent Computer-Use Agents

Lawrence Keunho Jang, Andrew Keunwoo Jang, Jing Yu Koh, Ruslan Salakhutdinov

Carnegie Mellon University