The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: Jun 13, 2026, 7:23 AM PT

AlphaXiv Trending

Cosmos 3: Omnimodal World Models for Physical AI
AlphaXiv
239

Cosmos 3: Omnimodal World Models for Physical AI

Aditi, Niket Agarwal

NVIDIA

Self-Harness: Harnesses That Improve Themselves
AlphaXiv
121

Self-Harness: Harnesses That Improve Themselves

Hangfan Zhang, Shao Zhang, Kangcong Li

Shanghai Artificial Intelligence Laboratory

Computer Vision
Latent Spatial Memory for Video World Models
AlphaXiv
117

Latent Spatial Memory for Video World Models

Weijie Wang, Haoyu Zhao, Yifan Yang

Monash University, Zhejiang University

Retrieval
AlphaXiv
61

First Steps Toward Automated AI Research

Recursive Superintelligence

Peking University

Machine Learning
ICA Lens: Interpreting Language Models Without Training Another Dictionary
AlphaXiv
47

ICA Lens: Interpreting Language Models Without Training Another Dictionary

Sida Liu, Feijiang Han

University of Maryland

HuggingFace Daily Papers

NLP
Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior
HuggingFace
5

Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior

Rafal Kocielnik, Pengrui Han, Peiyang Song, Myrl G. Marmarelis, Ramit Debnath

Reinforcement Learning
See What I See, Know What I Think: Dense Latent Communication Across Heterogeneous Agents
HuggingFace
3

See What I See, Know What I Think: Dense Latent Communication Across Heterogeneous Agents

Siyi Chen, Xiaoyan Zhang, Meng Wu, Jonathan Tremblay, Valts Blukis

University of Michigan

Reinforcement Learning
WebChallenger: A Reliable and Efficient Generalist Web Agent
HuggingFace
2

WebChallenger: A Reliable and Efficient Generalist Web Agent

Jayoo Hwang, Xiaowen Zhang, Vedant Padwal

Reinforcement Learning
Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents
HuggingFace
2

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

Yujun Zhou, Kehan Guo, Haomin Zhuang, Xiangqi Wang, Yue Huang

NLP
The Cold-Start Safety Gap in LLM Agents
HuggingFace
2

The Cold-Start Safety Gap in LLM Agents

Chung-En Sun, Linbo Liu, Tsui-Wei Weng

NLP
On the Limits of LLM Adaptability: Impact of Model-Internalized Priors on Annotation Task Performance
HuggingFace
0

On the Limits of LLM Adaptability: Impact of Model-Internalized Priors on Annotation Task Performance

Etienne Casanova, Rafal Kocielnik, R. Michael Alvarez

California institute of technology