The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Updated: Apr 10, 2026, 11:48 AM PT

AlphaXiv Trending

Self-Distilled RLVR
AlphaXiv
171

Self-Distilled RLVR

Chenxu Yang, Chuanyu Qin, Qingyi Si

#efficiency#alphaxiv
In-Place Test-Time Training
AlphaXiv
107

In-Place Test-Time Training

Guhao Feng, Shengjie Luo, Kai Hua

#machine-learning#alphaxiv
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
AlphaXiv
70

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Weian Mao, Xi Lin, Wei Huang

#efficiency#reasoning#alphaxiv
SkillX: Automatically Constructing Skill Knowledge Bases for Agents
AlphaXiv
63

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

Chenxi Wang, Zhuoyun Yu, Xin Xie

#reinforcement-learning#retrieval#alphaxiv
Vero: An Open RL Recipe for General Visual Reasoning
AlphaXiv
62

Vero: An Open RL Recipe for General Visual Reasoning

Gabriel Sarch, Linrong Cai, Qunzhong Wang

#computer-vision#reinforcement-learning#reasoning#alphaxiv
PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing
AlphaXiv
58

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

Yiwen Song, Tomas Pfister

#reinforcement-learning#retrieval#alphaxiv

HuggingFace Daily Papers

ClawBench: Can AI Agents Complete Everyday Online Tasks?
HuggingFace
65

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Yuxuan Zhang, Yubo Wang, Yipeng Zhu +2 more

#reinforcement-learning#reacher-z
Small Vision-Language Models are Smart Compressors for Long Video Understanding
HuggingFace
3

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Junjie Fei, Jun Chen, Zechun Liu +2 more

#nlp#computer-vision#multimodal#FeiElysia
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization
HuggingFace
3

Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization

Sai Srinivas Kancheti, Aditya Kanade, Rohit Sinha +2 more

#nlp#computer-vision#reinforcement-learning
The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment
HuggingFace
3

The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment

Rishab Balasubramanian, Pin-Jie Lin, Rituraj Sharma +2 more

#safety-alignment
CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation
HuggingFace
0

CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation

Samer Abualhanud, Christian Grannemann, Max Mehltretter

#abualhanud
Training a Student Expert via Semi-Supervised Foundation Model Distillation
HuggingFace
0

Training a Student Expert via Semi-Supervised Foundation Model Distillation

Pardis Taghavi, Tian Liu, Renjie Li +2 more

#machine-learning#efficiency