The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: Apr 10, 2026, 11:48 AM PT

AlphaXiv Trending

Efficiency
Self-Distilled RLVR
AlphaXiv
171

Self-Distilled RLVR

Chenxu Yang, Chuanyu Qin, Qingyi Si

Machine Learning
In-Place Test-Time Training
AlphaXiv
107

In-Place Test-Time Training

Guhao Feng, Shengjie Luo, Kai Hua

Efficiency
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
AlphaXiv
70

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Weian Mao, Xi Lin, Wei Huang

Reinforcement Learning
SkillX: Automatically Constructing Skill Knowledge Bases for Agents
AlphaXiv
63

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

Chenxi Wang, Zhuoyun Yu, Xin Xie

Computer Vision
Vero: An Open RL Recipe for General Visual Reasoning
AlphaXiv
62

Vero: An Open RL Recipe for General Visual Reasoning

Gabriel Sarch, Linrong Cai, Qunzhong Wang

Reinforcement Learning
PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing
AlphaXiv
58

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

Yiwen Song, Tomas Pfister

HuggingFace Daily Papers

Reinforcement Learning
ClawBench: Can AI Agents Complete Everyday Online Tasks?
HuggingFace
65

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Yuxuan Zhang, Yubo Wang, Yipeng Zhu, Penghui Du, Junwen Miao

Safety Alignment
The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment
HuggingFace
3

The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment

Rishab Balasubramanian, Pin-Jie Lin, Rituraj Sharma, Anjie Fang, Fardin Abdi

NLP
Small Vision-Language Models are Smart Compressors for Long Video Understanding
HuggingFace
3

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Junjie Fei, Jun Chen, Zechun Liu, Yunyang Xiong, Chong Zhou

NLP
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization
HuggingFace
3

Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization

Sai Srinivas Kancheti, Aditya Kanade, Rohit Sinha, Vineeth N Balasubramanian, Tanuja Ganu

Machine Learning
Training a Student Expert via Semi-Supervised Foundation Model Distillation
HuggingFace
0

Training a Student Expert via Semi-Supervised Foundation Model Distillation

Pardis Taghavi, Tian Liu, Renjie Li, Reza Langari, Zhengzhong Tu

Abualhanud
CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation
HuggingFace
0

CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation

Samer Abualhanud, Christian Grannemann, Max Mehltretter