The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Updated: May 8, 2026, 9:34 AM PT

X.com Research Buzz

Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest
X.com
24561

Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest

Addison J. Wu, Ryan Liu, Shuyue Stella Li +2 more

#nlp
Who's in Charge? Disempowerment Patterns in Real-World LLM Usage
X.com
4752

Who's in Charge? Disempowerment Patterns in Real-World LLM Usage

Mrinank Sharma, Miles McCain, Raymond Douglas +1 more

#nlp
Agents of Chaos
X.com
4117

Agents of Chaos

Natalie Shapira, Chris Wendler, Avery Yen +35 more

#reinforcement-learning

AlphaXiv Trending

MolmoAct2: Action Reasoning Models for Real-world Deployment
AlphaXiv
94

MolmoAct2: Action Reasoning Models for Real-world Deployment

Haoquan Fang, Jiafei Duan, Donovan Clay

#reasoning#alphaxiv
On-Policy Distillation
AlphaXiv
84

On-Policy Distillation

Thinking Machines, Kevin Lu

#reinforcement-learning#efficiency#alphaxiv
Model Spec Midtraining: Improving How Alignment Training Generalizes
AlphaXiv
74

Model Spec Midtraining: Improving How Alignment Training Generalizes

Chloe Li, Sara Price, Samuel Marks

#machine-learning#safety-alignment#alphaxiv
Mamoda2.5: Enhancing Unified Multimodal Model with DiT-MoE
AlphaXiv
55

Mamoda2.5: Enhancing Unified Multimodal Model with DiT-MoE

ByteDance, Yangming Shi, Shixiang Zhu +1 more

#multimodal#alphaxiv
RLDX-1 Technical Report
AlphaXiv
53

RLDX-1 Technical Report

Dongyoung Kim, Huiwon Jang, Myungkyu Koo

#alphaxiv
ProgramBench: Can Language Models Rebuild Programs From Scratch?
AlphaXiv
52

ProgramBench: Can Language Models Rebuild Programs From Scratch?

John Yang, Kilian Lieret, Jeffrey Ma

#nlp#alphaxiv

HuggingFace Daily Papers

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction
HuggingFace
26

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Zhuofeng Li, Haoxiang Zhang, Cong Wei +2 more

#reinforcement-learning#retrieval#DCI-Agent
Audio-Visual Intelligence in Large Foundation Models
HuggingFace
13

Audio-Visual Intelligence in Large Foundation Models

You Qin, Kai Liu, Shengqiong Wu +2 more

#computer-vision#speech-audio#JavisVerse
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
HuggingFace
10

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Xiangyuan Xue, Yifan Zhou, Zidong Wang +2 more

#reinforcement-learning#xxyQwQ
Prescriptive Scaling Laws for Data Constrained Training
HuggingFace
2

Prescriptive Scaling Laws for Data Constrained Training

Justin Lovelace, Christian Belardi, Srivatsa Kundurthy +2 more

#machine-learning
Generative Quantum-inspired Kolmogorov-Arnold Eigensolver
HuggingFace
1

Generative Quantum-inspired Kolmogorov-Arnold Eigensolver

Yu-Cheng Lin, Yu-Chao Hsu, I-Shan Tsai +2 more

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs
HuggingFace
1

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

Pranav Mantini, Shishir K. Shah

#multimodal#QuantitativeImagingLaboratory