The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Updated: May 9, 2026, 9:53 AM PT

X.com Research Buzz

Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest
X.com
33195

Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest

Addison J. Wu, Ryan Liu, Shuyue Stella Li +2 more

#nlp
Who's in Charge? Disempowerment Patterns in Real-World LLM Usage
X.com
4749

Who's in Charge? Disempowerment Patterns in Real-World LLM Usage

Mrinank Sharma, Miles McCain, Raymond Douglas +1 more

#nlp

AlphaXiv Trending

Recursive Multi-Agent Systems
AlphaXiv
209

Recursive Multi-Agent Systems

Xiyuan Yang, Jiaru Zou, Rui Pan

#reinforcement-learning
MolmoAct2: Action Reasoning Models for Real-world Deployment
AlphaXiv
118

MolmoAct2: Action Reasoning Models for Real-world Deployment

Haoquan Fang, Jiafei Duan, Donovan Clay

#reasoning
On-Policy Distillation
AlphaXiv
116

On-Policy Distillation

Thinking Machines, Kevin Lu

#reinforcement-learning#efficiency
ProgramBench: Can Language Models Rebuild Programs From Scratch?
AlphaXiv
68

ProgramBench: Can Language Models Rebuild Programs From Scratch?

John Yang, Kilian Lieret, Jeffrey Ma

#nlp
RLDX-1 Technical Report
AlphaXiv
66

RLDX-1 Technical Report

Dongyoung Kim, Huiwon Jang, Myungkyu Koo

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
AlphaXiv
57

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

Yuwen Du, Rui Ye, Shuo Tang

#reinforcement-learning#retrieval

HuggingFace Daily Papers

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
HuggingFace
16

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Xiangyuan Xue, Yifan Zhou, Zidong Wang +2 more

#reinforcement-learning#xxyQwQ
EMO: Pretraining Mixture of Experts for Emergent Modularity
HuggingFace
5

EMO: Pretraining Mixture of Experts for Emergent Modularity

Ryan Wang, Akshita Bhagia, Sewon Min

#machine-learning#allenai
Prescriptive Scaling Laws for Data Constrained Training
HuggingFace
4

Prescriptive Scaling Laws for Data Constrained Training

Justin Lovelace, Christian Belardi, Srivatsa Kundurthy +2 more

#machine-learning
PianoCoRe: Combined and Refined Piano MIDI Dataset
HuggingFace
3

PianoCoRe: Combined and Refined Piano MIDI Dataset

Ilya Borovik

#ilya16
GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs
HuggingFace
2

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

Pranav Mantini, Shishir K. Shah

#multimodal#QuantitativeImagingLaboratory
Generative Quantum-inspired Kolmogorov-Arnold Eigensolver
HuggingFace
2

Generative Quantum-inspired Kolmogorov-Arnold Eigensolver

Yu-Cheng Lin, Yu-Chao Hsu, I-Shan Tsai +2 more