The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: May 9, 2026, 9:53 AM PT

X.com Research Buzz

NLP
Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest
X.com
33195

Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest

Addison J. Wu, Ryan Liu, Shuyue Stella Li, Yulia Tsvetkov, Thomas L. Griffiths

Princeton University, University of Washington

NLP
Who's in Charge? Disempowerment Patterns in Real-World LLM Usage
X.com
4749

Who's in Charge? Disempowerment Patterns in Real-World LLM Usage

Mrinank Sharma, Miles McCain, Raymond Douglas, David Duvenaud

Anthropic, ACS Research Group, University of Toronto

AlphaXiv Trending

Reinforcement Learning
Recursive Multi-Agent Systems
AlphaXiv
209

Recursive Multi-Agent Systems

Xiyuan Yang, Jiaru Zou, Rui Pan

University of Illinois at Urbana-Champaign, Stanford University, NVIDIA, MIT

Reasoning
MolmoAct2: Action Reasoning Models for Real-world Deployment
AlphaXiv
118

MolmoAct2: Action Reasoning Models for Real-world Deployment

Haoquan Fang, Jiafei Duan, Donovan Clay

Allen Institute for AI, University of Washington, National University of Singapore, University of Pennsylvania, Johns Hopkins University, Amazon, Cortex AI, University of Michigan, University of North Carolina at Chapel Hill

Reinforcement Learning
On-Policy Distillation
AlphaXiv
116

On-Policy Distillation

Thinking Machines, Kevin Lu

Thinking Machines Lab

NLP
ProgramBench: Can Language Models Rebuild Programs From Scratch?
AlphaXiv
68

ProgramBench: Can Language Models Rebuild Programs From Scratch?

John Yang, Kilian Lieret, Jeffrey Ma

Meta FAIR, Meta TBD, Stanford University, Harvard University

RLDX-1 Technical Report
AlphaXiv
66

RLDX-1 Technical Report

Dongyoung Kim, Huiwon Jang, Myungkyu Koo

RLWRLD, KAIST

Reinforcement Learning
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
AlphaXiv
57

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

Yuwen Du, Rui Ye, Shuo Tang

Shanghai Jiao Tong University

HuggingFace Daily Papers

Reinforcement Learning
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
HuggingFace
16

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Xiangyuan Xue, Yifan Zhou, Zidong Wang, Shengji Tang, Philip Torr

The Chinese University of Hong Kong, Shanghai Artificial Intelligence Laboratory, University of Georgia, University of Oxford, Shenzhen Loop Area Institute

Machine Learning
EMO: Pretraining Mixture of Experts for Emergent Modularity
HuggingFace
5

EMO: Pretraining Mixture of Experts for Emergent Modularity

Ryan Wang, Akshita Bhagia, Sewon Min

University of California, Berkeley, Allen Institute for AI

Machine Learning
Prescriptive Scaling Laws for Data Constrained Training
HuggingFace
4

Prescriptive Scaling Laws for Data Constrained Training

Justin Lovelace, Christian Belardi, Srivatsa Kundurthy, Shriya Sudhakar, Kilian Q. Weinberger

Cornell University

Ilya16
PianoCoRe: Combined and Refined Piano MIDI Dataset
HuggingFace
3

PianoCoRe: Combined and Refined Piano MIDI Dataset

Ilya Borovik

Skolkovo Institute of Science and Technology

Multimodal
GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs
HuggingFace
2

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

Pranav Mantini, Shishir K. Shah

University of Houston, The University of Oklahoma

Generative Quantum-inspired Kolmogorov-Arnold Eigensolver
HuggingFace
2

Generative Quantum-inspired Kolmogorov-Arnold Eigensolver

Yu-Cheng Lin, Yu-Chao Hsu, I-Shan Tsai, Chun-Hua Lin, Kuo-Chung Peng

National Yang Ming Chiao Tung University, National Center for High-Performance Computing, National Institutes of Applied Research, National Cheng Kung University, University of California, San Diego, National Taiwan University, NVIDIA AI Technology Center