The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: May 14, 2026, 7:32 AM PT

X.com Research Buzz

Reinforcement Learning
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
X.com
34177

τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Yao

Princeton University

Safety Alignment
Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback
X.com
4223

Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback

Mei Tan

Stanford University

AlphaXiv Trending

ELF: Embedded Language Flows
AlphaXiv
151

ELF: Embedded Language Flows

Keya Hu, Linlu Qiu, Yiyang Lu

Computer Vision
Qwen-Image-2.0 Technical Report
AlphaXiv
80

Qwen-Image-2.0 Technical Report

Bing Zhao, Chenfei Wu, Deqing Li

Reasoning
Solve the Loop: Attractor Models for Language and Reasoning
AlphaXiv
51

Solve the Loop: Attractor Models for Language and Reasoning

Jacob Fein-Ashley, Paria Rashidinejad

Robotics
World Action Models: The Next Frontier in Embodied AI
AlphaXiv
45

World Action Models: The Next Frontier in Embodied AI

Siyin Wang, Junhao Shi, Zhaoyang Fu

Reinforcement Learning
Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents
AlphaXiv
45

Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents

Zhiyuan Fan, Wenwei Jin, Feng Zhang

The Truth Lies Somewhere in the Middle (of the Generated Tokens)
AlphaXiv
42

The Truth Lies Somewhere in the Middle (of the Generated Tokens)

Sophie L. Wang, Phillip Isola, Brian Cheung

HuggingFace Daily Papers

Egangu
FeatCal: Feature Calibration for Post-Merging Models
HuggingFace
4

FeatCal: Feature Calibration for Post-Merging Models

Yanggan Gu, Shuo Cai, Zihao Wang, Wenjun Wang, Yuanyi Wang

The Hong Kong Polytechnic University

NLP
RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
HuggingFace
3

RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation

Chengzhi Shen, Weixiang Shen, Tobias Susetzky, Chen

Technical University of Munich

Baoguangsheng
FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation
HuggingFace
1

FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation

Guangsheng Bao, Hongbo Zhang, Han Cui, Ke Sun, Yanbin Zhao

Text Intelligence Lab of Westlake University

NLP
PersonalAI 2.0: Enhancing knowledge graph traversal/retrieval with planning mechanism for Personalized LLM Agents
HuggingFace
1

PersonalAI 2.0: Enhancing knowledge graph traversal/retrieval with planning mechanism for Personalized LLM Agents

Mikhail Menschikov, Matvey Iskornev, Alexander Kharitonov, Alina Bogdanova, Mikhail Belkin

Skoltech

Retrieval
Retrieval from Within: An Intrinsic Capability of Attention-Based Models
HuggingFace
0

Retrieval from Within: An Intrinsic Capability of Attention-Based Models

Elad Hoffer, Yochai Blau, Edan Kinderman, Ron Banner, Daniel Soudry

NVIDIA

Computer Vision
M2Retinexformer: Multi-Modal Retinexformer for Low-Light Image Enhancement
HuggingFace
0

M2Retinexformer: Multi-Modal Retinexformer for Low-Light Image Enhancement

Youssef Aboelwafa, Hicham G. Elmongui, Marwan Torki

Alexandria University