The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: May 27, 2026, 7:31 AM PT

X.com Research Buzz

Reasoning
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
X.com
8727

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Parshin Shojaee, Iman Mirzadeh, Keivan Alizadeh, Maxwell Horton, Samy Bengio, Mehrdad Farajtabar

Apple, Virginia Tech

AlphaXiv Trending

Computer Vision
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion
AlphaXiv
82

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Yifan Lu, Qi Wu, Jay Zhangjie Wu

Reinforcement Learning
SkillOpt: Executive Strategy for Self-Evolving Agent Skills
AlphaXiv
74

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Yifan Yang, Ziyang Gong, Weiquan Huang

Microsoft, Shanghai Jiao Tong University, Tongji University, Fudan University

NLP
Language Models Need Sleep
AlphaXiv
51

Language Models Need Sleep

Sangyun Lee, Sean McLeish, Tom Goldstein

Carnegie Mellon University, University of Maryland

Machine Learning
Training-Free Looped Transformers
AlphaXiv
44

Training-Free Looped Transformers

Lizhang Chen, Jonathan Li, Chen Liang

Computer Vision
HumanEgo: Zero-Shot Robot Learning from Minutes of Human Egocentric Videos
AlphaXiv
24

HumanEgo: Zero-Shot Robot Learning from Minutes of Human Egocentric Videos

Wang, Botao He, Kelin Yu

University of Maryland

Computer Vision
TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction
AlphaXiv
22

TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

Weijie Wang, Zimu Li, Jinchuan Shi

Zhejiang University, ETH AI Center, Microsoft, Monash University

HuggingFace Daily Papers

NLP
D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing
HuggingFace
31

D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing

Aoxi Liu, Yupeng Chen, James Oldfield, Guanzhe Hong, Junchi Yu

University of Oxford

Retrieval
Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini
HuggingFace
4

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

Madhuri Shanbhogue, Zhe Li, Shanfeng Zhang, Gustavo Hernández Ábrego, Shih-Cheng Huang

Google

NLP
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models
HuggingFace
2

ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models

Yujie Lin, Chengyi Yang, Zhishang Xiang, Yiping Song, Jinsong Su

NLP
Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents
HuggingFace
1

Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents

Asaf Yehudai, Lilach Eden, Michal Shmueli-Scheuer

IBM Research

Jjzha
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations
HuggingFace
1

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

Mike Zhang, Ali Basirat, Desmond Elliott

Hitxueliang
STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media
HuggingFace
0

STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media

Liang Xue, Haoyu Liu, Cheng Wang, Pengyu Chen, Haozhuo Zheng

Byering