The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Updated: Apr 30, 2026, 9:32 AM PT

X.com Research Buzz

The AI Layoff Trap
X.com
17230

The AI Layoff Trap

Brett Hemenway Falk, Gerry Tsoukalas

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
X.com
6521

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Parshin Shojaee, Iman Mirzadeh, Rishabh Agarwal +5 more

#reasoning

AlphaXiv Trending

DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence
AlphaXiv
329

DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence

DeepSeek-AI

#efficiency#alphaxiv
There Will Be a Scientific Theory of Deep Learning
AlphaXiv
200

There Will Be a Scientific Theory of Deep Learning

Jamie Simon, Daniel Kunin, Alexander Atanasov

#machine-learning#alphaxiv
Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity
AlphaXiv
80

Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity

Bojie Li

#nlp#alphaxiv
The Recurrent Transformer: Greater Effective Depth and Efficient Decoding
AlphaXiv
79

The Recurrent Transformer: Greater Effective Depth and Efficient Decoding

Costin-Andrei Oncescu, Depen Morwani, Samy Jelassi

#nlp#efficiency#reasoning#alphaxiv
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
AlphaXiv
72

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Zhiheng Liu, Weiming Ren, Xiaoke Huang

#computer-vision#retrieval#multimodal#alphaxiv
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
AlphaXiv
50

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Weijie Wang, Xiaoxuan He, Youping Gu

#computer-vision#alphaxiv

HuggingFace Daily Papers

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
HuggingFace
71

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

V Team, Wenyi Hong, Xiaotao Gu +2 more

#reinforcement-learning#multimodal#zai-org
Large Language Models Explore by Latent Distilling
HuggingFace
48

Large Language Models Explore by Latent Distilling

Yuanhao Zeng, Ao Lu, Lufei Li +2 more

#nlp#efficiency#LinesHogan
RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments
HuggingFace
32

RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments

Zaid Nasser, Mikhail Iumanov, Tianhao Li +2 more

#multimodal#be2rlab
A Survey on LLM-based Conversational User Simulation
HuggingFace
2

A Survey on LLM-based Conversational User Simulation

Bo Ni, Leyao Wang, Yu Wang +2 more

#nlp
FASH-iCNN: Making Editorial Fashion Identity Inspectable Through Multimodal CNN Probing
HuggingFace
1

FASH-iCNN: Making Editorial Fashion Identity Inspectable Through Multimodal CNN Probing

Morayo Danielle Adeyemi, Ryan A. Rossi, Franck Dernoncourt

#multimodal
Probing Visual Planning in Image Editing Models
HuggingFace
0

Probing Visual Planning in Image Editing Models

Zhimu Zhou, Yanpeng Zhao, Qiuyu Liao +2 more

#computer-vision#reinforcement-learning#spatigen