The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Updated: Apr 23, 2026, 9:42 AM PT

X.com Research Buzz

Agents of Chaos
X.com
3461

Agents of Chaos

Natalie Shapira, Chris Wendler, Avery Yen +35 more

#reinforcement-learning#openclaw
AI Agent Traps
X.com
2080

AI Agent Traps

Matija Franklin, Nenad Tomašev, Julian Jacobs +2 more

#reinforcement-learning

AlphaXiv Trending

Qwen3.5-Omni Technical Report
AlphaXiv
132

Qwen3.5-Omni Technical Report

Qwen Team

#alphaxiv
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence
AlphaXiv
112

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Guanting Dong, Junting Lu, Junjie Huang

#reinforcement-learning#alphaxiv
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems
AlphaXiv
90

Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems

Jiacheng Liu, Xiaohan Zhao, Xinyi Shang

#reinforcement-learning#alphaxiv
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
AlphaXiv
45

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Jinghui Lu, Jiayi Guan, Zhijian Huang

#computer-vision#reinforcement-learning#reasoning#alphaxiv
Neural Garbage Collection: Learning to Forget while Learning to Reason
AlphaXiv
40

Neural Garbage Collection: Learning to Forget while Learning to Reason

Michael Y. Li, Jubayer Ibn Hamid, Emily B. Fox

#alphaxiv
Image Generators are Generalist Vision Learners
AlphaXiv
37

Image Generators are Generalist Vision Learners

Valentin Gabeur, Shangbang Long, Songyou Peng

#computer-vision#alphaxiv

HuggingFace Daily Papers

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis
HuggingFace
19

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Kanzhi Cheng, Zehao Li, Zheng Ma +2 more

#reinforcement-learning#njucckevin
Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL
HuggingFace
2

Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

Skylar Zhai, Jingcheng Liang, Dongyeop Kang

Image Generators are Generalist Vision Learners
HuggingFace
2

Image Generators are Generalist Vision Learners

Valentin Gabeur, Shangbang Long, Songyou Peng +2 more

#computer-vision
Streaming Structured Inference with Flash-SemiCRF
HuggingFace
1

Streaming Structured Inference with Flash-SemiCRF

Benjamin K. Johnson, Thomas Goralski, Ayush Semwal +2 more

#efficiency#biobenkj
COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling
HuggingFace
0

COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

Noah Flynn

Benign Fine-Tuning Breaks Safety Alignment in Audio LLMs
HuggingFace
0

Benign Fine-Tuning Breaks Safety Alignment in Audio LLMs

Jaechul Roh, Amir Houmansadr

#machine-learning#nlp#speech-audio