The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: Jun 5, 2026, 7:39 AM PT

X.com Research Buzz

The AI Layoff Trap
X.com
19455

The AI Layoff Trap

Brett Hemenway Falk, Gerry Tsoukalas

University of Pennsylvania, Boston University

AlphaXiv Trending

Cosmos 3: Omnimodal World Models for Physical AI
AlphaXiv
88

Cosmos 3: Omnimodal World Models for Physical AI

Aditi, Niket Agarwal, Arslan Ali

Reinforcement Learning
Trust Region On-Policy Distillation
AlphaXiv
52

Trust Region On-Policy Distillation

Xingrun Xing, Haoqing Wang, Boyan Gao

Samsung Research, Beijing, China, University of Oxford, Peking University

Computer Vision
Qwen-Image-Flash: Beyond Objective Design
AlphaXiv
42

Qwen-Image-Flash: Beyond Objective Design

Tianhe Wu, Kun Yan, Zikai Zhou

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters
AlphaXiv
41

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Vin Bo, Song Cao

Mind Lab

Speech Audio
WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling
AlphaXiv
39

WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling

Wenxi Chen, Dongya Jia, Yushen Chen

Shanghai Jiao Tong University, Shanghai Innovation Institute, ByteDance Seed

Computer Vision
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization
AlphaXiv
35

VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization

Junhao Cheng, Liang Hou, Tianxiong Zhong

City University of Hong Kong

HuggingFace Daily Papers

Computer Vision
Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?
HuggingFace
12

Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?

Rui Zhao, Kaiming Yang, Jifeng Zhu, Siyang Chen, Ziqi Wang

Towards One-to-Many Temporal Grounding
HuggingFace
4

Towards One-to-Many Temporal Grounding

Qi Xu, Yue Tan, Shihao Chen, Jiahao Meng, Anna Wang

ByteDance

NLP
Revising Context, Shifting Simulated Stance: Auditing LLM-Based Stance Simulation in Online Discussions
HuggingFace
1

Revising Context, Shifting Simulated Stance: Auditing LLM-Based Stance Simulation in Online Discussions

Xinnong Zhang, Wanting Shan, Hanjia Lyu, Zhongyu Wei, Jiebo Luo

NLP
Absorbing Complexity: An Interaction-Native Knowledge Harness for Financial LLM Agents
HuggingFace
1

Absorbing Complexity: An Interaction-Native Knowledge Harness for Financial LLM Agents

Ailiya Borjigin, Igor Stadnyk, Ben Bilski, Maksym Chikita, Dmytro Kyrylenko

INC4

Efficiency
SEAOTTER: Sensor Embedded Autoencoding with One-Time Transcode for Efficient Reconstruction
HuggingFace
1

SEAOTTER: Sensor Embedded Autoencoding with One-Time Transcode for Efficient Reconstruction

Dan Jacobellis, Neeraja J. Yadwadkar

NLP
ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment
HuggingFace
0

ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment

Qiuyu Tian, Haojie Yin, Yingce Xia, Youyong Kong, Zequn Liu