The Research Desk.

The most upvoted and starred AI research crossing the community today.

Last Brew Time: Jun 1, 2026, 7:30 AM PT

X.com Research Buzz

The AI Layoff Trap
X.com
18695

The AI Layoff Trap

Brett Hemenway Falk, Gerry Tsoukalas

University of Pennsylvania, Boston University

StoryScope: Investigating idiosyncrasies in AI fiction
X.com
4104

StoryScope: Investigating idiosyncrasies in AI fiction

Jenna Russell, Rishanth Rajendhran, Chau Minh Pham, Mohit Iyyer, John Wieting

University of Maryland, College Park, Google DeepMind

Computer Vision
LocateAnything: Fast and High-Quality Vision-Language Grounding and Detection
X.com
2511

LocateAnything: Fast and High-Quality Vision-Language Grounding and Detection

NVIDIA

AlphaXiv Trending

NLP
AlphaXiv
195

Do Language Models Need Sleep? Offline Recurrence for Improved Online Inference

Sangyun Lee, Sean McLeish, Tom Goldstein

Carnegie Mellon University, University of Maryland

AlphaXiv
149

When Does LeJEPA Learn a World Model?

David Klindt, Yann LeCun, Randall Balestriero

Cold Spring Harbor Laboratory, New York University, Brown University

Computer Vision
AlphaXiv
132

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Qiuyue Wang, Mingsheng Li, Jian Guan

AlphaXiv
60

Learn from your own latents and not from tokens: A sample-complexity theory

Daniel J. Korchinski, Alessandro Favero, Matthieu Wyart

Institute of Physics, Department of Applied Maths and Theoretical Physics, University of Cambridge, Department of Physics & Institute of Physics, Johns Hopkins University & EPFL

NLP
AlphaXiv
56

Self-Improving Language Models with Bidirectional Evolutionary Search

Guowei Xu, Zhenting Qi, Huangyuan Su

Harvard University

Reinforcement Learning
AlphaXiv
55

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Fangfu Liu, Kai He, Tianchang Shen

HuggingFace Daily Papers

Mellum2 Technical Report
HuggingFace
25

Mellum2 Technical Report

Marko Kojic, Ivan Bondyrev, Aral de Moor, Joseph Shtok, Petr Borovlev

JetBrains

Retrieval
How can embedding models bind concepts?
HuggingFace
3

How can embedding models bind concepts?

Arnas Uselis, Darina Koishigarina, Seong Joon Oh

Computer Vision
VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies
HuggingFace
2

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Mingjian Gao, Wenqiao Zhang, Yuqian Yuan, Yang Dai, Binhe Yu

Zhejiang university

Machine Learning
One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation
HuggingFace
1

One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation

Sanghyun Jo, Seo Jin Lee, Seohyung Hong, Yoorim Gang, Hyeongsub Kim

Seoul National University

NLP
Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode
HuggingFace
0

Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode

Josef Chen

JYe9
A Topology-Aware Spatiotemporal Handover Framework for Continuous Multi-UAV Tracking
HuggingFace
0

A Topology-Aware Spatiotemporal Handover Framework for Continuous Multi-UAV Tracking

Jianlin Ye, Christos Kyrkou, Panayiotis Kolios