
Self-Distilled Agentic Reinforcement Learning
Zhengxi Lu, Zhiyuan Yao, Zhuowen Han
Zhejiang University, Tsinghua University
The most upvoted and starred AI research crossing the community today.
Last Brew Time: May 19, 2026, 7:31 AM PT

Zhengxi Lu, Zhiyuan Yao, Zhuowen Han
Zhejiang University, Tsinghua University

Jianyuan Wang, Minghao Chen, Shangzhan Zhang

Jingdi Lei, Di Zhang, Junxian Li

Haoyi Zhu, Haozhe Liu, Yuyang Zhao
NVIDIA

Shashwat Goel, Nikhil Chandak, Arvindh Arun

Sahil Sen, Akhil Kasturi, Elias Lumer
Haolin Chen, Deon Metelski, Leon Qi, Tao Xia, Joonyul Lee
actAVA AI
Zhiqiang Liu, Wenhui Dong, Yilang Tan, Yuwen Qu, Haochen Yin
Pi3AI
Yifan Shen, Jiawen Zhang, Jian Xu, Junho Kim, Ismini Lourentzou
PediaMed AI
Ziyun Zeng, Hang Hua, Bocheng Zou, Mu Cai, Rogerio Feris
Yiming Zhao, Yu Zeng, Wenxuan Huang, Zhen Fang, Qing Miao
Wenjun Wang, Yanggan Gu, Shuo Cai, Yuanyi Wang, Pengkai Wang
The Hong Kong Polytechnic University
System online.