news
| Dec 2, 2025 |
A blog post sharing my perspective on KL estimators in reinforcement learning.
|
|---|---|
| Nov 23, 2025 |
A blog post sharing my perspective on training–inference mismatch in reinforcement learning for large language models.
|
| May 16, 2025 |
Our paper Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration has been accepted to ACL 2025!
|
| Sep 26, 2024 | Our work about zero-shot coordination evaluation ZSC-Eval is accepted by NeurIPS 2024 Dataset and Benchmark Track! |
| Aug 8, 2023 | Give a talk about cooperative multi-agent reinforcement learning (Coordinate Agents vis Policy Optimization) at RLChina. |
| Mar 25, 2023 | Our work about policy optimization in cooperative multi-agent scenarios Order Matters: Agent-by-agent Policy Optimization is accepted by ICLR 2023! |