News
News
What's happened, in chronological order.
2026 1 entry
-
Milestone
I have successfully defended my Ph.D. thesis and graduated from Shanghai Jiao Tong University! 🎉
2025 3 entries
-
Writing
A blog post sharing my perspective on KL estimators in reinforcement learning.
- English Version | 中文版本 | 知乎
| 青稞 AI 公众号

- English Version | 中文版本 | 知乎
-
Writing
A blog post sharing my perspective on training–inference mismatch in reinforcement learning for large language models.
- English Version | 中文版本 | 知乎
| 青稞 AI 公众号

- English Version | 中文版本 | 知乎
-
Paper
Our paper Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration has been accepted to ACL 2025!
2024 1 entry
-
Paper
Our work about zero-shot coordination evaluation ZSC-Eval is accepted by NeurIPS 2024 Dataset and Benchmark Track!
2023 2 entries
-
Talk
Give a talk about cooperative multi-agent reinforcement learning (Coordinate Agents vis Policy Optimization) at RLChina
-
Paper
Our work about policy optimization in cooperative multi-agent scenarios Order Matters: Agent-by-agent Policy Optimization is accepted by ICLR 2023!