Skip to content

News

News

What's happened, in chronological order.

2026 1 entry

  1. Milestone
    Ph.D. Graduation I have successfully defended my Ph.D. thesis and graduated from Shanghai Jiao Tong University! 🎉

2025 3 entries

  1. Writing
    A blog post sharing my perspective on KL estimators in reinforcement learning.
  2. Writing
    A blog post sharing my perspective on training–inference mismatch in reinforcement learning for large language models.
  3. Paper

2024 1 entry

  1. Paper
    Our work about zero-shot coordination evaluation ZSC-Eval is accepted by NeurIPS 2024 Dataset and Benchmark Track!

2023 2 entries

  1. Talk
    Give a talk about cooperative multi-agent reinforcement learning (Coordinate Agents vis Policy Optimization) at RLChina
  2. Paper
    Our work about policy optimization in cooperative multi-agent scenarios Order Matters: Agent-by-agent Policy Optimization is accepted by ICLR 2023!