Xihuai Wang's Page

APEX Lab, Shanghai Jiao Tong University.

portrait.png

Shanghai, China

Xihuai Wang is currently a Ph.D. candidate at Shanghai Jiao Tong University, supervised by Prof. Weinan Zhang and Prof. Ying Wen. He is a member of SJTU-Apex Group and a member of SJTU-MARL Group leaded by Prof. Ying Wen, and is also selected into Wen-Tsun Wu AI Honorary Doctoral Class in 2020. Xihuai earned his B.Eng. in Computer Science and Technology at School of Computer Science and Engineering, Sun Yat-sen University in 2020.

Xihuai’s research interests include Decision-making and Multi-agent System. Specifically, he is now focusing on

  • Multi-agent Decision-making in Cooperative Scenarios, especially
    • Efficiency of Cooperative Multi-agent Reinforcement Learning;
    • Zero-shot Generalization Ability in Cooperative Multi-agent Systems.
  • Large Language Models for Decision-making, specifically human-ai collaboration.

news

Sep 26, 2024 Our work about zero-shot coordination evaluation ZSC-Eval is accepted by NeurIPS 2024 Dataset and Benchmark Track!
Aug 8, 2023 Give a talk about cooperative multi-agent reinforcement learning (Coordinate Agents vis Policy Optimization) at RLChina.
Mar 25, 2023 Our work about policy optimization in cooperative multi-agent scenarios Order Matters: Agent-by-agent Policy Optimization is accepted by ICLR 2023!

selected publications

  1. Language Agent
    Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration
    Shao Zhang*Xihuai Wang*, Wenhao Zhang, Chaoran Li, Junru Song, Tingyu Li, Lin Qiu, Xuezhi Cao, Xunliang Cai, Wen Yao, Weinan Zhang, Xinbing Wang, and Ying Wen
    Preprint Under Review, 2025
  2. MARL Generalization
    ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination
    Xihuai WangShao Zhang, Wenhao Zhang, Wentao Dong, Jingxiao Chen, Ying Wen, and Weinan Zhang
    38th NeurIPS Dataset and Benchmark Track, 2024
  3. MARL Efficiency
    Order Matters: Agent-by-agent Policy Optimization
    Xihuai Wang, Zheng Tian, Ziyu Wan, Ying WenJun Wang, and Weinan Zhang
    11th ICLR, 2023
  4. MARL Efficiency
    Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
    Weinan ZhangXihuai Wang, Jian Shen, and Ming Zhou
    30th IJCAI, 2021