Xihuai Wang's Page

APEX Lab, Shanghai Jiao Tong University.

portrait.png

Shanghai, China

Xihuai Wang is currently a Ph.D. candidate at Shanghai Jiao Tong University, supervised by Prof. Weinan Zhang and Prof. Ying Wen. He is a member of SJTU-Apex Group and a member of SJTU-MARL Group leaded by Prof. Ying Wen, and is also selected into Wen-Tsun Wu AI Honorary Doctoral Class in 2020. Xihuai earned his B.Eng. in Computer Science and Technology at School of Computer Science and Engineering, Sun Yat-sen University in 2020.

Xihuai’s research interests include Decision-making and Multi-agent System. Specifically, he is now focusing on

  • Multi-agent Decision-making in Cooperative Scenarios, especially
    • Efficiency of Cooperative Multi-agent Reinforcement Learning;
    • Zero-shot Generalization Ability in Cooperative Multi-agent Systems.
  • Large Language Models for Decision-making, specifically human-ai collaboration.

news

Sep 26, 2024 Our work about zero-shot coordination evaluation ZSC-Eval is accepted by NeurIPS 2024 Dataset and Benchmark Track!
Aug 8, 2023 Give a talk about cooperative multi-agent reinforcement learning (Coordinate Agents vis Policy Optimization) at RLChina.
Mar 25, 2023 Our work about policy optimization in cooperative multi-agent scenarios Order Matters: Agent-by-agent Policy Optimization is accepted by ICLR 2023!

selected publications

  1. Human-AI Collaboration
    Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task
    Shao Zhang*Xihuai Wang*, Wenhao Zhang, Yongshan Chen, Landi Gao, Dakuo Wang, Weinan Zhang, Xinbing Wang, and Ying Wen
    Under Review., 2024
  2. MARL Generalization
    ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination
    Xihuai WangShao Zhang, Wenhao Zhang, Wentao Dong, Jingxiao Chen, Ying Wen, and Weinan Zhang
    38th NeurIPS Dataset and Benchmark Track, 2024
  3. MARL Efficiency
    Order Matters: Agent-by-agent Policy Optimization
    Xihuai Wang, Zheng Tian, Ziyu Wan, Ying WenJun Wang, and Weinan Zhang
    11th ICLR, 2023
  4. MARL Efficiency
    Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
    Weinan ZhangXihuai Wang, Jian Shen, and Ming Zhou
    30th IJCAI, 2021