2023 ZSC Evaluation Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners Xihuai Wang, Shao Zhang, Wenhao Zhang, Wentao Dong, Jingxiao Chen, Ying Wen, and Weinan Zhang Preprint, 2023 PDF A2PO Order Matters: Agent-by-agent Policy Optimization Xihuai Wang, Zheng Tian, Ziyu Wan, Ying Wen, Jun Wang, and Weinan Zhang In The Eleventh International Conference on Learning Representations , 2023 PDF Code 2022 Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects Xihuai Wang, Zhicheng Zhang, and Weinan Zhang ArXiv, 2022 PDF 2021 AORPO Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts Weinan Zhang, Xihuai Wang, Jian Shen, and Ming Zhou IJCAI, 2021 PDF Code