Xihuai Wang's Page
  • about
  • Xihuai's Blog
  • publications
  • cv

LLM-RL Training–Inference Mismatch Blog

November 23, 2025

📅 2025

A blog post sharing my perspective on training–inference mismatch in reinforcement learning for large language models.

  • English Version | 中文版本 | 知乎版本 Zhihu | 青稞 AI 公众号 WeChat
© Copyright 2025 Xihuai Leo Wang. Last updated: December 05, 2025.