Xihuai Wang's Page
  • About
  • Blog
  • Publications
  • CV

LLM-RL Training–Inference Mismatch Blog

November 23, 2025
📅 2025

A blog post sharing my perspective on training–inference mismatch in reinforcement learning for large language models.

  • English Version | 中文版本 | 知乎 Zhihu | 青稞 AI 公众号 WeChat
Contents
© Copyright 2026 Xihuai Leo Wang. Last updated: February 22, 2026.