强化学习数学基础（开源教材）

Summary

西湖大学 Shiyu Zhao 教授编写的开源强化学习教材，10k+ stars，Springer Nature + 清华大学出版社联合出版（2024-2025）。"数学严谨但友好"，10章涵盖 Bellman 方程、值迭代到策略梯度、Actor-Critic，配套 54讲 YouTube 系列（210万+播放）和 Python/MATLAB 代码。

Key Concepts

Reinforcement Learning — 系统性数学基础教材
Bellman Equation — 值迭代与策略迭代的数学核心
Policy Gradient — 策略梯度方法
Actor Critic — 值函数+策略网络结合方法

Part	主题
Part 1	Bellman 方程、值迭代、策略迭代
Part 2	Monte Carlo、TD 学习、值函数近似、策略梯度、Actor-Critic

Resources

完整 PDF + LaTeX 讲义
Python/MATLAB/R/C++ 代码
YouTube 54讲英文系列

CMU 10-202: Introduction to Modern AI - 现代AI入门课程
大语言模型LLM学习路径和资料汇总（ninehills）
How to Think About GPUs - 大模型训练GPU架构深度指南

Summary​

Key Concepts​

Contents​

Resources​

Related Topics​

Summary

Key Concepts

Contents

Resources

Related Topics