RL(2): Sample-based Learning Methods

雖然以 Coursera 課程 (Sample-based Learning Methods) 來當 RL 的第二份筆記

Textbook [Sutton&Barto]; Reference solution: [ref sol]; Course Quiz and Assignment: [quiz sol]

但個人完全偏好赵世钰老師的 RL 課程和課本: 强化学习的数学原理 [YouTube] 的內容, 因此會以趙老師內容來記錄.

第二份 RL 筆記對應到趙老師的內容為 Chapter 5 to 7

Stochastic approximation 很重要的 chapter