[Hands-On ML] 18. Reinforcement Learning - 3

Temporal Difference Learning, Q-Learning, Deep Q-Network

[Statistics 110] 32. Markov Chains Continued

Irreducible Markov chains, Reversible Marcov chains, Random walk on network

[Hands-On ML] 18. Reinforcement Learning - 2

Credit Assignment Problem, Policy Gradient, MDP

[Statistics 110] 31. Markov Chains

Markov Chains

[Hands-On ML] 18. Reinforcement Learning - 1

Basic concept of RL, Policy, OpenAI Gym, NN policy