Variation of Q-Learning, Other RL Algorithms
Markov Chains, Google PageRank
Temporal Difference Learning, Q-Learning, Deep Q-Network
Irreducible Markov chains, Reversible Marcov chains, Random walk on network
Credit Assignment Problem, Policy Gradient, MDP