User:Antonov86/Books/Reinforcement learning


Reinforcement learning

edit
Markov decision process
Supervised learning
Unsupervised learning
Monte Carlo method
Temporal difference learning
Multi-armed bandit
Q-learning
Reinforcement learning