This course aims to describe some of the recent progresses made in AI thanks to Deep Reinforcement Learning techniques.
We will learn - time permitting - to describe real-life problems as Markov Decision Processes (MDP), and to deal with them using dynamic programming or Reinforcement Learning, according whether a full distribution model, or only real experience or a sample model, is available.
1) The reinforcement learning problem: interation between agent-environment.
2) The reinforcement learning setup: Markov decision processes.
3) Prediction and control in dynamic programming.
4) Prediction and control in the model-free case: TD and MC methods.
5) Prediction and control with approximation.
6) Policy gradient methods.
7) Exploration versus exploitation: multi-armed bandits.
8) Reinforcement learning in perfect information two-player zero-sum games.
SEDE DI CHIETI
Via dei Vestini,31
Centralino 0871.3551
SEDE DI PESCARA
Viale Pindaro,42
Centralino 085.45371
email: info@unich.it
PEC: ateneo@pec.unich.it
Partita IVA 01335970693