Reinforcement Learning

Content

Reinforcement Learning (RL) is a sub-field of machine learning in which an artificial agent has to interact with its environment and learn how to improve its behaviour by trial and error. For doing so, the agent is provided with an evaluative feedback signal, called reward, that he perceives for each action performed in its environment. RL is one of the hardest machine learning problems, as, in contrast to standard supervised learning, we do not know the targets (i.e. the optimal actions) for our inputs (i.e. the state of the environment) and we also need to consider the long-term effects of the agent’s actions on the state of the environment. Due to recent successes, RL has gained a lot of popularity with applications in robotics, automation, health care, trading and finance, natural language processing, autonomous driving and computer games. This lecture will introduce the concepts and theory of RL and review current state of the art methods with a particular focus on RL applications in robotics. An exemplary list of topics is given below:

  • Primer in Machine Learning and Deep Learning
  • Supervised Learning of Behaviour
  • Introduction in Reinforcement Learning
  • Dynamic Programming
  • Value Based Methods
  • Policy Optimization and Trust Regions
  • Episodic Reinforcement Learning and Skill Learning
  • Bayesian Optimization
  • Variational Inference, Max-Entropy RL and Versatility
  • Model-based Reinforcement Learning
  • Offline Reinforcement Learning
  • Inverse Reinforcement Learning
  • Hierarchical Reinforcement Learning
  • Exploration and Artificial Curiosity
  • Meta Reinforcement Learning

Lernziele:

- Students are able to understand the RL problem and challenges.

- Students can differentiate between different RL algorithm and understand their underlying theory

- Students will know the mathematical tools necessary to understand RL algorithms

- Students can implement RL algorithms for various tasks

- Students understand current research questions in RL

Empfehlungen:

  • Der Vorlesungsinhalt von Maschinelles Lernen – Grundverfahren wird vorausgesetzt
  • Gute Python Kenntnisse erforderlich
  • Gute mathematische Grundkenntnisse

 

 Erfolgskontrolle: Siehe Modulhandbuch!

Arbeitsaufwand:

180h, aufgeteilt in: 

  • ca 45h Vorlesungsbesuch
  • ca 15h Übungsbesuch
  • ca 90h Nachbearbeitung und Bearbeitung der Übungsblätter

ca 30h Prüfungsvorbereitung

 

Language of instructionEnglish
Organisational issues

6 ECTS 

Vorlesungs-und Übungsturnus: Siehe ILIAS