IRL - Lectures - Reinforcement Learning

Reinforcement Learning

Type: Lecture / Practice (VÜ)
Chair: KIT-Fakultäten - KIT-Fakultät für Informatik - Institut für Anthropomatik und Robotik - IAR Neumann
Semester: WS 23/24
Time: Thu 2023-10-26
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)
more...

Fri 2023-10-27
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-11-02
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-11-03
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-11-09
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-11-10
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-11-16
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-11-17
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-11-23
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-11-24
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-11-30
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-12-01
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-12-07
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-12-08
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-12-14
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-12-15
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2023-12-21
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2023-12-22
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2024-01-11
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2024-01-12
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2024-01-18
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2024-01-19
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2024-01-25
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2024-01-26
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2024-02-01
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2024-02-02
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2024-02-08
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2024-02-09
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)

Thu 2024-02-15
09:45 - 11:15, weekly
11.10 Engelbert-Arnold-Hörsaal (EAS)
11.10 Elektrotechnisches Institut (ETI) (1. OG)

Fri 2024-02-16
11:30 - 13:00, weekly
30.46 Chemie, Neuer Hörsaal
30.46 Chemie-Hörsaalgebäude (EG)
Lecturer: Prof. Dr. Gerhard Neumann
TT-Prof. Dr. Rudolf Lioutikov
Mevlüt Onur Celik
SWS: SWS: 4 / ECTS: 6
Lv-No.: 2400163
Information: On-Site

Links

ILIAS-Course

Content

Reinforcement Learning (RL) is a sub-field of machine learning in which an artificial agent has to interact with its environment and learn how to improve its behaviour by trial and error. For doing so, the agent is provided with an evaluative feedback signal, called reward, that he perceives for each action performed in its environment. RL is one of the hardest machine learning problems, as, in contrast to standard supervised learning, we do not know the targets (i.e. the optimal actions) for our inputs (i.e. the state of the environment) and we also need to consider the long-term effects of the agent’s actions on the state of the environment. Due to recent successes, RL has gained a lot of popularity with applications in robotics, automation, health care, trading and finance, natural language processing, autonomous driving and computer games. This lecture will introduce the concepts and theory of RL and review current state of the art methods with a particular focus on RL applications in robotics. An exemplary list of topics is given below:

Primer in Machine Learning and Deep Learning
Supervised Learning of Behaviour
Introduction in Reinforcement Learning
Dynamic Programming
Value Based Methods
Policy Optimization and Trust Regions
Episodic Reinforcement Learning and Skill Learning
Bayesian Optimization
Variational Inference, Max-Entropy RL and Versatility
Model-based Reinforcement Learning
Offline Reinforcement Learning
Inverse Reinforcement Learning
Hierarchical Reinforcement Learning
Exploration and Artificial Curiosity
Meta Reinforcement Learning

Lernziele:

- Students are able to understand the RL problem and challenges.

- Students can differentiate between different RL algorithm and understand their underlying theory

- Students will know the mathematical tools necessary to understand RL algorithms

- Students can implement RL algorithms for various tasks

- Students understand current research questions in RL

Empfehlungen:

Der Vorlesungsinhalt von Maschinelles Lernen – Grundverfahren wird vorausgesetzt
Gute Python Kenntnisse erforderlich
Gute mathematische Grundkenntnisse

Erfolgskontrolle: Siehe Modulhandbuch!

Arbeitsaufwand:

180h, aufgeteilt in:

ca 45h Vorlesungsbesuch
ca 15h Übungsbesuch
ca 90h Nachbearbeitung und Bearbeitung der Übungsblätter

ca 30h Prüfungsvorbereitung

Language of instruction

English

Organisational issues

ECTS von 5 auf 6 erhöht

Vorlesungs-und Übungsturnus: Siehe ILIAS