Course: Reinforcement Learning

Reinforcement Learning

0 comments

Introduction

Welcome to Reinforcement Learning

Description:

An intelligent system is expected to generate policies autonomously to achieve a goal, which is mostly to maximize a given reward function. Reinforcement learning is a set of methods in machine learning that can produce such policies. To learn optimal actions in an environment that is not fully comprehensible to itself, an intelligent system can use reinforcement algorithms to leverage its experience to figure out optimal policies. Nowadays, reinforcement learning techniques are successfully applied in various engineering fields, including robotics (DeepMind’s walking robot) and computer-games (AlphaGo and TD-Gammon).

Developed independently from reinforcement learning, dynamic programming and related stochastic optimisation is a set of algorithms that generate policies assuming that the environment is fully comprehensible to the intelligent system. Therefore, dynamic programming provides an essential base to learn reinforcement learning. The course aims at building a fundamental understanding of both methods based on their relations to each other and on their applications to similar problems.

The course consists of the following topics:

Markov decision processes, dynamic programming for infinite time and stopping time, reinforcement learning, safe learning and verification tools for reinforcement learning.

Prerequisites: Basic knowledge of mathematics: calculus and probability

Learning objectives:

- General Introduction to machine learning herein Reinforcement Learning

- Markov Decision Processes and Dynamic Programming

- Reinforcement Learning with Temporal-Difference Learning

- Policy prediction with approximation

- Verification tool UPPAAL for model-based reinforcement learning

Key literature: Richard S. Sutton, Andrew G. Barto, Reinforcement Learning

Organizer: Rafal Wisniewski

Lecturers: Kim Guldstrand Larsen, Zheng-Hua Tan, Rafal Wisniewski, Marius Mikucionis, Abhijit Mazumdar, Rahul Misra

ECTS: 3

Date: 23, 24, 25, 26, 27 March 2026

Place: TBA

City: Aalborg

Number of seats: 50

Deadline: 2 March 2026

Important information concerning PhD courses:

There is a no-show fee of DKK 3,000 for each course where the student does not show up. Cancellations are accepted no later than 2 weeks before the start of the course. Registered illness is of course an acceptable reason for not showing up on those days. Furthermore, all courses open for registration approximately four months before start of the course.

We cannot ensure any seats before the deadline for enrolment, all participants will be informed after the deadline, approximately 3 weeks before the start of the course.

For inquiries regarding registration, cancellation or waiting list, please contact the PhD administration at phdcourses@adm.aau.dk When contacting us please state the course title and course period. Thank you.

To participate in the course, you must register here.

Enroll

Introduction

Share the course