Reinforcement Learning (2025)
Enrolment options
Welcome to Reinforcement Learning (2025)
Description: An intelligent system is expected to ...
Welcome to Reinforcement Learning (2025)
Description: An intelligent system is expected to generate policies autonomously to achieve a goal, which is mostly to maximize a given reward function or minimize a given cost function.
Reinforcement learning is a set of methods in machine learning that can produce such policies. To learn optimal actions in an environment that is not fully comprehensible to itself, an intelligent system can use reinforcement algorithms to leverage its experience to figure out optimal policies. Nowadays, reinforcement learning techniques are successfully applied in various engineering fields, including robotics (DeepMind’s walking robot) and computers playing games (AlphaGo and TD-Gammon).
Developed independently from reinforcement learning, dynamic programming is a set of algorithms in optimal control theory that generate policies assuming that the environment is fully comprehensible to the intelligent system. Therefore, dynamic programming provides an essential base to learn reinforcement learning. The course aims at building a fundamental understanding of both methods based on their relations to each other and on their applications to similar problems.
The course consists of the following topics:
Markov decision processes, dynamic programming for infinite time and stopping time, reinforcement learning, and verification tools for reinforcement learning.
Prerequisites: Basic knowledge of mathematics: calculus and probability
Learning objectives:
- General Introduction to machine learning herein Reinforcement Learning
- Markov Decision Processes and Dynamic Programming
- Reinforcement Learning with Temporal-Difference Learning
- Policy prediction with approximation
- Verification tool UPPAAL for model-based reinforcement learning
Key literature: TBA
Organizer: Rafal Wisniewski
Lecturers: Kim Guldstrand Larsen, Zheng-Hua Tan, Rafal Wisniewski, Marius Mikucionis, Abhijit Mazumdar
ECTS: 2.0
Time: 31 March to 4th April 2025
Place: Aalborg University
From March 31st to April 2nd: Fredrik Bajers Vej 7, C3-204
April 3rd: Kroghstræde 3, room 3.136
April 4th: Fredrik Bajers Vej 7, C3-204
Maximal number of participants: 40
Deadline: 10 March 2025
Important information concerning PhD courses:
There is a no-show fee of DKK 3,000 for each course where the student does not show up. Cancellations are accepted no later than 2 weeks before the start of the course. Registered illness is of course an acceptable reason for not showing up on those days. Furthermore, all courses open for registration approximately four months before start of the course.
We cannot ensure any seats before the deadline for enrolment, all participants will be informed after the deadline, approximately 3 weeks before the start of the course.
For inquiries regarding registration, cancellation or waiting list, please contact the PhD administration at phdcourses@adm.aau.dk When contacting us please state the course title and course period. Thank you.