Checking date: 20/05/2024


Course: 2024/2025

Reinforcement Learning
(19209)
Master in Applied Artificial Intelligence (Plan: 475 - Estudio: 378)
EPI


Coordinating teacher: FERNANDEZ REBOLLO, FERNANDO

Department assigned to the subject: Computer Science and Engineering Department

Type: Electives
ECTS Credits: 3.0 ECTS

Course:
Semester:




Requirements (Subjects that are assumed to be known)
Machine Learning contents are recommended for the Reinforcement Learning course
Description of contents: programme
Introduction to Reinforcement Learning - Introduction to reinforcement learning - Markov Decision Processes - Policies and optimality: discounted infinite horizon - Value Functions Dynamic Programming - Problem Solving on MDP: model-free, model-based and dynamic programming methods - Policy Iteration Algorithm - Value Iteration Algorithm Direct reinforcement learning - Monte Carlo methods: and Monte Carlo with exploratory start - Model-free methods: Q-Learning - Example of execution of Q-Learning - On-policy methods vs. off-policy: SARSA - Exploration and exploitation: e-greedy and softmax Model-Based Methods - Model Learning - Dyna-Q Representation in Reinforcement Learning - Representation of the space of states, actions and Q - State space discretization: uniform and adaptive methods - Approximate methods to represent the function Q: Batch Q-Learning Generalization Through Function Approximation - Approximation through neural networks - Deep reinforcement learning Policy Search Methods - Policy Approximation - Actor-critic methods - Proximal Policy Optimization (PPO) Other Reinforcement Learning topics - Hierarchical Reinforcement Learning - Transfer of learning learned - Multi-agent Reinforcement Learning - Safe reinforcement learning - Offline Reinforcement Learning - Multi-objective Reinforcement Learning - Partially observable Reinforcement Learning Reinforcement Learning in the real world: - Applications of reinforcement learning - Reinforcement learning frameworks and software
Learning activities and methodology
Training Activities: -------------------------- AF1 - Theoretical class AF3 - Theoretical-practical classes AF5 - Individual and group tutorials AF6 - Group work AF7 - Individual student work Teaching methodology: ------------------------ MD1: Class lectures by the lecturer with the support of computer and audiovisual media, in which the main concepts of the subject are developed and the bibliography is provided to complement the students' learning. MD2: Critical reading of texts recommended by the subject teacher: press articles, reports, manuals and/or academic articles, either for subsequent discussion in class or to expand and consolidate knowledge of the subject. MD3: Resolution of practical cases, problems, etc... posed by the teacher individually or in groups. MD4: Presentation and discussion in class, under the moderation of the teacher, of topics related to the content of the subject, as well as practical cases. MD5: Preparation of individual or group work and reports.
Assessment System
  • % end-of-term-examination 30
  • % of continuous assessment (assigments, laboratory, practicals...) 70

Calendar of Continuous assessment


Basic Bibliography
  • Richard Sutton and Andrew Barto. Reinforcement Learning: an Introduction. The MIT Press.
Recursos electrónicosElectronic Resources *
(*) Access to some electronic resources may be restricted to members of the university community and require validation through Campus Global. If you try to connect from outside of the University you will need to set up a VPN


The course syllabus may change due academic events or other reasons.