I am a AI researcher at Microsoft Research with the goal of creating autonomous agents that can efficiently learn to solve complex decision-making tasks in the real world. As part of the Game Intelligence team, I am working towards autonomous agents that can enable novel experiences and tools in video games. I am broadly excited about technology and AI but most interested in its application for decision-making problems.
Prior to joining Microsoft Research, I received my PhD and MSc from the University of Edinburgh where I was supervised by Stefano Albrecht and Amos Storkey. My research focused on reinforcement learning, in particular in the context of multi-agent systems that require multiple agents to cooperate with each other.
The introductory textbook on multi-agent reinforcement learning by Stefano, Filippos, and myself is coming out soon! You can order a (physical) copy of the book here and find the PDF for free at www.marl-book.com!
Dec 19, 2024
📃 Our work Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning has been accepted at the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) 2025!
Nov 27, 2024
📃 Our work ‘Using Offline Data to Speed Up Reinforcement Learning in Procedurally Generated Environments’ has been accepted at the Neurocomputing journal!
Oct 10, 2024
🏆 I am very excited to have passed my PhD viva today with no corrections! Huge thanks to my examiners Prof. Subramanian Ramamoorthy and Prof. Karl Tuyls for taking the time to evaluate my thesis, their thoughtful feedback, and for making the viva experience as comfortable as possible. My thesis is now accessible at this link.
Oct 01, 2024
📢 I am very excited to be re-joining the Game Intelligence team at Microsoft Research as a Researcher after interning with them last year! As part of the team, I will be working on creating autonomous agents that can enable novel experiences and tools in video games.
Jul 24, 2024
🏆 Honored to receive the best reviewer award for the International Conference on Machine Learning (ICML) 2024!
Dec 06, 2023
📢 New preprint on Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games is now available on arXiv.
Nov 01, 2023
📢 It is DONE! Our textbook “Multi-Agent Reinforcement Learning: Foundations and Modern Approaches” is now with MIT Press and available on the official webpage www.marl-book.com! The print release is scheduled for late 2024.
Oct 30, 2023
📃 Our work Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning will be presented at the Workshop on Generalization in Planning at the Conference on Neural Information Processing Systems (NeurIPS) 2023!
Oct 2024 - Oct 2023
Cambridge
Apr 2023 - Oct 2023
Oct 2024 - Present
Sep 2022 - Sep 2022
Heidelberg
The Heidelberg Laureate Forum brings together the most exceptional mathematicians and computer scientists of their generations. Each year, the recipients of the most prestigious awards in mathematics and computer science, the Abel Prize, ACM A.M. Turing Award, ACM Prize in Computing, Fields Medal, IMU Abacus Medal and Nevanlinna Prize, meet 200 selected young researchers from all over the world. Participants spend a week interacting and networking in a relaxed atmosphere designed to encourage scientific exchange.
Sep 2022 - Sep 2022
Jul 2022 - Dec 2022
London
The Noah’s Ark Lab is the AI research center for Huawei Technologies, working towards significant contributions to both the company and society by innovating in artificial intelligence, data mining and related fields.
Jul 2022 - Dec 2022
Nov 2020 - Mar 2021
Remote
Dematic is global player focused on design and implementation of automated system solutions for warehouses, distribution centres and production facilities.
Nov 2020 - Mar 2021
Sep 2018 - Aug 2020
Edinburgh
HYPED is a team of students at the University of Edinburgh dedicated to developing the Hyperloop concept and inspiring future generations about engineering. HYPED has received awards from SpaceX, Virgin Hyperloop One and Institution of Civil Engineers.
Sep 2019 - Aug 2020
Sep 2018 - Aug 2019
![]() 2019-Present Ph.D in Data Science and Artificial IntelligenceUK PhD: Pass with no corrections out ofThesis:Efficient Exploration in Single-Agent and Multi-Agent Deep Reinforcement Learning Supervisors:Stefano V. Albrecht (primary) and Amos Storkey (secondary) Funding:Principal’s Career Development Scholarship from the University of Edinburgh Keywords:Reinforcement Learning, Multi-Agent Systems, Generalisation, Exploration, Intrinsic Rewards | ||||||||||||||
![]() 2018-2019 M.Sc. in InformaticsCGPA: 77.28% out ofTaken Courses:
Extracurricular Activities:
Funding:DAAD (German Academic Exchange Service) graduate scholarship & Stevenson Exchange Scholarship | ||||||||||||||
![]() 2015-2018 B.Sc. in InformaticsGerman scale: 1.2 out ofTaken Courses:
Extracurricular Activities:
|
Oct 2019 - June 2022, School of Informatics, University of Edinburgh
Teaching assistant, demonstrator and marker for three iterations of the Reinforcement Learning lecture at the University of Edinburgh under Dr. Stefano V. Albrecht
Jun 2022 - Present, School of Informatics, University of Edinburgh
Co-supervised visiting PhD student project at the University of Edinburgh
Feb 2021 - Aug 2021, School of Informatics, University of Edinburgh
Co-supervised final Masters students’ projects at the University of Edinburgh
Sep 2017 - Oct 2017, Mathematics Preparation Course, Saarland University
Voluntary lecturer and coach for the mathematics preparation course preparing upcoming computer science undergraduate students for their studies
Oct 2016 - Mar 2017, Dependable Systems and Software Chair, Saarland University
Tutor for the Programming 1 lecture about functional programming at the Dependable Systems and Software Group chair of Saarland University under Prof. Dr. Holger Hermanns