Aim
Develop novel algorithms for multi-agent reinforcement learning (MARL), enabling multiple autonomous assets to learn how to interact and collaborate towards a specified task.
Objectives
- Explore the current state-of-the-art in MARL research, identify key limitations.
- Develop novel MARL algorithms to address identified limitations.
- Implement and evaluate the algorithms in challenging environments and learning tasks.
Description
Multi-agent reinforcement learning (MARL) algorithms are designed to train the decision policies of multiple autonomous agents acting in a shared environment to accomplish specified tasks. MARL can be applied to a broad range of applications, including many tasks related to defence and security scenarios, including: multiple drones or mobile assets patrolling in a specified area, distributed sensing and intelligence gathering in areas of interest, and human-agent teamwork scenarios. This project will identify key limitations in current MARL capabilities, such as relating to scalability, learning stability, and communication capabilities; and will develop novel algorithms to address such limitations.
References
- Stefano V. Albrecht, Filippos Christianos, and Lukas Schäfer. Multi-Agent Reinforcement Learning: Foundations and Modern Approaches. MIT Press, 2024. https://www.marl-book.com
Research theme:
Principal supervisor:
Dr Stefano Albrecht
University of Edinburgh, School of Informatics
s.albrecht@ed.ac.uk