Towards Deployment of Robust Cooperative AI Agents: An Algorithmic Framework for Learning Adaptive Policies
We study the problem of designing an AI agent that can robustlycooperate with agents of unknown type (i.e., previously unobservedbehavior) in multi-agent scenarios. Our work is inspired by real-world applications in which an AI agent, e.g., a virtual assistant, hasto cooperate with new types of agents/users after its deployment.We model this problem via parametric Markov Decision Processeswhere the parameters correspond to a user’s type and characterizeher behavior. In the test phase, the AI agent has to interact with auser of an unknown type. We develop an algorithmic frameworkfor learning adaptive policies: our approach relies on observing theuser’s actions to make inferences about the user’s type and adaptingthe policy to facilitate efficient cooperation. We show that withoutbeing adaptive, an AI agent can end up performing arbitrarily badin the test phase. Using our framework, we propose two concretealgorithms for computing policies that automatically adapt to theuser in the test phase. We demonstrate the effectiveness of ouralgorithms in a cooperative gathering game environment for twoagents.KEYWORDSLearning agent-to-agent interactions; Machine learning; Reinforce-ment learning
Top- Ghosh, Ahana
- Tschiatschek, Sebastian
- Mahdavi, Hamed
- Singla, Adish
Category |
Paper in Conference Proceedings or in Workshop Proceedings (Poster) |
Event Title |
19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) |
Divisions |
Data Mining and Machine Learning |
Subjects |
Kuenstliche Intelligenz |
Event Location |
Auckland (NZ); Virtual (Covid) |
Event Type |
Conference |
Event Dates |
09.-13.05.2020 |
Series Name |
Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems |
ISSN/ISBN |
978-1-4503-7518-4 |
Page Range |
pp. 447-455 |
Date |
May 2020 |
Export |