Detecting Environment Drift in Reinforcement Learning Using a Gaussian Process
This study introduces a novel two-stage method, GPAction, for detecting environment drift in reinforcement learning settings. We first train a Gaussian process predicting the reinforcement learning agents' actions and then detect environment drifts by monitoring the mean squared error between the predicted actions of a Gaussian process action predictor and actual actions. Our proposed method is evaluated against three baselines across four environments with continuous action spaces. Results demonstrate the superior performance of GPAction in detecting environment drift. In an ablation study, by analyzing the plots and AUC values of the MSEs, we show that our method GPAction can provide more distinguishable monitoring metrics than the Gaussian process state predictor.
Top- Fang, Zhizhou
- Zdun, Uwe
Category |
Paper in Conference Proceedings or in Workshop Proceedings (Paper) |
Event Title |
The 36th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2024) |
Divisions |
Software Architecture |
Subjects |
Software Engineering Kuenstliche Intelligenz Angewandte Informatik |
Event Location |
Herndon, VA, USA |
Event Type |
Conference |
Event Dates |
28 October - 30 October 2024 |
Date |
28 October 2024 |
Export |