For my study I did an experimentation project on learning in agent environments. The idea is to build action selection learning into the 2APL agent platform. It uses different reinforcement learning techniques to allow software agents to learn what actions give the best result while trying to reach a goal.
![]()
The full report can be downloaded and describes how it was implemented and what RL methods worked best in a random gridworld environment searching for bombs.