Reinforcement Learning’s Generalization Problem
▻https://hackernoon.com/reinforcement-learnings-generalization-problem-414d276c4000?source=rss--
… or why your reinforcement learning agent behaves oddly on unseen game levels.This article details the testing of a PPO-trained A2C agent’s generalization ability. Code available at ▻https://github.com/davidleejy/ai-safety-gridworldsTests of GeneralizationRecently, Deepmind & OpenAI released environments meant for gauging agents’ ability to generalize — a fundamental challenge even for modern deep reinforcement learning.The need for generalization is ubiquitous — for instance, when an agent is trained in a simulator but is then deployed in the real world (this difference is also known as the reality gap). However, common benchmarks today use the same environments for both training and testing — a practice that offers relatively little insight into an agent’s ability to generalize.The following (...)
#deep-learning #reinforcement-learning #artificial-intelligence #machine-learning #neural-networks