‘On Centralized Critics in Multi-Agent Reinforcement Learning’

“Centralized Training for Decentralized Execution, where agents are trained offline in a centralized fashion and execute online in a decentralized manner, has become a popular approach in Multi-Agent Reinforcement Learning (MARL). In particular, it has become popular to develop actor-critic methods that train decentralized actors with a centralized critic … [however,] using a centralized critic in this context has yet to be sufficiently analyzed theoretically or empirically. In this paper, we therefore formally analyze centralized and decentralized critic approaches.”

Find the paper and full list of authors at the Journal of Artificial Intelligence Research.

View on Site: ‘On Centralized Critics in Multi-Agent Reinforcement Learning’