Social diversity and social preferences in mixed-motive reinforcement learning

Kevin R. McKee, Ian Gemp, Brian McWilliams, Edgar A. Duèñez-Guzmán, Edward Hughes, & Joel Z. Leibo

Abstract

Recent research on reinforcement learning in pure-conflict and pure-common interest games has emphasized the importance of population heterogeneity. In contrast, studies of reinforcement learning in mixed-motive games have primarily leveraged homogeneous approaches. Given the defining characteristic of mixed-motive gamess—the imperfect correlation of incentives between group members—we study the effect of population heterogeneity on mixed-motive reinforcement learning. We draw on interdependence theory from social psychology and imbue reinforcement learning agents with Social Value Orientation (SVO), a flexible formalization of preferences over group outcome distributions. We subsequently explore the effects of diversity in SVO on populations of reinforcement learning agents in two mixed-motive Markov games. We demonstrate that heterogeneity in SVO generates meaningful and complex behavioral variation among agents similar to that suggested by interdependence theory. Empirical results in these mixed-motive dilemmas suggest agents trained in heterogeneous populations develop particularly generalized, high-performing policies relative to those trained in homogeneous populations.


Venue

Proc. of the 19th International Conference on Autonomous Agents and MultiAgent Systems

Year

2020

Links

arXiv

ACM Digital Library