New Algorithms for Collaborative and Adversarial Decision Making in Partially Observable Stochastic Games

Sponsored by the Air Force Office of Scientific Research
Shlomo Zilberstein, PI


The performance of decision makers in many practical settings can be improved significantly by using new computational tools for coordination, prediction and planning. We are developing such tools for situations involving multiple decision makers operating over an extended period of time in either collaborative or adversarial domains. To address this challenge, we employ two formal models. A Decentralized Partially Observable Markov Decision Process (DEC-POMDP) is designed for collaborative systems in which all the decision makers share the same objective or utility function. A Partially Observable Stochastic Game (POSG) is a proper extension of a DEC-POMDP that is designed for competitive systems in which decision makers have separate, possibly conflicting, objectives. We have recently produced important new results regarding these two models. We have published the first complexity analysis of DEC-POMDPs and identified a taxonomy of sub-classes whose complexity ranges between polynomial and NEXP-hard. We are developing new exact dynamic programming algorithm for solving DEC-POMDPs and POSGs as well as a range of approximation schemes. We are also working on effective decision-theoretic techniques to model and exploit bounded rationality and opponent models in decentralized settings.

Related Publications


shlomo@cs.umass.edu