Influence Diagrams with Memory States: Representation and Algorithms
Xiaojian Wu, Akshat Kumar, and Shlomo Zilberstein. Influence Diagrams with Memory States: Representation and Algorithms. Proceedings of the Second International Conference on Algorithmic Decision Theory (ADT), Rutgers University, 2011.
Abstract
Influence diagrams (IDs) offer a powerful framework for decision making under uncertainty, but their applicability has been hindered by the exponential growth of runtime and memory usage--largely due to the no-forgetting assumption. We present a novel way to maintain a limited amount of memory to inform each decision and still obtain near-optimal policies. The approach is based on augmenting the graphical model with memory states that represent key aspects of previous observations--a method that has proved useful in POMDP solvers. We also derive an efficient EM-based message-passing algorithm to compute the policy. Experimental results show that this approach produces high-quality approximate polices and offers better scalability than existing methods.
Bibtex entry:
@inproceedings{WKZadt11, author = {Xiaojian Wu and Akshat Kumar and Shlomo Zilberstein}, title = {Influence Diagrams with Memory States: Representation and Algorithms}, booktitle = {Proceedings of the Second International Conference on Algorithmic Decision Theory}, year = {2011}, pages = {}, address = {Rutgers University, USA}, url = {http://rbr.cs.umass.edu/shlomo/papers/WKZadt11.html} }shlomo@cs.umass.edu