Symbolic Generalization for On-line Planning

Zhengzhu Feng, Eric A. Hansen, and Shlomo Zilberstein. Symbolic Generalization for On-line Planning. Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI), 209-216, Acapulco, Mexico, 2003.

Abstract

Symbolic representations have been used successfully in off-line planning algorithms for Markov decision processes. We show that they can also improve the performance of on-line planners. In addition to reducing computation time, symbolic generalization can reduce the amount of costly real-world interactions required for convergence. We introduce Symbolic Real-Time Dynamic Programming (or sRTDP), an extension of RTDP. After each step of on-line interaction with an environment, sRTDP uses symbolic model-checking techniques to generalizes its experience by updating a group of states rather than a single state. We examine two heuristic approaches to dynamic grouping of states and show that they accelerate the planning process significantly in terms of both CPU time and the number of steps of interaction with the environment.

Bibtex entry:

@inproceedings{FHZuai03,
  author	= {Zhengzhu Feng and Eric A. Hansen and Shlomo Zilberstein},
  title		= {Symbolic Generalization for On-line Planning},
  booktitle     = {Proceedings of the Nineteenth Conference on Uncertainty in
                   Artificial Intelligence},
  year		= {2003},
  pages		= {209-216},
  address       = {Acapulco, Mexico},
  url		= {http://rbr.cs.umass.edu/shlomo/papers/FHZuai03.html}
}

shlomo@cs.umass.edu
UMass Amherst