dc.contributor.author | Santana, Pedro | |
dc.contributor.author | Thiebaux, Sylvie | |
dc.contributor.author | Williams, Brian Charles | |
dc.date.accessioned | 2016-03-02T23:12:33Z | |
dc.date.available | 2016-03-02T23:12:33Z | |
dc.date.issued | 2016-02 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/101416 | |
dc.description.abstract | Autonomous agents operating in partially observable stochastic environments often face the problem of optimizing expected performance while bounding the risk of violating safety constraints. Such problems can be modeled as chance-constrained POMDP’s (CC-POMDP’s). Our first contribution is a systematic derivation of execution risk in POMDP domains, which improves upon how chance constraints are handled in the constrained POMDP literature. Second, we present RAO*, a heuristic forward search algorithm producing optimal, deterministic, finite-horizon policies for CC-POMDP’s. In addition to the utility heuristic, RAO* leverages an admissible execution risk heuristic to quickly detect and prune overly-risky policy branches. Third, we demonstrate the usefulness of RAO* in two challenging domains of practical interest: power supply restoration and autonomous science agents | en_US |
dc.description.sponsorship | United States. Air Force Office of Scientific Research (Grant FA95501210348) | en_US |
dc.description.sponsorship | United States. Air Force Office of Scientific Research (Grant FA2386-15-1-4015) | en_US |
dc.description.sponsorship | SUTD-MIT Graduate Fellows Program | en_US |
dc.description.sponsorship | NICTA | en_US |
dc.language.iso | en_US | |
dc.publisher | Association for the Advancement of Artificial Intelligence | en_US |
dc.relation.isversionof | http://www.aaai.org/Conferences/AAAI/2016/aaai16accepted-papers.pdf | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | MIT web domain | en_US |
dc.title | RAO*: an Algorithm for Chance-Constrained POMDP’s | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Santana, Pedro, Sylvie Thiebaux, and Brian Williams. "RAO*: an Algorithm for Chance-Constrained POMDP’s." Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) (February 2016). | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | en_US |
dc.contributor.mitauthor | Santana, Pedro | en_US |
dc.contributor.mitauthor | Williams, Brian Charles | en_US |
dc.relation.journal | Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Santana, Pedro; Thiebaux, Sylvie; Williams, Brian | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-1057-3940 | |
dc.identifier.orcid | https://orcid.org/0000-0001-8959-0059 | |
mit.license | OPEN_ACCESS_POLICY | en_US |