Planning under uncertainty in safety-critical systems

Placeholder Show Content

Abstract/Contents

Abstract
From warehouses and manufacturing lines to homes and offices, from roads and seas, to skies and space, autonomous systems promise to improve efficiency, unlock human potential, and explore new frontiers. Many autonomous systems already make decisions that impact our everyday lives. As technology continues to develop and the cost of compute continues to decrease, autonomous systems will continue integrating into society. However, a unifying necessity for safety-critical systems to deploy autonomously in the real world is the need to be able to reason about their environments and make good decisions to satisfy their objectives. For autonomous systems to be deployed successfully, it often does not suffice to plan deterministically, that is, assuming that everything will `go as planned` against a single string of outcomes. Rather, agents must reason about the uncertainty that can arise, either from inexact actuation or sensing, imperfect information, unclear objectives, unknown motives of other participants, or complex environments. These sources of uncertainty can significantly complicate autonomous decision-making and can ultimately lead to catastrophic errors. By explicitly reasoning about these sources of uncertainty, this thesis introduces new methods for planning safely against them. First, this thesis investigates methods that use data to overcome uncertainty in action outcomes and agent objectives. Specifically, we consider using human driving demonstrations alongside simulators to overcome objective uncertainty for autonomous driving in complex urban environments. Previous approaches that used simulators to help imitate human driving were typically limited to relatively simple scenarios. We introduce Safety-Aware Hierarchical Adversarial Imitation Learning (SHAIL), a method that scales safety-critical data-driven decision-making to complex problems through reliance on hierarchical decomposition and safety predictions. After building a simulator to test counterfactuals of real-world driving decisions, we demonstrate empirically that SHAIL can improve safety compared to other data-driven decision-making methods, especially in unseen driving scenarios. Next, we turn to safe planning under outcome and state uncertainty when models for those uncertainties are known a priori. Here, we impose safety through constraints on agent plans, modeling problems as constrained partially observable Markov decision processes (CPOMDPs). Approximate CPOMDP solutions are typically limited to small, discrete actions and observation spaces. We introduce algorithms that extend online search-based planning in CPOMDPs to domains with large or continuous state, action, and observation spaces by using methods that artificially limit the width of a search tree in unpromising areas and satisfy constraints using dual ascent. We empirically compare the effectiveness of our proposed algorithms on continuous CPOMDPs that model both toy and real-world safety-critical problems. In doing so, we demonstrate that CPOMDP planning can be effective in continuous domains. Unfortunately, the algorithms we introduce for safe online planning in continuous CPOMDPs are still restricted to relatively small problems. Fortunately, as noted for urban driving, many large planning problems can be decomposed hierarchically. In our final contribution, we introduce Constrained Options Belief Tree Search (COBeTS) to scale continuous CPOMDP planning to much larger problems with favorable hierarchical decompositions by planning over macro-actions (i.e. low-level controller options). We demonstrate COBeTS in several large, safety-critical, uncertain domains, showing that it can plan successfully while non-hierarchical baselines cannot. Importantly, we show that with constraint-satisfying macro-actions, COBeTS can guarantee safety regardless of planning time. In summary, our contributions improve planning safety in domains with quantifiable outcome, state, and/or objective uncertainty through novel applications of hierarchies and/or constraints.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2024; ©2024
Publication date 2024; 2024
Issuance monographic
Language English

Creators/Contributors

Author Jamgochian, Arec Levon
Degree supervisor Kochenderfer, Mykel
Thesis advisor Kochenderfer, Mykel
Thesis advisor Pavone, Marco
Thesis advisor Schwager, Mac
Degree committee member Pavone, Marco
Degree committee member Schwager, Mac
Associated with Stanford University, School of Engineering
Associated with Stanford University, Department of Aeronautics and Astronautics

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Arec Jamgochian.
Note Submitted to the Department of Aeronautics and Astronautics.
Thesis Thesis Ph.D. Stanford University 2024.
Location https://purl.stanford.edu/mj052kx8392

Access conditions

Copyright
© 2024 by Arec Levon Jamgochian
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...