Deep learning and probabilistic methods for robotic perception from streaming data

Placeholder Show Content

Abstract/Contents

Abstract
Many robots today are confined to operate in relatively simple, controlled environments. One reason for this is that current methods for processing visual data tend to break down when faced with occlusions, viewpoint changes, poor lighting, and other challenging but common situations that occur when robots are placed in the real world. I will show that we can train robots to handle these challenges by modeling the causes behind visual appearance changes. If we model how the world changes over time, we can be robust to the types of transitions that objects often undergo. I show how we can use this idea to improve performance on four different tasks: segmentation, tracking, velocity estimation, and object recognition. Many of the methods in this dissertation are demonstrated in the context of autonomous driving, although they are generally applicable to other robotic applications for dynamic environments. By modeling the causes of appearance variations over time, we can make our methods more robust to a variety of challenging situations that commonly occur in the real world.

Description

Type of resource text
Form electronic; electronic resource; remote
Extent 1 online resource.
Publication date 2016
Issuance monographic
Language English

Creators/Contributors

Associated with Held, David
Associated with Stanford University, Department of Computer Science.
Primary advisor Savarese, Silvio
Thesis advisor Savarese, Silvio
Thesis advisor Ng, Andrew Y, 1976-
Thesis advisor Thrun, Sebastian, 1967-
Advisor Ng, Andrew Y, 1976-
Advisor Thrun, Sebastian, 1967-

Subjects

Genre Theses

Bibliographic information

Statement of responsibility David Held.
Note Submitted to the Department of Computer Science.
Thesis Thesis (Ph.D.)--Stanford University, 2016.
Location electronic resource

Access conditions

Copyright
© 2016 by David Avi Held
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...