Deep learning and probabilistic methods for robotic perception from streaming data

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fcf036kx6751" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: Many robots today are confined to operate in relatively simple, controlled environments. One reason for this is that current methods for processing visual data tend to break down when faced with occlusions, viewpoint changes, poor lighting, and other challenging but common situations that occur when robots are placed in the real world. I will show that we can train robots to handle these challenges by modeling the causes behind visual appearance changes. If we model how the world changes over time, we can be robust to the types of transitions that objects often undergo. I show how we can use this idea to improve performance on four different tasks: segmentation, tracking, velocity estimation, and object recognition. Many of the methods in this dissertation are demonstrated in the context of autonomous driving, although they are generally applicable to other robotic applications for dynamic environments. By modeling the causes of appearance variations over time, we can make our methods more robust to a variety of challenging situations that commonly occur in the real world.

Type of resource	text
Form	electronic; electronic resource; remote
Extent	1 online resource.
Publication date	2016
Issuance	monographic
Language	English

Associated with	Held, David
Associated with	Stanford University, Department of Computer Science.
Primary advisor	Savarese, Silvio
Thesis advisor	Savarese, Silvio
Thesis advisor	Ng, Andrew Y, 1976-
Thesis advisor	Thrun, Sebastian, 1967-
Advisor	Ng, Andrew Y, 1976-
Advisor	Thrun, Sebastian, 1967-

Genre	Theses

Statement of responsibility	David Held.
Note	Submitted to the Department of Computer Science.
Thesis	Thesis (Ph.D.)--Stanford University, 2016.
Location	electronic resource

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

View in SearchWorks

Loading usage metrics...