Closing the perception-action loop : towards general-purpose robot autonomy

Zhu, Yuke

Closing the perception-action loop : towards general-purpose robot autonomy

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fjg446vg2066" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: Robots and autonomous systems have been playing a significant role in the modern economy. Custom-built robots have remarkably improved productivity, operational safety, and product quality. However, these robots are usually programmed for specific tasks in narrow domains, unable to quickly adapt to new tasks and novel situations. The advent of affordable, lightweight, and flexible robot hardware has opened up opportunities for scaling up robot autonomy to an unprecedented level. A major challenge for the new robot hardware to operate in everyday settings is to handle the constant variability and uncertainty of the real world. To tackle this challenge, we have to address the synergy between perception and action: on the one hand, the robot's perception guides its action adaptively, and on the other hand, its action gives rise to new perceptual information for decision making. I argue that a vital step towards a general-purpose robot autonomy is to integrate perception and action in a tight loop. Emerging computational tools in artificial intelligence have demonstrated promising successes and constitute ideal candidates to enhance robots' perception and control in unstructured environments. The embodied nature of robotics compels us to move beyond the existing paradigm of learning from disembodied datasets and inspires us to develop novel algorithms that take into account the physical hardware and the complex system dynamics. This dissertation demonstrates our research that builds methods and mechanisms for generalizable robot perception and control. Our work illustrates that the tight coupling of perception and action facilitates robots to interact with the unstructured world through their senses, to flexibly perform a wide range of tasks, and to adaptively learn new tasks. Our findings show that dissecting the perception-action loop at three levels of abstraction, from the low-level motor skills to high-level task understanding, effectively prompts the robustness and generalization of robot behaviors. Laying out our research work that attends to tasks with growing complexity unfolds our roadmap towards the holy-grail goal: building long-term, general-purpose robot autonomy in the real world.

Description

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource.
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2019; ©2019
Publication date	2019; 2019
Issuance	monographic
Language	English

Creators/Contributors

Author	Zhu, Yuke
Degree supervisor	Li, Fei Fei, 1976-
Thesis advisor	Li, Fei Fei, 1976-
Thesis advisor	Bohg, Jeannette, 1981-
Thesis advisor	Brunskill, Emma
Degree committee member	Bohg, Jeannette, 1981-
Degree committee member	Brunskill, Emma
Associated with	Stanford University, Computer Science Department.

Subjects

Genre	Theses
Genre	Text

Bibliographic information

Statement of responsibility	Yuke Zhu.
Note	Submitted to the Computer Science Department.
Thesis	Thesis Ph.D. Stanford University 2019.
Location	electronic resource

Access conditions

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

View in SearchWorks

Loading usage metrics...