Deep optics : optimizing integrated imaging and computer vision systems

Placeholder Show Content

Abstract/Contents

Abstract
Over the past decade, computational imaging has transformed the expectations we have for our cameras. At the fingertips of anyone with an iPhone X, for example, are panoramas, high dynamic range (HDR) photography, and Portrait mode---all classic examples of computational imaging. This integration of optical design and image processing has also enabled us to see around corners and behind walls, observe molecular dynamics during chemical reactions, and even to piece together the first image of a black hole. For each application though, the system needs to be reformulated and designed, and it is difficult to claim that the final solution is actually optimal. Concurrently, a tremendous amount of progress has been made in the field of deep learning. Convolutional neural networks (CNNs) have found success in a wide variety of computer vision tasks, including image classification, object detection, image generation, and more. However, this high performance comes at a high computational cost, and it remains difficult to deploy CNNs in embedded systems due to tight power budgets. Through earlier projects discussed in Section I, we gain an understanding of the ways computational imaging and deep learning could potentially enhance and benefit from the other. In section II, we present the concept of deep optics, in which the optical model is jointly optimized with the task-specific algorithm. Deep optics capitalizes on the recent successes of deep learning for computer vision and uses the same data-driven approach to optimize optical parameters together with subsequent neural network parameters. Not only could this improve computer vision performance over basic or heuristically designed cameras, but it could also relieve some of the computational cost by partially pre-processing the data in optics. We begin by building a hybrid optical-electronic convolutional neural network for image classification, specifically focusing on the concept of an optimizable optical convolutional layer to replace the standard electronic convolutional layer. Finally, we apply deep optics to the problems of monocular depth estimation and 3D object detection, demonstrating how these common computer vision tasks that typically ignore the image capture process can be improved by proper modeling and optimization of the optics.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2019; ©2019
Publication date 2019; 2019
Issuance monographic
Language English

Creators/Contributors

Author Chang, Julie
Degree supervisor Wetzstein, Gordon
Thesis advisor Wetzstein, Gordon
Thesis advisor Lee, Jin Hyung
Thesis advisor Liphardt, Jan
Degree committee member Lee, Jin Hyung
Degree committee member Liphardt, Jan
Associated with Stanford University, Department of Bioengineering.

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Julie Chang.
Note Submitted to the Department of Bioengineering.
Thesis Thesis Ph.D. Stanford University 2019.
Location electronic resource

Access conditions

Copyright
© 2019 by Julie Chang
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...