Deep optics : optimizing integrated imaging and computer vision systems

Chang, Julie

Deep optics : optimizing integrated imaging and computer vision systems

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fdk931fs5485" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: Over the past decade, computational imaging has transformed the expectations we have for our cameras. At the fingertips of anyone with an iPhone X, for example, are panoramas, high dynamic range (HDR) photography, and Portrait mode---all classic examples of computational imaging. This integration of optical design and image processing has also enabled us to see around corners and behind walls, observe molecular dynamics during chemical reactions, and even to piece together the first image of a black hole. For each application though, the system needs to be reformulated and designed, and it is difficult to claim that the final solution is actually optimal. Concurrently, a tremendous amount of progress has been made in the field of deep learning. Convolutional neural networks (CNNs) have found success in a wide variety of computer vision tasks, including image classification, object detection, image generation, and more. However, this high performance comes at a high computational cost, and it remains difficult to deploy CNNs in embedded systems due to tight power budgets. Through earlier projects discussed in Section I, we gain an understanding of the ways computational imaging and deep learning could potentially enhance and benefit from the other. In section II, we present the concept of deep optics, in which the optical model is jointly optimized with the task-specific algorithm. Deep optics capitalizes on the recent successes of deep learning for computer vision and uses the same data-driven approach to optimize optical parameters together with subsequent neural network parameters. Not only could this improve computer vision performance over basic or heuristically designed cameras, but it could also relieve some of the computational cost by partially pre-processing the data in optics. We begin by building a hybrid optical-electronic convolutional neural network for image classification, specifically focusing on the concept of an optimizable optical convolutional layer to replace the standard electronic convolutional layer. Finally, we apply deep optics to the problems of monocular depth estimation and 3D object detection, demonstrating how these common computer vision tasks that typically ignore the image capture process can be improved by proper modeling and optimization of the optics.

Description

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource.
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2019; ©2019
Publication date	2019; 2019
Issuance	monographic
Language	English

Creators/Contributors

Author	Chang, Julie
Degree supervisor	Wetzstein, Gordon
Thesis advisor	Wetzstein, Gordon
Thesis advisor	Lee, Jin Hyung
Thesis advisor	Liphardt, Jan
Degree committee member	Lee, Jin Hyung
Degree committee member	Liphardt, Jan
Associated with	Stanford University, Department of Bioengineering.

Subjects

Genre	Theses
Genre	Text

Bibliographic information

Statement of responsibility	Julie Chang.
Note	Submitted to the Department of Bioengineering.
Thesis	Thesis Ph.D. Stanford University 2019.
Location	electronic resource

Access conditions

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

View in SearchWorks

Loading usage metrics...