CMOS image sensor design for always-on object detection

Young, Christopher Jordan

CMOS image sensor design for always-on object detection

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fsc618yr5054" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: Object detection is an important vision task in embedded systems like smartphones, real-time monitoring devices, and augmented reality. A standard approach is to embed a CMOS image sensor with a backend detection algorithm. Deep neural networks (DNNs) are currently the best-performing algorithms for such applications and there has been extensive work on improving the efficiency of algorithms and hardware. Despite these efforts, customized DNN ASICs are relatively energy-hungry due their large computational footprint. Therefore, to achieve efficient object detection in an embedded device, it is attractive to consider less complex, prior-art algorithms that use low-complexity hand-crafted features as a wakeup for a more powerful DNN. For instance, histograms of oriented gradients (HOGs) present a good tradeoff between computational requirements and detection accuracy. To optimize the wakeup detector for always-on functionality, we can build the feature-extraction directly into the CMOS image sensor. Towards this end, this dissertation presents an application-optimized QVGA image sensor for low-power, always-on object detection using HOGs. In contrast to conventional CMOS imagers that feature linear and high-resolution ADCs, this readout scheme extracts logarithmic intensity gradients at 1.5 or 2.75 bits of resolution. This eliminates unnecessary illumination-related data and allows the HOG feature descriptors to be compressed by up to 25x relative to a conventional 8-bit readout. As a result, the digital backend detector, which typically limits system efficiency, incurs less data movement and computation, leading to an estimated 3.3x energy reduction. The imager employs a column-parallel readout with analog cyclic-row buffers that also perform arbitrary-sized pixel-binning for multi-scale object detection. The log-digitization of pixel gradients is realized by using a ratio-to-digital converter, which performs successive capacitive divisions to its input voltages. The prototype IC was fabricated in a 0.13 um CIS process with standard 4-T, 5 um pixels and consumes 99 pJ/pixel. The power consumption is comparable to conventional low-power designs showing there is little energy overhead in the novel readout. Experiments using a deformable parts model detector for three object classes (persons, bicycles and cars) indicate detection accuracies that are on par with conventional systems.

Description

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource.
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2019; ©2019
Publication date	2019; 2019
Issuance	monographic
Language	English

Creators/Contributors

Author	Young, Christopher Jordan
Degree supervisor	Murmann, Boris
Thesis advisor	Murmann, Boris
Thesis advisor	Arbabian, Amin
Thesis advisor	Raina, Priyanka, (Assistant Professor of Electrical Engineering)
Degree committee member	Arbabian, Amin
Degree committee member	Raina, Priyanka, (Assistant Professor of Electrical Engineering)
Associated with	Stanford University, Department of Electrical Engineering.

Subjects

Genre	Theses
Genre	Text

Bibliographic information

Statement of responsibility	Christopher Young.
Note	Submitted to the Department of Electrical Engineering.
Thesis	Thesis Ph.D. Stanford University 2019.
Location	electronic resource

Access conditions

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

View in SearchWorks

Loading usage metrics...