CMOS image sensor design for always-on object detection
Abstract/Contents
- Abstract
- Object detection is an important vision task in embedded systems like smartphones, real-time monitoring devices, and augmented reality. A standard approach is to embed a CMOS image sensor with a backend detection algorithm. Deep neural networks (DNNs) are currently the best-performing algorithms for such applications and there has been extensive work on improving the efficiency of algorithms and hardware. Despite these efforts, customized DNN ASICs are relatively energy-hungry due their large computational footprint. Therefore, to achieve efficient object detection in an embedded device, it is attractive to consider less complex, prior-art algorithms that use low-complexity hand-crafted features as a wakeup for a more powerful DNN. For instance, histograms of oriented gradients (HOGs) present a good tradeoff between computational requirements and detection accuracy. To optimize the wakeup detector for always-on functionality, we can build the feature-extraction directly into the CMOS image sensor. Towards this end, this dissertation presents an application-optimized QVGA image sensor for low-power, always-on object detection using HOGs. In contrast to conventional CMOS imagers that feature linear and high-resolution ADCs, this readout scheme extracts logarithmic intensity gradients at 1.5 or 2.75 bits of resolution. This eliminates unnecessary illumination-related data and allows the HOG feature descriptors to be compressed by up to 25x relative to a conventional 8-bit readout. As a result, the digital backend detector, which typically limits system efficiency, incurs less data movement and computation, leading to an estimated 3.3x energy reduction. The imager employs a column-parallel readout with analog cyclic-row buffers that also perform arbitrary-sized pixel-binning for multi-scale object detection. The log-digitization of pixel gradients is realized by using a ratio-to-digital converter, which performs successive capacitive divisions to its input voltages. The prototype IC was fabricated in a 0.13 um CIS process with standard 4-T, 5 um pixels and consumes 99 pJ/pixel. The power consumption is comparable to conventional low-power designs showing there is little energy overhead in the novel readout. Experiments using a deformable parts model detector for three object classes (persons, bicycles and cars) indicate detection accuracies that are on par with conventional systems.
Description
Type of resource | text |
---|---|
Form | electronic resource; remote; computer; online resource |
Extent | 1 online resource. |
Place | California |
Place | [Stanford, California] |
Publisher | [Stanford University] |
Copyright date | 2019; ©2019 |
Publication date | 2019; 2019 |
Issuance | monographic |
Language | English |
Creators/Contributors
Author | Young, Christopher Jordan | |
---|---|---|
Degree supervisor | Murmann, Boris | |
Thesis advisor | Murmann, Boris | |
Thesis advisor | Arbabian, Amin | |
Thesis advisor | Raina, Priyanka, (Assistant Professor of Electrical Engineering) | |
Degree committee member | Arbabian, Amin | |
Degree committee member | Raina, Priyanka, (Assistant Professor of Electrical Engineering) | |
Associated with | Stanford University, Department of Electrical Engineering. |
Subjects
Genre | Theses |
---|---|
Genre | Text |
Bibliographic information
Statement of responsibility | Christopher Young. |
---|---|
Note | Submitted to the Department of Electrical Engineering. |
Thesis | Thesis Ph.D. Stanford University 2019. |
Location | electronic resource |
Access conditions
- Copyright
- © 2019 by Christopher Jordan Young
- License
- This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).
Also listed in
Loading usage metrics...