Analysis of quantization and normalization effects in deep neural networks

Placeholder Show Content

Abstract/Contents

Abstract
There is great interest in the machine learning community to reduce Deep Neural Network (DNN) model sizes. Decreasing the memory and compute requirements expands the range of resource-constrained mobile applications where DNNs can be deployed. By far, the most popular method of compressing model size is uniform quantization. In this work, we illustrate how quantization performance was fortuitously advanced by Batch Normalization (BatchNorm), a technique originally developed to aid training convergence. This improvement is due to BatchNorm's reshaping of the network's activation distributions. Additionally, due to the limited consensus on why BatchNorm is effective, this work uses concepts from the traditional adaptive filter domain to provide insights into its dynamics and inner workings. First, we show that the convolution weight updates have natural modes whose stability and convergence speed are tied to the eigenvalues of the input autocorrelation matrices. Furthermore, our experiments demonstrate that the speed and stability benefits are distinct effects. At low learning rates, it is BatchNorm's amplification of the smallest eigenvalues that improves convergence speed. In contrast, at high learning rates, it is BatchNorm's suppression of the largest eigenvalues that ensures stability. Next, we prove that in the first training step, when normalization is needed most, BatchNorm satisfies the same optimization as Normalized Least Mean Square (NLMS), while it continues to approximate this condition in subsequent steps. The analyses provided lay the groundwork for gaining further insight into the operation of modern neural network structures using adaptive filter theory. Finally, we highlight contributions made to a real-world application of DNNs in the Smart Hospital space.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2021; ©2021
Publication date 2021; 2021
Issuance monographic
Language English

Creators/Contributors

Author Chai, Elaina Teresa
Degree supervisor Murmann, Boris
Thesis advisor Murmann, Boris
Thesis advisor Mujica, Fernando
Thesis advisor Pilanci, Mert
Degree committee member Mujica, Fernando
Degree committee member Pilanci, Mert
Associated with Stanford University, Department of Electrical Engineering

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Elaina Teresa Chai.
Note Submitted to the Department of Electrical Engineering.
Thesis Thesis Ph.D. Stanford University 2021.
Location https://purl.stanford.edu/gb995rt5179

Access conditions

Copyright
© 2021 by Elaina Teresa Chai
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...