Analysis of quantization and normalization effects in deep neural networks

Chai, Elaina Teresa

Analysis of quantization and normalization effects in deep neural networks

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fgb995rt5179" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: There is great interest in the machine learning community to reduce Deep Neural Network (DNN) model sizes. Decreasing the memory and compute requirements expands the range of resource-constrained mobile applications where DNNs can be deployed. By far, the most popular method of compressing model size is uniform quantization. In this work, we illustrate how quantization performance was fortuitously advanced by Batch Normalization (BatchNorm), a technique originally developed to aid training convergence. This improvement is due to BatchNorm's reshaping of the network's activation distributions. Additionally, due to the limited consensus on why BatchNorm is effective, this work uses concepts from the traditional adaptive filter domain to provide insights into its dynamics and inner workings. First, we show that the convolution weight updates have natural modes whose stability and convergence speed are tied to the eigenvalues of the input autocorrelation matrices. Furthermore, our experiments demonstrate that the speed and stability benefits are distinct effects. At low learning rates, it is BatchNorm's amplification of the smallest eigenvalues that improves convergence speed. In contrast, at high learning rates, it is BatchNorm's suppression of the largest eigenvalues that ensures stability. Next, we prove that in the first training step, when normalization is needed most, BatchNorm satisfies the same optimization as Normalized Least Mean Square (NLMS), while it continues to approximate this condition in subsequent steps. The analyses provided lay the groundwork for gaining further insight into the operation of modern neural network structures using adaptive filter theory. Finally, we highlight contributions made to a real-world application of DNNs in the Smart Hospital space.

Description

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource.
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2021; ©2021
Publication date	2021; 2021
Issuance	monographic
Language	English

Creators/Contributors

Author	Chai, Elaina Teresa
Degree supervisor	Murmann, Boris
Thesis advisor	Murmann, Boris
Thesis advisor	Mujica, Fernando
Thesis advisor	Pilanci, Mert
Degree committee member	Mujica, Fernando
Degree committee member	Pilanci, Mert
Associated with	Stanford University, Department of Electrical Engineering

Subjects

Genre	Theses
Genre	Text

Bibliographic information

Statement of responsibility	Elaina Teresa Chai.
Note	Submitted to the Department of Electrical Engineering.
Thesis	Thesis Ph.D. Stanford University 2021.
Location	https://purl.stanford.edu/gb995rt5179

Access conditions

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

View in SearchWorks

Loading usage metrics...