Validation and generalization of pixel-wise relevance in CNNs trained for face classification

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fwc388fb3957" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: The increased use of convolutional neural networks for face recognition in science,
governance, and broader society has create an acute need for methods that can
codify how these ’black box’ decisions are made. To be interpretable and useful
to humans, such a method should convey a model’s learned classification strategy
in a way that is robust to random initializations or spurious correlations in input
data. In this work, we applied the decompositional pixel-wise attribution method
of layer-wise relevance propagation (LRP) to this end, quantifying pixel-wise
information use of several classes of VGG-16 models trained for face recognition.
We evaluated the models on relevance-based occluded images to analyze how
each model’s relevance distributions vary with and generalize across the type of
pretraining dataset (ImageNet or VGGFace), the finetuning task (gender or identity
classification), and random initializations. We found that relevance distributions
produced by these various types of VGG-16 models prove generally stable across
random initializations and can generalize across finetuning tasks. However, there
is much less generalization across types of pretraining data. So, we find that
LRP relevance distributions generalize asymmetrically and to varying degrees
across these parameters, suggesting it may be possible to find an underlying set of
important facial image pixels that are relevant to most CNNs and tasks.

Type of resource	text
Date created	2020

Subject	convolution neural networks
Subject	CNNs
Subject	facial recognition
Subject	visual perception
Subject	explainability
Genre	Thesis

Location	https://purl.stanford.edu/wc388fb3957

Use and reproduction: User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Undergraduate Honors Theses, Symbolic Systems Program, Stanford University

Loading usage metrics...