Validation and generalization of pixel-wise relevance in CNNs trained for face classification
Abstract/Contents
- Abstract
The increased use of convolutional neural networks for face recognition in science,
governance, and broader society has create an acute need for methods that can
codify how these ’black box’ decisions are made. To be interpretable and useful
to humans, such a method should convey a model’s learned classification strategy
in a way that is robust to random initializations or spurious correlations in input
data. In this work, we applied the decompositional pixel-wise attribution method
of layer-wise relevance propagation (LRP) to this end, quantifying pixel-wise
information use of several classes of VGG-16 models trained for face recognition.
We evaluated the models on relevance-based occluded images to analyze how
each model’s relevance distributions vary with and generalize across the type of
pretraining dataset (ImageNet or VGGFace), the finetuning task (gender or identity
classification), and random initializations. We found that relevance distributions
produced by these various types of VGG-16 models prove generally stable across
random initializations and can generalize across finetuning tasks. However, there
is much less generalization across types of pretraining data. So, we find that
LRP relevance distributions generalize asymmetrically and to varying degrees
across these parameters, suggesting it may be possible to find an underlying set of
important facial image pixels that are relevant to most CNNs and tasks.
Description
Type of resource | text |
---|---|
Date created | 2020 |
Creators/Contributors
Author | Crawford, Jñani | |
---|---|---|
Primary advisor | Grill-Spector, Kalanit | |
Advisor | Poltoralski, Sonia |
Subjects
Subject | convolution neural networks |
---|---|
Subject | CNNs |
Subject | facial recognition |
Subject | visual perception |
Subject | explainability |
Genre | Thesis |
Bibliographic information
Access conditions
- Use and reproduction
- User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
- License
- This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).
Collection
Undergraduate Honors Theses, Symbolic Systems Program, Stanford University
View other items in this collection in SearchWorksContact information
- Contact
- jnani@stanford.edu
Also listed in
Loading usage metrics...