Uncertainty quantification in complex networks with applications to brain connectomics

Placeholder Show Content

Abstract/Contents

Abstract
From social networks to neurosciences, researchers across all disciplines are now faced with the challenge of adapting statistical methods to vast, high-dimensional arrays of data: how can we extract patterns and make sense of such large arrays of numbers? In this setting, graphs have become ubiquitous by offering a versatile modeling framework in which data points are represented as nodes, while edges capture various aspects of the underlying organization of the data. These edges typically denote some flexible notion of proximity, ranging from affinities between users and products in recommendation networks, to causal links in directed acyclic graphs or neuronal coactivation patterns in brain datasets. Whereas most of the current literature has focused on using graphs for learning nodes' properties at an atomic level (community detection, link prediction, etc.), in a variety of applications, the object of interest is the graph itself. In brain connectomics for instance --- one of the areas of application at the center of this thesis ---, the focus is on understanding the functional and anatomical ``wiring'' of the brain and its association with cognitive processes and psychiatric diseases. This process requires the extension of traditional statistical notions (mean, variance, etc.) to the graph setting, which currently appear as the missing, albeit crucial building blocks for principled inference on complex systems. This PhD thesis focuses on providing some methodological tools for extending statistical inference and uncertainty quantification to graph-structured data --- whether these graphs are observed or latent. In particular, we motivate this problem by its application to the analysis of fMRI data. Chapter 1 describes some of the properties of this extremely rich data source, as well as the many interesting challenges (low signal-to-noise ratio, scalability, multi-resolution behavior, non-stationarity, etc.) posed by its analysis \cite{nature} and its representation as a graph. Building upon this overview of the multiple facets of the challenges which arise when working with real-life graph-structured data, we organize this thesis around the three subsequent themes. The first main block of our work (Chapter 2) concentrates around the definition of an appropriate distance for contrasting and comparing aligned networks. In some instances, the comparison of coarsened representations of graphs can be more informative than the comparison of the original ones. We thus turn in Chapter 3 to the extraction of robust multiscale representations of graphs, which we address here by an adaptation of convex clustering. The second main part of our work tackles the case where the graphs are unobserved, and need to be simultaneously inferred and contrasted. In particular, Chapter 4 is centered around the extraction of reliable brain connectome networks through the lens of Bayesian Independent Component Analysis --- an approach which allows the flexible integration of multiple sources of information while providing Bayesian uncertainty estimates. Finally, Chapter 5 opens our discussion to the analysis of data and signals on graphs --rather than the graphs themselves. Indeed, in a number of settings, the underlying organization of a complex system as a graph is crucial in understanding its behavior. In epidemiological studies for instance, social networks have been shown to influence the outcome of an epidemic, its propagation speed, or the variability in the transmission rate. In this context, it becomes essential to try to impute and integrate characteristics of the network structure in the analysis. We focus here on accounting for the potential heterogeneity of the contact network on predictive scenarios for epidemics

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2020; ©2020
Publication date 2020; 2020
Issuance monographic
Language English

Creators/Contributors

Author Donnat, Claire Louise
Degree supervisor Holmes, Susan, 1954-
Thesis advisor Holmes, Susan, 1954-
Thesis advisor Friedman, J. H. (Jerome H.)
Thesis advisor Leskovec, Jurij
Degree committee member Friedman, J. H. (Jerome H.)
Degree committee member Leskovec, Jurij
Associated with Stanford University, Department of Statistics.

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Claire Donnat
Note Submitted to the Department of Statistics
Thesis Thesis Ph.D. Stanford University 2020
Location electronic resource

Access conditions

Copyright
© 2020 by Claire Louise Donnat
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...