On the predictive power of graph neural networks

Placeholder Show Content

Abstract/Contents

Abstract
Graph Neural Networks (GNNs) are a class of deep learning models for making predictions on graph-structured data. Many different GNN models have been proposed to achieve promising empirical performance. However, their architectural designs were ad-hoc, and their theoretical understanding remained limited. Moreover, these models were developed on small graph benchmark datasets, which altogether limit the development of powerful GNNs for real-world prediction tasks over graphs. In this thesis, we aim to build powerful predictive GNNs by understanding, improving, and benchmarking the predictive power of GNNs---the ability of GNNs to make accurate predictions over graphs. This thesis consists of three parts. In Part I, we develop a theoretical framework for understanding the predictive power of GNNs. We specifically focus on the expressive power, asking whether GNNs can express desired functions over graphs. We use our theoretical framework to provide insight into whether a given GNN is powerful enough to model the ground-truth target function that underlies the data. We also propose a maximally-expressive GNN model that can provably model most functions over graphs. Equipped with the framework to design expressive GNN models, in Part II, we move on to improve their predictive power on unseen/unlabeled data, i.e., improve the generalization power of GNNs. Motivated by real-world applications, we develop methods for improving the generalization power of GNNs under two common limited data scenarios: limited labeled data and limited edge connectivity. Finally, in Part III, we create new graph benchmark datasets to resolve the issues with the existing benchmarks and to engage the community toward improving the predictive power of GNNs. We present the Open Graph Benchmark (OGB) and OGB-LSC, a collection of challenging, realistic, and large-scale benchmark datasets for machine learning on graphs. We discuss the impact our benchmarks have had in advancing the predictive power of GNNs and conclude with future challenges of applying GNNs to real-world prediction tasks.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2022; ©2022
Publication date 2022; 2022
Issuance monographic
Language English

Creators/Contributors

Author Hu, Weihua
Degree supervisor Leskovec, Jurij
Thesis advisor Leskovec, Jurij
Thesis advisor Liang, Percy
Thesis advisor Ré, Christopher
Degree committee member Liang, Percy
Degree committee member Ré, Christopher
Associated with Stanford University, Computer Science Department

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Weihua Hu.
Note Submitted to the Computer Science Department.
Thesis Thesis Ph.D. Stanford University 2022.
Location https://purl.stanford.edu/xh015dp2304

Access conditions

Copyright
© 2022 by Weihua Hu
License
This work is licensed under a Creative Commons Attribution 3.0 Unported license (CC BY).

Also listed in

Loading usage metrics...