Deep learning in computational biology : from predictive modeling to knowledge extraction
Abstract/Contents
- Abstract
- The rapid development of deep learning methods has transformed concepts and pipelines in the analysis of large-scale data cohorts. In parallel, datasets of unprecedented size and diversity stemming from novel biological experimental techniques have largely exceeded the capacity of conventional human-engineered tools. Driven by the versatility and expressive power of deep neural networks, the past few years have witnessed a burst in efforts to incorporate deep learning-based techniques to model the rich information from experimental data. In addition to the need for accurate predictive modeling, biological research problems place great emphasis on model interpretability, aiming to unravel the underlying mechanism by extracting model-learned knowledge. With these challenges posed by the new techniques and datasets in mind, I present three works in this thesis that developed deep learning-based tools to model, analyze and understand various types of molecular and cellular data. In the first project, we summarized methods and datasets for molecular machine learning and proposed a large-scale benchmark MoleculeNet to facilitate the comparison of model efficacy. MoleculeNet curates multiple public datasets, establishes metrics for evaluation, and offers high-quality open-source implementations of multiple molecular featurization and learning algorithms. MoleculeNet benchmarks demonstrate that learnable representations are powerful tools for molecular machine learning and broadly offer the best performance, though learnable representations still struggle to deal with complex tasks under data scarcity and highly imbalanced classification. We further recognized that for quantum mechanical and biophysical datasets, the use of physics-aware featurization can be more important than the choice of modeling algorithm. In the second project, we proposed an automated analysis tool: DynaMorph for quantitative live-cell imaging. DynaMorph is composed of multiple modules sequentially applied to perform cell segmentation, tracking, and self-supervised morphology encoding. We employed DynaMorph to learn the cellular morphodynamics of live microglia through label-free measurements of optical density and anisotropy. These cells show complex behavior and have varied responses to disease-relevant perturbations. DynaMorph generates quantitative morphodynamic representations that can be used to compare the effects of the perturbations. Furthermore, by analyzing DynaMorph representations we identify distinct morphodynamic states of microglia polarization and detect rare transition events between states. In the third project, I studied spatial cellular community structures based on multiplex immunofluorescence imaging. By parsing high-resolution immunofluorescence images as graphical representations of cellular communities, we developed SPAtial CEllular Graphical Modeling (SPACE-GM), a geometric deep learning framework that models tumor microenvironments as cellular graphs. We applied SPACE-GM to human head-and-neck and colorectal cancer samples assayed with 40-plex immunofluorescence imaging to identify spatial motifs associated with patient survival and recurrence outcomes after immunotherapy. SPACE-GM achieves substantially higher accuracy in predicting patient outcomes than previous approaches based on neighborhood cell-type compositions. Computational interpretation of the disease-relevant microenvironments identified by SPACE-GM generates insights into the effect of spatial dispersion of tumor cells and granulocytes on patient prognosis.
Description
Type of resource | text |
---|---|
Form | electronic resource; remote; computer; online resource |
Extent | 1 online resource. |
Place | California |
Place | [Stanford, California] |
Publisher | [Stanford University] |
Copyright date | 2022; ©2022 |
Publication date | 2022; 2022 |
Issuance | monographic |
Language | English |
Creators/Contributors
Author | Wu, Zhenqin |
---|---|
Degree supervisor | Zou, James |
Thesis advisor | Zou, James |
Thesis advisor | Markland, Thomas E |
Thesis advisor | Rotskoff, Grant |
Degree committee member | Markland, Thomas E |
Degree committee member | Rotskoff, Grant |
Associated with | Stanford University, Department of Chemistry |
Subjects
Genre | Theses |
---|---|
Genre | Text |
Bibliographic information
Statement of responsibility | Zhenqin Wu. |
---|---|
Note | Submitted to the Department of Chemistry. |
Thesis | Thesis Ph.D. Stanford University 2022. |
Location | https://purl.stanford.edu/jf524fw3276 |
Access conditions
- Copyright
- © 2022 by Zhenqin Wu
- License
- This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).
Also listed in
Loading usage metrics...