Decoding transcription regulatory patterns in health and disease

Placeholder Show Content

Abstract/Contents

Abstract
Transcriptional control of gene expression is known to be a cell-type specific process that depends on many different factors: including the underlying DNA sequence, combinatorial patterns of transcription factors (TFs) and their associated motifs, chromatin accessibility, and DNA-DNA looping contacts. However, modeling the relationship between DNA and its regulatory elements while learning how these regulatory complexes lead to cell-type specific function within a 3-dimensional context remains difficult. Additionally, sequence variants within DNA are known to modulate risk for many polygenic diseases--with oncogenic, neuropsychiatric, and inflammatory etiologies, among others. As these variants are mostly in non-coding regions of the genome, assessing the functional impact of variants within the cell and how they might contribute to disease pathogenesis remains a computational and experimental challenge. In this dissertation, I present data representation and modeling strategies to understand cell-type specific transcription regulatory mechanisms in healthy cells and analyze how these mechanisms are disrupted in complex polygenic diseases. First, I use an epigenomic and transcriptomic resource of 15 different epithelial cell types to extract features of transcription regulation from the dataset in an unbiased manner and to build a data representation of transcription regulation that accurately predicts cell type. I then show that different patterns of transcription factor motif cooperativity exist in healthy cells and that this synergistic behavior is altered in the context of cancer. Next, as part of a collaborative effort, I generate a compendium of non-coding single nucleotide variants (SNVs) for neuropsychiatric disorders that alter transcription-regulatory activity in a developing neural cell system. The analytical framework built from this dataset links regulatory variants to disease-relevant genes, pathomechanisms, therapeutics, and clinical manifestations of disease using expression-gene mapping, network analyses, chromatin looping, and statistical enrichment methods. Additionally, I develop a network-based approach to identify genes relevant for psychiatric disease diagnosis and to prioritize treatment options. Finally, I add a perspective on the strengths and limitations of modern network methods--including network building strategies, machine learning techniques, and inference applications--for biological applications. In summary, I hope that these resources and methods--which provide further insight into transcription regulation--have translatable uses in the clinical setting for complex disease prognosis, diagnosis, and therapy selection.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2021; ©2021
Publication date 2021; 2021
Issuance monographic
Language English

Creators/Contributors

Author Guo, Margaret Gan
Degree supervisor Altman, Russ
Degree supervisor Khavari, Paul A
Thesis advisor Altman, Russ
Thesis advisor Khavari, Paul A
Thesis advisor Montgomery, Stephen, 1979-
Thesis advisor Salzman, Julia
Degree committee member Montgomery, Stephen, 1979-
Degree committee member Salzman, Julia
Associated with Stanford University, Department of Biomedical Informatics

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Margaret Gan Guo.
Note Submitted to the Department of Biomedical Informatics.
Thesis Thesis Ph.D. Stanford University 2021.
Location https://purl.stanford.edu/gh371fv2168

Access conditions

Copyright
© 2021 by Margaret Gan Guo
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...