Decoding transcription regulatory patterns in health and disease
Abstract/Contents
- Abstract
- Transcriptional control of gene expression is known to be a cell-type specific process that depends on many different factors: including the underlying DNA sequence, combinatorial patterns of transcription factors (TFs) and their associated motifs, chromatin accessibility, and DNA-DNA looping contacts. However, modeling the relationship between DNA and its regulatory elements while learning how these regulatory complexes lead to cell-type specific function within a 3-dimensional context remains difficult. Additionally, sequence variants within DNA are known to modulate risk for many polygenic diseases--with oncogenic, neuropsychiatric, and inflammatory etiologies, among others. As these variants are mostly in non-coding regions of the genome, assessing the functional impact of variants within the cell and how they might contribute to disease pathogenesis remains a computational and experimental challenge. In this dissertation, I present data representation and modeling strategies to understand cell-type specific transcription regulatory mechanisms in healthy cells and analyze how these mechanisms are disrupted in complex polygenic diseases. First, I use an epigenomic and transcriptomic resource of 15 different epithelial cell types to extract features of transcription regulation from the dataset in an unbiased manner and to build a data representation of transcription regulation that accurately predicts cell type. I then show that different patterns of transcription factor motif cooperativity exist in healthy cells and that this synergistic behavior is altered in the context of cancer. Next, as part of a collaborative effort, I generate a compendium of non-coding single nucleotide variants (SNVs) for neuropsychiatric disorders that alter transcription-regulatory activity in a developing neural cell system. The analytical framework built from this dataset links regulatory variants to disease-relevant genes, pathomechanisms, therapeutics, and clinical manifestations of disease using expression-gene mapping, network analyses, chromatin looping, and statistical enrichment methods. Additionally, I develop a network-based approach to identify genes relevant for psychiatric disease diagnosis and to prioritize treatment options. Finally, I add a perspective on the strengths and limitations of modern network methods--including network building strategies, machine learning techniques, and inference applications--for biological applications. In summary, I hope that these resources and methods--which provide further insight into transcription regulation--have translatable uses in the clinical setting for complex disease prognosis, diagnosis, and therapy selection.
Description
Type of resource | text |
---|---|
Form | electronic resource; remote; computer; online resource |
Extent | 1 online resource. |
Place | California |
Place | [Stanford, California] |
Publisher | [Stanford University] |
Copyright date | 2021; ©2021 |
Publication date | 2021; 2021 |
Issuance | monographic |
Language | English |
Creators/Contributors
Author | Guo, Margaret Gan |
---|---|
Degree supervisor | Altman, Russ |
Degree supervisor | Khavari, Paul A |
Thesis advisor | Altman, Russ |
Thesis advisor | Khavari, Paul A |
Thesis advisor | Montgomery, Stephen, 1979- |
Thesis advisor | Salzman, Julia |
Degree committee member | Montgomery, Stephen, 1979- |
Degree committee member | Salzman, Julia |
Associated with | Stanford University, Department of Biomedical Informatics |
Subjects
Genre | Theses |
---|---|
Genre | Text |
Bibliographic information
Statement of responsibility | Margaret Gan Guo. |
---|---|
Note | Submitted to the Department of Biomedical Informatics. |
Thesis | Thesis Ph.D. Stanford University 2021. |
Location | https://purl.stanford.edu/gh371fv2168 |
Access conditions
- Copyright
- © 2021 by Margaret Gan Guo
- License
- This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).
Also listed in
Loading usage metrics...