Complex Genomic Patterns Characterize Variants in Transcription Factor Binding Associated with Gene Expression

Placeholder Show Content

Abstract/Contents

Abstract
Although allele-specific transcription factor (TF) binding has been identified at a large number of human genetic variants, known as TF binding quantitative trait loci (bQTLs), very few bQTLs are also linked to detectable changes in gene expression. Classifying the bQTLs that are also gene expression QTLs (eQTLs) is a challenging problem with the potential to illuminate broad patterns underlying TF involvement in gene regulation and the implications for downstream phenotype. We were able to use a machine learning classifier to predict bQTLs that are also eQTLs with high performance using a relatively small number of functional genomics features. We show that linear and three-dimensional proximity between a bQTL and its target gene are most indicative of an association with expression, followed by complex interactions involving genomic structure, multiple TFs, and epigenetic modification, while also showing that bQTLs can mediate gene co-expression and long-range effects on transcription. Our results suggest that bQTLs are marked by diverse yet consistent genomic signatures that elicit future inquiry into the broader regulatory and functional significance of TFs.

Description

Type of resource text
Date created June 10, 2016

Creators/Contributors

Author Hie, Brian
Primary advisor Kundaje, Anshul
Advisor Fraser, Hunter
Advisor Kaplow, Irene
Degree granting institution Stanford University, Department of Computer Science

Subjects

Subject computer science
Subject computational biology
Subject machine learning
Subject genetics
Subject genomics
Subject gene expression
Subject transcription factor
Genre Thesis

Bibliographic information

Access conditions

Use and reproduction
User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Preferred citation

Preferred Citation
Hie, Brian. (2016). Complex Genomic Patterns Characterize Variants in Transcription Factor Binding Associated with Gene Expression. Stanford Digital Repository. Available at: http://purl.stanford.edu/zn578yv9941

Collection

Undergraduate Theses, School of Engineering

View other items in this collection in SearchWorks

Contact information

Also listed in

Loading usage metrics...