| Literature DB >> 25473428 |
Abstract
The diversity of regulatory genetic variants and their mechanisms of action reflect the complexity and context-specificity of gene regulation. Regulatory variants are important in human disease and defining such variants and establishing mechanism is crucial to the interpretation of disease-association studies. This review describes approaches for identifying and functionally characterizing regulatory variants, illustrated using examples from common diseases. Insights from recent advances in resolving the functional epigenomic regulatory landscape in which variants act are highlighted, showing how this has enabled functional annotation of variants and the generation of hypotheses about mechanism of action. The utility of quantitative trait mapping at the transcript, protein and metabolite level to define association of specific genes with particular variants and further inform disease associations are reviewed. Establishing mechanism of action is an essential step in resolving functional regulatory variants, and this review describes how this is being facilitated by new methods for analyzing allele-specific expression, mapping chromatin interactions and advances in genome editing. Finally, integrative approaches are discussed together with examples highlighting how defining the mechanism of action of regulatory variants and identifying specific modulated genes can maximize the translational utility of genome-wide association studies to understand the pathogenesis of diseases and discover new drug targets or opportunities to repurpose existing drugs to treat them.Entities:
Year: 2014 PMID: 25473428 PMCID: PMC4254439 DOI: 10.1186/s13073-014-0092-4
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Examples of online data resources and tools for analysis of putative regulatory variants
|
|
|
|
|---|---|---|
| ENCODE | Encyclopedia of DNA Elements Project |
|
| FANTOM | Functional Annotation of the Mammalian Genome project |
|
| International Human Epigenome Consortium | International Human Epigenome Consortium Data Portal |
|
| Roadmap Epigenomics Project | NIH Roadmap Epigenomics Mapping Consortium, including links to data |
|
| BLUEPRINT | European hematopoietic epigenome project |
|
| Variant Annotation Integrator (UCSC) | Tool for predicting functional effects of variants on transcripts |
|
| Variant Effect Predictor (Ensembl) | Integrated tool resolving effects of variant on regulatory regions, genes, transcripts and protein |
|
| RegulomeDB | Tool for functional annotation of SNVs including known and predicted regulatory elements and eQTLs |
|
| SNPnexus | Integrated functional annotation of SNVs |
|
| JASPAR | Transcription factor binding profile database |
|
| PROMO | Transcription factor binding site analysis |
|
| MAPPER2 | Identification of transcription factor binding sites in multiple genomes |
|
| HaploReg | Functional annotation of variants on haplotype blocks such as at GWAS loci |
|
| GWAS3D | Integrated annotation of variants including chromatin interactions |
|
| ORegAnno | Regulatory annotation database |
|
| ConSite | Transcription factor binding site detection using phylogenetic footprinting |
|
| HGMD | Human Gene Mutation Database, including regulatory mutations |
|
| Genevar | eQTL database integration, search and visualization |
|
| eQTL Browser | NCBI hosted browser to interrogate eQTL datasets |
|
| OMICStools | Links to a large number of multi-omics tools |
|
eQTL, expression quantitative trait locus; GWAS, genome-wide association study; SNV, single-nucleotide variant.
Figure 1Examples of local and distant effects of regulatory variants. (a) A local cis-acting variant (red star, top) in a regulatory element (red line) affects allele-specific transcription factor binding affinity and is associated with differential expression of gene A (as shown by the chart, bottom), with possession of a copy of the A allele associated with higher expression than the G allele (hence AA homozygotes having higher expression than AG heterozygotes, with lowest expression in GG homozygotes). (b) The same variant can modulate expression of gene D at a distance through DNA looping that brings the regulatory enhancer element close to the promoter of gene D (gray line) on the same chromosome. (c) An example of a local cis-acting variant modulating expression of a transcription factor encoding gene, Gene E, differential expression of which modulates a set of target genes. Expression of these target genes is found to be associated in trans with the variant upstream of gene E. (d) A local cis-acting variant on chromosome 12 modulates expression of a cytokine gene and is also associated in trans with a set of genes whose expression is regulated through a signaling cascade determined by that cytokine. Such trans associations can be shown on a circos plot (chromosomes labeled 1-22 with arrows pointing to location of gene on a given chromosome).
Figure 2Overview of the Capture-C approach. Capture-C [36] enables mapping of chromatin interactions, in this example between a regulatory element (within the region denoted by a red line) and a gene promoter (gray line). Crosslinking and high-efficiency restriction digestion followed by proximity ligation (in which close proximity will favor ligation taking place, in this example generating red-gray lines in contrast to black lines representing other ligation events) allows such interactions to be defined. A 3C library is generated, sonicated and end repair performed with ligation of adaptors (dark gray boxes). Capture of target regions of interest (in this example target is region denoted by red line) involves oligonucleotide capture technology (capture probes denoted by red hexagons with yellow centers). Sequencing using end-ligated adapters allows genome-wide sites of interaction to be revealed. The approach can be multiplexed to several hundred targets.
Figure 3Overview of the CRISPR-Cas9 system. Cas-9 is a nuclease that makes a double-strand break at a location defined by a guide RNA [108]. The latter comprises a scaffold (red) and a 20-nucleotide guide sequence (blue) that pairs with the DNA target immediately upstream of a 5’-NGG motif (this motif varies depending on the exact bacterial species of origin of the CRISPR used). There are two main approaches that can be followed. (Left) Repair of the double-strand break by nonhomologous end joining can be used to knock out gene function though incorporation of random indels at junction sites, where these occur within coding exons, leading to frameshift mutations and premature stop codons. (Right) Homology-directed repair can enable precise genome editing through the use of dsDNA-targeting constructs flanking insertion sequences or single-stranded DNA oligonucleotides to introduce single-nucleotide changes. Adapted with permission from [108].