| Literature DB >> 28525990 |
Cong Guo1,2, Ian C McDowell1,3, Michael Nodzenski4, Denise M Scholtens4, Andrew S Allen5,6, William L Lowe7, Timothy E Reddy8,9,10.
Abstract
BACKGROUND: Transversions (Tv's) are more likely to alter the amino acid sequence of proteins than transitions (Ts's), and local deviations in the Ts:Tv ratio are indicative of evolutionary selection on genes. Whether the two different types of mutations have different effects in non-protein-coding sequences remains unknown. Genetic variants primarily impact gene expression by disrupting the binding of transcription factors (TFs) and other DNA-binding proteins. Because Tv's cause larger changes in the shape of a DNA backbone, we hypothesized that Tv's would have larger impacts on TF binding and gene expression.Entities:
Keywords: Massively parallel reporter assay; Regulatory variation; SNPs; Transitions; Transversions
Mesh:
Substances:
Year: 2017 PMID: 28525990 PMCID: PMC5438547 DOI: 10.1186/s12864-017-3785-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Tv’s have greater effects on DNA shape. Each plot shows the estimated effect of Ts’s and Tv’s placed in the center of random 500 bp nucleotide sequences on (a) minor groove width, (b) roll, (c), propeller twist, and (d) helical twist, as estimated by DNAshapeR. Each plot shows data for all four possible variants of 100,000 random sequences, for a total of 400,000 random sequences analyzed
Fig. 2Tv’s are depleted at TF binding sites. a, b Changes to PWM scores for JASPAR TFs caused by Ts’s (grey boxplots) and Tv’s (white boxplots) ordered by normalized position (above) and information content (below). For each position in each consensus sequence, we calculated the position specific scoring matrix (PSSM) score of having every possible nucleotide at that position and, subsequently, the change in PSSM score when mutating any nucleotide to any other nucleotide at that position. Black squares are changes >1.5× above the interquartile range. P-values for parameter estimates were calculated using a t test, and are reported without adjustment for multiple hypothesis testing. We tested 11 and 20 hypotheses in Fig. 2a and b, respectively. Applying a Bonferroni correction to a nominal p-value threshold of α = 0.05 gives a significance threshold of α = 0.005 and α = 0.0025 for 2A and 2B, respectively
Fig. 3Tv’s are enriched in allele-specific transcription factor binding sites across the genome. The percentage of Tv’s in allele-specific CTCF across six LCL lines (left), and for allele-specific binding across seven TFs (right) in a publically available database [30]. P-values and parameter estimates were calculated using a two-tailed Z-test. Error bars show the s.e.m
Fig. 4Tv’s have greater regulatory effects than Ts’s. a Schematic of POP-STARR experimental design. b Changes in haplotype effect magnitudes due to the presence of a Ts or Tv within a haplotype. Black squares are effects 1.5× of the interquartile range above the upper quartile or below the lower quartile. P-values were calculated using a t test. c Correlation between change in haplotype effect magnitude and the number of Ts/Tv differences between haplotypes. The red line is the regression line. P-values were calculated using a t test. d Correlation between change in haplotype effect magnitude and the number of Ts/Tv differences between haplotypes which overlap DHS centers. The red line is the regression line. P-values were calculated using a t test. e Boxplots of the effect of transitions and transversions on regulatory element activity as measured in Patwardhan et al., [19]. Each plot shows data from a different enhancer. Each plot shows the absolute log2(fold change) in regulatory element activity for every Ts and Tv assayed in that enhancer. P-values are for a one-sided t test from a linear regression model