| Literature DB >> 23275886 |
Martha H Elmore1, John G Gibbons, Antonis Rokas.
Abstract
Copy number polymorphisms of nucleotide tandem repeat (TR) regions, such as microsatellites and minisatellites, are mutationally reversible and highly abundant in eukaryotic genomes. Studies linking TR polymorphism to phenotypic variation have led some to suggest that TR variation modulates and majorly contributes to phenotypic variation; however, studies in which the authors assess the genome-wide impact of TR variation on phenotype are lacking. To address this question, we quantified relationships between polymorphism levels in 143 genome-wide promoter region TRs across 16 isolates of the filamentous fungus Aspergillus flavus and its ecotype Aspergillus oryzae with expression levels of their downstream genes. We found that only 4.3% of relationships tested were significant; these findings were consistent with models in which TRs act as "tuning," "volume," or "optimality" "knobs" of phenotype but not with "switch" models. Furthermore, the promoter regions of differentially expressed genes between A. oryzae and A. flavus did not show TR enrichment, suggesting that genome-wide differences in molecular phenotype between the two species are not significantly associated with TRs. Although in some cases TR polymorphisms do contribute to transcript abundance variation, these results argue that at least in this case, TRs might not be major modulators of variation in phenotype.Entities:
Keywords: RNA-Seq; genotype−phenotype map; molecular phenotype; tandem repeat copy number polymorphism
Mesh:
Year: 2012 PMID: 23275886 PMCID: PMC3516485 DOI: 10.1534/g3.112.004663
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1 Design of the experiment to test the genome-wide effect of TR variation in modulating molecular phenotype.
Figure 2 Select patterns of significant relationships between promoter region TR polymorphism and variation in gene expression in A. flavus. For each example, the regression fit, TR type (microsatellite or minisatellite), locus identifier (in parentheses), regression R, and regression P values are provided. Each plot shows the NAL of TR on the x-axis and gene expression level (RPKM) on the y-axis.
Loci in which promoter region TR variation correlates significantly with gene expression variation
| Type | Species | Gene Promoter | Function | Best-Fit Regression | R2 | |
|---|---|---|---|---|---|---|
| Microsatellite | AO090012000588 | SNF2 family helicase/ATPase | Linear (squared) | 0.72 | 0.008 | |
| AO090012000871 | PAP2 superfamily | Linear (logarithmic) | 0.78 | 0.009 | ||
| AO090102000623 | HLH transcription factor | Quadratic | 0.87 | 0.006 | ||
| AO090206000041 | F-box domain | Cubic | 0.94 | 0.007 | ||
| AO090701000151 | Growth-arrest-specific protein 2 domain | Cubic | 0.95 | 0.004 | ||
| AO090701000375 | RhoGAP domain | Quadratic | 0.96 | 0.002 | ||
| AO090005000013 | Uncharacterized protein | Cubic | 0.96 | 1.00E-04 | ||
| combined | AO090003000121 | 6-phosphogluconate dehydrogenase | Cubic | 0.67 | 0.009 | |
| AO090012000871 | PAP2 domain-containing protein | Cubic | 0.64 | 0.009 | ||
| Minisatellite | AO090005000959 | Hypothetical protein | Linear (reciprocal) | 0.83 | 0.002 | |
| AO090038000210 | Polyketide synthase | Linear (square root) | 0.71 | 0.008 | ||
| AO090102000623 | HLH transcription factor | Linear (reciprocal) | 0.98 | 1.55E-05 | ||
| AO090005000567 | Hypothetical protein | Cubic | 1.00 | 2.77E-27 | ||
| AO090009000040 | Hypothetical protein | Cubic | 0.98 | 0.001 | ||
| AO090010000582 | Eukaryotic-type carbonic anhydrase | Cubic | 0.95 | 0.004 | ||
| combined | AO090005000567 | Hypothetical protein | Quadratic | 0.74 | 2.00E-04 | |
| AO090102000623 | HLH transcription factor | Linear (untransformed) | 0.46 | 0.005 |
TR, tandem repeats.
Figure 3 Gene expression variance is not increased in genes with promoter region TRs. Expression variance comparison between genes containing promoter region TRs and those without promoter region TRs but are located in the same genomic region (A and B) or have similar functions (C). For each box plot, the horizontal line represents the sample median, the box extends from the first to the third quartile, and the whiskers extend to the interquartile ranges. For (C), “TR” represents the TR dataset (combined microsatellite and minisatellite), whereas numbers 1–10 represent each of the randomly selected sets of genes with similar functional classifications lacking promoter region TRs.