| Literature DB >> 18700012 |
Mara K N Lawniczak1, Alisha K Holloway, David J Begun, Corbin D Jones.
Abstract
BACKGROUND: Understanding how DNA sequence polymorphism relates to variation in gene expression is essential to connecting genotypic differences with phenotypic differences among individuals. Addressing this question requires linking population genomic data with gene expression variation.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18700012 PMCID: PMC2575515 DOI: 10.1186/gb-2008-9-8-r125
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Gene feature length, polymorphism and divergence by gene expression variation for each sex
| Male* | Female* | ||||||||
| Genome average | NS† | SIG‡ | NS† | SIG‡ | |||||
| Number of genes | 6,687 | 1,262 | 6,405 | 723 | |||||
| Length | |||||||||
| EXON | 1,675 | 1,726 | 1,357 | 67.07 | *** | 1,768 | 1,416 | 36.94 | *** |
| 5'UTR | 239 | 251 | 198 | 59.68 | *** | 248 | 216 | 16.59 | *** |
| Intron | 2,493 | 2,750 | 1,764 | 16.14 | *** | 2,598 | 2,390 | 4.51 | 0.0336 |
| Number of introns | 3.55 | 3.69 | 3.11 | 16.42 | *** | 3.67 | 3.11 | 13.68 | 0.0002 |
| 3'UTR | 392 | 418 | 299 | 96.22 | *** | 414 | 353 | 28.52 | *** |
| Polymorphism | |||||||||
| CPR | 0.0290 | 0.0290 | 0.0284 | 0.88 | 0.3479 | 0.0297 | 0.0304 | 0.32 | 0.5727 |
| 5'UTR | 0.0112 | 0.0108 | 0.0127 | 13.34 | 0.0003 | 0.0108 | 0.0122 | 5.94 | 0.0148 |
| Nonsynonymous | 0.0024 | 0.0022 | 0.0029 | 43.56 | *** | 0.0021 | 0.0026 | 21.63 | *** |
| Synonymous | 0.0318 | 0.0308 | 0.0357 | 62.93 | *** | 0.0310 | 0.0355 | 28.04 | *** |
| First intron | 0.0277 | 0.0274 | 0.0294 | 6.45 | 0.0100 | 0.0266 | 0.0284 | 6.82 | 0.0090 |
| All introns | 0.0302 | 0.0297 | 0.0324 | 12.53 | 0.0004 | 0.0290 | 0.0317 | 9.56 | 0.0020 |
| 3'UTR | 0.0122 | 0.0114 | 0.0156 | 66.80 | *** | 0.0110 | 0.0151 | 54.52 | *** |
| Divergence¶ | |||||||||
| CPR | 0.0525 | 0.0532 | 0.0468 | 26.96 | *** | 0.0543 | 0.0514 | 3.16 | 0.0757 |
| 5'UTR | 0.0229 | 0.0224 | 0.0225 | 0.01 | 0.9063 | 0.0223 | 0.0216 | 0.11 | 0.7392 |
| Nonsynonymous | 0.0060 | 0.0057 | 0.0065 | 17.96 | *** | 0.0049 | 0.0054 | 13.64 | 0.0002 |
| Synonymous | 0.0531 | 0.0526 | 0.0538 | 5.41 | 0.0200 | 0.0522 | 0.0541 | 5.79 | 0.0160 |
| First intron | 0.0463 | 0.0457 | 0.0472 | 3.07 | 0.0797 | 0.0448 | 0.0480 | 3.70 | 0.0546 |
| All introns | 0.0487 | 0.0480 | 0.0503 | 4.98 | 0.0256 | 0.0472 | 0.0512 | 9.11 | 0.0025 |
| 3'UTR | 0.0228 | 0.0217 | 0.0256 | 22.61 | *** | 0.0209 | 0.0244 | 20.22 | *** |
*Male and female sets include genes that are expressed in that sex, but may also be expressed in the other sex. †NS, not significantly differentially expressed between genotypes (AOV p-value > 0.001). ‡SIG, significantly differentially expressed between genotypes (AOV p-value ≤ 0.001). §X2 and p-values derived from Kruskal Wallis; three asterisks denote p-value < 0.0001. ¶Divergence refers to lineage specific divergence along the D. simulans branch.
Figure 1Significant expression variation between genotypes is associated with elevated levels of sequence polymorphism at most types of sites. The y-axis is the per site nucleotide diversity (note: axis scale varies by feature). The pink line indicates the genomic mean nucleotide diversity and yellow lines indicate 95% confidence intervals around the genomic mean. The x-axis represents the level of expression variation between genotypes for the different gene features as named (5'UTR, untranslated region; CPR, core promoter region; NonSyn, nonsynonymous sites; Syn, synonymous sites). P-values from the AOV of expression variation were sorted and grouped into 15 equal sized bins. Bins on the left side of the figure have no evidence of expression variation and bins on the right have the most variably expressed genes. For each bin, blue circles represent the mean nucleotide diversity with standard error bars. Permutation tests examined whether nucleotide diversity was higher within each bin than in a random sample of genes from the genome. The asterisk marks the bin in which an average p-value = 0.05 occurs. To the right of the asterisk, a positive trend is observed in some gene features, suggesting that the positive relationship between gene expression variation and nucleotide polymorphism is not solely confined to the most dramatically differentially expressed genes.
Gene feature length, polymorphism and divergence in males for genes with high, average and low levels of expression
| Low | Average | High | Tukey's HSD summary* | |||
| Number of genes | 2,073 | 4,167 | 1,709 | |||
| Length | ||||||
| EXON | 1,874 | 1,747 | 1,225 | L>A>H | 306.91 | *** |
| 5'UTR | 282 | 240 | 207 | L>A>H | 16.62 | 0.0002 |
| Intron | 3,644 | 2,521 | 1,450 | L>A>H | 7.11 | 0.0286 |
| Number of introns | 3.86 | 3.75 | 2.88 | L=A>H | 63.89 | *** |
| 3'UTR | 503 | 380 | 338 | L>A>H | 77.77 | *** |
| Polymorphism | ||||||
| CPR | 0.0277 | 0.0295 | 0.0290 | L=A=H | 13.40 | 0.0012 |
| 5'UTR | 0.0114 | 0.0116 | 0.0097 | L=A>H | 24.01 | *** |
| Nonsynonymous | 0.0029 | 0.0023 | 0.0016 | L>A>H | 245.83 | *** |
| Synonymous | 0.0335 | 0.0322 | 0.0277 | L>A>H | 86.68 | *** |
| First intron | 0.0290 | 0.0276 | 0.0263 | L≥A≥H | 7.58 | 0.0226 |
| All introns | 0.0317 | 0.0301 | 0.0284 | L≥A≥H | 9.52 | 0.0086 |
| 3'UTR | 0.0131 | 0.0122 | 0.0109 | L=A>H | 41.23 | *** |
| Divergence‡ | ||||||
| CPR | 0.0493 | 0.0533 | 0.0528 | A=H>L | 19.77 | *** |
| 5'UTR | 0.0225 | 0.0232 | 0.0208 | A≥L≥H | 12.66 | 0.0018 |
| Nonsynonymous | 0.0066 | 0.0060 | 0.0047 | L>A>H | 155.62 | *** |
| Synonymous | 0.0524 | 0.0543 | 0.0494 | A≥L≥H | 35.69 | *** |
| First intron | 0.0475 | 0.0463 | 0.0433 | L=A>H | 7.79 | 0.0203 |
| All introns | 0.0483 | 0.0492 | 0.0462 | A≥L≥H | 7.06 | 0.0293 |
| 3'UTR | 0.0237 | 0.0225 | 0.0204 | L=A>H | 22.83 | *** |
*L, low expression; A, average expression; H, high expression (see Materials and methods). †X2 and p-values derived from Kruskal Wallis; three asterisks denote p-value < 0.0001. ‡Divergence refers to lineage specific divergence along the D. simulans branch.
Gene feature length, polymorphism and divergence in females for genes with high, average and low levels of expression
| Low | Average | High | Tukey's HSD summary* | |||
| Number of genes | 1,652 | 3,999 | 1,477 | |||
| Length | ||||||
| EXON | 1,877 | 1,825 | 1,319 | L=A>H | 198.71 | *** |
| 5'UTR | 287 | 241 | 213 | L>A>H | 15.53 | 0.0004 |
| Intron | 3,845 | 2,521 | 1,386 | L>A>H | 20.64 | *** |
| Number of introns | 3.97 | 3.71 | 2.94 | L=A>H | 48.06 | *** |
| 3'UTR | 547 | 366 | 384 | L>H=A | 108.03 | *** |
| Polymorphism | ||||||
| CPR | 0.0260 | 0.0304 | 0.0315 | H=A>L | 59.21 | *** |
| 5'UTR | 0.0110 | 0.0115 | 0.0094 | A=L>H | 28.06 | *** |
| Nonsynonymous | 0.0028 | 0.0021 | 0.0013 | L>A>H | 341.18 | *** |
| Synonymous | 0.0337 | 0.0325 | 0.0259 | L=A>H | 148.96 | *** |
| First intron | 0.0283 | 0.0272 | 0.0240 | L=A>H | 19.94 | *** |
| All introns | 0.0309 | 0.0298 | 0.0260 | L=A>H | 24.77 | *** |
| 3'UTR | 0.0136 | 0.0116 | 0.0089 | L>A>H | 106.22 | *** |
| Divergence‡ | ||||||
| CPR | 0.0452 | 0.0550 | 0.0597 | H>A>L | 132.46 | *** |
| 5'UTR | 0.0218 | 0.0229 | 0.0209 | A≥L≥H | 6.70 | 0.0350 |
| Nonsynonymous | 0.0066 | 0.0048 | 0.0034 | L>A>H | 243.38 | *** |
| Synonymous | 0.0513 | 0.0546 | 0.0476 | A≥L≥H | 79.80 | *** |
| First intron | 0.0471 | 0.0459 | 0.0411 | L=A>H | 13.88 | 0.0010 |
| All introns | 0.0482 | 0.0489 | 0.0437 | A=L>H | 22.22 | *** |
| 3'UTR | 0.0241 | 0.0221 | 0.0168 | 74.98 | *** |
*L, low expression; A, average expression; H, high expression (see Materials and methods). †X2 and p-values derived from Kruskal Wallis; three asterisks denote p-value < 0.0001. ‡Divergence refers to lineage specific divergence along the D. simulans branch.
Gene feature length, polymorphism and divergence for sex-specific*, sex-biased*, and unbiased genes
| Tukey's HSD summary‡ | Summary§ | |||
| Number of genes | ||||
| Length | ||||
| EXON | 247.10 | *** | Fb≥Fs≥U≥Mb≥Ms | F>U>M |
| 5'UTR | 133.27 | *** | U,Fb≥Mb≥Ms,Fs | NSS>SS |
| Intron | 131.81 | *** | U≥Mb,Fb,Ms>Fs | NSS>SS |
| Number of introns | 64.44 | *** | U,Fb≥Mb≥Ms,Fs | NSS>SS |
| 3'UTR | 236.01 | *** | U≥Fb≥Mb>Ms,Fs | NSS>SS |
| 5' intergenic | 291.9 | *** | Ms>Mb,U,Fs≥Fb | M>F,U |
| 3' intergenic | 274.6 | *** | Ms≥Mb≥U,Fb,Fs | M>F,U |
| Polymorphism | ||||
| CPR | 79.64 | *** | Fb,Fs,U>Ms,Mb | F,U>M |
| 5'UTR | 22.14 | 0.0002 | Ms,Fs≥Fb,Mb≥U | SS>NSS |
| Nonsynonymous | 305.11 | *** | Ms≥Fs≥Mb≥U,Fb | SS>NSS |
| Synonymous | 33.62 | *** | Fs,Ms≥Mb≥U,Fb | SS>NSS |
| First intron | 59.49 | *** | Ms≥Mb≥Fs,U,Fb | M>F,U |
| All introns | 48.10 | *** | Ms≥Mb≥Fs,Fb,U | M>F,U |
| 3'UTR | 156.48 | *** | Ms≥Mb≥Fs>U>Fb | M>F,U |
| Divergence¶ | ||||
| CPR | 212.79 | *** | Fb,Fs>U>Ms,Mb | F>U>M |
| 5'UTR | 80.02 | *** | Fs≥Ms>Fb≥Mb≥U | SS>NSS |
| Nonsynonymous | 533.92 | *** | Ms>Fs,Mb>Fb,U | SS>NSS |
| Synonymous | 81.82 | *** | Ms≥Fs,Fb,Mb≥U | SS>NSS |
| First intron | 68.47 | *** | Ms,Fs≥Mb≥Fb,U | SS>NSS |
| All introns | 55.72 | *** | Ms≥Fs≥Mb≥Fb,U | SS>NSS |
| 3'UTR | 259.87 | *** | Ms≥Fs≥Mb≥Fb≥U | SS>NSS |
*Male- and female-specific sets include genes that are expressed only in that sex, whereas sex-biased are expressed, on average, three-fold higher in one sex than the other. †X2 and p-values derived from Kruskal Wallis; three asterisks denote p-value < 0.0001. ‡Ms, male-specific; Mb, male-biased; Fs, female-specific; Fb, female-biased; U, unbiased. §F, female; M, male; U, unbiased; NSS, non-sex-specific; SS, sex-specific. ¶Divergence refers to lineage specific divergence along the D. simulans branch.