| Literature DB >> 30287792 |
Aaron Sievers1, Frederik Wenz2, Michael Hausmann3, Georg Hildenbrand4,5.
Abstract
In this study, we pairwise-compared multiple genome regions, including genes, exons, coding DNA sequences (CDS), introns, and intergenic regions of 39 Animalia genomes, including Deuterostomia (27 species) and Protostomia (12 species), by applying established k-mer-based (alignment-free) comparison methods. We found strong correlations between the sequence structure of introns and intergenic regions, individual organisms, and within wider phylogenetical ranges, indicating the conservation of certain structures over the full range of analyzed organisms. We analyzed these sequence structures by quantifying the contribution of different sets of DNA words to the average correlation value by decomposing the correlation coefficients with respect to these word sets. We found that the conserved structures within introns, intergenic regions, and between the two were mainly a result of conserved tandem repeats with repeat units ≤ 2 bp (e.g., (AT)n), while other conserved sequence structures, such as those found between exons and CDS, were dominated by tandem repeats with repeat unit sizes of 3 bp in length and more complex DNA word patterns. We conclude that the conservation between intron and intergenic regions indicates a shared function of these sequence structures. Also, the similar differences in conserved structures with known origin, especially to the conservation between exons and CDS resulting from DNA codons, indicate that k-mer composition-based functional properties of introns and intergenic regions may differ from those of exons and CDS.Entities:
Keywords: 3D conformation of DNA; Animalia; alignment-free; intergenic region; intron; k-mer; sequence analysis; sequence patterns; tandem repeats
Year: 2018 PMID: 30287792 PMCID: PMC6211125 DOI: 10.3390/genes9100482
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Organisms and chromosomes used in our analysis (accession numbers of sequences are found within the appendix, Table S1).
| Species | Further Classification | Chromosomes |
|---|---|---|
|
| Deuterostomia—Mammalia | 1–22, X, Y |
|
| Deuterostomia—Mammalia | 1, 2A, 2B, 3-22, X, Y |
|
| Deuterostomia—Mammalia | 1–19, X, Y |
|
| Deuterostomia—Mammalia | 1–20, X, Y |
|
| Deuterostomia—Mammalia | 1–21, X |
|
| Deuterostomia—Mammalia | 1–38, X |
|
| Deuterostomia—Mammalia | A1–A3, B1–B4, C1, C2, D1–D4, E1–E3, F1, F2, X |
|
| Deuterostomia—Mammalia | 1–31, X |
|
| Deuterostomia—Mammalia | 1–29 |
|
| Deuterostomia—Mammalia | 1–29, X |
|
| Deuterostomia—Mammalia | 1–18, X, Y |
|
| Deuterostomia—Mammalia | 1–8, X |
|
| Deuterostomia—Mammalia | 1–7, 10–12, 14, 15, 17, 18, 20, X1, X2, X3, X5 |
|
| Deuterostomia—Aves | 1–28, 30, 33, W, Z |
|
| Deuterostomia—Aves | 1, 1A, 1B, 2-4, 4A, 5–15, 17–28, Z |
|
| Deuterostomia—Aves | 1–15, 17–28, 1A, 4A, Z |
|
| Deuterostomia—Reptilia | 1–11, 13, 15, 17, 19, 21, 22, 24, 25 |
|
| Deuterostomia—Reptilia | 1–6, a, b, c, d, f, g, h |
|
| Deuterostomia—Amphibia | 1–10 |
|
| Deuterostomia—Osteichthyes | 1–22 |
|
| Deuterostomia—Osteichthyes | 1–20, W, Z |
|
| Deuterostomia—Osteichthyes | 1, 2, 3a, 3b, 4–22 |
|
| Deuterostomia—Osteichthyes | 1–23 |
|
| Deuterostomia—Osteichthyes | 1–25 |
|
| Deuterostomia—Osteichthyes | 1–29 |
|
| Deuterostomia—Osteichthyes | 1–29 |
|
| Deuterostomia—Ascidiae | 1–14 |
|
| Protostomia—Trematoda | 1–7, W |
|
| Protostomia—Secernentea | 1, 2, X |
|
| Protostomia—Secernentea | 1–5, X |
|
| Protostomia—Secernentea | 1–5, X |
|
| Protostomia—Insecta | 1–16 |
|
| Protostomia—Insecta | 1–18 |
|
| Protostomia—Insecta | 1–5 |
|
| Protostomia—Insecta | 2L, 3L, 2R, 3R, X |
|
| Protostomia—Insecta | 2, 3 |
|
| Protostomia—Insecta | 2L, 2R, 3L, 3R, 4, X, Y |
|
| Protostomia—Insecta | 2L, 3L, 2R, 3R, 4, X, |
Definitions of genome sequence regions.
| Region ID | Definition used for the Algorithm | Definition/Description |
|---|---|---|
| Genome | Complete genome sequence | Complete genome sequence of the respective organism. |
| Genes | Genes * | Genes of a respective organism (including pseudo and RNA genes **). |
| Exons | Exons * | Exons (transcribed parts) of genes |
| (including pseudo and RNA genes). | ||
| Introns | Genes * with exons * masked | Introns (untranscribed parts) of genes |
| (including pseudo and RNA genes). | ||
| CDS | Coding DNA sequences (CDS *) | CDS (translated parts) of exons |
| (exons without untranslated regions (UTRs)). | ||
| Intergenic | Complete genome sequence with genes (and centromeres) masked. | Non-coding regions between genes (RNA genes are also considered genes in this context). |
*: Referring to the labels found in respective GenBank files [14]. **: RNA genes including tRNA, ncRNA (e.g., long ncRNA, micro RNA, snoRNA).
Figure 1Mean correlation values over all organsims against word length k. Shown are the mean values and uncertainties (see Section 2.1 for details) for k-mer analysis correlation results between pairs of genome regions. We limited the y-axis to values above zero for better readability, and accordingly have not shown the k = 1 correlations for Exon–CDS, Introns–CDS, and Intergenic–CDS.
Figure 2Heatmap of correlations between individual intron and intergenic regions (k = 7). Each row represents the pairwise correlation values (Pearson correlation) of introns for the listed organism, while every column is associated with the intergenic region of an organism. Color scale is limited in the heatmap to values above zero for better readability.
List of 10 k-mer words with highest contribution to correlation values for k = 7. Organism Content, shown for homogenous pairings of sequence regions, is defined by the fraction of analyzed organisms for which the difference between content of the corresponding k-mer word and the mean k-mer word content (4−) is larger than 1 σ with respect to the k-mer word content distribution of the respective organism.
| Correlated Regions | Contribution | Organism Content | |
|---|---|---|---|
| TTTTTTT | 0.0204 | 100% | |
| AAAAAAA | 0.0196 | 100% | |
| GCTGCTG | 0.0045 | 92.3% | |
| CAGCAGC | 0.0044 | 89.7% | |
| CTGCTGC | 0.0032 | 82.1% | |
| GCAGCAG | 0.0032 | 84.6% | |
| TTTTCTT | 0.0029 | 100% | |
| TTTCTTT | 0.0028 | 100% | |
| TTTATTT | 0.0028 | 97.4% | |
| TGCTGCT | 0.0027 | 92.3% | |
| GCTGCTG | 0.0077 | 94.9% | |
| CAGCAGC | 0.0077 | 94.9% | |
| CTGCTGC | 0.0054 | 92.3% | |
| GCAGCAG | 0.0053 | 89.7% | |
| TGCTGCT | 0.0042 | 94.9% | |
| AGCAGCA | 0.0041 | 94.9% | |
| CTCCTCC | 0.0035 | 89.7% | |
| GGAGGAG | 0.0035 | 89.7% | |
| CTTCTTC | 0.0031 | 100% | |
| GAAGAAG | 0.0030 | 100% | |
| TTTTTTT | 0.0553 | 100% | |
| AAAAAAA | 0.0552 | 100% | |
| TGTGTGT | 0.0070 | 89.7% | |
| ACACACA | 0.0070 | 87.2% | |
| TTTATTT | 0.0065 | 100% | |
| AAATAAA | 0.0064 | 100% | |
| ATATATA | 0.0063 | 100% | |
| TATATAT | 0.0063 | 100% | |
| ATTTTTT | 0.0056 | 100% | |
| GTGTGTG | 0.0056 | 82.1% | |
| TTTTTTT | 0.0467 | 100% | |
| AAAAAAA | 0.0466 | 100% | |
| ATATATA | 0.0073 | 100% | |
| TATATAT | 0.0072 | 100% | |
| AAATAAA | 0.0069 | 100% | |
| TTTATTT | 0.0069 | 100% | |
| TGTGTGT | 0.0059 | 89.7% | |
| ACACACA | 0.0057 | 89.7% | |
| ATTTTTT | 0.0055 | 100% | |
| AAAAAAT | 0.0055 | 100% | |
| TTTTTTT | 0.0510 | ||
| AAAAAAA | 0.0510 | ||
| ATATATA | 0.0068 | ||
| TATATAT | 0.0068 | ||
| AAATAAA | 0.0066 | ||
| TTTATTT | 0.0066 | ||
| TGTGTGT | 0.0064 | ||
| ACACACA | 0.0063 | ||
| ATTTTTT | 0.0056 | ||
| AAAAAAT | 0.0056 | ||
| TTTTTTT | 0.0414 | ||
| AAAAAAA | 0.0406 | ||
| TTTATTT | 0.0052 | ||
| AAATAAA | 0.0047 | ||
| ATTTTTT | 0.0044 | ||
| TTTAAAA | 0.0042 | ||
| TTTTAAA | 0.0042 | ||
| AAAAAAT | 0.0041 | ||
| ACACACA | 0.0040 | ||
| TGTGTGT | 0.0039 | ||
| GCTGCTG | 0.0065 | ||
| CAGCAGC | 0.0065 | ||
| CTGCTGC | 0.0047 | ||
| GCAGCAG | 0.0046 | ||
| TGCTGCT | 0.0037 | ||
| AGCAGCA | 0.0037 | ||
| GGAGGAG | 0.0030 | ||
| CTCCTCC | 0.0030 | ||
| TTCTTCT | 0.0025 | ||
| AGAAGAA | 0.0025 |
List of 10 k-mer words with highest contribution to correlation values for k = 11. Organism Content, shown for homogenous pairings of sequence regions, is defined by the fraction of analyzed organisms for which the difference between content of the corresponding k-mer word and the mean k-mer word content (4−) is larger than 1 σ, with respect to the k-mer word content distribution of the respective organism.
| Correlated Regions | Contribution | Organism Content | |
|---|---|---|---|
| TTTTTTTTTTT | 0.0662 | 100% | |
| AAAAAAAAAAA | 0.0657 | 100% | |
| ACACACACACA | 0.0155 | 97.4% | |
| CACACACACAC | 0.0148 | 97.4% | |
| TGTGTGTGTGT | 0.0148 | 92.3% | |
| GTGTGTGTGTG | 0.0142 | 94.9% | |
| ATATATATATA | 0.0069 | 97.4% | |
| TATATATATAT | 0.0068 | 97.4% | |
| TGCTGCTGCTG | 0.0052 | 100% | |
| CAGCAGCAGCA | 0.0052 | 100% | |
| TGCTGCTGCTG | 0.0106 | 100% | |
| CAGCAGCAGCA | 0.0101 | 100% | |
| GCTGCTGCTGC | 0.0082 | 100% | |
| CTGCTGCTGCT | 0.0079 | 100% | |
| GCAGCAGCAGC | 0.0078 | 100% | |
| AGCAGCAGCAG | 0.0075 | 100% | |
| TCCTCCTCCTC | 0.0051 | 100% | |
| GAGGAGGAGGA | 0.0050 | 100% | |
| CCTCCTCCTCC | 0.0036 | 100% | |
| GGAGGAGGAGG | 0.0035 | 100% | |
| AAAAAAAAAAA | 0.0990 | 100% | |
| TTTTTTTTTTT | 0.0987 | 100% | |
| TGTGTGTGTGT | 0.0480 | 97.4% | |
| ACACACACACA | 0.0478 | 97.4% | |
| GTGTGTGTGTG | 0.0455 | 97.4% | |
| CACACACACAC | 0.0453 | 97.4% | |
| ATATATATATA | 0.0363 | 100% | |
| TATATATATAT | 0.0363 | 100% | |
| AGAGAGAGAGA | 0.0167 | 97.4% | |
| TCTCTCTCTCT | 0.0165 | 100% | |
| TTTTTTTTTTT | 0.0837 | 100% | |
| AAAAAAAAAAA | 0.0835 | 100% | |
| TGTGTGTGTGT | 0.0452 | 97.4% | |
| ACACACACACA | 0.0440 | 97.4% | |
| GTGTGTGTGTG | 0.0428 | 97.4% | |
| TATATATATAT | 0.0422 | 100% | |
| ATATATATATA | 0.0422 | 100% | |
| CACACACACAC | 0.0417 | 97.4% | |
| AGAGAGAGAGA | 0.0193 | 100% | |
| TCTCTCTCTCT | 0.0192 | 100% | |
| AAAAAAAAAAA | 0.0884 | ||
| TTTTTTTTTTT | 0.0883 | ||
| TGTGTGTGTGT | 0.0456 | ||
| ACACACACACA | 0.0448 | ||
| GTGTGTGTGTG | 0.0432 | ||
| CACACACACAC | 0.0425 | ||
| ATATATATATA | 0.0395 | ||
| TATATATATAT | 0.0395 | ||
| AGAGAGAGAGA | 0.0181 | ||
| TCTCTCTCTCT | 0.0178 | ||
| TTTTTTTTTTT | 0.1002 | ||
| AAAAAAAAAAA | 0.0999 | ||
| ACACACACACA | 0.0344 | ||
| TGTGTGTGTGT | 0.0338 | ||
| CACACACACAC | 0.0328 | ||
| GTGTGTGTGTG | 0.0323 | ||
| ATATATATATA | 0.0223 | ||
| TATATATATAT | 0.0222 | ||
| AGAGAGAGAGA | 0.0096 | ||
| TCTCTCTCTCT | 0.0095 | ||
| TGCTGCTGCTG | 0.0093 | ||
| CAGCAGCAGCA | 0.0091 | ||
| GCTGCTGCTGC | 0.0073 | ||
| CTGCTGCTGCT | 0.0071 | ||
| GCAGCAGCAGC | 0.0071 | ||
| AGCAGCAGCAG | 0.0069 | ||
| GAGGAGGAGGA | 0.0044 | ||
| TCCTCCTCCTC | 0.0044 | ||
| CCTCCTCCTCC | 0.0032 | ||
| GGAGGAGGAGG | 0.0032 |
Figure 3Histograms representing the correlation contributions of tandem repeat k-mer words with different repeat unit lengths to k = 7 correlation values for intron–intergenic correlation (A) and to exon–CDS correlation (B) (see Section 2.2 for details).
Correlation contributions of tandem repeats with different repeat unit lengths b (in base pairs) for word lengths k = 7 and k = 11, between different genome regions analogous to the data shown in Figure 2.
| Correlated Regions |
| Correlation Contribution | Correlation Contribution | Correlation Contribution | Remaining Words |
|---|---|---|---|---|---|
| 7 | 4.02% | 0.98% | 5.18% | 89.82% | |
| 7 | 0.03% | 0.09% | 8.21% | 91.67% | |
| 7 | 0.41% | 0.20% | 7.17% | 92.22% | |
| 7 | 8.28% | 2.81% | 2.26% | 86.66% | |
| 7 | 11.22% | 5.08% | 1.36% | 82.34% | |
| 7 | 9.62% | 4.82% | 1.54% | 84.02% | |
| 7 | 10.52% | 5.01% | 1.44% | 83.03% | |
| 11 | 13.37% | 8.70% | 5.70% | 72.23% | |
| 11 | 0.02% | 0.03% | 9.86% | 90.09% | |
| 11 | 0.46% | 0.60% | 9.30% | 89.65% | |
|
| 11 | 21.31% | 21.49% | 2.37% | 54.83% |
|
| 11 | 20.45% | 32.35% | 1.54% | 45.66% |
|
| 11 | 17.34% | 33.40% | 2.09% | 47.27% |
|
| 11 | 18.51% | 32.47% | 1.84% | 47.18% |
Correlation contributions of words with different G/C contents for word lengths k = 7 and k = 11, between different genome regions.
| Correlated Regions |
| 0% G/C | 0–25% G/C | 25–50% G/C | 50–75% G/C | 75–100% G/C | 100% G/C |
|---|---|---|---|---|---|---|---|
| 7 | 11.71% | 23.53% | 34.13% | 39.23% | 3.11% | 0.21% | |
| 7 | 0.38% | 4.33% | 38.89% | 52.92% | 3.86% | 0.35% | |
| 7 | 1.91% | 7.95% | 39.16% | 49.27% | 3.63% | 0.29% | |
| 7 | 23.99% | 42.64% | 28.35 | 26.60% | 2.41% | 0.09% | |
| 7 | 33.02% | 55.66% | 21.94% | 18.82% | 3.59% | 0.61% | |
| 7 | 32.49% | 56.00% | 22.01% | 18.31% | 3.69% | 0.71% | |
| 7 | 32.83% | 55.89% | 21.92% | 18.49% | 3.71% | 0.75% | |
| 11 | 18.84% | 33.66% | 29.21% | 33.72% | 3.41% | 1.06% | |
| 11 | 0.16% | 3.56% | 34.72% | 56.19% | 5.53% | 1.32% | |
| 11 | 1.24% | 8.30% | 35.78% | 50.63% | 5.29% | 1.52% | |
| 11 | 32.51% | 52.23% | 25.27% | 20.63% | 1.86% | 1.33% | |
| 11 | 37.31% | 56.78% | 23.69% | 18.35% | 1.18% | 0.69% | |
| 11 | 36.08% | 56.25% | 24.37% | 18.28% | 1.11% | 0.63% | |
| 11 | 36.27% | 56.06% | 24.08% | 18.53% | 1.34% | 0.84% |
Correlation contributions for correlations between introns and intergenic regions for tandem repeat k-mer words with repeat unit length ≤ 2 bp for either no or one allowed mismatches, for k = 7 (top) and k = 11 (bottom).
| No Mismatch | 1 Mismatch | |
|---|---|---|
| AAAAAAA | 5.10% | 11.51% |
| CCCCCCC | 0.17% | 0.20% |
| GGGGGGG | 0.15% | 0.18% |
| TTTTTTT | 5.10% | 11.55% |
| ACACACA | 0.63% | 0.88% |
| AGAGAGA | 0.38% | 0.70% |
| ATATATA | 0.68% | 1.64% |
| CACACAC | 0.50% | 0.57% |
| CGCGCGC | <0.01% | 0.08% |
| CTCTCTC | 0.30% | 0.43% |
| GAGAGAG | 0.30% | 0.43% |
| GCGCGCG | <0.01% | 0.08% |
| GTGTGTG | 0.51% | 0.58% |
| TATATAT | 0.68% | 1.63% |
| TCTCTCT | 0.38% | 0.70% |
| TGTGTGT | 0.64% | 0.89% |
| AAAAAAAAAAA | 8.84% | 10.90% |
| CCCCCCCCCCC | 0.42% | 0.45% |
| GGGGGGGGGGG | 0.41% | 0.44% |
| TTTTTTTTTTT | 8.83% | 10.89% |
| ACACACACACA | 4.48% | 4.61% |
| AGAGAGAGAGA | 1.81% | 1.91% |
| ATATATATATA | 3.95% | 4.20% |
| CACACACACAC | 4.25% | 4.36% |
| CGCGCGCGCGC | <0.01% | <0.01 |
| CTCTCTCTCTC | 1.68% | 1.79% |
| GAGAGAGAGAG | 1.70% | 1.79% |
| GCGCGCGCGCG | <0.01% | <0.01% |
| GTGTGTGTGTG | 4.32% | 4.43% |
| TATATATATAT | 3.95% | 4.19% |
| TCTCTCTCTCT | 1.78% | 1.89% |
| TGTGTGTGTGT | 4.56% | 4.68% |
Contents of specific nucleotides (A, C, G, T) within mismatches of tandem repeat k-mer words (for k = 7 and k = 11). Contents lower than 20% are marked red, and contents higher than 30% are marked green, remaining contents are marked yellow.
| Introns A | C | G | T | Intergenic Region A | C | G | T | |
|---|---|---|---|---|---|---|---|---|
| AAAAAAA | - | 28.48% | 31.04% | 40.48% | - | 28.05% | 31.29% | 40.65% |
| CCCCCCC | 37.93% | - | 16.91% | 45.16% | 37.66% | - | 16.80% | 45.54% |
| GGGGGGG | 45.22% | 16.91% | - | 37.87% | 45.52% | 16.79% | - | 37.69% |
| TTTTTTT | 40.41% | 31.06% | 28.50% | - | 40.66% | 31.28% | 28.06% | - |
| ACACACA | 41.80% | 15.85% | 18.99% | 23.36% | 41.94% | 15.78% | 18.70% | 23.58% |
| AGAGAGA | 43.86% | 16.79% | 21.76% | 17.59% | 44.29% | 16.54% | 21.43% | 17.74% |
| ATATATA | 39.21% | 14.91% | 15.44% | 30.43% | 39.04% | 14.95% | 15.77% | 30.24% |
| CACACAC | 34.04% | 19.40% | 20.78% | 25.78% | 34.30% | 19.28% | 20.60% | 25.82% |
| CGCGCGC | 24.41% | 29.08% | 22.76% | 23.75% | 23.88% | 28.96% | 23.24% | 23.91% |
| CTCTCTC | 17.49% | 25.84% | 18.82% | 37.86% | 17.47% | 25.40% | 18.70% | 38.43% |
| GAGAGAG | 37.89% | 18.82% | 25.87% | 17.43% | 38.50% | 18.72% | 25.34% | 17.44% |
| GCGCGCG | 24.29% | 22.68% | 29.08% | 23.96% | 23.91% | 23.91% | 22.99% | 29.23% |
| GTGTGTG | 25.71% | 20.84% | 19.43% | 34.02% | 25.83% | 20.61% | 19.26% | 34.29% |
| TATATAT | 30.42% | 15.45% | 14.92% | 39.21% | 30.24% | 15.75% | 14.98% | 39.02% |
| TCTCTCT | 17.52% | 21.77% | 16.77% | 43.94% | 17.75% | 21.48% | 16.57% | 44.19% |
| TGTGTGT | 23.27% | 19.02% | 15.86% | 41.86% | 23.59% | 18.75% | 15.74% | 41.91% |
| mean | 32.23% | 21.13% | 21.13% | 32.18% | 32.31% | 21.02% | 21.05% | 32.29% |
| AAAAAAAAAAA | - | 29.72% | 37.36% | 32.91% | - | 29.40% | 37.73% | 32.86% |
| CCCCCCCCCCC | 40.11% | - | 19.32% | 40.56% | 38.89% | - | 19.13% | 41.98% |
| GGGGGGGGGGG | 41.16% | 19.42% | - | 39.42% | 41.57% | 19.31% | - | 39.12% |
| TTTTTTTTTTT | 32.78% | 37.44% | 29.78% | - | 33.01% | 37.70% | 29.29% | - |
| ACACACACACA | 29.68% | 14.15% | 24.26% | 31.91% | 29.95% | 13.88% | 23.69% | 32.48% |
| AGAGAGAGAGA | 42.99% | 17.03% | 27.46% | 12.52% | 43.73% | 16.82% | 26.98% | 12.48% |
| ATATATATATA | 28.64% | 20.88% | 21.22% | 29.26% | 28.40% | 20.84% | 21.49% | 29.27% |
| CACACACACAC | 29.89% | 11.88% | 23.41% | 34.83% | 30.98% | 11.51% | 22.91% | 34.60% |
| CGCGCGCGCGC | 33.67% | 17.94% | 16.99% | 31.40% | 32.56% | 20.27% | 18.52% | 28.65% |
| CTCTCTCTCTC | 12.31% | 28.82% | 17.55% | 41.32% | 12.26% | 28.13% | 17.22% | 42.39% |
| GAGAGAGAGAG | 41.93% | 17.19% | 28.78% | 12.10% | 42.78% | 17.18% | 28.09% | 11.94% |
| GCGCGCGCGCG | 31.17% | 16.91% | 18.14% | 33.78% | 29.61% | 16.89% | 20.61% | 32.89% |
| GTGTGTGTGTG | 33.92% | 23.03% | 12.74% | 30.31% | 34.60% | 23.14% | 11.48% | 30.78% |
| TATATATATAT | 29.22% | 21.34% | 20.79% | 28.65% | 29.21% | 21.55% | 20.83% | 28.41% |
| TCTCTCTCTCT | 12.84% | 27.56% | 17.27% | 42.33% | 12.74% | 26.91% | 16.99% | 43.37% |
| TGTGTGTGTGT | 31.39% | 23.95% | 14.51% | 30.15% | 32.57% | 23.96% | 13.66% | 29.81% |
| mean | 31.45% | 21.82% | 21.97% | 31.43% | 31.52% | 21.83% | 21.91% | 32.40% |