| Literature DB >> 21156062 |
Eduard D Akhunov1, Alina R Akhunova, Olin D Anderson, James A Anderson, Nancy Blake, Michael T Clegg, Devin Coleman-Derr, Emily J Conley, Curt C Crossman, Karin R Deal, Jorge Dubcovsky, Bikram S Gill, Yong Q Gu, Jakub Hadam, Hwayoung Heo, Naxin Huo, Gerard R Lazo, Ming-Cheng Luo, Yaqin Q Ma, David E Matthews, Patrick E McGuire, Peter L Morrell, Calvin O Qualset, James Renfro, Dindo Tabanao, Luther E Talbert, Chao Tian, Donna M Toleno, Marilyn L Warburton, Frank M You, Wenjun Zhang, Jan Dvorak.
Abstract
BACKGROUND: A genome-wide assessment of nucleotide diversity in a polyploid species must minimize the inclusion of homoeologous sequences into diversity estimates and reliably allocate individual haplotypes into their respective genomes. The same requirements complicate the development and deployment of single nucleotide polymorphism (SNP) markers in polyploid species. We report here a strategy that satisfies these requirements and deploy it in the sequencing of genes in cultivated hexaploid wheat (Triticum aestivum, genomes AABBDD) and wild tetraploid wheat (Triticum turgidum ssp. dicoccoides, genomes AABB) from the putative site of wheat domestication in Turkey. Data are used to assess the distribution of diversity among and within wheat genomes and to develop a panel of SNP markers for polyploid wheat.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21156062 PMCID: PMC3022916 DOI: 10.1186/1471-2164-11-702
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Project flow chart.
Lines of tetraploid and hexaploid wheat used for SNP discovery
| Species | Database code | Line | Origin |
|---|---|---|---|
| Td01 | PI 428020 | Diyarbakir, Turkey | |
| Td02 | PI 428027 | Diyarbakir, Turkey | |
| Td03 | PI 428053 | Diyarbakir, Turkey | |
| Td04 | PI 428073 | Diyarbakir, Turkey | |
| Td05 | PI 428064 | Diyarbakir, Turkey | |
| Td06 | PI 428082 | Diyarbakir, Turkey | |
| Td07 | PI 428083 | Diyarbakir, Turkey | |
| Td08 | PI 428086 | Diyarbakir, Turkey | |
| Td09 | G 2844 | Diyarbakir, Turkey | |
| Td10 | G 2040 | Diyarbakir, Turkey | |
| Ta11 | PI 166698 | Turkey | |
| Ta13 | PI 166792 | Turkey | |
| Ta16 | PI 622268 | Iran, | |
| Ta17 | Yangxian Yangqianmai | Shaanxi (5660) | |
| Ta18 | Yecora Rojo | California | |
| Ta19 | PI 119325 | Turkey | |
| Ta20 | PI 622233 | Iran | |
| Ta21 | Chinese Spring | China | |
| Ta23 | Opata 85 | CIMMYT | |
| Ta12 | PI 166305 | Turkey | |
| Ta14 | PI 350731 | Austria | |
| Ta15 | PI 410595 | Pakistan | |
| Ta22 | 405a (DV1132) | Iran |
Synthetic wheats used for SNP discovery
| Name* | Database code | Source of AB genomes | Source of the D genome |
|---|---|---|---|
| ITMI synthetic(1) | Sn24 | CIMMYT W7984, CIGM86.940, | |
| RL5402(2) | Sn25 | TetraCantach | RL5261, |
| RL5403(2) | Sn26 | TetraCantach | RL5266, |
| RL5405(2) | Sn27 | TetraCantach | RL5288. |
| RL5406(2) | Sn28 | TetraCantach | RL5289. |
| 62052_4(1) | Sn29 | CIMMYT 205, | |
| 62056_4(1) | Sn30 | CIMMYT 224 = CLAE 25, | |
| 161725_0(1) | Sn31 | CIMMYT 372 = CIGM86.940, | |
| Sear's synthetic(3) | Sn32 | Unknown | Unknown |
* Accessions designated with (1) were developed and supplies by A. Mujeeb-Kazi, at CIMMYT, those designated with (2) were developed and supplied by E.R. Kerber, Agriculture Canada, Winnipeg, Manitoba, and that designated with (3) was developed and supplied by E.R. Search, USDA-ARS, Columbia, Missouri
Loci mapped on the basis of linkage and synteny and the total number of EST loci with estimated diversity (Div. loci) on the map
| Chromosome | Length (cM) | Total loci mapped | Linkage mapped loci | Synteny mapped loci | Div. loci |
|---|---|---|---|---|---|
| 1 | 180.5 | 212 | 148 | 64 | 125 |
| 2 | 186.9 | 171 | 92 | 79 | 142 |
| 3 | 197.0 | 240 | 186 | 54 | 151 |
| 4 | 127.4 | 198 | 107 | 91 | 155 |
| 5 | 171.2 | 133 | 86 | 47 | 113 |
| 6 | 149.3 | 205 | 119 | 86 | 145 |
| 7 | 154.5 | 195 | 132 | 63 | 156 |
| Total | 1166.8 | 1354 | 870 | 484 | 987 |
Nucleotides sequenced, SNPs discovered, average number of haplotypes (H), and haplotype diversity (h)
| Species | Genome | SNPs | Nucl./SNP | ||||
|---|---|---|---|---|---|---|---|
| A | 13 | 351542 | 966 | 364 | 1.82c* | 0.22c | |
| B | 13 | 348938 | 1008 | 346 | 1.72c | 0.21c | |
| D | 13 | 349748 | 377 | 927 | 1.23d | 0.06d | |
| A | 10 | 351542 | 1516 | 232 | 2.05ab | 0.31b | |
| B | 10 | 348938 | 1979 | 176 | 2.22a | 0.33b | |
| Synthetic 6 | D | 9 | 349029 | 1727 | 202 | 2.39a | 0.47a |
* Means followed by the same letter are not significantly different at the 5% probability level. Individual chromosome means (Table 9) were used as variables in ANOVA.
Agreement between the locations of EST loci on the Ae. tauschii linkage map and wheat deletion-bin maps
| A genome | B genome | D genome | ||||
|---|---|---|---|---|---|---|
| 1S | 62 | 6.5 | 60 | 20.0 | 73 | 6.8 |
| 1L | 38 | 31.6 | 49 | 20.4 | 48 | 16.7 |
| 2S | 24 | 8.3 | 36 | 0.0 | 26 | 0.0 |
| 2L | 39 | 2.6 | 36 | 5.6 | 37 | 0.0 |
| 3S | 48 | 2.1 | 48 | 14.6 | 45 | 0.0 |
| 3L | 82 | 1.2 | 92 | 8.7 | 97 | 9.3 |
| 4S | -* | - | 27 | 14.8 | 29 | 10.3 |
| 4L | -* | - | 55 | 1.8 | 59 | 27.1 |
| 5S | 23 | 21.7 | 24 | 25.0 | 23 | 4.3 |
| 5L | 39 | 12.8 | 37 | 10.8 | 33 | 0.0 |
| 6S | 29 | 3.4 | 26 | 19.2 | 24 | 12.5 |
| 6L | 53 | 9.4 | 59 | 11.9 | 48 | 8.3 |
| 7S | 51 | 3.9 | 56 | 8.9 | 55 | 18.2 |
| 7L | 46 | 23.9 | 49 | 12.2 | 49 | 10.2 |
| Mean | 10.8 | 12.4 | 8.8 | |||
* Chromosome 4A is extensively rearranged, and the congruence of the 4A deletion-bin map could not be meaningfully compared with the Ae. tauschii linkage map.
Numbers of loci on the diversity maps harboring one or more SNPs, the total numbers of loci with estimated diversity (nt), and the total numbers of loci used for analyses (na)
| Synthetic wheat | ||||||
|---|---|---|---|---|---|---|
| 1 | 43 | 49 | 23 | 53 | 50 | 66 |
| 2 | 42 | 45 | 30 | 49 | 49 | 66 |
| 3 | 27 | 36 | 11 | 30 | 40 | 54 |
| 4 | 55 | 28 | 20 | 54 | 62 | 75 |
| 5 | 45 | 35 | 6 | 37 | 31 | 58 |
| 6 | 39 | 47 | 25 | 42 | 50 | 87 |
| 7 | 54 | 56 | 23 | 51 | 56 | 71 |
| Total * | 305a | 296a | 138b | 316a | 338a | 477c |
| 619 | 609 | 704 | 619 | 609 | 704 | |
| 590 | 584 | 679 | 585 | 576 | 650 | |
*Sums followed by the same letter are not statistically different at the 5% probability level.
Total nucleotide diversity, diversity in coding sequences, noncoding sequences (introns and UTRs), replacement and silent codon positions
| Genome | Total diversity | Coding | Noncoding | Replacement | Silent | Ratio replac./silent | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.59c | 0.57c | -0.09b | 0.54b | 0.55b | 0.76bc | 0.72bc | 0.30b | 0.30b | 1.38bc | 1.46a | 0.25a | 0.24a | |
| 0.56c | 0.57c | 0.00b | 0.55b | 0.59b | 0.89bc | 0.89b | 0.32b | 0.36b | 1.40bc | 1.44a | 0.22a | 0.22a | |
| 0.22d | 0.18e | -0.57c | 0.22c | 0.18c | 0.36d | 0.32d | 0.06c | 0.06c | 0.79d | 0.59b | 0.10b | 0.11b | |
| 0.68b | 0.70d | 0.09b | 0.58b | 0.60b | 0.91bc | 0.98b | 0.32b | 0.36b | 1.43bc | 1.47a | 0.23a | 0.26a | |
| 0.76b | 0.72cb | -0.12b | 0.56b | 0.55b | 1.21b | 1.17b | 0.31b | 0.30b | 1.47b | 1.46a | 0.29a | 0.29a | |
| Synth. D | 1.31a | 1.51a | 0.56a | 1.13a | 1.25a | 1.84a | 2.09a | 0.52a | 0.53a | 3.35a | 3.87d | 0.17ab | 0.16b |
* Means sharing the same letter are not significantly different at the 0.05 probability levels.
Average nucleotide polymorphism (θw), nucleotide diversity (θπ), and Tajima's D per chromosome
| Wild emmer | Synthetic wheats | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.67 | 0.64 | -0.14 | 0.66 | 0.71 | 0.07 | 0.31 | 0.32 | -0.10* | 0.69 | 0.73 | 0.09 | 0.82 | 0.90 | 0.37 | 1.18 | 1.28 | 0.42 | |
| 0.56 | 0.57 | 0.08 | 0.74* | 0.87* | 0.30 | 0.40 | 0.29 | -0.65 | 0.83* | 0.96* | 0.50* | 0.79 | 0.81 | 0.06 | 1.07* | 1.23 | 0.57 | |
| 0.49 | 0.46 | -0.21 | 0.66 | 0.72 | 0.30 | 0.11* | 0.08* | -0.79 | 0.60 | 0.65 | 0.21 | 0.81 | 0.76 | -0.10 | 1.16 | 1.37 | 0.73 | |
| 0.45* | 0.42* | -0.11 | 0.24* | 0.16* | -0.78* | 0.15 | 0.10 | -0.77 | 0.59 | 0.50* | -0.39* | 0.75 | 0.61 | -0.56* | 1.19 | 1.41 | 0.61 | |
| 0.77* | 0.76* | 0.15 | 0.45 | 0.48 | 0.21 | 0.09* | 0.06* | -0.84 | 0.54* | 0.51* | -0.18 | 0.56* | 0.47* | -0.58* | 1.65* | 1.82* | 0.39 | |
| 0.60 | 0.66 | 0.26 | 0.65 | 0.66 | 0.06 | 0.21 | 0.21 | -0.27 | 0.77 | 0.80* | 0.05 | 0.79 | 0.76 | -0.07 | 1.67* | 1.90* | 0.44 | |
| 0.63 | 0.51 | -0.57 | 0.66 | 0.59 | -0.28* | 0.24 | 0.16 | -0.86 | 0.69* | 0.77* | 0.38* | 0.77 | 0.74 | -0.05 | 1.34 | 1.63 | 0.78* | |
| Coef. variat. | 0.18 | 0.21 | 0.30 | 0.38 | 0.52 | 0.59 | 0.15 | 0.23 | 0.12 | 0.19 | 0.18 | 0.18 | ||||||
* Means outside of the 99% bootstrap confidence interval of the genome mean
Figure 2Nucleotide diversity . Gene diversity along the A-genome chromosomes in T. aestivum and wild emmer (T. dicoccoides) in the Diyarbakir region in Turkey. Chromosome 4A is excluded because the order of genes does not conform to the Ae. tauschii genetic map. Monomorphic loci are depicted with zero diversity. The gene order along the diversity maps in Additional file 2 Table S1 is used on the X-axis, and the maps are oriented with the most distal gene in the short arm to the left. Centromere is indicated by a triangle. Genetic distances between genes are not depicted.
Figure 3Nucleotide diversity . Gene diversity along the B-genome chromosomes in T. aestivum and wild emmer (T. dicoccoides) in the Diyarbakir region in Turkey. See Figure 2 for details.
The average numbers of haplotypes per locus (H) and haplotype diversity (h)
| Synthetic wheats | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2.09* | 0.29* | 1.92* | 0.26* | 1.25 | 0.08* | 2.13 | 0.30 | 2.17 | 0.36* | 2.47 | 0.52 |
| 2 | 1.85 | 0.23 | 1.79 | 0.25 | 1.33* | 0.11* | 2.25* | 0.33* | 2.59* | 0.34 | 2.40 | 0.47 |
| 3 | 1.65* | 0.17* | 1.78 | 0.24 | 1.13* | 0.03* | 2.03 | 0.30 | 2.11 | 0.32 | 2.26 | 0.41* |
| 4 | 1.56* | 0.17* | 1.31* | 0.07* | 1.20 | 0.06 | 1.92 | 0.28 | 2.17 | 0.29* | 2.24 | 0.43 |
| 5 | 2.04* | 0.28* | 1.68 | 0.21 | 1.11* | 0.03* | 2.01 | 0.31 | 1.99* | 0.27* | 2.43 | 0.50 |
| 6 | 1.71 | 0.22 | 1.86 | 0.25 | 1.27 | 0.08* | 1.84* | 0.27* | 2.22 | 0.35 | 2.63* | 0.50 |
| 7 | 1.84 | 0.18 | 1.74 | 0.20 | 1.31 | 0.06 | 2.14 | 0.35* | 2.33 | 0.35 | 2.31 | 0.47 |
* Means outside of the 99% bootstrap confidence interval of the genome mean
Figure 4Nucleotide diversity . Gene diversity along the D-genome chromosomes in T. aestivum. See Figure 2 for details.
Average values of Wall's B in the N number of genes
| 1A | 18 | 0.29c* | 15 | 0.31ab |
| 2A | 25 | 0.57ab | 13 | 0.60a |
| 3A | 9 | 0.42bc | 17 | 0.36ab |
| 4A | 12 | 0.82a | 16 | 0.59a |
| 5A | 24 | 0.40c | 15 | 0.19b |
| 6A | 20 | 0.45bc | 14 | 0.35ab |
| 7A | 23 | 0.50bc | 14 | 0.42ab |
| Mean | 0.49a | 0.40a | ||
| 1B | 25 | 0.32bc | 23 | 0.44ab |
| 2B | 24 | 0.46ab | 36 | 0.28a |
| 3B | 18 | 0.33bc | 17 | 0.37ab |
| 4B | 9 | 0.67a | 29 | 0.49b |
| 5B | 12 | 0.45ab | 19 | 0.46ab |
| 6B | 26 | 0.42b | 24 | 0.38ab |
| 7B | 27 | 0.70a | 26 | 0.48ab |
| Mean | 0.48a | 0.41a | ||
| 1D | 11 | 0.88a | ||
| 2D | 15 | 0.82a | ||
| 3D | 2 | 1.00a | ||
| 4D | 3 | 0.78a | ||
| 5D | 2 | 1.00a | ||
| 6D | 8 | 0.46b | ||
| 7D | 6 | 0.75ab | ||
| Mean | 0.81b | |||
*Means followed by the same letter are not statistically different at the 5% probability level.
Figure 5The folded site frequency spectra. Folded site frequency spectra of minor SNP alleles at silent and replacement positions in the A and B genomes, Td (A) and Td (B) respectively, in a sample of 10 homozygous accessions of wild emmer (T. dicoccoides). Each homozygous accession is equivalent to one chromosome. The plot depicts numbers of SNPs with the minor allele being observed in an indicated number of sampled chromosomes. (B) The folded site frequency spectrum of minor SNP alleles at the silent and replacement positions in the A, B, and D genomes, Ta (A), Ta (B), and Ta (D), respectively, in a sample of 13 homozygous accessions of T. aestivum. Each homozygous accession is equivalent to one chromosome. The plot depicts numbers of SNPs with the minor allele being observed in an indicated number of sampled chromosomes.
Diploid and tetraploid species used for the development of genome-specific primers
| Species | Database code | Genome | Line | Origin |
|---|---|---|---|---|
| Tu01 | A | PI428198 = G1812 | Turkey | |
| Tu02 | A | ICTW600161 (ICARDA) | Syria | |
| As01 | S | F4 /2-12-4//PI136909-12-II/134-1 | Experimental line | |
| As02 | S | F4 /2-12-4//PI136909-12-II/134-2 | Experimental line | |
| At01 | D | AL8/78 | Armenia | |
| At02 | D | AS75 | China | |
| Lgd1 through 12 | AB | Langdon, 12 clones | Cultivar |