| Literature DB >> 22238659 |
Eleni Bachlava1, Christopher A Taylor, Shunxue Tang, John E Bowers, Jennifer R Mandel, John M Burke, Steven J Knapp.
Abstract
Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible.Entities:
Mesh:
Year: 2012 PMID: 22238659 PMCID: PMC3251610 DOI: 10.1371/journal.pone.0029814
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of DNA sequence data used in the reference assembly (above line) and SNP discovery (below line).
| Sunflower Line | Accession ID | Sequencing Method | Read Length (range in bp) | Read Length (avg. in bp) | Number of Reads | Total Sequence (Mb) |
| RHA280 | PI 552943 | Sanger | 100–809 | 397 | 20,892 | 8.3 |
| RHA801 | PI 599768 | Sanger | 100–814 | 425 | 22,603 | 9.6 |
| HA89 | PI 599773 | Sanger | 100–923 | 712 | 39,569 | 28 |
| HA300b | n/a | Sanger | 85–546 | 354 | 1,485 | 0.5 |
| PSC8 | n/a | Sanger | 100–922 | 478 | 15,837 | 7.6 |
| EMIL | n/a | Sanger | 101–625 | 356 | 2,169 | 0.8 |
| ANN1238 | n/a | Sanger | 100–1013 | 713 | 27,957 | 30 |
| HA89 | See above. | 454 GS FLX XLR | 50–622 | 285 | 66,851 | 19 |
| RHA373 | PI 560141 | Illumina | 36 | 36 | 21,601,273 | 777.7 |
| RHA415 | PI 607506 | Illumina | 36 | 36 | 11,341,180 | 408.3 |
| HA383 | PI 578872 | Illumina | 36 | 36 | 12,717,269 | 457.8 |
| HA434 | PI 633744 | Illumina | 36 | 36 | 25,661,886 | 923.8 |
| RHA455 | PI 642774 | Illumina-PE | 2×90 | 180 | 4,673,377 | 841.2 |
| RHA468 | n/a | Illumina-PE | 2×90 | 180 | 4,459,305 | 802.7 |
| HA89 | See above. | Illumina-PE | 2×90 | 180 | 4,943,677 | 889.9 |
| HA412-HO | PI 642777 | Illumina-PE | 2×90 | 180 | 3,690,226 | 664.2 |
| Total | 89,245,173 | 5,869.40 |
*These sequence datasets were used for both the reference assembly and for SNP identification.
Summary of sunflower lines/accessions genotyped using the SNP array.
| Sunflower Line | Species | Accession ID | Type |
| ANN1238 |
| n/a | Wild (Nebraska) |
| ANN1811 |
| PI 494567 | Wild (Texas) |
| Arikara |
| PI 369357 | Native American Landrace |
| Havasupai |
| PI 369358 | Native American Landrace |
| Hopi |
| PI 369359 | Native American Landrace |
| Seneca |
| PI 369360 | Native American Landrace |
| Mennonite |
| PI 650650 | Open-Pollinated; Non-Oil |
| Shemesh |
| n/a | Open-Pollinated; Non-Oil |
| Peredovik |
| PI 650338 | Open-Pollinated; Oil |
| Pervenets |
| PI 483077 | Open-Pollinated; Oil |
| VNIIMK8931 |
| PI 340790 | Open-Pollinated; Oil |
| RHA280 |
| PI 552943 | RHA Non-Oil |
| HA292 |
| PI 552937 | HA Non-Oil |
| RHA274 |
| PI 599759 | RHA Oil |
| RHA373 |
| PI 560141 | RHA Oil |
| RHA409 |
| PI 603990 | RHA Oil |
| RHA415 |
| PI 607506 | RHA Oil |
| RHA417 |
| PI 600000 | RHA Oil |
| RHA455 |
| PI 642774 | RHA Oil |
| RHA468 |
| n/a | RHA Oil |
| RHA801 |
| PI 599768 | RHA Oil |
| NMS373 |
| PI 560141 | RHA Oil |
| NMS377 |
| PI 560145 | RHA Oil |
| HA89 |
| PI 599773 | HA Oil |
| HA342 |
| PI 509052 | HA Oil |
| HA370 |
| PI 534656 | HA Oil |
| HA372 |
| PI 534658 | HA Oil |
| HA383 |
| PI 578872 | HA Oil |
| HA407 |
| PI 597371 | HA Oil |
| HA412-HO |
| PI 642777 | HA Oil |
| HA434 |
| PI 633744 | HA Oil |
| HA821 |
| PI 599984 | HA Oil |
| ARG1820 |
| PI 494580 | Wild Relative |
| ARG1834 |
| PI 494582 | Wild Relative |
| NIV20 |
| PI 650020 | Wild Relative |
| NIV58 |
| PI 613758 | Wild Relative |
*These DNA samples were genotyped twice each to assess repeatability of genotype calls.
Figure 1STRUCTURE results plot.
Results of STRUCTURE analysis of the 32 H. annuus individuals based on all SNPs with MAF ≥0.10. A) Depicts the results for K = 2. B) Depicts the results for K = 3. Black bars represent dividers between the six groups: OPV/Landraces, HA-oil, HA-nonoil (HA-NO), RHA-oil, RHA-nonoil (RHA-NO), and wild H. annuus.
Figure 2Principal coordinates analysis plot.
Plot of the first two principal coordinates for the 32 H. annuus individuals based on all SNPs with MAF ≥0.10. Each data point represents an accession with one of six groups: OPV/Landraces (OPV/LR), HA-oil, HA-nonoil (HA-NO), RHA-oil, RHA-nonoil (RHA-NO), and wild H. annuus.