| Literature DB >> 20502512 |
A M Anithakumari, Jifeng Tang, Herman J van Eck, Richard G F Visser, Jack A M Leunissen, Ben Vosman, C Gerard van der Linden.
Abstract
Single nucleotide polymorphisms (SNPs) represent the most abundant type of genetic variation that can be used as molecular markers. The SNPs that are hidden in sequence databases can be unlocked using bioinformatic tools. For efficient application of these SNPs, the sequence set should be error-free as much as possible, targeting single loci and suitable for the SNP scoring platform of choice. We have developed a pipeline to effectively mine SNPs from public EST databases with or without quality information using QualitySNP software, select reliable SNP and prepare the loci for analysis on the Illumina GoldenGate genotyping platform. The applicability of the pipeline was demonstrated using publicly available potato EST data, genotyping individuals from two diploid mapping populations and subsequently mapping the SNP markers (putative genes) in both populations. Over 7000 reliable SNPs were identified that met the criteria for genotyping on the GoldenGate platform. Of the 384 SNPs on the SNP array approximately 12% dropped out. For the two potato mapping populations 165 and 185 SNPs segregating SNP loci could be mapped on the respective genetic maps, illustrating the effectiveness of our pipeline for SNP selection and validation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11032-009-9377-5) contains supplementary material, which is available to authorized users.Entities:
Year: 2010 PMID: 20502512 PMCID: PMC2869401 DOI: 10.1007/s11032-009-9377-5
Source DB: PubMed Journal: Mol Breed ISSN: 1380-3743 Impact factor: 2.589
Results of 384 PotSNP array performed in two (C × E and SH × RH) independent assays
| 384 PotSNP array | Mapping |
|---|---|
| 309 out of the 384 are useful markers (80%) | 165 markers could be mapped in C × E |
| 42 dropped out in any sample (11%) | 186 markers could be mapped in SH × RH |
| 33 were monomorphic in all materiala(9%) | 99 markers could be mapped in both populations |
aIncluding a set of 220 tetraploid varieties
Fig. 1Location of the SNP markers on parental maps C and E. The number on the left side is the genetic distance in centiMorgans (cM) right side is marker designations. The parental maps were drawn by the MapChart 2.2 program (Voorrips 2002)
Number of markers used for construction of parental maps (C, E and SH, RH) according to marker type
| Marker type | Total markers used in construction of parental maps | Markers on the map | ||
|---|---|---|---|---|
| C and E | SH and RH | C and E | SH and RH | |
| SNP markers | 165 | 186 | 146 | 168 |
| AFLP markers | 93 | 151 | 82 | 131 |
| SSR markers | 45 | 16 | 33 | 16 |
| CAPS markers | 24 | 21 | 22 | 21 |
Fig. 2Location of the SNP markers on parental maps SH and RH. The number on the left side is the genetic distance in centiMorgans (cM) right side is marker designations. The parental maps were drawn by the MapChart 2.2 program (Voorrips 2002)
Fig. 3Alignment of C8 linkage group with the SH8, and E2, E10 with the RH2, RH10 linkage groups, respectively using markers common to both populations. Left side number indicates genetic distances in centiMorgan (cM), right side marker designations