| Literature DB >> 20543950 |
Camille Lepoittevin1, Jean-Marc Frigerio, Pauline Garnier-Géré, Franck Salin, María-Teresa Cervera, Barbara Vornam, Luc Harvengt, Christophe Plomion.
Abstract
BACKGROUND: There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size ( approximately 23.8 Gb/C). METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2010 PMID: 20543950 PMCID: PMC2882948 DOI: 10.1371/journal.pone.0011034
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
cDNA library information.
| Library | Tissue | Nb of haploid genomes | Maritime pine provenance | Nb of ESTs |
| GEMINI | Xylem | 4 | Corsica | 8,129 |
| Normal aerial parts (AN) | Needles | 300 | Aquitaine | 240 |
| Stressed aerial parts (AS) | Needles | 300 | Aquitaine | 475 |
| Normal roots (RN) | Roots | 300 | Aquitaine | 4,592 |
| Stressed roots (RS) | Roots | 300 | Aquitaine | 4,274 |
| Buds (LG0ACA) | Buds | Unknown | Spain | 8,766 |
| TOTAL | 26,476 |
Figure 1Examples of clustering observed for the P. pinaster SNP array.
Each dot represents the mean intensity derived from a population of beads for a single sample. The normalized R (y axis) is the normalized sum of intensities of the two dyes (Cy3 and Cy5), and the normalized Theta (x axis) is ((2/Л)Tan−1 (Cy5/Cy3)), where a normalized Theta value nearest 0 is a homozygous for allele A and a Theta value nearest 1 is homozygous for allele B. A/ classical pattern with three clusters for a SNP considered as successful and polymorphic. B and C/ “cluster compression” when both homozygous clusters are closer to each other than expected. In panel B, the clustering algorithm is able to distinguish the three clusters and gives a GenTrain score of 0.58, however this kind of pattern was considered as a genotyping failure in our analysis because one of the homozygous cluster normalized Theta value does not fall in the [0, 0.1] or [0.9, 1] ranges. In panel C the clustering algorithm was not able to distinguish the three clusters because of low separation scores, and the SNP was automatically considered as a genotyping failure because of its low GenTrain score. D and E/ SNPs interpreted as genotyping failures either because of abnormal Theta values (D) or because of the presence of subgroups in a cluster (E).
Figure 2Distribution of the 200 in silico SNPs according to the number of ESTs considered for the detection.
Success rate of the genotyping assay.
| Category | Nb of SNPs ( | % of SNPs ( |
| Failed | 127 (50/77) | 33% (27%/38.5%) |
| Monomorphic | 60 (22/38) | 16% (12%/19%) |
| Polymorphic | 197 (112/85) | 51% (61%/42.5%) |
| Total | 384 (184/200) | 100% (100%/100%) |
Failed genotyping, i.e. GenTrain score <0.25 or cluster compression.
Genotyping successful but monomorphic SNPs.
Genotyping successful and polymorphic SNP.
Figure 3Allele frequency spectrum for 257 successfully genotyped in vitro and in silico SNPs.
Figure 4Genotyping success rate according to functionality score for the 384 SNPs of the assay.
The number of SNPs in each functionality score class is indicated above each bar.