| Literature DB >> 22694767 |
Mekki Boussaha1, René Guyomard, Cédric Cabau, Diane Esquerré, Edwige Quillet.
Abstract
BACKGROUND: There is considerable interest in developing high-throughput genotyping with single nucleotide polymorphisms (SNPs) for the identification of genes affecting important ecological or economical traits. SNPs are evenly distributed throughout the genome and are likely to be functionally relevant. In rainbow trout, in silico screening of EST databases represents an attractive approach for de novo SNP identification. Nevertheless, EST sequencing errors and assembly of EST paralogous sequences can lead to the identification of false positive SNPs which renders the reliability of EST-derived SNPs relatively low. Further validation of EST-derived SNPs is therefore required. The objective of this work was to assess the quality of and to validate a large number of rainbow trout EST-derived SNPs.Entities:
Mesh:
Year: 2012 PMID: 22694767 PMCID: PMC3536561 DOI: 10.1186/1471-2164-13-238
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Selection of the validation panel. Filters used to select EST-derived SNPs for validation from the INRA Sigenae SNP database release som10 were summarized.
Distribution of SNPs in EST contigs
| 1 | 6570 | 151 |
| 2 | 2962 | 141 |
| 3 | 1549 | 122 |
| 4 | 850 | 88 |
| 5 | 502 | 67 |
| 6 | 298 | 58 |
| 7 | 199 | 46 |
| 8 | 119 | 33 |
| 9 | 96 | 33 |
| 10 | 56 | 24 |
| > 10 | 173 | 100 |
Minor allele frequency of validated SNPs
| 352 | 390 | 367 | 351 | 346 | |
| 262 | 272 | 268 | 266 | 262 | |
| 344 | 296 | 322 | 341 | 350 | |
| 27 | 7 | 11 | 22 | 21 | |
| 30 | 33 | 30 | 36 | 26 | |
| 58 | 55 | 51 | 73 | 74 | |
| 63 | 55 | 71 | 56 | 66 | |
| 85 | 68 | 77 | 67 | 66 | |
| 81 | 78 | 82 | 87 | 97 | |
SNPs were clustered into different categories based on their observed MAF in rainbow trout DH individuals and in the three population samples analysed
* one snp was not considered for MAF calculation because of genotyping failure in all samples.
Distribution of the genotyped SNPs in contigs
| 1 | 151 | 133 | 34 | 13 | 86 |
| 2 | 154 | 126 | 39 | 21 | 66 |
| 3 | 141 | 115 | 34 | 24 | 57 |
| 4 | 98 | 79 | 30 | 20 | 29 |
| 5 | 91 | 69 | 25 | 20 | 24 |
| 6 | 81 | 72 | 24 | 30 | 18 |
| 7 | 70 | 53 | 29 | 14 | 10 |
| 8 | 48 | 39 | 18 | 11 | 10 |
| 9 | 45 | 40 | 19 | 12 | 9 |
| 10 | 38 | 29 | 7 | 12 | 10 |
| > 10 | 235 | 203 | 87 | 85 | 31 |
| Total | 1152 | 958 | 346 | 262 | 350 |
The number of SNPs identified within each contig type were summarized for the validation panel and for successfully genotyped, monomorphic, heterozygous and for true SNPs.
Figure 2Distribution of observed heterozygosity for true SNPs in three populations. SNPs were clustered into categories based on their observed heterozygosity values.
Figure 3Frequency distribution of PIC values across the three genotyped populations. SNPs were clustered into categories based on their observed PIC values.
Figure 4Prediction of the best value of K. Delta K analysis was performed as previously described (Evanno, 2005) in order to predict the best value of K.
Figure 5Genetic population structure predicted by STRUCTURE software. Genetic population structure was inferred with the structure software using K = 3, a burn-in period of 50,000 200,000 iterations for the likelihood estimation. Individuals 1 to 20 correspond to the INRA-SP population, individuals 21 to 40 correspond to the INRA-SY population and individuals 41 to 84 correspond to the NCCCWA population.
Observed occurrence of the different SNP types (transitions and transversions)
| Type | Validation panel | polymorphic SNPs | Syn. SNPs | Non Syn. SNPs | |
|---|---|---|---|---|---|
| Transition (Ts) | AG | 325 | 88 | 71 | 11 |
| CT | 529 | 184 | 169 | 4 | |
| Transversion (Tv) | AC | 98 | 28 | 22 | 5 |
| AT | 49 | 14 | 8 | 3 | |
| CG | 58 | 16 | 12 | 3 | |
| GT | 93 | 20 | 17 | 2 | |
| TOTAL | | 1152 | 350 | 299 | 28 |
| % ratio Ts/Tv | 2.87 | 3.48 | 4.07 | 1.15 |
Distribution of SNPs within INRA rainbow trout microsatellite-based linkage map
| GL1 | 8 | GL16 | 9 |
| GL2 | 10 | GL17 | 0 |
| GL3 | 9 | GL18 | 5 |
| GL4_25 | 8 | GL19 | 6 |
| GL5 | 3 | GL20 | 11 |
| GL6 | 4 | GL21 | 11 |
| GL7 | 8 | GL22 | 5 |
| GL8 | 13 | GL23 | 11 |
| GL9 | 12 | GL24 | 7 |
| GL10 | 6 | GL26 | 3 |
| GL11 | 3 | GL27 | 17 |
| GL12 | 16 | GL29 | 9 |
| GL13 | 6 | GL30 | 2 |
| GL14 | 5 | GL31 | 9 |
| GL15 | 7 |
The number of SNPs that were integrated in each rainbow trout linkage group (GL) were indicated.