| Literature DB >> 19939274 |
Cecilia Castaño Sánchez1, Timothy P L Smith, Ralph T Wiedmann, Roger L Vallejo, Mohamed Salem, Jianbo Yao, Caird E Rexroad.
Abstract
BACKGROUND: To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19939274 PMCID: PMC2790473 DOI: 10.1186/1471-2164-10-559
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Gel image of rainbow trout DNA digested with . DNA fragments between 400 and 500 bp were excised from both lanes containing HaeIII digested pooled DNA. Slices were cut every 2 mm or approximately 20 bp.
Sequence data analysis results
| Newbler1 | Newbler2 | Velvet | |
|---|---|---|---|
| 149,096 | 125,081 | ||
| 94,000 | 69,000 | ||
| 5.9× | 6.4× | ||
| 47,128 | 22,022 | 33,624 | |
| 24,627 | 13,140 | 20,963 | |
Number of genotyped and validated markers from the three data analysis approaches.
| Approach | Tested | Validated | ||
|---|---|---|---|---|
| #markers | % | #markers | % | |
| 47 | 12.2 | 17 | 36 | |
| 46 | 12.0 | 22 | 48 | |
| 47 | 12.2 | 15 | 32 | |
| 100 | 26.1 | 53 | 53 | |
| 47 | 12.2 | 30 | 64 | |
| 47 | 12.2 | 18 | 38 | |
| 50 | 13.1 | 28 | 56 | |
N1, N2 and V1 represent SNPs predicted by only one of those three analyses; N1N2, V1N1, V1N2, and V1N1N2 indicate SNPs predicted by multiple analyses. Tested percentage refers to the number of markers in each group compared to all; the rate of validation within each approach is given as the validated percentage.
Validated SNPs annotation.
| OMS | GENE | Percentage_homology | E_VALUE | Annotation |
|---|---|---|---|---|
| OMS00035 | CU065093 | 98.01 | 1E-101 | AAI16452-B-cell CLL/lymphoma 9 [Homo sapiens] |
| OMS00028 | TC158345 | 99.01 | 1E-104 | CAG02508-unnamed protein product [Tetraodon nigroviridis] |
| OMS00080 | TC139092 | 99.49 | 6E-106 | Unknown |
| OMS00122 | TC155403 | 99.49 | 3E-105 | ACI66580-Lipocalin precursor [Salmo salar] |