| Literature DB >> 26754638 |
Jinzhuang Dou1,2, Xue Li1, Qiang Fu1, Wenqian Jiao1, Yangping Li1, Tianqi Li1, Yangfan Wang1, Xiaoli Hu1,3, Shi Wang1,4, Zhenmin Bao1,3.
Abstract
The recently developed 2b-restriction site-associated DNA (2b-RAD) sequencing method provides a cost-effective and flexible genotyping platform for aquaculture species lacking sufficient genomic resources. Here, we evaluated the performance of this method in the genomic selection (GS) of Yesso scallop (Patinopecten yessoensis) through simulation and real data analyses using six statistical models. Our simulation analysis revealed that the prediction accuracies obtained using the 2b-RAD markers were slightly lower than those obtained using all polymorphic loci in the genome. Furthermore, a small subset of markers obtained from a reduced tag representation (RTR) library presented comparable performance to that obtained using all markers, making RTR be an attractive approach for GS purpose. Six GS models exhibited variable performance in prediction accuracy depending on the scenarios (e.g., heritability, sample size, population structure), but Bayes-alphabet and BLUP-based models generally outperformed other models. Finally, we performed the evaluation using an empirical dataset composed of 349 Yesso scallops that were derived from five families. The prediction accuracy for this empirical dataset could reach 0.4 based on optimal GS models. In summary, the genotyping flexibility and cost-effectiveness make 2b-RAD be an ideal genotyping platform for genomic selection in aquaculture breeding programs.Entities:
Mesh:
Year: 2016 PMID: 26754638 PMCID: PMC4709697 DOI: 10.1038/srep19244
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The parameters used for scallop population/family simulation.
| Simulation | Dataset type | Sample size | Marker density | Genetic model |
|---|---|---|---|---|
| 1 | Population-based | 2,000 | HD-SNPs (250 k) MD-SNPs (61 k) LD-SNPs (5 k) | Additive |
| 2 | Family-based | 20 × 50 | LD-SNPs (2,364) | Additive |
| 3 | Family-based | 5 × 50 | LD-SNPs (2,364) | Additive + Dominant |
a20 × 50 denotes a population composed of 20 sub-families with each containing 50 samples.
Accuracy of GEBVs estimated from the simulated population-based datasets (Simulation 1) under different heritabilities.
| Case | Marker density | Method | |||||
|---|---|---|---|---|---|---|---|
| G1 | BLUP | 0.29 | 0.47 | 0.53 | 0.68 | 0.70 | |
| HD | LASSO | 0.50 | 0.54 | 0.65 | 0.79 | 0.78 | |
| RR-BLUP | 0.74 | 0.82 | 0.89 | 0.92 | 0.92 | ||
| BL | 0.29 | 0.42 | 0.47 | 0.63 | 0.65 | ||
| G-BLUP | 0.74 | 0.81 | 0.88 | 0.94 | 0.94 | ||
| BayesA | 0.74 | 0.82 | 0.88 | 0.94 | 0.94 | ||
| BayesB | 0.73 | 0.82 | 0.89 | 0.94 | 0.93 | ||
| MD | LASSO | 0.39 | 0.56 | 0.60 | 0.75 | 0.78 | |
| RR-BLUP | 0.69 | 0.86 | 0.87 | 0.92 | 0.92 | ||
| BL | 0.23 | 0.39 | 0.54 | 0.63 | 0.65 | ||
| G-BLUP | 0.70 | 0.86 | 0.87 | 0.92 | 0.92 | ||
| BayesA | 0.70 | 0.86 | 0.87 | 0.92 | 0.92 | ||
| BayesB | 0.70 | 0.86 | 0.87 | 0.92 | 0.92 | ||
| LD | LASSO | 0.44 | 0.50 | 0.64 | 0.77 | 0.83 | |
| RR-BLUP | 0.47 | 0.72 | 0.88 | 0.90 | 0.92 | ||
| BL | 0.23 | 0.33 | 0.43 | 0.55 | 0.66 | ||
| G-BLUP | 0.69 | 0.82 | 0.84 | 0.90 | 0.92 | ||
| BayesA | 0.72 | 0.82 | 0.86 | 0.90 | 0.92 | ||
| BayesB | 0.76 | 0.82 | 0.90 | 0.90 | 0.92 | ||
| G1- > G2 | BLUP | 0.24 | 0.37 | 0.57 | 0.59 | 0.76 | |
| HD | LASSO | 0.39 | 0.62 | 0.61 | 0.68 | 0.81 | |
| RR-BLUP | 0.77 | 0.90 | 0.91 | 0.91 | 0.94 | ||
| BL | 0.59 | 0.76 | 0.79 | 0.82 | 0.89 | ||
| G-BLUP | 0.74 | 0.87 | 0.92 | 0.93 | 0.94 | ||
| BayesA | 0.73 | 0.86 | 0.91 | 0.93 | 0.93 | ||
| BayesB | 0.73 | 0.87 | 0.91 | 0.93 | 0.95 | ||
| MD | LASSO | 0.31 | 0.48 | 0.63 | 0.77 | 0.78 | |
| RR-BLUP | 0.73 | 0.85 | 0.91 | 0.94 | 0.94 | ||
| BL | 0.26 | 0.39 | 0.48 | 0.59 | 0.65 | ||
| G-BLUP | 0.72 | 0.85 | 0.91 | 0.93 | 0.93 | ||
| BayesA | 0.73 | 0.85 | 0.91 | 0.93 | 0.93 | ||
| BayesB | 0.73 | 0.86 | 0.91 | 0.93 | 0.93 | ||
| LD | LASSO | 0.44 | 0.67 | 0.72 | 0.83 | 0.87 | |
| RR-BLUP | 0.27 | 0.89 | 0.89 | 0.92 | 0.93 | ||
| BL | 0.31 | 0.38 | 0.51 | 0.56 | 0.65 | ||
| G-BLUP | 0.76 | 0.88 | 0.88 | 0.92 | 0.94 | ||
| BayesA | 0.80 | 0.85 | 0.89 | 0.92 | 0.94 | ||
| BayesB | 0.81 | 0.89 | 0.89 | 0.92 | 0.93 |
Accuracy of GEBVs estimated from the simulated family-based datasets (Simulation 2) under the low marker density.
| No. of families | Model | |||||
|---|---|---|---|---|---|---|
| 5 | BLUP | 0.27 | 0.39 | 0.47 | 0.63 | 0.69 |
| LASSO | 0.33 | 0.43 | 0.41 | 0.52 | 0.60 | |
| RR-BLUP | 0.27 | 0.28 | 0.32 | 0.32 | 0.42 | |
| BL | 0.16 | 0.24 | 0.39 | 0.56 | 0.58 | |
| G-BLUP | 0.63 | 0.84 | 0.89 | 0.91 | 0.92 | |
| BayesA | 0.67 | 0.83 | 0.89 | 0.90 | 0.92 | |
| BayesB | 0.66 | 0.83 | 0.90 | 0.90 | 0.92 | |
| 10 | BLUP | 0.31 | 0.43 | 0.55 | 0.63 | 0.69 |
| LASSO | 0.24 | 0.45 | 0.49 | 0.60 | 0.69 | |
| RR-BLUP | 0.24 | 0.28 | 0.70 | 0.61 | 0.66 | |
| BL | 0.14 | 0.39 | 0.41 | 0.53 | 0.65 | |
| G-BLUP | 0.74 | 0.82 | 0.84 | 0.90 | 0.93 | |
| BayesA | 0.79 | 0.81 | 0.84 | 0.90 | 0.93 | |
| BayesB | 0.82 | 0.84 | 0.87 | 0.90 | 0.92 | |
| 15 | BLUP | 0.36 | 0.44 | 0.55 | 0.63 | 0.69 |
| LASSO | 0.25 | 0.48 | 0.61 | 0.72 | 0.77 | |
| RR-BLUP | 0.25 | 0.28 | 0.88 | 0.88 | 0.92 | |
| BL | 0.13 | 0.37 | 0.43 | 0.54 | 0.68 | |
| G-BLUP | 0.66 | 0.83 | 0.89 | 0.88 | 0.93 | |
| BayesA | 0.68 | 0.83 | 0.89 | 0.89 | 0.93 | |
| BayesB | 0.68 | 0.84 | 0.89 | 0.90 | 0.93 | |
| 20 | BLUP | 0.38 | 0.44 | 0.57 | 0.67 | 0.70 |
| LASSO | 0.37 | 0.50 | 0.65 | 0.75 | 0.79 | |
| RR-BLUP | 0.31 | 0.84 | 0.87 | 0.90 | 0.92 | |
| BL | 0.25 | 0.35 | 0.50 | 0.57 | 0.61 | |
| G-BLUP | 0.70 | 0.75 | 0.86 | 0.90 | 0.92 | |
| BayesA | 0.75 | 0.80 | 0.86 | 0.90 | 0.92 | |
| BayesB | 0.80 | 0.84 | 0.87 | 0.91 | 0.92 |
Figure 1Box and whisker plots of three traits shown for five Yesso scallop families.
SH, shell height; SL, shell length; SW, shell width.
Figure 2Distribution of the minor allele frequencies of 2,364 markers in five Yesso scallop families.
Figure 3Principal component analysis (a) and genetic kinships (b) of the five empirical families based on 2,364 markers.
Estimation of variance components and heritabilities for three traits including shell height (SH), shell length (SL) and shell width (SW).
| Across-family | Fam1 | Fam2 | Fam3 | Fam4 | Fam5 | |
|---|---|---|---|---|---|---|
| SH | ||||||
| σa2 | 14.58 (2.56 | 9.21 (2.49) | 6.38 (2.13) | 8.56 (1.10) | 12.27 (4.21) | 17.8 (2.75) |
| σe2 | 15.84 (1.18) | 14.88 (1.05) | 16.31 (1.15) | 7.01 (0.55) | 19.09 (1.51) | 14.2 (0.96) |
| h2 | 0.48 (0.05) | 0.38 (0.07) | 0.28 (0.08) | 0.41 (0.05) | 0.39 (0.09) | 0.54 (0.06) |
| SL | ||||||
| σa2 | 16.30 (2.68) | 9.93 (3.07) | 7.24 (3.11) | 1.79 (0.87) | 7.74 (1.73) | 5.92 (2.51) |
| σe2 | 17.48 (1.16) | 16.42 (1.32) | 20.47 (1.39) | 14.23 (0.98) | 25.81 (2.14) | 15.29 (1.34) |
| h2 | 0.48 (0.05) | 0.38 (0.08) | 0.26 (0.06) | 0.11 (0.05) | 0.23 (0.05) | 0.26 (0.09) |
| SW | ||||||
| σa2 | 2.66 (0.29) | 0.52 (0.37) | 2.15 (0.35) | 0.91 (0.04) | 2.53 (0.14) | 1.91 (0.16) |
| σe2 | 4.68 (0.15) | 3.07 (0.15) | 2.37 (0.15) | 1.24 (0.04) | 1.32 (0.06) | 2.43 (0.08) |
| h2 | 0.36 (0.06) | 0.15 (0.08) | 0.48 (0.05) | 0.67 (0.03) | 0.65 (0.04) | 0.71 (0.04) |
The genetic variances (σa2), error variance (σe2), and narrow-sense heritabilities (h2) were calculated for the entire population and individual families.
aStandard error.
Accuracy of GEBVs assessed by five-fold cross-validation based on a combined dataset consisting of five scallop families.
| SH | SL | SW | SH | SL | SW | |
|---|---|---|---|---|---|---|
| LASSO | 0.20 (0.09) | 0.27(0.13) | 0.15 (0.10) | 0.29 (0.13) | 0.39 (0.19) | 0.25 (0.17) |
| RR-BLUP | 0.30 (0.16) | 0.37 (0.09) | 0.18 (0.08) | 0.43 (0.23) | 0.53 (0.13) | 0.30 (0.13) |
| BL | 0.31 (0.16) | 0.36 (0.08) | 0.15 (0.07) | 0.44 (0.23) | 0.51 (0.12) | 0.25 (0.12) |
| G-BLUP | 0.37 (0.08) | 0.32 (0.09) | 0.33 (0.09) | 0.53 (0.12) | 0.46 (0.13) | 0.55 (0.15) |
| BayesA | 0.40 (0.07) | 0.33 (0.08) | 0.35 (0.09) | 0.57 (0.10) | 0.47 (0.12) | 0.58 (0.15) |
| BayesB | 0.40 (0.07) | 0.34 (0.07) | 0.36 (0.08) | 0.57 (0.10) | 0.49 (0.10) | 0.60 (0.13) |
aStandard error.
bThe correlation between EBV and TBV is calculated as the r(y, EBV) divided by the square root of the heritability of a given trait.
Figure 4Principal component analysis of five GS models based on the estimated genetic effects of 2,364 markers.
G-BLUP is not included in comparison because genetic effect is not estimated for individual markers in this model.
The correlation of marker effects estimated using five GS models based on a combined family dataset for the trait of shell length.
| LASSO | RR-BLUP | BL | BayesA | BayesB | |
|---|---|---|---|---|---|
| LASSO | 1.00 | 0.32 | 0.30 | 0.26 | 0.23 |
| RR-BLUP | 1.00 | 0.82 | 0.72 | 0.72 | |
| BL | 1.00 | 0.62 | 0.62 | ||
| BayesA | 1.00 | 0.87 | |||
| BayesB | 1.00 |