| Literature DB >> 21329522 |
Oscar González-Recio1, Selma Forni.
Abstract
BACKGROUND: Genomic selection has gained much attention and the main goal is to increase the predictive accuracy and the genetic gain in livestock using dense marker information. Most methods dealing with the large p (number of covariates) small n (number of observations) problem have dealt only with continuous traits, but there are many important traits in livestock that are recorded in a discrete fashion (e.g. pregnancy outcome, disease resistance). It is necessary to evaluate alternatives to analyze discrete traits in a genome-wide prediction context.Entities:
Mesh:
Year: 2011 PMID: 21329522 PMCID: PMC3400433 DOI: 10.1186/1297-9686-43-7
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Accuracy (standard error across replicates in parentheses), measured as Pearson correlation between predicted and true genomic assisted values, and area under the operating characteristic curve for different methods and number of QTL
| # QTL | TBA | BTL | RF | L2B | LhB | |
|---|---|---|---|---|---|---|
| Pearson correlation | 90 | 0.26 | 0.33 | 0.36 | 0.37 | |
| 1000 | 0.32 (0.16) | 0.30 (0.03) | 0.24 (0.01) | 0.34 (0.02) | ||
| AUC | 90 | 0.61 | 0.65 | 0.65 | 0.69 | |
| 1000 | 0.66 (0.01) | 0.66 (0.00) | 0.63 (0.01) | 0.66 (0.01) | ||
1Higher value is desirable; the best value is in bold face; TBA = Threshold Bayes A, BTL = Bayesian Threshold LASSO, RF = Random Forest; L2B = L2-boosting algorithm, LhB = Lh-boosting algorithm
Figure 1SNP covariate relative variable importance (VI) in each line using random forest algorithm.
Specificity, sensitivity, phi correlation and misclassification rate for each model at detecting different α and (1-α) percentiles of extreme animals in the testing set within line A
| Parameter | Method | α (number of records) | |||
|---|---|---|---|---|---|
| Specificity1 | TBA | 0.71 | 0.58 | 0.56 | |
| BTL | 0.75 | 0.74 | |||
| RF | 0.88 | ||||
| L2B | 0.75 | 0.71 | 0.64 | 0.65 | |
| LhB | 0.75 | 0.71 | 0.61 | 0.67 | |
| Sensitivity1 | TBA | 0.75 | |||
| BTL | 0.75 | 0.53 | 0.53 | 0.47 | |
| RF | 0.52 | 0.52 | 0.46 | ||
| L2B | 0.75 | 0.48 | 0.48 | 0.51 | |
| LhB | 0.50 | 0.45 | 0.45 | 0.42 | |
| Phi correlation1 | TBA | 0.71 | 0.24 | 0.16 | 0.13 |
| BTL | 0.71 | 0.27 | 0.22 | ||
| RF | 0.33 | ||||
| L2B | 0.48 | 0.16 | 0.12 | 0.17 | |
| LhB | 0.24 | 0.13 | 0.06 | 0.09 | |
| Misclassification rate (%)2 | TBA | 17 | 39 | 42 | 43 |
| BTL | 17 | 40 | |||
| RF | 41 | ||||
| L2B | 25 | 47 | 46 | 42 | |
| LhB | 42 | 49 | 49 | 46 | |
1Higher value is desirable; the best value for each percentile is in bold face;
2Lower value is desirable; the best value for each percentile is in bold face;
TBA = Threshold Bayes A, BTL = Bayesian Threshold LASSO, RF = Random Forest; L2B = L2-boosting algorithm, LhB = Lh-boosting algorithm
Specificity, sensitivity, phi correlation and misclassification rate for each model at detecting different α and (1-α) percentiles of extreme animals in the testing set within line B
| Parameter | Method | α (number of records) | |||
|---|---|---|---|---|---|
| Specificity1 | TBA | 0.75 | |||
| BTL | 0.75 | 0.61 | 0.58 | ||
| RF | 0.75 | 0.57 | 0.48 | 0.37 | |
| L2B | 0.71 | 0.57 | 0.48 | ||
| LhB | 0.75 | 0.71 | 0.57 | 0.63 | |
| Sensitivity1 | TBA | 0.95 | 0.64 | 0.58 | |
| BTL | 0.75 | 0.75 | |||
| RF | |||||
| L2B | 0.72 | 0.56 | 0.64 | ||
| LhB | 0.67 | 0.78 | 0.73 | 0.69 | |
| Phi correlation1 | TBA | 0.75 | 0.80 | 0.34 | 0.34 |
| BTL | 0.75 | 0.34 | 0.32 | ||
| RF | 0.75 | 0.70 | |||
| L2B | 0.40 | 0.12 | 0.12 | ||
| LhB | 0.42 | 0.46 | 0.28 | 0.32 | |
| Misclassification rate (%)2 | TBA | 14 | 8 | 35 | 34 |
| BTL | 14 | 29 | 32 | ||
| RF | 14 | 12 | |||
| L2B | 28 | 44 | 43 | ||
| LhB | 29 | 24 | 32 | 36 | |
1Higher value is desirable; the best value for each percentile is in bold face;
2Lower value is desirable; the best value for each percentile is in bold face;
TBA = Threshold Bayes A, BTL = Bayesian Threshold LASSO, RF = Random Forest; L2B = L2-boosting algorithm, LhB = Lh-boosting algorithm
Specificity, sensitivity, phi correlation and misclassification rate for each model at detecting different α and (1-α) percentiles of extreme animals in the testing set within line C
| Parameter | Method | α (number of records) | |||
|---|---|---|---|---|---|
| Specificity1 | TBA | 0.50 | 0.64 | 0.71 | |
| BL | 0 | 0.25 | 0.61 | 0.71 | |
| RF | 0.75 | 0.75 | 0.71 | ||
| L2B | |||||
| LhB | 0.82 | 0.69 | |||
| Sensitivity1 | TBA | 0.33 | 0.30 | ||
| BL | 0.30 | 0.44 | 0.43 | ||
| RF | 0.33 | 0.52 | 0.51 | ||
| L2B | 0.17 | 0.20 | 0.15 | 0.15 | |
| LhB | 0.33 | 0.20 | 0.46 | 0.45 | |
| Phi correlation1 | TBA | -0.16 | 0.17 | ||
| BL | -0.35 | -0.35 | 0.05 | 0.15 | |
| RF | 0.08 | 0.26 | 0.23 | ||
| L2B | 0.17 | 0.17 | |||
| LhB | 0.15 | ||||
| Misclassification rate (%)2 | TBA | 67 | 43 | ||
| BL | 71 | 50 | 43 | ||
| RF | 39 | ||||
| L2B | 71 | 67 | 56 | 44 | |
| LhB | 67 | 41 | 43 | ||
1Higher value is desirable; the best value for each percentile is in bold face;
2Lower value is desirable; the best value for each percentile is in bold face;
TBA = Threshold Bayes A, BTL = Bayesian Threshold LASSO, RF = Random Forest; L2B = L2-boosting algorithm, LhB = Lh-boosting algorithm
Area under the receiver operating characteristic curve1 for each model and breed line in the field pig data
| TBA | BL | RF | L2B | LhB | |
|---|---|---|---|---|---|
| Line A | 0.64 | 0.65 | 0.55 | 0.60 | |
| Line B | 0.70 | 0.69 | 0.60 | 0.72 | |
| Line C | 0.62 | 0.62 | 0.66 |
TBA = Threshold Bayes A; BTL = Bayesian Threshold LASSO; RF = Random Forest; L2B = L2-boosting algorithm; LhB = Lh-boosting algorithm
1Higher value is desirable; the best value for each line is in bold face