| Literature DB >> 25544265 |
Vivian P S Felipe1, Hayrettin Okut2, Daniel Gianola3, Martinho A Silva4, Guilherme J M Rosa5.
Abstract
BACKGROUND: Genotype imputation is an important tool for whole-genome prediction as it allows cost reduction of individual genotyping. However, benefits of genotype imputation have been evaluated mostly for linear additive genetic models. In this study we investigated the impact of employing imputed genotypes when using more elaborated models of phenotype prediction. Our hypothesis was that such models would be able to track genetic signals using the observed genotypes only, with no additional information to be gained from imputed genotypes.Entities:
Mesh:
Year: 2014 PMID: 25544265 PMCID: PMC4333171 DOI: 10.1186/s12863-014-0149-9
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Overall imputation accuracy and error distribution for 90, 75 and 50% of masked genotypes
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Accuracy | 0.75 | 0.79 | 0.91 | 0.94 | 0.97 | 0.98 |
| 0*<−>1* errora | 0.16 | 0.17 | 0.22 | 0.25 | 0.26 | 0.20 |
| 1<−>2* errorb | 0.50 | 0.54 | 0.61 | 0.63 | 0.62 | 0.65 |
| 0<−>2 errorc | 0.09 | 0.08 | 0.08 | 0.06 | 0.09 | 0.13 |
aError due to change from 0 to 1 genotype code or vice versa.
bError due to change from 1 to 2 genotype code or vice versa.
cError due to change from 0 to 2 genotype code or vice versa.
*Genotypes are coded as 0, 1 and 2 as the number of copies of the more frequent allele.
Correlations between predicted and observed body weight for all masking rates and family layouts
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
| ||||
| 1809 | 1809ia | 201 | 1809 | 1809ia | 201 | |
| BL | 0.347 | 0.259 | 0.169 | 0.500 | 0.330 | 0.407 |
| RKHS | 0.347 | 0.312 | 0.210 | 0.527 | 0.417 | 0.499 |
| BRANN | 0.330 | 0.217 | 0.144 | 0.490 | 0.274 | 0.392 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ib | 453 | 1809 | 1809ib | 453 | |
| BL | 0.343 | 0.291 | 0.262 | 0.499 | 0.447 | 0.430 |
| RKHS | 0.348 | 0.317 | 0.293 | 0.528 | 0.506 | 0.501 |
| BRANN | 0.320 | 0.241 | 0.255 | 0.492 | 0.414 | 0.428 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ic | 905 | 1809 | 1809ic | 905 | |
| BL | 0.342 | 0.324 | 0.271 | 0.499 | 0.496 | 0.477 |
| RKHS | 0.343 | 0.345 | 0.306 | 0.530 | 0.530 | 0.520 |
| BRANN | 0.320 | 0.281 | 0.252 | 0.492 | 0.478 | 0.461 |
aImputed from 201 SNPs.
bImputed from 453 SNPs.
cImputed from 905 SNPs.
*BL: Bayesian LASSO; RKHS: Reproducing Kernel Hilbert Spaces (RKHS) and; BRANN: Bayesian Regularized Neural Networks.
Correlations between predicted and observed body mass index for all genotype masking rates and family layouts
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
| ||||
| 1809 | 1809ia | 201 | 1809 | 1809ia | 201 | |
| BL | 0.227 | 0.193 | 0.191 | 0.199 | 0.164 | −0.047 |
| RKHS | 0.238 | 0.195 | 0.199 | 0.208 | 0.132 | −0.054 |
| BRANN | 0.112 | 0.092 | 0.147 | 0.163 | 0.041 | 0.054 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ib | 453 | 1809 | 1809ib | 453 | |
| BL | 0.228 | 0.219 | 0.199 | 0.200 | 0.196 | 0.184 |
| RKHS | 0.238 | 0.226 | 0.211 | 0.208 | 0.204 | 0.200 |
| BRANN | 0.118 | 0.115 | 0.145 | 0.172 | 0.154 | 0.170 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ic | 905 | 1809 | 1809ic | 905 | |
| BL | 0.227 | 0.231 | 0.225 | 0.199 | 0.197 | 0.189 |
| RKHS | 0.238 | 0.238 | 0.236 | 0.207 | 0.206 | 0.202 |
| BRANN | 0.118 | 0.131 | 0.149 | 0.172 | 0.168 | 0.149 |
aImputed from 201 SNPs.
bImputed from 453 SNPs.
cImputed from 905 SNPs.
*BL: Bayesian LASSO; RKHS: Reproducing Kernel Hilbert Spaces (RKHS) and; BRANN: Bayesian Regularized Neural Networks.
Prediction mean squared errors for body weight analysis by family layouts and genotype masking rates
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
| ||||
| 1809 | 1809ia | 201 | 1809 | 1809ia | 201 | |
| BL | 5.03 | 5.32 | 5.67 | 4.18 | 4.99 | 4.71 |
| RKHS | 4.92 | 5.20 | 5.36 | 4.15 | 4.75 | 4.66 |
| BRANN | 5.36 | 5.52 | 5.54 | 5.26 | 5.40 | 5.52 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ib | 453 | 1809 | 1809ib | 453 | |
| BL | 5.05 | 5.25 | 5.44 | 4.18 | 4.45 | 4.52 |
| RKHS | 4.92 | 5.04 | 5.11 | 4.13 | 4.23 | 4.21 |
| BRANN | 5.38 | 5.44 | 5.44 | 5.26 | 5.32 | 5.33 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ic | 905 | 1809 | 1809ic | 905 | |
| BL | 5.06 | 5.12 | 5.49 | 4.18 | 4.19 | 4.32 |
| RKHS | 4.94 | 4.94 | 5.01 | 4.06 | 4.08 | 4.12 |
| BRANN | 5.20 | 5.24 | 5.44 | 5.26 | 5.27 | 5.28 |
aImputed from 201 SNPs.
bImputed from 453 SNPs.
cImputed from 905 SNPs.
*BL: Bayesian LASSO; RKHS: Reproducing Kernel Hilbert Spaces (RKHS) and; BRANN: Bayesian Regularized Neural Networks.
Prediction mean squared errors for body mass index analysis by family layouts and genotype masking rates
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
| ||||
| 1809 | 1809ia | 201 | 1809 | 1809ia | 201 | |
| BL | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 |
| RKHS | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 |
| BRANN | 0.013 | 0.014 | 0.010 | 0.042 | 0.036 | 0.024 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ib | 453 | 1809 | 1809ib | 453 | |
| BL | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 |
| RKHS | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 |
| BRANN | 0.021 | 0.023 | 0.015 | 0.044 | 0.045 | 0.041 |
|
| ||||||
|
|
|
| ||||
| 1809 | 1809ic | 905 | 1809 | 1809ic | 905 | |
| BL | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 |
| RKHS | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 |
| BRANN | 0.021 | 0.023 | 0.016 | 0.040 | 0.040 | 0.047 |
aImputed from 201 SNPs.
bImputed from 453 SNPs.
cImputed from 905 SNPs.
*BL: Bayesian LASSO; RKHS: Reproducing Kernel Hilbert Spaces (RKHS) and; BRANN: Bayesian Regularized Neural Networks.
Number and distribution of individuals by trait and cross validation strategy employed
|
|
|
|
| ||
|---|---|---|---|---|---|
|
|
|
|
| ||
| BW | 1,200 | 681 | 1,200 | 681 | 1,881 |
| BMI | 1,165 | 658 | 1,161 | 662 | 1,823 |
*BMI: body mass index; BW: body weight at ten weeks of age.
Figure 1Artificial Neural Network architecture with two layers containing 5 neurons in the hidden layer and one neuron in the output layer. The x are the inputs for each animal i, and p is the number of SNPs; the w are the weights where k is the hidden layer neuron indicator and j is the index for SNP; are the hidden layer biases, where k and l are the indexes for neurons and layers, respectively, and b is the output neuron bias.