| Literature DB >> 24004563 |
Chris Hozé1, Marie-Noëlle Fouilloux, Eric Venot, François Guillaume, Romain Dassonneville, Sébastien Fritz, Vincent Ducrocq, Florence Phocas, Didier Boichard, Pascal Croiseau.
Abstract
BACKGROUND: Genotyping with the medium-density Bovine SNP50 BeadChip® (50K) is now standard in cattle. The high-density BovineHD BeadChip®, which contains 777,609 single nucleotide polymorphisms (SNPs), was developed in 2010. Increasing marker density increases the level of linkage disequilibrium between quantitative trait loci (QTL) and SNPs and the accuracy of QTL localization and genomic selection. However, re-genotyping all animals with the high-density chip is not economically feasible. An alternative strategy is to genotype part of the animals with the high-density chip and to impute high-density genotypes for animals already genotyped with the 50K chip. Thus, it is necessary to investigate the error rate when imputing from the 50K to the high-density chip.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24004563 PMCID: PMC3846489 DOI: 10.1186/1297-9686-45-33
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Number of high-density genotyped animals and population structure per breed
| | | | | |
| Abondance (ABO) | 209 | 54 | 3.69 | 15 |
| Brown Swiss (BSW) | 99 | 52 | 1.90 | 28 |
| Holstein (HOL) | 788 | 204 | 2.30 | 21 |
| Montbéliarde (MON) | 530 | 139 | 3.77 | 18 |
| Normande (HOR) | 551 | 138 | 3.82 | 23 |
| Simmental (SIM) | 125 | 55 | 2.24 | 39 |
| Tarentaise (TAR) | 185 | 65 | 2.77 | 15 |
| | | | | |
| Aubrac (AUB) | 254 | 116 | 2.17 | 112 |
| Bazadaise (BAZ) | 89 | 60 | 1.45 | 46 |
| Blonde d'Aquitaine (BLA) | 327 | 187 | 1.74 | 78 |
| Charolais (CHA) | 672 | 310 | 2.14 | 249 |
| Gasconne (GAS) | 163 | 76 | 2.12 | 197 |
| Limousine (LIM) | 462 | 235 | 1.96 | 185 |
| Parthenaise (PAR) | 304 | 97 | 3.02 | 89 |
| Rouge des Prés (RDP) | 149 | 80 | 1.83 | 99 |
| Salers (SAL) | 246 | 186 | 1.31 | 99 |
Figure 1Linkage disequilibrium decay in 16 beef (dashed lines) and dairy (solid lines) cattle breeds.
Within-breed imputation error rate and others parameters affecting imputation error rate
| | | | | | |
| Abondance (ABO) | 169 | 40 | 0.75 | 0.217 | 0.146 |
| Brown Swiss (BSW) | 79 | 20 | 1.92 | 0.255 | 0.074 |
| Holstein (HOL) | 634 | 154 | 0.73 | 0.255 | 0.078 |
| Montbéliarde (MON) | 424 | 106 | 0.51 | 0.196 | 0.116 |
| Normande (HOR) | 444 | 107 | 0.33 | 0.233 | 0.104 |
| Simmental (SIM) | 100 | 25 | 2.55 | 0.209 | 0.050 |
| | | | | | |
| Aubrac (AUB) | 204 | 50 | 2.03 | 0.177 | 0.028 |
| Bazadaise (BAZ) | 72 | 17 | 2.07 | 0.239 | 0.038 |
| Blonde d'Aquitaine (BLA) | 262 | 65 | 1.80 | 0.175 | 0.038 |
| Charolais (CHA) | 539 | 133 | 0.68 | 0.176 | 0.018 |
| Gasconne (GAS) | 131 | 32 | 2.26 | 0.174 | 0.026 |
| Limousine (LIM) | 370 | 92 | 1.09 | 0.164 | 0.014 |
| Parthenaise (PAR) | 245 | 59 | 1.88 | 0.161 | 0.024 |
| Rouge des Prés (RDP) | 119 | 30 | 2.39 | 0.206 | 0.028 |
| Salers (SAL) | 197 | 49 | 1.27 | 0.213 | 0.024 |
Size of the training and validation populations, within-breed imputation error rate, level of linkage disequilibrium (LD, r2)) at 70 kb and average relationship between training and validation populations (RT/V).
Figure 2Relationship between allelic imputation error rate and reference population size in beef (black) and dairy (gray) cattle breeds. Breeds with more than 300 animals in the reference population are represented by a square, those with more than 200 animals by a circle, and those with 200 or less animals by a diamond.
Figure 3Relationship between allelic imputation error rate and number of effective ancestors in beef (black) and dairy (gray) cattle breeds. Breeds with more than 300 animals in the reference population are represented by a square, those with more than 200 animals by a circle, and those with 200 or less animals by a diamond.
Figure 4Relationship between allelic imputation error rate and the genetic relationship between target and validation populations in beef (black) and dairy (gray) cattle breeds. Breeds with more than 300 animals in the reference population are represented by a square, those with more than 200 animals by a circle, and those with 200 or less animals by a diamond.
Results of the multiple linear regression model
| Reference population size | 52% | −0.0026 ± 0.0006 | 0.002 |
| Relationship between reference and validation populations | 25% | −12.6042 ± 3.8937 | 0.008 |
| Level of linkage disequilibrium at 70 kb | 2.5% | −0.0028 ± 0.0026 | 0.305 |
| Number of effective ancestors | 0.03% | −0.0576 ± 4.4641 | 0.899 |
The model tests the effect of reference population size, relationship between reference and validation populations, level of linkage disequilibrium at 70 kb, and number of effective ancestors on imputation error rate.
Figure 5Allelic imputation error rate along the genome in Montbéliarde breed.
Figure 6Relationship between allelic imputation error rate and minor allele frequency in Montbéliarde breed.
Imputation error rate using single-breed populations compared to a multi-breed reference population for three breeds
| Size of training / validation population | 159 / 40 | 422 / 106 | 146 / 36 |
| Single-breed imputation error rate (%) | 0.755 | 0.487 | 0.763 |
| Multi-breed imputation error rate (%) | 0.753 | 0.485 | 0.824 |