| Literature DB >> 26250698 |
Tatiane C S Chud1, Ricardo V Ventura2,3, Flavio S Schenkel4, Roberto Carvalheiro5, Marcos E Buzanskas6, Jaqueline O Rosa7, Maurício de Alvarenga Mudadu8, Marcos Vinicius G B da Silva9, Fabiana B Mokry10, Cintia R Marcondes11, Luciana C A Regitano12, Danísio P Munari13.
Abstract
BACKGROUND: Genotype imputation has been used to increase genomic information, allow more animals in genome-wide analyses, and reduce genotyping costs. In Brazilian beef cattle production, many animals are resulting from crossbreeding and such an event may alter linkage disequilibrium patterns. Thus, the challenge is to obtain accurately imputed genotypes in crossbred animals. The objective of this study was to evaluate the best fitting and most accurate imputation strategy on the MA genetic group (the progeny of a Charolais sire mated with crossbred Canchim X Zebu cows) and Canchim cattle. The data set contained 400 animals (born between 1999 and 2005) genotyped with the Illumina BovineHD panel. Imputation accuracy of genotypes from the Illumina-Bovine3K (3K), Illumina-BovineLD (6K), GeneSeek-Genomic-Profiler (GGP) BeefLD (GGP9K), GGP-IndicusLD (GGP20Ki), Illumina-BovineSNP50 (50K), GGP-IndicusHD (GGP75Ki), and GGP-BeefHD (GGP80K) to Illumina-BovineHD (HD) SNP panels were investigated. Seven scenarios for reference and target populations were tested; the animals were grouped according with birth year (S1), genetic groups (S2 and S3), genetic groups and birth year (S4 and S5), gender (S6), and gender and birth year (S7). Analyses were performed using FImpute and BEAGLE software and computation run-time was recorded. Genotype imputation accuracy was measured by concordance rate (CR) and allelic R square (R(2)).Entities:
Mesh:
Year: 2015 PMID: 26250698 PMCID: PMC4527250 DOI: 10.1186/s12863-015-0251-7
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Number of SNPs in common between LDa panel and the HDb panel
| LD Panel | Label | SNPs in original LD panel | SNPs in common after QCc |
|---|---|---|---|
| Illumina® Bovine 3K | 3K | 2,900 | 2,341 |
| Illumina® Bovine LD | 6K | 6,909 | 6,280 |
| GGP Beef LD | GGP9K | 8,762 | 7,548 |
| GGP Indicus LD | GGP20Ki | 19,721 | 14,305 |
| Illumina® BovineSNP50 | 50K | 54,609 | 38,802 |
| GGP Indicus HD | GGP75Ki | 74,085 | 50,038 |
| GGP Beef HD | GGP80K | 76,992 | 67,143 |
aLD: low-density, bHD: high-density panel, cQC: quality Control
Description of imputation scenarios and number of animals in referencea and targetb population
| Scenarios | Description | Number of animals | |||
|---|---|---|---|---|---|
| Charolais | Canchim | MA | Total | ||
| S1 | Animals born prior to 2005a | 1 | 184 | 68 | 253 |
| Animals born in 2005b | 0 | 99 | 44 | 143 | |
| S2 | All Canchim animalsa | 0 | 283 | 0 | 283 |
| All MA animalsb | 0 | 0 | 112 | 112 | |
| S3 | All MA animalsa | 0 | 0 | 112 | 112 |
| All Canchim animalsb | 0 | 283 | 0 | 283 | |
| S4 | All Canchim + MA animals born prior to 2005a | 0 | 283 | 68 | 351 |
| MA animals were born in 2005b | 0 | 0 | 44 | 44 | |
| S5 | All MA + Canchim animals born prior to 2005a | 0 | 184 | 112 | 296 |
| Canchim animals were born in 2005b | 0 | 99 | 0 | 99 | |
| S6 | All malesa | 1 | 128 | 63 | 192 |
| All femalesb | 0 | 155 | 49 | 204 | |
| S7 | All Males + Females born prior to 2005a | 1 | 228 | 86 | 315 |
| Females born in 2005b | 0 | 55 | 26 | 81 | |
aReference population; bTarget Population
Genomic relationship statistics between reference population and target population
| Scenariosa | Genomic Relationship | ||
|---|---|---|---|
| Minimum | Mean | Maximum | |
| S1 | 0.023 | 0.198 | 0.390 |
| S2 | 0.010 | 0.050 | 0.220 |
| S3 | 0.003 | 0.040 | 0.225 |
| S4 | 0.028 | 0.193 | 0.330 |
| S5 | 0.050 | 0.198 | 0.390 |
| S6 | 0.090 | 0.210 | 0.409 |
| S7 | 0.108 | 0.228 | 0.390 |
aAs described in the section “Genotype imputation” of “Methods”
Imputation accuracy from low-density panel to high-density panel using FImpute and BEAGLE software
| Scenariosa | LD panel | FImpute | BEAGLE | ||
|---|---|---|---|---|---|
| CR%b | R2c | CR%b | R2c | ||
| S1 | 3K | 75.70 | 0.59 | 66.27 | 0.44 |
| 6K | 87.72 | 0.79 | 80.79 | 0.68 | |
| GGP9K | 88.64 | 0.81 | 82.19 | 0.70 | |
| GGP20Ki | 92.43 | 0.87 | 87.50 | 0.71 | |
| 50K | 95.20 | 0.92 | 92.14 | 0.87 | |
| GGP75Ki | 96.68 | 0.94 | 95.03 | 0.92 | |
| GGP80K | 96.96 | 0.95 | 95.26 | 0.92 | |
| S2 | 3K | 62.86 | 0.37 | 59.73 | 0.33 |
| 6K | 76.17 | 0.58 | 72.23 | 0.58 | |
| GGP9K | 77.54 | 0.61 | 73.78 | 0.55 | |
| GGP20Ki | 83.61 | 0.71 | 79.75 | 0.65 | |
| 50K | 89.55 | 0.82 | 86.66 | 0.77 | |
| GGP75Ki | 92.48 | 0.87 | 90.85 | 0.84 | |
| GGP80K | 93.24 | 0.88 | 91.51 | 0.85 | |
| S3 | 3K | 60.21 | 0.33 | 54.83 | 0.25 |
| 6K | 71.46 | 0.51 | 63.00 | 0.38 | |
| GGP9K | 72.93 | 0.54 | 64.15 | 0.40 | |
| GGP20Ki | 79.19 | 0.65 | 69.91 | 0.49 | |
| 50K | 85.92 | 0.76 | 79.95 | 0.66 | |
| GGP75Ki | 89.54 | 0.82 | 85.79 | 0.76 | |
| GGP80K | 90.60 | 0.84 | 87.35 | 0.79 | |
| S4 | 3K | 72.75 | 0.53 | 64.55 | 0.40 |
| 6K | 85.17 | 0.74 | 79.32 | 0.65 | |
| GGP9K | 86.12 | 0.76 | 80.85 | 0.67 | |
| GGP20Ki | 90.60 | 0.84 | 86.55 | 0.77 | |
| 50K | 94.12 | 0.90 | 91.24 | 0.85 | |
| GGP75Ki | 95.94 | 0.93 | 94.36 | 0.90 | |
| GGP80K | 96.28 | 0.93 | 94.53 | 0.91 | |
| S5 | 3K | 77.74 | 0.62 | 68.57 | 0.47 |
| 6K | 89.84 | 0.83 | 83.86 | 0.73 | |
| GGP9K | 90.67 | 0.84 | 85.23 | 0.75 | |
| GGP20Ki | 94.15 | 0.94 | 90.23 | 0.84 | |
| 50K | 96.36 | 0.90 | 93.90 | 0.90 | |
| GGP75Ki | 97.55 | 0.96 | 96.10 | 0.94 | |
| GGP80K | 97.74 | 0.96 | 96.30 | 0.94 | |
| S6 | 3K | 76.52 | 0.60 | 65.80 | 0.43 |
| 6K | 88.71 | 0.81 | 80.35 | 0.67 | |
| GGP9K | 89.56 | 0.82 | 81.71 | 0.70 | |
| GGP20Ki | 93.13 | 0.88 | 87.33 | 0.80 | |
| 50K | 95.60 | 0.93 | 92.23 | 0.87 | |
| GGP75Ki | 96.98 | 0.95 | 95.25 | 0.92 | |
| GGP80K | 97.19 | 0.95 | 95.40 | 0.92 | |
| S7 | 3K | 78.69 | 0.64 | 69.06 | 0.48 |
| 6K | 89.98 | 0.83 | 84.16 | 0.73 | |
| GGP9K | 90.76 | 0.85 | 85.42 | 0.76 | |
| GGP20Ki | 94.06 | 0.90 | 90.20 | 0.84 | |
| 50K | 96.27 | 0.94 | 93.82 | 0.90 | |
| GGP75Ki | 97.47 | 0.96 | 96.04 | 0.93 | |
| GGP80K | 97.66 | 0.96 | 96.20 | 0.94 | |
aAs described in the section “Genotype imputation” of “Methods,bCR = Concordance Rate, cR2: Allelic R square
Fig. 1Genotype concordance rate using FImpute (a) and BEAGLE (b) software for all scenarios tested. S1: animals born prior to 2005 in reference population and in target population animals born in 2005; S2: Canchim animals in reference population and MA animals in target population; S3: MA animals in reference population and Canchim animals in target population; S4: all Canchim + MA animals born prior to 2005 in reference population and MA animals were born in 2005 in target population; S5: All MA + Canchim animals born prior to 2005 in reference population and Canchim animals were born in 2005 in target population; S6: all males in reference population and all females in target population; S7: All Males + Females born prior to 2005 in reference population and Females born in 2005 in target population
Fig. 2Average relationship between reference and target population. Figure 2 shows average relationship between reference and target population considering scenario S1 (animals grouped considering birth year) for genotype imputation from panels 3K (a), 6K (b), GGP9K (c), GGP20Ki (d), 50K (e), GGP 75Ki (f), and GGP80K (g) to High Density (HD) panel. Regression equation was significant (p < 0.01) for all panels
Fig. 3Genotype concordance rate by chromosome using FImpute and BEAGLE software. Considering individuals grouped by birth year (S1) from 50K SNP to HD (a), GGP 75Ki SNP to HD (b), and GGP80K SNP to (c)
Fig. 4Concordance Rate (a) and Allelic R-square (b) using FImpute and BEAGLE software. Considering the scenario S1 (individuals grouped by birth year)