| Literature DB >> 28222685 |
Hubert Pausch1, Iona M MacLeod2, Ruedi Fries3, Reiner Emmerling4, Phil J Bowman2,5, Hans D Daetwyler2,5, Michael E Goddard2,6.
Abstract
BACKGROUND: The availability of dense genotypes and whole-genome sequence variants from various sources offers the opportunity to compile large datasets consisting of tens of thousands of individuals with genotypes at millions of polymorphic sites that may enhance the power of genomic analyses. The imputation of missing genotypes ensures that all individuals have genotypes for a shared set of variants.Entities:
Mesh:
Year: 2017 PMID: 28222685 PMCID: PMC5320806 DOI: 10.1186/s12711-017-0301-x
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Number of polymorphic sites in Fleckvieh and Holstein cattle
| Chr | Chr length (Mb) | Fleckvieh | Holstein | ||
|---|---|---|---|---|---|
| SEQ (MAF < 0.1) | 700 K (MAF < 0.1) | SEQ (MAF < 0.1) | 700 K (MAF < 0.1) | ||
| 1 | 158.32 | 1,444,299 (60.55) | 38,009 (18.40) | 1,382,987 (59.11) | 37,397 (17.01) |
| 5 | 121.18 | 1,098,976 (59.64) | 28,173 (18.55) | 1,073,964 (59.12) | 27,667 (15.90) |
| 10 | 104.30 | 959,206 (59.13) | 25,787 (17.09) | 917,799 (59.48) | 25,497 (16.12) |
| 15 | 85.27 | 866,151 (58.54) | 20,617 (17.24) | 827,526 (58.12) | 20,360 (14.38) |
| 20 | 71.98 | 679,738 (59.94) | 18,466 (18.18) | 649,768 (59.18) | 18,264 (18.13) |
| 25 | 42.85 | 413,371 (59.08) | 11,370 (14.69) | 383,072 (58.14) | 11,179 (14.27) |
Number of sequence (SEQ) and array-derived (Illumina bovineHD, 700 K) variants that were polymorphic in 249 Fleckvieh and 450 Holstein animals. The proportion of variants with a minor allele frequency (MAF) lower than 10% is given in parentheses
Cross-validation imputation accuracy in Fleckvieh and Holstein cattle
| Breed | Within-breed reference populations | Multi-breed reference populations | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Fleckvieh | 0.898 (0.674) | 0.908 (0.735) | 0.934 (0.771) | 0.921 (0.774) | 0.916 (0.812) | 0.939 (0.838) |
| Holstein | 0.912 (0.681) | 0.929 (0.772) | 0.951 (0.798) | 0.926 (0.732) | 0.928 (0.797) | 0.948 (0.820) |
The values represent the correlation between true and imputed genotypes. The correlation between true and imputed genotypes for sequence variants with a minor allele frequency lower than 10% is given in parentheses. BG best-guess, DOS allele dosage
Fig. 1Accuracy of imputation in Fleckvieh and Holstein cattle. a, b Correlation between true and imputed genotypes as a function of the frequency of the minor allele in Fleckvieh (a) and Holstein (b) animals. c, d The boxplots represent the correlation between true and imputed genotypes for sequence variants on six chromosomes in Fleckvieh (c) and Holstein (d) animals
Accuracy of imputation for two causal variants in Fleckvieh cattle
| Variant | Effect | Minor allele frequency | Proportion of correctly imputed genotypes/rIMP,SEQ | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||||
| TaqMan | Within-breed | Multi-breed | Within-breed | Multi-breed | Within-breed | Multi-breed | Within-breed | Multi-breed | ||
| 14:1,802,266a/rs109326954 | p.A232K | 0.074 | 0.062 | 0.183 | 0.993/0.976 | 0.998/0.992 | 0.999/0.996 | 0.999/0.996 | –/0.979 | –/0.995 |
| 20:31,909,478/rs385640152 | p.F279Y | 0.074 | 0.072 | 0.121 | 0.861/0.639 | 0.931/NAb | 0.868/NAb | 0.994/0.938 | –/0.703 | –/0.895 |
The minor allele frequencies were calculated using TaqMan-derived and sequence genotypes in either within- or multi-breed reference populations
aThe p.A232K-variant in the DGAT1 gene results from two adjacent SNPs in LD located at 1,802,265 and 1,802,266 bp (rs109234250 and rs109326954). In the current study, we considered only the variant at 1,802,266 bp
bThe imputed variant was not polymorphic in the sample of 902 genotyped animals precluding the calculation of rIMP,SEQ values
Fig. 2Fine-mapping of a fat percentage QTL on bovine chromosome 14. a–e Association between 48,641 imputed sequence variants located at the proximal end of bovine chromosome 14 and milk fat percentage in 6958 Fleckvieh animals. Genotypes for the association studies were imputed using either FImpute (a, d) or Minimac (b, c, e, f) with either within- (a–c) or multi-breed reference populations (d–f). Grey and blue colours represent sequence and array-derived variants. The red symbol represents the p.A232K-mutation in the DGAT1 gene
Fig. 3Fine-mapping of a fat percentage QTL on bovine chromosome 20. a–e Association between 29,205 imputed sequence variants on bovine chromosome 20 and milk fat percentage in 6958 Fleckvieh animals. Genotypes for the association studies were imputed using either FImpute (a, d) or Minimac (b, c, e, f) with either (a–c) or multi breed reference populations (d–f). Grey and blue colours represent sequence and array-derived variants, respectively. The red symbol represents the p.F279Y-mutation in the GHR gene
Fig. 4Fine-mapping of fat percentage QTL in Fleckvieh cattle. a Manhattan plot representing the association of 23,256,743 imputed sequence variants with milk fat percentage in 6958 Fleckvieh animals. Red colours represent significantly associated variants (P < 2.15 × 10−9). The y-axis is truncated at −log10(10−40). b–e Fine-mapping of four QTL for milk fat percentage on bovine chromosomes 5, 6, 11 and 19. Different colours represent the linkage disequilibrium between the most significantly associated variant (violet) and all other variants. Blue arrows indicate the direction of the gene transcription