| Literature DB >> 26949946 |
Danai Jattawa1, Mauricio A Elzo1, Skorn Koonawootrittriron1, Thanathip Suwanasopee1.
Abstract
The objective of this study was to investigate the accuracy of imputation from low density (LDC) to moderate density SNP chips (MDC) in a Thai Holstein-Other multibreed dairy cattle population. Dairy cattle with complete pedigree information (n = 1,244) from 145 dairy farms were genotyped with GeneSeek GGP20K (n = 570), GGP26K (n = 540) and GGP80K (n = 134) chips. After checking for single nucleotide polymorphism (SNP) quality, 17,779 SNP markers in common between the GGP20K, GGP26K, and GGP80K were used to represent MDC. Animals were divided into two groups, a reference group (n = 912) and a test group (n = 332). The SNP markers chosen for the test group were those located in positions corresponding to GeneSeek GGP9K (n = 7,652). The LDC to MDC genotype imputation was carried out using three different software packages, namely Beagle 3.3 (population-based algorithm), FImpute 2.2 (combined family- and population-based algorithms) and Findhap 4 (combined family- and population-based algorithms). Imputation accuracies within and across chromosomes were calculated as ratios of correctly imputed SNP markers to overall imputed SNP markers. Imputation accuracy for the three software packages ranged from 76.79% to 93.94%. FImpute had higher imputation accuracy (93.94%) than Findhap (84.64%) and Beagle (76.79%). Imputation accuracies were similar and consistent across chromosomes for FImpute, but not for Findhap and Beagle. Most chromosomes that showed either high (73%) or low (80%) imputation accuracies were the same chromosomes that had above and below average linkage disequilibrium (LD; defined here as the correlation between pairs of adjacent SNP within chromosomes less than or equal to 1 Mb apart). Results indicated that FImpute was more suitable than Findhap and Beagle for genotype imputation in this Thai multibreed population. Perhaps additional increments in imputation accuracy could be achieved by increasing the completeness of pedigree information.Entities:
Keywords: Imputation Accuracy; Linkage Disequilibrium; Multibreed Dairy Cattle
Year: 2016 PMID: 26949946 PMCID: PMC4782080 DOI: 10.5713/ajas.15.0291
Source DB: PubMed Journal: Asian-Australas J Anim Sci ISSN: 1011-2367 Impact factor: 2.509
Imputation accuracy using Beagle, FImpute, and Findhap
| Software | Algorithm | Total number of Imputed SNP | Number of correctly imputed SNP | Accuracy |
|---|---|---|---|---|
| Beagle | Population – based | 3,296,330 | 2,531,228 | 76.79 |
| FImpute | Family+population – based | 3,296,330 | 3,096,580 | 93.94 |
| Findhap | Family+population – based | 3,296,330 | 2,790,041 | 84.64 |
SNP, single nucleotide polymorphism.
Figure 1Imputation accuracy within the 29 autosomal chromosomes using Beagle, FImpute, and Findhap.
Figure 2Average linkage disequilibrium (correlation coefficient; r2) between adjacent single nucleotide polymorphism makers separated by at most 1 Mb within each autosome.
Figure 3Distribution of average SNP linkage disequilibria (LD; correlation coefficient; r2) for SNP within 1 Mb of each other. SNP, single nucleotide polymorphism.
Figure 4Imputation accuracy by average SNP linkage disequilibrium (LD; correlation coefficient; r2) at distances between SNP lower than or equal to 1 Mb computed using Beagle, FImpute and Findhap. SNP, single nucleotide polymorphism.