| Literature DB >> 25927638 |
Mirjam Frischknecht1,2,3,4, Markus Neuditschko5,6, Vidhya Jagannathan7,8, Cord Drögemüller9,10, Jens Tetens11, Georg Thaller12, Tosso Leeb13,14, Stefan Rieder15,16.
Abstract
BACKGROUND: A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide association studies (GWAS) and genomic predictions.Entities:
Mesh:
Year: 2014 PMID: 25927638 PMCID: PMC4180851 DOI: 10.1186/s12711-014-0063-7
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Figure 1Concordance between sequence-derived and SNP-chip-derived genotypes. (A) Concordance per horse: the boxplots illustrate the distribution of the values for each chromosome within each of the 30 sequenced FM horses; two horses that had genotype concordances smaller than 0.95 were excluded from subsequent analyses. (B) Concordance per chromosome: the boxplots illustrate the values for each horse within each autosome; note that there are two outliers for each chromosome that are below the threshold of 0.95; these two outliers correspond to the two horses that were subsequently excluded.
Imputation accuracies on two chromosomes with three software programs
|
|
|
|
|---|---|---|
| Impute2 | 0.927 | 0.920 |
| Beagle | 0.750 | 0.743 |
| FImpute | 0.774 | 0.772 |
The values indicate the concordance between true and imputed genotypes for two different chromosomes.
Imputation accuracies on chromosome ECA31 using Beagle with different parameter settings
|
|
|
|
|---|---|---|
| 0.717 | no | no |
| 0.743 | yes | no |
| 0.753 | yes | yes |
Three different parameter settings for Beagle were evaluated; pre-phasing of the reference population was optionally performed in a separate step using Beagle; the accuracy was also determined with and without feeding pedigree data (duos and trios) into the program.
Imputation accuracies on chromosome ECA31 using Impute 2 with different parameter settings
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| 0.954 | 129 985 | 100 | 100 | yes | 0.00042 | 2 Mb2 | all horses | 6 Mb2 |
| 0.950 | 128 845 | 100 | 100 | yes1 | 0.00042 | 2 Mb2 | all horses | 6 Mb2 |
| 0.953 | 130 103 | 100 | 100 | yes1 | 0.00042 | 2 Mb2 | all horses | 6 Mb2 |
| 0.950 | 128 501 | 100 | 100 | no | 0.00042 | 2 Mb2 | all horses | 6 Mb2 |
| 0.954 | 130 422 | 100 | 100 | yes | 0.00042 | 2 Mb2 | all horses | whole ECA31 |
| 0.953 | 130 001 | 100 | 100 | yes | 0.00042 | 0.5 Mb | all horses | 6 Mb2 |
| 0.952 | 129 511 | 100 | 100 | yes | 0.01 | 2 Mb2 | all horses | 6 Mb2 |
| 0.953 | 129 544 | 15 0002 | 100 | yes | 0.00042 | 2 Mb2 | all horses | 6 Mb2 |
| 0.971 | 54 295 | 100 | 20 0002 | yes | 0.00042 | 2 Mb2 | all horses | 6 Mb2 |
| 0.956 | 128 135 | 100 | 100 | yes | 0.00042 | 2 Mb2 | test pop. only | 6 Mb2 |
The concordance of true and imputed genotypes was calculated with different parameter settings for ECA31; the number of SNPs passing the probability threshold of 0.9 on average per animal is indicated in the second column; Rho: recombination rate in SHAPEIT; the parameters in the first row are the same parameters as in Table 1, except for the quality threshold of 0.9 which was set to 0; 1only in reference population; 2default value of the software.
Figure 2Concordance between true and imputed genotypes. The concordance was calculated in a cross-validation experiment comprising 26 FM horses for which 50 k SNP-chip and genome sequence data were available. The SNPs were divided into six concordance classes depending on the proportion of horses, which had correctly imputed genotypes. Roughly half of the SNPs showed perfect concordance between experimental and imputed genotypes and only very few SNPs had incorrect genotypes in more than 20% of the horses. Note the lower concordance of true and imputed SNPs on ECA12.
Figure 3Accuracy and admixture per horse. The grey shaded bars show the accuracy per individual when only 28 FM horses were used as reference, the green bars show the accuracy when 14 Warmblood horses were added to the reference population and the blue dots show the Warmblood admixture of each evaluated FM horse.