| Literature DB >> 27980652 |
Mohamad Saad1, Alejandro Q Nato2, Fiona L Grimson3, Steven M Lewis3, Lisa A Brown4, Elizabeth M Blue2, Timothy A Thornton4, Elizabeth A Thompson3, Ellen M Wijsman5.
Abstract
BACKGROUND: In the past few years, imputation approaches have been mainly used in population-based designs of genome-wide association studies, although both family- and population-based imputation methods have been proposed. With the recent surge of family-based designs, family-based imputation has become more important. Imputation methods for both designs are based on identity-by-descent (IBD) information. Apart from imputation, the use of IBD information is also common for several types of genetic analysis, including pedigree-based linkage analysis.Entities:
Year: 2016 PMID: 27980652 PMCID: PMC5133511 DOI: 10.1186/s12919-016-0046-5
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Marker sets
| Marker sets | Number of SNPs | Relevant information |
|---|---|---|
| MS-1 | ~15,000 | chr 3: 46,750 Kbp–49,250 Kbp |
| MS-2 | 351 | Mean spacing ~0.64 cM; in linkage equilibrium |
| MS-3 | 48,892 | MAF > 0.05; genotype completion > 99 % |
chr chromosome, MAF minor allele frequency
Random selection of 200 reference/dense SNP panel individuals
| (0–0.01]: #SNPs = 4604 SNPs | (0.01–0.15]: #SNPs = 1765 SNPs | (0.15–0.5]: #SNPs = 979 SNPs | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Imputation approaches | #SNPe | ρ1 | ρ2 | #SNPe | ρ1 | ρ2 | #SNPe | ρ1 | ρ2 |
| BEAGLE-BEAGLE | 4325 | 0.209 | 0.196 | 1673 | 0.587 | 0.556 | 976 | 0.897 | 0.895 |
| SHAPEITped-BEAGLE | 4554 | 0.270 | 0.267 | 1763 | 0.730 | 0.729 | 979 | 0.978 | 0.978 |
| MaCH-MaCH | 4388 | 0.332 | 0.316 | 1755 | 0.595 | 0.591 | 979 | 0.801 | 0.801 |
| MaCH-minimac | 4523 | 0.460 | 0.452 | 1763 | 0.706 | 0.706 | 979 | 0.910 | 0.910 |
| SHAPEITped-minimac | 4507 | 0.642 | 0.629 | 1765 | 0.894 | 0.894 | 979 | 0.985 | 0.985 |
| MaCH-MaCHAdmix | 4340 | 0.420 | 0.396 | 1754 | 0.687 | 0.683 | 979 | 0.896 | 0.896 |
| SHAPEITped-MaCHAdmix | 4352 | 0.527 | 0.498 | 1760 | 0.773 | 0.770 | 979 | 0.904 | 0.904 |
| GIGI | 3763 | 0.610 | 0.498 | 1719 | 0.557 | 0.542 | 979 | 0.581 | 0.581 |
| IMPUTE2-IMPUTE2 | 4230 | 0.370 | 0.340 | 1735 | 0.660 | 0.649 | 978 | 0.898 | 0.897 |
| SHAPEITped-IMPUTE2 | 4401 | 0.485 | 0.464 | 1741 | 0.713 | 0.703 | 979 | 0.923 | 0.923 |
| GIGI + SHAPEITped-BEAGLE | 4507 | 0.643 | 0.630 | 1764 | 0.790 | 0.789 | 979 | 0.972 | 0.972 |
| GIGI + SHAPEITped-minimac | 4507 | 0.693 | 0.678 | 1764 | 0.860 | 0.860 | 979 | 0.980 | 0.980 |
| GIGI + SHAPEITped-IMPUTE2 | 4491 | 0.635 | 0.619 | 1746 | 0.740 | 0.732 | 979 | 0.876 | 0.876 |
#SNPs is the total number of SNPs in the reference panel; #SNPe is the number of imputed SNPs as polymorphic; ρ1 is the mean correlation of all SNPs: sum(correlation)/#SNPe; ρ2 = sum(correlation)/#SNPs
Fig. 1Mean correlations between imputed and reference panels from different imputation approaches. Individuals were selected randomly or via GIGI-Pick. The different imputation approaches for the 3MAF bins (ie, (0–0.01], (0.01–0.15], and (0.15–0.5] are on the x-axis. Mean correlations (ρ ) are on the y-axis
Fig. 2LOD score curves for merged IBD graphs for all 200 simulated traits (cyan lines) with their average (solid black line), and the average LOD score for the unmerged graphs (dashed black line). The location of the MAP4 gene is indicated by the vertical line