| Literature DB >> 35927623 |
Kolja A Thormann1, Viola Tozzi2, Paula Starke2, Heike Bickeböller2, Marcus Baum3, Albert Rosenberger4.
Abstract
BACKGROUND: ImputAccur is a software tool to measure genotype-imputation accuracy. Imputation of untyped markers is a standard approach in genome-wide association studies to close the gap between directly genotyped and other known DNA variants. However, high accuracy for imputed genotypes is fundamental. Several accuracy measures have been proposed, but unfortunately, they are implemented on different platforms, which is impractical.Entities:
Keywords: Accuracy; GWAS; Imputation; Marker selection; Quality control; SNP
Mesh:
Year: 2022 PMID: 35927623 PMCID: PMC9351229 DOI: 10.1186/s12859-022-04863-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1Example of real-data genotype-imputation accuracy-measures for the telomere region on chromosome 9. Top left: hiQ, top right: info, centre left: Iamchance, centre right: IamHWE, bottom left: , bottom right: ; each dot represents one imputed marker; the marker size is according to minor allele frequency (MAF); threshold values are freely selectable; vertical lines: centre of region classified as: “cold”, “tepid”, “hot”, or “very hot” (the definition is given in the Additional file 1).
| 1 rs00001 913 1 0 0 1 0 0 1 0 0 1 0 0 1 0 0 |
| 2 rs00002 402 0.01 0.99 0 0.01 0.99 0 0.01 0.99 0 0.01 0.99 0 0.01 0.99 0 |
| 3 rs00003 644 0.333 0.334 0.333 0.333 0.334 0.333 0.333 0.334 0.333 0.333 0.334 0.333 0.333 0.334 0.333 |
| 4 rs00004 222 0.25 0.5 0.25 0.25 0.5 0.25 0.25 0.5 0.25 0.25 0.5 0.25 0.25 0.5 0.25 |
| 5 rs00005 221 0.47 0.18 0.35 0.89 0.02 0.09 0.03 0.96 0.01 0.94 0 0.06 0.62 0.34 0.04 |
| 6 rs00006 955 0.975 0.002 0.023 0.52 0.154 0.326 0.309 0.21 0.481 0.48 0.509 0.011 0.969 0.004 0.027 |
| 7 rs00007 518 0.63 0.14 0.23 0.86 0.09 0.05 0.35 0.24 0.41 0.01 0.23 0.76 0.76 0.05 0.19 |
| -i | test1.imputed | [path to and name of input-file] |
| -l | 3 | [number of leading columns] |
| -c | 0 | [third genotype probability needs to be calculated—TRUE (1) or FALSE (0)] |
| -p | excluded_PROBE.txt | [path to and name of sample exclusion file] |
| -m | excluded_SNP.txt | [path to and name of SNP exclusion file] |
| -n | SNP_no,SSSS,PPPPPP | [names for leading columns] |
| SNP_no | SNP | Position | N | MAF (%) | Iamchance | IamHWE | hiQ | Accuracy | Info | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 | rs00005 | 221 | 5 | 26.0 | 0.546 | 0.448 | 0.739 | HOT | 0.099 | 0.319 | 0.113 |
| 2 | rs00002 | 402 | 5 | 49.5 | 0.97 | 0.968 | 0.929 | TEPID | 0.98 | 0.0 | − 9 |
| 7 | rs00007 | 518 | 5 | 40.3 | 0.344 | 0.289 | 0.72 | HOT | − 0.057 | 0.631 | 0.294 |
| 6 | rs00006 | 955 | 5 | 26.1 | 0.446 | 0.326 | 0.963 | HOT | − 0.056 | 0.488 | 0.179 |