| Literature DB >> 19706367 |
David Ellinghaus1, Stefan Schreiber, Andre Franke, Michael Nothnagel.
Abstract
Genotype imputation for single nucleotide polymorphisms (SNPs) has been shown to be a powerful means to include genetic markers in exploratory genetic association studies without having to genotype them, and is becoming a standard procedure. A number of different software programs are available. In our experience, user-friendliness is often the deciding factor in the choice of software to solve a particular task. We therefore evaluated the usability of three publicly available imputation programs: BEAGLE, IMPUTE and MACH. We found all three programs to perform well with HapMap reference data, with little effort needed for data preparation and subsequent association analysis. Each of them has different strengths and weaknesses, however, and none is optimal for all situations.Entities:
Mesh:
Year: 2009 PMID: 19706367 PMCID: PMC3525198 DOI: 10.1186/1479-7364-3-4-371
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
List of programs that can be used for genotype imputation, together with citations and the websites from which they can be obtained
| Software | Version | URL | Ref. |
|---|---|---|---|
| BEAGLE | 3.0.2 | [ | |
| IMPUTE | 0.5.0 | [ | |
| MACH | 1.10.16 | [ | |
| PLINK | 1.05 | [ | |
| BIMBAM | 0.99 | [ | |
| FAMHAP | 18 | [ |
Only the first three programs were considered here.
Summary of a number of software features for three imputation programs.
| Group | Feature | BEAGLE | IMPUTE | MACH |
|---|---|---|---|---|
| Operating system | Java (platform independent) | Linux, Windows, MacOS X, Solaris | Linux, MacOS X | |
| Licence | Free | Free for academic use | Not clear | |
| Source code | Not available | Not available | Availability announced | |
| Documentation | Commendable | Clearly structured | Incomplete | |
| Authors' response | Quick and detailed | Quick | Quick | |
| GUI | No | No | No | |
| Genotype format | Discrete; custom format | Probabilities; custom format | QTDT (Linkage) | |
| Reference format | Custom format | Custom format; prepared HapMap reference available | HapMap format (custom format) | |
| Conversion utilities | Yes | Yes | No | |
| Target of imputation | Chromosomes | Chromosomes or segments | Chromosomes | |
| Memory-saving mode | Yes | No | Yes | |
| Known checking errors | None | Missing probability and input check | Problematic handling of missing reference | |
| Runtime [chr. 6] | 350 minutes | 433 minutes | 2781 minutes | |
| Maximum memory allocation | 2GB | 14 GB [<1 GB with ~ 10 MB segments] | 7GB | |
| Memory-saving mode | Yes | No | Yes | |
| Strand orientation | Check | Check + autoflip | Check + autoflip | |
| Genotypes | Posterior probability | Posterior probability | Posterior probability; allele dosage | |
| Quality measure for imputation | Allelic | Information measure, average of the maximum posterior probabilities | ||
| Output file size [chr. 6] | 1476 MB | 533 MB | 938 MB | |
| X chromosome imputation | No | Yes | No | |
| Trio data | Yes | No | No | |
| Multi-allelic markers | Yes | No | No | |
| Accuracy estimation | No | Yes | No |
For more details, see the main text
chr., chromosome.