| Literature DB >> 26232609 |
Wenzhi Li1, Wei Xu2, Guoxing Fu3, Li Ma4, Jendai Richards2, Weinian Rao3, Tameka Bythwood2, Shiwen Guo5, Qing Song6.
Abstract
Enormously growing genomic datasets present a new challenge on missing data imputation, a notoriously resource-demanding task. Haplotype imputation requires ethnicity-matched references. However, to date, haplotype references are not available for the majority of populations in the world. We explored to use existing unphased genotype datasets as references; if it succeeds, it will cover almost all of the populations in the world. The results showed that our HiFi software successfully yields 99.43% accuracy with unphased genotype references. Our method provides a cost-effective solution to breakthrough the bottleneck of limited reference availability for haplotype imputation in the big data era.Entities:
Keywords: Big data; Imputation; References
Mesh:
Year: 2015 PMID: 26232609 PMCID: PMC5373555 DOI: 10.1016/j.gene.2015.07.082
Source DB: PubMed Journal: Gene ISSN: 0378-1119 Impact factor: 3.688