Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A fast Data-Driven method for genotype imputation, phasing, and local ancestry inference: MendelImpute.jl.

Literature DB >> 34289008

A fast Data-Driven method for genotype imputation, phasing, and local ancestry inference: MendelImpute.jl.

Benjamin B Chu¹, Eric M Sobel^1,2, Rory Wasiolek¹, Seyoon Ko³, Janet S Sinsheimer^1,2,3, Hua Zhou³, Kenneth Lange^1,2.

Abstract

MOTIVATION: Current methods for genotype imputation and phasing exploit the volume of data in haplotype reference panels and rely on hidden Markov models. Existing programs all have essentially the same imputation accuracy, are computationally intensive, and generally require pre-phasing the typed markers.
RESULTS: We introduce a novel data-mining method for genotype imputation and phasing that substitutes highly efficient linear algebra routines for hidden Markov model calculations. This strategy, embodied in our Julia program MendelImpute.jl, avoids explicit assumptions about recombination and population structure while delivering similar prediction accuracy, better memory usage, and an order of magnitude or better run-times compared to the fastest competing method. MendelImpute operates on both dosage data and unphased genotype data and simultaneously imputes missing genotypes and phase at both the typed and untyped SNPs. Finally, MendelImpute naturally extends to global and local ancestry estimation and lends itself to new strategies for data compression and hence faster data transport and sharing. AVAILABILITY: Software, documentation, and scripts to reproduce our results are available from https://github.com/OpenMendel/MendelImpute.jl. SUPPLEMENTARY INFORMATION: Supplementary data are available from Bioinformatics online.

Entities: Chemical

Year: 2021 PMID： 34289008 PMCID： PMC8665755 DOI： 10.1093/bioinformatics/btab489

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.931

Keyword Cloud
References

26 in total

1. Tabix: fast retrieval of sequence features from generic TAB-delimited files.

Authors: Heng Li
Journal: Bioinformatics Date: 2011-01-05 Impact factor: 6.937

2. Elevated male European and female African contributions to the genomes of African American individuals.

Authors: Joanne M Lind; Holli B Hutcheson-Dilks; Scott M Williams; Jason H Moore; Myron Essex; Eduardo Ruiz-Pesini; Douglas C Wallace; Sarah A Tishkoff; Stephen J O'Brien; Michael W Smith
Journal: Hum Genet Date: 2006-09-28 Impact factor: 4.132

Review 3. Genotype Imputation from Large Reference Panels.

Authors: Sayantan Das; Gonçalo R Abecasis; Brian L Browning
Journal: Annu Rev Genomics Hum Genet Date: 2018-05-23 Impact factor: 8.929

4. MaCH-admix: genotype imputation for admixed populations.

Authors: Eric Yi Liu; Mingyao Li; Wei Wang; Yun Li
Journal: Genet Epidemiol Date: 2012-10-16 Impact factor: 2.135

5. A global reference for human genetic variation.

Authors: Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal: Nature Date: 2015-10-01 Impact factor: 49.962

6. Genetic ancestry, admixture and health determinants in Latin America.

Authors: Emily T Norris; Lu Wang; Andrew B Conley; Lavanya Rishishwar; Leonardo Mariño-Ramírez; Augusto Valderrama-Aguirre; I King Jordan
Journal: BMC Genomics Date: 2018-12-11 Impact factor: 3.969

7. Admixture-enabled selection for rapid adaptive evolution in the Americas.

Authors: Emily T Norris; Lavanya Rishishwar; Aroon T Chande; Andrew B Conley; Kaixiong Ye; Augusto Valderrama-Aguirre; I King Jordan
Journal: Genome Biol Date: 2020-02-07 Impact factor: 13.583

8. A reference panel of 64,976 haplotypes for genotype imputation.

Authors: Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin
Journal: Nat Genet Date: 2016-08-22 Impact factor: 38.330

9. An integrated map of genetic variation from 1,092 human genomes.

Authors: Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal: Nature Date: 2012-11-01 Impact factor: 49.962

10. Enhanced methods for local ancestry assignment in sequenced admixed individuals.

Authors: Robert Brown; Bogdan Pasaniuc
Journal: PLoS Comput Biol Date: 2014-04-17 Impact factor: 4.475