Literature DB >> 30032192

Toward fast and accurate SNP genotyping from whole genome sequencing data for bedside diagnostics.

Chen Sun1, Paul Medvedev1,2,3.   

Abstract

Motivation: Genotyping a set of variants from a database is an important step for identifying known genetic traits and disease-related variants within an individual. The growing size of variant databases as well as the high depth of sequencing data poses an efficiency challenge. In clinical applications, where time is crucial, alignment-based methods are often not fast enough. To fill the gap, Shajii et al. propose LAVA, an alignment-free genotyping method which is able to more quickly genotype single nucleotide polymorphisms (SNPs); however, there remains large room for improvements in running time and accuracy.
Results: We present the VarGeno method for SNP genotyping from Illumina whole genome sequencing data. VarGeno builds upon LAVA by improving the speed of k-mer querying as well as the accuracy of the genotyping strategy. We evaluate VarGeno on several read datasets using different genotyping SNP lists. VarGeno performs 7-13 times faster than LAVA with similar memory usage, while improving accuracy. Availability and implementation: VarGeno is freely available at: https://github.com/medvedevgroup/vargeno. Supplementary information: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2019        PMID: 30032192     DOI: 10.1093/bioinformatics/bty641

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  Nebula: ultra-efficient mapping-free structural variant genotyper.

Authors:  Parsoa Khorsand; Fereydoun Hormozdiari
Journal:  Nucleic Acids Res       Date:  2021-05-07       Impact factor: 16.971

2.  Pangenome-based genome inference allows efficient and accurate genotyping across a wide spectrum of variant classes.

Authors:  Jana Ebler; Peter Ebert; Wayne E Clarke; Tobias Rausch; Peter A Audano; Torsten Houwaart; Yafei Mao; Jan O Korbel; Evan E Eichler; Michael C Zody; Alexander T Dilthey; Tobias Marschall
Journal:  Nat Genet       Date:  2022-04-11       Impact factor: 38.330

3.  SPRISS: Approximating Frequent K-mers by Sampling Reads, and Applications.

Authors:  Diego Santoro; Leonardo Pellegrina; Matteo Comin; Fabio Vandin
Journal:  Bioinformatics       Date:  2022-05-18       Impact factor: 6.931

4.  MALVA: Genotyping by Mapping-free ALlele Detection of Known VAriants.

Authors:  Luca Denti; Marco Previtali; Giulia Bernardini; Alexander Schönhuth; Paola Bonizzoni
Journal:  iScience       Date:  2019-07-12

5.  Simplitigs as an efficient and scalable representation of de Bruijn graphs.

Authors:  Michael Baym; Gregory Kucherov; Karel Břinda
Journal:  Genome Biol       Date:  2021-04-06       Impact factor: 13.583

6.  Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper.

Authors:  Phillip Andrew Richmond; Alice Mary Kaye; Godfrain Jacques Kounkou; Tamar Vered Av-Shalom; Wyeth W Wasserman
Journal:  PLoS Comput Biol       Date:  2021-03-22       Impact factor: 4.475

7.  KAGE: fast alignment-free graph-based genotyping of SNPs and short indels.

Authors:  Ivar Grytten; Knut Dagestad Rand; Geir Kjetil Sandve
Journal:  Genome Biol       Date:  2022-10-04       Impact factor: 17.906

8.  Disk compression of k-mer sets.

Authors:  Amatur Rahman; Rayan Chikhi; Paul Medvedev
Journal:  Algorithms Mol Biol       Date:  2021-06-21       Impact factor: 1.405

9.  Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants.

Authors:  Daniel S Standage; C Titus Brown; Fereydoun Hormozdiari
Journal:  iScience       Date:  2019-07-23

10.  Genetic Variants Associated With Intraparenchymal Hemorrhage Progression After Traumatic Brain Injury.

Authors:  Ruchira M Jha; Benjamin E Zusman; Ava M Puccio; David O Okonkwo; Matthew Pease; Shashvat M Desai; Matthew Leach; Yvette P Conley; Patrick M Kochanek
Journal:  JAMA Netw Open       Date:  2021-07-01
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.