Literature DB >> 33503255

Nebula: ultra-efficient mapping-free structural variant genotyper.

Parsoa Khorsand1, Fereydoun Hormozdiari1,2,3.   

Abstract

Large scale catalogs of common genetic variants (including indels and structural variants) are being created using data from second and third generation whole-genome sequencing technologies. However, the genotyping of these variants in newly sequenced samples is a nontrivial task that requires extensive computational resources. Furthermore, current approaches are mostly limited to only specific types of variants and are generally prone to various errors and ambiguities when genotyping complex events. We are proposing an ultra-efficient approach for genotyping any type of structural variation that is not limited by the shortcomings and complexities of current mapping-based approaches. Our method Nebula utilizes the changes in the count of k-mers to predict the genotype of structural variants. We have shown that not only Nebula is an order of magnitude faster than mapping based approaches for genotyping structural variants, but also has comparable accuracy to state-of-the-art approaches. Furthermore, Nebula is a generic framework not limited to any specific type of event. Nebula is publicly available at https://github.com/Parsoa/Nebula.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2021        PMID: 33503255      PMCID: PMC8096284          DOI: 10.1093/nar/gkab025

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  39 in total

1.  Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.

Authors:  Arda Soylev; Thong Minh Le; Hajar Amini; Can Alkan; Fereydoun Hormozdiari
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

2.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

Review 3.  Copy number variation in human health, disease, and evolution.

Authors:  Feng Zhang; Wenli Gu; Matthew E Hurles; James R Lupski
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

4.  An integrated map of structural variation in 2,504 human genomes.

Authors:  Peter H Sudmant; Tobias Rausch; Eugene J Gardner; Robert E Handsaker; Alexej Abyzov; John Huddleston; Yan Zhang; Kai Ye; Goo Jun; Markus Hsi-Yang Fritz; Miriam K Konkel; Ankit Malhotra; Adrian M Stütz; Xinghua Shi; Francesco Paolo Casale; Jieming Chen; Fereydoun Hormozdiari; Gargi Dayama; Ken Chen; Maika Malig; Mark J P Chaisson; Klaudia Walter; Sascha Meiers; Seva Kashin; Erik Garrison; Adam Auton; Hugo Y K Lam; Xinmeng Jasmine Mu; Can Alkan; Danny Antaki; Taejeong Bae; Eliza Cerveira; Peter Chines; Zechen Chong; Laura Clarke; Elif Dal; Li Ding; Sarah Emery; Xian Fan; Madhusudan Gujral; Fatma Kahveci; Jeffrey M Kidd; Yu Kong; Eric-Wubbo Lameijer; Shane McCarthy; Paul Flicek; Richard A Gibbs; Gabor Marth; Christopher E Mason; Androniki Menelaou; Donna M Muzny; Bradley J Nelson; Amina Noor; Nicholas F Parrish; Matthew Pendleton; Andrew Quitadamo; Benjamin Raeder; Eric E Schadt; Mallory Romanovitch; Andreas Schlattl; Robert Sebra; Andrey A Shabalin; Andreas Untergasser; Jerilyn A Walker; Min Wang; Fuli Yu; Chengsheng Zhang; Jing Zhang; Xiangqun Zheng-Bradley; Wanding Zhou; Thomas Zichner; Jonathan Sebat; Mark A Batzer; Steven A McCarroll; Ryan E Mills; Mark B Gerstein; Ali Bashir; Oliver Stegle; Scott E Devine; Charles Lee; Evan E Eichler; Jan O Korbel
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

5.  Discovery and genotyping of genome structural polymorphism by sequencing on a population scale.

Authors:  Robert E Handsaker; Joshua M Korn; James Nemesh; Steven A McCarroll
Journal:  Nat Genet       Date:  2011-02-13       Impact factor: 38.330

6.  Precise detection of de novo single nucleotide variants in human genomes.

Authors:  Laura Gómez-Romero; Kim Palacios-Flores; José Reyes; Delfino García; Margareta Boege; Guillermo Dávila; Margarita Flores; Michael C Schatz; Rafael Palacios
Journal:  Proc Natl Acad Sci U S A       Date:  2018-05-07       Impact factor: 11.205

7.  Paragraph: a graph-based structural variant genotyper for short-read sequence data.

Authors:  Sai Chen; Peter Krusche; Egor Dolzhenko; Rachel M Sherman; Roman Petrovski; Felix Schlesinger; Melanie Kirsche; David R Bentley; Michael C Schatz; Fritz J Sedlazeck; Michael A Eberle
Journal:  Genome Biol       Date:  2019-12-19       Impact factor: 13.583

8.  LUMPY: a probabilistic framework for structural variant discovery.

Authors:  Ryan M Layer; Colby Chiang; Aaron R Quinlan; Ira M Hall
Journal:  Genome Biol       Date:  2014-06-26       Impact factor: 13.583

9.  Association mapping from sequencing reads using k-mers.

Authors:  Atif Rahman; Ingileif Hallgrímsdóttir; Michael Eisen; Lior Pachter
Journal:  Elife       Date:  2018-06-13       Impact factor: 8.140

10.  A benchmark of transposon insertion detection tools using real data.

Authors:  Pol Vendrell-Mir; Fabio Barteri; Miriam Merenciano; Josefa González; Josep M Casacuberta; Raúl Castanera
Journal:  Mob DNA       Date:  2019-12-30
View more
  5 in total

1.  Space-efficient representation of genomic k-mer count tables.

Authors:  Yoshihiro Shibuya; Djamal Belazzougui; Gregory Kucherov
Journal:  Algorithms Mol Biol       Date:  2022-03-21       Impact factor: 1.405

Review 2.  Population-scale genotyping of structural variation in the era of long-read sequencing.

Authors:  Cheng Quan; Hao Lu; Yiming Lu; Gangqiao Zhou
Journal:  Comput Struct Biotechnol J       Date:  2022-05-27       Impact factor: 6.155

3.  Dysgu: efficient structural variant calling using short or long reads.

Authors:  Kez Cleal; Duncan M Baird
Journal:  Nucleic Acids Res       Date:  2022-05-20       Impact factor: 19.160

4.  Effective sequence similarity detection with strobemers.

Authors:  Kristoffer Sahlin
Journal:  Genome Res       Date:  2021-10-19       Impact factor: 9.043

5.  Comprehensive evaluation of structural variant genotyping methods based on long-read sequencing data.

Authors:  Xiaoke Duan; Mingpei Pan; Shaohua Fan
Journal:  BMC Genomics       Date:  2022-04-23       Impact factor: 4.547

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.