Literature DB >> 33349238

MegaPath: sensitive and rapid pathogen detection using metagenomic NGS data.

Chi-Ming Leung1,2, Dinghua Li3, Yan Xin3,4, Wai-Chun Law4, Yifan Zhang3,4, Hing-Fung Ting3, Ruibang Luo3,4, Tak-Wah Lam3,4.   

Abstract

BACKGROUND: Next-generation sequencing (NGS) enables unbiased detection of pathogens by mapping the sequencing reads of a patient sample to the known reference sequence of bacteria and viruses. However, for a new pathogen without a reference sequence of a close relative, or with a high load of mutations compared to its predecessors, read mapping fails due to a low similarity between the pathogen and reference sequence, which in turn leads to insensitive and inaccurate pathogen detection outcomes.
RESULTS: We developed MegaPath, which runs fast and provides high sensitivity in detecting new pathogens. In MegaPath, we have implemented and tested a combination of polishing techniques to remove non-informative human reads and spurious alignments. MegaPath applies a global optimization to the read alignments and reassigns the reads incorrectly aligned to multiple species to a unique species. The reassignment not only significantly increased the number of reads aligned to distant pathogens, but also significantly reduced incorrect alignments. MegaPath implements an enhanced maximum-exact-match prefix seeding strategy and a SIMD-accelerated Smith-Waterman algorithm to run fast.
CONCLUSIONS: In our benchmarks, MegaPath demonstrated superior sensitivity by detecting eight times more reads from a low-similarity pathogen than other tools. Meanwhile, MegaPath ran much faster than the other state-of-the-art alignment-based pathogen detection tools (and compariable with the less sensitivity profile-based pathogen detection tools). The running time of MegaPath is about 20 min on a typical 1 Gb dataset.

Entities:  

Keywords:  Abundance detection; Next generation sequencing; Pathogen detection; Read alignment; Shotgun metagenomic sequencing

Mesh:

Year:  2020        PMID: 33349238      PMCID: PMC7751095          DOI: 10.1186/s12864-020-06875-6

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  14 in total

1.  A novel outbreak enterovirus D68 strain associated with acute flaccid myelitis cases in the USA (2012-14): a retrospective cohort study.

Authors:  Alexander L Greninger; Samia N Naccache; Kevin Messacar; Anna Clayton; Guixia Yu; Sneha Somasekar; Scot Federman; Doug Stryke; Christopher Anderson; Shigeo Yagi; Sharon Messenger; Debra Wadford; Dongxiang Xia; James P Watt; Keith Van Haren; Samuel R Dominguez; Carol Glaser; Grace Aldrovandi; Charles Y Chiu
Journal:  Lancet Infect Dis       Date:  2015-03-31       Impact factor: 25.071

2.  The standard genetic code enhances adaptive evolution of proteins.

Authors:  Wen Zhu; Stephen Freeland
Journal:  J Theor Biol       Date:  2005-12-01       Impact factor: 2.691

3.  MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph.

Authors:  Dinghua Li; Chi-Man Liu; Ruibang Luo; Kunihiko Sadakane; Tak-Wah Lam
Journal:  Bioinformatics       Date:  2015-01-20       Impact factor: 6.937

4.  AC-DIAMOND v1: accelerating large-scale DNA-protein alignment.

Authors:  Huijun Mai; Yifan Zhang; Dinghua Li; Henry Chi-Ming Leung; Ruibang Luo; Chi-Kwong Wong; Hing-Fung Ting; Tak-Wah Lam
Journal:  Bioinformatics       Date:  2018-11-01       Impact factor: 6.937

5.  Validation of Metagenomic Next-Generation Sequencing Tests for Universal Pathogen Detection.

Authors:  Robert Schlaberg; Charles Y Chiu; Steve Miller; Gary W Procop; George Weinstock
Journal:  Arch Pathol Lab Med       Date:  2017-02-07       Impact factor: 5.534

6.  A novel rhabdovirus associated with acute hemorrhagic fever in central Africa.

Authors:  Gilda Grard; Joseph N Fair; Deanna Lee; Elizabeth Slikas; Imke Steffen; Jean-Jacques Muyembe; Taylor Sittler; Narayanan Veeraraghavan; J Graham Ruby; Chunlin Wang; Maria Makuwa; Prime Mulembakani; Robert B Tesh; Jonna Mazet; Anne W Rimoin; Travis Taylor; Bradley S Schneider; Graham Simmons; Eric Delwart; Nathan D Wolfe; Charles Y Chiu; Eric M Leroy
Journal:  PLoS Pathog       Date:  2012-09-27       Impact factor: 6.823

7.  Host Subtraction, Filtering and Assembly Validations for Novel Viral Discovery Using Next Generation Sequencing Data.

Authors:  Gordon M Daly; Richard M Leggett; William Rowe; Samuel Stubbs; Maxim Wilkinson; Ricardo H Ramirez-Gonzalez; Mario Caccamo; William Bernal; Jonathan L Heeney
Journal:  PLoS One       Date:  2015-06-22       Impact factor: 3.240

8.  Centrifuge: rapid and sensitive classification of metagenomic sequences.

Authors:  Daehwan Kim; Li Song; Florian P Breitwieser; Steven L Salzberg
Journal:  Genome Res       Date:  2016-10-17       Impact factor: 9.043

9.  Kraken: ultrafast metagenomic sequence classification using exact alignments.

Authors:  Derrick E Wood; Steven L Salzberg
Journal:  Genome Biol       Date:  2014-03-03       Impact factor: 13.583

10.  An evaluation of the accuracy and speed of metagenome analysis tools.

Authors:  Stinus Lindgreen; Karen L Adair; Paul P Gardner
Journal:  Sci Rep       Date:  2016-01-18       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.