Literature DB >> 31418021

Long-read amplicon denoising.

Venkatesh Kumar1,2, Thomas Vollbrecht2, Mark Chernyshev1,2, Sanjay Mohan2, Brian Hanst3, Nicholas Bavafa2, Antonia Lorenzo1,2, Nikesh Kumar2, Robert Ketteringham4, Kemal Eren2, Michael Golden5, Michelli F Oliveira2, Ben Murrell1,2.   

Abstract

Long-read next-generation amplicon sequencing shows promise for studying complete genes or genomes from complex and diverse populations. Current long-read sequencing technologies have challenging error profiles, hindering data processing and incorporation into downstream analyses. Here we consider the problem of how to reconstruct, free of sequencing error, the true sequence variants and their associated frequencies from PacBio reads. Called 'amplicon denoising', this problem has been extensively studied for short-read sequencing technologies, but current solutions do not always successfully generalize to long reads with high indel error rates. We introduce two methods: one that runs nearly instantly and is very accurate for medium length reads and high template coverage, and another, slower method that is more robust when reads are very long or coverage is lower. On two Mock Virus Community datasets with ground truth, each sequenced on a different PacBio instrument, and on a number of simulated datasets, we compare our two approaches to each other and to existing algorithms. We outperform all tested methods in accuracy, with competitive run times even for our slower method, successfully discriminating templates that differ by a just single nucleotide. Julia implementations of Fast Amplicon Denoising (FAD) and Robust Amplicon Denoising (RAD), and a webserver interface, are freely available.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31418021      PMCID: PMC6765106          DOI: 10.1093/nar/gkz657

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  29 in total

1.  Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons.

Authors:  Brian J Haas; Dirk Gevers; Ashlee M Earl; Mike Feldgarden; Doyle V Ward; Georgia Giannoukos; Dawn Ciulla; Diana Tabbaa; Sarah K Highlander; Erica Sodergren; Barbara Methé; Todd Z DeSantis; Joseph F Petrosino; Rob Knight; Bruce W Birren
Journal:  Genome Res       Date:  2011-01-06       Impact factor: 9.043

2.  RIFRAF: a frame-resolving consensus algorithm.

Authors:  Kemal Eren; Ben Murrell
Journal:  Bioinformatics       Date:  2018-11-15       Impact factor: 6.937

3.  Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples.

Authors:  Melissa Laird Smith; Ben Murrell; Kemal Eren; Caroline Ignacio; Elise Landais; Steven Weaver; Pham Phung; Colleen Ludka; Lance Hepler; Gemma Caballero; Tristan Pollner; Yan Guo; Douglas Richman; Pascal Poignard; Ellen E Paxinos; Sergei L Kosakovsky Pond; Davey M Smith
Journal:  Virus Evol       Date:  2016-07-08

4.  Intrahost dynamics of antiviral resistance in influenza A virus reflect complex patterns of segment linkage, reassortment, and natural selection.

Authors:  Matthew B Rogers; Timothy Song; Robert Sebra; Benjamin D Greenbaum; Marie-Eve Hamelin; Adam Fitch; Alan Twaddle; Lijia Cui; Edward C Holmes; Guy Boivin; Elodie Ghedin
Journal:  MBio       Date:  2015-04-07       Impact factor: 7.867

5.  Evaluation of PacBio sequencing for full-length bacterial 16S rRNA gene classification.

Authors:  Josef Wagner; Paul Coupland; Hilary P Browne; Trevor D Lawley; Suzanna C Francis; Julian Parkhill
Journal:  BMC Microbiol       Date:  2016-11-14       Impact factor: 3.605

6.  Pacific Biosciences Sequencing and IMGT/HighV-QUEST Analysis of Full-Length Single Chain Fragment Variable from an In Vivo Selected Phage-Display Combinatorial Library.

Authors:  Audrey Hemadou; Véronique Giudicelli; Melissa Laird Smith; Marie-Paule Lefranc; Patrice Duroux; Sofia Kossida; Cheryl Heiner; N Lance Hepler; John Kuijpers; Alexis Groppi; Jonas Korlach; Philippe Mondon; Florence Ottones; Marie-Josée Jacobin-Valat; Jeanny Laroche-Traineau; Gisèle Clofent-Sanchez
Journal:  Front Immunol       Date:  2017-12-20       Impact factor: 7.561

7.  Full-Length Envelope Analyzer (FLEA): A tool for longitudinal analysis of viral amplicons.

Authors:  Kemal Eren; Steven Weaver; Robert Ketteringham; Morné Valentyn; Melissa Laird Smith; Venkatesh Kumar; Sanjay Mohan; Sergei L Kosakovsky Pond; Ben Murrell
Journal:  PLoS Comput Biol       Date:  2018-12-13       Impact factor: 4.475

8.  High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution.

Authors:  Benjamin J Callahan; Joan Wong; Cheryl Heiner; Steve Oh; Casey M Theriot; Ajay S Gulati; Sarah K McGill; Michael K Dougherty
Journal:  Nucleic Acids Res       Date:  2019-10-10       Impact factor: 16.971

9.  IPED: a highly efficient denoising tool for Illumina MiSeq Paired-end 16S rRNA gene amplicon sequencing data.

Authors:  Mohamed Mysara; Natalie Leys; Jeroen Raes; Pieter Monsieurs
Journal:  BMC Bioinformatics       Date:  2016-04-29       Impact factor: 3.169

10.  Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system.

Authors:  Patrick D Schloss; Matthew L Jenior; Charles C Koumpouras; Sarah L Westcott; Sarah K Highlander
Journal:  PeerJ       Date:  2016-03-28       Impact factor: 2.984

View more
  9 in total

Review 1.  Repair of DNA double-strand breaks in plant meiosis: role of eukaryotic RecA recombinases and their modulators.

Authors:  Côme Emmenecker; Christine Mézard; Rajeev Kumar
Journal:  Plant Reprod       Date:  2022-06-01       Impact factor: 3.767

2.  Comparative genomics and full-length Tprk profiling of Treponema pallidum subsp. pallidum reinfection.

Authors:  Amin Addetia; Lauren C Tantalo; Michelle J Lin; Hong Xie; Meei-Li Huang; Christina M Marra; Alexander L Greninger
Journal:  PLoS Negl Trop Dis       Date:  2020-04-06

3.  An alpaca nanobody neutralizes SARS-CoV-2 by blocking receptor interaction.

Authors:  Leo Hanke; Laura Vidakovics Perez; Daniel J Sheward; Hrishikesh Das; Tim Schulte; Ainhoa Moliner-Morro; Martin Corcoran; Adnane Achour; Gunilla B Karlsson Hedestam; B Martin Hällberg; Ben Murrell; Gerald M McInerney
Journal:  Nat Commun       Date:  2020-09-04       Impact factor: 14.919

4.  A highly multiplexed droplet digital PCR assay to measure the intact HIV-1 proviral reservoir.

Authors:  Claire N Levy; Sean M Hughes; Pavitra Roychoudhury; Daniel B Reeves; Chelsea Amstuz; Haiying Zhu; Meei-Li Huang; Yulun Wei; Marta E Bull; Noah A J Cassidy; Jan McClure; Lisa M Frenkel; Mars Stone; Sonia Bakkour; Elizabeth R Wonderlich; Michael P Busch; Steven G Deeks; Joshua T Schiffer; Robert W Coombs; Dara A Lehman; Keith R Jerome; Florian Hladik
Journal:  Cell Rep Med       Date:  2021-04-12

5.  Long-read sequencing reveals the evolutionary drivers of intra-host diversity across natural RNA mycovirus infections.

Authors:  Deborah M Leigh; Karla Peranić; Simone Prospero; Carolina Cornejo; Mirna Ćurković-Perica; Quirin Kupper; Lucija Nuskern; Daniel Rigling; Marin Ježić
Journal:  Virus Evol       Date:  2021-12-01

6.  MOCHI, a comprehensive cross-platform tool for amplicon-based microbiota analysis.

Authors:  Jun-Jie Zheng; Po-Wen Wang; Tzu-Wen Huang; Yao-Jong Yang; Hua-Sheng Chiu; Pavel Sumazin; Ting-Wen Chen
Journal:  Bioinformatics       Date:  2022-07-25       Impact factor: 6.931

7.  Estimation of Full-Length TprK Diversity in Treponema pallidum subsp. pallidum.

Authors:  Amin Addetia; Michelle J Lin; Quynh Phung; Hong Xie; Meei-Li Huang; Giulia Ciccarese; Ivano Dal Conte; Marco Cusini; Francesco Drago; Lorenzo Giacani; Alexander L Greninger
Journal:  mBio       Date:  2020-10-27       Impact factor: 7.867

8.  Ultra-accurate microbial amplicon sequencing with synthetic long reads.

Authors:  Benjamin J Callahan; Dmitry Grinevich; Siddhartha Thakur; Michael A Balamotis; Tuval Ben Yehezkel
Journal:  Microbiome       Date:  2021-06-05       Impact factor: 14.650

9.  Multivariate mining of an alpaca immune repertoire identifies potent cross-neutralizing SARS-CoV-2 nanobodies.

Authors:  Leo Hanke; Daniel J Sheward; Alec Pankow; Laura Perez Vidakovics; Vivien Karl; Changil Kim; Egon Urgard; Natalie L Smith; Juan Astorga-Wells; Simon Ekström; Jonathan M Coquet; Gerald M McInerney; Ben Murrell
Journal:  Sci Adv       Date:  2022-03-25       Impact factor: 14.136

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.