Heng Li1. 1. Medical Population Genetics, Broad Institute, Cambridge, MA 02142, USA.
Abstract
MOTIVATION: Single Molecule Real-Time (SMRT) sequencing technology and Oxford Nanopore technologies (ONT) produce reads over 10 kb in length, which have enabled high-quality genome assembly at an affordable cost. However, at present, long reads have an error rate as high as 10-15%. Complex and computationally intensive pipelines are required to assemble such reads. RESULTS: We present a new mapper, minimap and a de novo assembler, miniasm, for efficiently mapping and assembling SMRT and ONT reads without an error correction stage. They can often assemble a sequencing run of bacterial data into a single contig in a few minutes, and assemble 45-fold Caenorhabditis elegans data in 9 min, orders of magnitude faster than the existing pipelines, though the consensus sequence error rate is as high as raw reads. We also introduce a pairwise read mapping format and a graphical fragment assembly format, and demonstrate the interoperability between ours and current tools. AVAILABILITY AND IMPLEMENTATION: https://github.com/lh3/minimap and https://github.com/lh3/miniasm CONTACT: hengli@broadinstitute.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Single Molecule Real-Time (SMRT) sequencing technology and Oxford Nanopore technologies (ONT) produce reads over 10 kb in length, which have enabled high-quality genome assembly at an affordable cost. However, at present, long reads have an error rate as high as 10-15%. Complex and computationally intensive pipelines are required to assemble such reads. RESULTS: We present a new mapper, minimap and a de novo assembler, miniasm, for efficiently mapping and assembling SMRT and ONT reads without an error correction stage. They can often assemble a sequencing run of bacterial data into a single contig in a few minutes, and assemble 45-fold Caenorhabditis elegans data in 9 min, orders of magnitude faster than the existing pipelines, though the consensus sequence error rate is as high as raw reads. We also introduce a pairwise read mapping format and a graphical fragment assembly format, and demonstrate the interoperability between ours and current tools. AVAILABILITY AND IMPLEMENTATION: https://github.com/lh3/minimap and https://github.com/lh3/miniasm CONTACT: hengli@broadinstitute.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Konstantin Berlin; Sergey Koren; Chen-Shan Chin; James P Drake; Jane M Landolin; Adam M Phillippy Journal: Nat Biotechnol Date: 2015-05-25 Impact factor: 54.908
Authors: Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach Journal: Nat Methods Date: 2013-05-05 Impact factor: 28.547
Authors: Filipe J Ribeiro; Dariusz Przybylski; Shuangye Yin; Ted Sharpe; Sante Gnerre; Amr Abouelleil; Aaron M Berlin; Anna Montmayeur; Terrance P Shea; Bruce J Walker; Sarah K Young; Carsten Russ; Chad Nusbaum; Iain MacCallum; David B Jaffe Journal: Genome Res Date: 2012-07-24 Impact factor: 9.043
Authors: Sergey Koren; Michael C Schatz; Brian P Walenz; Jeffrey Martin; Jason T Howard; Ganeshkumar Ganapathy; Zhong Wang; David A Rasko; W Richard McCombie; Erich D Jarvis Journal: Nat Biotechnol Date: 2012-07-01 Impact factor: 54.908
Authors: Magnus Unemo; Daniel Golparian; Leonor Sánchez-Busó; Yonatan Grad; Susanne Jacobsson; Makoto Ohnishi; Monica M Lahra; Athena Limnios; Aleksandra E Sikora; Teodora Wi; Simon R Harris Journal: J Antimicrob Chemother Date: 2016-07-17 Impact factor: 5.790
Authors: Robert P Auber; Thiti Suttiyut; Rachel M McCoy; Manoj Ghaste; Joseph W Crook; Amanda L Pendleton; Joshua R Widhalm; Jennifer H Wisecaver Journal: Hortic Res Date: 2020-06-01 Impact factor: 6.793
Authors: Bridget K Marcellino; Noushin Farnoud; Bruno Cassinat; Min Lu; Emanuelle Verger; Erin McGovern; Minal Patel; Juan Medina-Martinez; Max Fine Levine; Juanes E Arango Ossa; Yangyu Zhou; Heidi Kosiorek; Meenakshi Mehrotra; Jane Houldsworth; Amylou Dueck; Michael Rossi; John Mascarenhas; Jean-Jacques Kiladjian; Raajit K Rampal; Ronald Hoffman Journal: Blood Adv Date: 2020-11-24
Authors: Scott Quainoo; Jordy P M Coolen; Sacha A F T van Hijum; Martijn A Huynen; Willem J G Melchers; Willem van Schaik; Heiman F L Wertheim Journal: Clin Microbiol Rev Date: 2017-10 Impact factor: 26.132