Literature DB >> 27153593

Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences.

Heng Li1.   

Abstract

MOTIVATION: Single Molecule Real-Time (SMRT) sequencing technology and Oxford Nanopore technologies (ONT) produce reads over 10 kb in length, which have enabled high-quality genome assembly at an affordable cost. However, at present, long reads have an error rate as high as 10-15%. Complex and computationally intensive pipelines are required to assemble such reads.
RESULTS: We present a new mapper, minimap and a de novo assembler, miniasm, for efficiently mapping and assembling SMRT and ONT reads without an error correction stage. They can often assemble a sequencing run of bacterial data into a single contig in a few minutes, and assemble 45-fold Caenorhabditis elegans data in 9 min, orders of magnitude faster than the existing pipelines, though the consensus sequence error rate is as high as raw reads. We also introduce a pairwise read mapping format and a graphical fragment assembly format, and demonstrate the interoperability between ours and current tools.
AVAILABILITY AND IMPLEMENTATION: https://github.com/lh3/minimap and https://github.com/lh3/miniasm CONTACT: hengli@broadinstitute.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2016        PMID: 27153593      PMCID: PMC4937194          DOI: 10.1093/bioinformatics/btw152

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  The fragment assembly string graph.

Authors:  Eugene W Myers
Journal:  Bioinformatics       Date:  2005-09-01       Impact factor: 6.937

3.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.

Authors:  Konstantin Berlin; Sergey Koren; Chen-Shan Chin; James P Drake; Jane M Landolin; Adam M Phillippy
Journal:  Nat Biotechnol       Date:  2015-05-25       Impact factor: 54.908

5.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

6.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

7.  Limitations of next-generation genome sequence assembly.

Authors:  Can Alkan; Saba Sajjadian; Evan E Eichler
Journal:  Nat Methods       Date:  2010-11-21       Impact factor: 28.547

8.  Finished bacterial genomes from shotgun sequence data.

Authors:  Filipe J Ribeiro; Dariusz Przybylski; Shuangye Yin; Ted Sharpe; Sante Gnerre; Amr Abouelleil; Aaron M Berlin; Anna Montmayeur; Terrance P Shea; Bruce J Walker; Sarah K Young; Carsten Russ; Chad Nusbaum; Iain MacCallum; David B Jaffe
Journal:  Genome Res       Date:  2012-07-24       Impact factor: 9.043

9.  Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Authors:  Sergey Koren; Michael C Schatz; Brian P Walenz; Jeffrey Martin; Jason T Howard; Ganeshkumar Ganapathy; Zhong Wang; David A Rasko; W Richard McCombie; Erich D Jarvis
Journal:  Nat Biotechnol       Date:  2012-07-01       Impact factor: 54.908

Review 10.  Genetic variation and the de novo assembly of human genomes.

Authors:  Mark J P Chaisson; Richard K Wilson; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2015-10-07       Impact factor: 53.242

View more
  342 in total

1.  The novel 2016 WHO Neisseria gonorrhoeae reference strains for global quality assurance of laboratory investigations: phenotypic, genetic and reference genome characterization.

Authors:  Magnus Unemo; Daniel Golparian; Leonor Sánchez-Busó; Yonatan Grad; Susanne Jacobsson; Makoto Ohnishi; Monica M Lahra; Athena Limnios; Aleksandra E Sikora; Teodora Wi; Simon R Harris
Journal:  J Antimicrob Chemother       Date:  2016-07-17       Impact factor: 5.790

2.  Hybrid de novo genome assembly of red gromwell (Lithospermum erythrorhizon) reveals evolutionary insight into shikonin biosynthesis.

Authors:  Robert P Auber; Thiti Suttiyut; Rachel M McCoy; Manoj Ghaste; Joseph W Crook; Amanda L Pendleton; Joshua R Widhalm; Jennifer H Wisecaver
Journal:  Hortic Res       Date:  2020-06-01       Impact factor: 6.793

Review 3.  Application of computational approaches to analyze metagenomic data.

Authors:  Ho-Jin Gwak; Seung Jae Lee; Mina Rho
Journal:  J Microbiol       Date:  2021-02-10       Impact factor: 3.422

4.  Minimap2: pairwise alignment for nucleotide sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

5.  The design and construction of reference pangenome graphs with minigraph.

Authors:  Heng Li; Xiaowen Feng; Chong Chu
Journal:  Genome Biol       Date:  2020-10-16       Impact factor: 13.583

6.  De novo clustering of long reads by gene from transcriptomics data.

Authors:  Camille Marchet; Lolita Lecompte; Corinne Da Silva; Corinne Cruaud; Jean-Marc Aury; Jacques Nicolas; Pierre Peterlongo
Journal:  Nucleic Acids Res       Date:  2019-01-10       Impact factor: 16.971

7.  Chromatin Organization in Early Land Plants Reveals an Ancestral Association between H3K27me3, Transposons, and Constitutive Heterochromatin.

Authors:  Sean A Montgomery; Yasuhiro Tanizawa; Bence Galik; Nan Wang; Tasuku Ito; Takako Mochizuki; Svetlana Akimcheva; John L Bowman; Valérie Cognat; Laurence Maréchal-Drouard; Heinz Ekker; Syuan-Fei Hong; Takayuki Kohchi; Shih-Shun Lin; Li-Yu Daisy Liu; Yasukazu Nakamura; Lia R Valeeva; Eugene V Shakirov; Dorothy E Shippen; Wei-Lun Wei; Masaru Yagura; Shohei Yamaoka; Katsuyuki T Yamato; Chang Liu; Frédéric Berger
Journal:  Curr Biol       Date:  2020-01-30       Impact factor: 10.834

8.  Long-read sequencing data analysis for yeasts.

Authors:  Jia-Xing Yue; Gianni Liti
Journal:  Nat Protoc       Date:  2018-05-03       Impact factor: 13.491

9.  Transient expansion of TP53 mutated clones in polycythemia vera patients treated with idasanutlin.

Authors:  Bridget K Marcellino; Noushin Farnoud; Bruno Cassinat; Min Lu; Emanuelle Verger; Erin McGovern; Minal Patel; Juan Medina-Martinez; Max Fine Levine; Juanes E Arango Ossa; Yangyu Zhou; Heidi Kosiorek; Meenakshi Mehrotra; Jane Houldsworth; Amylou Dueck; Michael Rossi; John Mascarenhas; Jean-Jacques Kiladjian; Raajit K Rampal; Ronald Hoffman
Journal:  Blood Adv       Date:  2020-11-24

Review 10.  Whole-Genome Sequencing of Bacterial Pathogens: the Future of Nosocomial Outbreak Analysis.

Authors:  Scott Quainoo; Jordy P M Coolen; Sacha A F T van Hijum; Martijn A Huynen; Willem J G Melchers; Willem van Schaik; Heiman F L Wertheim
Journal:  Clin Microbiol Rev       Date:  2017-10       Impact factor: 26.132

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.