Literature DB >> 35668290

A combined de novo assembly approach increases the quality of prokaryotic draft genomes.

Uğur Çabuk1,2,3, Ercan Selçuk Ünlü4.   

Abstract

Next-generation sequencing methods provide comprehensive data for the analysis of structural and functional analysis of the genome. The draft genomes with low contig number and high N50 value can give insight into the structure of the genome as well as provide information on the annotation of the genome. In this study, we designed a pipeline that can be used to assemble prokaryotic draft genomes with low number of contigs and high N50 value. We aimed to use combination of two de novo assembly tools (SPAdes and IDBA-Hybrid) and evaluate the impact of this approach on the quality metrics of the assemblies. The followed pipeline was tested with the raw sequence data with short reads (< 300) for a total of 10 species from four different genera. To obtain the final draft genomes, we firstly assembled the sequences using SPAdes to find closely related organism using the extracted 16 s rRNA from it. IDBA-Hybrid assembler was used to obtain the second assembly data using the closely related organism genome. SPAdes assembler tool was implemented using the second assembly, produced by IDBA-hybrid as a hint. The results were evaluated using QUAST and BUSCO. The pipeline was successful for the reduction of the contig numbers and increasing the N50 statistical values in the draft genome assemblies while preserving the coverage of the draft genomes.
© 2022. Institute of Microbiology, Academy of Sciences of the Czech Republic, v.v.i.

Entities:  

Keywords:  Bacteria; De novo assembly; Draft genome; NGS; Prokaryotes; Short reads

Mesh:

Year:  2022        PMID: 35668290     DOI: 10.1007/s12223-022-00980-7

Source DB:  PubMed          Journal:  Folia Microbiol (Praha)        ISSN: 0015-5632            Impact factor:   2.629


  19 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  DNA-DNA hybridization values and their relationship to whole-genome sequence similarities.

Authors:  Johan Goris; Konstantinos T Konstantinidis; Joel A Klappenbach; Tom Coenye; Peter Vandamme; James M Tiedje
Journal:  Int J Syst Evol Microbiol       Date:  2007-01       Impact factor: 2.747

3.  QUAST: quality assessment tool for genome assemblies.

Authors:  Alexey Gurevich; Vladislav Saveliev; Nikolay Vyahhi; Glenn Tesler
Journal:  Bioinformatics       Date:  2013-02-19       Impact factor: 6.937

4.  Chelatococcus caeni sp. nov., isolated from a biofilm reactor sludge sample.

Authors:  Long Jin; So-Ra Ko; Hyung-Gwan Lee; Byung-Hyuk Kim; Hee-Sik Kim; Chi-Yong Ahn; Hee-Mock Oh
Journal:  Int J Syst Evol Microbiol       Date:  2014-12-18       Impact factor: 2.747

5.  Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.

Authors:  Keith R Bradnam; Joseph N Fass; Anton Alexandrov; Paul Baranay; Michael Bechner; Inanç Birol; Sébastien Boisvert; Jarrod A Chapman; Guillaume Chapuis; Rayan Chikhi; Hamidreza Chitsaz; Wen-Chi Chou; Jacques Corbeil; Cristian Del Fabbro; T Roderick Docking; Richard Durbin; Dent Earl; Scott Emrich; Pavel Fedotov; Nuno A Fonseca; Ganeshkumar Ganapathy; Richard A Gibbs; Sante Gnerre; Elénie Godzaridis; Steve Goldstein; Matthias Haimel; Giles Hall; David Haussler; Joseph B Hiatt; Isaac Y Ho; Jason Howard; Martin Hunt; Shaun D Jackman; David B Jaffe; Erich D Jarvis; Huaiyang Jiang; Sergey Kazakov; Paul J Kersey; Jacob O Kitzman; James R Knight; Sergey Koren; Tak-Wah Lam; Dominique Lavenier; François Laviolette; Yingrui Li; Zhenyu Li; Binghang Liu; Yue Liu; Ruibang Luo; Iain Maccallum; Matthew D Macmanes; Nicolas Maillet; Sergey Melnikov; Delphine Naquin; Zemin Ning; Thomas D Otto; Benedict Paten; Octávio S Paulo; Adam M Phillippy; Francisco Pina-Martins; Michael Place; Dariusz Przybylski; Xiang Qin; Carson Qu; Filipe J Ribeiro; Stephen Richards; Daniel S Rokhsar; J Graham Ruby; Simone Scalabrin; Michael C Schatz; David C Schwartz; Alexey Sergushichev; Ted Sharpe; Timothy I Shaw; Jay Shendure; Yujian Shi; Jared T Simpson; Henry Song; Fedor Tsarev; Francesco Vezzi; Riccardo Vicedomini; Bruno M Vieira; Jun Wang; Kim C Worley; Shuangye Yin; Siu-Ming Yiu; Jianying Yuan; Guojie Zhang; Hao Zhang; Shiguo Zhou; Ian F Korf
Journal:  Gigascience       Date:  2013-07-22       Impact factor: 6.524

6.  GFinisher: a new strategy to refine and finish bacterial genome assemblies.

Authors:  Dieval Guizelini; Roberto T Raittz; Leonardo M Cruz; Emanuel M Souza; Maria B R Steffens; Fabio O Pedrosa
Journal:  Sci Rep       Date:  2016-10-10       Impact factor: 4.379

7.  Complete Genome Sequences for Three Chromosomes of the Burkholderia stabilis Type Strain (ATCC BAA-67).

Authors:  Julia V Bugrysheva; Blake Cherney; David Sue; Andrew B Conley; Lori A Rowe; Kristen M Knipe; Michael A Frace; Vladimir N Loparev; Julie R Avila; Kevin Anderson; David R Hodge; Segaran P Pillai; Linda M Weigel
Journal:  Genome Announc       Date:  2016-11-17

8.  Draft Genome Sequence of Burkholderia reimsis BE51, a Plant-Associated Bacterium Isolated from Agricultural Rhizosphere.

Authors:  Qassim Esmaeel; Alaa Issa; Lisa Sanchez; Christophe Clément; Cédric Jacquard; Essaid Ait Barka
Journal:  Microbiol Resour Announc       Date:  2018-10-04

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery.

Authors:  Roz Laing; Taisei Kikuchi; Axel Martinelli; Isheng J Tsai; Robin N Beech; Elizabeth Redman; Nancy Holroyd; David J Bartley; Helen Beasley; Collette Britton; David Curran; Eileen Devaney; Aude Gilabert; Martin Hunt; Frank Jackson; Stephanie L Johnston; Ivan Kryukov; Keyu Li; Alison A Morrison; Adam J Reid; Neil Sargison; Gary I Saunders; James D Wasmuth; Adrian Wolstenholme; Matthew Berriman; John S Gilleard; James A Cotton
Journal:  Genome Biol       Date:  2013-08-28       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.