Literature DB >> 34871418

Deeplasmid: deep learning accurately separates plasmids from bacterial chromosomes.

William B Andreopoulos1,2, Alexander M Geller3, Miriam Lucke3, Jan Balewski4, Alicia Clum1, Natalia N Ivanova1, Asaf Levy3.   

Abstract

Plasmids are mobile genetic elements that play a key role in microbial ecology and evolution by mediating horizontal transfer of important genes, such as antimicrobial resistance genes. Many microbial genomes have been sequenced by short read sequencers and have resulted in a mix of contigs that derive from plasmids or chromosomes. New tools that accurately identify plasmids are needed to elucidate new plasmid-borne genes of high biological importance. We have developed Deeplasmid, a deep learning tool for distinguishing plasmids from bacterial chromosomes based on the DNA sequence and its encoded biological data. It requires as input only assembled sequences generated by any sequencing platform and assembly algorithm and its runtime scales linearly with the number of assembled sequences. Deeplasmid achieves an AUC-ROC of over 89%, and it was more accurate than five other plasmid classification methods. Finally, as a proof of concept, we used Deeplasmid to predict new plasmids in the fish pathogen Yersinia ruckeri ATCC 29473 that has no annotated plasmids. Deeplasmid predicted with high reliability that a long assembled contig is part of a plasmid. Using long read sequencing we indeed validated the existence of a 102 kb long plasmid, demonstrating Deeplasmid's ability to detect novel plasmids.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2022        PMID: 34871418      PMCID: PMC8860608          DOI: 10.1093/nar/gkab1115

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  56 in total

1.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

Review 2.  The diversity of conjugative relaxases and its application in plasmid classification.

Authors:  María Pilar Garcillán-Barcia; María Victoria Francia; Fernando de la Cruz
Journal:  FEMS Microbiol Rev       Date:  2009-05       Impact factor: 16.408

3.  Introducing the bacterial 'chromid': not a chromosome, not a plasmid.

Authors:  Peter W Harrison; Ryan P J Lower; Nayoung K D Kim; J Peter W Young
Journal:  Trends Microbiol       Date:  2010-01-18       Impact factor: 17.079

4.  plasmidSPAdes: assembling plasmids from whole genome sequencing data.

Authors:  Dmitry Antipov; Nolan Hartwick; Max Shen; Mikhail Raiko; Alla Lapidus; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2016-07-27       Impact factor: 6.937

5.  The IMG/M data management and analysis system v.6.0: new tools and advanced capabilities.

Authors:  I-Min A Chen; Ken Chu; Krishnaveni Palaniappan; Anna Ratner; Jinghua Huang; Marcel Huntemann; Patrick Hajek; Stephan Ritter; Neha Varghese; Rekha Seshadri; Simon Roux; Tanja Woyke; Emiley A Eloe-Fadrosh; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

6.  Broad-host-range properties of plasmid RK2: importance of overlapping genes encoding the plasmid replication initiation protein TrfA.

Authors:  F C Fang; D R Helinski
Journal:  J Bacteriol       Date:  1991-09       Impact factor: 3.490

7.  An SOS inhibitor that binds to free RecA protein: the PsiB protein.

Authors:  Vessela Petrova; Sindhu Chitteni-Pattu; Julia C Drees; Ross B Inman; Michael M Cox
Journal:  Mol Cell       Date:  2009-10-09       Impact factor: 17.970

8.  Post-transcriptional control of expression of the repA gene of plasmid R1 mediated by a small RNA molecule.

Authors:  J Light; S Molin
Journal:  EMBO J       Date:  1983       Impact factor: 11.598

9.  PlasmidSeeker: identification of known plasmids from bacterial whole genome sequencing reads.

Authors:  Märt Roosaare; Mikk Puustusmaa; Märt Möls; Mihkel Vaher; Maido Remm
Journal:  PeerJ       Date:  2018-04-02       Impact factor: 2.984

10.  IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes.

Authors:  I-Min A Chen; Ken Chu; Krishna Palaniappan; Manoj Pillay; Anna Ratner; Jinghua Huang; Marcel Huntemann; Neha Varghese; James R White; Rekha Seshadri; Tatyana Smirnova; Edward Kirton; Sean P Jungbluth; Tanja Woyke; Emiley A Eloe-Fadrosh; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  1 in total

1.  Accurate identification of bacteriophages from metagenomic data using Transformer.

Authors:  Jiayu Shang; Xubo Tang; Ruocheng Guo; Yanni Sun
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.