Literature DB >> 30637385

A Curated, Comprehensive Database of Plasmid Sequences.

Lauren Brooks1, Mo Kaze2, Mark Sistrom2.   

Abstract

Plasmid sequences are central to a myriad of microbial functions and processes. Here, we have compiled a database of complete plasmid sequences and associated metadata curated from both NCBI's recent genome database update, which includes plasmids as organisms, and all available annotated bacterial genomes. The resultant database contains 10,892 complete plasmid sequences and associated metadata.

Year:  2019        PMID: 30637385      PMCID: PMC6318356          DOI: 10.1128/MRA.01325-18

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Plasmids are one of the key vectors of horizontal gene transfer in bacteria and archaea (1). Plasmids play a major role in bacterial genetic diversity (2), evolution (3), and adaptation (4). Conjugative exchange (i.e., the transfer of plasmids from one bacterium to another) can lead to the spread of a variety of functions, including degradation of heavy metals and anthropogenic toxic waste (5), bacteriocin and toxin production to ward off predators (6), and, alarmingly, antibiotic resistance and virulence plasmids that inhibit antibiotics and lead to novel and untreatable diseases (7). Plasmids are also extensively used as tools in genetic engineering (8). To generate a comprehensive plasmid database, we started with the recent NCBI genome database update, which has a separate collection of plasmids as organisms. FASTA format files containing plasmid “genome” sequences were downloaded on 5 March 2018 from ftp://ftp.ncbi.nlm.nih.gov/refseq/release/plasmid/, resulting in 11,677 plasmid sequences. Using the R package Rentrez (https://cran.r-project.org/web/packages/rentrez/index.html), we downloaded the metadata available from the nucleotide database for each entry based on the locus number contained in the header file for each plasmid. Metadata from the BioProject, BioSample, and Assembly databases were also pulled for each plasmid sequence when present. An initial review of the metadata demonstrated that not all sequences contained in the downloaded files were complete plasmid sequences. After downloading all sequences labeled as plasmids (n = 11,677), we filtered the database using the nucleotide metadata to remove partial plasmid sequences from the databases (n = 9,763) and again using the assembly metadata to remove incomplete assemblies (n = 7,434). Additionally, 8 sequences labeled as phages were found and removed from the database. This resulted in 7,426 complete and assembled plasmid sequences following this initial screening. In addition to curating the predefined NCBI plasmid database, we extracted plasmid sequences from bacterial genomes with complete assemblies in NCBI’s prokaryotic genome database (https://www.ncbi.nlm.nih.gov/genome/browse#!/prokaryotes/). Genomic assemblies labeled as partially complete or in contigs were not included to ensure that only complete plasmid sequences were included in our final database. Sequences that were already included as part of the original plasmid downloads, as identified by their accession or locus numbers, were removed as duplicates. This allowed us to include an additional 3,466 complete, annotated plasmid sequences, resulting in our database of 10,892 complete and annotated plasmid sequences for subsequent analyses. The two data sets described above were combined to result in a comprehensive, complete, and annotated plasmid database. Metadata for this final list were compiled using the accession version number provided in the header for each plasmid sequence as described above.

Data availability.

The plasmid database is available in fasta format and associated metadata are available in csv format at https://doi.org/10.15146/R33X2J.
  21 in total

Review 1.  Experimental approaches to tracking mobile genetic elements in microbial communities.

Authors:  Christina C Saak; Cong B Dinh; Rachel J Dutton
Journal:  FEMS Microbiol Rev       Date:  2020-09-01       Impact factor: 16.408

2.  CRISPR-Cas is associated with fewer antibiotic resistance genes in bacterial pathogens.

Authors:  Elizabeth Pursey; Tatiana Dimitriu; Fernanda L Paganelli; Edze R Westra; Stineke van Houte
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-11-29       Impact factor: 6.237

3.  CRISPR-Cas systems restrict horizontal gene transfer in Pseudomonas aeruginosa.

Authors:  Rachel M Wheatley; R Craig MacLean
Journal:  ISME J       Date:  2020-12-21       Impact factor: 10.302

4.  Complete Assembly of Escherichia coli Sequence Type 131 Genomes Using Long Reads Demonstrates Antibiotic Resistance Gene Variation within Diverse Plasmid and Chromosomal Contexts.

Authors:  Arun Gonzales Decano; Catherine Ludden; Theresa Feltwell; Kim Judge; Julian Parkhill; Tim Downing
Journal:  mSphere       Date:  2019-05-08       Impact factor: 4.389

5.  Contrasting patterns of longitudinal population dynamics and antimicrobial resistance mechanisms in two priority bacterial pathogens over 7 years in a single center.

Authors:  Matthew J Ellington; Eva Heinz; Alexander M Wailan; Matthew J Dorman; Marcus de Goffau; Amy K Cain; Sonal P Henson; Nicholas Gleadall; Christine J Boinett; Gordon Dougan; Nicholas M Brown; Neil Woodford; Julian Parkhill; M Estée Török; Sharon J Peacock; Nicholas R Thomson
Journal:  Genome Biol       Date:  2019-09-02       Impact factor: 13.583

6.  Transcriptional Regulation and Mechanism of SigN (ZpdN), a pBS32-Encoded Sigma Factor in Bacillus subtilis.

Authors:  Aisha T Burton; Aaron DeLoughery; Gene-Wei Li; Daniel B Kearns
Journal:  mBio       Date:  2019-09-17       Impact factor: 7.867

7.  Escherichia coli ST8196 is a novel, locally evolved, and extensively drug resistant pathogenic lineage within the ST131 clonal complex.

Authors:  Priyanka Hastak; Mathieu Fourment; Aaron E Darling; Thomas Gottlieb; Elaine Cheong; John Merlino; Garry S A Myers; Steven P Djordjevic; Piklu Roy Chowdhury
Journal:  Emerg Microbes Infect       Date:  2020-12       Impact factor: 7.163

8.  Legionella pneumophila CRISPR-Cas Suggests Recurrent Encounters with One or More Phages in the Family Microviridae.

Authors:  Shayna R Deecker; Malene L Urbanus; Beth Nicholson; Alexander W Ensminger
Journal:  Appl Environ Microbiol       Date:  2021-08-11       Impact factor: 4.792

9.  A marine plasmid hitchhiking vast phylogenetic and geographic distances.

Authors:  Jörn Petersen; John Vollmers; Victoria Ringel; Henner Brinkmann; Claire Ellebrandt-Sperling; Cathrin Spröer; Alexandra M Howat; J Colin Murrell; Anne-Kristin Kaster
Journal:  Proc Natl Acad Sci U S A       Date:  2019-09-23       Impact factor: 11.205

10.  Phylogenetic and genomic analysis reveals high genomic openness and genetic diversity of Clostridium perfringens.

Authors:  Yuqing Feng; Xuezheng Fan; Liangquan Zhu; Xinyue Yang; Yan Liu; Shiguang Gao; Xiaolu Jin; Dan Liu; Jiabo Ding; Yuming Guo; Yongfei Hu
Journal:  Microb Genom       Date:  2020-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.