| Literature DB >> 30619542 |
Hirokazu Yano1, Masaki Shintani2,3, Masaru Tomita4,5, Haruo Suzuki4,5, Taku Oshima6.
Abstract
Plasmids are genetic parasites of microorganisms. The genomes of naturally occurring plasmids are expected to be polished via natural selection to achieve long-term persistence in the microbial cell population. However, plasmid genomes are extremely diverse, and the rules governing plasmid genomes are not fully understood. Therefore, computationally designing plasmid genomes optimized for model and nonmodel organisms remains challenging. Here, we summarize current knowledge of the plasmid genome organization and the factors that can affect plasmid persistence, with the aim of constructing synthetic plasmids for use in gram-negative bacteria. Then, we introduce publicly available resources, plasmid data, and bioinformatics tools that are useful for computational plasmid design.Entities:
Keywords: Bioinformatics; Genome design; Host-parasite coevolution; Plasmid persistence; Synthetic biology; Vector design
Year: 2018 PMID: 30619542 PMCID: PMC6312765 DOI: 10.1016/j.csbj.2018.12.001
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Lists of plasmids in different incompatibility groups.
| Incompatibility | Representative plasmid (original host) | Accession number | RIP | MOB | MPF | Host range | Reference |
|---|---|---|---|---|---|---|---|
| Inc groups | |||||||
| A/C1 (=IncA) | RA1 ( | NC_012885 | RepA | MOBH | MPFF | [ | |
| A/C2 (=IncC) | pRMH760 ( | KF976462 | RepA | MOBH | MPFF | [ | |
| IncB/O | R3521 ( | GU256641 | RepA | MOBP | MPFI | [ | |
| IncD | R711b ( | NA | NA | NA | NA | [ | |
| IncFI | F ( | AP001918 | RepE (for RepFIA replicon) | MOBF | MPFF | [ | |
| IncFII | R1 ( | KY749247 | RepA | MOBF | MPFF | [ | |
| IncG/U (=IncP-6) | Rms149 ( | AJ877225 | RepA | MOBP | – | [ | |
| RA3 ( | DQ401103 | ||||||
| IncH | R27 ( | AF250878 | RepHI1A, RepHI1B | MOBH | MPFF | [ | |
| IncI | R64 ( | AP005147 | RepZ | MOBP | MPFI | [ | |
| IncJ | R391 ( | AY090559 | – | MOBH | MPFF | NA | [ |
| IncK | R387 ( | NCTC50022 | MOBP | MPFI | NA | [ | |
| IncL | R471 ( | KM406489 | RepA | MOBP | MPFI | [ | |
| pKOI-34 ( | AB715422 | ||||||
| IncM | R69 ( | KM406488 | RepA | MOBP | MPFI | [ | |
| pEL60 ( | NC_005246 | ||||||
| IncN | N3 ( | NC_015599 | RepA | MOBF | MPFT | [ | |
| IncP (=IncP-1) | RK2 ( | BN000925 | TrfA | MOBP | MPFT | [ | |
| IncQ | RSF1010 ( | M28829 | RepABC | MOBQ | – | [ | |
| IncR | pKP1780 ( | JX424614 | RepB | – | – | [ | |
| IncS (=IncHI2) | R478 ( | BX664015 | RepHIA | MOBH | MPFF | [ | |
| IncT | Rts1 ( | AP004237 | MOBH | MPFF | [ | ||
| IncV | R753 ( | NCTC50521 (planned) | NA | NA | NA | [ | |
| IncW | R388 ( | NC_028464 | RepA | MOBF | MPFT | [ | |
| IncX | R6K ( | NCTC50005 | π | MOBP | MPFT | [ | |
| IncY | P1 ( | AF234172 (phage P1 | RepA | – | – | [ | |
| pMCR-1-P3 ( | KX880944 | ||||||
| IncZ | pEI545 ( | M93064 (partial) | RepA | NA | NA | [ | |
| PromA | pMRAD02 ( | NC_010509 | RepA | MOBP | MPFT | [ | |
| pIPO2 ( | AJ297913 | ||||||
| pSN0729-62 ( | AP018705 | ||||||
| IncP-9 | pM3 ( | AF078924 (partial) | Rep | MOBF | MPFT | [ | |
| NAH7 ( | NC_007926 | ||||||
| Representative cloning vectors | |||||||
| Not assigned | pUC18/19, pET, pBluescript | L09136 | pMB1 copy-up type, pMB1 type, pMB1 copy-up type | – | – | [ | |
| Not assigned | pACYC148 | X06403 | p15A type | – | – | [ | |
| Not assigned | pBBR1MCS ( | NC_025015 | pBBR1 Rep | MOBV | – | [ | |
| Not assigned | pSC101 ( | NC_002056 | RepA | MOBQ | – | [ | |
| Not assigned | pABC1 ( | KY031728 | p42b RepC | – | – | [ | |
| Not assigned | pME6041 ( | AF118812 | pVS1 RepA | – | – | [ | |
Several Inc. groups are identical; e.g. IncG = IncU.
Representative plasmids are listed based on Lawley et al. [186].
Accession numbers in National Center for Biotechnology Information (NCBI) GenBank and RefSeq (https://www.ncbi.nlm.nih.gov/Sequin/acc.html) and Wellcome Sanger Institute (prefix “NCTC” https://www.sanger.ac.uk/resources/downloads/plasmids).
Names of replication initiation protein (RIP). “NA” indicates that the nucleotide sequences of the plasmid are not available.
Classification of MOB classes and MPF types is based on Smillie et al. [132] and Guglielmini et al. [187]. “-” indicates that the genes involved in conjugation have not been detected, whereas “NA” indicates that the nucleotide sequences of the plasmid are not available.
Plasmid host range determined based on genome sequencing projects (hosts in which a plasmid has been found) and/or filter mating assays.
Original hosts are unknown because exogenous plasmid capturing was used.
Key factors in the construction of a plasmid vector.
| Factors | Notes (what should be considered) |
|---|---|
| 1. Plasmid gene content | Include a set of plasmid core genes. |
| Include selection marker or a toxin-antitoxin system to prevent generation of plasmid-free cells. | |
| Include | |
| 2. Interaction with host | Select a basic replicon that has evolved in species closely related to a model host. |
| Transcriptional regulator or NAPs (H-NS homologs) for plasmid genes could reduce the fitness cost imposed by the plasmid. | |
| 3. Constraints in genome | The G + C content of the plasmid should match that of the host. |
| Highly expressed essential genes should be on leading strands. |
Fig. 1Modular structure of the plasmid genome. An example of the IncP-1 plasmid genome is shown. The replication module is the only element essential for plasmid maintenance. The other modules assist in plasmid maintenance. IncP-1β and IncP-1α are lineages of IncP-1 plasmids. Horizontal arrows indicate the direction of transcription. Each functional module is regulated by a transcriptional regulator encoded in the module block or global regulators encoded in the central control region [188,189].
Fig. 2Replication module. (A) Replication module of IncP-1 plasmids. Ssb encodes a single-strand DNA binding protein. trfA encodes the Rep protein. (B) Replication initiation of IncP-1 plasmid. Monomeric TrfA bound to iterons opens base pairs in the DUE, and then host DnaA or TrfA itself recruits DnaB onto the unwound DUE. TrfA molecules bind to single-stranded DNA to assist replisome assembly on one strand via direct interaction with β-clamp, starting unidirectional replication. Lagging strand synthesis is not shown for simplicity. Illustration follows [8], with minor modifications. (C) Replication inhibition by Rep protein dimer. Paired oriV can dissociate via proteolysis or an increased Rep protein monomer to dimer ratio [32].
Fig. 3Partition module. (A) par locus of P1 prophage (type I partition system). parA encodes Walker-type ATPase. parB encodes centromere-binding protein. parS is centromere. (B-F) Diffusion-ratchet model of plasmid motion. (B) ParA has two states: ParA*, the ATP-bound form, active in binding DNA; ParA, other forms inactive in binding DNA. ParB binds to parS. For simplicity, only two ParB molecules are shown. (C) ParB binding to ParA* activates ATP hydrolysis by ParA. ParA is released from the nucleoid. (D) ParA slowly exchanges ADP with ATP, then returns to the nucleoid surface. (E) When replicated plasmid copies are present in close proximity, a ParA*-free area is generated between them. Each ParB/plasmid complex diffuses until finding its closest ParA*; thus, their interactions are repulsive. (F) ParB/plasmid complexes are pulled to ParA*-dense areas at opposite ends, following the gradient of ParA*. Illustration follows [5], with modifications.
Fig. 4Box-and-whisker plots displaying the distributions of plasmid sizes (kb) for various plasmid groups based on a five-number summary (minimum, 25th percentile, median, 75th percentile, and maximum). Outliers are plotted as open circles. Data are from Suzuki et al. [17].
Fig. 5Plot of G + C contents of 209 plasmids and their host chromosomes. Each point represents a plasmid-chromosome pair from 209 prokaryotes. To minimize the bias in the numbers of sequenced organisms and replicons available in public databases (e.g., thousands of genome projects for Escherichia coli, and multireplicons for Borrelia species), RefSeq data for completely sequenced prokaryotes that consist of one chromosome and plasmid were retrieved on April 17, 2017 from a list of all selected representative prokaryotic genomes (ftp://ftp.ncbi.nlm.nih.gov/genomes/GENOME_REPORTS/prok_representative_genomes.txt). The G + C contents of plasmids tend to be lower than (and are correlated with) those of the host chromosomes.
List of bioinformatics tools for plasmid sequence analysis and vector design.
| Usage | Name | URL |
|---|---|---|
| ApE (A plasmid Editor) | ||
| SnapGene | ||
| Benchling | ||
| SEVA-DB | ||
| PLACNET | ||
| PLACNETw | ||
| plasmidSPAdes | ||
| Recycler | ||
| PlasmidTron | ||
| PlasmidFinder | ||
| cBar | ||
| PlasFlow | ||
| MOB-suite | ||
| PlasmidSeeker | ||