| Literature DB >> 22392282 |
Robert Lasek1, Lukasz Dziewit, Dariusz Bartosik.
Abstract
The complete nucleotide sequence of plasmid pP62BP1 (34,467 bp), isolated from Arctic Psychrobacter sp. DAB_AL62B, was determined and annotated. The conserved plasmid backbone is composed of several genetic modules, including a replication system (REP) with similarities to the REP region of the iteron-containing plasmid pPS10 of Pseudomonas syringae. The additional genetic load of pP62BP1 includes two highly related type II restriction-modification systems and a set of genes (slfRCHSL) encoding enzymes engaged in the metabolism of organic sulfates, plus a putative transcriptional regulator (SlfR) of the AraC family. The pP62BP1 slf locus has a compact and unique structure. It is predicted that the enzymes SlfC, SlfH, SlfS and SlfL carry out a chain of reactions leading to the transformation of alkyl sulfates into acyl-CoA, with dodecyl sulfate (SDS) as a possible starting substrate. Comparative analysis of the nucleotide sequences of pP62BP1 and other Psychrobacter spp. plasmids revealed their structural diversity. However, the presence of a few highly conserved DNA segments in pP62BP1, plasmid 1 of P. cryohalolentis K5 and pRWF-101 of Psychrobacter sp. PRwf-1 is indicative of recombinational shuffling of genetic information, and is evidence of lateral gene transfer in the Arctic environment.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22392282 PMCID: PMC3346939 DOI: 10.1007/s00792-012-0435-2
Source DB: PubMed Journal: Extremophiles ISSN: 1431-0651 Impact factor: 2.395
Fig. 1Circular representation of pP62BP1. Open reading frames (ORFs) are represented by block arrows on the outer circle. Predicted functions/homologies of the ORFs are indicated by the color key. Several ORF numbers and names are given for reference (see Table 1). The positions of the putative oriV and parS sequences are indicated by white ellipses (see text for details). Solid black blocks represent the genetic modules identified in the plasmid: REP, replication module; PAR, partitioning system; R-M1 and R-M2, restriction-modification systems; SLF, phenotypic module; MRS, multimer resolution system; TA, toxin–antitoxin system. The inner circle is a graduated scale (kb)
Putative ORFs of pP62BP1 and their functions
| ORF name | Position in sequence | Protein length (aa) | Predicted gene function | Most relevant homolog | |||
|---|---|---|---|---|---|---|---|
| Strand | bp | Number of identities/number examined (%) | Organism (plasmid) | GenBank accession number | |||
|
| + | 549–1,466 | 305 | Initiator RepB protein | 154/308 (50) |
| ACY02903 |
|
| + | 1,707–2,345 | 212 | Partitioning protein ParA | 203/211 (96) |
| ABE76262 |
|
| + | 2,355–2,558 | 67 | Partitioning protein ParB | 39/68 (57) |
| EEV21786 |
| ORF4 | − | 3,412–2,711 | 233 | No significant similarity | |||
| ORF5 | + | 3,755–4,567 | 270 | Hypothetical protein | 37/79 (47) |
| BAE70960 |
| ORF6 | + | 4,681–4,815 | 44 | Hypothetical protein | 25/44 (57) |
| CBX21727 |
| ORF7 | + | 4,824–5,219 | 131 | Hypothetical protein | 37/123 (30) |
| CBX21727 |
| ORF8 | − | 6,634–5,303 | 443 | RNA-directed DNA polymerase (reverse transcriptase) | 267/483 (55) |
| EGK13519 |
| ORF9 | + | 7,289–8,563 | 424 | DNA (cytosine-5-)-methyltransferase | 279/404 (69) |
| EEY99607 |
| ORF10 | − | 9,345–8,560 | 261 | Type II site-specific deoxyribonuclease | 135/264 (51) |
| EFV17523 |
| ORF11 | + | 9,533–11,488 | 651 | Putative DNA mismatch repair protein | 406/656 (62) |
| ZP_04661841 |
| ORF12 | − | 11,957–11,697 | 86 | Hypothetical protein | 29/80 (36) |
| EEX49811 |
| ORF13 | − | 12,256–11,978 | 92 | Hypothetical protein | 32/90 (36) |
| EEX49811 |
| ORF14 | − | 14,108–12,483 | 541 | Medium-chain-fatty-acid-CoA ligase | 359/540 (66) |
| EEY97115 |
| ORF15 | − | 16,274–14,307 | 655 | Alkyl sulfatase | 365/631 (58) |
| EFQ36922 |
| ORF16 | − | 18,034–16,274 | 586 | GMC oxidoreductase family protein | 434/565 (77) |
| EEY97634 |
| ORF17 | − | 19,565–18,132 | 477 | NAD-dependent aldehyde dehydrogenase | 364/468 (78) |
| EEY97635 |
| ORF18 | + | 19,732–20,751 | 339 | AraC/XylS family transcriptional regulator | 194/339 (57) |
| EEY97636 |
| ORF19 | + | 21,088–21,720 | 210 | Resolvase domain-containing protein | 166/201 (83) |
| EGK07211 |
| ORF20 | − | 22,172–21,852 | 106 | Hypothetical protein | 17/46 (37) |
| EGK13551 |
| ORF21 | − | 23,486–22,821 | 229 | Hypothetical protein | 125/202 (62) |
| AAZ19228 |
| ORF22 | − | 23,676–23,494 | 60 | No significant similarity | |||
| ORF23 | − | 23,887–23,699 | 62 | Putative toxin of | 33/60 (55) |
| ADL43536 |
| ORF24 | + | 24,421–26,145 | 574 | Hypothetical protein | 123/564 (22) |
| ADU72791 |
| ORF25 | + | 26,411–27,685 | 424 | DNA (cytosine-5-)-methyltransferase | 271/410 (66) |
| EEX04797 |
| ORF26 | − | 28,467–27,682 | 261 | Type II site-specific deoxyribonuclease | 135/264 (51) |
| EFV17523 |
| ORF27 | − | 30,452–28,593 | 619 | SMC domain-containing protein | 270/639 (42) |
| ADO84398 |
| ORF28 | − | 31,016–30,726 | 96 | No significant similarity | |||
| ORF29 | + | 31,407–31,655 | 82 | Hypothetical protein | 20/45 (44) |
| EEX02196 |
| ORF30 | − | 32,055–31,681 | 124 | Hypothetical protein | 41/134 (31) |
| ADU68442 |
| ORF31 | + | 32,253–32,894 | 213 | Hypothetical protein | 22/55 (40) |
| BAD83748 |
| ORF32 | − | 34,151–32,946 | 401 | Hypothetical protein | 86/386 (22) |
| ABC32288 |
| ORF33 | + | 34,210–34,443 | 78 | XRE family transcriptional regulator | 42/66 (64) |
| EEO01051 |
Fig. 2a Structure of the REP module of pP62BP1. A G+C content profile is shown in the upper panel with the relative positions of ORF33, oriV and repB indicated below (1 the arbitrary starting point of the plasmid’s nucleotide sequence). b Sequence organization of the proposed oriV. The dnaA box is shaded. Dashed arrows indicate A+T-rich sequences (DR1–DR4). Six iteron sequences (IT1–IT6) are marked by black arrows. The non-identical residues in the IT6 sequence are written in lowercase. Palindromic sequence is underlined. Additional repeats similar to the 3′-part of iterons (AR1–AR3) are indicated by gray arrows. The repB −35 and −10 promoter elements and a potential ribosome binding site (rbs) are boxed. The coding sequence of repB is marked by a large gray arrow in the background
Fig. 3Comparison of the genetic structures of R-M systems encoding proteins homologous to the enzymes of the R-M systems harbored by pP62BP1. Organism names, the ranges of the compared sequences and their GenBank acc. nos. are shown on the right. (NB Only one genetic structure is shown for pP62BP1, representing both R-M1 and R-M2.) ORFs are marked by block arrows colored according to the proposed function of their product (see legend). The GenBank acc. nos. of the encoded proteins are given above or below each ORF. The percentage values indicate the level of identity between proteins encoded by corresponding ORFs, which are paired by shaded regions
Fig. 4a Genetic organization of the pP62BP1 SLF module. ORFs are represented by block arrows. The location and orientation of predicted promoter sequences (P) for slfC and slfR are indicated by arrows. b Proposed metabolic pathway comprising reactions catalyzed by the enzymes encoded in the slf operon, with dodecyl sulfate as the starting substrate (see text for details). Dashed arrow indicates transport into the cell. The proposed enzyme names are based on their functions: SlfS, alkyl sulfatase; SlfH and SlfC, dehydrogenases catalyzing oxidation of hydroxyl and carbonyl groups, respectively; SlfL, fatty-acid-CoA ligase
Fig. 5Linear alignment of pP62BP1 and plasmid 1 of Psychrobacter cryohalolentis K5. ORFs are marked by block arrows. In the strain K5 plasmid, several GenBank acc. nos. are given for reference, while the ORF whose sequence has not been deposited in GenBank is encircled. In pP62BP1, the defined genetic modules are boxed (see Fig. 1 for the abbreviations used). Vertical black bars mark the positions of 142-nt-long inverted repeats (IRL and IRR; see text for details). Regions of the plasmids sharing >70% identity (E value >1e−10) are connected by black connecting lines; those corresponding to pP62BP1 iterons and the parS sequence are marked accordingly. The scale is indicated by the bar in the lower right-hand corner