| Literature DB >> 29593241 |
Daniel C Lima1,2, Lena K Nyberg3, Fredrik Westerlund3, Silvia R Batistuzzo de Medeiros4.
Abstract
Chromobacterium violaceum is a ß-proteobacterium found widely worldwide with important biotechnological properties and is associated to lethal sepsis in immune-depressed individuals. In this work, we report the discover, complete sequence and annotation of a plasmid detected in C. violaceum that has been unnoticed until now. We used DNA single-molecule analysis to confirm that the episome found was a circular molecule and then proceeded with NGS sequencing. After DNA annotation, we found that this extra-chromosomal DNA is probably a defective bacteriophage of approximately 44 kilobases, with 39 ORFs comprising, mostly hypothetical proteins. We also found DNA sequences that ensure proper plasmid replication and partitioning as well as a toxin addiction system. This report sheds light on the biology of this important species, helping us to understand the mechanisms by which C. violaceum endures to several harsh conditions. This discovery could also be a first step in the development of a DNA manipulation tool in this bacterium.Entities:
Mesh:
Year: 2018 PMID: 29593241 PMCID: PMC5871888 DOI: 10.1038/s41598-018-23708-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Restriction digestion pattern from two independent preparations of the episome. The asterisk denotes the band corresponding to the plasmid. The restriction enzyme used is mentioned on top of each lane. 0.8% TBE Agarose gel stained with ethidium bromide.
Figure 2Restriction digestion of extra-chromosomal DNA extracted from eight different C. violaceum strains. Samples were digested with KpnI, BamHI and EcoRI for one hour at 37 °C. “−” reflects non digested samples and L is DNA ladder. 0.8% TBE agarose gel stained with ethidium bromide.
Figure 3Kymographs showing the extensions of circular and nicked forms of the episome in nanofluidic channels. Competitive binding was used in order to produce the emission intensity pattern along the linear for of the plasmid.
General features of pChV1.
| Lenght, bp | 44,212 |
| G + C content | 65.96% |
| Total ORFs | 39 |
| Percentage of plasmid sequence constituting coding regions | 89.66% |
| Average ORF lenght, bp | 1017 |
| Number of conserved hypothetical proteins | 28 |
| Number of hypothetical proteins | 1 |
Figure 4Map of the pChV1 plasmid. Most of the phage-related genes are present in the same region. The distribution of the plasmid partitioning genes is in accordance to what is seen in the literature. Blue, red and green arrows depict phage, plasmid and hypothetical ORFS respectively.
List of ORFs found in pChV1.
| ORF Number | Gene | Length | Domains | Best Hit Identity |
|---|---|---|---|---|
| ORF_01 | Chromosome partitioning protein ParA | 840 | ParA/Soj/Fer4 NifH | 93% |
| ORF_02 | Partitioning protein ParB | 885 | ParB | 96% |
| ORF_03 | Plasmid replication protein RepA | 951 | Plasmid replication initiator protein | 49% |
| ORF_04 | Conserved hypothetical protein | 477 | none | 82% |
| ORF_05 | Conserved hypothetical protein/putative bacteriophage lysis protein | 1029 | COG4623 | 82% |
| ORF_06 | Conserved hypothetical protein | 312 | SlyX | 97% |
| ORF_07 | Conserved hypothetical protein | 756 | Cadherin repeat/Ca2+ binding | 32% |
| ORF_08 | Conserved hypothetical protein | 1422 | none | 44% |
| ORF_09 | Conserved hypothetical protein | 1887 | none | 60% |
| ORF_10 | Conserved hypothetical protein | 363 | none | 99% |
| ORF_11 | Conserved hypothetical protein | 219 | none | 95% |
| ORF_12 | Conserved hypothetical protein | 1167 | DUF4157 | 48% |
| ORF_13 | Toxin | 801 | RhsA | 52% |
| ORF_14 | Conserved hypothetical protein | 708 | none | 63% |
| ORF_15 | Conserved hypothetical protein | 1125 | none | 98% |
| ORF_16 | Conserved Hypothetical protein | 126 | none | 52% |
| ORF_17 | Conserved hypothetical protein | 615 | Transposase | 99% |
| ORF_18 | Hypothetical protein | 726 | none | ND |
| ORF_19 | Conserved hypothetical protein | 891 | DUF4255 | 95% |
| ORF_20 | Conserved hypothetical protein | 1854 | ATPase AAA domain | 72% |
| ORF_21 | Conserved hypothetical protein | 2415 | none | 62% |
| ORF_22 | Conserved hypothetical protein | 453 | none | 78% |
| ORF_23 | Conserved hypothetical protein | 3768 | DUF342 | 95% |
| ORF_24 | Conserved hypothetical protein | 2439 | none | 82% |
| ORF_25 | Conserved hypothetical protein | 993 | ribonuclease e/dihydrolipoamide succyniltransferase | 66% |
| ORF_26 | Conserved hypothetical protein | 3168 | TIGR02243(phage tail-like region) | 77% |
| ORF_27 | Conserved hypothetical protein | 441 | none | 90% |
| ORF_28 | Phage baseplate assembly protein W | 399 | GPW gp25 | 71% |
| ORF_29 | Conserved hypothetical protein | 1593 | Phage Base V | 59% |
| ORF_30 | Conserved hypothetical protein | 729 | Phage Tube | 41% |
| ORF_31 | Conserved Hypothetical protein | 165 | none | 52% |
| ORF_32 | Conserved hypothetical protein | 456 | Phage T4 gp19 | 40% |
| ORF_33 | Phage tail protein | 447 | Phage T4 gp19 | 91% |
| ORF_34 | Phage tail sheath protein | 1416 | COG3497(phage tail sheath protein FI) | 63% |
| ORF_35 | DNA invertase | 624 | mpi/SR ResInv(Recombinase;DNA binding)/HTH Hin like | 97% |
| ORF_36 | Conserved hypothetical protein | 348 | HTH_XRE (transcriptional regulator family)/xenobiotic response | 97% |
| ORF_37 | Toxin HipA | 1350 | HipA/Rna Pol | 90% |
| ORF_38 | Conserved Hypothetical protein | 126 | none | 52% |
| ORF_39 | Plasmid replication initiator protein | 1182 | RPA | 54% |
Figure 5GC content profile of pChV1. Two points of low GC content are observable in the 1) beginning-end of the plasmid sequence and in the 2) 27000 bp region (second chart, below).
Figure 6GC-skew of pChV1. The cumulative GC-skew (red curve) has two decline points, characteristic of origins of replication and might reflect oriR and oriL.
Pairs of Inverted repeated and palindromic sequences founded in pChV1.
| Coordinates | Strand | Sequence | Type of sequence |
|---|---|---|---|
| 42355–42373 | + | TGTAGCAAGTTGCTACACT | Inverted repeat sequence |
| 5113–5120 | + | AAATATTT | Palindrome |
| 44161–44177 | − | TGTAGCAAg/cTTGCTACA | Palindrome |
| 27010–27016 | − | ATAt/aTAT | Palindrome |
| 1608–1615 | + | TGAATTCA | Palindrome |
| 1465–1478 | + | TTTTTAACTAAAAA | Palindrome |
| 304–312 | − | TTGAt/aTCAA | Palindrome |
| 34202–34209 | + | AATTAATT | Palindrome |
| 34414–34424 | + | ATTTGTCATAT | Palindrome |
| 27884–27891 | − | TATTCATA | Palindrome |
Figure 7Schematic chart showing the genomic context in which the 19 bp inverted repeated sequence are flanking the partitioning-related genes parA and parB. The diagram is not to scale.