| Literature DB >> 29580205 |
Sherwood R Casjens1,2,3, Lia Di4, Saymon Akther5, Emmanuel F Mongodin6, Benjamin J Luft7, Steven E Schutzer8, Claire M Fraser6, Wei-Gang Qiu9,10,11.
Abstract
BACKGROUND: With approximately one-third of their genomes consisting of linear and circular plasmids, the Lyme disease agent cluster of species has the most complex genomes among known bacteria. We report here a comparative analysis of plasmids in eleven Borreliella (also known as Borrelia burgdorferi sensu lato) species.Entities:
Keywords: Borrelia; Borreliella; Burgdorferi; Circular plasmid; Linear plasmid; Lyme disease; Plasmid evolution
Mesh:
Substances:
Year: 2018 PMID: 29580205 PMCID: PMC5870499 DOI: 10.1186/s12864-018-4597-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of plasmids in NBu-Borreliella isolates
| Linear plasmids | Circular plasmids | |
|---|---|---|
|
| ||
| ACA-1 | 9 | 5 |
| BO23 | 6a | 2a |
| K78 | 8 | 5 |
| PKo | 9 | 8 |
|
| ||
| DN127 | 7 | 9 |
|
| ||
| SV1 | 5 | 5 |
|
| ||
| Far04 | 6 | 1 |
| PBr | 8 | 3 |
| 20047 | 4a | 1a |
|
| ||
| HO14 | 3a | 1a |
|
| ||
| MN14–1420 | 8 | 7 |
| MN14–1539 | 7 | 7 |
|
| ||
| A14S | 7 | 6b |
|
| ||
| VS116 | 6 | 5 |
aNot all plasmids have been identified and sequenced in this isolate
bBecause the sequences of some circular plasmids in A14S were not closed, it is possible that some cp32s are fused. Thus, the number of different plasmid DNA molecules could be slightly less than this value
Fig. 1Sequenced plasmids in NBu-Borreliella species. The eleven completely sequenced genomes and three partly sequence genomes are shown as columns where shaded cells indicate the plasmid’s presence, “fused” indicates that the indicated plasmids contain apparently intact PFam32 genes of more than one type (see text); and “∆”, indicates the presence of a substantial deletion relative to other cp32s. Hyphens (−) denote plasmids that are known not to be present, and blank cells indicate that it is not known if that plasmid type is present. For the linear plasmids and cp9, Roman numerals indicate plasmid organizational subtype (see text and [20] for subtype definitions); here subtypes are named by the first letter of the genus name and first two letters of the species name (e.g., B. afzelii subtype I is “Baf I”). Similar subtype numerals in different plasmid types (columns in table) does not imply any relationship, and subtype organizations are always different in the different species except in the three cases that are indicated by dark gray cells (see text). The cp26 plasmids do not exhibit organizational variation and cp32s are too variable, so no subtypes are defined for these plasmids; a “+” indicates that a plasmid of that PFam32 type is present, and parentheses (...) around the subtype name denote the cp9 and four cp32 isolate A14S plasmids whose sequences were not closed. Because the sequences of these plasmids in A14S were not closed, it is possible that some cp32s are fused, so the number of different plasmid DNA molecules is not known precisely in this case. (a) The indicated plasmid sequences from isolates B. afzelii BO23 and B. garinii 20047 have been deposited in GenBank as closed sequences (Additional file 1: Table S1); it is not known what other plasmids these isolates may carry (S. Bontemps-Gallo, Pers. Com.); (b) these plasmids represent the full complement of plasmids of B. afzelii K78 and the two B. mayonii isolates; accession numbers are listed in [15, 21], respectively; (c) the plasmid sequences from isolate B. japonica HO14 (ATCC51557) are “minimal draft” quality sequences; they are included here because they are the right size to be full plasmid sequences but the details of their sequences should be interpreted with caution. (d) plasmid is very likely present in the original ACA-1 isolate, see text; (e) called lp28–10 in reference [15], see text; (f) called lp28–1 in reference [21], see text; (g) the lp32 plasmids very likely have the same compatibility type as cp32 of the same number, see text; (h) the BO23 lp54 sequence does not include the PFam54 gene array that distinguishes the subtypes of this plasmid; (i) previously named cp32–2 and cp32–7 have the same PFam32 protein type; we use cp32–7 to represent this group; and (j) several kbp of typically linear plasmid sequence replaces at least part of the deleted cp32 DNA
Fig. 2PFam32 gene tree suggests ancestral radiation of plasmid compatibility groups. The midpoint-rooted gene tree (center) is based on a 392-amino acid long alignment of 411 PFam32 homologues from 34 Borreliella genomes and selected sequences from 8 relapsing-fever Borrelia genomes. The rings, from center outward, indicate the (i) continent of origin of each isolate, (ii) plasmid compatibility type names with different shades of pink, blue, and green for the core genome (chromosome, cp26 and lp54), linear, and circular plasmids, respectively, and (iv) species (colored dots). Lavender central ribbons connect multiple PFam32 alleles found on the same replicon, indicating plasmid fusion events. A vertical PFam32 tree is available as Additional file 1: Figure S1, where two PFam32 homologs (one on lp28–9 from BOL26, another on lp56 from DN127) with positions inconsistent with the genome phylogeny (see Fig. 5 below) – indicative of whole-plasmid transfer – are highlighted. Plasmid types lp5 and some cp9s are not shown in the figure as they do not have a PFam32 partition gene
Fig. 5Borreliella chromosome phylogeny. It is based on a concatenated 144,891-long protein sequence alignment of 450 single-copy orthologues universally present in all Borreliella genomes (see Methods) and is rooted with orthologues from relapsing-fever Borrelia genomes (not shown). The four previously designated B. burgdorferi chromosomal “SNP types” A, B, C and D [1] and a new type E are indicated outside the circle in the same color box as the cognate strain names. Isolate names are shown within the outer ring, and species designations of the genomes are indicated by color in the outer ring. Continental origins of the isolates are indicated by colored dots at the tips
Fig. 3ORF maps of Borreliella lp17 plasmids. The horizontal bars represent all the sequenced lp17 plasmids; identical background colors indicate regions of homologous DNA in the different plasmids. The bacterial species and organizational subtypes (see text) are indicated by Roman numerals and on the right (without their species component, see text), and isolates that carry each subtype are indicated on the left. Selected genes are indicated by red arrows, and black “X”s mark pseudogenes. PFam numbers [19, 30] are indicated in the boxes above; “U”s in boxes are proteins for which no intact gene is known in B. burgdorferi; “J's” in boxes indicate strain B31 homologous proteins for which there is only one gene in strain B31 (i.e., no PFam number exists); adeC, cspZ and fbn refer to adenine deaminase, complement regulator-acquiring surface protein Z and fibronectin binding protein encoding genes, respectively; selected strain B31 gene names are indicated above its map. Black lines between plasmids mark the locations of inversions. See Additional file 1: Figure S3B for locations of indels within generally homologous regions (e.g., deletions within B31-like sections of VS116 lp17 and PBr lp17)
Fig. 4ORF maps of Borreliella lp36 plasmids. Plasmids are represented as in Fig. 3, with the same background colors indicating regions of homologous DNA. The organizational subtypes (see text) are indicated by Roman numerals on the right, and isolates that carry each type are indicated at the left
Linear plasmid locations of selected NBu-Borreliella genes
| Genea | adeC | arp/erpD | bptA | cpsZ | dbpABc | fnb | plzA | ospABc | ospCc | ospD | pncA | res-mod1 | res-mod2 | sagABC DE | vlsE-liked | vslE & | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PFamb | 61 | 163 | 99 | 92 | 74 | none | 180 | 53 | none | none | none | 1 | 102–167 | none | 170 | 170 | |
|
| B31 | lp36 | lp28–1 | lp25 | lp28–3 | lp54 | lp36 | – | lp54 | cp26 | lp38 | lp25 | lp25, | lp56 | – | – | lp28–1 |
|
| DN127 | lp28–7 | – | lp25 | lp28–3 | lp54 | lp28–7 | – | lp54 | cp26 | – | lp25 | lp25, | – | – | – | – |
|
| SV1 | – | – | lp28–2 | – | lp54h | – | lp32–6f | lp54h | cp26 | – | lp28–2 | lp32–12f | – | – | – | lp32–6 |
|
| MN14–1420 | lp36 | – | lp25 | – | lp54 | – | – | lp54 | cp26 | lp28–3 | lp25 | lp25, | – | – | – | lp28–8e |
| MN14–1539 | lp36 | – | lp25 | – | lp54 | – | – | lp54 | cp26 | lp28–3 | lp25 | lp25, | – | – | – | lp28–8e | |
|
| PKo | lp38 | – | lp28–2 | – | lp54 | lp17 | – | lp54 | cp26 | lp32–10 f | lp28–2 | lp28–3, | lp32–10 | lp28–8 | – | lp28–8 |
| K78 | lp38 | – | lp28–2 | – | lp54 | lp17 | – | lp54 | cp26 | – | lp28–2 | lp28–3, | – | lp28–8 | – | lp28–8 | |
| ACA-1 | lp38 | – | lp28–2 | – | lp54 | lp17 | – | lp54 | cp26 | lp32–10f | lp28–2 | lp28–3, | lp32–10 | lp28–8?j | – | lp28–8?j | |
| BO23l | lp38 i | ND | ND | ND | lp54 | lp17 | ND | lp54h | cp26 | ND | ND | lp28–4 | ND | lp28–8 | ND | lp28–8 | |
|
| PBr | lp25 | – | lp25 | lp28–4 | lp54 | lp17 | lp28–9 | lp54 | cp26 | cp32–10k | lp25 | lp28–3, | lp28–9 f | – | lp28–4, | lp28–3 |
| Far04 | lp25 | – | lp25 | – | lp54 | lp17 | – | lp54 | cp26 | lp32–10 | lp25 | lp36 | lp28–9 f | – | – | lp28–9 | |
| 20047l | ND | ND | ND | ND | lp54 | lp17 | ND | lp54 | cp26 | ND | ND | lp36 | ND | ND | lp36 j | ND | |
|
| A14S | lp36 | – | lp38 | lp28–4, | lp54 | lp17 | – | lp54 | cp26 | – | lp38 | lp28–3 | – | lp28–8 | – | lp28–8 |
|
| VS116 | right | – | lp25j | lp36 f | lp54 | – | – | lp54 | cp26 | lp17 | lp25 | lp28–3f | cp32–7k | lp28–8 | lp28–3 | lp28–8 |
|
| HO14l | ND | ND | lp25 | ND | lp54 | ND | NC | lp54 | cp26 | ND | lp25 | ND | ND | lp28–8 | ND | ND |
aThe linear plasmids that carry selected genes are indicated in the table; only apparently intact genes are indicated; hyphens (−) indicate a plasmid is absent from that completely sequenced genome. Gene names are as follows: adeC, adenine deaminase [35]; arp, arthritis related protein [134]; cpsZ, complement regulatory protein Z [86]; dbpAB, decorin binding proteins A and B [135]; fnb, fibronectin protein [136–139]; plzB, cyclic di-GMP binding protein [140, 141]; ospAB, outer surface proteins A and B [142–144]; ospC, outer surface protein C [103, 145], ospD, outer surface protein D [85, 146]; pncA, nicotinamidase [73]; res-mod1, restriction-modification protein PFam01 [147, 148]; res-mod2, restriction-modification protein PFam102/167 [148]; sagABCDE, synthesis of streptolysin S-like toxin [78]; vlsE-like, putative lipoproteins that are related VlsE but are encoded at a location not adjacent to vls cassette array [19]; vlsE and vls, variable outer surface protein cassettes and expression locus [149]
bBorreliella paralogous protein family (see text)
cThe gene contents of lp54 and cp26 gene content is very constant so only a few examples of their many important genes are shown
dvlsE homologue not adjacent to vls cassette region
eNamed lp28–10 in the literature but we include them as lp28–8 type plasmids (see text)
fA relatively large, apparently truncated gene fragment whose functionality is unknown; there is no intact adeC gene in the VS116 genome
gGene is known to have been originally present in this isolate, but its plasmid was lost from the sequenced culture [69]
hlp54 is fused to another plasmid (see text); this gene is in the lp54 portion
ilp38 is fused to lp54; this gene is in the lp38 portion
jTwo genes of this sort are present on this plasmid
kUnusual circular plasmid location; in B31 lp38-like indel in PBr cp32–10 and B31 lp56-like indel in VS116 cp32–7; see Additional file 1: Figure S8
lPlasmid sequences are not complete for this isolate. ND indicates not determined; since all plasmids in this strain have not been sequenced the presence of this gene is not known
Fig. 6Plasmid super-groups revealed by plasmid partition gene clusters. The midpoint-rooted gene tree (center) is based on a 1327 amino acid long concatenated alignment of sequences of four plasmid partition proteins (including PFam57/62, Pfam50, PFam32, and Pfam49). This tree shows a monophyly of all cp32 + cp9 plasmids (left-side, shades of yellow in outer ring). Other monophyletic plasmid super-groups separated by similarly deep branches include (counter-clockwise from bottom) lp28–4/lp28–2, cp26, lp54, lp25, lp28–8, lp28–5, lp28–3/lp28–6/lp36/lp38/lp56, lp28–7, and lp28–1/lp28–9. Plasmids not shown in the figure (lp5, lp17 and lp21) do not have a full set of all four partition genes. A web-interactive version of the tree is available at BorreliaBase website (under the “Replicons” tab)