| Literature DB >> 23145189 |
Jessica N Ricaldi1, Derrick E Fouts, Jeremy D Selengut, Derek M Harkins, Kailash P Patra, Angelo Moreno, Jason S Lehmann, Janaki Purushe, Ravi Sanka, Michael Torres, Nicholas J Webster, Joseph M Vinetz, Michael A Matthias.
Abstract
The whole genome analysis of two strains of the first intermediately pathogenic leptospiral species to be sequenced (Leptospira licerasiae strains VAR010 and MMD0835) provides insight into their pathogenic potential and deepens our understanding of leptospiral evolution. Comparative analysis of eight leptospiral genomes shows the existence of a core leptospiral genome comprising 1547 genes and 452 conserved genes restricted to infectious species (including L. licerasiae) that are likely to be pathogenicity-related. Comparisons of the functional content of the genomes suggests that L. licerasiae retains several proteins related to nitrogen, amino acid and carbohydrate metabolism which might help to explain why these Leptospira grow well in artificial media compared with pathogenic species. L. licerasiae strains VAR010(T) and MMD0835 possess two prophage elements. While one element is circular and shares homology with LE1 of L. biflexa, the second is cryptic and homologous to a previously identified but unnamed region in L. interrogans serovars Copenhageni and Lai. We also report a unique O-antigen locus in L. licerasiae comprised of a 6-gene cluster that is unexpectedly short compared with L. interrogans in which analogous regions may include >90 such genes. Sequence homology searches suggest that these genes were acquired by lateral gene transfer (LGT). Furthermore, seven putative genomic islands ranging in size from 5 to 36 kb are present also suggestive of antecedent LGT. How Leptospira become naturally competent remains to be determined, but considering the phylogenetic origins of the genes comprising the O-antigen cluster and other putative laterally transferred genes, L. licerasiae must be able to exchange genetic material with non-invasive environmental bacteria. The data presented here demonstrate that L. licerasiae is genetically more closely related to pathogenic than to saprophytic Leptospira and provide insight into the genomic bases for its infectiousness and its unique antigenic characteristics.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23145189 PMCID: PMC3493377 DOI: 10.1371/journal.pntd.0001853
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Salient features of the unfinished L. licerasiae genomes.
|
|
|
|
|
|
|
|
| |
| Size (Mbp) | ∼4.2 | ∼4.2 | ∼4.6 | ∼4.7 | ∼3.9 | ∼3.9 | ∼4.0 | ∼4.0 |
| G+C (%) | 41.6 | 41.1 | 35.1 | 36.0 | 40.2 | 40.2 | 38.6 | 38.9 |
| CDS | ||||||||
| Hypothetical proteins | 1302 | 1600 | 1833 | 1579 | 963 | 949 | 1221 | 1532 |
| Proteins with functional assignments | 2629 | 2285 | 1834 | 2123 | 1982 | 1931 | 2379 | 2194 |
| Total | 3931 | 3885 | 3667 | 3702 | 2945 | 2880 | 3600 | 3726 |
| tRNA genes | 37 | 37 | 37 | 37 | 37 | 37 | 35 | 35 |
| Riboswitches and | ||||||||
| Cobalamin | 2 | 2 | 2 | 2 | 2 | 2 | NP | NP |
| TPP | Y | Y | Y | Y | Y | Y | Y | Y |
|
| Y [ | Y [ | NP | NP | NP | NP | NP | NP |
| CRISPR's (# repeats) | NP | NP | Y(4) | Y(18) | NP | NP | NP | NP |
| rRNA | ||||||||
| 23S | 1 | 1 | 2 | 2 | 2 | 2 | 2 | 3 |
| 16S | 1 | 1 | 2 | 2 | 2 | 2 | 2 | 2 |
| 5S | 2 | 2 | 1 | 1 | 1 | 1 | 2 | 2 |
| N50/kb | 522.9 | 463.5 | NA | NA | NA | NA | NA | NA |
| Species-specific genes | 1211 | 649 | 225 | 1553 | ||||
Draft Genome.
Genome version as of July 01, 2011—includes all replicons and plasmids.
N50 = length-weighted mean: the size of the smallest contig such that 50% of the genome is contained in contigs of size N50 or greater.
NP – Not Present.
Y – Yes.
NA – Not Applicable.
Figure 1Comparison of LE1-like prophage regions in L. licerasiae.
(A) Linear representations of CDSs found in each L. licerasiae genome with similarity to bacteriophage LE1 and to non-prophage regions of L. interrogans and L. borgpetersenii encoding efflux pumps. CDSs are labeled by locus identifier and colored by functional role categories as noted in the boxed key. BLASTP matches between CDSs are colored by protein percent identity (see key). (B) Circular depiction of vB-LliZ_VAR010-LE1.
Figure 2Comparison of prophage regions in L. licerasiae similar to a known cryptic prophage in expressed in L. interrogans sv. Lai.
Depicted are linear representations of CDSs found in each genome with similarity to the cryptic prophage identified by Qin et al. in L. interrogans sv. Lai [64]. CDSs are labeled by locus identifier and are colored by functional role categories as noted in the boxed key. BLASTP matches between CDSs are colored by protein percent identity (see key). CDSs colored blue in L. borgpetersenii and L. interrogans genomes denote CDSs flanking the regions identified by Qin et al. (yellow highlighted box).
Figure 3Venn diagram showing the distribution of 3745 L. licerasiae orthologs by leptospiral species.
Orthologs were predicted using QuartetS [45] run with default parameters. Only proteins present in both strains of a given species are shown.
Figure 4Whole-genome comparison of the pathogenic, intermediately pathogenic and saprophytic species of Leptospira.
Line figures depict the results of PROmer analysis. Colored lines denote percent identity of protein translations (see key) and are plotted according to the location in the leptospiral reference genomes (x-axis) and the query genome L. licerasiae strain VAR10 (y-axis). Chromosome I results are in the left column while chromosome II comparisons are in the right column.
Distribution of subsystems in Leptospira. Predicted by RAST server.
| Subsystem | VAR010 | MMD0835 | L1–130 | 56601 | L550 | JB197 | Paris | Ames |
| Cofactor, Vitamins, Prosthetic Groups, Pigments | 156 | 155 | 202 | 201 | 137 | 138 | 141 | 141 |
| Cell Wall and Capsule | 71 | 71 | 100 | 110 | 94 | 94 | 79 | 79 |
| Virulence, Disease and Defense | 55 | 55 | 40 | 37 | 34 | 33 | 62 | 61 |
| Potassium Metabolism | 13 | 13 | 18 | 18 | 9 | 9 | 0 | 0 |
| Miscellaneous | 99 | 99 | 145 | 143 | 95 | 94 | 96 | 96 |
| Membrane Transport | 25 | 26 | 19 | 24 | 11 | 10 | 14 | 28 |
| Iron acquisition and metabolism | 8 | 8 | 2 | 2 | 1 | 1 | 6 | 6 |
| RNA metabolism | 107 | 96 | 121 | 116 | 95 | 92 | 105 | 105 |
| Nucleotides and Nucleosides | 50 | 50 | 55 | 55 | 49 | 48 | 51 | 51 |
| Protein Metabolism | 185 | 185 | 184 | 182 | 204 | 188 | 154 | 182 |
| Cell Division and Cell Cycle | 22 | 20 | 24 | 23 | 20 | 20 | 21 | 21 |
| Motility and Chemotaxis | 82 | 82 | 75 | 78 | 81 | 81 | 106 | 104 |
| Regulation and Cell Signaling | 43 | 40 | 17 | 51 | 20 | 19 | 61 | 64 |
| Secondary Metabolism | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 |
| DNA Metabolism | 68 | 77 | 63 | 59 | 50 | 49 | 73 | 73 |
| Fatty Acids, Lipids and Isoprenoids | 73 | 72 | 77 | 78 | 58 | 45 | 63 | 46 |
| Nitrogen Metabolism |
|
| 4 | 5 | 5 | 4 |
|
|
| Dormancy and Sporulation | 3 | 3 | 3 | 3 | 3 | 3 | 4 | 3 |
| Respiration | 64 | 64 | 51 | 52 | 50 | 50 | 64 | 64 |
| Stress Response | 75 | 75 | 60 | 62 | 57 | 55 | 86 | 86 |
| Metabolism of Aromatic Compounds | 3 | 3 | 1 | 1 | 1 | 1 | 1 | 1 |
| Amino Acids and Derivatives |
|
| 180 | 176 | 172 | 171 |
|
|
| Sulfur Metabolism | 8 | 20 | 19 | 18 | 17 | 17 | 12 | 12 |
| Phosphorous Metabolism |
|
| 14 | 13 | 11 | 12 |
|
|
| Carbohydrates |
|
| 102 | 107 | 130 | 131 |
|
|
Figure 5Structure of Leptospira O-antigen regions.
The O-antigen region and flanking CDSs of L. licerasiae strain VAR10 are compared to regions of homology in L. interrogans Copenhageni (A). Likewise, the O-antigen and flanking CDSs of L. interrogans Copenhageni are compared to the homologous region in L. licerasiae strain VAR10 (B). Yellow shaded boxes mark the locations of the O-antigen regions. CDSs are labeled by locus identifier and colored by functional role categories as noted in the boxed key. Gene symbols, when present, are noted above their respective genes in bold italics. BLASTP matches between CDSs are colored by protein percent identity (see key).
The top 10 BLAST hits from the six genes of the proposed L. licerasiae O-antigen locus identify genes from sixty unique genomes.
|
| LEP1GSC185_2122 | LEP1GSC185_2123 | LEP1GSC185_2124 | LEP1GSC185_2125 | LEP1GSC185_2126 | LEP1GSC185_2127 |
| Annotation | glycosyltransferase group 2 family protein | glycosyltransferase group 1 family protein | O-antigen transporter (flippase), wzx | polysaccharide pyruvoyl transferase domain protein | glycosyltransferase group 1 family protein | O-antigen acetylase |
|
| LlicsVM_010100001060 ZP_09256931 | LlicsVM_010100001055 ZP_09256930 | LlicsVM_010100001050 ZP_09256929 | LlicsVM_010100001045 ZP_09256928 | LlicsVM_010100001040 ZP_09256927 | LlicsVM_010100001035 ZP_09256926 |
| Best BLAST hits |
|
| Flavobacteriaceae bacterium HMQ9 (ZP_09314923) E = 6e-66 |
|
|
|
| Nocardioidaceae bacterium Broad-1 (EGD43512) E = 6e-96 |
|
| Flavobacteriaceae bacterium S85 (ZP_09498960) E = 8e-54 |
|
| |
| marine actinobacterium PHSC20C1 (EAR24699) E = 2e-95 | Rhodobacteraceae bacterium KLH11 (EEE35994) E = 2e-65 |
|
|
|
| |
|
|
|
|
| uncultured methanogenic archaeon RC-I (CAJ36730) E = 2e-04 |
| |
| candidate division TM7 genomospecies GTL1 (EDK72664) E = 7e-92 |
|
|
|
|
| |
|
|
|
|
|
| marine gamma proteobacterium HTCC2148 (EEB79327) E = 2e-135 | |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
| Verrucomicrobiae bacterium DG1235 (EDY81368) E = 9e-91 |
| butyrate-producing bacterium SS3/4 (CBL39981) E = 5e-51 |
|
|
| |
|
|
|
|
|
|
|
Figure 6Silver-stained polyacrylamide gel of purified L. licerasiae str. VAR10 LPS.
LPS was purified by the hot-phenol water method. E. coli LPS (Lane 1), 1/32 Dilution of VAR010 Phenol phase LPS (Lane 2), 1/16 Dilution of VAR010 Phenol phase LPS (Lane 3), 1/8 Dilution of VAR010 Phenol phase LPS (Lane 4), 1/4 Dilution of VAR010 Phenol phase LPS (Lane 5), 1/2 Dilution of VAR010 Phenol phase LPS (Lane 6), Undiluted VAR010 Phenol phase LPS (Lane 7), Molecular weight marker (M).
Location of Putative Genomic Islands (GIs) in L. licerasiae VAR010.
| Contig | Start | End | Length (bp) | # protein coding genes | TAS (# associated) | Protein ID | |
| First | Last | ||||||
| AHOO02000005 | 172542 | 208590 | 36048 | 46 | No | LEP1GSC185_0932 | LEP1GSC185_0976 |
| AHOO02000009 | 303086 | 331349 | 28263 | 30 | Yes (3) | LEP1GSC185_3852 | LEP1GSC185_3880 |
| AHOO02000013 | 449819 | 471528 | 21709 | 23 | No | LEP1GSC185_2782 | LEP1GSC185_2803 |
| AHOO02000009 | 620 | 8558 | 7938 | 15 | Yes (7) | LEP1GSC185_3556 | LEP1GSC185_3570 |
| AHOO02000005 | 628935 | 635409 | 6474 | 6 | No | LEP1GSC185_1388 | LEP1GSC185_1392 |
| AHOO02000014 | 64218 | 70265 | 6047 | 9 | No | LEP1GSC185_0603 | LEP1GSC185_0612 |
| AHOO02000008 | 41316 | 46316 | 5000 | 7 | No | LEP1GSC185_3455 | LEP1GSC185_3460 |
Putative type II toxin-antitoxin system.
Qin et. Lysogen.
Putative Type II toxin-antitoxin systems (TAS's) in L. licerasiae VAR010.
| Locus | Toxin Family | Annotation | Antitoxin Family | Annotation | |
| LEP1GSC185_0180 | Aha1 | conserved hypothetical protein | LEP1GSC185_0181 | ArsR | toxin-antitoxin system, antitoxin component, ArsR family |
| LEP1GSC185_0262 | Aha1 | conserved hypothetical protein | LEP1GSC185_0263 | ArsR | transcriptional regulator, ArsR family |
| LEP1GSC185_0307 | PIN | PIN domain protein | LEP1GSC185_0308 | Phd/YefM | prevent-host-death family protein |
| LEP1GSC185_0418 | PIN | PIN domain protein | LEP1GSC185_0416 | Phd/YefM | prevent-host-death family protein |
| LEP1GSC185_0630 | PIN | toxin-antitoxin system, toxin component, PIN family | LEP1GSC185_0631 | Phd/YefM | antitoxin Phd/YefM, type II toxin-antitoxin system |
| LEP1GSC185_1922 | PIN | PIN domain protein | LEP1GSC185_1923 | DUF2191 | PF09957 family protein |
| LEP1GSC185_2251 | PIN | PIN domain protein | LEP1GSC185_2250 | Phd/YefM | conserved hypothetical protein |
| LEP1GSC185_2325 | Aha1 | conserved hypothetical protein | LEP1GSC185_2324 | HTH | DNA-binding helix-turn-helix protein |
| LEP1GSC185_2580 | PIN | PIN domain protein | LEP1GSC185_2582 | RHH | toxin-antitoxin system, antitoxin component, ribbon-helix-helix domain protein |
| LEP1GSC185_2709 | Aha1 | conserved hypothetical protein | LEP1GSC185_2710 | ArsR | MarR family protein |
| LEP1GSC185_3193 | PIN | putative toxin-antitoxin system toxin component, PIN family | LEP1GSC185_3194 | RHH | putative toxin-antitoxin system, antitoxin component, ribbon-helix-helix domain protein |
| LEP1GSC185_3380 | Aha1 | conserved hypothetical protein | LEP1GSC185_3379 | ArsR | DNA-binding helix-turn-helix protein |
| LEP1GSC185_3451 | COG2856 | PF06114 domain protein | LEP1GSC185_3450 | XRE | conserved hypothetical protein |
| LEP1GSC185_3530 | PIN | toxin-antitoxin system toxin component, PIN family | LEP1GSC185_3529 | RHH | ribbon-helix-helix protein, CopG family |
| LEP1GSC185_3543 | ChpK | PemK-like protein | LEP1GSC185_3542 | ChpI | ribbon-helix-helix protein, CopG family |
| LEP1GSC185_3550 | PIN | PIN domain protein | LEP1GSC185_3549 | MazE | antidote-toxin recognition MazE |
| LEP1GSC185_3553 | PIN | toxin-antitoxin system, toxin component, PIN family | LEP1GSC185_3552 | COG2442 | putative toxin-antitoxin system, antitoxin component |
| LEP1GSC185_3555 | MazF | putative toxin-antitoxin system, toxin component, MazF family | LEP1GSC185_3554 | RHH | ribbon-helix-helix protein, CopG family |
| LEP1GSC185_3557 | PIN | PIN domain protein | LEP1GSC185_3556 | RHH | toxin-antitoxin system, antitoxin component, ribbon-helix-helix domain protein |
| LEP1GSC185_3559 | PIN | toxin-antitoxin system toxin component, PIN family | LEP1GSC185_3558 | RHH | ribbon-helix-helix protein, CopG family |
| LEP1GSC185_3561 | PIN | PIN domain protein | LEP1GSC185_3560 | AbrB | transcriptional regulator, AbrB family |
| LEP1GSC185_3562 | UNK | putative toxin-antitoxin system, toxin component | LEP1GSC185_3563 | RHH | toxin-antitoxin system, antitoxin component, ribbon-helix-helix domain protein |
| LEP1GSC185_3564 | COG2929 | PF04365 family protein | LEP1GSC185_3565 | RHH | toxin-antitoxin system, antitoxin component, ribbon-helix-helix domain protein |
| LEP1GSC185_3566 | PIN | PIN domain protein | NA | UNK | hypothetical protein |
| LEP1GSC185_3584 | Aha1 | conserved hypothetical protein | LEP1GSC185_3585 | ArsR | MarR family protein |
| NA | DUF497 | hypothetical protein LlicsVM_07915 | LEP1GSC185_3866 | RHH | toxin-antitoxin system, antitoxin component, ribbon-helix-helix domain protein |
| NA | HigB | plasmid maintenance killer protein | LEP1GSC185_3875 | HigA | addiction module antidote protein HigA (higA) |
| LEP1GSC185_3880 | PIN | toxin-antitoxin system toxin component, PIN family | LEP1GSC185_3879 | RHH | ribbon-helix-helix protein, CopG family |
Manually annotated—missed by automated gene caller.