| Literature DB >> 25370836 |
Laura Gomez-Valero, Christophe Rusniok, Monica Rolando, Mario Neou, Delphine Dervins-Ravault, Jasmin Demirtas, Zoe Rouy, Robert J Moore, Honglei Chen, Nicola K Petty, Sophie Jarraud, Jerome Etienne, Michael Steinert, Klaus Heuner, Simonetta Gribaldo, Claudine Médigue, Gernot Glöckner, Elizabeth L Hartland, Carmen Buchrieser.
Abstract
BACKGROUND: The genus Legionella comprises over 60 species. However, L. pneumophila and L. longbeachae alone cause over 95% of Legionnaires’ disease. To identify the genetic bases underlying the different capacities to cause disease we sequenced and compared the genomes of L. micdadei, L. hackeliae and L. fallonii (LLAP10), which are all rarely isolated from humans.Entities:
Mesh:
Year: 2014 PMID: 25370836 PMCID: PMC4256840 DOI: 10.1186/PREACCEPT-1086350395137407
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Intracellular replication of , and (LLAP10). (A) THP-1 derived macrophages at 37°C. (B) A. castellanii culture at 20°C. (C) A. castellanii plate test at 37°C and 30°C. L. pneumophila strain Paris wild type (wt) and ∆dotA were used as positive and negative controls, respectively. Intracellular replication for each strain was determined by recording the number of colony-forming units (CFU) through plating on BCYE agar. Blue, L. pneumophila strain Paris; red, ∆dotA; orange, L. micdadei; violet, L. hackeliae; green, L. fallonii (LLAP10). Results are expressed as Log10 ratio CFU Tn/T0 and each point represents the mean ± standard deviation of two or three independent experiments. The error bars represent the standard deviation, but some were too small to clearly appear in the figure.
General features of the , and genomes compared with and
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
| Genome size (Mb) | 3.5 | 4.1 | 4.2 | 3.3 | 3.4 | |
| G + C content (%) | 38 | 37 | 38 | 40 | 39 | |
| Genes | 3,178 | 3,660 | 3,673 | 3,076 | 3,185 | |
| Protein coding genes | 3,079 | 3,571 | 3,601 | 3,009 | 3,103 | |
| Pseudogenes | 45 | 23 | 7 | 11 | 20 | |
| tRNA | 43 | 46 | 46 | 43 | 43 | |
| 16S/23S/5S | 3/3/3 | 4/4/4 | 4/4/4 | 3/3/3 | 4/4/4 | |
| Coding density (%) | 87 | 84 | 87 | 90 | 89 | |
| Plasmids | 1 | 1 | 2 | 0 | 1 | |
| Plasmid | Plasmid | Plasmid | Plasmid | |||
| A | B | |||||
| Size (kb) | 131.8 | 71.8 | 238.8 | 14.6 | 129.9 | |
| G + C content (%) | 37 | 38 | 39 | 36 | 38 | |
| Genes | 142 | 75 | 248 | 15 | 141 | |
| Protein coding genes | 140 | 75 | 247 | 15 | 141 | |
| Pseudogenes | 2 | 0 | 1 | 0 | 0 | |
| Coding density (%) | 91 | 86 | 89 | 85 | 91 | |
Figure 2Shared and specific content of the different species/strains analyzed in this study. Each petal and color represents one genome. The number in the center of the diagram represents the orthologous genes shared by all the genomes. The number inside of each individual petal corresponds to the specific genes of each genome with non-orthologous genes in any of the other genomes. (A) Core genome of five Legionella species including seven L. pneumophila genomes. (B) Core genome when one representative of each Legionella species is taken into account.
Figure 3Phylogenetic tree of six species and seven strains and their shared Dot/Icm substrates. Neighbor-joining tree based on the concatenation of 816 protein-coding genes from 11 Legionella genomes. C. burnetii was used as out-group. The tree was constructed using MEGA and JTT as model of evolution. The values above nodes indicate the bootstrap values. The values in blue circles represent the number of Dot/Icm substrates shared by the species in the corresponding cluster, suggesting that they were present in the common ancestor. The values inside blues squares are the number of Dot/Icm substrates shared between L. pneumophila strains and the remaining species (for example, the species L. micdadei and L. pneumophila share 33 Dot/Icm substrates).
Figure 4synthesizes cellulose. (A) Genomic organization and Blastx comparison of the regions encoding the cellulose synthesis machinery in E. coli, L. fallonii, L. dumofii and L. anisa. The gray color code represents the Blast matches; the darker the gray the better the blast match. (B) Growth of L. fallonii on calcofluor agar plates that shows cellulose synthesis as visualized under long-wave UV light. L. fallonii is fluorescent due to the binding of calcofluor to cellulose. In contrast L. pneumophila that was used as negative control is not.
Figure 5The and genomes contain specific flagellar-encoding regions Genomic organization and Blastx comparison of the specific flagellar gene clusters in L. micdadei and L. fallonii. The gray color code represents the Blast matches; the darker the gray the better the blast match. Pink arrows point to tRNA genes. Protein names and their predicted function in L. micdadei are indicated below.
Distribution of type IV secretion systems in the analyzed genomes
|
|
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
|
| Paris | 1 | - | - | 1 | 2 |
| Lens | 1 | - | - | 1 | 1 | |
| Philadelphia | 1 | - | 1 | 1 | 1 | |
| Corby | 1 | 2 | - | - | 2 | |
| Lorraine | 1 | 1 | - | - | 1 | |
| HL06041035 | 1 | - | - | 1 | 2 | |
| 130b | 1 | 1 | 2 | 2 | ||
|
| NSW-150 | 1 | 1 | 1 | - | - |
|
| D-4968 | 1 | - | 1 | 1 | 1 |
|
| ATCC35250 | 1 | - | - | - | 1 |
|
| ATCC33218 | 1 | - | - | 1 | 1 |
|
| 02/42 | 1 | 1 | - | 1 | - |
|
| ATCC700992 | 1 | - | 1 | - | 1 |
|
| ||||||
|
| Paris | - | - | 1 | - | - |
| Lens | - | - | 1 | - | - | |
| Lorraine | - | - | 1 | - | - | |
|
| - | - | 1 | - | - | |
|
| - | - | 2 | - | - | |
Figure 6Genome comparison of two strains The complete genome sequences of the two L. micdadei strains included in this study were aligned using the software Mauve. The two strains align perfectly with the exception of three mobile genetic elements specifically present in strain L. micdadei ATCC33218 and one specifically present in the Victorian isolate. The specific regions of each genome are indicated. The 'Lvh region' is indicated, as this region is, with a high number of SNPs, quite divergent between the two isolates.
Core of substrates of the Dot/Icm secretion system present in 11 genomes
|
|
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
| |||
|
| Protein of unknown function [ribosome associated protein] |
|
|
|
| |
|
| N-terminal acetyltransferase, GNAT family |
|
|
|
|
|
|
| Conserved protein of unknown function |
|
|
|
|
|
|
| Membrane protein of unknown function |
|
|
|
|
|
|
| Protein of unknown function |
|
|
|
|
|
|
| Protein of unknown function |
|
|
|
| |
|
| Protein of unknown function [ankyrin]c |
|
|
|
|
|
|
| Putative Beta-lactamase [Sel1-domain]c[TPR repeat]c |
|
|
|
|
|
|
| Protein of unknown function |
|
|
|
| |
|
| Heparan-alpha glucosaminide N-acetyltransferased |
| 57 | 47 | 48 | |
|
| Protein of unknown function |
|
| 44 |
| 58 |
|
| Protein of unknown function |
|
|
|
|
|
|
| Membrane protein of unknown function [hydrolases] |
|
| 44 | 49 | |
|
| Protein of unknown function [ankyrin]c |
|
|
|
|
|
|
| Protein of unknown function [ankyrin] |
|
|
|
| 42 |
|
| Protein of unknown function |
|
|
|
| 33 |
|
| Protein of unknown function [putative GatB/Yqey domain]c |
|
| 73 | 72 | |
|
| Acid sphingomyelinase-like phosphodiesterase |
|
|
| 54 | 33 |
|
| Protein of unknown function |
|
|
|
|
|
|
| Membrane protein of unknown function |
|
|
|
|
|
|
| Protein of unknown function |
|
|
|
| |
|
| Membrane protein of unknown function [coiled-coil]c |
|
|
|
| |
|
| Ribosomal RNA small subunit methyltransferase E |
|
|
|
| |
|
| Protein of unknown function |
|
|
|
|
|
aAmino acid identity with respect to L. pneumophila strain Philadelphia. b ravC has been erroneously named as lpg0170 in previous publications. cThe motif is present in all orthologues. dHigh prediction for enzymatic activity in all the corresponding orthologues (prediction based on PRIAM EC number). Bold numbers indicate synteny between L. pneumophila genes and the corresponding orthologous gene.
Genes encoding proteins containing eukaryotic motifs not previously described
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
|
|
|
|
| Cytosolic IMP-GMP specific 5'-nucleotidase | HAD-superfamily hydrolase, 5'-nucleotidase |
|
|
| Protein of unknown function | DM9 repeat | |||
|
| Apyrase | Nucleoside phosphatase | ||||
|
|
| Protein of unknown function | Thaumatin | |||
|
|
| Protein of unknown function | RhoGTPase | |||
|
|
|
|
|
| Apyrase | Nucleoside phosphatase |
|
| Protein of unknown function | DM9 repeat | ||||
|
|
| Protein of unknown function | DM9 repeat | |||
|
| Protein of unknown function | Synaptobrevin/V-snare | ||||
|
| Protein of unknown function | Synaptobrevin/V-snare | ||||
|
| Protein of unknown function | Thaumatin | ||||
|
| Protein of unknown function | Thaumatin | ||||
|
| Protein of unknown function | Thaumatin | ||||
|
|
|
| Protein of unknown function | DH/RhoGEF | ||
|
|
| Putative serine carboxypeptidase | Peptidase S28 | |||
|
| Protein of unknown function | Mitochondrial substrate/solute carrier repeat | ||||
|
| Protein of unknown function [ARM repeat] | Clathrin/coatomer_adaptine_like | ||||
|
| Protein of unknown function [HEAT-REPEAT DOMAIN] [F-box domain] | Clathrin/coatomer_adaptine_like | ||||
|
| Protein of unknown function [coiled-coil] | RhoGTPase | ||||
|
|
| Protein of unknown function | T-complex protein 10 | |||
|
|
| Protein of unknown function | Peptidase C65 |
aGene encoding a protein that has been demonstrated to be secreted by the Dot/Icm secretion system. bOrthologous proteins in which the corresponding motif is not present.
Genes horizontally transferred from eukaryotic genomes to genomes
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
|
|
|
|
| cytosolic IMP-GMP specific 5'-nucleotidase | |
|
|
|
|
| glucoamylase | ||
|
| ecto-ATP diphosphohydrolase II | |||||
|
|
|
| map Major acid phosphatase Map (histidine-acid phosphatase) | |||
|
| pyruvate decarboxylase | |||||
|
|
|
| thi thiamine biosynthesis protein NMT-1 | |||
|
|
|
|
|
| phosphoribosylamidoimidazole-succinocarboxamide synthase | |
|
|
|
|
|
| putative Apyrase | |
|
|
|
| RalF, translocated into host cells by the Dot/Icm system | |||
|
|
|
| sphingosine-1-phosphate lyase I | |||
|
|
| protein kinase-like | ||||
|
|
|
|
| cysteine synthase A, O-acetylserine sulfhydrolase A subunit | ||
|
|
|
|
|
| conserved protein of unknown function [FADPNR domain] | |
|
|
|
|
|
|
| cysteine synthase A, O-acetylserine sulfhydrolase A subunit |
| protein of unknown function [SNARE domain] | ||||||
|
|
| protein of unknown function [ank domain] [Ras domain] [coiled-coil domain] | ||||
|
|
| putative Dipeptidyl-peptidase II | ||||
|
| conserved protein of unknown function [Leucine-rich repeat domain] | |||||
|
|
|
|
| DWF 7-dehydrocholesterol reductase | ||
|
| putative Sphingomyelin phosphodiesterase | |||||
|
|
|
|
| protein of unknown function [Tubulin-tyrosine ligase domain) | ||
|
| GTP-binding protein ypt1 [Ras small GTPase, Rab type] | |||||
|
| protein of unknown function [Mitochondrial substrate carrier] | |||||
|
|
| phosphoprotein phosphatase | ||||
|
|
| protein of unknown function [Ras small GTPase, Rab type] | ||||
|
| protein of unknown function [RhoGAP motif] [coiled-coil] | |||||
|
| protein of unknown function [cAMP-binding domain-like] | |||||
|
| protein of unknown function [Ras GTPase domain] | |||||
|
| putative Adenosine deaminase | |||||
|
| phosphoprotein phosphatase | |||||
|
| protein of unknown function [Synaptobrevin] | |||||
|
| protein of unknown function | |||||
|
| membrane protein of unknown function [Ras small GTPase, Rab type] | |||||
|
| protein of unknown function [Cytochrome P450 domain] | |||||
|
| membrane protein of unknown function [Ras small GTPase, Rab type] | |||||
|
| endo-1,3(4)-beta-glucanase | |||||
|
| protein of unknown function [H3 methylation by DOT1] | |||||
|
| membrane protein of unknown function [Ras small GTPase, Rab type] | |||||
|
| membrane protein of unknown function [Ras small GTPase, Rab type] | |||||
|
| protein of unknown function [SecA DEAD-like] |
aGene encoding a protein that has been demonstrated to be secreted by the Dot/Icm secretion system.
Figure 7Phylogenetic analysis shows the eukaryotic origin of the carboxypeptidase S28 family protein (Llo0042/Lfa0022). The species belonging to bacteria and eukaryotes are shown in red and green, respectively. Numbers next to tree nodes correspond to bootstrap values. The bar at the bottom represents the estimated evolutionary distance.