Literature DB >> 22059087

Comparative and functional genomics of legionella identified eukaryotic like proteins as key players in host-pathogen interactions.

Laura Gomez-Valero1, Christophe Rusniok, Christel Cazalet, Carmen Buchrieser.   

Abstract

Although best known for its ability to cause severe pneumonia in people whose immune defenses are weakened, Legionella pneumophila and Legionella longbeachae are two species of a large genus of bacteria that are ubiquitous in nature, where they parasitize protozoa. Adaptation to the host environment and exploitation of host cell functions are critical for the success of these intracellular pathogens. The establishment and publication of the complete genome sequences of L. pneumophila and L. longbeachae isolates paved the way for major breakthroughs in understanding the biology of these organisms. In this review we present the knowledge gained from the analyses and comparison of the complete genome sequences of different L. pneumophila and L. longbeachae strains. Emphasis is given on putative virulence and Legionella life cycle related functions, such as the identification of an extended array of eukaryotic like proteins, many of which have been shown to modulate host cell functions to the pathogen's advantage. Surprisingly, many of the eukaryotic domain proteins identified in L. pneumophila as well as many substrates of the Dot/Icm type IV secretion system essential for intracellular replication are different between these two species, although they cause the same disease. Finally, evolutionary aspects regarding the eukaryotic like proteins in Legionella are discussed.

Entities:  

Keywords:  Legionella longbeachae; Legionella pneumophila; comparative genomics; eukaryotic like proteins; evolution; virulence

Year:  2011        PMID: 22059087      PMCID: PMC3203374          DOI: 10.3389/fmicb.2011.00208

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

Genomics has the potential to provide an in depth understanding of the genetics, biochemistry, physiology, and pathogenesis of a microorganism. Furthermore comparative genomics, functional genomics, and related technologies, are helping to unravel the molecular basis of the pathogenesis, evolution, and phenotypic differences among different species, strains, or clones and to uncover potential virulence genes. Knowledge of the genomes provides the basis for the application of new powerful approaches for the understanding of the biology of the organisms studied. Although Legionella are mainly environmental bacteria, several species are pathogenic to humans, in particular Legionella pneumophila (Fraser et al., 1977; Mcdade et al., 1977) and Legionella longbeachae (Mckinney et al., 1981). Legionnaires’ disease has emerged in the second half of the twentieth century partly due to human alterations of the environment. The development of artificial water systems in the last decades like air conditioning systems, cooling towers, showers, and other aerosolizing devices has allowed Legionella to gain access to the human respiratory system. When inhaled in contaminated aerosols, pathogenic Legionella can reach the alveoli of the lung where they are subsequently engulfed by macrophages. In contrast to most bacteria, which are destroyed, some Legionella species can multiply within the phagosome and eventually kill the macrophage, resulting in a severe, often fatal pneumonia called legionellosis or Legionnaires’ disease (mortality rate of 5–20%; up to 50% in nosocomial infections; Steinert et al., 2002; Marrie, 2008; Whiley and Bentham, 2011). To replicate intracellularly L. pneumophila manipulates host cellular processes using bacterial proteins that are delivered into the cytosolic compartment of the host cell by a specialized type IV secretion system called Dot/Icm. The proteins delivered by the Dot/Icm system target host factors implicated in controlling membrane transport in eukaryotic cells, which enables L. pneumophila to create an endoplasmic reticulum-like vacuole that supports intracellular replication in both protozoan and mammalian host cells (for a review see Hubber and Roy, 2010). An interesting epidemiological observation is, that among the over 50 Legionella species described today, strains belonging to the species L. pneumophila are responsible for over 90% of the legionellosis cases worldwide and strains belonging to the species L. longbeachae are responsible for about 5% of human legionellosis cases worldwide (Yu et al., 2002). Surprisingly, this distribution is very different in Australia and New Zealand where L. pneumophila accounts for “only” 45.7% of the cases but L. longbeachae is implicated in 30.4% of the human cases. Furthermore, among the strains causing Legionnaires’ disease, L. pneumophila serogroup 1 (Sg1) alone is responsible for over 85% of cases (Yu et al., 2002; Doleans et al., 2004) despite the description of 15 different Sg within this species. In addition, the characterization of over 400 different L. pneumophila Sg1 strains has shown that only a minority among these is responsible for causing most of the human disease (Edelstein and Metlay, 2009). Some of these clones are distributed worldwide like L. pneumophila strain Paris (Cazalet et al., 2008) others have a more restricted geographical distribution, like the recently described endemic clone, prevalent in Ontario, Canada (Tijet et al., 2010). For the species L. longbeachae two serogroups are described to date (Bibb et al., 1981; Mckinney et al., 1981). L. longbeachae Sg1 is predominant in human disease as it causes up to 95% of the cases of legionellosis worldwide and most outbreaks and sporadic cases in Australia (Anonymous, 1997; Montanaro-Punzengruber et al., 1999). The two main human pathogenic Legionella species, L. pneumophila and L. longbeachae cause the same disease and symptoms in humans (Amodeo et al., 2009), however, there exist major differences between both species in niche adaptation and host susceptibility. They are found in different environmental niches, as L. pneumophila is mainly found in natural and artificial water circuits and L. longbeachae is principally found in soil and therefore associated with gardening and use of potting compost (O’Connor et al., 2007). However, although less common, the isolation of L. pneumophila from potting soil in Europe has also been reported (Casati et al., 2009; Velonakis et al., 2009). Human infection due to L. longbeachae is particularly common in Australia but cases have been documented also in other countries like the USA, Japan, Spain, England, or Germany (MMWR, 2000; Garcia et al., 2004; Kubota et al., 2007; Kumpers et al., 2008; Pravinkumar et al., 2010). As described for other Legionella species, person to person transmission of L. longbeachae has not been documented, however, the primary transmission mode seems to be inhalation of dust from contaminated compost or soil that contains the organism (Steele et al., 1990; MMWR, 2000; O’Connor et al., 2007). Furthermore, for L. pneumophila a biphasic life cycle was observed in vitro and in vivo as exponential phase bacteria do not express virulence factors and are unable to replicate intracellularly. The ability of L. pneumophila to replicate intracellularly is triggered at the post-exponential phase by a complex regulatory cascade (Molofsky and Swanson, 2004; Sahr et al., 2009). In contrast, less is known on the L. longbeachae intracellular life cycle and its virulence factors. It was recently shown that unlike L. pneumophila the ability of L. longbeachae to replicate intracellularly is independent of the bacterial growth phase (Asare and Abu Kwaik, 2007) and that phagosome biogenesis is different. Like L. pneumophila, the L. longbeachae phagosome is surrounded by endoplasmic reticulum and does not mature to a phagolysosome; however it acquires early and late endosomal markers (Asare and Abu Kwaik, 2007). Another interesting difference between these two species is their ability to colonize the lungs of mice. While only A/J mice are permissive for replication of L. pneumophila, A/J, C57BL/6, and BALB/c mice are all permissive for replication of L. longbeachae (Asare et al., 2007; Gobin et al., 2009). Resistance of C57BL/6 and BALB/c mice to L. pneumophila has been attributed to polymorphisms in Nod-like receptor apoptosis inhibitory protein 5 (naip5) allele that recognizes the C-terminus of flagellin (Wright et al., 2003; Molofsky et al., 2006; Ren et al., 2006; Lightfield et al., 2008). The current model is that L. pneumophila replication is restricted due to flagellin dependent caspase-1 activation through Naip5-Ipaf and early macrophage cell death by pyroptosis. However, although depletion or inhibition of caspase-1 activity leads to decreased targeting of bacteria to lysosomes, the mechanism of caspase-1-dependent restriction of L. pneumophila replication in macrophages and in vivo is not fully understood (Schuelein et al., 2011). In the last years, six genomes of different L. pneumophila strains (Paris, Lens, Philadelphia, Corby, Alcoy, and 130b (Cazalet et al., 2004; Chien et al., 2004; Steinert et al., 2007; D’Auria et al., 2010; Schroeder et al., 2010) have been published. The genome sequences of all but strain 130b were completely finished. Furthermore, the sequencing and analysis of four genomes of L. longbeachae have been carried out recently (Cazalet et al., 2010). L. longbeachae strain NSW150 of Sg1 isolated in Australia from a patient was sequenced completely, and for the remaining three strains (ATCC33462, Sg1 isolated from a human lung, C-4E7 and 98072, both of Sg2 isolated from patients) a draft genome sequence was reported. A fifth L. longbeachae strain (D-4968 of Sg1, isolated in the US from a patient) was recently sequenced and the analysis of the genome sequences assembled into 89 contigs was reported (Kozak et al., 2010). Here we will describe what we learned from the analysis and comparison of the sequenced Legionella strains. We will discuss their general characteristics and then highlight the specific features or common traits with respect to the different ecological niches and the differences in host susceptibility of these two Legionella species. Emphasis will be put on putative virulence and Legionella life cycle related functions. In the last part we will analyze and discuss the possible evolution of the identified virulence factors. Finally, future perspectives in Legionella genomics are presented.

General Features of the L. pneumophila and L. longbeachae Genomes

Legionella pneumophila and L. longbeachae each have a single, circular chromosome with a size of 3.3–3.5 Mega bases (Mb) for L. pneumophila and 3.9–4.1 Mb for L. longbeachae. For both the average G + C content is 38% (Tables 1). The L. pneumophila strains Paris and Lens each contain different plasmids, 131.9 kb and 59.8 kb in size, respectively. In strain Philadelphia-1, 130b, Alcoy, and Corby no plasmid was identified. The L. longbeachae strains NSW10 and D-4986 carry highly similar plasmids of about 70 kb and DNA identity of 99%, strains C-4E7 and 98072 also contain each a highly similar plasmid of 133.8 kb in size. Thus similar plasmids circulate among L. longbeachae strains, but they seem to be different from those found in L. pneumophila.
Table 1

General features of the sequenced .

A. Complete and draft genomes of L. pneumophila obtained by classical or new generation sequencing
L. pneumophila
ParisLensPhiladelphiaCorbyAlcoy130bc
Chromosome size (kb)a3504 (131.9)b3345 (59.8)3397357635163490
G + C content (%)38.3 (37.4)38.4 (38)38.33838.438.2
No. of genesa3123 (142)2980 (60)3031323731973288
No. of protein coding genesa3078 (140)2921 (60)2999319330973141
Percentage of CDS (%)87.988.090.286.886.087.9
No. of specific genes225181213144182386c
No. of 16S/23S/5S03/03/0303/03/0303/03/0303/03/0303/03/03ND
No. transfer RNA444343434342
Plasmids110000
B. Complete and draft genomes of L. longbeachae obtained by classical or new generation sequencing
L. longbeachae
NSW 150D-4968ATCC3346298072C-4E7
Chromosome size (Kb)4077 (71)4016 (70)40964018 (133.8)3979 (133.8)
G + C content (%)37.1 (38.2)37.037.037.0 (37.8)37 (37.8)
No. of genes3660 (75)3557 (61)
No. of 16S/23S/5S04/04/0404/04/0404/04/0404/04/0404/04/04
No. of contigs > 0.5–300 kbComplete13646563
N50 contig size*Complete138 kb129 kb134 kb
Percentage of coverage**100%96.396.393.493.1
Number of SNP with NSW15001900161116 85316 820
Plasmids11011

.

General features of the sequenced . . A total of ∼3000 and 3500 protein-encoding genes are predicted in the L. pneumophila and L. longbeachae genomes, respectively. No function could be predicted for about 40% of these genes and about 20% are unique to the genus Legionella. Comparative analysis of the genome structure of the L. pneumophila genomes showed high colinearity, with only few translocations, duplications, deletions, or inversions (Figures 1A,B) and identified between 6 and 11% of genes as specific to each L. pneumophila strain. Principally, the genomes contain three large plasticity zones, where the synteny is disrupted: a 260-kb inversion in strain Lens with respect to strains Paris and Philadelphia-1, a 130-kb fragment which is inserted in a different genomic location in strains Paris and Philadelphia-1 and the about 50 kb chromosomal region carrying the Lvh type IV secretion system, previously described in strain Philadelphia-1 (Segal et al., 1999). Furthermore, deletions and insertions of several smaller regions were identified in each strain, as well as regions with variable gene content. In contrast, comparison of the completed chromosome sequences of L. pneumophila and L. longbeachae shows that the two Legionella species have a significantly different genome organization (Figure 1C). Moreover only about 65% of the L. longbeachae genes are orthologous to L. pneumophila genes, whereas about 34% of all genes are specific to L. longbeachae with respect to L. pneumophila Paris, Lens, Philadelphia, and Corby (defined by less than 30% amino acid identity over 80% of the length of the smallest protein).
Figure 1

Synteny plot of the chromosomes of L. pneumophila strains Paris, Lens, Corby, and L. longbeachae NSW150. The plot was created using the mummer software package. (A) Synteny plot of the chromosomes of strains L. pneumophila Paris and Corby (B) and strains L. pneumophila Paris and Lens and (C) strains L. pneumophila Paris and L. longbeachae NSW150. Inversions between the genomic sequences are represented in blue. Genome-wide synteny is disrupted by a 260 kb inversion (blue) and a 130 kb plasticity zone between strain L. pneumophila Paris and Lens. In contrast, synteny between L. pneumophila and L. longbeachae is highly conserved.

Synteny plot of the chromosomes of L. pneumophila strains Paris, Lens, Corby, and L. longbeachae NSW150. The plot was created using the mummer software package. (A) Synteny plot of the chromosomes of strains L. pneumophila Paris and Corby (B) and strains L. pneumophila Paris and Lens and (C) strains L. pneumophila Paris and L. longbeachae NSW150. Inversions between the genomic sequences are represented in blue. Genome-wide synteny is disrupted by a 260 kb inversion (blue) and a 130 kb plasticity zone between strain L. pneumophila Paris and Lens. In contrast, synteny between L. pneumophila and L. longbeachae is highly conserved. Analysis of single nucleotide polymorphisms (SNP) revealed a very low SNP number of less than 0.4% among the four L. longbeachae genomes, which is significantly lower than the polymorphism of about 2% between L. pneumophila Sg1 strains Paris and Philadelphia (Table 1). Comparison of the two L. longbeachae Sg1 genomes (NSW150, ATCC33462) identified 1611 SNPs of which 1426 are located in only seven chromosomal regions mainly encoding putative mobile elements, whereas the remaining 185 SNPs were evenly distributed around the chromosome. A similar number of about 1900 SNPs were identified when comparing strains NSW150 to strain D-4968 (Table 1). In contrast, the SNP number between two strains of different Sg was higher, with about 16000 SNPs present between Sg1 and Sg2 strains (Table 1). This low SNP number and relatively homogeneous distribution of the SNPs around the chromosome suggest recent expansion for the species L. longbeachae (Cazalet et al., 2010). The sequences and their analysis are accessible at http://genolist.pasteur.fr/LegioList/. To investigate the phylogenetic relationship among the L. pneumophila and L. longbeachae strains we here used the nucleotide sequence of recN (recombination and repair protein-encoding gene) aligned based on the protein alignment. Based on an analysis of 32 protein-encoding genes widely distributed among bacterial genomes, RecN was described as the gene with the greatest potential for predicting genome relatedness at the genus or subgenus level (Zeigler, 2003). As depicted in Figure 2, the phylogenetic relationship among the four L. pneumophila strains is very high, and L. longbeachae is clearly more distant.
Figure 2

Phylogenetic tree showing the relationship of the sequenced . The tree was constructed using the recN sequences of each genome and the Neighbor joining method in MEGA. L. longbeachae is indicated without strain designation, as the RecN sequence of all sequenced strains is identical and thus only one representative strain is indicated on the tree. Numbers at branching nodes are percentages of 1000 bootstrap replicates.

Phylogenetic tree showing the relationship of the sequenced . The tree was constructed using the recN sequences of each genome and the Neighbor joining method in MEGA. L. longbeachae is indicated without strain designation, as the RecN sequence of all sequenced strains is identical and thus only one representative strain is indicated on the tree. Numbers at branching nodes are percentages of 1000 bootstrap replicates.

Diversity in Secretion Systems and Their Substrates may Contribute to Differences in Intracellular Trafficking and Niche Adaptation

The capacity of pathogens like Legionella to infect eukaryotic cells is intimately linked to the ability to manipulate host cell functions to establish an intracellular niche for their replication. Essential for the ability of Legionella to subvert host functions are its different secretion systems. The two major ones, known to be involved in virulence of L. pneumophila are the Dot/Icm type IV secretion system (T4BSS) and the Lsp type II secretion system (T2SS; Marra et al., 1992; Berger and Isberg, 1993; Rossier and Cianciotto, 2001). For L. pneumophila type II protein secretion is critical for infection of amebae, macrophages and mice. Analyses of the L. longbeachae genome sequences showed, that it contains all genes to encode a functional Lsp type II secretion machinery (Cazalet et al., 2010; Kozak et al., 2010). Several studies, including the analysis of the L. pneumophila type II secretome indicated that L. pneumophila encodes at least 25 type II secreted substrates (Debroy et al., 2006; Cianciotto, 2009). Although this experimentally defined repertoire of type II secretion-dependent proteins is the largest known in bacteria, it may contain even more than 60 proteins as 35 additional proteins with a signal sequence were identified by in silico analyses (Cianciotto, 2009). A search for homologs of these substrates in the L. longbeachae genome sequences revealed that 9 (36%) of the 25 type II secretion system substrates described for L. pneumophila are absent from L. longbeachae (Table 2). For example the phospholipase C encoded by plcA and the chiA-encoded chitinase, which was shown to promote L. pneumophila persistence in the lungs of A/J mice are not present in L. longbeachae (Debroy et al., 2006). Thus over a third of the T2SS substrates seem to differ between L. pneumophila and L. longbeachae, a feature probably related to the different ecological niches occupied, but also to different virulence properties in the hosts.
Table 2

Distribution of type II secretion-dependent proteins of .

L. pneumophilaL. longbeachaeNameProduct
PhilaParisLensCorbyAlcoy130b*NSWD-4968
lpg0467lpp0532lpl0508lpc2877lpa00713lpw05741llo2721llb2607proAZinc metalloprotease, promotes amebal infection
lpg1119lpp1120lpl1124lpc0577lpa01742llo1016llb0700mapTartrate-sensitive acid phosphatase
lpg2343lpp2291lpl2264lpc1811lpa03353lpw25361llo2819llb2504plaALysophospholipase A
lpg2837lpp2894lpl2749lpc3121lpa04118lpw30971llo0210llb1661plaCGlycerophospholipid:cholestrol transferase
lpg0502lpp0565lpl0541lpc2843lpa00759lpw05821plcAPhospholipase C
lpg0745lpp0810lpl0781lpc2548lpa01148lpw08251llo2076llb3335lipAMono- and triacylglycerol lipase
lpg1157lpp1159lpl1164lpc0620lpa01801lpw12111llo2433llb2928lipBTriacylglycerol lipase
lpg2848lpp2906lpl2760lpc3133lpa04141lpw31111llo0201llb1671srnAType 2 ribonuclease, promotes amebal infection
lpg1116lpp1117lpl1121lpc0574lpa01738lpw11641chiAChitinase, promotes lung infection
lpg2814lpp2866lpl2729lpc3100lpa04088lpw30701llo0255llb1611lapALeucine, phenylalanine, and tyrosine aminopeptidase
lpg0032lpp0031lpl0032lpc0032lpa00041lpw00321lapBLysine and arginine aminopeptidase
lpg0264lpp0335lpl0316lpc0340lpa00461lpw03521llo3103llb2271Weakly similar to bacterial amidase
lpg2622lpp2675lpl2547lpc0519lpa03836lpw28341Weakly similar to bacterial cysteine protease
lpg1918lpp1893lpl1882lpc1372lpa02774lpw19571llo3308llb2032celAEndoglucanase
lpg2999lpp3071lpl2927lpc3315lpa04395lpw32851Predicted astacin-like zink endopeptidase
lpg2644lpp2697lpl2569lpc0495lpa03870Some similarity to collagen like protein
lpg1809lpp1772lpl1773lpc1253lpa02614lpw18401llo1104llb0603Unknown
lpg1385lpp1340lpl1336lpc0801lpa02037lpw13951llo1474llb0177Unknown
lpg0873lpp0936lpl0906lpc2419lpa01320lpw09571llo2475llb2883Unknown
lpg0189lpp0250lpl0249lpc0269lpa00360lpw02811Unknown
lpg0956lpp1018lpl0958lpc2331lpa01443lpw10421llo1935llb3498Unknown
lpg2689lpp2743lpl2616lpc0447lpa03925lpw29431llo0361llb1497icmXLinked to Dot/Icm type IV secretion genes
lpg1244lpp0181lpl0163lpw01541lvrELinked to Lvh type IV secretion genes
lpg1832lpp1795lpl1796lpc1276lpa02647lpw18641llo1152llb0546Weakly similar to VirK
lpg1962lpp1946lpl1936lpc1440lpa02861lpw20131Putative peptidyl-prolyl cis-trans isomerase
lpg0422lpp0489lpl0465lpc2921lpa0657lpw05041llo2801llb2523gamAGlucoamylase

Substrates in this list are according to Cianciotto (.

Distribution of type II secretion-dependent proteins of . Substrates in this list are according to Cianciotto (. Indispensible for replication of L. pneumophila in the eukaryotic host cells is the Dot/Icm T4SS (Nagai and Kubori, 2011), which translocate a large repertoire of bacterial effectors into the host cell. These effectors modulate multiple host cell processes and in particular, redirect trafficking of the L. pneumophila phagosome and mediate its conversion into an ER-derived organelle competent for intracellular bacterial replication (Shin and Roy, 2008; Cianciotto, 2009). The Dot/Icm system is conserved in L. longbeachae with a similar gene organization and protein identities of 47–92% with respect to L. pneumophila (Figure 3). This is similar to what has been reported previously for other Legionella species (Morozova et al., 2004). The only major differences identified are that in L. longbeachae the icmR gene is replaced by the ligB gene, however, the encoded proteins have been shown to perform similar functions (Feldman and Segal, 2004; Feldman et al., 2005) and that the DotG/IcmE protein of L. longbeachae (1525 aa) is 477 amino acids larger than that of L. pneumophila (1048 aa; Cazalet et al., 2010). DotG of L. pneumophila is part of the core transmembrane complex of the secretion system and is composed of three domains: a transmembrane N-terminal domain, a central region composed of 42 repeats of 10 amino acid and a C-terminal region homologous to VirB10. In contrast, the central region of L. longbeachae DotG is composed of approximately 90 repeats. Among the many VirB10 homologs present in bacteria, the Coxiella DotG and the Helicobacter pylori Cag7 are the only ones, which also have multiple repeats of 10 aa (Segal et al., 2005). It will be challenging to understand the impact of this modification on the function of the type IV secretion system. A L. longbeachae T4SS mutant obtained by deleting the dotA gene is strongly attenuated for intracellular growth in Acanthamoeba castellanii and human macrophages (Cazalet et al., 2010, and unpublished data), is outcompeted by the wild type strain 24 and 72 h after infection of lungs of A/J mice and is also dramatically attenuated for replication in lungs of A/J mice upon single infections (Cazalet et al., 2010). Thus, similar to what is seen for L. pneumophila, the Dot/Icm T4SS of L. longbeachae is also central for its pathogenesis and the capacity to replicate in eukaryotic host cells.
Figure 3

Alignment of the chromosomal regions of . The comparison shows that all genes are highly conserved (47–92% identity) between L. pneumophila Paris and L. longbeachae. Red arrows, genes conserved between L. pneumophila and L. longbeachae (>47% identity); black arrows, L. pneumophila specific genes compared to L. longbeachae (<35% identity); blue arrows, genes conserved between L. pneumophila and L. longbeachae but located in different places of the genome; green arrows, L. longbeachae specific genes compared to L. pneumophila. Red arrow boxed in green depicts dotG. N-terminal and C-terminal parts of dotG are highly conserved while the central part composed of repeated sequences differs between L. pneumophila and L. longbeachae.

Alignment of the chromosomal regions of . The comparison shows that all genes are highly conserved (47–92% identity) between L. pneumophila Paris and L. longbeachae. Red arrows, genes conserved between L. pneumophila and L. longbeachae (>47% identity); black arrows, L. pneumophila specific genes compared to L. longbeachae (<35% identity); blue arrows, genes conserved between L. pneumophila and L. longbeachae but located in different places of the genome; green arrows, L. longbeachae specific genes compared to L. pneumophila. Red arrow boxed in green depicts dotG. N-terminal and C-terminal parts of dotG are highly conserved while the central part composed of repeated sequences differs between L. pneumophila and L. longbeachae. This T4SS is crucial for intracellular replication for Legionella as it secretes an exceptionally large number of proteins into the host cell. Using different methods, 275 substrates have been shown to be translocated in the host cell in a Dot/Icm T4SS dependent manner (Campodonico et al., 2005; De Felipe et al., 2005, 2008; Shohdy et al., 2005; Burstein et al., 2009; Heidtman et al., 2009; Zhu et al., 2011). Table 3 shows the distribution of the 275 Dot/Icm substrates identified in L. pneumophila strain Philadelphia and their distribution in the six L. pneumophila and five L. longbeachae genomes sequenced. Their conservation among different L. pneumophila strains is very high, as over 80% of the substrates are present in all L. pneumophila strains analyzed here. In contrast, the search for homologs of these L. pneumophila Dot/Icm substrates in L. longbeachae showed that even more pronounced differences are present than in the repertoire of type II secreted substrates. Only 98 of these 275 L. pneumophila Dot/Icm substrates have homologs in the L. longbeachae genomes (Table 3). However, the repertoire of L. longbeachae substrates seems also to be quite large, as a search for proteins that encode eukaryotic like domains and contain the secretion signal described by Nagai et al. (2005) and the additional criteria defined by Kubori et al. (2008) predicted 51 putative Dot/Icm substrates specific for L. longbeachae NSW150 (Cazalet et al., 2010) indicating that at least over 140 proteins might be secreted by the Dot/Icm T4SS of L. longbeachae. A similar number of L. longbeachae specific putative eukaryotic like proteins and effectors was predicted for strain D-4968 (Kozak et al., 2010). Examples of effector proteins conserved between the two species are RalF, VipA, VipF, SidC, SidE, SidJ, YlfA LepA, and LepB, which contribute to trafficking or recruitment and retention of vesicles to L. pneumophila (Nagai et al., 2002; Chen et al., 2004; Luo and Isberg, 2004; Campodonico et al., 2005; Shohdy et al., 2005; Liu and Luo, 2007). It is interesting to note that homologs of SidM/DrrA and SidD are absent from L. longbeachae but a homolog of LepB is present. For L. pneumophila it was shown that SidM/DrrA, SidD, and LepB act in cooperation to manipulate Rab1 activity in the host cell. DrrA/SidM possesses three domains, an N-terminal AMP-transfer domain (AT), a nucleotide exchange factor (GEF) domain in the central part and a phosphatidylinositol-4-Phosphate binding domain (P4M) in its C-terminal part. After association of DrrA/SidM with the membrane of the Legionella-containing vacuole (LCV) via P4M (Brombacher et al., 2009), it recruits Rab1 via the GEF domain and catalyzes the GDPGTP exchange (Ingmundson et al., 2007; Machner and Isberg, 2007). Rab1 is then adenylated by the AT domain leading to inhibition of GAP-catalyzed Rab1-deactivation (Müller et al., 2010). LepB cannot bind AMPylated Rab1 (Ingmundson et al., 2007). Recently it was shown that SidD deAMPylates Rab1 and enables LepB to bind Rab1 to promote its GTPGDP exchange (Neunuebel et al., 2011; Tan and Luo, 2011). One might assume that other proteins of L. longbeachae not yet identified may perform the functions of DrrA/SidM and SidD. Another interesting observation is, that all except four of the effector proteins of L. pneumophila that are conserved in L. longbeachae are also conserved in all sequenced L. pneumophila genomes (Table 3).
Table 3

Distribution of 275 Dot/Icm substrates identified in strain .

L. pneumophilaL. longbeachaeNameProduct
PhilaParisLensCorbyAlcoy130bNSW 150D-4968AT98072C-4E7
lpg0008lpp0008lpl0008lpc0009lpa0011lpw00071ravAUnknown
lpg0012lpp0012lpl0012lpc0013lpa0016lpw00111cegC1Ankyrin
lpg0021lpp0021lpl0022lpc0022lpa0030lpw00221llo0047llb1841+++Unknown
lpg0030lpp0030lpl0031lpc0031lpa0040lpw00311ravBUnknown
lpg0038lpp0037lpl0038lpc0039lpa0049lpw00381ankQ/legA10Ankyrin repeat
lpg0041lpc0042lpa0056Putative metalloprotease
lpg0045lpp0046lpl0044lpc0047lpa0060lpw00441Unknown
lpg0046lpp0047lpl0045lpc0048lpa0062lpw00451Unknown
lpg0059lpp0062lpl0061lpc0068lpa0085lpw00621ceg2Unknown
lpg0080lpp0094lpa3018lpw00781ceg3Unknown
lpg0081lpp0095lpw00791Unknown
lpg0090lpp0104lpl0089lpc0109lpa0132lpw00881lem1Unknown
lpg0096lpp0110lpl0096lpc0115lpa0145lpw00961llo1322llb0347+++ceg4Unknown
lpg0103lpp0117lpl0103lpc0122lpa0152lpw01031llo3312llb2028+++vipFN-terminal acetyl-transferase, GNAT
lpg0126lpp0140lpl0125lpc0146lpa0185lpw01261cegC2Ninein
lpg0130lpp0145lpl0130lpc0151lpa0194lpw01311llo3270llb2073+++Unknown
lpg0135lpp0150lpl0135lpc0156lpa0204lpw01361llo2439llb2921+++sdhBUnknown
lpg0160lpp0224lpl0224lpc0242lpa0322lpw02541ravDUnknown
lpg0170lpp0232lpl0233lpc0251lpa0335lpw02641llo1378llb0280+++ravCUnknown
lpg0171lpp0233lpl0234lpw02651legU1F-box motif
lpg0172lpp0234lpc0253lpa0339lpw02661Unknown
lpg0181lpp0245lpl0244lpc0265lpa0388lpw02761llo2453llb2907+++Unknown
lpg0191lpp0251lpw02821ceg5Unknown
lpg0195lpp0253lpl0251lpc0272lpa0339lpw02851ravEUnknown
lpg0196lpp0254lpl0252lpw02861llo2549llb2798+++ravFUnknown
lpg0210lpp0269lpl0264lpc0285lpa0388lpw02981ravGUnknown
lpg0227lpp0286lpl0281lpc0303lpa0412lpw03151llo2491llb2864+++ceg7Unknown
lpg0234lpp0304lpl0288lpc0309lpa0419lpw03221llo0425llb1431+++sidE/laiDUnknown
lpg0240lpp0310lpl0294lpc0316lpa0428lpw03291llo1601llb0040+++ceg8Unknown
lpg0246lpp0316lpl0300lpc0323lpa0436lpw03361ceg9Unknown
lpg0257lpp0327lpl0310lpc0334lpa0450lpw03461llo2362llb3009+++sdeAMultidrug resistance protein
lpg0260lpp0332lpl0313lpc0337lpa0456lpw03491Unknown
lpg0275lpp0349lpl0327lpc0351/3529lpa0477lpw03641sdbAUnknown
lpg0276lpp0350lpl0328lpc0353lpa0479lpw03651llo0327llb1533+++legG2Ras guanine nucleotide exchange factor
lpg0284lpp0360lpl0336lpc0361lpa0490lpw03741ceg10Unknown
lpg0285lpp0361lpl0337lpc0362lpa0492lpw03751lem2Unknown
lpg0294lpp0372lpl0347lpc0373lpa0508lpw03861llo0464llb1386+++Unknown
lpg0364lpp0429lpl0405lpc2980lpa0578lpw04431Unknown
lpg0365lpp0430lpl0406lpc2979lpa0580lpw04441llo0525llb1334+++Unknown
lpg0375lpp0442lpl0418lpc2968lpa0596Unknown
lpg0376lpp0443lpl0419lpc2967lpa0597lpw04591llo0548llb1307+++sdhAGRIP, coiled-coil
lpg0390lpp0457lpl0433lpc2954lpa0613lpw04721vipAUnknown
lpg0401lpp0468lpl0444lpc2942lpa0629lpw04831llo2582llb2763+++legA7/ceg11Unknown
lpg0402ankY/legA9Ankyrin, STPK
lpg0403lpp0469lpl0445lpc2941lpa0630lpw04841ankG/ankZ/legA7Ankyrin
lpg0405lpp0471lpl0447lpc2939lpa0633lpw04861llo2845llb2472+++Spectrin domain
lpg0422lpp0489lpl0465lpc2921lpa0657lpw05041llo2801llb2523+++legYPutative Glucan 1,4-alpha-glucosidase
lpg0436lpp0503lpl0479lpc2906lpa0673lpw05181ankJ/legA11Ankyrin
lpg0437lpp0504lpl0480lpc2905lpa0674lpw05191ceg14Unknown
lpg0439lpp0505lpl0481lpc2904lpa0678lpw05201llo2983llb2392+++ceg15Unknown
lpg0483lpp0547lpl0523lpc2861lpa0739lpw05631llo2705llb2623+++ankC/legA12Ankyrin
lpg0515lpp0578lpl0554lpc2829lpa0776lpw05951llo3224llb2129+++legD2Phytanoyl-CoA dioxygenase domain
lpg0518lpp0581lpl0557lpc2826lpa0781lpw05981Unknown
lpg0519ceg17Unknown
lpg0621lpp0675lpl0658lpc2673lpa0975lpw06951sidAUnknown
lpg0634lpp0688lpl0671lpc2660lpa0996lpw07081llo2574llb2771+++Unknown
lpg0642lpp0696/97lpl0679lpc2651lpa1005lpw07161wipBUnknown
lpg0695lpp0750lpl0732lpc2599lpa1082lpw07721ankN/ankX legA8Ankyrin
lpg0696lpp0751lpl0733lpc2598lpa1084lpw07731lem3Unknown
lpg0716lpp0782lpl0753lpc2577lpa1108lpw07931++Unknown
lpg0733lpp0799lpl0770lpc2559lpa1135lpw08111llo0831llb0892+++ravHUnknown
lpg0796lpp0859Unknown
lpg0898lpp0959lpl0929lpc2395lpa1360lpw09801ceg18Unknown
lpg0926lpp0988lpl0957lpc2365lpa1397lpw10111ravIUnknown
lpg0940lpp1002lpl0971lpc2349lpa1415lpw10251lidAUnknown
lpg0944lpp1006lpc2345lpa1421ravJUnknown
lpg0945lpp1007lpl1579lpc2344lpa1423lpw10311legL1LLR
lpg0963lpp1025lpl0992lpc2324lpa1453lpw10491llo0934llb0782+++Unknown
lpg0967lpp1029lpc2320lpa1459lpw10531Unknown
lpg0968lpp1030lpl0997lpc2319lpa1460lpw10541sidKUnknown
lpg0969lpp1031lpl0998lpc2318lpa1461lpw10551llo3265llb2078+++ravKUnknown
lpg1083Unknown
lpg1101lpp1101lpl1100lpc2154*lpa1709lpw11451lem4Unknown
lpg1106lpp1105lpl1105lpc2149lpa1719lpw11501llo1414llb0239/40+++Unknown
lpg1108lpp1108lpl1108lpc2146lpa1724lpw11531llo3030llb2350+++ravLUnknown
lpg1109lpp1109lpc2145lpa1725ravMUnknown
lpg1110lpp1111lpl1114lpc2142lpa1728lpw11571lem5Unknown
lpg1111lpp1112lpl1115lpc2141lpa1730lpw11581llo3126llb2244+++ravNUnknown
lpg1120lpw11681lem6Unknown
lpg1121lpp1121lpl1126lpc0578lpa1743lpw11691llo1321llb0348+++ceg19Unknown
lpg1124lpp1125lpl1129lpc0582lpa1748lpw11741llo3206llb2150+++Unknown
lpg1129lpp1130lpw11801ravOUnknown
lpg1137lpp1139lpl1144lpc0601lpa1776lpw11901llo2404llb2962+++Unknown
lpg1144lpp1146lpl1150lpc0607lpa1785lpw11971cegC3Unknown
lpg1145lpp1147lpl1151lpc0608lpa1787lpw11981lem7Unknown
lpg1147lpp1149lpl1153lpc0610lpa1789lpw12001GCN5-related N-acetyltransferase
lpg1148lpp1150lpl1154lpc0611lpa1790lpw12011Unknown
lpg1152lpp1154lpl1159lpc0615lpa1795lpw12061ravPUnknown
lpg1154lpp1156lpl1161lpc0617lpa1797lpw12081llo2487llb2868+++ravQUnknown
lpg1158lpp1160lpl1165*lpc0621lpa1802lpw12121Unknown
lpg1166lpp1168lpl1174lpc0631lpa1819lpw12211llo1034llb0680+++ravRUnknown
lpg1171lpp1173lpl1179lpc0637lpa1826Spectrin domain
lpg1183lpp1186lpl1192lpc0650lpa1839lpw12401llo2390llb2978+++ravSUnknown
lpg1227lpp1235lpl1235lpc0696lpa1899lpw12861vpdBUnknown
lpg1273lpp1236lpl1236lpc0698lpa1901lpw12871Unknown
lpg1290lpp1253lem8Unknown
lpg1312lpw13261legC1Unknown
lpg1316llo1389llb0269+++ravTUnknown
lpg1317ravWUnknown
lpg1328lpp1283lpl1282lpc0743lpa1958legTThaumatin domain
lpg1355lpp1309sidGCoiled-coil
lpg1426lpp1381lpl1377lpc0842lpa2090lpw14431llo1791llb3606+++vpdCPatatin domain
lpg1449lpp1404lpw14671Unknown
lpg1453lpp1409lpl1591lpc0868lpa2119lpw14711Unknown
lpg1483lpp1439lpl1545lpc0898lpa2161lpw15031llo1682llb3727+++legK1STPK
lpg1484lpp1440lpl1544lpc0899lpa2162lpw15041Unknown
lpg1488lpp1444lpl1540lpc0903*lpa2168lpw15081lgt3/legc5Coiled-coil
lpg1489lpp1445lpl1539lpc0905lpa2169lpw15091ravXUnknown
lpg1491lpp1447lem9Unknown
lpg1496lpp1453lpl1530lpc0915lpa2185lpw15181lem10Unknown
lpg1551lpp1508lpl1475lpc0972lpa2253ravYUnknown
lpg1578lpp4178lpl4143lpc1002lpa2292lpw16011llo1503llb0148+++Unknown
lpg1588lpp1546lpl1437lpc1013lpa2305lpw16131legC6Coiled–coil
lpg1598lpp1556lpl1427lpc1025lpa2317lpw16231lem11Unknown
lpg1602lpp1567lpl1423/26*lpc1028lpa2318lpw16241legL2LRR
lpg1621lpp1591lpl1402lpc1048lpa2346lpw16461llo1014llb0702+++ceg23Unknown
lpg1625lpp1595lpl1398lpc1052lpa2350lpw16511llo0719llb1016+++lem23Unknown
lpg1639lpp1609lpl1387lpc1068lpa2367lpw16651Unknown
lpg1642lpp1612a/blpl1384lpc1071lpa2371lpw16681sidBPutative hydrolase
lpg1654lpp1625lpc1084lpa2390llo0791llb0935+++Unknown
lpg1660lpp1631lpl1625lpc1090lpa2398lpw16861legL3LRR
lpg1661lpp1632lpl1626lpc1091lpa2399lpw16871llo1691llb3715+++Putative N-acetyl transferase
lpg1666lpp1637lpl1631lpc1096lpa2408lpw16921Unknown
lpg1667lpp1638lpl1632lpc1097lpa2409lpw16931Unknown
lpg1670lpp1642lpl1635lpc1101lpa2413lpw16971Unknown
lpg1683lpc1114lpa2431llo2508llb2843+++ravZUnknown
lpg1684lpc1115lpa2432llo2267llb3113+++Unknown
lpg1685lpc1116lpa2433llo3208llb2147+++Unknown
lpg1687lpp1656lpl1650lpc1118lpa2437lpw17121mavAUnknown
lpg1689lpp1658lpl1652lpc1120lpa2439lpw17141llo1697llb3708+++Unknown
lpg1692lpc1123lpa2442Unknown
lpg1701lpp1666lpl1660lpc1130lpa2455lpw17231ppeA/legC3Coiled-coil
lpg1702lpp1667lpl1661lpc1131lpa2456lpw17241ppeBUnknown
lpg1716lpp1681lpl1675lpc1146lpa2474lpw17391Unknown
lpg1717lpp1682lpw17401Unknown
lpg1718lpp1683lpl1682lpc1152lpa2484lpw17411ankI/legAS4Ankyrin
lpg1751lpp1715lpl1715lpc1191lpa2538lpw17761llo2314llb3061+++Unknown
lpg1752lpp1716lpl1716lpc1192lpa2539lpw17771llo2315llb3060+++Unknown
lpg1776lpp1740lpl1740lpc1217lpa2570lpw18031llo1437llb0214*+++Unknown
lpg1797lpc1239lpa2599lpw32931rvfAUnknown
lpg1798lpp1761lpl1761lpc1241lpa2600lpw18281llo0991llb0731+++marBUnknown
lpg1803lpp1766lpl1766lpc1246lpa2606lpw18331llo2611llb2729+++Unknown
lpg1836lpp1799lpl1800lpc1280lpa2652lpw18691ceg25Unknown
lpg1851lpp1818lpl1817lpc1296lpa2675lpw18871llo1047llb0666+++lem14Unknown
lpg1884lpp1848lpl1845lpc1331lpa2714lpw19161ylfB/legC2Coiled-coil
lpg1888lpp1855lpl1850lpc1336lpa2723lpw19211Unknown
lpg1890lpl1852lpc1338lpa2726lpw19231legLC8LRR, coiled-coil
lpg1907lpp1882lpl1871lpc1361lpa2762lpw19461llo1240llb0452+++Unknown
lpg1924lpp1899lpl1888lpc1378lpa2783lpw19631Unknown
lpg1933lpp1914lpl1903lpc1406lpa2811lpw19721lem15Unknown
lpg1947lpp1930lpl1917*lpa2835lpw19951lem16Spectrin domain
lpg1948legLC4LRR, coiled-coil
lpg1949lpp1931lpl1918lpc1422lpa2837lpw19961lem17Unknown
lpg1950lpp1932lpl1919lpc1423lpa2838lpw19971llo1397llb0259+++ralFSec7 domain
lpg1953lpp1935lpl1922lpc1426lpa2842lpw20041legC4Coiled-coil
lpg1958lpp1940legL5LRR
lpg1959lpp1941lpa2857lpw20101Unknown
lpg1960lpp1942lpl1934*lpc1437lpa2859lpw20111llo0565llb1288+++lirAUnknown
lpg1962lpp1946lpl1936lpc1440lpa2861lpw20131lirBRotamase
lpg1963lpc1441/42lpa2863pieA/lirCUnknown
lpg1964pieB/lirDUnknown
lpg1965lpc1443/45lpa2865lpw20141pieC/lirEUnknown
lpg1966lpp1947lpc1446lpa2867lpw20151pieD/lirFUnknown
lpg1969lpp1952lpl1941lpc1452lpa2874lpw20201llo3131llb2239+++pieEUnknown
lpg1972lpp1955lpl1950lpc1459lpa2884lpw20291pieFUnknown
lpg1975lpp1959lpl1953lpc1462lpa2889(1)lpw20351Unknown
lpg1976lpp1959lpl1953lpc1462lpa2889(2)lpw20351pieG/legG1Regulator of chromosome condensation
lpg1978lpp1961lpl1955lpc1464lpa2892lpw20371setAPutative Glyosyltransferase
lpg1986lpp1967lpl1961lpc1469lpa2898lpw20431Unknown
lpg2050lpp2033lpl2028lpc1536lpa2992lpw21141Unknown
lpg2131legA6Unknown
lpg2137lpp2076lpl2066lpc1586lpa3060lpw23101legK2STPK
lpg2144lpp2082lpl2072lpc1593lpa3071lpw23181ankB/legAU13/ceg27Ankyrin, F-box
lpg2147lpp2086lpl2075lpc1596lpa3076lpw23211mavCUnknown
lpg2148lpp2087lpl2076lpc1597lpa3077lpw23221Unknown
lpg2149lpp2088lpl2077lpc1598lpa3078lpw23231Unknown
lpg2153lpp2092lpl2081lpc1602lpa3083lpw23271sdeCUnknown
lpg2154lpp2093lpl2082lpc1603lpa3086lpw23281llo3097llb2278+++sdeCUnknown
lpg2155lpp2094lpl2083lpc1604lpa3087lpw23291llo3096llb2279+++sidJUnknown
lpg2156lpp2095lpl2084lpc1605lpa3088lpw23301llo3095llb2280+++?sdeBUnknown
lpg2157lpp2096lpl2085lpc1618lpa3037lpw23331sdeCUnknown
lpg2166lpp2104lpl2093lpc1626lpa3107lpw23451llo2398llb2969+++lem19Unknown
lpg2160lpp2099lpl2088lpc1621lpa3100lpw23361llo2645llb2690+++Unknown
lpg2176lpp2128lpl2102lpc1635lpa3118lpw23561legS2Sphingosine-1-phosphate lyase
lpg2199lpp2149lpl2123lpc1663lpa3157lpw23811cegC4Unknown
lpg2200lpp2150lpl2124lpc1664lpa3158lpw23821cegC4Unknown
lpg2215lpp2166lpl2140lpc1680lpa3179lpw24011legA2Ankyrin
lpg2216lpp2167lpl2141lpc1681lpa3180lpw24021lem20Unknown
lpg2222lpp2174lpl2147lpc1689lpa3191lpw24081llo1443llb0208+++lpnEPutative beta-lactamase (SEL1 domain)
lpg2223lpp2175lpl2149*lpc1691lpa3196lpw24091Unknown
lpg2224ppgARegulator of chromosome condensation
lpg2239lpp2192lpw24261Unknown
lpg2248lpp2202lpl2174lpc1717lpa3237lpw24371lem21Unknown
lpg2271lpp2225lpl2197lpc1740lpa3268lpw24611llo2530llb2821+++Unknown
lpg2298lpp2246lpl2217lpc1763lpa3296lpw24841llo1707llb3696+++ylfA/legC7Coiled-coil
lpg2300lpp2248lpl2219lpc1765lpa3298lpw24871llo0584llb1266+++ankH/legA3, ankWAnkyrin, NfkappaB inhibitor
lpg2311lpp2259lpl2230lpc1776lpa3312lpw24981ceg28Unknown
lpg2322lpp2270lpl2242lpc1789lpa3328lpw25121llo0570llb1282+++ankK/legA5Ankyrin
lpg2327lpp2275lpl2247lpc1794lpa3335lpw25181Unknown
lpg2328lpp2276lpl2248lpc1795lpa3336lpw25191lem22Unknown
lpg2344lpp2292lpl2265lpc1812lpa3355lpw25371mavEUnknown
lpg2351lpp2300lpl2273lpc1820lpa3367lpw25461llo2850llb2466+++mavFUnknown
lpg2359lpp2308lpl2281lpc1828lpa3376lpw25561llo2856llb2460+++Unknown
lpg2370HipA fragment
lpg2372lpp3009lpc3248lpa4300Unknown
lpg2382lpp2444lpl2300lpc2108lpa3446lpw25841llo1576llb0071+++Unknown
lpg2391lpp2458lpl2315lpc2086lpa3485lpw26021sdbCUnknown
lpg2392lpp2459lpl2316lpc2085lpa3486lpw26041legL6LRR
lpg2400lpl2323lpw26121legL6LRR
lpg2406lpp2472lpl2329lpc2070lpa3506lpw26191llo2172llb3225+++lem23Unknown
lpg2407lpp2474lpc2069lpa3507Unknown
lpg2409lpp2476lpl2332lpc2067lpa3511lpw26241ceg29Unknown
lpg2410lpp2479lpl2334lpc2065lpa3513lpw26261vpdAPatatin domain
lpg2411lpp2480lpl2335lpc2064lpa3515lpw26281llo2227llb3158+++lem24Unknown
lpp2486F-box
lpg2416lpl2339lpc2057lpa3527lpw26351legA1Unknown
lpg2420lpl2343lpc2056lpa3529lpw26391Unknown
lpg2422lpp2487lpl2345lpc2055lpa3530lpw26401llo1650llb3763/64+++lem25Unknown
lpg2424lpp2489lpl2347lpc2053lpa3532lpw26421mavGUnknown
lpg2425lpp2491lpl2348lpc2051lpa3537lpw26431mavHUnknown
lpg2433lpp2500lpl2353lpc2043lpa3548lpw26521ceg30Unknown
lpg2434lpp2501lpl2355lpc2042lpa3550lpw26531Unknown
lpg2443lpp2510lpl2363lpc2033lpa3562Unknown
lpg2444lpp2511lpl2364lpc2032lpa3563lpw26641mavIUnknown
lpg2452lpp2517lpl2370lpc2026lpa3574lpw26701ankF/legA14/ceg31Ankyrin
lpg2456lpp2522lpl2375lpc2020lpa3583lpw26751llo0365llb1493+++ankD/legA15Ankyrin
lpg2461lpp2527lpl2380lpc2015lpa3589lpw26801llo1991llb3433+++Unknown
lpg2464lpl2384lpw26851sidM/drrAUnknown
lpg2465lpl2385lpw26861sidDUnknown
lpg2490lpp2555lpl2411lpc1987lpa3628lpw27131lepBCoiled-coil, Rab1 GAP
lpg2482lpp2546lpl2402lpc1996lpa3615lpw27041sdbBUnknown
lpg2498lpp2566lpl2420lpc1975lpa3646lpw27241mavJUnknown
lpg2504lpp2572lpl2426lpc1967lpa3658lpw27301llo2525llb2826+++sidI/ceg32Unknown
lpg2505lpp2573lpl2427lpc1966lpa3659lpw27311llo2526llb2825+++Unknown
lpg2508lpp2576lpl2430lpc1962/63*lpa3666lpw27341sdjAUnknown
lpg2509lpp2577lpl2431lpc1961lpa3667lpw27351llo3097llb2278+++sdeDUnknown
lpg2510lpp2578lpl2432lpc1960lpa3668llo3098llb2276+++sdcAUnknown
lpg2511lpp2579lpl2433lpc1959lpa3669lpw27371sidCPI(4)P binding domain
lpg2523lpw27501lem26Unknown
lpg2525mavKUnknown
lpg2526lpp2591lpl2446lpc1946lpa3687lpw27521mavLUnknown
lpg2527lpp2592lpl2447lpc1944lpa3688lpw27531llo3335llb2002+++Unknown
lpg2529lpp2594lpl2449lpc1942lpa3692lpw27551llo2238llb3146+++lem27Unknown
lpg2538lpp2604lpl2459lpc1930lpa3706lpw27671Unknown
lpg2539lpp2605lpl2460lpc1929lpa3707lpw27681llo1348llb0317+++Unknown
lpg2541lpp2607lpl2462lpc1927lpa3710lpw27701Unknown
lpg2546lpp2615lpc1919lpa3727lpw27791Unknown
lpg2552lpp2622lpl2473lpc1911lpa3738lpw27871llo1062llb0648+++Unknown
lpg2555lpp2625lpl2480lpc1908lpa3743lpw27901llo2220llb3170+++Unknown
lpg2556lpp2626lpl2481lpc1906lpa3745lpw27911llo2218llb3172+++legK3STPK
lpg2577lpp2629lpl2499lpc0570lpa3768lpw28241mavMUnknown
lpg2584lpp2637lpl2507lpc0561lpa3779lpw28321sidFUnknown
lpg2588lpp2641lpl2511lpc0557lpa3784lpw28361llo2622llb2718+++legS1Unknown
lpg2591lpp2644lpl2514lpc0551lpa3790lpw28391llo0626llb1219+++ceg33Unknown
lpg2603lpp2656lpl2526lpc0539lpa3807lpw28521lem28Unknown
lpg2628lpp2681lpl2553lpc0513lpa3846lpw28781Unknown
lpg2637lpp2690lpl2562lpc0503lpa3859lpw28871Unknown
lpg2638lpp2691lpl2563lpc0502lpa3861lpw28891llo2645llb2690+++mavVUnknown
lpg2692lpp2746lpl2619lpc0444lpa3929lpw29461Unknown
lpg2694lpp2748lpl2621lpc0442lpa3931lpw29481legD1Phyhd1 protein
lpg2718lpp2775lpl2646lpc0415lpa3966lpw29771wipAUnknown
lpg2720lpp2777lpl2648lpc0413lpa3968lpw29791legNcAMP-binding protein
lpg2744lpp2800lpl2669lpc0386lpa4004lpw30031Unknown
lpg2745lpp2801lpl2670lpc0385lpa4005lpw30041llo0308llb1553+++Unknown
lpg2793lpp2839lpl2708lpc3079lpa4063lpw30471lepAEffector protein A
lpg2804lpp2850lpl2719lpc3090lpa4076lpw30591llo0267llb1598+++lem29Unknown
lpg2815lpp2867lpl2730lpc3101lpa4089lpw30711llo0254llb1612+++mavNUnknown
lpg2826lpl2741lpc3113lpa4104lpw30831ceg34Unknown
lpg2828lpp2882lpl2743lpc3115lpa4109lpw30851llo0783llb0944+++Unknown
lpg2829lpp2883/86*lpw30861sidHUnknown
lpg2830lpp2887lpw30881lubX/legU2U-box motif
lpg2831lpp2888lpw30891VipDPatatin-like phopholipase
lpg2832lpp2889lpl2744lpc3116lpa4110lpw30921llo0214llb1656+++Putative hydrolase
lpg2844lpp2903lpl2756lpc3128lpa4133Unknown
lpg2862Lgt2/legC8Coiled-coil
lpg2874lpp2933lpl2787lpc3160lpa4176lpw31411Unknown
lpg2879lpp2938lpl2792lpc3165lpa4186lpw31471llo0192llb1681+++Unknown
lpg2884lpp2943lpl2797lpc3170lpa4193lpw31531llo0197llb1676+++Unknown
lpg2885lpp2944lpl2798lpc3171lpw31541Unknown
lpg2888lpp2947lpl2801lpc3174lpa4199lpw31571llo0200llb1672+++Unknown
lpg2912lpp2980lpl2830lpc3214lpa4255lpw31931Unknown
lpg2936lpp3004lpl2865lpc3243lpa4293lpw32251llo0081llb1804+++rRNA small subunit methyltransferase E
lpg2975lpp3047lpl2904lpc3290lpa4358−?llo3405llb1930+++Unknown
lpg2999lpp3071lpl2927lpc3315lpa4395lpw32851legPAstacin protease
lpg3000lpp3072lpl2928lpc3316lpa4397lpw32861llo3444llb1887+++Unknown

List of substrates is based on Isberg et al. (.

Distribution of 275 Dot/Icm substrates identified in strain . List of substrates is based on Isberg et al. (. Taken together the T2SS Lsp and the T4SS Dot/Icm are highly conserved between L. pneumophila and L. longbeachae. However, more than a third of the known L. pneumophila type II- and over 70% of type IV-dependent substrates differ between both species. These species specific, secreted effectors might be implicated in the different niche adaptations and host susceptibilities. Most interestingly, of the 98 L. pneumophila substrates conserved in L. longbeachae 87 are also present in all L. pneumophila strains sequenced to date. Thus, these 87 Dot/Icm substrates might be essential for intracellular replication of Legionella and represent a minimal toolkit for intracellular replication that has been acquired before the divergence of the two species.

Molecular Mimicry is a Major Virulence Strategy of L. pneumophila and L. longbeachae

The L. pneumophila genome sequence analysis has revealed that many of the predicted or experimentally verified Dot/Icm secreted substrates are proteins similar to eukaryotic proteins or contain motifs mainly or only found in eukaryotic proteins (Cazalet et al., 2004; De Felipe et al., 2005). Thus comparative genomics suggested that L. pneumophila encodes specific virulence factors that have evolved during its evolution with eukaryotic host cells such as fresh-water ameba (Cazalet et al., 2004). The protein-motifs predominantly found in eukaryotes, which were identified in the L. pneumophila genomes are ankyrin repeats, SEL1 (TPR), Set domain, Sec7, serine threonine kinase domains (STPK), U-box, and F-box motifs. Examples for eukaryotic like proteins of L. pneumophila are two secreted apyrases, a sphingosine-1-phosphate lyase and sphingosine kinase, eukaryotic like glycoamylase, cytokinin oxidase, zinc metalloprotease, or an RNA binding precursor (Cazalet et al., 2004; De Felipe et al., 2005; Bruggemann et al., 2006). Function prediction based on similarity searches suggested that many of these proteins are implicated in modulating host cell functions to the pathogens advantage (Cazalet et al., 2004). Recent functional studies confirm these predictions. As a first example, it was shown that L. pneumophila is able to interfere with the host ubiquitination pathway. The L. pneumophila U-box containing protein LubX was shown to be a secreted effector of the Dot/Icm secretion system that mediates polyubiquitination of a host kinase Clk1 (Kubori et al., 2008). Recently, LubX was described as the first example of an effector protein, which targets and regulates another effector within host cells, as it functions as an E3 ubiquitin ligase that hijacks the host proteasome to specifically target the bacterial effector protein SidH for degradation. Delayed delivery of LubX to the host cytoplasm leads to the shutdown of SidH within the host cells at later stages of infection. This demonstrates a sophisticated level of co-evolution between eukaryotic cells and L. pneumophila involving an effector that functions as a key regulator to temporally coordinate the function of a cognate effector protein (Kubori et al., 2010; Luo, 2011). Furthermore, AnkB/Lpp2028, one of the three F-box proteins of L. pneumophila, was shown to be a T4SS effector that is implicated in virulence of L. pneumophila and in recruiting ubiquitinated proteins to the LCV (Al-Khodor et al., 2008; Price et al., 2009; Habyarimana et al., 2010; Lomma et al., 2010). A second example is the apyrases (Lpg1905 and Lpg0971) encoded in the L. pneumophila genomes. Indeed, both are secreted enzymes important for intracellular replication of L. pneumophila. Lpg1905 is a novel prokaryotic ecto-NTPDase, similar to CD39/NTPDase1, which is characterized by the presence of five apyrase-conserved regions and enhances the replication of L. pneumophila in eukaryotic cells (Sansom et al., 2007). Apart from ATP and ADP, Lpg1905 also cleaves GTP and GDP with similar efficiency to ATP and ADP, respectively (Sansom et al., 2008). A third example is a L. pneumophila homolog of the highly conserved eukaryotic enzyme sphingosine-1-phosphate lyase (Spl). In eukaryotes, SPL is an enzyme that catalyzes the irreversible cleavage of sphingosine-1-phosphate (S1P). S1P is implicated in various physiological processes like cell survival, apoptosis, proliferation, migration, differentiation, platelet aggregation, angiogenesis, lymphocyte trafficking and development. Despite the fact that the function of the L. pneumophila Spl remains actually unknown, the hypothesis is that it plays a role in autophagy and/or apoptosis (Cazalet et al., 2004; Bruggemann et al., 2006). Recently it has been shown that the L. pneumophila Spl is a secreted effector of the Dot/Icm T4SS, that it is able to complement the sphingosine-sensitive phenotype of Saccharomyces cerevisiae. Moreover, L. pneumophila Spl co-localizes to the host cell mitochondria (Degtyar et al., 2009). Taken together, the many different functional studies undertaken based on the results of the genome sequence analyses deciphering the roles of the eukaryotic like proteins have clearly established that they are secreted virulence factors that are involved in host cell adhesion, formation of the LCV, modulation of host cell functions, induction of apoptosis and egress of Legionella (Nora et al., 2009; Hubber and Roy, 2010). Most of these effector proteins are expressed at different stages of the intracellular life cycle of L. pneumophila (Bruggemann et al., 2006) and are delivered to the host cell by the Dot/Icm T4SS. Thus molecular mimicry of eukaryotic proteins is a major virulence strategy of L. pneumophila. As expected, eukaryotic like proteins and proteins encoding domains mainly found in eukaryotic proteins are also present in the L. longbeachae genomes. However, between the two species a considerable diversity in the repertoire of these proteins exists. For example Spl, LubX, the three L. pneumophila F-box proteins, and the homolog of one (Lpg1905) of the two apyrases are missing in all sequenced L. longbeachae genomes. In contrast a glycoamylase (Herrmann et al., 2011) and an uridine kinase homolog are present also in L. longbeachae (Cazalet et al., 2010; Kozak et al., 2010; Table 3). However, other proteins encoded by the L. longbeachae genome contain U-box and F-box domains and might therefore fulfill similar functions. Thus, although the specific proteins may not be conserved, the eukaryotic like protein–protein interaction domains found in L. pneumophila are also present in L. longbeachae. The differences in trafficking between L. longbeachae and L. pneumophila mentioned above might be related to specific effectors encoded by L. longbeachae. A search for such specific putative effectors of L. longbeachae identified several proteins that might contribute to these differences like a family of Ras-related small GTPases (Cazalet et al., 2010; Kozak et al., 2010). These proteins may be involved in vesicular trafficking and thus may account at least partly for the specificities of the L. longbeachae life cycle. L. pneumophila is also known to exploit monophosphorylated host phosphoinositides (PI) to anchor the effector proteins SidC, SidM/DrrA, LpnE, and LidA to the membrane of the replication vacuole (Machner and Isberg, 2006; Murata et al., 2006; Weber et al., 2006, 2009; Newton et al., 2007; Brombacher et al., 2009). L. longbeachae may employ an additional strategy to interfere with the host PI as a homolog of the mammalian PI metabolizing enzyme phosphatidylinositol-4-phosphate 5-kinase was identified in its genome. One could speculate that this protein allows direct modulation of the host cell PI levels. Interestingly, although 23 of the 29 ankyrin proteins identified in the L. pneumophila strains are absent from the L. longbeachae genome, L. longbeachae encodes a total of 23 specific ankyrin repeat proteins (Table 3). For example, L. pneumophila AnkX/AnkN that was shown to interfere with microtubule-dependent vesicular transport is missing in L. longbeachae (Pan et al., 2008). However, L. longbeachae encodes a putative tubulin–tyrosine ligase (TTL). TTL catalyzes the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. Although the exact physiological function of alpha-tubulin has so far not been established, it has been linked to altered microtubule structure and function (Eiserich et al., 1999). Thus this protein might take over this function in L. longbeachae. Legionella longbeachae is the first bacterial genome encoding a protein containing an Src Homology 2 (SH2) domain. SH2 domains, in eukaryotes, have regulatory functions in various intracellular signaling cascades. Furthermore, L. longbeachae encodes two proteins with pentatricopeptide repeat (PPR) domains. This family seems to be greatly expanded in plants, where they appear to play essential roles in organellar RNA metabolism (Lurin et al., 2004; Nakamura et al., 2004; Schmitz-Linneweber and Small, 2008). Only 12 bacterial PPR domain proteins have been identified to date, all encoded by two species, the plant pathogens Ralstonia solanacearum and the facultative photosynthetic bacterium Rhodobacter sphaeroides. Thus, genome analysis revealed a particular feature of the Legionella genomes, the presence of many eukaryotic like proteins and protein domains, some of which are common to the two Legionella species, others which are specific and may thus account for the species specific features in intracellular trafficking and niche adaptation in the environment.

Surface Structures – A Clue to Mouse Susceptibility to Infection with Legionella

Despite the presence of many different species of Legionella in aquatic reservoirs, the vast majority of human disease is caused by a single serogroup (Sg) of a single species, namely L. pneumophila Sg1, which is responsible for about 84% of all cases worldwide (Yu et al., 2002). Similar results are obtained for L. longbeachae. Two serogroups are described, but L. longbeachae Sg1 is predominant in human disease. Lipopolysaccharide (LPS) is the basis for the classification of serogroups but it is also a major immunodominant antigen of L. pneumophila and L. longbeachae. Interestingly, it has also been shown that membrane vesicles shed by virulent L. pneumophila containing LPS are sufficient to inhibit phagosome–lysosome fusion (Fernandez-Moreira et al., 2006). Results obtained from large-scale genome comparisons of L. pneumophila suggested that LPS of Sg1 itself might be implicated in the predominance of Sg1 strains in human disease compared to other serogroups of L. pneumophila and other Legionella species (Cazalet et al., 2008). A comparative search for LPS coding regions in the genome of L. longbeachae NSW 150 identified two gene clusters encoding proteins that could be involved in production of lipopolysaccharide (LPS) and/or capsule. Neither shared homology with the L. pneumophila LPS biosynthesis gene cluster suggesting considerable differences in this major immunodominant antigen between the two Legionella species. However, homologs of L. pneumophila lipidA biosynthesis genes (LpxA, LpxB, LpxD, and WaaM) are present. Electron microscopy also demonstrated that, in contrast to L. pneumophila, L. longbeachae produces a capsule-like structure, suggesting that one of the aforementioned gene cluster encodes LPS and the other the capsule (Cazalet et al., 2010). As mentioned in the introduction, only A/J mice are permissive for replication of L. pneumophila, in contrast A/J, C57BL/6, and BALB/c mice are all permissive for replication of L. longbeachae. In C57BL/6 mice cytosolic flagellin of L. pneumophila triggers Naip5-dependent caspase-1 activation and subsequent proinflammatory cell death by pyroptosis rendering them resistant to infection (Diez et al., 2003; Wright et al., 2003; Molofsky et al., 2006; Ren et al., 2006; Zamboni et al., 2006; Lamkanfi et al., 2007; Lightfield et al., 2008). Genome analysis shed light on the reasons for these differences. L. longbeachae does not carry any flagellar biosynthesis genes except the sigma factor FliA, the regulator FleN, the two-component system FleR/FleS and the flagellar basal body rod modification protein FlgD (Cazalet et al., 2010; Kozak et al., 2010). Analysis of the genome sequences of strains L. longbeachae D-4968, ATCC33642, 98072, and C-4E7 as well as a PCR-based screening of 50 L. longbeachae isolates belonging to both serogroups by Kozak et al. (2010) and of 15 additional isolates by Cazalet et al. (2010) did not detect flagellar genes in any isolate confirming that L. longbeachae, in contrast to L. pneumophila does not synthesize flagella. Interestingly, all genes bordering flagellar gene clusters are conserved between L. longbeachae and L. pneumophila, suggesting deletion of these regions from the L. longbeachae genome. This result suggests, that L. longbeachae fails to activate caspase-1 due to the lack of flagellin, which may also partly explain the differences in mouse susceptibility to L. pneumophila and L. longbeachae infection. The putative L. longbeachae capsule may also contribute to this difference. Quite interestingly, although L. longbeachae does not encode flagella, it encodes a putative chemotaxis system. Chemotaxis enables bacteria to find favorable conditions by migrating toward higher concentrations of attractants. In many bacteria, the chemotactic response is mediated by a two-component signal transduction pathway, comprising a histidine kinase CheA and a response regulator CheY. Homologs of this regulatory system are present in the L. longbeachae genomes sequenced (Cazalet et al., 2010; Kozak et al., 2010). Furthermore, two homologs of the “adaptor” protein CheW that associate with CheA or cytoplasmic chemosensory receptors are present. Ligand-binding to receptors regulates the autophosphorylation activity of CheA in these complexes. The CheA phosphoryl group is subsequently transferred to CheY, which then diffuses away to the flagellum where it modulates motor rotation. Adaptation to continuous stimulation is mediated by a methyltransferase CheR. Together, these proteins represent an evolutionarily conserved core of the chemotaxis pathway, common to many bacteria and archea (Kentner and Sourjik, 2006; Hazelbauer et al., 2008). Homologs of all these proteins are present in the L. longbeachae genomes (Cazalet et al., 2010; Kozak et al., 2010) and a similar chemotaxis system is present in Legionella drancourtii LLAP12 (La Scola et al., 2004) but it is absent from L. pneumophila. The flanking genomic regions are highly conserved among L. longbeachae and all L. pneumophila strains sequenced, suggesting that L. pneumophila, although it encodes flagella has lost the chemotaxis system encoding genes by deletion events. Thus these two species differ markedly in their surface structures. L. longbeachae encodes a capsule-like structure, synthesizes a very different LPS, does not synthesize flagella but encodes a chemotaxis system. These differences in surface structures seem to be due to deletion events leading to the loss of flagella in L. longbeachae and the loss of chemotaxis in L. pneumophila leading in part to the adaptation to their different main niches, soil, and water.

Evolution of Eukaryotic Effectors – Acquisition by Horizontal Gene Transfer from Eukaryotes?

Human to human transmission of Legionella has never been reported. Thus humans have been inconsequential in the evolution of these bacteria. However, Legionella have co-evolved with fresh-water protozoa allowing the adaptation to eukaryotic cells. The idea that protozoa are training grounds for intracellular pathogens was born with the finding by Rowbotham (1980) that Legionella has the ability to multiply intracellularly. This lead to a new percept in microbiology: bacteria parasitize protozoa and can utilize the same process to infect humans. Indeed, the long co-evolution of Legionella with protozoa is reflected in its genome by the presence of eukaryotic like genes, many of which are clearly virulence factors used by L. pneumophila to subvert host functions. These genes may have been acquired either through horizontal gene transfer (HGT) from the host cells (e.g., aquatic protozoa) or from bacteria or may have evolved by convergent evolution. Recently it has been reported that L. drancourtii a relative of L. pneumophila has acquired a sterol reductase gene from the Acanthamoeba polyphaga Mimivirus genome, a virus that grows in ameba (Moliner et al., 2009). Thus, the acquisition of some of the eukaryotic like genes of L. pneumophila by HGT from protozoa is plausible. ralF was the first gene suggested to have been acquired by L. pneumophila from eukaryotes by HGT, as RalF carries a eukaryotic Sec 7 domain (Nagai et al., 2002). In order to study the evolutionary origin of eukaryotic L. pneumophila genes, we have undertaken a phylogenetic analysis of the eukaryote-like sphingosine-1-phosphate lyase of L. pneumophila that is encoded by lpp2128 described earlier. The phylogenetic analyses shown in Figure 4 revealed that it was most likely acquired from a eukaryotic organism early during Legionella evolution (Degtyar et al., 2009; Nora et al., 2009) as the Lpp2128 protein sequence of L. pneumophila clearly falls into the eukaryotic clade of SPL sequences.
Figure 4

Phylogenetic tree of a multiple sequence comparison of sphingosine-phosphate lyase proteins present in eukaryotic and prokaryotic genomes. Phylogenetic reconstruction was done with MEGA using the Neighbor-Joining method. Numbers indicate bootstrap values after 1000 bootstrap replicates. The red lines indicate the L. pneumophila sequences that are embedded in the eukaryotic clade. The bar at the bottom represents the estimated evolutionary distance.

Phylogenetic tree of a multiple sequence comparison of sphingosine-phosphate lyase proteins present in eukaryotic and prokaryotic genomes. Phylogenetic reconstruction was done with MEGA using the Neighbor-Joining method. Numbers indicate bootstrap values after 1000 bootstrap replicates. The red lines indicate the L. pneumophila sequences that are embedded in the eukaryotic clade. The bar at the bottom represents the estimated evolutionary distance. We then tested the hypothesis that L. longbeachae might have acquired genes also from plants, which is conceivable as it is found in soil. We thus undertook here a phylogenetic analysis similar to that described above for the L. longbeachae protein Llo2643 that contains PPR repeats, a protein family typically present in plants. A Blast search in the database revealed that homologs of Llo2643 are only found in eukaryotes, in particular in plants and algae. The only prokaryotes encoding this protein are the cyanobacteria Microcoelus vaginatus and Cylindrospermopsis rasiborskii. This rare presence in bacteria is suggestive of a horizontal transfer event from eukaryotes to these bacteria. Figure 5 shows the phylogenetic tree we obtained. The fact that the bacterial proteins group together may also be due to a phenomenon of long branch attraction. Thus, the Llo2643 protein of L. longbeachae appears closer to plant proteins than prokaryotic ones. Once more plant proteins, perhaps from algae, will be in the database, it might become possible to evaluate whether L. longbeachae indeed acquired genes from plants.
Figure 5

Phylogenetic tree of the protein Llo2643 and their homologs after blastp search. The tree was constructed by the Neighbor-joining method using the program MEGA. The red lines indicate the L. longbeachae sequences that are close to sequences derived from plant genomes. Numbers indicate bootstrap support for nodes from 1000 NJ bootstrap replicates. The bar at the bottom represents the estimated evolutionary distance.

Phylogenetic tree of the protein Llo2643 and their homologs after blastp search. The tree was constructed by the Neighbor-joining method using the program MEGA. The red lines indicate the L. longbeachae sequences that are close to sequences derived from plant genomes. Numbers indicate bootstrap support for nodes from 1000 NJ bootstrap replicates. The bar at the bottom represents the estimated evolutionary distance. Legionella is not the only prokaryote whose genome shows an enrichment of proteins with eukaryotic domains. Another example is the genome of “Ca. Amoebophilus asiaticus” a Gram-negative, obligate intracellular ameba symbiont belonging to the Bacteroidetes, which has been discovered within an ameba isolated from lake sediment (Schmitz-Esser et al., 2008) has been reported (Schmitz-Esser et al., 2010). In a recent report Schmitz-Esser et al. (2010) show that the genome of this organism also encodes an arsenal of proteins with eukaryotic domains. To further investigate the distribution of these protein domains in other bacteria the authors have undertaken an enrichment analysis comparing the fraction of all functional protein domains among 514 bacterial proteomes (Schmitz-Esser et al., 2010). This showed that the genomes of bacteria for which the replication in ameba has been demonstrated were enriched in protein domains that are predominantly found in eukaryotic proteins. Interestingly, the domains potentially involved in host cell interaction described above, such as ANK repeats, LRR, SEL1 repeats, and F- and U-box domains, are among the most highly enriched domains in proteomes of ameba-associated bacteria. Bacteria that can exploit amebae as hosts thus share a set of eukaryotic domains important for host cell interaction despite their different lifestyles and their large phylogenetic diversity. This suggests that bacteria thriving within ameba use similar mechanisms for host cell interaction to facilitate survival in the host cell. Due to the phylogenetic diversity of these bacteria, it is most likely that these traits were acquired independently during evolutionary early interaction with ancient protozoa.

Conclusion

Legionella pneumophila and L. longbeachae are two human pathogens that are able to modulate, manipulate, and subvert many eukaryotic host cell functions to their advantage, in order to enter, replicate, and evade protozoa or human alveolar macrophages during disease. In the last years genome analyses, as well as comparative and functional genomics have demonstrated that genome plasticity plays a major role in differences in host cell exploitation and niche adaptation of Legionella. The genomes of these environmental pathogens are shaped by HGT between eukaryotes and prokaryotes, allowing them to mimic host cell functions and to exploit host cell pathways. Genome plasticity and HGT lead in each strain and species to a different repertoire of secreted effectors that may allow subtle adaptations to, e.g., different protozoan hosts. Plasmids can be exchanged among strains and phages and deletions of surface structures like flagella or chemotaxis systems has taken place. Thus genome plasticity is major mechanism by which Legionella may adapt to different niches and hosts. Access to genomic data has revealed many potential virulence factors of L. pneumophila and L. longbeachae as well as metabolic capacities of these bacteria. The increasing information in the genomic database will allow a better identification of the origin and similarity of eukaryotic like proteins or eukaryotic protein domains and other virulence factors. New eukaryotic genomes like that of the natural host of Legionella, A. castellanii are in progress. These additional data will allow studying possible transfer events of genes from the eukaryotic host to Legionella more in depth. Taken together, the progressive increase of information on Legionella as well as on protozoa will allow more complete comparative and phylogenetic studies to shed light on the evolution of virulence in Legionella. However, much work remains to be done to translate the basic findings from genomics research into improved understanding of the biology of this organism. As data are accumulating, new fields of investigation will emerge. Without doubt the investigation and characterization of regulatory ncRNAs will be one such field. Manipulation of host-epigenetic information and investigating host susceptibility to disease will be another. In particular development of high throughput techniques for comparative and functional genomics as well as more and more powerful imaging techniques will accelerate the pace of knowledge acquisition.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  108 in total

1.  The genome of the amoeba symbiont "Candidatus Amoebophilus asiaticus" reveals common mechanisms for host cell interaction among amoeba-associated bacteria.

Authors:  Stephan Schmitz-Esser; Patrick Tischler; Roland Arnold; Jacqueline Montanaro; Michael Wagner; Thomas Rattei; Matthias Horn
Journal:  J Bacteriol       Date:  2009-12-18       Impact factor: 3.490

2.  Pathogen effector protein screening in yeast identifies Legionella factors that interfere with membrane trafficking.

Authors:  Nadim Shohdy; Jem A Efe; Scott D Emr; Howard A Shuman
Journal:  Proc Natl Acad Sci U S A       Date:  2005-03-21       Impact factor: 11.205

3.  A bacterial ecto-triphosphate diphosphohydrolase similar to human CD39 is essential for intracellular multiplication of Legionella pneumophila.

Authors:  Fiona M Sansom; Hayley J Newton; Sandra Crikis; Nicholas P Cianciotto; Peter J Cowan; Anthony J F d'Apice; Elizabeth L Hartland
Journal:  Cell Microbiol       Date:  2007-03-26       Impact factor: 3.715

4.  Virulence factors encoded by Legionella longbeachae identified on the basis of the genome sequence analysis of clinical isolate D-4968.

Authors:  Natalia A Kozak; Meghan Buss; Claressa E Lucas; Michael Frace; Dhwani Govil; Tatiana Travis; Melissa Olsen-Rasmussen; Robert F Benson; Barry S Fields
Journal:  J Bacteriol       Date:  2009-12-11       Impact factor: 3.490

5.  Legionella pneumophila proteins that regulate Rab1 membrane cycling.

Authors:  Alyssa Ingmundson; Anna Delprato; David G Lambright; Craig R Roy
Journal:  Nature       Date:  2007-10-21       Impact factor: 49.962

6.  Analysis of the Legionella longbeachae genome and transcriptome uncovers unique strategies to cause Legionnaires' disease.

Authors:  Christel Cazalet; Laura Gomez-Valero; Christophe Rusniok; Mariella Lomma; Delphine Dervins-Ravault; Hayley J Newton; Fiona M Sansom; Sophie Jarraud; Nora Zidane; Laurence Ma; Christiane Bouchier; Jerôme Etienne; Elizabeth L Hartland; Carmen Buchrieser
Journal:  PLoS Genet       Date:  2010-02-19       Impact factor: 5.917

7.  A specific genomic location within the icm/dot pathogenesis region of different Legionella species encodes functionally similar but nonhomologous virulence proteins.

Authors:  Michal Feldman; Gil Segal
Journal:  Infect Immun       Date:  2004-08       Impact factor: 3.441

8.  Legionella pneumophila goes clonal--Paris and Lorraine strain-specific risk factors.

Authors:  Paul H Edelstein; Joshua P Metlay
Journal:  Clin Infect Dis       Date:  2009-07-15       Impact factor: 9.079

9.  New endemic Legionella pneumophila serogroup I clones, Ontario, Canada.

Authors:  Nathalie Tijet; Patrick Tang; Mya Romilowych; Carla Duncan; Victoria Ng; David N Fisman; Frances Jamieson; Donald E Low; Cyril Guyard
Journal:  Emerg Infect Dis       Date:  2010-03       Impact factor: 6.883

10.  Molecular mimicry by an F-box effector of Legionella pneumophila hijacks a conserved polyubiquitination machinery within macrophages and protozoa.

Authors:  Christopher T Price; Souhaila Al-Khodor; Tasneem Al-Quadan; Marina Santic; Fabien Habyarimana; Awdhesh Kalia; Yousef Abu Kwaik
Journal:  PLoS Pathog       Date:  2009-12-24       Impact factor: 6.823

View more
  68 in total

1.  Vacuolar pathogens value membrane integrity.

Authors:  Craig R Roy
Journal:  Proc Natl Acad Sci U S A       Date:  2012-02-16       Impact factor: 11.205

2.  Whole-genome sequence of the human pathogen Legionella pneumophila serogroup 12 strain 570-CO-H.

Authors:  Francisco Amaro; Jack A Gilbert; Sarah Owens; William Trimble; Howard A Shuman
Journal:  J Bacteriol       Date:  2012-03       Impact factor: 3.490

3.  Life Stage-specific Proteomes of Legionella pneumophila Reveal a Highly Differential Abundance of Virulence-associated Dot/Icm effectors.

Authors:  Philipp Aurass; Thomas Gerlach; Dörte Becher; Birgit Voigt; Susanne Karste; Jörg Bernhardt; Katharina Riedel; Michael Hecker; Antje Flieger
Journal:  Mol Cell Proteomics       Date:  2015-11-06       Impact factor: 5.911

Review 4.  Targeting of host organelles by pathogenic bacteria: a sophisticated subversion strategy.

Authors:  Pedro Escoll; Sonia Mondino; Monica Rolando; Carmen Buchrieser
Journal:  Nat Rev Microbiol       Date:  2015-11-23       Impact factor: 60.633

5.  Genomic analysis of 38 Legionella species identifies large and diverse effector repertoires.

Authors:  David Burstein; Francisco Amaro; Tal Zusman; Ziv Lifshitz; Ofir Cohen; Jack A Gilbert; Tal Pupko; Howard A Shuman; Gil Segal
Journal:  Nat Genet       Date:  2016-01-11       Impact factor: 38.330

6.  Study of Legionella Effector Domains Revealed Novel and Prevalent Phosphatidylinositol 3-Phosphate Binding Domains.

Authors:  Nimrod Nachmias; Tal Zusman; Gil Segal
Journal:  Infect Immun       Date:  2019-05-21       Impact factor: 3.441

7.  Identification of novel Coxiella burnetii Icm/Dot effectors and genetic analysis of their involvement in modulating a mitogen-activated protein kinase pathway.

Authors:  Ziv Lifshitz; David Burstein; Kierstyn Schwartz; Howard A Shuman; Tal Pupko; Gil Segal
Journal:  Infect Immun       Date:  2014-06-23       Impact factor: 3.441

8.  The Legionella longbeachae Icm/Dot substrate SidC selectively binds phosphatidylinositol 4-phosphate with nanomolar affinity and promotes pathogen vacuole-endoplasmic reticulum interactions.

Authors:  Stephanie Dolinsky; Ina Haneburger; Adam Cichy; Mandy Hannemann; Aymelt Itzen; Hubert Hilbi
Journal:  Infect Immun       Date:  2014-07-14       Impact factor: 3.441

9.  Beyond Rab GTPases Legionella activates the small GTPase Ran to promote microtubule polymerization, pathogen vacuole motility, and infection.

Authors:  Hubert Hilbi; Eva Rothmeier; Christine Hoffmann; Christopher F Harrison
Journal:  Small GTPases       Date:  2014

10.  Eliminating Legionella by inhibiting BCL-XL to induce macrophage apoptosis.

Authors:  Mary Speir; Kate E Lawlor; Stefan P Glaser; Gilu Abraham; Seong Chow; Adam Vogrin; Keith E Schulze; Ralf Schuelein; Lorraine A O'Reilly; Kylie Mason; Elizabeth L Hartland; Trevor Lithgow; Andreas Strasser; Guillaume Lessene; David C S Huang; James E Vince; Thomas Naderer
Journal:  Nat Microbiol       Date:  2016-02-24       Impact factor: 17.745

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.