| Literature DB >> 34668737 |
Sho Shimada1,2, Ryosuke Nakai3, Kotaro Aoki1, Sakae Kudoh4,5, Satoshi Imura4,5, Norifumi Shimoeda6, Giichiro Ohno7, Kentaro Watanabe4, Yasunari Miyazaki2, Yoshikazu Ishii1, Kazuhiro Tateda1.
Abstract
Culture-independent analysis shows that Legionella spp. inhabit a wide range of low-temperature environments, but to date, no psychrotolerant or psychrophilic strains have been reported. Here, we characterized the first cultivated psychrotolerant representative, designated strain TUM19329T, isolated from an Antarctic lake using a polyphasic approach and comparative genomic analysis. A genome-wide phylogenetic tree indicated that this strain was phylogenetically separate at the species level. Strain TUM19329T shared common physiological traits (e.g., Gram-negative, limited growth on buffered charcoal-yeast extract α-ketoglutarate [BCYEα] agar with l-cysteine requirements) with its relatives, but it also showed psychrotolerant growth properties (e.g., growth at 4°C to 25°C). Moreover, this strain altered its own cellular fatty acid composition to accumulate unsaturated fatty acid at a lower temperature, which may help maintain the cell membrane fluidity. Through comparative genomic analysis, we found that this strain possessed massive mobile genetic elements compared with other species, amounting to up to 17% of the total genes. The majority of the elements were the result of the spread of only a few insertion sequences (ISs), which were spread throughout the genome by a "copy-and-paste" mechanism. Furthermore, we found metabolic genes, such as fatty acid synthesis-related genes, acquired by horizontal gene transfer (HGT). The expansion of ISs and HGT events may play a major role in shaping the phenotype and physiology of this strain. On the basis of the features presented here, we propose a new species-Legionella antarctica sp. nov.-represented by strain TUM19329T (= GTC 22699T = NCTC 14581T). IMPORTANCE This study characterized a unique cultivated representative of the genus Legionella isolated from an Antarctic lake. This psychrotolerant strain had some common properties of known Legionella species but also displayed other characteristics, such as plasticity in fatty acid composition and an enrichment of mobile genes in the genome. These remarkable properties, as well as other factors, may contribute to cold hardiness, and this first cultivated cold-tolerant strain of the genus Legionella may serve as a model bacterium for further studies. It is worth noting that environmentally derived 16S rRNA gene phylotypes closely related to the strain characterized here have been detected from diverse environments outside Antarctica, suggesting a wide distribution of psychrotolerant Legionella bacteria. Our culture- and genome-based findings may accelerate the ongoing studies of the behavior and pathogenicity of Legionella spp., which have been monitored for many years in the context of public health.Entities:
Keywords: Legionella; cellular fatty acids; comparative genomic analysis; horizontal gene transfer; insertion sequence; mobile genetic elements; psychrotolerant
Mesh:
Substances:
Year: 2021 PMID: 34668737 PMCID: PMC8528123 DOI: 10.1128/Spectrum.00424-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Tree inferred with FastME v2.1.6.1 from whole-proteome-based Genome BLAST Distance Phylogeny (GBDP) distances. The branch lengths are scaled via the GBDP distance formula d5. Branch values are GBDP pseudobootstrap support values of >60% from 100 replications, with an average branch support of 92.1%. The trees were rooted at the midpoint. The scale bar indicates the branch length value. The strain names of each species are listed in Table 2.
Summary of the basic genomic data of the 17 Legionella spp. compared in this study
| Taxon name | Strain name | GenBank accession ID | No. of contigs | Size (Mbp) | GC (%) | No. of CDS | No. of rRNA | No. of tRNA |
|---|---|---|---|---|---|---|---|---|
|
| TUM19329T |
| 1 | 3.75 | 39.15 | 3,608 | 9 | 42 |
|
| WIGAT |
| 7 | 4.23 | 37.97 | 3,761 | 9 | 46 |
|
| 72-OH-HT |
| 2 | 4.1 | 36.86 | 3,498 | 9 | 46 |
|
| LLAP-10T |
| 3 | 4.43 | 38.32 | 3,748 | 12 | 46 |
|
| Lansing 2T |
| 2 | 3.57 | 38.96 | 3,183 | 12 | 43 |
|
| BL 540T |
| 1 | 3.13 | 41.74 | 2,865 | 12 | 44 |
|
| 1677-MI-HT |
| 1 | 2.99 | 40.7 | 2,703 | 12 | 43 |
|
| Longbeach 4T |
| 38 | 4.13 | 37 | 3,554 | 5 | 42 |
|
| TATLOCKT |
| 1 | 3.31 | 40.46 | 2,953 | 9 | 43 |
|
| 316-36T |
| 2 | 3.83 | 40.12 | 3,212 | 9 | 41 |
| Philadelphia-1 |
| 1 | 3.41 | 38.33 | 3,021 | 9 | 43 | |
|
| 200/83-1335T |
| 5 | 4.28 | 39.06 | 3,611 | 9 | 43 |
|
| MSH-4T |
| 1 | 4.16 | 37.06 | 3,746 | 12 | 49 |
|
| 214T |
| 76 | 3.51 | 41.56 | 2,933 | 4 | 40 |
|
| 1087-AZ-HT |
| 12 | 3.36 | 37.41 | 2,967 | 3 | 41 |
|
| Wadsworth 81-716AT |
| 2 | 3.6 | 38.08 | 3,189 | 11 | 43 |
|
| 95/83-1347T |
| 2 | 3.16 | 40.4 | 2,705 | 9 | 41 |
FIG 2Proliferative potential of strain TUM19329T in culture medium and eukaryotic cells. (a) Growth curves of strain TUM19329T in BYE broth at different temperatures. (b) Intracellular growth of strain TUM19329T and L. pneumophila Philadlphia-1 in an Acanthamoeba sp. The light micrograph images were obtained by Gimenez staining at each time point. All images were taken at ×1,000 magnification. The yellow arrows indicate bacterial cells. The experiment was conducted twice in triplicate.
CFA compositions of strain TUM19329T and the most closely related strain, LLAP-10T
| Fatty acid or characteristic | Data for strain: | ||
|---|---|---|---|
| TUM19329T | LLAP-10T | ||
| Cultured temp (°C) | 10 | 25 | 30 |
| iso-C13:0 | – | T | – |
| anteiso-C13:0 | – | T | – |
| iso-C14:0 | 1.3 | 3.6 | 4-5 |
| C14:1 | 1.1 | 1.2 | – |
| C14:0 | 1.1 | T | 2-3 |
| C13:0 3-OH / C15:1 iso H (summed feature 1) | – | T | ND |
| iso-C15:0 | 1.5 | 3.9 | T |
| anteiso-C15:0 | 19.9 | 36.2 | 12–15 |
| C15:1ω6c | 1.6 | 1.6 | 3–4 |
| C15:0 | – | – | 2–3 |
| C14:0 3-OH | T | T | – |
| iso-C16:0 | 9.0 | 15.2 | 11–13 |
| C16:1 | 53.7 | 27.2 | 21–27 |
| C16:0 | 4.2 | 1.7 | 15–19 |
| C15:0 2-OH | – | T | – |
| iso-C17:1 | T | T | ND |
| iso-C17:0 | T | T | – |
| anteiso-C17:0 | 3.0 | 4.1 | 2–3 |
| C17:0 cyc | – | – | 2–6 |
| C17:0 | – | T | 2 |
| iso-C18:0 | – | T | T |
| C18:0 | 1.8 | 1.0 | 6-7 |
| C18:0 3-OH | – | T | ND |
| C19:0 cyc | – | T | ND |
| C19:0 | – | – | 1–2 |
| anteiso-C19:0 | – | – | T |
| C20:0 | T | T | 1–2 |
Values are the percentage of total fatty acids. T, trace (∼0.9%); –, not detected; ND, no data. Data of strain LLAP-10T from Adeleke et al. (9). Note that the psychrotolerant Antarctic strain TUM19329T from this study does not grow above 30°C.
This is given as the value of C16:1 ω7c in the previous study.
FIG 3Comparison of the amino acid and codon composition of 17 Legionella species. (a) Coefficient of variation of the amino acid compositions between the compared Legionella species. (b) An amino acid and codon usage heatmap based on the percentages in each of the genomes of the compared Legionella species. A heatmap describing the details of each codon on the horizontal axis is available in Figshare (see “Data Availability”). The strain names of each species are listed in Table 2. (c) Relative synonymous codon usage (RSCU) in TUM19329T and L. fallonii. Stop codons were excluded from the analysis. Codons of amino acids in the x axis were sorted based on the RSCU values of TUM19329T. The amino acids translated by codon are shown in parentheses following the codon. The arrows point to the synonymous codon for leucine, which showed the most usage bias between the two species.
FIG 4Number of protein coding genes classified into each functional COG category. Genes belonging to each category are colored as core, shared (genes shared between at least one species), or unique. Protein coding genes not annotated with any functional category (729 CDS, 20%) or annotated with multiple COG functional categories (46 CDS, 1.2%) were excluded. COG, clusters of orthologous groups.
FIG 5Phylogeny, structure, and distribution of the insertion sequences in the genome of TUM19329T. (a) Maximum-likelihood tree showing the relationship between the genes classified into the four COG IDs with the highest number of genes, COG2801, COG2963, COG1484, and COG4584. The tree was rooted using midpoint rooting. The scale bar represents the branch length value. Numbers along branches are bootstrap values based on 100 replicates. The genes in each COG ID were grouped in three major clusters. Detailed versions of each tree are available in Figshare (see “Data Availability”). (b) Each of the 12 clusters appeared adjacent to each other in a specific pair in the genome, and analysis, which included the up- and downstream sequences, revealed six IS structures (ISLa1 to ISLa6) that appeared repeatedly. (c) The distribution of ISLa1 to ISLa6 in the genome of TUM19329T.
Descriptions of the six IS structures with particularly high copy numbers
| IS (length [bp]) | No. of full-length copies | Gene | Description of the gene (COG ID) | BLAST similarity search | Putative HGT | ||
|---|---|---|---|---|---|---|---|
| Host of the gene | Query coverage (%) | Identity (%) | |||||
| IS | 93 | Cluster 4 (285) | Hypothetical protein (COG2963) | 96 | 78.02 | No | |
|
| 96 | 72.53 | |||||
| 97 | 70.65 | ||||||
| Cluster 1 (846) | Transposase InsF for insertion sequence IS | 100 | 98.58 | Yes | |||
| 98 | 82.31 | ||||||
|
| 98 | 82.31 | |||||
| IS | 33 | Cluster 5 (285) | Hypothetical protein (COG2963) | 100 | 77.66 | Yes | |
| 98 | 59.14 | ||||||
|
| 94 | 55.06 | |||||
| Cluster 2 (837) | Transposase for insertion sequence element IS | 99 | 65.34 | Yes | |||
| 97 | 66.05 | ||||||
| 97 | 65.56 | ||||||
| (258) | Hypothetical protein (not annotated) | “ | 58 | 49.06 | No | ||
| “ | 31 | 70.37 | |||||
| “ | 28 | 70.83 | |||||
| IS | 6 | Cluster 6 (291) | Transposase InsN for insertion sequence element IS | 100 | 98.58 | Yes | |
| 98 | 82.31 | ||||||
|
| 98 | 82.31 | |||||
| Cluster 3 (876) | Insertion element IS | 97 | 95.09 | Yes | |||
| 100 | 73.29 | ||||||
|
| 100 | 73.04 | |||||
| IS | 17 | Cluster 7 (1,512) | Transposase (COG1484) | 97 | 61.05 | Yes | |
|
| 100 | 56.26 | |||||
|
| 100 | 56.26 | |||||
| Cluster 10 (729) | IS | 100 | 65.29 | Yes | |||
|
| 100 | 61.57 | |||||
| 99 | 58.51 | ||||||
| IS | 13 | Cluster 8 (1,533) | Putative transposase for insertion sequence element IS | 100 | 98.24 | No | |
| 99 | 73.87 | ||||||
| 99 | 73.67 | ||||||
| Cluster 11 (756) | Putative ATP-binding protein in insertion sequence (COG4584) | 100 | 98.8 | Yes | |||
| 100 | 82.47 | ||||||
| “ | 97 | 77.96 | |||||
| IS | 7 | Cluster 9 (1,551) | Putative transposase for insertion sequence IS | 100 | 74.08 | Yes | |
| 98 | 53.24 | ||||||
| 98 | 51.95 | ||||||
| Cluster 12 (738) | Insertion sequence IS | 100 | 82.86 | Yes | |||
| 97 | 56.25 | ||||||
| “ | 94 | 57.94 | |||||
Cluster number defined based on the results of phylogenetic analysis of the same COG ID (see Fig. 5a).
The results of the top three hits of the BLASTP similarity search for each gene.
Genes presumed to have been acquired by HGT using HGTector.
Genes interrupted by ISLa1 to ISLa6
| Genes | COG ID | Product | IS interrupting the gene |
|---|---|---|---|
| TUM19329_00019, TUM19329_00022 | L:COG0507 | Hypothetical protein | IS |
| TUM19329_00269, TUM19329_00272 | V:COG0286 | Site-specific DNA-methyltransferase (adenine-specific) | IS |
| TUM19329_00485, TUM19329_00488 | G:ENOG410XNQK | Arabinose-proton symporter | IS |
| TUM19329_00643, TUM19329_00646 | S:COG0121 | Putative glutamine amidotransferase | IS |
| TUM19329_00913, TUM19329_00917 | I:COG3243 | Poly(3-hydroxyalkanoate) polymerase | IS |
| TUM19329_00959, TUM19329_00962 | I:COG1960 | Acyl-CoA dehydrogenase family member 11 | IS |
| TUM19329_01188, TUM19329_01191 | V:COG1680 | Serine-type | IS |
| TUM19329_01427, TUM19329_01430 | T:ENOG410XNMH | Histidine kinase | IS |
| TUM19329_01576, TUM19329_01580, TUM19329_01584 | T:ENOG410XNMH | Histidine kinase | IS |
| TUM19329_01604, TUM19329_01607 | S:ENOG4112B5D | Hypothetical protein | IS |
| TUM19329_01615, TUM19329_01618 | S:ENOG4112A90 | Protein TraM | IS |
| TUM19329_01623, TUM19329_01626 | L:COG2189 | Site-specific DNA-methyltransferase (adenine-specific) | IS |
| TUM19329_01804, TUM19329_01807 | I:COG1835 | IS | |
| TUM19329_02422, TUM19329_02428 | M:COG2230 | Cyclopropane-fatty-acyl-phospholipid synthase | IS |
| TUM19329_02454, TUM19329_02458 | S:COG1988 | Hypothetical protein | IS |
| TUM19329_02485, TUM19329_02491 | M:COG0438 | 1,4-alpha-glucan branching enzyme | IS |
| TUM19329_02509, TUM19329_02512 | S:COG3173 | Hypothetical protein | IS |
| TUM19329_02618, TUM19329_02621 | E:COG0028 | Acetolactate synthase | IS |
| TUM19329_03112, TUM19329_03115 | S:ENOG410XUV4 | Protein BcsG-like protein | IS |
| TUM19329_03264, TUM19329_03267 | I:COG1597 | Undecaprenyl-diphosphate phosphatase | IS |
Mobile genes interrupted by ISs are excluded from the list.