| Literature DB >> 29097994 |
Ya Zhang1, Masaaki Kitajima2, Andrew J Whittle3, Wen-Tso Liu1.
Abstract
The occurrence of pathogenic bacteria in drinking water distribution systems (DWDSs) is a major health concern, and our current understanding is mostly related to pathogenic species such as Legionella pneumophila and Mycobacterium avium but not to bacterial species closely related to them. In this study, genomic-based approaches were used to characterize pathogen-related species in relation to their abundance, diversity, potential pathogenicity, genetic exchange, and distribution across an urban drinking water system. Nine draft genomes recovered from 10 metagenomes were identified as Legionella (4 draft genomes), Mycobacterium (3 draft genomes), Parachlamydia (1 draft genome), and Leptospira (1 draft genome). The pathogenicity potential of these genomes was examined by the presence/absence of virulence machinery, including genes belonging to Type III, IV, and VII secretion systems and their effectors. Several virulence factors known to pathogenic species were detected with these retrieved draft genomes except the Leptospira-related genome. Identical clustered regularly interspaced short palindromic repeats-CRISPR-associated proteins (CRISPR-Cas) genetic signatures were observed in two draft genomes recovered at different stages of the studied system, suggesting that the spacers in CRISPR-Cas could potentially be used as a biomarker in the monitoring of Legionella related strains at an evolutionary scale of several years across different drinking water production and distribution systems. Overall, metagenomics approach was an effective and complementary tool of culturing techniques to gain insights into the pathogenic characteristics and the CRISPR-Cas signatures of pathogen-related species in DWDSs.Entities:
Keywords: CRISPR; Legionella; Leptospira; Mycobacterium; Parachlamydia; drinking water distribution systems; genomic analysis; virulence
Year: 2017 PMID: 29097994 PMCID: PMC5654357 DOI: 10.3389/fmicb.2017.02036
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Flowing chart on the analysis of metagenomic data for the reconstruction of draft genomes and the identification of genetic signatures.
Figure 2Detected potential pathogens and eukaryotes (nematodes, amoebae, fungi, and flagellates) by 16S rRNA gene amplicon analysis (1), SSU genes extracted from metagenomes (2), ddPCR (3), and real-time PCR (4). Here, relative abundance was reported. The concentration of specific pathogens determined with ddPCR was summarized in Table S2. We divided the samples into water and biofilm phases.
General features of the recovered genomes of pathogen-related species.
| BC.3.64 | BC | 94.44 | 30.13 | 62 | 2.27E+06 | 40.1 | 2112 | 5 | 31,419 | 150,921 | |
| BC.3.72 | BC | 94.51 | 23.78 | 22 | 1.95E+06 | 40.6 | 1829 | 11 | 74,242 | 336,208 | |
| FW.3.37 | FW | 94.15 | 27.68 | 63 | 2.10E+06 | 40.3 | 1926 | 14 | 18,840 | 221,613 | |
| DS3.009 | DS3 | 98.83 | 45.78 | 140 | 3.36E+06 | 39.4 | 3159 | 39 | 16,314 | 165,891 | |
| DS1.3.26 | DS1 | 99.86 | 79.34 | 217 | 7.43E+06 | 66.8 | 6689 | 64 | 16,573 | 250,869 | |
| DS2.013 | DS2 | 99.86 | 23.74 | 219 | 7.96E+06 | 66.5 | 7334 | 77 | 15,428 | 244,689 | |
| PR.002 | PR | 89.12 | 451.94 | 919 | 6.78E+06 | 67.0 | 6179 | 120 | 4,016 | 89,735 | |
| BC.030 | BC | 100.00 | 24.81 | 39 | 3.04E+06 | 41.5 | 2763 | 15 | 54,962 | 289,998 | |
| FW.030 | FW | 95.88 | 15.42 | 114 | 3.73E+06 | 35.1 | 3613 | 19 | 15,672 | 307,203 |
Figure 3Phylogenomic tree of recovered draft genomes constructed based on up to 400 conserved protein sequences. (A) Legionella; (B) Mycobacterium; (C) Leptospira; (D) Parachlamydia. The nine draft genomes recovered from this study were bold. Known pathogenic species were shaded with gray. Scale bar, 1 expected substitutions per site.
Figure 4VFs identified with the draft genomes recovered in this study and related genomes from public databases. VFs were grouped based on general categories (secretion systems and associated effectors, attachment and invasion, endotoxin, amino acid biosynthesis, and others). The genomes were organized by their taxonomic affiliations. There were some shared VFs among different genera, including T2SS among Legionella, Leptospira, and Parachlamydia, the mip gene between Legionella and Parachlamydia, and the mce gene between Mycobacterium and Leptospira. The accession number of known genomes were listed in Table S3.
Figure 5CRISPR-Cas loci identified in the draft genomes recovered in this study and known genomes of Legionella. They were organized according to the subtypes (Type I-F, II-B, and I-C) of CRISPR-Cas loci.
Prophages identified in the retrieved draft genomes.
| BC.3.64 | R1 | 9.5 | ||
| R2 | 31.1 | |||
| FW.3.37 | R1 | 9.5 | ||
| R2 | 26.1 | |||
| DS3.009 | R1 | 37.0 | ||
| R2 | 23.5 | |||
| DS1.3.26 | R1 | 19.0 | ||
| DS2.103 | R1 | 28.3 | ||
| R2 | 12.2 | Molluscum contagiosum virus subtype 1 | ||
| R3 | 27.7 | |||
| R4 | 31.6 | |||
| R5 | 40.1 | |||
| PR.002 | R1 | 17.2 | ||
| R2 | 37.1 | |||
| FW.030 | R1 | 29.9 | ||
| BC.030 | R1 | 19.3 |
Potential targets of CRISPR-Cas spacers in Legionella-related genomes.
| BC.3.64 | Sp6 | Marine metagenome genome assembly TARA_030_DCM_0.22 (CENH01030675) | 27 | 5 | GG | 8 |
| Lgp | Chrm_Sp23 | 35 | 1 | GG | 6 | |
| Chrm_Sp35 | 27 | 5 | GG | 3 | ||
| Plsm_Sp22 | Activated sludge metagenome contig16020 (AERA01015926) | 37 | 0 | GG | – | |
| Plsm_Sp46 | 35 | 1 | GG | 7 | ||
| Plsm_Sp12 | 31 | 3 | GG | 7 | ||
| Plsm_Sp12 | 31 | 3 | GG | 7 | ||
| Plsm_Sp10 | 30 | 1 | Not match | N/A | ||
| Plsm_Sp8 | Uncultured marine Microviridae clone SOG3-01 major capsid protein gene, partial cds (KC131005) | 29 | 4 | GG | 1 | |
| Plsm_Sp47 | Activated sludge metagenome contig16020 (AERA01015926) | 29 | 4 | GG | – | |
| Plsm_Sp50 | Marine metagenome 1096626097875, whole genome shotgun sequence (AACY023989113) | 29 | 4 | GG | 5 | |
| Plsm_Sp7 | Activated sludge metagenome contig06523 (AERA01006474) | 29 | 5 | GG | 3, 5 | |
| Plsm_Sp13 | 26 | 3 | Not match | N/A | ||
| Plsm_Sp32 | 24 | 4 | Not match | N/A | ||
| Plsm_Sp7 | 24 | 4 | Not match | N/A | ||
| Lgp Alcoy | Sp32 | Uncultured Gokushovirinae clone WSBWG10n1 major capsid protein gene (KF689311) | 31 | 3 | GG | 8 |
| Sp28 | Marine metagenome genome assembly TARA_122_SRF_0.1-0.22 (CETN01079705) | 29 | 4 | GG | – | |
| Sp3 | 26 | 3 | Not match | N/A | ||
| Lgp Paris | Sp33 | Activated sludge metagenome contig28417 (AERA01027227) | 37 | 3 | CCA | 6, 9 |
| Sp4 | 29 | 5 | CCA | – | ||
| Sp15 | 28 | 3 | Not match | N/A | ||
| Sp14 | 28 | 4 | Not match | N/A | ||
| Lgp 130b | Sp40 | 37 | 0 | CCA | – | |
| Sp41 | Hypersaline lake metagenome ctg7180000052828 (APHM01003927) | 30 | 5 | CCA | 10 | |
| Sp27 | 30 | 2 | Not match | N/A | ||
| Sp27 | 30 | 2 | Not match | N/A |
Lgp, Lg. pneumophila;
PAMs, protospacer adjacent motifs.