| Literature DB >> 30787911 |
Utkarsh Sood1,2, Princy Hira1, Roshan Kumar1,2,3, Abhay Bajaj1,4, Desiraju Lakshmi Narsimha Rao5, Rup Lal1,2, Mallikarjun Shakarad1.
Abstract
Genomic information for outlier strains of Pseudomonas aeruginosa is exiguous when compared with classical strains. We sequenced and constructed the complete genome of an environmental strain CR1 of P. aeruginosa and performed the comparative genomic analysis. It clustered with the outlier group, hence we scaled up the analyses to understand the differences in environmental and clinical outlier strains. We identified eight new regions of genomic plasticity and a plasmid pCR1 with a VirB/D4 complex followed by trimeric auto-transporter that can induce virulence phenotype in the genome of strain CR1. Virulence genotype analysis revealed that strain CR1 lacked hemolytic phospholipase C and D, three genes for LPS biosynthesis and had reduced antibiotic resistance genes when compared with clinical strains. Genes belonging to proteases, bacterial exporters and DNA stabilization were found to be under strong positive selection, thus facilitating pathogenicity and survival of the outliers. The outliers had the complete operon for the production of vibrioferrin, a siderophore present in plant growth promoting bacteria. The competence to acquire multidrug resistance and new virulence factors makes these strains a potential threat. However, we identified major regulatory hubs that can be used as drug targets against both the classical and outlier groups.Entities:
Keywords: Pseudomonas aeruginosa; drug targets; environmental genomics; outliers; positive selection
Year: 2019 PMID: 30787911 PMCID: PMC6372532 DOI: 10.3389/fmicb.2019.00053
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1General genomic attributes of Pseudomonas aeruginosa CR1. (A) Circular map of chromosome 1 of CR1 strain. From outside to inside Ring 1: COG categories of ORFs in the positive strand. Ring 2: COG categories of ORFs in the negative strand. Ring 3: Regions of Genomic plasticity starting from zero bp mark in a clockwise direction. Ring 4: BLASTn alignment (expected threshold = 1e-220) between CR1 and previously sequenced outlier PA7 (teal); Ring 5: PAO1 (aqua); Ring 6: UCBPP-PA14 (orange). Rings 7 and 8: GC content and GC skew, respectively. (B) Circular map of plasmid pCR1. From outside to inside Rings 1, 2, 3, and 4 depicts COG categories of ORFs of the positive strand, genes on positive strand, genes on negative strand and COG categories of ORFs of negative strand, respectively. Rings 5 and 6 represents GC content and GC skew, respectively.
General genomic attributes of 14 outlier strains of P. aeruginosa: ANIm values were calculated between CR1 and other outliers.
| Strain | Origin (Country) | Genome size | Number of contigs | Genes | Sequence type | tRNAs | rRNAs (5S, 16S, 23S) | GC content (%) | CRISPR arrays | Prophage regions | Completeness (Genes/%) | ANIm (%) | DDH∗(%) | Reference |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PA7 | Burn wound (Argentina) | 6588339 | 1 | 6302 | 1195 | 75 | 4,4,4 | 66.5 | 3 | 4 | 107 (100) | 98.97 | 91.1 | |
| CR1 | Rhizosphere (India) | 6164858 | 2 | 5838 | 3198 | 76 | 4,4,4 | 66.8 | 5 | 2 | 107 (100) | 100 | 100 | This study |
| ATCC 9027 | Otitis (Australia) | 6362326 | 80 | 6121 | 2230 | 69 | 1,1,1 | 66.6 | 1 | 2 | 106 (99.06) | 99.33 | 93.9 | |
| AZPAE14901 | Abcess/Pus (India) | 6881448 | 232 | 6709 | 2212 | 63 | 1,1,1 | 66.2 | 5 | 2 | 105 (98.13) | 99.32 | 86.6 | |
| AZPAE14941 | Peritoneal (Hong Kong) | 6881480 | 183 | 6722 | Not Assigned | 66 | 2,1,1 | 66.0 | 3 | 2 | 105 (98.13) | 98.94 | 85.7 | |
| AZPAE15042 | Urinary (Germany) | 6636169 | 143 | 6470 | 2211 | 69 | 1,1,1 | 66.3 | 3 | 1 | 105 (98.13) | 99.36 | 90.7 | |
| EML528 | Unknown (Germany) | 6365411 | 53 | 6064 | 3043 | 70 | 2,1,1 | 66.6 | 4 | 1 | 107 (100) | 98.95 | 91.7 | |
| EML545 | Unknown (Germany) | 6442089 | 64 | 6142 | 2228 | 68 | 3,3,3 | 66.6 | 3 | 2 | 107 (100) | 99.37 | 94.0 | |
| EML548 | Unknown (Germany) | 6333473 | 120 | 6088 | 2230 | 73 | 4,4,4 | 66.6 | 1 | 2 | 103 (96.26) | 99.34 | 93.8 | |
| WH-SGI-V-07055 | Clinical (United States) | 6757340 | 102 | 6532 | 2023 | 67 | 7,1,1 | 66.4 | 3 | 5 | 107 (100) | 98.90 | 89.6 | |
| WH-SGI-V-07064 | Clinical (United States) | 6336003 | 87 | 6066 | 1195 | 68 | 6,1,1 | 66.7 | 2 | 3 | 105 (98.13) | 98.96 | 93.8 | |
| WH-SGI-V-07072 | Clinical (United States) | 6373176 | 88 | 6103 | 1195 | 68 | 6,1,1 | 66.6 | 2 | 1 | 105 (98.13) | 98.95 | 93.5 | |
| WH-SGI-V-07370 | Clinical (France) | 6581092 | 174 | 6358 | 2047 | 67 | 6,1,1 | 66.4 | 2 | 2 | 105 (98.13) | 98.94 | 91.1 | |
| WH-SGI-V-07618 | Clinical (United States) | 6845114 | 159 | 6622 | 2023 | 68 | 9,1,1 | 66.2 | 3 | 4 | 105 (98.13) | 98.90 | 88.7 |
FIGURE 2Phylogenetic analysis of P. aeruginosa strains using pan-genome matrix. The consensus pan-genome matrix was generated using the intersection of pan genes predicted by COG and OMCL algorithms to construct a parsimony tree using PARS from the PHYLIP. Three major clusters represented by PAO1 (Blue), UCBPP-PA14 (Green), and PA7 (Maroon) were observed. The outlier clade represented by PA7 was split into 2 clades one with a representative genome of PA7 (Maroon) and second by strain CR1 (red).
Details to RGPs found in strain CR1: number of ORFs, GC content, tRNA, integrase and transposase genes, major annotation, similarity/origin, and putative functions.
| GI (RGP) | Size (bp) | %G+C | tRNA | Integrase | Transposase | Major annotations | Similarity | Putative functions |
|---|---|---|---|---|---|---|---|---|
| RGP98 | 4,694 | 55.86 | No | No | Yes | Mobile Genetic Elements, Helicase, Bacterial Partitioning Protein | Helicase and Partitioning proteins have origins from | Transposition of genetic elements |
| RGP99 | 41,377 | 64.21 | Yes | Yes | No | Phage-related proteins, Hypothetical proteins | Major hits from | Recombination (presence of attL and attR sites) |
| RGP100 | 7,638 | 64.68 | No | No | No | Type I secretion protein (TolC), ABC Transporters, ATP-binding proteins, haemolysin D, YceK/YidQ family proteins | Non- | TISS (pathogenic potential) |
| RGP101 | 7,313 | 51.21 | No | Yes | No | Hypothetical proteins | Classical strains of | Hypothetical proteins |
| RGP102 | 11,296 | 66.10 | No | No | No | Hypothetical proteins | Non- | Unknown |
| RGP103 | 14,440 | 65.93 | Yes | Yes | No | Formaldehyde activating enzyme, aldehyde dehydrogenase; aldo/keto reductase, Gluconate dehydratase. | Aldehyde Detoxification; | |
| RGP104 | 12,652 | 58.79 | Yes | No | No | Amino acid permease, arginine/agmatine antiporter, SAM-dependent methyltransferase | Origin from classical strains of | Acid tolerance/resistance |
| RGP105 | 4,251 | 60.50 | No | No | No | Origin from soil inhabiting bacteria like | Rhizosphere expressed genes |
The list of genes under positive natural selection [ω represents the value of dN/dS, p-value: probability value; q-value: false discovery rate (FDR < 10%)].
| Gene | Function | Cluster name | ω | ||
|---|---|---|---|---|---|
| Carbohydrate-binding protein with metzincin domain (B7D75_11240) | Protease secreted by biofilm of | ORTHOMCL2836 | 29.07471 | 0 | 0.000844 |
| Epoxide hydrolase (B7D75_07235) | Virulence factor; hydrolysis of CFTR in lungs | ORTHOMCL5445 | 30.65061 | 0 | 0.000576 |
| Transglutaminase cysteine protease (B7D75_11075) | Role in biofilm formation and dispersal | ORTHOMCL2866 | 23.91616 | 0.000006 | 0.004772 |
| Metallo β-lactamase fold metallo-hydrolase (B7D75_21135) | Degrade beta-lactam antibiotics | ORTHOMCL4367 | 21.31702 | 0.000023 | 0.010209 |
| Outer membrane porin, OprD family (B7D75_11385) | Mutation in | ORTHOMCL2807 | 18.80725 | 0.000082 | 0.028645 |
| Peptidase M48 (B7D75_26190) | Chaperone and metalloprotease | ORTHOMCL1727 | 17.31162 | 0.000174 | 0.045382 |
| LysE family translocator (B7D75_10555) | Lysine exporter | ORTHOMCL2297 | 21.92155 | 0.000017 | 0.00845 |
| Hydrogen peroxide inducible genes activator (LysR family) (B7D75_26850) | Expression of hydrogen peroxide-inducible genes | ORTHOMCL481 | 25.94273 | 0.000002 | 0.002425 |
| High affinity sulfate family inorganic anion transporter (B7D75_00150) | Sulfate transporter | ORTHOMCL1053 | 22.29005 | 0.000014 | 0.00845 |
| Branched-chain amino acid transport system II carrier protein (B7D75_16930) | Transport system for branched-chain amino acids (BCAAs; Ile, Leu, and Val) | ORTHOMCL3746 | 17.34394 | 0.000171 | 0.045382 |
| Amino acid permease (B7D75_21325) | Amino acid transporters | ORTHOMCL4632 | 17.92312 | 0.000128 | 0.039327 |
| Type IV pilus secretin PilQ (B7D75_26155) | Involved in type IV pilus formation, competence for transformation, type III secretion, and type II secretion | ORTHOMCL1798 | 20.61727 | 0.000033 | 0.013371 |
| ProQ activator of osmoprotectant transporter ProP (B7D75_12075) | Transport betaine and other osmoprotectants | ORTHOMCL2663 | 18.45609 | 0.000098 | 0.032009 |
| Nuclease SbcCD subunit C (B7D75_21985) | Double stranded exo- and endonuclease | ORTHOMCL4686 | 24.01605 | 0.000006 | 0.004772 |
| ATP-dependent DNA helicase (B7D75_08610) | DNA replication | ORTHOMCL2136 | 21.94135 | 0.000017 | 0.00845 |
| Error-prone DNA polymerase (B7D75_21945) | DNA replication | ORTHOMCL4522 | 17.55637 | 0.000154 | 0.044617 |
| hmga2e, putative (B7D75_11510) | Unknown | ORTHOMCL2780 | 21.86925 | 0.000018 | 0.00845 |
| Hypothetical protein (B7D75_05465) | Unknown | ORTHOMCL1549 | 38.14207 | 0 | 0.000027 |
| Hypothetical protein (B7D75_14605) | Unknown | ORTHOMCL3251 | 27.56469 | 0.000001 | 0.001347 |
| Terminase (B7D75_05835) | Viral terminase | ORTHOMCL6048 | 20.30993 | 0.000039 | 0.014478 |
FIGURE 3Estimation of core and pan-genome of P. aeruginosa strains with Tettelin fit (blue: pangenome; red: core-genome). (A,B) Represents the estimate of the pan and core genome of 14 outlier strains; (C,D) represents the estimate of core and pan-genome of 64 P. aeruginosa strains included in the current study and (E,F) represents estimate of the core and pan-genome curves for 78 genomes. The x-axis represents the number of genomes (g) while the y-axis represents the core genome and pan-genome size (number of genes). The equation for estimating pan and core-genome size according to Tettelin fit are given with the respected graphs.
FIGURE 4Comparative functional analysis based on COG categories between core and accessory genes in (A) outlier strains (n = 14) and (B) classical strains (n = 64). COG-based binning of core genes and accessory genes. The abscissa denotes different COG functional categories. The ordinate denotes the number of genes in each COG category. Four COG functional categories (RNA processing and modification, Chromatin structure and dynamics, Extracellular structures, and Cytoskeleton) including only one or without homologs in the COG collection are not displayed. Significant enrichment of gene occurrence in the individual category is marked by asterisks (∗p < 0.05, ∗∗p < 0.001; FDR < 10% Chi-square test).
FIGURE 5Functional analysis of Pseudomonas outlier strains. (A) The amino acid sequences of all the 14 genomes were assigned KO number by KAAS server. The protein families were then mapped for a minimal set of pathways using a parsimony approach. The heat map was constructed based on most abundant top 50 subsystems using Pearson correlation. (B) The ORF’s were mapped against the COG database using rpsblast. For both the heatmaps rows are centered; unit variance scaling is applied to rows. Both rows and columns are clustered using correlation distance and average linkage.
Pathogenic potential of the 14 outlier strains.
| Strain | Probability | Input proteome coverage (%) | Matched pathogenic families | Matched not pathogenic families |
|---|---|---|---|---|
| ATCC9027 | 0.617 | 5.61 | 238 | 92 |
| EML548 | 0.615 | 5.68 | 239 | 93 |
| CR1 | 0.660 | 5.51 | 244 | 69 |
| AZPAE14901 | 0.638 | 5.15 | 248 | 82 |
| EML545 | 0.656 | 5.42 | 249 | 71 |
| AZPAE15042 | 0.639 | 5.45 | 252 | 83 |
| EML528 | 0.640 | 8.75 | 383 | 125 |
| AZPAE14941 | 0.653 | 7.84 | 387 | 115 |
| WH-SGI-V-07618 | 0.663 | 7.92 | 395 | 107 |
| WH-SGI-V-07055 | 0.661 | 8.04 | 395 | 109 |
| WH-SGI-V-07072 | 0.675 | 15.35 | 719 | 179 |
| WH-SGI-V-07064 | 0.674 | 15.51 | 721 | 180 |
| WH-SGI-V-07370 | 0.676 | 14.92 | 730 | 180 |
| PA7 | 0.672 | 15.72 | 757 | 191 |
FIGURE 6PCA analysis on pathogenic potential of P. aeruginosa outlier strains. The PCA- analysis was performed on the four parameters namely number of pathogenic and non-pathogenic families annotated, probability of being a pathogen and input proteome percentage among the 14 strains. PA7 along with WH-SGI-V-07370, WH-SGI-V-07064, and WH-SGI-V-07072 clustered in a different quadrant from the other strains showing high pathogenicity.
FIGURE 7Protein–Protein Interaction network of (A) Core genes of outlier strains (n = 14) and (B) Core genes of classical strains (n = 64). The circles represent protein hubs and line represent edges. The radius of major regulatory hubs decreases as the number of interactions decreases. The topological properties of these networks depicting the correlation coefficient values (r2). (C) Node degree distribution, (D) average clustering coefficient, (E) average neighborhood connectivity. All these properties follow the power distribution and show the nature of the scale-free network and hierarchical organization.
Major regulatory hubs from the core genome of an outlier and classical strains.
| Regulatory hub | Degree | Protein | Function |
|---|---|---|---|
| 239 | Glutamate synthase | Provides glutamate for the glutamine synthetase reaction, absent in animals | |
| 221 | Pyruvate carboxylase | Irreversible carboxylation of pyruvate to form oxaloacetate (OAA) | |
| 171 | DNA polymerase I | Prokaryotic DNA replication | |
| 159 | Histidine kinase | Play a role in signal transduction across the cellular membrane. | |
| 146 | GMP synthase | Converts xanthosine monophosphate to guanosine monophosphate in the | |
| 127 | Glutamate synthase | Provides glutamate for the glutamine synthetase reaction, absent in animals | |
| 67 | Chaperone protein | DnaK is also involved in chromosomal DNA replication, possibly through an analogous interaction with the DnaA | |
| 57 | Inosine 5′-monophosphate dehydrogenase | Purine biosynthetic enzyme; catalyzes the nicotinamide adenine dinucleotide (NAD+)-dependent oxidation of inosine monophosphate (IMP) to xanthosine monophosphate (XMP) | |
| 57 | Uroporphyrin decarboxylase | Catalyzes the decarboxylation of four acetate groups of uroporphyrinogen-III to yield coproporphyrinogen-III | |
| 55 | DNA-directed RNA polymerase | Essential for life; significant role in transcription | |