| Literature DB >> 26754847 |
Jeanneth Mosquera-Rendón1,2, Ana M Rada-Bravo3,4, Sonia Cárdenas-Brito5, Mauricio Corredor6, Eliana Restrepo-Pineda7, Alfonso Benítez-Páez8,9,10.
Abstract
BACKGROUND: Drug treatments and vaccine designs against the opportunistic human pathogen Pseudomonas aeruginosa have multiple issues, all associated with the diverse genetic traits present in this pathogen, ranging from multi-drug resistant genes to the molecular machinery for the biosynthesis of biofilms. Several candidate vaccines against P. aeruginosa have been developed, which target the outer membrane proteins; however, major issues arise when attempting to establish complete protection against this pathogen due to its presumably genotypic variation at the strain level. To shed light on this concern, we proposed this study to assess the P. aeruginosa pangenome and its molecular evolution across multiple strains.Entities:
Mesh:
Year: 2016 PMID: 26754847 PMCID: PMC4710005 DOI: 10.1186/s12864-016-2364-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Main features of the Pseudomonas aeruginosa pangenome
| Features analysed |
|
|---|---|
| Genomes | 181 |
| Total genes | 1,117,803 |
| Average genome size | 6175 genes |
| Pangenome size (non-redundant genes) | 16,820 |
| Core genome | 2503 genes |
| Accessory genome | 9108 genes |
| Unique genes | 5209 |
| Average unique genes/strain | 16 |
| Gene families under positive selection | 233 |
Fig. 1The P. aeruginosa pangenome. a Rarefaction curve of the 200 different pangenomes calculated from random combinations of strains. Iterations and combinations are shown as the dots cloud indicating the total number of non-redundant genes included in the pangenome as genomes are included in the analysis. Red filled circles indicate the median of each iteration. b Decay function for new genes discovered during pangenome reconstruction. Iterations and combinations are shown as dots cloud indicating the number of new genes incorporated to core genome. Red filled circles indicate the median of each iteration. The power law alpha parameter shown inside the plot is the average of such values retrieved individually in each iteration after fitting ± sd. The theta (θ) value was calculated from the horizontal asymptote where the exponential regression converges. c Histogram for the prevalence of different gene families of the pangenome. The 16,820 non-redundant gene families determined to be present in the P. aeruginosa pangenome were distributed according to their frequency across all strains analysed. Three gene categories are clearly distinguished, highlighting the core genome (gene families present in all strains analysed), the unique genes (genes present in only one strain), and the accessory genome (gene families exhibiting a variable frequency)
Fig. 2Functional annotation of the pangenome according to gene family categorization. Two Venn diagrams are presented, indicating the functional annotation at the orthology level (left diagram) and molecular pathway level (right diagram) for the three different categories established in concordance with gene frequency across strains. The redundancy of functions was predominantly at the pathway level and permitted to discern distinctive elements for each gene category. Those distinctive pathways are listed in Table 2
KEGG functional modules distinctive for the P. aeruginosa pangenome gene category
| KEGG module number | Description | Genes |
|---|---|---|
| Core genome | ||
| M00064 | ADP-L-glycero-D-manno-heptose biosynthesis | 2 |
| M00493 | AlgZ-AlgR (alginate production) two-component regulatory system | 1 |
| M00235 | Arginine/ornithine transport system | 3 |
| M00531 | Assimilatory nitrate reduction | 1 |
| M00475 | BarA-UvrY (central carbon metabolism) two-component regulatory system | 2 |
| M00086 | Beta-Oxidation | 1 |
| M00123, M00573, M00577 | Biotin biosynthesis | 3 |
| M00364, M00366 | C10-C20 isoprenoid biosynthesis | 2 |
| M00170, M00171 | C4-dicarboxylic acid cycle | 2 |
| M00168 | CAM (Crassulacean acid metabolism) | 1 |
| M00722, M00727, M00728 | Cationic antimicrobial peptide (CAMP) resistance | 3 |
| M00256 | Cell division transport system | 1 |
| M00010 | Citrate cycle | 4 |
| M00120 | Coenzyme A biosynthesis | 3 |
| M00338 | Cysteine biosynthesis | 1 |
| M00154, M00155 | Cytochrome c oxidase | 3 |
| M00417 | Cytochrome o ubiquinol oxidase | 3 |
| M00552 | D-galactonate degradation, De Ley-Doudoroff pathway | 2 |
| M00596 | Dissimilatory sulfate reduction | 1 |
| M00542 | EHEC/EPEC pathogenicity signature | 2 |
| M00008 | Entner-Doudoroff pathway | 1 |
| M00445 | EnvZ-OmpR (osmotic stress response) two-component regulatory system | 3 |
| M00515 | FlrB-FlrC (polar flagellar synthesis) two-component regulatory system | 1 |
| M00729 | Fluoroquinolone resistance | 1 |
| M00344, M00345 | Formaldehyde assimilation | 3 |
| M00497 | GlnL-GlnG (nitrogen regulation) two-component regulatory system | 1 |
| M00605 | Glucose/mannose transport system | 2 |
| M00012 | Glyoxylate cycle | 1 |
| M00050 | Guanine ribonucleotide biosynthesis, IMP | 3 |
| M00259 | Heme transport system | 1 |
| M00045 | Histidine degradation | 4 |
| M00226 | Histidine transport system | 1 |
| M00620 | Incomplete reductive citrate cycle | 3 |
| M00131 | Inositol phosphate metabolism | 1 |
| M00190 | Iron (III) transport system | 4 |
| M00535 | Isoleucine biosynthesis | 3 |
| M00113 | Jasmonic acid biosynthesis | 1 |
| M00505 | KinB-AlgB (alginate production) two-component regulatory system | 2 |
| M00080 | Lipopolysaccharide biosynthesis | 1 |
| M00320 | Lipopolysaccharide export system | 2 |
| M00255 | Lipoprotein-releasing system | 1 |
| M00116 | Menaquinone biosynthesis | 1 |
| M00740 | Methylaspartate cycle | 2 |
| M00189 | Molybdate transport system | 3 |
| M00711 | Multidrug resistance, efflux pump MdtIJ | 1 |
| M00115 | NAD biosynthesis | 3 |
| M00144 | NADH:quinone oxidoreductase | 13 |
| M00471 | NarX-NarL (nitrate respiration) two-component regulatory system | 1 |
| M00622 | Nicotinate degradation | 1 |
| M00615 | Nitrate assimilation | 2 |
| M00438 | Nitrate/nitrite transport system | 1 |
| M00439 | Oligopeptide transport system | 1 |
| M00209 | Osmoprotectant transport system | 2 |
| M00004, M00007 | Pentose phosphate pathway | 6 |
| M00024 | Phenylalanine biosynthesis | 3 |
| M00434 | PhoR-PhoB (phosphate starvation response) | 1 |
| M00222 | Phosphate transport system | 4 |
| M00501 | PilS-PilR (type 4 fimbriae synthesis) two-component regulatory system | 1 |
| M00133 | Polyamine biosynthesis | 3 |
| M00015 | Proline biosynthesis | 3 |
| M00247, M00258 | Putative ABC transport system | 3 |
| M00193 | Putative spermidine/putrescine transport system | 7 |
| M00046 | Pyrimidine degradation | 1 |
| M00053 | Pyrimidine deoxyribonuleotide biosynthesis, CDP/CTP | 5 |
| M00052 | Pyrimidine ribonucleotide biosynthesis, UMP | 3 |
| M00377 | Reductive acetyl-CoA pathway (Wood-Ljungdahl pathway) | 1 |
| M00167 | Reductive pentose phosphate cycle | 5 |
| M00523 | RegB-RegA (redox response) two-component regulatory system | 2 |
| M00125 | Riboflavin biosynthesis, GTP | 2 |
| M00394 | RNA degradosome | 1 |
| M00308 | Semi-phosphorylative Entner-Doudoroff pathway | 2 |
| M00185 | Sulfate transport system | 2 |
| M00436 | Sulfonate transport system | 3 |
| M00435 | Taurine transport system | 1 |
| M00089 | Triacylglycerol biosynthesis | 2 |
| M00332 | Type III secretion system | 2 |
| M00025, M00040 | Tyrosine biosynthesis | 4 |
| M00117, M00128 | Ubiquinone biosynthesis | 7 |
| M00029 | Urea cycle | 3 |
| M00651 | Vancomycin resistance | 2 |
| M00241 | Vitamin B12 transport system | 1 |
| M00660 | Xanthomonas spp. pathogenicity signature | 2 |
| M00242 | Zinc transport system | 2 |
| Accessory genome | ||
| M00502 | GlrK-GlrR (amino sugar metabolism) two-component regulatory system | 1 |
| M00533 | Homoprotocatechuate degradation | 2 |
| M00240 | Iron complex transport system | 3 |
| M00005 | PRPP biosynthesis | 1 |
| M00473 | UhpB-UhpA (hexose phosphates uptake) two-component regulatory system | 1 |
| M00644 | Vanadium resistance | 1 |
| Unique genes | ||
| M00653 | AauS-AauR (acidic amino acids utilization) two-component regulatory system | 1 |
| M00500 | AtoS-AtoC (cPHB biosynthesis) two-component regulatory system | 1 |
| M00450 | BaeS-BaeR (envelope stress response) two-component regulatory system | 1 |
| M00104 | Bile acid biosynthesis | 1 |
| M00581 | Biotin transport system | 1 |
| M00569 | Catechol meta-cleavage | 4 |
| M00582 | Energy-coupling factor transport system | 1 |
| M00760 | Erythromycin resistance | 1 |
| M00524 | FixL-FixJ (nitrogen fixation) two-component regulatory system | 1 |
| M00713 | Fluoroquinolone resistance | 1 |
| M00059 | Glycosaminoglycan biosynthesis | 1 |
| M00499 | HydH-HydG (metal tolerance) two-component regulatory system | 1 |
| M00714, M00645 | Multidrug resistance | 2 |
| M00664 | Nodulation | 1 |
| M00549 | Nucleotide sugar biosynthesis | 1 |
| M00267 | PTS system, N-acetylglucosamine-specific II component | 1 |
Catalogue of the KEGG functional modules (M) distinctively found in three gene categories of the P. aeruginosa pangenome: core, accessory, and unique genes. The number of modules correlated with those numbers presented in Fig. 2 (Venn diagram on the right)
Fig. 3Molecular evolution of the P. aeruginosa pangenome. a Histogram showing the distribution of omega (ω) values across the P. aeruginosa pangenome. The light blue histogram shows the original distribution with the tendency towards values indicating purifying selection (shift to left from neutrality). The superposed light red histogram indicates the Z-scores for the selection of genes with ω significantly different than 1. Those with significant ω <1 were considered to be under strong purifying selection for functional analysis, and those with significant ω >1 were selected to be under strong positive selection for the MLST approach. b Scatter plot to represent the distribution of normalized dN and dS rates for all gene families detected in the P. aeruginosa pangenome. Gene families under strong purifying selection are highlighted in blue, whereas gene families under positive selection (ω >2) are highlighted in red. The set of gene family candidates for MLST under strong positive selection are highlighted in green. The diagonal dashed line indicates the boundary for neutrality
Domain enrichment in proteins under positive selection
| SMART/Pfam domain | Description | Fisher’s test |
|---|---|---|
| Chromate_transp | Probably act as chromate transporters in bacteria | 0.0000 |
| Sulfatase | Present in esterases hydrolysing steroids, carbohydrates and proteins | 0.0020 |
| PepSY_TM | Conserved transmembrane helix found in bacterial protein families | 0.0041 |
| PrmA | Present in the Ribosomal protein L11 methyltransferase | 0.0123 |
| Cons_hypoth95 | Present in 16S RNA methyltransferase D | 0.0166 |
| MTS | Present in the 16S RNA methyltransferase C | 0.0182 |
| DUF1329 | Putative outer membrane lipoprotein | 0.0215 |
| DUF4102 | Putative phage integrase | 0.0235 |
| CHASE | Extracellular domain of bacterial transmembrane receptors | 0.0284 |
| G3P_acyltransf | Enzymes converting glycerol-3-phosphate into lysophosphatidic acid | 0.0284 |
| AceK | Bacterial isocitrate dehydrogenase kinase/phosphatase protein | 0.0284 |
| Choline_sulf_C | C-terminus of enzyme producing choline from choline-O-sulfate | 0.0284 |
| DUF2165 | Unknown function | 0.0284 |
| DUF2909 | Unknown function | 0.0284 |
| DUF3079 | Unknown function | 0.0284 |
| DUF444 | Unknown function | 0.0284 |
| DUF533 | Unknown function; integral membrane protein | 0.0284 |
| DUF791 | Unknown function | 0.0284 |
| DUF972 | Unknown function | 0.0284 |
| Glu_cys_ligase | Enzyme carrying out the first step of glutathione biosynthesis | 0.0284 |
| Herpes_UL6 | Present in proteins similar to herpes simplex UL6 virion protein | 0.0284 |
| His_kinase | Membrane sensor, a two-component regulatory system | 0.0284 |
| Inhibitor_I42 | Protease inhibitor | 0.0284 |
| PPDK_N | Present in enzymes catalysing the conversion of pyrophosphate to PEP | 0.0284 |
| Sigma54_AID | Activating interacting domain of the Sigma-54 factor | 0.0284 |
| Sigma54_CBD | Core binding domains of the Sigma-54 factor | 0.0284 |
| Sigma54_DBD | DNA binding domain of the Sigma-54 factor | 0.0284 |
| PAS, PAS 4/9 | Present in signalling proteins working as signal sensors | 0.0330 |
| MFS | Major Facilitator Superfamily of small molecule transporters | 0.0359 |
| Autoind_synth | Autoinducer synthase involved in quorum-sensing response | 0.0423 |
| AzIC | Putative protein involved in branched-chain amino acid transport | 0.0423 |
| Chitin_bind | Present in carbohydrate-active enzymes (glycoside hydrolases) | 0.0423 |
| DUF3299 | Unknown function | 0.0423 |
| PTS_EIIC / IIB | Phosphoenolpyruvate-dependent phosphotransferase system | 0.0423 |
| TctC | Member of the tripartite tricarboxylate receptors | 0.0423 |
| UPF0004 | Domain found in tRNA methythiotransferases | 0.0423 |
The SMART and Pfam domains are presented in a non-redundant manner. Function description was recovered from annotations in SMART or Pfam databases. Fisher’s test values correspond to p-values (p ≤ 0.05), supporting the over-representation of the corresponding domain in the set of proteins under positive selection
Potential genetic markers for MLST in P. aeruginosa strains
| Gene familya | Functionb | Omega ( | Length (bp) | Strain frequencyc |
|---|---|---|---|---|
| 3333 | Chitin binding protein | 108 | 1170 | 98.9 % (179) |
| 3675 | Flagellar basal-body rod protein FlgF | 5884 | 750 | 99.5 % (180) |
| 4766 | Predicted branched-chain amino acid permease AzIC | 86 | 763 | 96.7 % (175) |
| 5348 | Unknown function | 32 | 573 | 99.5 % (180) |
aNomenclature according to pangenome gene inventory
bFunction inferred from KEGG, SMART, and/or BLAST-based search
cNumber of strains carrying respective genes are denoted in parenthesis
Fig. 4Circular phylogenetic tree showing the genetic relationships among 170 reference PATRIC strains and our six P. aeruginosa isolates. The phylogenetic tree was built from the best evolutionary model explaining evolution at the concatenated gene families 3333, 3675, 4766, and 5348 after a sequential likelihood ratio test [54]. A total of 176 P. aeruginosa strains are located in the tree, and the localization of our clinical isolate is indicated. A close view of this tree permitted us to infer relationships among our clinical isolates with virulent and multi-drug resistant strains