| Literature DB >> 30018396 |
Chan Yeong Kim1, Muyoung Lee1, Keehoon Lee2, Sang Sun Yoon2, Insuk Lee3.
Abstract
Staphylococcus aureus is a gram-positive bacterium that causes a wide range of infections. Recently, the spread of methicillin-resistant S. aureus (MRSA) strains has seriously reduced antibiotic treatment options. Anti-virulence strategies, the objective of which is to target the virulence instead of the viability of the pathogen, have become widely accepted as a means of avoiding the emergence of new antibiotic-resistant strains. To increase the number of anti-virulence therapeutic options, it is necessary to identify as many novel virulence-associated genes as possible in MRSA. Co-functional networks have proved useful for mapping gene-to-phenotype associations in various organisms. Herein, we present StaphNet (www.inetbio.org/staphnet), a genome-scale co-functional network for an MRSA strain, S. aureus subsp. USA300_FPR3757. StaphNet, which was constructed by the integration of seven distinct types of genomics data within a Bayesian statistics framework, covers approximately 94% of the coding genome with a high degree of accuracy. We implemented a companion web server for network-based gene prioritization of the phenotypes of 31 different S. aureus strains. We demonstrated that StaphNet can effectively identify genes for virulence-associated phenotypes in MRSA. These results suggest that StaphNet can facilitate target discovery for the development of anti-virulence drugs to treat MRSA infection.Entities:
Mesh:
Year: 2018 PMID: 30018396 PMCID: PMC6050336 DOI: 10.1038/s41598-018-29120-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Construction and assessment of StaphNet. (A) Schematic overview of the process used to construct StaphNet. The functional associations were inferred from the seven distinct data types and integrated into StaphNet. (B) Methicillin-resistant Staphylococcus aureus (MRSA) and its component networks were assessed against UniProt-GO annotation. The graph represents the precision and its corresponding gene coverage for every 1,000 links. (C) The violin plots represent the distribution of betweenness centrality (left) and degree centrality (right) for each gene set. P-values were calculated using the Mann–Whitney U test.
Gene Expression Omnibus (GEO) expression data used to construct StaphNet.
| GSE | Platform | Description | # gene | # link |
|---|---|---|---|---|
| GSE26249 | GPL10597 | Gene expression analysis of daptomycin resistance in | 859 | 22,485 |
| GSE10605 | GPL1339 | Microarray analysis of toxicogenomic effects of ortho-phenylphenol on | 882 | 19,510 |
| GSE13424 | GPL1339 | Profiling downregulation of the mevalonate pathway in | 1,349 | 30,499 |
| GSE14669 | GPL1339 | Transcriptional analysis of | 581 | 22,515 |
| GSE15394 | GPL1339 | 945 | 23,492 | |
| GSE22233 | GPL1339 | Expression and mRNA half-life data for acid- and alkaline-shocked | 1,666 | 35,508 |
| GSE3415 | GPL1339 | Global transcriptome analysis of | 1,802 | 48,499 |
| GSE39627 | GPL1339 | Comparison of the oxacillin stress response in vraS and vraT mutants | 2,066 | 30,501 |
| GSE40448 | GPL1339 | Microarray analysis of toxicogenomic effect of ortho-benzyl-para-chlorophenol (OBPCP) on | 1,913 | 58,496 |
| GSE40450 | GPL1339 | Comparative analysis of the toxicogenomic effects of ortho-benzyl-para-chlorophenol (OBPCP) and para-tertiary amylphenol (PTAP) on | 1,580 | 45,490 |
| GSE50675 | GPL1339 | Global transcriptome analysis of | 798 | 22,495 |
| GSE58938 | GPL1339 | Phenotype and expression profile analysis of | 904 | 14,504 |
| GSE20973 | GPL1339 | Direct targets of CodY in | 980 | 17,494 |
| GSE70040 | GPL20586 | Whole-transcriptome analysis of | 480 | 11,418 |
| GSE25454 | GPL8069 | Global changes in | 1,863 | 44,494 |
| GSE65827 | GPL18484 | SaeRS-dependent inhibition of biofilm formation in | 249 | 9,507 |
| GSE59851 | GPL19006 | Potential influence of | 1,592 | 45,497 |
| GSE68772 | GPL19006 | The C-terminal region of the RNA helicase CshA is required for the interaction with the degradosome and turnover of bulk RNA in the opportunistic pathogen | 1,687 | 18,506 |
Summary of the seven component networks for distinct data types and the integrated StaphNet.
| Network code | Description | # gene | # link |
|---|---|---|---|
| SA-CX | Co-functional network inferred by co-expression analysis of | 2,539 | 24,031 |
| SA-GN | Co-functional network inferred by gene neighborhood of | 2,031 | 21,803 |
| SA-DP | Co-functional network inferred by domain profile similarity between | 1,448 | 7,026 |
| SA-HT | Co-functional network derived from high-throughput protein–protein interaction assays for | 290 | 4,130 |
| SA-PG | Co-functional network inferred by phylogenetic profile similarity between | 910 | 8,407 |
| EC-CX | Co-functional network inferred by co-expression analysis of | 754 | 5,232 |
| EC-HT | Co-functional network derived from high-throughput protein–protein interaction assays for | 479 | 2,626 |
| StaphNet | The integrated co-functional network for methicillin-resistant | 2,674 | 60,513 |
Figure 2Genes for virulence-associated phenotypes are interconnected in StaphNet. (A) For various virulence-associated phenotypes, the within-group edge count in StaphNet, which means the number of the edges between genes for the same phenotype, was compared with that of the same size of random gene sets. For each phenotype, a violin plot and an inner box plot represent the distribution of the within-group edge counts by a million random gene sets. Red stars represent the within-group edge counts for the given virulence-associated phenotypes. (B) The SAFE algorithm was used to detect the local enrichment modules for seven virulence-associated phenotypes in StaphNet. The various phenotypes are represented by different colors.
Figure 3Schematic diagrams of network-based gene prioritization algorithms implemented in StaphNet server. (A) Pathway-centric search algorithm. (B) Context-centric search algorithm.
Candidate gene list for hemolysis activity by pathway-centric search.
| Rank | Gene ID (New locus tag) | Pathway-Centric Search Score | Mutant Strain Availability (Tested Rank) | Significant Change in Hemolysis Ability |
|---|---|---|---|---|
| 1 | SAUSA300_RS11335 | 24.58 | TRUE (1) | FALSE |
| 2 | SAUSA300_RS05175 | 22.24 | TRUE (2) | FALSE |
| 3 | SAUSA300_RS07105 | 22.04 | TRUE (3) | FALSE |
| 4 | SAUSA300_RS11410 | 20.84 | FALSE | Not tested |
| 5 | SAUSA300_RS11425 | 20.64 | TRUE (4) | FALSE |
| 6 | SAUSA300_RS07100 | 19.34 | TRUE (5) | FALSE |
| 7 | SAUSA300_RS11990 | 18.85 | FALSE | Not tested |
| 8 | SAUSA300_RS01250 | 18.38 | TRUE (6) | FALSE |
| 9 | SAUSA300_RS11355 | 18.36 | TRUE (7) | FALSE |
| 10 | SAUSA300_RS02850 | 18.23 | FALSE | Not tested |
| 11 | SAUSA300_RS07405 | 17.96 | TRUE (8) | FALSE |
| 12 | SAUSA300_RS07430 | 17.58 | FALSE | Not tested |
| 13 | SAUSA300_RS12145 | 17.27 | FALSE | Not tested |
| 14 | SAUSA300_RS11400 | 17.06 | FALSE | Not tested |
| 15 | SAUSA300_RS11380 | 16.85 | TRUE (9) | FALSE |
| 16 | SAUSA300_RS02805 | 16.66 | FALSE | Not tested |
| 17 | SAUSA300_RS09090 | 16.66 | FALSE | Not tested |
| 18 | SAUSA300_RS06220 | 16.59 | FALSE | Not tested |
|
|
|
|
|
|
Figure 4Network-based identification of novel genes for virulence-associated phenotypes. (A) The blood agar plate streaked with wild-type Staphylococcus aureus exhibited beta-type hemolysis, whereas the plate streaked with the mutant S. aureus with the SAUSA300_RS06165 gene exhibited gamma-type hemolysis. (B) Hemolysis activity (OD450) of the wild-type S. aureus and the mutant S. aureus with the SAUSA300_RS06165 gene. Error bars represent the standard error, and the double asterisk signifies a P-value ≤ 0.01 (two-tailed t-test). (C) Hemolysis gene network including a novel hemolysis gene, RS06165 (red node) and its within-group connectivity score (red dot). Significance of the observed network connectivity was measured by using a null distribution based on 10000 random gene sets. (D) Biofilm-forming capability was normalized to the bacterial growth (OD550/OD600) of the wild-type and mutant S. aureus. Error bars represent the standard error, the single asterisk represents a P-value ≤ 0.05, and the double asterisks represent P-values ≤ 0.01 (two-tailed t-test). (E) Biofilm gene network including five novel biofilm genes (red nodes) identified in this study and its within-group connectivity score (red dot). Significance of the observed connectivity was tested as for (C).
Candidate gene list for biofilm forming ability by context-centric search.
| Rank | Gene ID (New locus tag) | Context-Centric Search p-value | Mutant Strain Availability (Tested Rank) | Significant Change in Biofilm Forming Ability (Significance) |
|---|---|---|---|---|
| 1 | SAUSA300_RS12890 | 1.73569E-08 | FALSE | Not tested |
| 2 | SAUSA300_RS12950 | 9.84818E-08 | TRUE (1) | FALSE |
| 3 | SAUSA300_RS12940 | 1.04527E-05 | TRUE (2) | FALSE |
| 4 | SAUSA300_RS12935 | 3.05251E-05 | FALSE | Not tested |
| 5 | SAUSA300_RS03540 | 3.89378E-05 | FALSE | Not tested |
| 6 | SAUSA300_RS04955 | 5.47043E-05 | FALSE | Not tested |
| 7 | SAUSA300_RS12930 | 0.000143066 | FALSE | Not tested |
|
|
|
|
|
|
| 9 | SAUSA300_RS12695 | 0.000271935 | FALSE | Not tested |
| 10 | SAUSA300_RS09310 | 0.000287212 | TRUE (4) | FALSE |
| 11 | SAUSA300_RS12380 | 0.000384348 | FALSE | Not tested |
| 12 | SAUSA300_RS07240 | 0.000402737 | TRUE (5) | FALSE |
| 13 | SAUSA300_RS00945 | 0.000498033 | FALSE | Not tested |
|
|
|
|
|
|
| 15 | SAUSA300_RS04750 | 0.000510171 | FALSE | Not tested |
|
|
|
|
|
|
|
|
|
|
|
|
| 18 | SAUSA300_RS14020 | 0.0014589 | FALSE | Not tested |
| 19 | SAUSA300_RS11780 | 0.001491269 | FALSE | Not tested |
| 20 | SAUSA300_RS06610 | 0.001493588 | TRUE (9) | FALSE |
| 21 | SAUSA300_RS13915 | 0.001493588 | FALSE | Not tested |
| 22 | SAUSA300_RS00475 | 0.001826731 | FALSE | Not tested |
| 23 | SAUSA300_RS00625 | 0.002214592 | FALSE | Not tested |
| 24 | SAUSA300_RS13275 | 0.002659488 | FALSE | Not tested |
| 25 | SAUSA300_RS01860 | 0.003166104 | FALSE | Not tested |
|
|
|
|
|
|