| Literature DB >> 29760087 |
Daniel M Cornforth1,2, Justine L Dees3, Carolyn B Ibberson1,2, Holly K Huse4, Inger H Mathiesen5, Klaus Kirketerp-Møller6, Randy D Wolcott7,8, Kendra P Rumbaugh9, Thomas Bjarnsholt10,11, Marvin Whiteley12,2.
Abstract
Laboratory experiments have uncovered many basic aspects of bacterial physiology and behavior. After the past century of mostly in vitro experiments, we now have detailed knowledge of bacterial behavior in standard laboratory conditions, but only a superficial understanding of bacterial functions and behaviors during human infection. It is well-known that the growth and behavior of bacteria are largely dictated by their environment, but how bacterial physiology differs in laboratory models compared with human infections is not known. To address this question, we compared the transcriptome of Pseudomonas aeruginosa during human infection to that of P. aeruginosa in a variety of laboratory conditions. Several pathways, including the bacterium's primary quorum sensing system, had significantly lower expression in human infections than in many laboratory conditions. On the other hand, multiple genes known to confer antibiotic resistance had substantially higher expression in human infection than in laboratory conditions, potentially explaining why antibiotic resistance assays in the clinical laboratory frequently underestimate resistance in patients. Using a standard machine learning technique known as support vector machines, we identified a set of genes whose expression reliably distinguished in vitro conditions from human infections. Finally, we used these support vector machines with binary classification to force P. aeruginosa mouse infection transcriptomes to be classified as human or in vitro. Determining what differentiates our current models from clinical infections is important to better understand bacterial infections and will be necessary to create model systems that more accurately capture the biology of infection.Entities:
Keywords: Pseudomonas aeruginosa; chronic wounds; cystic fibrosis; human transcriptome; machine learning
Mesh:
Year: 2018 PMID: 29760087 PMCID: PMC5984494 DOI: 10.1073/pnas.1717525115
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
The human samples used in this study
| Sample | Location of collection | Total reads | Reads mapped to PAO1 orthologous genes |
| Human_Sputum_A | Denmark | 71,926,942 | 492,733 |
| Human_Sputum_B | Denmark | 105,647,965 | 2,047,185 |
| Human_Sputum_C | Denmark | 61,212,590 | 1,958,877 |
| Human_Sputum_D | Denmark | 124,266,260 | 414,990 |
| Human_Sputum_E | Denmark | 62,985,339 | 453,389 |
| Human_Sputum_F | Denmark | 68,022,834 | 184,178 |
| Human_Sputum_G | Denmark | 78,869,961 | 10,213,756 |
| Lubbock_CW_15 | Texas, USA | 109,403,011 | 2,251,929 |
| Lubbock_CW_16 | Texas, USA | 94,788,090 | 191,274 |
| Lubbock_CW_18 | Texas, USA | 91,993,624 | 713,108 |
| Lubbock_CW_21 | Texas, USA | 109,252,642 | 30,836 |
| Lubbock _Burn_2_3 | Texas, USA | 46,780,195 | 682,982 |
| Lubbock_ Burn_2_4 | Texas, USA | 34,878,564 | 105,278 |
| Danish_Chronic_Wound_1 | Denmark | 106,188,256 | 5,112 |
| Danish_Chronic_Wound_2 | Denmark | 99,773,828 | 10,177 |
These samples were analyzed separately from the other samples in our study due to their low P. aeruginosa read counts and were only used in Fig. 1.
Fig. 1.PCA of P. aeruginosa RNA-seq results. This includes human samples listed in Table 1, as well as mouse and in vitro experiments from our laboratory and others (Dataset S1). (A) Analysis was performed with 1,707 genes that were expressed (i.e., contained at least 1 RNA-seq read) in all samples. (B) To include two chronic wound samples from Denmark with low-read coverage (labeled in figure), analysis was performed with 761 genes that were expressed in all samples.
Fig. 2.Gene categories that are significantly different in in vitro transcriptomes compared with human transcriptomes. Analysis was performed with 1,707 genes that were expressed (i.e., contained at least 1 RNA-seq read) in all samples. Categories and enrichment calculation were obtained from the BioCyc database using Grossmann’s parent–child-union variation of the Fisher’s exact test with a P value cut-off of 0.05 (48). Plotted are the genes with a P-adjusted value of <0.05. “Considered” genes indicates the number of genes within that category that were analyzed (i.e., included in the 1,707 genes).
Fig. 3.The expression of genes in the las core QS regulon in human samples compared with in vitro samples (18). (A) Average relative expression of 42 core QS genes in vitro and in humans. Fold-change in expression was calculated as a ratio of the geometric mean of relative expressions for each gene among the in vitro transcriptomes to the geometric mean of relative expressions of each gene among the human transcriptomes. Values above 0 indicate higher gene expression in vitro. Samples with fewer than three reads for a gene were removed from the analysis. (B) Average relative expression of 42 core QS genes within in vitro biofilm/aggregate transcriptomes or in vitro planktonic transcriptomes, compared with human transcriptomes. Fold-change in expression of each gene was calculated as above. Bars represent SEM.
P. aeruginosa genes induced in human samples that are also important for antimicrobial tolerance (19)
The above genes were both expressed higher in our human infections than in vitro samples (log2 fold-change > 1.5, FDR < 0.05) and were fitness determinants when grown in the presence of antimicrobials [log2 fold-change < −1.5, FDR < 0.05 (19)]. Blue boxes shade genes up-regulated in human CF sputum, yellow shade genes up-regulated in human chronic wounds, and green shade genes up-regulated in human burn wounds. Also included is whether these genes were induced in mouse burn or surgical wounds [fold-change > 2, P value < 0.05 (12)] and induced by any of 14 antimicrobials in vitro [log2 fold-change > 0, FDR < 0.05 (19)]. AMP, ampicillin; BLCH, bleach; CAR, carbenicillin; CFP, cefoperazone; CIP, ciprofloxacin; GEN, gentamicin; NEO, neomycin; PMB, polymyxin B; TOB, tobramycin.
Gene appears in all three infection types.
Gene appears in two of the three human infection types in the table.
Genes used to distinguish P. aeruginosa human infection transcriptome from in vitro transcriptome
| Locus tag | Gene name | Human infection vs. in vitro (log2 fold-change) | Soft tissue infection vs. in vitro (log2 fold-change) | CF sputum vs. in vitro (log2 fold-change) |
| PA2911 | 4.1 | 3.8 | 4.2 | |
| PA2914 | 3.2 | 3.1 | 3.2 | |
| PA5535 | 4.1 | 3.7 | 4.4 | |
| PA1414 | 4.6 | 5.4 | 3.2 | |
| PA0781 | 2.9 | 2.6 | 3.2 | |
| PA2382 | 4.9 | 2.1 | 5.7 | |
| PA4835 | 2.8 | 2.2 | 3.1 | |
| PA2943 | 3.2 | 1.8 | 3.7 | |
| PA3598 | 3.5 | 2.3 | 4.0 | |
| PA4063 | 2.9 | 2.0 | 3.4 | |
| PA1797 | 2.8 | 3.2 | 2.5 | |
| PA4570 | 2.8 | 2.0 | 3.2 | |
| PA3237 | 7.0 | 4.0 | 7.9 | |
| PA2018 | 2.3 | 2.0 | 2.5 | |
| PA4495 | 4.0 | 1.5 | 4.8 | |
| PA4709 | 2.4 | 1.9 | 2.8 | |
| PA4710 | 3.4 | 0.8 | 4.2 | |
| PA2662 | 3.7 | 1.7 | 4.4 | |
| PA4843 | −3.2 | −2.6 | −3.9 | |
| PA4470 | 1.8 | 0.6 | 2.3 | |
| PA2931 | 2.2 | 1.7 | 2.6 | |
| PA2562 | 1.7 | 0.3 | 2.3 | |
| PA2396 | 1.7 | 1.1 | 2.1 | |
| PA4227 | 1.6 | 0.5 | 2.1 | |
| PA2386 | 1.0 | −0.1 | 1.5 | |
| PA3418 | −0.5 | −0.1 | −0.9 | |
| PA0865 | −2.6 | −3.5 | −2.1 | |
| PA3691 | −0.1 | 0.8 | −2.6 | |
| PA2291 | −2.8 | −2.6 | −2.9 | |
| PA2553 | −2.3 | −2.8 | −2.0 |
Genes were ranked according to their ROC importance score in a trained SVM model (26). Also shown is the DESeq2 log2 fold-changes when in vitro samples were compared with all human infections (human infection vs. in vitro), human soft-tissue infections (soft tissue infection vs. in vitro) and CF sputum (CF sputum vs. in vitro) (29). Positive values indicate genes up-regulated in human infection.
Classification of P. aeruginosa transcriptomes from mouse models of infection as: human or in vitro, and human soft tissue or CF sputum
| Sample | Human vs. in vitro | Soft tissue vs. CF sputum | ||
| Mean probability human | SD | Mean probability soft tissue | SD | |
| Mouse burn 1 | 0.07 | 0.04 | 0.73 | 0.17 |
| Mouse burn 2 | 0.15 | 0.08 | 0.72 | 0.19 |
| Mouse surgical 1 | 0.72 | 0.15 | 0.68 | 0.19 |
| Mouse surgical 2 | 0.75 | 0.12 | 0.68 | 0.21 |
| Mouse pneumonia | 0.08 | 0.08 | 0.67 | 0.25 |
For both distinctions (human vs. in vitro and soft tissue vs. CF sputum), feature (gene) selection and model training were repeated 50 times. The mean probability according to the SVM model and its SD were calculated using the Caret package (26). Mean probability is the likelihood that a mouse transcriptome is human (for the human vs. in vitro) or soft tissue (for the soft tissue vs. CF sputum) based on the SVM model as calculated by the Caret package. For example, a mean probability of 1 would indicate that the sample is unambiguously categorized as human for the human vs. in vitro.