Literature DB >> 23326304

Gene expression-based classifiers identify Staphylococcus aureus infection in mice and humans.

Sun Hee Ahn1, Ephraim L Tsalik, Derek D Cyr, Yurong Zhang, Jennifer C van Velkinburgh, Raymond J Langley, Seth W Glickman, Charles B Cairns, Aimee K Zaas, Emanuel P Rivers, Ronny M Otero, Tim Veldman, Stephen F Kingsmore, Joseph Lucas, Christopher W Woods, Geoffrey S Ginsburg, Vance G Fowler.   

Abstract

Staphylococcus aureus causes a spectrum of human infection. Diagnostic delays and uncertainty lead to treatment delays and inappropriate antibiotic use. A growing literature suggests the host's inflammatory response to the pathogen represents a potential tool to improve upon current diagnostics. The hypothesis of this study is that the host responds differently to S. aureus than to E. coli infection in a quantifiable way, providing a new diagnostic avenue. This study uses Bayesian sparse factor modeling and penalized binary regression to define peripheral blood gene-expression classifiers of murine and human S. aureus infection. The murine-derived classifier distinguished S. aureus infection from healthy controls and Escherichia coli-infected mice across a range of conditions (mouse and bacterial strain, time post infection) and was validated in outbred mice (AUC>0.97). A S. aureus classifier derived from a cohort of 94 human subjects distinguished S. aureus blood stream infection (BSI) from healthy subjects (AUC 0.99) and E. coli BSI (AUC 0.84). Murine and human responses to S. aureus infection share common biological pathways, allowing the murine model to classify S. aureus BSI in humans (AUC 0.84). Both murine and human S. aureus classifiers were validated in an independent human cohort (AUC 0.95 and 0.92, respectively). The approach described here lends insight into the conserved and disparate pathways utilized by mice and humans in response to these infections. Furthermore, this study advances our understanding of S. aureus infection; the host response to it; and identifies new diagnostic and therapeutic avenues.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23326304      PMCID: PMC3541361          DOI: 10.1371/journal.pone.0048979

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Septicemia causes substantial morbidity and mortality among patients in the United States, with a rising burden of Staphylococcus aureus infection [1], [2]. Although blood cultures are the diagnostic gold standard for blood stream infection (BSI), sensitivity is limited and results are not rapidly available [3]. Such diagnostic delays can extend the time to administration of effective antibiotics, which is an independent risk factor for mortality [4], [5]. Conversely, diagnostic uncertainty also leads to high rates of empiric overtreatment, fueling the burden of antimicrobial resistance [6], [7]. Thus, novel approaches that are faster and more accurate are needed to differentiate between the major pathogens causing sepsis and BSI. Whereas conventional diagnostic approaches have focused on identifying the infecting pathogen, a growing body of evidence suggests that the host’s inflammatory response to the pathogen also represents a potential diagnostic tool. In vitro and in vivo experiments have revealed fundamental differences in host response to Gram-positive and Gram-negative bacterial infection [8]–[10], including significant differences in Toll-like receptor (TLR) signaling [11], [12] and cytokine production [13], [14]. Distinctive gene expression profiles exist for viral [15], [16], bacterial [17], [18], and fungal infections [19], [20] in both animal model systems and ex vivo stimulation of human peripheral blood leukocytes. Peripheral blood mononuclear cell (PBMC) gene expression signatures have also been evaluated in humans for a variety of conditions including severe infection [21], bacterial vs. viral illness [10], systemic lupus erythematosus [22], atherosclerosis [23], and radiation exposure [24]. Taken together, these studies provide strong evidence that global changes in host blood gene expression patterns can be used to differentiate disease states. The current study used S. aureus and Escherichia coli as prototypical Gram-positive and Gram-negative bacteria due to their prevalence and clinical relevance. Host gene expression was measured in mice with bacterial infection across multiple conditions. From these data, we derived a molecular classifier for S. aureus infection in inbred mice and validated it in a cohort of outbred mice. Next, we used host gene expression data from a well-characterized cohort of septic human subjects to identify a molecular classifier that accurately distinguished S. aureus BSI from E. coli BSI or uninfected controls. Murine and human S. aureus classifiers exhibited significant similarity particularly in comparing S. aureus infection to the healthy state. Furthermore, both murine and human classifiers were validated in an independent human cohort. This study is the first to demonstrate that the in vivo host response to Gram-positive infections is conserved from mouse to human and can be harnessed as a novel diagnostic strategy in patients with bacterial sepsis.

Materials and Methods

Ethics Statement

All animal experiments were carried out in strict accordance with the recommendations of NIH guidelines, the Animal Welfare Act, and US federal law. All animal procedures were approved by the Institutional Animal Care and Use Committee (IACUC) of Duke University (IACUC number: #1310905) which has been accredited by the Association for Assessment and Accreditation of Laboratory Animal Care (AAALAC) International. All animals were housed in a centralized and AAALAC accredited research animal facility that is fully staffed with trained husbandry, technical, and veterinary personnel. The Institutional Review Boards at Duke University Medical Center, the Durham VA Medical Center, and Henry Ford Hospital approved the human studies referenced in this work. Written informed consent was obtained for all subjects after the nature and possible consequences of the studies were explained.

Preparation of Bacterial Cells

One methicillin-susceptible S. aureus (Sanger 476) and three methicillin-resistant S. aureus genetic backgrounds (USA100, USA300, and MW2) were used. Overnight S. aureus cultures were inoculated into fresh tryptic soy broth and incubated aerobically at 30°C to log-phase growth (optical density 600 nm of ∼1.0) [25]. Cells were harvested by centrifugation, rinsed, and resuspended in phosphate-buffered saline (PBS). E. coli O18:K1:H7 was grown at 30°C overnight in Luria-Bertani broth [26]. Cultures were then diluted with fresh medium and grown for an additional 1 to 2 hours. Upon reaching log phase, cells were treated as described for S. aureus.

Human Subjects

Subjects were enrolled at Duke University Medical Center (DUMC; Durham, NC), Durham VAMC (Durham, NC), UNC Hospitals (Chapel Hill, NC), and Henry Ford Hospital (Detroit, Michigan) as part of a prospective, NIH-sponsored study to develop novel diagnostic tests for severe sepsis and community-acquired pneumonia (ClinicalTrials.gov NCT00258869) [27], [28]. Enrolled patients had a known or suspected infection and exhibited two or more Systemic Inflammatory Response Syndrome criteria [29]. Patients were excluded if they had an imminently terminal co-morbid condition, advanced AIDS (CD4 count <50), were being appropriately treated with an antibiotic pre-enrollment, or were enrolled in another clinical trial. Blood was drawn for microarray analysis on the day of hospital presentation with the exception of two subjects (S19 and S29). In these latter two cases, blood was not available for microarray preparation from that time point. However, blood drawn 24 hours into the hospitalization was available and so was used. Subjects in the current report had culture-confirmed monomicrobial BSI due to S. aureus (n = 32; median age 58 years; range 24–91) or E. coli (n = 19; median age 58; range 25–91). Uninfected controls (n = 43; median age 30 years; range 23–59) were enrolled at DUMC as part of a study on the effect of aspirin on platelet function among healthy volunteers [30]. Subjects were recruited through advertisements posted on the Duke campus. Blood used to derive gene expression data in these healthy controls was drawn prior to aspirin challenge.

Murine Sepsis Experiments

Except where noted, mice were purchased from The Jackson Laboratory (Bar Harbor, ME) and allowed to acclimate for 7 days. All experiments were performed on 6–8 week old mice. For the murine S. aureus classifier, seven inbred mouse strains (3 mice/strain: 129S1/SvImJ, A/J, AKR/J, BALB/cByJ, C57BL/6J, C3H/HeJ, and NOD/LtJ) were IP inoculated with 107 CFU/g of S. aureus Sanger476, euthanized at 2h after injection, and bled. This was repeated using the four different S. aureus genetic backgrounds (USA100, USA300, MW2, and Sanger476) in A/J mice (n = 3 per S. aureus background). For time series experiments, both A/J and C57BL/6J mouse strains were IP inoculated with S. aureus Sanger476 as above, and sacrificed at 2, 4, 6, and 12 h after injection (n = 5 per mouse strain at each time point). For survival experiments, mice were monitored twice daily after injection and culled upon reaching a moribund state. Animal sacrifice was carried out by carbon dioxide inhalation. Blood was collected by intracardiac puncture and stored in RNAlater at −70°C for microarray experiments. The murine E. coli infection model was carried out as described above except a smaller inoculum (6×104 CFU/g) was used. Furthermore, the time at which animals were sickest but still alive was 24 hours for E. coli inoculation, which is later than for S. aureus. Consequently, A/J and C57BL/6J mice inoculated with E. coli were sacrificed 24 h after challenge (n = 5 per mouse strain). Control mice were not injected. Outbred CD-1 mice were purchased from Charles River Laboratories (Wilmington, MA) to validate the murine S. aureus classifier. CD-1 mice were IP inoculated with 107 CFU/g of S. aureus (USA300 and Sanger 476) and 6×104 CFU/g of E. coli. Animals including controls were sacrificed at 2 and 24 h post-infection (n = 10 mice per pathogen at each time point). Blood was collected and stored as described for the derivation cohort.

Microarray Preparation (Additional Details Available in Methods S1)

Total RNA was extracted from mouse blood using the Mouse RiboPure Blood RNA kit (Ambion, Austin, TX) according to the manufacturer’s instructions. Globin mRNA was removed from whole blood RNA using the Globinclear kit (Ambion, Austin, TX). All samples passed the quality criteria of the Agilent Bioanalyzer and were used for microarray analysis. Since the total RNA yield of many samples was low, one round of linear amplification was performed for all samples using the MessageAmp Premier kit (Ambion, Austin, TX). RNA integrity numbers were calculated for all samples and found to be within tolerance limits. Microarrays were normalized using Robust Multichip Average (RMA). Affymetrix GeneChip Mouse Genome 430 2.0 Arrays were used (Santa Clara, CA). Biotin-labeled cDNA was hybridized to the arrays for 16 hours at 45°C according to the manufacturer’s instructions. Arrays were then washed and labeled with streptavidinphycoerythrin (strep-PE), and the signal was amplified using biotinylated antistreptavidin followed by another round of staining with strep-PE. These steps were performed on the Affymetrix fluidics station according to the recommended protocol. Amplification and microarray hybridization were performed at the Duke University Microarray Core. Labeled gene chips were scanned using an Affymetrix Genechip Scanner 7G (Santa Clara, CA). This array contains 45,101 probe sets to analyze the expression level of over 39,000 transcripts and variants from over 34,000 mouse genes. Human microarrays were prepared by first extracting total RNA from human blood using the PAXgene Blood RNA Kit (Qiagen, Valencia, CA) according to the manufacturer’s recommended protocol including DNase treatment. RNA quantity and quality was assessed using the Agilent 2100 Bioanalyzer (Agilent, Santa Clara, CA). RNA integrity numbers were calculated for all samples and found to be within tolerance limits. Microarrays were normalized using RMA. Hybridization and microarray data collection was then performed at Expression Analysis (Durham, NC) using the GeneChip Human Genome U133A 2.0 Array (Affymetrix, Santa Clara, CA) according to the “Affymetrix Technical Manual”. Fluorescent images were detected in a GeneChip Scanner 3000 and expression data was extracted using the GeneChip Operating System v 1.1 (Affymetrix). All GeneChips were scaled to a median intensity setting of 500. Murine and human microarray data have been deposited in the NCBI GEO (accession # GSE33341).

Deriving the Murine and Human S. aureus Classifiers

Microarray data was analyzed in two steps following the analysis strategy previously outlined and utilized [19]. First, a Bayesian sparse factor model was fit to the expression data without regard to phenotype [31], [32]. Second, factors were then used as independent variables to build a penalized binary regression with variable selection model [33] trained to identify S. aureus infection. In order to minimize issues with overfitting, batch was not included in the regression models. We used a Bayesian penalized regression technique for variable selection which allows for weighted model averaging of the resultant models, such that weights are computed from model fit on the training data [33]. The model averaging approach incorporates uncertainty in choice of model as well as regression coefficient. This has been shown to lead to out of sample predictive accuracies that are superior to penalized maximum likelihood approaches [34]. Assumptions for this approach are typical of probit regression including a linear response surface between predictors and the transformed latent probability variable. Genes were filtered for analysis using non-specific filtering for genes with high mean expression and high variance across samples. Samples with a high number of outlying genes were removed during the factor analysis. Mice were batched into discrete experiments with each experiment containing the relevant controls to avoid confounding. The development and application of this methodological approach has been previously described [15], [19], [31], [32], [35]–[42]. Using the same murine experimental data, another classifier was derived to classify methicillin-resistant vs. methicillin-sensitive S. aureus infection. The methodology was otherwise the same as that described above. We fit a factor model on the human data independently from the mouse data. The factor model was fit to 9,109 genes after non-specific filtering to remove unexpressed and uniformly expressed genes. Z-scores were computed independently for each gene without regard to experimental design. Subjects with absolute z-scores greater than 3 in more than 5% of the genes on the array were identified as outliers and were not used to fit the factor model. The factor model was trained on the 91 samples (after removal of three outliers) from three batches of expression data, and this resulted in 79 factors. These 79 factors were then projected onto the full data set (including the three subjects removed for validation) with the goal of distinguishing S. aureus BSI from healthy controls or E. coli BSI. Leave-one-out cross-validation was utilized in order to control for overfitting of the penalized binary regression model. In order to minimize issues with overfitting, batch was not included in the regression models. Matlab (Natick, MA, USA) scripts to perform these operations are available. Nonparametric testing was used to evaluate model performance (Wilcoxon rank sum for 2-group comparisons or Kruskal-Wallis for 3 or more-group comparisons) unless otherwise indicated. One limitation of this approach is that the marginal significance of genes within the factor-based classifier cannot be defined. Instead, gene lists were created to identify genes with differential expression between specified groups with respect to gene-level and factor-level analyses. For 3-group comparisons (S. aureus vs. E. coli vs. Healthy controls) one-way analysis of variance (ANOVA) was used. For pairwise comparisons, Student’s t-test was used. Results were statistically significant at p<0.05 after Bonferroni correction for multiple testing. Spreadsheets of gene/factor lists are provided as supplemental material.

Creating a Human Ortholog of the Murine S. aureus Classifier

We used Chip Comparer (http://chipcomparer.genome.duke.edu/) to identify human orthologs for all possible mouse genes. When there were multiple orthologs, we preferentially used the anti-sense target probes that shared the fewest probes with other genes as identified by the probe label. Chip Comparer identified 17,600 probe sets on the Affymetrix GeneChip Human Genome U133A 2.0 Array that have orthologs in the Affymetrix GeneChip Mouse Genome 430 2.0 Array. Factor scores from the mouse factor model were estimated using this set of 17,600 genes as follows: Given a matrix of expression values, X, and a factor model X = BF+e, we first replaced missing values by mean expression levels for those genes. Step 2: Inverse regression was utilized to compute F*, to estimate the factor scores. Step 3: We estimated X by computing BF* and replaced missing values with the corresponding values from this matrix. Steps 2 and 3 were then repeated until the estimates for the missing values converged.

External Validation in an Independent Cohort

To externally validate the murine and human S. aureus classifiers, we utilized publically available expression data from a pediatric cohort with S. aureus infection and healthy controls [18]. Hospitalized children with invasive S. aureus infection were enrolled with sample collection occurring after microbiological confirmation. Healthy controls included children undergoing elective surgical procedures and at healthy outpatient clinic visits. This dataset includes multiple expression platforms. For the purposes of consistency, we only included subjects with Affymetrix U133A data yielding 46 S. aureus-infected patients and 10 healthy controls. Given the absence of subjects with E. coli infection in the validation cohort, we derived new murine and human S. aureus classifiers that excluded animals or subjects with E. coli infection. These classifiers were derived and then projected onto the 56-sample validation cohort as described heretofore.

Heat Map Generation

In order to generate heat maps of gene expression, we first turned to the factors from the murine and human S. aureus classifiers. Probes from each factor were identified and tested for differential expression in a one-way ANOVA. Probes with significantly different levels of expression after Bonferroni correction were retained. For the murine data, there were thousands of probes (∼1000–3000, typically) meeting these criteria. Consequently, the p-values were sorted in ascending order and the 100 most significant probes from each factor were retained. Duplicate probes across the factors were removed. The human expression heat map was created in the same manner except all significant probes are presented considering there were fewer factors and genes in the human S. aureus classifier as compared to the murine classifier. Heat maps were generated using Matlab (Natick, MA, USA).

Pathway Analysis

Pathway analysis for functional annotation of genes was performed with the MetaCore tool of the GeneGO package (GeneGo, Inc., St. Joseph, MI, USA) (http://www.genego.com). P-values were assigned to pathways based on the number of genes mapping to a particular pathway relative to the total number of genes in that pathway. Statistically significant pathways were defined as a p-value <0.05 (False Discovery Rate [FDR]-adjusted) based on hypergeometric distributions [19].

Results

Murine Sepsis due to S. aureus and E. coli

Clinically relevant S. aureus infections in humans typically arise from a primary focus with secondary dissemination. To mimic this process, mice were inoculated via the intraperitoneal (IP) route [25]. Infection-susceptible and infection-resistant inbred mouse strains (A/J and C57BL/6J, respectively) [43], [44] were inoculated with S. aureus (Sanger476) or E. coli (O18:K1:H7) (n = 5 per mouse strain and bacterial species). A survival analysis was carried out to determine the optimal duration of infection for subsequent experiments (). Based on this data, A/J and C57BL/6J mice were infected with S. aureus (sacrificed at t = 0, 2, 4, 6, and 12 hours post-infection; n = 10 animals/time point) or E. coli (t = 0, 2, 6, 12, and 24 hours post-infection; n = 10 animals/time point). The effect of infection status, bacterial pathogen, and duration of infection on global patterns of gene expression was assessed using principal component analysis (PCA) (Partek Genomics Suite) () [45]. Gene expression patterns clustered by infection status and by pathogen (S. aureus vs. E. coli). Animals infected with S. aureus demonstrated a time-dependent change in gene expression that first manifested at two hours, by which time bacteremia has occurred [46]. This pattern remained stable through 12 hours, when most animals have succumbed to sepsis. E. coli-infected animals did not reveal this time-dependent progression based on the time points sampled, but had a distinctly different pattern of gene expression that was evident at 2 hours and persisted through 24 hours following infection. A heat map depicting the time-dependent nature of these gene expression changes is presented in .

Peripheral Blood Gene Expression Signatures Classify S. aureus-infected from Uninfected Mice

To create a host gene expression-based classifier for S. aureus infection, mice from a variety of experimental conditions were utilized (n = 187 total). Seven strains of inbred mice were challenged with 4 S. aureus genetic backgrounds via IP inoculation and sacrificed at various time points as described in Experimental Procedures. The comparator group for model derivation included 50 A/J or C57BL/6J mice inoculated with E. coli (O18:K1:H7) as well as 54 non-inoculated mice. Whole blood mRNA was used to generate microarray expression data. A list of differentially expressed genes is presented in . presents the number of overlapping genes in each pairwise comparison. Patterns of co-expressing genes were defined using sparse latent factor regression in an unsupervised manner (i.e. without knowledge of the source animal’s infection status) [31], [32]. Factor models are a well-known technique for describing correlation structure in high dimension, low sample size data sets. Our sparse latent factor model works by collecting genes that are highly correlated into groups. Predictive models are then built from the latent factors – vectors that describe the aggregate behavior of the group. Subsequently, these factors served as independent variables in a variable selection, binary regression model to distinguish animals with and without S. aureus infection. This approach was taken in lieu of using individual gene expression changes for several reasons. A given gene with biological relevance may be differentially expressed in response to S. aureus infection but not to the degree that would meet statistical significance. Considering this altered gene expression exists amid a network of other such changes, the collective perturbations in that particular pathway would be more easily detected using factor analysis. Furthermore, changes across multiple biological pathways will be reflected across multiple factors. These can then be collectively harnessed for their diagnostic potential using a binary regression model. Thirty factors were identified, of which 16 demonstrated a pattern of expression significantly associated with infection status (mFactors 15, 7, 23, 13, 9, 29, 28, 2, 17, 16, 21, 1, 5, 4, 26, and 19 in order of greatest significance; ANOVA; p<0.0017 for S. aureus vs. control vs. E. coli after Bonferroni correction; ). These 30 factors were fitted into a penalized binary regression model, termed the “murine S. aureus classifier”. The best performing model, as defined by the model with the largest log-likelihood value, included four factors (mFactors 7, 15, 23, and 26). Other models may be just as adequate, but we are only referring to this “top” model. Leave-one-out cross-validation was used to control overfitting and to estimate the model’s performance in subgroups of experimental conditions as described below (mouse strain, S. aureus genetic background, duration of infection, and bacterial species [S. aureus vs. E. coli]). A schematic of the derivation and validation experiments is depicted in .
Figure 1

Schematic of derivation and validation cohorts.

The Murine Derivation Cohort includes S. aureus infection (n = 83), healthy control mice (n = 54), and E. coli infection (n = 50). It served as a validation cohort to assess Mouse Strain Effect, S. aureus Genetic Background Effect, Time Course, and to compare S. aureus vs. E. coli and E. coli vs. Healthy. The murine S. aureus classifier was externally validated in Outbred Mice (n = 30) and the CAPSOD Human Cohort. The CAPSOD Human Cohort includes S. aureus BSI (n = 32), healthy volunteers (n = 43), and E. coli BSI (n = 19). It served as a validation cohort to compare S. aureus vs. Healthy, S. aureus vs. E. coli, and E. coli vs. Healthy. Model derivation and validation using the entire cohort of animals or humans is depicted by the blue outline and arrows. An independent classifier was generated using only subjects with S. aureus or E. coli BSI (green outline). This classifier was validated using leave one out cross validation (green arrow). The Human Pediatric Cohort (n = 46 S. aureus, 10 Healthy) used for external validation does not include patients with E. coli infection. Therefore, S. aureus classifiers were generated from the murine and CAPSOD cohorts that excluded E. coli data (red outline and thick red arrow). The Human Pediatric Cohort was used to derive a Human S. aureus vs. Healthy classifier which was validated in the S. aureus-infected and Healthy populations within the murine and CAPSOD human cohorts (thin red arrow).

Schematic of derivation and validation cohorts.

The Murine Derivation Cohort includes S. aureus infection (n = 83), healthy control mice (n = 54), and E. coli infection (n = 50). It served as a validation cohort to assess Mouse Strain Effect, S. aureus Genetic Background Effect, Time Course, and to compare S. aureus vs. E. coli and E. coli vs. Healthy. The murine S. aureus classifier was externally validated in Outbred Mice (n = 30) and the CAPSOD Human Cohort. The CAPSOD Human Cohort includes S. aureus BSI (n = 32), healthy volunteers (n = 43), and E. coli BSI (n = 19). It served as a validation cohort to compare S. aureus vs. Healthy, S. aureus vs. E. coli, and E. coli vs. Healthy. Model derivation and validation using the entire cohort of animals or humans is depicted by the blue outline and arrows. An independent classifier was generated using only subjects with S. aureus or E. coli BSI (green outline). This classifier was validated using leave one out cross validation (green arrow). The Human Pediatric Cohort (n = 46 S. aureus, 10 Healthy) used for external validation does not include patients with E. coli infection. Therefore, S. aureus classifiers were generated from the murine and CAPSOD cohorts that excluded E. coli data (red outline and thick red arrow). The Human Pediatric Cohort was used to derive a Human S. aureus vs. Healthy classifier which was validated in the S. aureus-infected and Healthy populations within the murine and CAPSOD human cohorts (thin red arrow). The ability of the murine-derived host gene expression classifier to identify S. aureus infection was tested in 7 inbred mouse strains of varying infection susceptibilities [43]. In all 7 strains, the murine S. aureus classifier accurately differentiated S. aureus-infected from control mice (p = 4.89×10−16; AUC = 0.9964) ( ). The ability to characterize S. aureus infection persisted when A/J mice (infection-susceptible) were challenged with four different S. aureus backgrounds: USA100 (the predominant US nosocomial methicillin resistant S. aureus [MRSA] genetic background); USA300 (the predominant US community-acquired MRSA genetic background); USA400 (MW2); and Sanger 476 (a methicillin-susceptible genetic background) (p = 1.92×10−10 vs. control mice; AUC = 1.00) ( ). Furthermore, the murine S. aureus classifier consistently discriminated S. aureus infected mice from controls at 2, 4, 6, and 12 hours post-inoculation (p = 4.41×10−16 vs. uninfected mice; AUC 1.00) ( ). This time interval was selected because two hours is the earliest time point at which S. aureus can be cultured from blood; while 12 hours was the point at which animals began to die of S. aureus sepsis (). In summary, a classifier based on murine-derived host gene expression accurately identified the presence of S. aureus infection in mice under a variety of host, pathogen, and temporal conditions.
Figure 2

Murine S. aureus classifier accurately identifies S. aureus infection under a variety of conditions.

Conditions represented include different murine hosts (A), bacterial genetic backgrounds (B), and time from inoculation (C). Animals with S. aureus infection are depicted by a red “x”. Uninfected control mice are depicted by black circles.

Murine S. aureus classifier accurately identifies S. aureus infection under a variety of conditions.

Conditions represented include different murine hosts (A), bacterial genetic backgrounds (B), and time from inoculation (C). Animals with S. aureus infection are depicted by a red “x”. Uninfected control mice are depicted by black circles.

Murine S. aureus Classifier Distinguishes S. aureus-infected from E. coli-infected Mice

Next, we determined whether the murine S. aureus classifier could differentiate S. aureus from E. coli infection. Both the infection-susceptible A/J and infection-resistant C57BL/6J strains were infected with either S. aureus (Sanger 476) or E. coli (O18:K1:H7). Animals were sacrificed at 2, 6, and 12 hours after inoculation. The murine S. aureus classifier correctly identified 50 of 53 (94.3%) animals as either infected with S. aureus or not at 2 hours (50/53), 100% of animals at 6 hours (n = 20), and 96.7% of animals at 12 hours (29/30) ( ). This corresponded to an overall p-value of 7.94×10−26 by Kruskal-Wallis test (comparing S. aureus vs. E. coli vs. Healthy controls) with an AUC of 0.9935 across all time points. Next, the murine S. aureus classifier was independently validated in outbred CD-1 mice with S. aureus infection (Sanger 476 or USA300), E. coli infection (O18:K1:H7), or uninfected controls (10 animals per condition). The murine-derived S. aureus model accurately classified 95% of all animals where the reference standard was the known experimental condition (38/40; p = 1.47×10−6; 90% sensitivity and 100% specificity; AUC = 0.9775) ( ).
Figure 3

The murine S. aureus classifier differentiates S. aureus from E. coli infection.

(A) Inbred mice were tested under three conditions: uninfected controls (black circles), S. aureus infected (red “x”), and E. coli infected (blue triangles). The y-axis represents the predicted probability that a given animal was infected with S. aureus. (B) The murine S. aureus classifier is validated in outbred CD-1 mice where it differentiates S. aureus infection from E. coli infection and uninfected controls.

The murine S. aureus classifier differentiates S. aureus from E. coli infection.

(A) Inbred mice were tested under three conditions: uninfected controls (black circles), S. aureus infected (red “x”), and E. coli infected (blue triangles). The y-axis represents the predicted probability that a given animal was infected with S. aureus. (B) The murine S. aureus classifier is validated in outbred CD-1 mice where it differentiates S. aureus infection from E. coli infection and uninfected controls. The murine S. aureus classifier was generated to identify S. aureus infection within a population including both healthy and E. coli-infected animals. However, it is possible this classifier is primarily distinguishing “sick” from “not-sick” phenotypes. In such a case, it would be expected that the classifier would still differentiate animals with E. coli infection from uninfected controls. However, this was not observed (AUC 0.5089; p = 0.8785) demonstrating the specificity of this classifier for S. aureus infection. Thus, a murine-derived host gene expression classifier accurately distinguished S. aureus-infected from E. coli-infected or uninfected mice across multiple host strains, pathogens, post-infection time points, and was validated in outbred mice. Given this ability to discriminate infection due to different bacterial species, we further explored the potential for a factor-based classifier to distinguish infection due to methicillin-resistant (MRSA) or methicillin-sensitive S. aureus (MSSA), which have been shown to differ in their pathogenicity and virulence. The same 30 factors described above were fitted into a penalized binary regression model with the specific aim of differentiating MRSA from MSSA infection. Leave-one-out cross-validation was used to control overfitting and to estimate the model’s performance in a population of 19 MRSA-infected and 84 MSSA-infected mice (). Despite some overlap, this classifier accurately differentiated infection due to MRSA or MSSA (AUC 0.8396; p = 4.14×10−6). Genes discriminating infection due to MRSA or MSSA that remained significant after adjusting for multiple tests are presented in .

Human S. aureus Classifier

We next determined whether peripheral blood gene expression in humans could yield a classifier for S. aureus BSI. Peripheral whole blood mRNA from 32 patients with S. aureus BSI, 19 patients with E. coli BSI, and 43 healthy control subjects were used to generate microarray data ( ). A list of differentially expressed genes is presented in . Figure S6 presents the number of overlapping genes in each pairwise comparison. Seventy-nine factors were defined and fitted into a linear regression model trained to identify the presence of S. aureus BSI. Although 17 factors were independently associated with S. aureus BSI (), only two factors remained in the best-performing model (hFactors 20 and 74). Similar to the murine S. aureus classifier, the human S. aureus classifier was generated blind to microbiological diagnosis in an unsupervised manner. Gender was controlled for in the model’s derivation considering the predilection for female sex in E. coli BSI ( ). We then estimated the model’s performance in phenotypic subgroups using leave-one-out cross-validation. The classifier accurately differentiated those with S. aureus BSI from healthy controls (72/75 correctly classified; AUC = 0.9898; p = 5.41×10−13) ( ). The human S. aureus classifier also correctly distinguished S. aureus from E. coli BSI in 82% (42/51) of cases (AUC = 0.8372; p = 6.77×10−4). When the human S. aureus classifier was applied to subjects with E. coli BSI vs. healthy controls, we observed an intermediate level of discrimination (56/62 correctly classified; AUC 0.9229; p = 1.38×10−7). This suggests that the human classifier is partially pathogen specific since E. coli BSI could also be distinguished from healthy controls but not with the same degree of accuracy as S. aureus BSI. A heat map depicting these gene expression changes is presented in .
Table 1

Description of human subjects used to generate a S. aureus classifier.

SubjectSubject CategoryRaceAgeGenderSource of InfectionPositive Culture SourcesWBCPMN %
S1 S. aureus White82MaleEndocarditisBlood23.696.4
S2 S. aureus White70FemaleSkinBlood, Wound, Operative cultures11.6N/Aa
S3 S. aureus Black41MaleCatheterb Blood14.8N/A
S4 S. aureus White81MaleSkinBlood, Pleural fluid16.2N/A
S5 S. aureus White81MaleBoneBlood17.3N/A
S6 S. aureus Black55MaleCatheterBlood, Vascular catheter site14.389.8
S7 S. aureus Black69FemaleCatheterBlood13N/A
S8 S. aureus Black44FemaleCatheterBlood1390
S9 S. aureus Black51MaleSkinBlood6.973
S10 S. aureus Black47MaleSkinBlood1287
S11 S. aureus White36FemaleEndocarditisBlood22.785
S12 S. aureus White54MaleBoneBlood9.888
S13 S. aureus Black55MaleBoneBlood, Skin, Synovial fluid18.387
S14 S. aureus Black42MaleUnknownBlood, Urine, Sputum7.682
S15 S. aureus Black52MaleBoneBlood10.979
S16 S. aureus Black55MaleBoneBlood, Skin28.695
S17 S. aureus White52MaleSkinBlood, Skin17.984
S18 S. aureus N/A51FemaleLungBlood19.878
S19 c S. aureus Black40MaleSkinBlood, Skin14.793
S20 S. aureus White60MaleSkinBloodN/AN/A
S21 S. aureus Black59MaleCatheterBlood7.575.5
S22 S. aureus Black58MaleBoneBlood27.9N/A
S23 S. aureus Black77MaleUrinary tractBlood, Urine, Skin9.880.3
S24 S. aureus Black91MaleBoneBlood15.193
S25 S. aureus White75FemaleCatheterBlood692
S26 S. aureus Black58MaleCatheterBlood23.987
S27 S. aureus Black24MaleUrinary TractBlood, Urine, Sputum1676.4
S28 S. aureus White74MaleSkinBlood, Abscess33.289
S29 c S. aureus Black70MaleSkinBlood19.682
S30 S. aureus White61MaleBone/CNSd Blood, Abscess10.486
S31 S. aureus, S. pneumoniae Black52MaleLungBlood (S. aureus); Antigen test (S. pneumoniae)6.193
S32 S. aureus Black38MaleEndocarditisBlood16.894
E1 E. coli Black43FemaleUrinary tractBlood, Urine32.687.3
E2 E. coli White49FemaleUrinary tractBlood1492.4
E3 E. coli Black44FemaleUrinary tractBlood, Urine15.7N/A
E4 E. coli White70FemaleUrinary tractBlood, Urine20.788
E5 E. coli Black40MaleUrinary tractBlood, Urine1583
E6 E. coli White91FemaleUrinary tractBlood, Urine5.6N/A
E7 E. coli Black25FemaleUrinary tractBlood, Urine11.188
E8 E. coli White62MaleUrinary tractBlood, Urine13.3N/A
E9 E. coli Black70MaleUrinary tractBlood2.494
E10 E. coli Black32FemaleUrinary tractBlood, Urine25.1N/A
E11 E. coli White54FemaleUrinary tractBlood, Urine10.890
E12 E. coli White74FemaleUrinary tractBlood, Urine7.397
E13 E. coli Black79FemaleLungBlood16.977
E14 E. coli Black41MaleUrinary tractBlood, Urine14.377.6
E15 E. coli White65MaleUrinary tractBlood, Urine21.685
E16 E. coli White63FemaleUrinary tractBlood, Urine8.5N/A
E17 E. coli White81FemaleUrinary tractBlood, Urine14.186.5
E18 E. coli Black69FemaleUrinary tractBlood, Urine11.167.6
E19 E. coli White55FemaleUrinary tractBlood, Urine7.287.5
H1HealthyBlack27Male
H2HealthyWhite24Female
H3HealthyWhite29Female
H4HealthyWhite26Male
H5HealthyAsian30Male
H6HealthyBlack24Male
H7HealthyWhiteN/AMale
H8HealthyAsian24Male
H9HealthyAsian23Male
H10HealthyWhite50Female
H11HealthyWhite23Female
H12HealthyWhite24Female
H13HealthyWhite44Male
H14HealthyWhite24Female
H15HealthyWhite28Female
H16HealthyWhite26Male
H17HealthyAsian30Female
H18HealthyBlack26Male
H19HealthyWhite25Male
H20HealthyWhite24Male
H21HealthyWhite24Male
H22HealthyAsian25Female
H23HealthyBlack24Female
H24HealthyWhite43Female
H25HealthyWhite26Female
H26HealthyBlack59Male
H27HealthyBlack25Female
H28HealthyWhite24Male
H29HealthyWhite25Male
H30HealthyWhite26Male
H31HealthyWhite24Male
H32HealthyWhite26Male
H33HealthyN/A25Male
H34HealthyWhite53Female
H35HealthyBlack45Female
H36HealthyWhite23Male
H37HealthyWhite26Female
H38HealthyWhite27Male
H39HealthyAsian43Female
H40HealthyBlack32Female
H41HealthyN/A25Female
H42HealthyBlack43Female
H43HealthyWhiteN/AFemale

N/A – Not available.

Catheter refers to vascular catheters.

Gene expression data for S19 and S29 was generated from blood drawn on the second hospital day. Blood drawn on the day of admission was otherwise used for all other infected subjects.

This subject had vertebral osteomyelitis associated with an epidural abscess.

Table 2

Characteristics of human subjects used for S. aureus classifier derivation.

S. aureus (n = 32)Gram-negative (n = 19)Healthy (n = 43)
Age in years, mean (range)58 (24–91)58 (25–91)30 (23–59)
Gender, n (%)
Female6 (19)14 (74)21 (49)
Male26 (81)5 (26)22 (51)
Race, n (%)
Black20 (63)9 (47)9 (21)
White11 (34)10 (53)26 (60)
Asian006 (14)
Unknown1 (3)02 (5)
Dialysis, n (%)12 (38)00
Diabetes, n (%)13 (41)3 (16)0
Immunosuppression, n (%)2 (6)2 (11)0
Figure 4

Performance of the human S. aureus classifier.

(A) The human S. aureus classifier differentiates S. aureus BSI from both uninfected controls and E. coli BSI. (B) A separate classifier was generated using only S. aureus and E. coli-infected human subjects and tested using leave-one-out cross-validation.

Performance of the human S. aureus classifier.

(A) The human S. aureus classifier differentiates S. aureus BSI from both uninfected controls and E. coli BSI. (B) A separate classifier was generated using only S. aureus and E. coli-infected human subjects and tested using leave-one-out cross-validation. N/A – Not available. Catheter refers to vascular catheters. Gene expression data for S19 and S29 was generated from blood drawn on the second hospital day. Blood drawn on the day of admission was otherwise used for all other infected subjects. This subject had vertebral osteomyelitis associated with an epidural abscess. In the human S. aureus classifier described above, it is the inclusion of healthy controls that drives the discrimination from S. aureus BSI. Considering the clinical importance of differentiating Gram-positive from Gram-negative infections, rather than sick vs. healthy, we created a penalized binary regression model with the specific aim of differentiating human S. aureus (n = 32) from E. coli (n = 19) BSI. In this cohort, 52 factors were identified (different from the 79 factors identified when Healthy was included) of which only hFactor 40 remained in the top performing model after controlling for gender. Using leave-one-out cross-validation ( ), this model had a sensitivity of 62.5% (20/32 S. aureus BSIs correctly classified) but a specificity of 94.7% (18/19 E. coli BSIs correctly classified). This corresponds to an AUC of 0.8503 (p = 3.47×10−5).

A Murine S. aureus Classifier Identifies S. aureus Infection in Humans

We then determined whether the murine S. aureus classifier could identify S. aureus BSI in humans. To accomplish this, the murine S. aureus classifier was projected onto human gene expression data. Specifically, Chip Comparer (http://chipcomparer.genome.duke.edu/) provided a modified representation of the Affymetrix Mouse Genome 430 2.0 Array that only included orthologs of transcripts represented on the Affymetrix Human Genome U133A 2.0 Array. This resulted in a murine S. aureus classifier consisting only of genes with human orthologs (68.6% of the total array representation). We then evaluated this classifier in our human cohort. To account for potential species-specific variation in gene expression, predicted probabilities were plotted on a logit rather than a probabilistic scale. Using this murine S. aureus classifier, human patients with S. aureus BSI were distinguished from healthy controls (AUC = 0.9484; p = 4.00×10−11) ( ). Thus, the host response to S. aureus infection was sufficiently conserved that a predictive model generated in one species (Mus musculus) identified S. aureus BSI in another (Homo sapiens). However, the murine-derived S. aureus classifier did not differentiate between S. aureus and E. coli BSI in humans (AUC = 0.5905; p = 0.2883).
Figure 5

Projecting the mouse S. aureus classifier onto human subjects.

The murine S. aureus classifier identifies humans with S. aureus BSI, but does not differentiate S. aureus from E. coli BSI.

Projecting the mouse S. aureus classifier onto human subjects.

The murine S. aureus classifier identifies humans with S. aureus BSI, but does not differentiate S. aureus from E. coli BSI.

Validation of Murine and Human Classifiers in an Independent Pediatric Population

We externally validated the murine and human S. aureus classifiers in an independent human cohort [18]. This validation cohort consisted of pediatric patients hospitalized due to invasive S. aureus infection (n = 46) and healthy controls (n = 10) who had gene expression data generated on a compatible platform (U133A array) with that used in this study. This cohort did not enroll children with E. coli infections. For this reason, we excluded E. coli infection from both classifiers. New murine and human S. aureus classifiers were developed in the same manner described above but without E. coli-related expression data. This modified murine S. aureus classifier was comprised of mFactors 7, 15, and 26 but not mFactor23. The modified human S. aureus classifier only contained hFactor4. Both the murine and human S. aureus classifiers differentiated children with S. aureus infection from healthy controls in this validation cohort (murine classifier AUC = 0.9522, p-value = 9.03×10−6 [ ]; human classifier AUC 0.9217, p-value 3.48×10−5 [ ]). The converse was also true. A S. aureus classifier trained on this independent pediatric cohort accurately discriminated S. aureus infection from healthy controls in our CAPSOD human cohort (70/75 correctly classified; AUC = 0.9775, p-value = 2.03×10−12) and murine cohort (123/137 correctly classified; AUC = 0.9255; p = 4.56×10−17).
Figure 6

Validation in an independent human cohort [18].

(A) The murine S. aureus classifier differentiates between S. aureus infection and healthy. (B) The human S. aureus classifier differentiates between S. aureus infection and healthy.

Validation in an independent human cohort [18].

(A) The murine S. aureus classifier differentiates between S. aureus infection and healthy. (B) The human S. aureus classifier differentiates between S. aureus infection and healthy.

S. aureus Infection Induces Similar Host Gene-expression Responses in Mouse and Human

Pairwise comparisons were performed to identify genes with significantly different levels of expression (after Bonferroni correction). Comparisons included S. aureus infection vs. Healthy, E. coli infection vs. Healthy, and S. aureus vs. E. coli infection in mice and humans (). Genes from each pairing were entered into the GeneGo pathway map database. The 50 most significant biological pathways arising from the pairwise comparisons are presented in . The genes represented within common pathways are presented in . A similar number of pathways overlapped between the murine and human responses to S. aureus (12 of the top 50) and E. coli (14 of the top 50) infection. Most of the overlapping pathways in the murine and human responses to both S. aureus and E. coli belonged to the broad category of immune response including CD28, ICOS, and the MEF2 pathway. Cytoskeletal remodeling (TGF and WNT) and apoptosis were also common to both infection types in mice and humans. Some pathways were highly significant in the S. aureus vs. Healthy comparison but not manifest in E. coli vs. Healthy such as NF-κB-associated pathways; the CD40 immune response pathway; and clathrin-coated vesicle transport. As expected, these pathways were also differentially manifest in the direct comparison of murine S. aureus and E. coli infection. We did not identify any statistically significant probes that distinguished human S. aureus from E. coli BSI. One probe, corresponding to the F2RL3 gene, nearly met this statistical cutoff (p-value 5.94×10−6 with a cutoff of 2.24×10−6). F2RL3 encodes proteinase-activated receptor 4 [47]. This molecule is a G-protein coupled receptor activated by thrombin and trypsin but has not previously been implicated in the sepsis or immune response. It is expressed in multiple tissues with high levels in the lung, pancreas, thyroid, testis, and small intestine but not peripheral blood or lymphoid tissues [47].

Discussion

Early diagnostic strategies for S. aureus BSI could improve patient care by reducing the time required to establish the diagnosis and provide appropriate treatment while avoiding unnecessary anti-MRSA antibiotics. The current investigation contributes to this goal through three key findings. First, S. aureus infection induces conserved host gene expression responses in mice that can differentiate from E. coli-infected or uninfected mice. This discovery was consistent and robust across multiple inbred mouse strains, S. aureus genetic backgrounds, time points, and was validated in outbred mice. The validation step strengthens generalizability and is an important improvement over previous murine gene-expression based classifiers that were developed and tested in only a single inbred mouse strain including the fields of infectious diseases [19], [48], [49]; cancer progression [50], [51]; and aging [52], [53]. Furthermore, this murine predictor was specific for S. aureus infection and not simply a marker of illness based on the observation that mice with E. coli sepsis could not be distinguished from healthy, uninfected animals. The murine S. aureus classifier performed equally well at multiple time points despite progression of illness lending additional support to the specificity of this classifier. Second, human-derived host gene expression signatures differentiated S. aureus BSI from E. coli BSI or uninfected controls. In contrast to the murine-based classifier, the human-based model was less pathogen specific but still provided a significant degree of differentiation between S. aureus and E. coli BSI. Finally, the responses to S. aureus infection are highly conserved at the transcriptional and pathway level. This conserved response allowed us to validate the murine- and human-derived S. aureus classifiers in an independent cohort of S. aureus-infected patients. Previous efforts to identify a discriminatory host gene expression signature for Gram-positive versus Gram-negative infections have yielded inconsistent results. This is likely due to the observation that transcriptional data derived from complex phenotypes such as infection do not produce just one predictive gene set, but rather generate multiple gene sets associated with the phenotype in question [54]. Some studies report a common pattern of host gene expression [55]–[57], whereas others have identified different expression profiles [8]–[10], [58]. In the current investigation, we utilized well-established methodologies [15], [19], [31], [36], [38]–[41] to derive predictors for S. aureus infection in both mice and humans from gene expression data. A key component of this methodology was a dimensional reduction step generating sets of co-expressed genes, termed “factors”. We observed that multiple, individual factors differentiated between various infection states although none performed universally well. For example, mFactor15 was associated with the lowest overall p-value during model generation. The AUC was 0.9587 for S. aureus vs. uninfected control mice but only 0.7942 for S. aureus vs. E. coli. In contrast, mFactor23 had an AUC of 0.9800 for S. aureus vs. E. coli but an AUC of 0.5926 for S. aureus vs. uninfected control mice. In order to generate a more robust classifier, factors were used as independent variables to generate a binary regression model. Factor models are an excellent technique for estimating correlation structure in very high dimensional data sets. This comprised the second step in generating the S. aureus predictors. It was only by including all factors to build the classifier that we could validate the model in the broadest set of conditions including different bacterial pathogens. However, because factors are typically made up of many genes, it is difficult to estimate the marginal effect of removing single genes from predictors. As such, it can be challenging to move from predictors based on factors to predictors based on small gene subsets. Although redundancy among the genes in a molecular classifier is expected and is a potential limitation, such redundancy can also improve robustness for a specific phenotype [54] as is likely to be the case in discriminating S. aureus from E. coli infection in mice. Comparisons at the individual gene level, as with pairwise comparisons, are likely to reveal differences in relatively simple biological responses. In contrast, dimension reduction with factor modeling as utilized in this study incorporates differences across multiple pathways, allowing for the detection of changes in a more complex pathobiology. Additionally, our factor model construction does not incorporate known biological pathways. This leads to gene groupings that are sometimes difficult to interpret. The advantage of the approach is the extreme dimension reduction which allows for discovery and cross-validation on very small data sets. This is one possible explanation for why the human S. aureus classifier differentiated S. aureus from E. coli whereas no genes met the threshold for differential expression after Bonferroni correction in a pairwise comparison between these two patient populations. The strength of this approach is offset by the possibility that smaller or transient changes in gene expression might be missed. It should be noted that the classifiers described in this study are not intended to be of clinical grade, which would require a more restrictive set of discriminating genes. Furthermore, there are likely many combinations of genes and factors that would perform similarly to that described here. This study presents findings related to the best performing classifier using the described methodologies. Defining the smallest, non-redundant set of genes that retains adequate discriminating power would be a vital next step in generating a clinically-useful diagnostic. In addition, any host response-based diagnostic requires validation across a range of clinical states. Immunocompromise is a particular condition in which it cannot be assumed the host immune response follows the paradigms identified here. The murine model has been effectively used to gain insights into the pathophysiology of sepsis in general and S. aureus in particular [43], [44]. Murine-derived gene expression signatures have also been successfully translated to non-infectious human conditions such as radiation exposure and breast cancer [24], [59], [60]. Here, we describe the robust performance of a murine-derived S. aureus classifier in both mice and humans and also offer several lines of evidence supporting a partially conserved host response to S. aureus infection in both host species. First, the murine-based predictor could differentiate human S. aureus BSI from uninfected controls. Second, the genetic pathways were highly conserved. For example, most of the relevant murine pathways were also significantly associated with S. aureus BSI in humans. Finally, the murine-based predictor was highly accurate in classifying S. aureus infection in an independent human cohort. Despite the robust performance of the murine classifier when applied to a human population, the ideal animal model for human sepsis remains elusive [61]–[63]. For example, virtually no murine-based sepsis studies have been replicated in patients [64], [65]. Other sepsis studies in mice and humans yield discordant results. For example, the impact of TNF-α receptor therapy on septic mice [66] and humans [67] yielded contradictory results. In fact, more than 60 incongruities between murine and human immune systems have been recognized many of which involve host-pathogen interactions [68], [69]. Our results are consistent with these earlier observations. For example, we encountered inconsistences between murine and human responses to S. aureus such that a minority (12) of the top 50 pathways overlapped between the two species. Moreover, the human response to S. aureus when compared to E. coli was differentiated by only one gene, F2RL3, which nearly reached the threshold for statistical significance. This is in contrast to the many genes identified differentiating the murine response to S. aureus and E. coli infection although F2RL3 is notably absent from this list. These host species-specific differences in sepsis, as well as infection-specific characteristics such as anatomic site of infection (e.g. genitourinary tract for E. coli vs. skin/soft tissue for S. aureus) limit our ability to apply knowledge gained from animal sepsis models to humans. It is also worth noting that batch effects and their correction may introduce bias in the form of false positives in the gene selection output [54]. However, this effect would be equally distributed among the S. aureus infected, E. coli infected, and healthy subjects. Finally, the ability to distinguish bacterial sepsis from healthy is expectedly easier than the finer distinction between two offending bacterial pathogens. It is therefore not surprising that the murine S. aureus predictor did not differentiate S. aureus from E. coli infection in the human cohort. Comorbid disease such as diabetes or end-stage renal failure, which we observed in a minority of the infected human cohort, could be confounding the analysis and driving the differentiation between healthy human controls and those with infection (S. aureus or E. coli BSI). Without controlling for comorbid disease, such a confounding effect cannot be excluded. However, the human S. aureus classifier performed exceptionally well in differentiating infected individuals from healthy controls even in those patients without comorbid disease. Furthermore, the murine classifier (derived from mice without comorbid disease) could still differentiate infected human subjects from the healthy human cohort. These factors make it unlikely that comorbid disease is playing a significant role in the analysis although future attempts at deriving a gene-expression-based classifier should make accommodations for the possible confounding effect of comorbid disease. Gene expression changes in peripheral blood cells drive the derivation of both the murine and human S. aureus classifiers. It is conceivable these gene expression changes are reflective of transcript abundance driven by myeloid cell lineage expansions and are not pathogen or infection specific. However, previously published data and work presented here suggest this is not the case. For example, Ardura et al. found no differences in the absolute numbers of total B and T cells in patients with S. aureus infection compared to healthy controls [18]. Yet the abundance of lymphocyte-specific transcripts was significantly reduced. In contrast, expansion of the myeloid lineage was associated with high levels of expression among genes associated with neutrophil function. A similar independence between lymphocyte counts and differential gene expression within this lineage was observed in an independent pediatric sepsis cohort [70]. In another example, transcript abundance due to cell lineage expansions was not the primary factor in the development of a tuberculosis-specific gene expression signature [71]. Rather, it is changes in cellular composition and altered gene expression that drive such signatures. The data presented here also indicates that the S. aureus classifiers are not being driven by lineage-specific transcript abundance. Specifically, the total leukocyte count and cell lineage distribution (based on routine automated differential measurements) were not different between patients with S. aureus infection and E. coli infection (15.7×109/L with 86.2% neutrophils vs. 14.1×109/L with 85.8% neutrophils, respectively). However, the human S. aureus classifier was still able to differentiate infection due to the two pathogens. The murine S. aureus classifier was highly successful in differentiating S. aureus infection from healthy and from E. coli infection yet was unable to differentiate E. coli from healthy. This result would not be expected if transcript abundance was driving the derivation of the classifier. The overlap we observed in the gene expression response to S. aureus infection in mouse and human was also consistent with published studies. NF-κB signaling pathways have been identified as a critical component of the murine response to infection [72], which was mirrored in the murine and human data presented here. Similar gene expression-based analyses of the human response to bacterial infection have also revealed the importance of other biological pathways including MHC class I and II antigen presentation, immunological synapse formation, TGF-β receptor signaling, TGF and WNT-dependent cytoskeleton remodeling, and T-cell receptor signaling [10], [18], [73], all of which were significantly associated with S. aureus infection in this study. Hence, mice and humans utilize many of the same or overlapping pathways in response to bacterial sepsis supporting the potential utility of murine-based diagnostics for human disease. S. aureus continues to evolve as a pathogen and leads to a disproportionate burden of sepsis morbidity and mortality. This study takes significant steps forward on multiple levels in the ongoing effort to understand this pathogen; the host response to it; and identify new diagnostic and therapeutic avenues. We describe a potential diagnostic modality capable of differentiating infection from health across species. More importantly, host gene expression classifiers can differentiate infection due to S. aureus from that of E. coli but this effect is less pronounced in the complex human host. The approach described here also affords great insight into the conserved and disparate pathways utilized by mice and humans in response to these infections. Not only have we provided evidence to support the paradigm shift in how we think about diagnostics, but we have also identified new areas for research into the pathways that subserve sepsis pathophysiology. Bacterial challenge experiments. (A) Survival curves for A/J and C57BL/6J mice following an intra-peritoneal infection with S. aureus (1×107 CFU/g) or E. coli (6×104 CFU/g). Principal Components Analysis plots of the samples in the dataset. Samples are colored by infection status and pathogen. (B) S. aureus infection by time after inoculation (n = 10 animals/time point). (C) E. coli infection by time after inoculation (n = 10 animals/time point). (D) PCA differentiated by pathogen. (DOC) Click here for additional data file. Heat maps of genes contributing to the murine classifier. (A) Genes within the top five factors contributing to the murine S. aureus classifier were identified and ranked by p-value after Bonferroni correction. A subset of genes (393 after removing duplicates) is depicted here, stratified by pathogen. (B) The same genes depicted in part (A) are categorized first pathogen and then by time since infection. (DOC) Click here for additional data file. Venn diagram demonstrating the number of overlapping probes in each murine experimental group pairwise comparison. Probes were included that had significantly different levels of expression after Bonferroni correction. (DOC) Click here for additional data file. Sixteen murine factors independently associated with infection projected onto healthy controls (left panel, black circles), animals with infection (middle panel, blue triangles), and animals with infection (right panel, red “x”). The y-axis represents the factor score. (DOC) Click here for additional data file. A factor-based classifier distinguishes MRSA from MSSA infection in mice. An ROC curve is shown for this classification. (DOC) Click here for additional data file. Venn diagram demonstrating the number of overlapping probes in each human experimental group pairwise comparison. Probes were included that had significantly different levels of expression after Bonferroni correction. No probes met this cutoff for the S. aureus vs. E. coli comparison. (DOC) Click here for additional data file. Seventeen human factors independently associated with BSI projected onto healthy controls (left panel, black circles), subjects with BSI (middle panel, blue triangles), and subjects with BSI (right panel, red “x”). The y-axis represents the factor score. (DOC) Click here for additional data file. Heat map of genes contributing to the human classifier. Genes within the top two factors contributing to the human S. aureus classifier were identified and ranked by p-value after Bonferroni correction. A subset of genes (86 after removing duplicates) is depicted here, stratified by pathogen. (DOC) Click here for additional data file. Probes, ranked by p-value, that were differentially expressed (after Bonferroni correction) in mice with infection vs. Healthy controls; vs. infection; and vs. Healthy controls. Also presented is the average probe expression in each comparator group and the fold-change within the pairwise comparison. (XLSX) Click here for additional data file. Probes and corresponding genes that were differentially expressed (after Bonferroni correction) in mice with MRSA vs. MSSA infection. (DOC) Click here for additional data file. Probes, ranked by p-value, that were differentially expressed (after Bonferroni correction) in humans with infection vs. Healthy controls; vs. infection; and vs. Healthy controls. Also presented is the average probe expression in each comparator group and the fold-change within the pairwise comparison. (XLSX) Click here for additional data file. Pathway analysis for the genes from pairwise comparisons in the mouse and human study. Top 50 ranked pathways from GeneGo MetaCore pathway analysis based upon p-value. Shaded text corresponds to pathways that are present in both the mouse and human response to the specified pathogen. (XLSX) Click here for additional data file. Genes in pathways common to murine and human responses to infection. Human genes are in the shaded cells. Murine genes are in the unshaded cells. (XLSX) Click here for additional data file. A detailed description of microarray preparation. (DOC) Click here for additional data file.
  68 in total

1.  Stereotyped and specific gene expression programs in human innate immune responses to bacteria.

Authors:  Jennifer C Boldrick; Ash A Alizadeh; Maximilian Diehn; Sandrine Dudoit; Chih Long Liu; Christopher E Belcher; David Botstein; Louis M Staudt; Patrick O Brown; David A Relman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-01-22       Impact factor: 11.205

2.  A bayesian analysis strategy for cross-study translation of gene expression biomarkers.

Authors:  Joseph Lucas; Carlos Carvalho; Mike West
Journal:  Stat Appl Genet Mol Biol       Date:  2009-02-04

3.  Inadequate antimicrobial treatment of infections: a risk factor for hospital mortality among critically ill patients.

Authors:  M H Kollef; G Sherman; S Ward; V J Fraser
Journal:  Chest       Date:  1999-02       Impact factor: 9.410

Review 4.  Why do animal models (sometimes) fail to mimic human sepsis?

Authors:  Charles T Esmon
Journal:  Crit Care Med       Date:  2004-05       Impact factor: 7.598

5.  A modular analysis framework for blood genomics studies: application to systemic lupus erythematosus.

Authors:  Damien Chaussabel; Charles Quinn; Jing Shen; Pinakeen Patel; Casey Glaser; Nicole Baldwin; Dorothee Stichweh; Derek Blankenship; Lei Li; Indira Munagala; Lynda Bennett; Florence Allantaz; Asuncion Mejias; Monica Ardura; Ellen Kaizer; Laurence Monnet; Windy Allman; Henry Randall; Diane Johnson; Aimee Lanier; Marilynn Punaro; Knut M Wittkowski; Perrin White; Joseph Fay; Goran Klintmalm; Octavio Ramilo; A Karolina Palucka; Jacques Banchereau; Virginia Pascual
Journal:  Immunity       Date:  2008-07-18       Impact factor: 31.745

6.  Gene-expression profiling of gram-positive and gram-negative sepsis in critically ill patients.

Authors:  Benjamin M P Tang; Anthony S McLean; Ian W Dawes; Stephen J Huang; Mark J Cowley; Ruby C Y Lin
Journal:  Crit Care Med       Date:  2008-04       Impact factor: 7.598

7.  Immunological mechanisms underlying the genetic predisposition to severe Staphylococcus aureus infection in the mouse model.

Authors:  Maren von Köckritz-Blickwede; Manfred Rohde; Sonja Oehmcke; Lloyd S Miller; Ambrose L Cheung; Heiko Herwald; Simon Foster; Eva Medina
Journal:  Am J Pathol       Date:  2008-10-30       Impact factor: 4.307

8.  Sepsis-induced human lymphocyte apoptosis and cytokine production in "humanized" mice.

Authors:  Jacqueline Unsinger; Jacquelyn S McDonough; Leonard D Shultz; Thomas A Ferguson; Richard S Hotchkiss
Journal:  J Leukoc Biol       Date:  2009-04-15       Impact factor: 4.962

9.  An interferon-inducible neutrophil-driven blood transcriptional signature in human tuberculosis.

Authors:  Matthew P R Berry; Christine M Graham; Finlay W McNab; Zhaohui Xu; Susannah A A Bloch; Tolu Oni; Katalin A Wilkinson; Romain Banchereau; Jason Skinner; Robert J Wilkinson; Charles Quinn; Derek Blankenship; Ranju Dhawan; John J Cush; Asuncion Mejias; Octavio Ramilo; Onn M Kon; Virginia Pascual; Jacques Banchereau; Damien Chaussabel; Anne O'Garra
Journal:  Nature       Date:  2010-08-19       Impact factor: 49.962

10.  Gene expression signatures that predict radiation exposure in mice and humans.

Authors:  Holly K Dressman; Garrett G Muramoto; Nelson J Chao; Sarah Meadows; Dawn Marshall; Geoffrey S Ginsburg; Joseph R Nevins; John P Chute
Journal:  PLoS Med       Date:  2007-04       Impact factor: 11.069

View more
  30 in total

1.  A comprehensive time-course-based multicohort analysis of sepsis and sterile inflammation reveals a robust diagnostic gene set.

Authors:  Timothy E Sweeney; Aaditya Shidham; Hector R Wong; Purvesh Khatri
Journal:  Sci Transl Med       Date:  2015-05-13       Impact factor: 17.956

Review 2.  Emerging and resistant infections.

Authors:  Cornelius J Clancy; Andre C Kalil; Vance G Fowler; Elodie Ghedin; Jay K Kolls; M Hong Nguyen
Journal:  Ann Am Thorac Soc       Date:  2014-08

Review 3.  Infective endocarditis.

Authors:  Thomas L Holland; Larry M Baddour; Arnold S Bayer; Bruno Hoen; Jose M Miro; Vance G Fowler
Journal:  Nat Rev Dis Primers       Date:  2016-09-01       Impact factor: 52.329

4.  Unsupervised Analysis of Transcriptomics in Bacterial Sepsis Across Multiple Datasets Reveals Three Robust Clusters.

Authors:  Timothy E Sweeney; Tej D Azad; Michele Donato; Winston A Haynes; Thanneer M Perumal; Ricardo Henao; Jesús F Bermejo-Martin; Raquel Almansa; Eduardo Tamayo; Judith A Howrylak; Augustine Choi; Grant P Parnell; Benjamin Tang; Marshall Nichols; Christopher W Woods; Geoffrey S Ginsburg; Stephen F Kingsmore; Larsson Omberg; Lara M Mangravite; Hector R Wong; Ephraim L Tsalik; Raymond J Langley; Purvesh Khatri
Journal:  Crit Care Med       Date:  2018-06       Impact factor: 7.598

5.  Mortality Prediction in Sepsis With an Immune-Related Transcriptomics Signature: A Multi-Cohort Analysis.

Authors:  Louis Kreitmann; Maxime Bodinier; Aurore Fleurie; Katia Imhoff; Marie-Angelique Cazalis; Estelle Peronnet; Elisabeth Cerrato; Claire Tardiveau; Filippo Conti; Jean-François Llitjos; Julien Textoris; Guillaume Monneret; Sophie Blein; Karen Brengel-Pesce
Journal:  Front Med (Lausanne)       Date:  2022-06-30

6.  A host-based RT-PCR gene expression signature to identify acute respiratory viral infection.

Authors:  Aimee K Zaas; Thomas Burke; Minhua Chen; Micah McClain; Bradly Nicholson; Timothy Veldman; Ephraim L Tsalik; Vance Fowler; Emanuel P Rivers; Ronny Otero; Stephen F Kingsmore; Deepak Voora; Joseph Lucas; Alfred O Hero; Lawrence Carin; Christopher W Woods; Geoffrey S Ginsburg
Journal:  Sci Transl Med       Date:  2013-09-18       Impact factor: 17.956

7.  Host gene expression classifiers diagnose acute respiratory illness etiology.

Authors:  Ephraim L Tsalik; Ricardo Henao; Marshall Nichols; Thomas Burke; Emily R Ko; Micah T McClain; Lori L Hudson; Anna Mazur; Debra H Freeman; Tim Veldman; Raymond J Langley; Eugenia B Quackenbush; Seth W Glickman; Charles B Cairns; Anja K Jaehne; Emanuel P Rivers; Ronny M Otero; Aimee K Zaas; Stephen F Kingsmore; Joseph Lucas; Vance G Fowler; Lawrence Carin; Geoffrey S Ginsburg; Christopher W Woods
Journal:  Sci Transl Med       Date:  2016-01-20       Impact factor: 17.956

8.  Benchmarking Sepsis Gene Expression Diagnostics Using Public Data.

Authors:  Timothy E Sweeney; Purvesh Khatri
Journal:  Crit Care Med       Date:  2017-01       Impact factor: 7.598

9.  Robust classification of bacterial and viral infections via integrated host gene expression diagnostics.

Authors:  Timothy E Sweeney; Hector R Wong; Purvesh Khatri
Journal:  Sci Transl Med       Date:  2016-07-06       Impact factor: 17.956

10.  Rapid, Sample-to-Answer Host Gene Expression Test to Diagnose Viral Infection.

Authors:  Ephraim L Tsalik; Ayeaye Khine; Abdossamad Talebpour; Alaleh Samiei; Vilcy Parmar; Thomas W Burke; Micah T Mcclain; Geoffrey S Ginsburg; Christopher W Woods; Ricardo Henao; Tino Alavie
Journal:  Open Forum Infect Dis       Date:  2019-10-30       Impact factor: 3.835

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.