| Literature DB >> 35847809 |
Louis Kreitmann1,2, Maxime Bodinier1,2, Aurore Fleurie1,2, Katia Imhoff3, Marie-Angelique Cazalis1,2, Estelle Peronnet1,2, Elisabeth Cerrato1,2, Claire Tardiveau1,2, Filippo Conti1,4, Jean-François Llitjos1,2,5, Julien Textoris6, Guillaume Monneret1,4, Sophie Blein3, Karen Brengel-Pesce1,2.
Abstract
Background: Novel biomarkers are needed to progress toward individualized patient care in sepsis. The immune profiling panel (IPP) prototype has been designed as a fully-automated multiplex tool measuring expression levels of 26 genes in sepsis patients to explore immune functions, determine sepsis endotypes and guide personalized clinical management. The performance of the IPP gene set to predict 30-day mortality has not been extensively characterized in heterogeneous cohorts of sepsis patients.Entities:
Keywords: biomarker discovery; gene expression analysis; mortality; predictive modeling; sepsis; transcriptomics
Year: 2022 PMID: 35847809 PMCID: PMC9280291 DOI: 10.3389/fmed.2022.930043
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Characteristics of the cohorts, patients and microarray data included in the study.
| Dataset accession | First author | Country | CA vs. HCA | Time points | Age | Sex | Arrays | Patients | Controls | Sepsis | Bacterial | Viral | Alive | Deceased | Chip | Normalization |
|
| Berdal | Norway | CA | d1 d6 d7 | 41.1 | 85.7 | 21 | 14 | 7 | 7 | 0 | 7 | 5 | 2 | Affymetrix | RMA |
|
| Dolinay | United States | CA | d1 | 57.1 | 54.2 | 103 | 103 | 55 | 48 | NA | NA | 86 | 17 | Illumina | Quantile |
|
| Lill | Estonia | CA | d1 | 46.1 | NA | 39 | 39 | 18 | 21 | 21 | 0 | 19 | 2 | Affymetrix | RMA |
|
| Wong | United States | CA | d1 | 3.7 | 63.1 | 276 | 276 | 77 | 199 | NA | NA | 248 | 28 | Affymetrix | gcRMA |
|
| Bermejo-Martin | Canada | CA | d1 | NA | NA | 15 | 15 | 4 | 11 | 0 | 11 | 7 | 4 | Illumina | Quantile |
|
| Parnell | Australia | CA | d1-d5 | 59.8 | 41.7 | 163 | 54 | 18 | 36 | 36 | 0 | 26 | 10 | Illumina | Quantile |
|
| Parnell | Australia | CA | d1-d7 | NA | NA | 55 | 22 | 18 | 4 | 0 | 4 | 22 | 0 | Illumina | Cubic spline |
|
| Parnell | Australia | CA | d1-d5 | NA | 45.5 | 129 | 42 | 31 | 11 | 3 | 11 | 42 | 0 | Illumina | Quantile |
|
| Cazalis | France | CA HCA | d1 d2 d3 | 62.7 | 67.9 | 107 | 53 | 25 | 28 | 28 | 0 | 22 | 6 | Affymetrix | RMA |
|
| Suarez | United States | CA | d1 | 62.1 | 41.5 | 158 | 158 | 40 | 118 | 47 | 96 | 158 | 0 | Illumina | Quantile |
|
| Scicluna | Netherland | CA HCA | d1 | 61 | 56.8 | 521 | 521 | 42 | 479 | NA | NA | 365 | 114 | Affymetrix | RMA + quantile |
|
| Tabone | France | CA | d1 d2 d3 | 62.1 | 64.7 | 124 | 71 | 20 | 51 | NA | NA | 56 | 17 | Illumina | Quantile |
|
| Almansa | Spain | CA | d1 | NA | 50 | 16 | 16 | 4 | 12 | 5 | 3 | 16 | 0 | Agilent | Normexp |
|
| Almansa | Spain | HCA | d1 | 69.2 | 67.1 | 155 | 155 | 73 | 82 | NA | NA | 138 | 17 | Agilent | Normexp |
|
| Burnham | United Kingdom | CA | d1 d3 d5 | 65.4 | 53 | 337 | 253 | 10 | 243 | NA | NA | 204 | 39 | Illumina | VSN |
|
| Planka | Thailand | CA HCA | d1 | 53.7 | 54.7 | 92 | 92 | 29 | 63 | 63 | 0 | 52 | 20 | Illumina | Quantile |
|
| Smith | United Kingdom | CA HCA | d1 | 0.25 | 56.8 | 88 | 88 | 44 | 44 | 37 | 5 | 84 | 4 | Illumina + Affymetrix | Spline |
|
| ||||||||||||||||
|
|
|
|
|
|
|
|
|
| ||||||||
*Normalization method was not specified in the original study but was verified graphically and assumed to follow the method specified in the table based on usual methods for the associated chip.
**Community- vs. healthcare associated sepsis cases: CA is for community-acquired and HCA for healthcare-associated infections.
Demographics and clinical characteristics in the discovery and validation sets computed with microarray data sampled at day 1 following study enrolment.
| Discovery set | Validation set | ||
| Age [mean (SD)] | 52.09 (26.35) | 49.79 (26.83) | 0.138 |
| Gender (n,%) | 0.142 | ||
| Female | 419 (41.6) | 189 (44.0) | |
| Male | 566 (56.2) | 225 (52.3) | |
| NA | 22 (2.2) | 16 (3.7) | |
| Infection setting (n,%) | 0.133 | ||
| Community-associated | 685 (68.0) | 313 (72.8) | |
| Healthcare-associated | 69 (6.9) | 30 (7.0) | |
| NA | 253 (25.1) | 87 (20.2) | |
| Microbiology (n,%) | 0.972 | ||
| Viral sepsis | 79 (7.8) | 33 (7.7) | |
| bacterial sepsis | 143 (14.2) | 63 (14.7) | |
| NA | 785 (78.0) | 334 (77.7) | |
| Ethnic background (n,%) | 0.976 | ||
| Asian | 46 (4.6) | 19 (4.4) | |
| Black | 17 (1.7) | 6 (1.4) | |
| Latino | 13 (1.3) | 7 (1.6) | |
| White | 52 (5.2) | 21 (4.9) | |
| NA | 879 (87.3) | 377 (87.7) | |
| Platform (n,%) | 0.708 | ||
| Affymetrix | 552 (54.8) | 226 (52.6) | |
| Agilent | 66 (6.6) | 28 (6.5) | |
| Illumina | 389 (38.6) | 176 (40.9) | |
| Survival (n,%) | 816 (81.0) | 349 (81.2) | > 0.999 |
NA indicates values missing in the original studies.
FIGURE 1Effect of ComBat co-normalization on patient-level gene expression data assessed by principal component analysis (PCA) across 17 microarray studies. We computed a 2-dimensional PCA plot of individual gene expression data from sepsis patients at day 1 following admission (7,122 genes assessed on 1,437 arrays sampled on 1,437 patients) before (left panel) and after (right panel) ComBat co-normalization using controls with the COCONUT R package. Each of the 17 studies maps to one color, showing how co-normalization attenuates the segregation of individual data points in clusters determined by the study to which they belong.
FIGURE 2Predictive performance of the IPP gene set on “day 1” discovery and validation sets. We trained machine learning models on the “day 1” discovery (n = 1,007) and validation (n = 430) data sets by 5 repeats of 10-fold cross-validation, and computed areas under the receiver operating characteristic (AUROC, right panel) and precision-recall curves (AUPRC, left panel) on the resampled discovery set (box plots) and by prediction on the validation set (gray diamonds). Gray dashed line on the AUPRC facet indicates baseline probability of the outcome (death).
FIGURE 3Comparison of predictive performances of the IPP gene set with that obtained with other genes on the “day 1” data set. We compared the predictive performance of the IPP gene set to that obtained with other genes common to the 17 microarray studies by computing ROC curves obtained by prediction on the validation set with IPP, “all genes” and “top 29 genes” models trained on GE data collected at day 1 following admission. Gray areas indicate 95% confidence intervals of corresponding AUROCs.
FIGURE 4Predictive performance of the IPP gene set on the “days > 2” data set. We assessed the predictive performance of the IPP and “all genes” set by computing ROC curves on the validation set.
FIGURE 5Prognostic enrichment with the IPP tool. We used the best IPP models (trained on the “day 1” and “days > 2” discovery sets) and computed a test threshold using the top-left method on corresponding validation sets. This enabled us to divide the validation sets in 2 sub-groups with a low and a high predicted risk of death. Then, we compared the actual proportion of sepsis patients deceased at day 30 in both sub-groups, to assess if IPP could be used for prognostic enrichment at the bedside.