| Literature DB >> 29342870 |
Gabriela Turk1, Yanina Ghiglione2, Macarena Hormanstorfer3, Natalia Laufer4,5, Romina Coloccini6, Jimena Salido7, César Trifone8, María Julia Ruiz9, Juliana Falivene10, María Pía Holgado11, María Paula Caruso12, María Inés Figueroa13,14, Horacio Salomón15, Luis D Giavedoni16, María de Los Ángeles Pando17, María Magdalena Gherardi18, Roberto Daniel Rabinovich19, Pedro A Pury20, Omar Sued21.
Abstract
Progression of HIV infection is variable among individuals, and definition disease progression biomarkers is still needed. Here, we aimed to categorize the predictive potential of several variables using feature selection methods and decision trees. A total of seventy-five treatment-naïve subjects were enrolled during acute/early HIV infection. CD4⁺ T-cell counts (CD4TC) and viral load (VL) levels were determined at enrollment and for one year. Immune activation, HIV-specific immune response, Human Leukocyte Antigen (HLA) and C-C chemokine receptor type 5 (CCR5) genotypes, and plasma levels of 39 cytokines were determined. Data were analyzed by machine learning and non-parametric methods. Variable hierarchization was performed by Weka correlation-based feature selection and J48 decision tree. Plasma interleukin (IL)-10, interferon gamma-induced protein (IP)-10, soluble IL-2 receptor alpha (sIL-2Rα) and tumor necrosis factor alpha (TNF-α) levels correlated directly with baseline VL, whereas IL-2, TNF-α, fibroblast growth factor (FGF)-2 and macrophage inflammatory protein (MIP)-1β correlated directly with CD4⁺ T-cell activation (p < 0.05). However, none of these cytokines had good predictive values to distinguish "progressors" from "non-progressors". Similarly, immune activation, HIV-specific immune responses and HLA/CCR5 genotypes had low discrimination power. Baseline CD4TC was the most potent discerning variable with a cut-off of 438 cells/μL (accuracy = 0.93, κ-Cohen = 0.85). Limited discerning power of the other factors might be related to frequency, variability and/or sampling time. Future studies based on decision trees to identify biomarkers of post-treatment control are warrantied.Entities:
Keywords: HIV; HLA; acute infection; biomarkers; decision trees; disease progression; immune responses; soluble plasma factors
Mesh:
Substances:
Year: 2018 PMID: 29342870 PMCID: PMC5795447 DOI: 10.3390/v10010034
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1(A) Definition of the three classes (C1, C2 and C3) used in this study to segregate subjects into “progressors” and “non-progressors” or in relation to their ability to control viral replication. These classes were used to construct decision trees. (B) The whole database was subdivided into three self-including parts, i.e., the small database (N = 27), the intermediate base (N = 48) and the entire base (N = 75). The dataset used in each analysis is indicated in the text. CD4+ and CD8+ T-cell counts and viral load (VL) were determined in all subjects. Additionally, Human Leukocyte Antigen (HLA) and C-C chemokine receptor type 5 (CCR5) genotyping (and the corresponding genetic scores (GS, Appendix A)), HIV-specific immune responses and immune activation were determined in a subset of 48 subjects. Finally, 40 plasma soluble factors (39 cytokines and chemokines plus lipopolysaccharide (LPS) were quantified in a smaller group (N = 27). The values obtained for each soluble factor were used individually but also additive scores were constructed (CS, cytokine scores, Appendix A variables plasma soluble factor (PSF)41 to PSF52). IA: Immune activation; IR: Immune response.
Characteristics of HIV+ subjects enrolled in this study.
| 75 | |||
| 1:3 | |||
| 30 (24–38) | |||
| 75 (54–113) | |||
| 61,045 (12,736–455,417) | 18,951 (4298–62,739) | 16,988 (5695–40,105) | |
| 4.6 ± 1 | 4 ± 1 | 4 ±0.89 | |
| 525 (361–698) | 571 (406–673) | 464 (387–585) | |
| 0.6 (0.32–0.83) | 0.55 (0.34–0.8) | 0.61 (0.39–0.93) | |
| −0.62 (−0.31 – −0.03) | |||
| 24.4 (16–36.2) | |||
| 4.6 (1.5–11.01) | |||
| 1.2 (0.47–2.8) | |||
| 45.2 (21.3–57.1) | |||
| 26.5 (15.2–41) | |||
| 13.8 (6.7–30.6) | |||
a Versant HIV-1 RNA 3.0 assay, Siemens. Lower and upper detection limits are 50 and 500,000 RNA copies/mL, respectively (1.7log10 and 5.7log10); b Flow cytometry double platform, FACSCanto, BD Biosciences; c Immune activation was only evaluated at baseline samples by flow cytometry. IQR25–75: Interquartile range 25–75%. VL: Viral Load. pi: postinfection. SD: Standard deviation. IQR: interquartile range; HLA-DR: Human leukocyte antigen - antigen D Related.
Figure 2CD4+ T-cell counts, plasma viral load (VL), and immune activation of enrolled subjects. All samples were obtained as long as the subjects remained treatment naïve. Longitudinal determination of CD4+ T-cell counts (A); and plasma viral load (B) are shown (Baseline = enrollment sample, 6 and 12 months postinfection). Dots represent data from individual subjects and lines join matched values for each subject. Boxes represent the interquartile 25–75% range (IQR25–75) and whiskers extend from 10th to 90th percentiles. Horizontal lines within boxes represent the median. Immune activation (C) was evaluated at baseline as the percentage of CD38+, HLA-DR+ or CD38+/HLA-DR+ CD4+ (left panel) or CD8+ (right panel) T-cells. Dots represent data from individual subjects. Median and IQR25-75 are shown in red. In A and B, p-values were calculated using Wilcoxon test (baseline versus 6-month or 12-month samples). In C, p-values were calculated using Kruskal–Wallis test followed by Dunn’s post-test to compare preselected pairs of datasets. PHI: Primary HIV infection cohort. HD: Healthy donors. Asterisks denote p-values as follows: ** p < 0.01, *** p < 0.005, **** p <0.0001.
Figure 3Plasma levels of IL-1α, IL-10, IL-15, IP-10, MIP-1α, sIL-2Rα, and TNF-α in samples obtained at enrollment of recently infected HIV+ subjects (PHI, baseline samples) and healthy donors (HD). Dots represent data from individual subjects. Median and interquartile ranges (IQR25–75) are shown in red. p-values were calculated using Mann–Whitney test. Asterisks denote p-values as follows: * p < 0.05, ** p < 0.01, *** p < 0.005, **** p < 0.0001. After false discovery rate (FDR) adjustment, all but macrophage inflammatory protein (MIP)-1α remained significantly different.
Figure 4(A) Correlations between plasma level of denoted cytokines and chemokines (evaluated at baseline samples) versus percentages of CD4+ T-cells evaluated at the denoted time-points. (B) Correlations between the plasma level of denoted cytokines and chemokines (evaluated at baseline samples) versus daily CD4+ T-cell count decay rate (CD4 slope, cells/μL/day). Dots represent data from individual subjects. In the inset, r (upper line) and p (lower line) values correspond to Spearman’s test. After correction for multiple comparisons (FDR procedure) was applied, none of these correlations remained statistically significant.
Figure 5Correlations between plasma level of denoted cytokines and chemokines (evaluated at baseline samples) versus: plasma viral load (VL) (A); and baseline immune activation (B) (percentages of CD38+/HLA-DR+ CD4+ (upper panels) and CD8+ (lower panels) T-cells). Dots represent data from individual subjects. In the inset, r (upper line) and p (lower line) values correspond to Spearman’s test. After correction for multiple comparisons (FDR procedure) was applied, only those correlations shown in red remained statistically significant.
Figure 6(A) Variable hierarchization using correlation based feature selection between classes C1 and C2 with all other individual variables as well as cytokine-based (CS) and genetic scores (GS), along with the three databases (small (N = 27), medium (N = 48), and large (N = 75)). Only strong correlations (>0.6) are shown. No correlation was found between the variables studied and C3. Decision trees were constructed to discriminate “progressors” from “non-progressors” as defined by: C3 (B); and C1 (C). To build the tree shown in (B), the clinical variables were not included in the analysis. (D) Baseline CD4+ T-cell counts of the individuals enrolled segregated as “progressors” (yes, red dots) and “non-progressors” (no, black dots) according to C1. The cut-off value as defined by the tree shown in C is depicted by the vertical dashed line. CST4: cytokine score CST4 (see Appendix A). GS8: genetic score 8 (see Appendix A). The number of instances considered in D and C was the result of the elimination of cases with missing class value.
List and description of variables evaluated in this study. In addition to individual variables, scores were constructed to compile host genetic (variables G1 to G10) and plasma soluble factor (PSF41 to PSF52) data for each subject.
| Variable ID | Variable Name | Variable Description |
|---|---|---|
| C1 | Baseline CD4 | Absolute CD4+ T-cell count evaluated at enrollment |
| C2 | Baseline %CD4 | Percentage of CD4+ T-cells evaluated at enrollment |
| C3 | Baseline CD4/CD8 | CD4/CD8 ratio evaluated at enrollment |
| C4 | 6mo CD4 | Absolute CD4+ T-cell count evaluated at 6 months postinfection |
| C5 | 6mo %CD4 | Percentage of CD4+ T-cells evaluated at 6 months postinfection |
| C6 | 6mo CD4/CD8 | CD4/CD8 ratio evaluated at 6 months postinfection |
| C7 | 12mo CD4 | Absolute CD4+ T-cell count evaluated at 12 months postinfection |
| C8 | 12mo %CD4 | Percentage of CD4+ T-cells evaluated at 12 months postinfection |
| C9 | 12mo CD4/CD8 | CD4/CD8 ratio evaluated at 12 months postinfection |
| C10 | CD4 Slope | Rate of CD4+ T-cell decay over the first year postinfection (cells/µL/day) |
| C11 | Baseline VL | Plasma viral load evaluated at enrollment (RNA copies/mL) |
| C12 | Baseline log10VL | Log10 plasma viral load evaluated at enrollment |
| C13 | 6mo VL | Plasma viral load evaluated at 6 months postinfection (RNA copies/mL) |
| C14 | 6mo log10VL | Log10 plasma viral load evaluated at 6 months postinfection |
| C15 | 12mo VL | Plasma viral load evaluated at 12 months postinfection (RNA copies/mL) |
| C16 | 12mo log10VL | Log10 plasma viral load evaluated at 12 months postinfection |
| IA1 | CD4CD38 | Percentage of CD4+ CD38+ T-cells evaluated at enrollment |
| IA2 | CD4HLADR | Percentage of CD4+ HLA-DR+ T-cells evaluated at enrollment |
| IA3 | CD4double | Percentage of CD4+ CD38+ HLA-DR+ T-cells evaluated at enrollment |
| IA4 | CD8CD38 | Percentage of CD8+ CD38+ T-cells evaluated at enrollment |
| IA5 | CD8HLADR | Percentage of CD8+ HLA-DR+ T-cells evaluated at enrollment |
| IA6 | CD8double | Percentage of CD8+ CD38+ HLA-DR+ T-cells evaluated at enrollment |
| IR1 | %Nef response | Percentage of T-cell response directed to Nef (over total HIV-specific response) evaluated at enrollment by ELISPOT |
| IR2 | Absolute Nef response | Magnitude of Nef-specific response evaluated at enrollment by ELISPOT (SFU/million PBMC) |
| IR3 | %Gag response | Percentage of T-cell response directed to Gag (over total HIV-specific response) evaluated at enrollment by ELISPOT |
| IR4 | Absolute Gag response | Magnitude of Gag-specific response evaluated at enrollment by ELISPOT (SFU/million PBMC) |
| G1 | GS1 | Additive genetic score constructed based on the presence or absence of certain HLA alleles, as described previously [ |
| G2 | GS3 | Additive genetic score constructed based on the presence or absence of certain HLA alleles and CCR5 genotypes, as described previously [ |
| G3 | GS5 | Additive genetic score constructed based on adding +1 when a protective allele was present and subtracting −1 when a risk allele was present. Protective and risk alleles were determined based on the odd ratio obtained for each allele in our cohort with a |
| G4 | GS6 | Idem to GS5 but the cut-off level for significance was established at |
| G5 | GS7 | Constructed by multiplying the odds ratio corresponding to the 6-month CD4+ T-cell count for each of the 6 HLA alleles (A, B and C) of each individual. |
| G6 | GS8 | Constructed by multiplying the odds ratio corresponding to the 6-month VL for each of the 6 HLA alleles (A, B and C) of each individual. |
| G7 | GS9 | Constructed by multiplying the odds ratio corresponding to the 6-month CD4+ T-cell count for each of the 2 CCR5 haplotypes of each individual. |
| G8 | GS10 | Constructed by multiplying the odds ratio corresponding to the 6-month VL for each of the 2 CCR5 haplotypes of each individual. |
| G9 | GS11 | Constructed by multiplying the odds ratio corresponding to the 6-month CD4+ T-cell count for each of the 6 HLA alleles (A, B and C) and the 2 CCR5 haplotypes of each individual. |
| G10 | GS12 | Constructed by multiplying the odds ratio corresponding to the 6-month VL for each of the 6 HLA alleles (A, B and C) and the 2 CCR5 haplotypes of each individual. |
| PSF1 | EGF | Plasma level of EGF (Endothelial growth factor) evaluated at enrollment by Luminex (pg/mL) |
| PSF2 | Eotaxin | Plasma level of Eotaxin evaluated at enrollment by Luminex (pg/mL) |
| PSF3 | FGF2 | Plasma level of FGF-2 (fibroblast growth factor 2) evaluated at enrollment by Luminex (pg/mL) |
| PSF4 | Flt3Ligand | Plasma level of Flt-3 (Fms-like tyrosine kinase 3) ligand evaluated at enrollment by Luminex (pg/mL) |
| PSF5 | Fractalkine | Plasma level of Fractalkine evaluated at enrollment by Luminex (pg/mL) |
| PSF6 | GCSF | Plasma level of G-CSF (granulocyte colony-stimulating factor) evaluated at enrollment by Luminex (pg/mL) |
| PSF7 | GMCSF | Plasma level of GM-CSF (granulocyte monocyte colony-stimulating factor) evaluated at enrollment by Luminex (pg/mL) |
| PSF8 | GRO | Plasma level of GRO evaluated at enrollment by Luminex (pg/mL) |
| PSF9 | IFN-α2 | Plasma level of IFN-α2 (interferon alpha 2) evaluated at enrollment by Luminex (pg/mL) |
| PSF10 | IFN-γ | Plasma level of IFN-γ (interferon gamma) evaluated at enrollment by Luminex (pg/mL) |
| PSF11 | IL1α | Plasma level of IL-1α (interleukin 1 alpha) evaluated at enrollment by Luminex (pg/mL) |
| PSF12 | IL1β | Plasma level of IL-1β (interleukin 1 beta) evaluated at enrollment by Luminex (pg/mL) |
| PSF13 | IL1ra | Plasma level of IL-1ra (interleukin 1 receptor antagonist) evaluated at enrollment by Luminex (pg/mL) |
| PSF14 | IL2 | Plasma level of IL-2 (interleukin 2) evaluated at enrollment by Luminex (pg/mL) |
| PSF15 | IL3 | Plasma level of IL-3 (interleukin 3) evaluated at enrollment by Luminex (pg/mL) |
| PSF16 | IL4 | Plasma level of IL-4 (interleukin 4) evaluated at enrollment by Luminex (pg/mL) |
| PSF17 | IL5 | Plasma level of IL-5 (interleukin 5) evaluated at enrollment by Luminex (pg/mL) |
| PSF18 | IL6 | Plasma level of IL-6 (interleukin 6) evaluated at enrollment by Luminex (pg/mL) |
| PSF19 | IL7 | Plasma level of IL-7 (interleukin 7) evaluated at enrollment by Luminex (pg/mL) |
| PSF20 | IL8 | Plasma level of IL-8 (interleukin 8) evaluated at enrollment by Luminex (pg/mL) |
| PSF21 | IL9 | Plasma level of IL-9 (interleukin 9) evaluated at enrollment by Luminex (pg/mL) |
| PSF22 | IL10 | Plasma level of IL-10 (interleukin 10) evaluated at enrollment by Luminex (pg/mL) |
| PSF23 | IL12p40 | Plasma level of IL-12p40 (interleukin 12 subunit p40) evaluated at enrollment by Luminex (pg/mL) |
| PSF24 | IL12p70 | Plasma level of IL-12p70 (interleukin 12) evaluated at enrollment by Luminex (pg/mL) |
| PSF25 | IL13 | Plasma level of IL-13 (interleukin 13) evaluated at enrollment by Luminex (pg/mL) |
| PSF26 | IL15 | Plasma level of IL-15 (interleukin 15) evaluated at enrollment by Luminex (pg/mL) |
| PSF27 | IL17 | Plasma level of IL-17 (interleukin 17) evaluated at enrollment by Luminex (pg/mL) |
| PSF28 | IP10 | Plasma level of IP10 (interferon gamma-induced protein 10, CXCL10) evaluated at enrollment by Luminex (pg/mL) |
| PSF29 | MCP1 | Plasma level of MCP1 (monocyte chemoattractant protein 1) evaluated at enrollment by Luminex (pg/mL) |
| PSF30 | MCP3 | Plasma level of MCP3 (monocyte chemoattractant protein 3) evaluated at enrollment by Luminex (pg/mL) |
| PSF31 | MDC | Plasma level of MDC (macrophage derived chemokine, CCL22) evaluated at enrollment by Luminex (pg/mL) |
| PSF32 | MIP1α | Plasma level of MIP-1α (macrophage inflammatory protein 1 alpha) evaluated at enrollment by Luminex (pg/mL) |
| PSF33 | MIP1β | Plasma level of MIP-1β (macrophage inflammatory protein 1 beta) evaluated at enrollment by Luminex (pg/mL) |
| PSF34 | sCD40L | Plasma level of sCD40L (soluble CD40 ligand) evaluated at enrollment by Luminex (pg/mL) |
| PSF35 | sIL2Rα | Plasma level of sIL-2Rα (soluble interleukin 2 receptor alpha) evaluated at enrollment by Luminex (pg/mL) |
| PSF36 | TGFα | Plasma level of TGF-α (tumor growth factor alpha) evaluated at enrollment by Luminex (pg/mL) |
| PSF37 | TNFα | Plasma level of TNF-α (tumor necrosis factor alpha) evaluated at enrollment by Luminex (pg/mL) |
| PSF38 | TNFβ | Plasma level of TNF-β (tumor necrosis factor beta) evaluated at enrollment by Luminex (pg/mL) |
| PSF39 | VEGF | Plasma level of VEGF (vascular endothelial growth factor) evaluated at enrollment by Luminex (pg/mL) |
| PSF40 | LPS | Plasma level of LPS (lipopolysaccharide) evaluated at enrollment by Lal assay (EU/mL) |
| PSF41 | CSVL1 | Cytokines that significantly correlated with baseline VL were considered (sIL-2Rα, TNF-α, IP-10 and IL-10) to construct an additive score based on 1s and −1s. If the cytokine value of the subject was above 75% IQR corresponding to the group of healthy donors (HD), then this cytokine was assigned a value of 1. If the value was below 25% IQR corresponding to HD, it was assigned a value of −1. If the value was within the range IQR25–75% corresponding to HD, it was assigned a value of 0. |
| PSF42 | CSCD41 | Cytokines that significantly correlated with baseline CD4+ T-cell counts were considered (sIL-2Rα, IP-10 and G-CSF) to construct an additive score based on 1s and −1s. If the cytokine value of the subject was above 75% IQR corresponding to the group of healthy donors (HD), then this cytokine was assigned a value of −1. If the value was below 25% IQR corresponding to HD, it was assigned a value of 1. If the value was within the range IQR25–75% corresponding to HD, it was assigned a value of 0. |
| PSF43 | CST1 | Score defined as the arithmetic sum of CSVL1 + CSCD41. |
| PSF44 | CSVL2 | Cytokines that significantly correlated with baseline VL were considered (sIL-2Rα, TNF-α, IP-10 and IL-10) to construct an additive score based on adding the corresponding Spearman’s R values. If the cytokine value of the subject was above 75% IQR corresponding to the group of healthy donors (HD), then the Spearman´s R value corresponding to that cytokine was added to the score. If the value was below 25% IQR corresponding to HD, the Spearman´s R value corresponding to that cytokine was subtracted from the score. If the value was within the range IQR25–75% corresponding to HD, it was assigned a value of 0. |
| PSF45 | CSCD42 | Cytokines that significantly correlated with baseline CD4+ T-cell counts were considered (sIL-2Rα, IP-10 and G-CSF) to construct an additive score based on adding the corresponding Spearman’s R values. If the cytokine value of the subject was above 75% IQR corresponding to the group of healthy donors (HD), then the Spearman´s R value corresponding to that cytokine was subtracted from the score. If the value was below 25% IQR corresponding to HD, the Spearman´s R value corresponding to that cytokine was added to the score. If the value was within the range IQR25–75% corresponding to HD, it was assigned a value of 0. |
| PSF46 | CST2 | Score defined as the arithmetic sum of CSVL2 + CSCD42 |
| PSF47 | CSVL3 | Cytokines that significantly correlated with baseline VL were considered (sIL-2Rα, TNF-α, IP-10 and IL-10) to construct an additive score based on normalizing the value of each of these cytokines over the cytokine median of the PHI group and multiplying this adjusted value by the corresponding Spearman´s R. |
| PSF48 | CSCD43 | Cytokines that significantly correlated with baseline CD4+ T-cell counts were considered (sIL-2Rα, IP-10 and G-CSF) to construct an additive score based on normalizing the value of each of these cytokines over the cytokine median of the PHI group and multiplying this adjusted value by the corresponding Spearman´s R. |
| PSF49 | CST3 | Score defined as the arithmetic sum of CSVL3 + CSCD43. |
| PSF50 | CSVL4 | Cytokines that significantly correlated with baseline VL were considered (sIL-2Rα, TNF-α, IP-10 and IL-10) to construct an additive score based on multiplying the log10 value of each of these cytokines by the corresponding Spearman´s R. |
| PSF51 | CSCD44 | Cytokines that significantly correlated with baseline CD4+ T-cell counts were considered (sIL-2Rα, IP-10 and G-CSF) to construct an additive score based on normalizing the log10 value of each of these cytokines by the corresponding Spearman´s R |
| PSF52 | CST4 | It was defined as the arithmetic sum of CSVL4 + CSCD44 |