| Literature DB >> 32818204 |
George A Robinson1,2, Junjie Peng1,2, Pierre Dönnes1,3, Leda Coelewij1,4, Meena Naja2, Anna Radziszewska2, Chris Wincup1, Hannah Peckham2, David A Isenberg1,2, Yiannis Ioannou2,5, Ines Pineda-Torra4, Coziana Ciurtin1,2, Elizabeth C Jury1.
Abstract
BACKGROUND: Juvenile-onset systemic lupus erythematosus (SLE) is a rare autoimmune rheumatic disease characterised by more severe disease manifestations, earlier damage accrual, and higher mortality than in adult-onset SLE. We aimed to use machine-learning approaches to characterise the immune cell profile of patients with juvenile-onset SLE and investigate links with the disease trajectory over time.Entities:
Year: 2020 PMID: 32818204 PMCID: PMC7425802 DOI: 10.1016/S2665-9913(20)30168-5
Source DB: PubMed Journal: Lancet Rheumatol ISSN: 2665-9913
Figure 1Study design and analysis plan flow diagram
BRF=balanced random forest. PBMCs=peripheral blood mononuclear cells. SLE=systemic lupus erythematosus. sPLS-DA=sparse partial least squares-discriminant analysis.
Demographic and clinical table of all patients and healthy controls
| Sex | |||
| Female | 22 (56%) | 54 (81%) | |
| Male | 17 (44%) | 13 (19%) | |
| Age, years | 18 (16–25) | 19 (13–25) | |
| Body-mass index, kg/m2 | 23·1 (19·9–24·5) | 22·41 (20·3–26·6) | |
| Ethnicity | |||
| White | 20 (51%) | 20 (30%) | |
| Asian | 10 (26%) | 24 (36%) | |
| Black | 2 (5%) | 17 (25%) | |
| Other or unknown | 7 (18%) | 6 (9%) | |
| Disease characteristics | |||
| Age at diagnosis | .. | 12·2 (6·4) | |
| Disease duration, years | .. | 7·1 (4·9) | |
| SLEDAI-2K | .. | 2·4 (0·0–4·0) | |
| SLEDAI-2K ≥4 | .. | 10 (15%) | |
| SLEDAI-2K <4 | .. | 57 (85%) | |
| Systemic Lupus International Collaborating Clinics | .. | 0·1 (0·4) | |
| Lupus low disease activity state | .. | 49 (73%) | |
| Current organ involvement | |||
| Neurological | .. | 12 (18%) | |
| Serositis | .. | 9 (13%) | |
| Cutaneous | .. | 57 (85%) | |
| Haematological | .. | 28 (42%) | |
| Musculoskeletal | .. | 55 (82%) | |
| Renal | .. | 21 (31%) | |
| Serology | |||
| Anti-dsDNA antibodies, IU/mL (normal range ≤50) | .. | 14 (2–154) | |
| Anti-dsDNA antibodies outside normal range | .. | 24 (36%) | |
| CRP, mg/L (normal range <5) | .. | 1·00 (0·60–2·55) | |
| CRP, outside normal range | .. | 8 (12%) | |
| Complement component C3, g/L (normal range 0·9–1·8) | .. | 1·02 (0·76–1·21) | |
| Complement component C3 outside normal range | .. | 24 (36%) | |
| Lymphocyte count, ×109 cells per L (normal range 1·3–3·5) | .. | 1·50 (1·28–2·06) | |
| Lymphocyte count outside normal range | .. | 34 (51%) | |
| Neutrophil count, ×109 cells per L (normal range 2·0–7·5) | .. | 3·01 (2·22–7·68) | |
| Neutrophil count outside normal range | .. | 19 (28%) | |
| Urine protein:creatinine ratio, mg/mmol (normal range 0–13) | .. | 8 (6–13) | |
| Urine protein:creatinine ratio outside normal range | .. | 16 (24%) | |
| Haemoglobin, g/L (normal range 115–155) | .. | 122 (112–134) | |
| Haemoglobin outside normal range | .. | 25 (37%) | |
| Platelet count, ×109 cells per L (normal range 150–400) | .. | 276 (211–337) | |
| Platelet count outside normal range | .. | 7 (10%) | |
| Antinuclear antibody positive | .. | 55 (82%) | |
| Extractable nuclear antigen-positive | .. | 42 (63%) | |
| Clinical lipids | |||
| Cholesterol, mmol/L (normal range <5) | .. | 4·0 (3·4–4·3) | |
| Triglycerides, mmol/L (normal range <3) | .. | 0·8 (0·6–1·15) | |
| HDL cholesterol, mmol/L (normal range >1) | .. | 1·5 (1·2–1·7) | |
| LDL cholesterol, mmol/L (normal range <3) | .. | 2·1 (1·6–2·4) | |
| Cholesterol:HDL cholesterol ratio (normal range <4) | .. | 2·75 (2·30–3·28) | |
| Current treatment | |||
| Hydroxychloroquine | .. | 62 (93%) | |
| Mycophenolate mofetil | .. | 27 (40%) | |
| Prednisolone | .. | 32 (48%) | |
| Vitamin D | .. | 13 (19%) | |
| Methotrexate | .. | 6 (9%) | |
| Azathioprine | .. | 15 (22%) | |
Data are n (%), median (IQR), or mean (SD). CRP=C-reactive protein. SLE=systemic lupus erythematosus. SLEDAI-2K=SLE disease activity index 2000.
Figure 2The immunological architecture is altered in juvenile-onset SLE
(A) Volcano plot displaying comparison between patients with juvenile-onset SLE and heath controls. Fold change versus log10 p values are displayed from unpaired t tests. The red line indicates adjusted p value following 5% false discovery rate adjustment for multiple comparisons. (B, C) Violin plots displaying antigen presenting cells (panel B) and T-cell subsets (panel C) that were significantly different between healthy controls and patients with juvenile-onset SLE by unpaired t test. The solid line indicates the mean and the dashed line indicates the SE. Adjusted p values are shown. (D) Correlation comparison analysis performed on immune phenotyping data described in panel A. The upper left of the heat map shows the correlation between immune cell types (28 immunological variables) in healthy controls. Spearman correlation coefficients for each pair of cell types are represented by colour. Asterisks indicate significant correlations, p<0·05. The bottom right of the heat map shows the correlation between immune cell types in patients with juvenile-onset SLE. Grey indicates that the Spearman correlation coefficient is not signficantly different from that of healthy controls. Significantly different correlations in patients with juvenile-onset SLE compared with healthy controls are coloured (p<0·05) and outlined in black (p<0·01). CM=central memory. EM=effector memory. EMRA=effector memory cells re-expressing CD45RA. HC=healthy controls. iNKT=invariant natural killer T cells. JSLE=juvenile-onset SLE. PDCs=plasmacytoid dendritic cells. SLE=systemic lupus erythematosus. Treg=regulatory T cells. Tresp=responder T cells.
Figure 3BRF analysis of immunophenotype data
(A) Building a predictive model using a BRF approach (appendix pp 5–6). (B) Comparison of 28 different immune cell subsets in healthy controls (n=39) versus patients with juvenile-onset SLE (n=67) using the BRF model. (C) ROC analysis for the BRF model. (D) The top ten variables contributing to the BRF model are shown. The mean decrease in Gini measures the importance of each variable to the model: a higher score indicates a higher importance of the variable. (E) ROC with AUC from univariate models showing the sensitivity and specificty of the top ten markers identified by the model. (F) ROC analysis without CD19 unswitched memory B cells (the most predictive parameter). (G) The top ten contributing variables in the BRF model trained on 27 immunological parameters (excluding CD19+ unswitched memory cells). AUC=area under the curve. BRF=balanced random forest. EM=effector memory. iNKT=invariant natural killer T cells. ROC=receiver operating characteristic. SLE=systemic lupus erythematosus.
Figure 4Top hits from BRF model validated with logistic regression analysis and sPLS-DA
(A) Odds ratios (error bars indicate 95% CIs) of 28 immunological parameters were computed with univaraite logistic regression analysis. iNKT and PDC data is shown seperately inset because of very different CI values. (B) sPLS-DA model optimisation using ten-fold cross-validation. (C) sPLS-DA plot to validate the top hits from the predictive model. Individual distribution points and confidence ellipses (ovals) are plotted for the healthy control and juvenile-onset SLE groups. (D) Using this analysis, the weighting of each cell type in component 1 and 2 is displayed (inner circle is the 0·5 cutoff). (E) Factor loading weights in component 1 for the top ten ranked immunological parameters. The bars indicate the class with maximal mean value. Variables excluded from the plot have no weight in component 1. BRF=balanced random forest. CM=central memory. EM=effector memory. EMRA=effector memory cells re-expressing CD45RA. iNKT=invariant natural killer T cells. PDCs=plasmacytoid dendritic cells. SLE=systemic lupus erythematosus. sPLS-DA=sparse partial least squares-discriminant analysis. Treg=regulatory T cells. Tresp=responder T cells.
Figure 5Patient clustering by top-weighted immunological parameters in patients with juvenile-onset SLE compared with healthy controls
(A) Top-weighted immunological parameters of patients with juvenile-onset SLE (appendix pp 11–12) were stratified using k-means clustering. Immunophenotype is standardised within each column by Z score and plotted as a heat map, representing the relationship to the mean of the group (red represents relatively high frequency and blue represents relatively low frequency). Each row represents a patient with juvenile-onset SLE. Four groups of patients were recognised with distinct immune cell profiles. (B) Scatter dot plots displaying top-weighted immunological parameters between the k-means clustered groups. Mean (error bars indicate SE) was calculated with one-way ANOVA, and p values were calculated with Tukey's multiple comparisons test. The dashed lines represent the mean for the healthy control population for each cell type. (C) sPLS-DA plot showing the clustering of the validated top-weighted immunological parameters in patients with juvenile-onset SLE between k-means clustered juvenile-onset SLE groups. Individual distribution points and confidence ellipses (ovals) are plotted for each group. (D) Using this analysis, the weighting of each cell type in component 1 and 2 is displayed, where the inner circle is the 0·5 cutoff. (E) Box and whisker plots displaying baseline measures over 3–7 years of follow-up of clinical measures of disease activity between the k-means clustered groups of patients with juvenile-onset SLE. (F) Average measure over 3–7 years of follow-up of clinical measures of disease activity between the k-means clustered groups of patients with juvenile-onset SLE. Mean (error bars indicate SE) was calculated with one-way ANOVA, and p values were calculated with Tukey's multiple comparisons test. Dashed lines represent the clinical cutoff for active disease in the C3 plot and the assigned cutoff associated with active lupus in the SLEDAI-2K plot. (G) Box and whisker plot displaying longitudinal disease activity, using the same dataset from panel F, assessed as LLDAS. Mean (error bars indicate SE) was calculated with one-way ANOVA, and p values were calculated with Tukey's multiple comparisons test. (H) Individual patient trajectory of SLEDAI-2K and C3 over 15 clinical encounters displayed as spaghetti plots. Each line represents one patient with juvenile-onset SLE. Smoothing lines were added to indicate the trend of juvenile-onset SLE groups from previous k-means clustering. C3=complement component C3. iNKT=invariant natural killer T cells. LLDAS=lupus low disease activity state. SLE=systemic lupus erythematosus. SLEDAI-2K=systemic lupus erythematosus disease activity index 2000. sPLS-DA=sparse partial least squares-discriminant analysis.