| Literature DB >> 31921420 |
William A Figgett1, Katherine Monaghan2, Milica Ng2, Monther Alhamdoosh2, Eugene Maraskovsky2, Nicholas J Wilson2, Alberta Y Hoi3, Eric F Morand3, Fabienne Mackay1,4.
Abstract
OBJECTIVES: Systemic lupus erythematosus (SLE) is a heterogeneous autoimmune disease that is difficult to treat. There is currently no optimal stratification of patients with SLE, and thus, responses to available treatments are unpredictable. Here, we developed a new stratification scheme for patients with SLE, based on the computational analysis of patients' whole-blood transcriptomes.Entities:
Keywords: RNA‐seq; SLE; autoimmunity; stratification; transcriptomics
Year: 2019 PMID: 31921420 PMCID: PMC6946916 DOI: 10.1002/cti2.1093
Source DB: PubMed Journal: Clin Transl Immunology ISSN: 2050-0068
Cohorts of patients and healthy donors, for whole‐blood RNA‐seq data
| Data set and reference | Subjects | Collection site | Clinical metadata | RNA‐sequencing method |
|---|---|---|---|---|
| Data set 1 | ||||
|
Hung Accession: PRJNA294187 |
99 SLE (93 female and 6 male) | UCSF Medical Center, USA |
Anti‐Ro (‘none’, ‘medium’ and ‘high’) ISM (‘low’ and ‘high’) |
Whole blood collected in PAXgene tubes, RNA extracted with TRIzol (Invitrogen, Waltham, MA, USA) RIN checked but not specified TruSeq Library Preparation Kit (Illumina, San Diego, CA, USA) HiSeq 2000 platform (Illumina) 50‐bp SE reads |
| 18 healthy (female) | ||||
| Data set 2 | ||||
|
This study Accession: PRJNA439269 |
30 SLE (28 female and 2 male) | Monash Medical Centre, Melbourne, Australia |
Age Race SLEDAI‐2k, PGA Clinical manifestations Flow cytometry Medications |
Whole blood collected in PAXgene tubes, RNA extracted with PAXgene kit (Qiagen, Hilden, Germany) RIN > 7 TruSeq Library Preparation Kit (Illumina) HiSeq 2500 platform (Illumina) 100‐bp SE reads |
|
29 healthy (27 female and 2 male) | ||||
| Data set 3 | ||||
|
Tokuyama (2019) Accession: PRJNA505280 |
20 SLE 6 healthy | Yale‐New Haven Hospital, USA |
Age Race |
Whole blood collected in heparin tubes, RNA extracted using RNeasy kit (Qiagen) Library preparation kit for polyA RNA (Illumina) Illumina HiSeq 2500 or NextSeq 500 150‐bp PE reads |
| All female | ||||
| Data set 4 | ||||
|
Rai Accession: PRJNA318253 |
12 SLE 4 healthy | Sir Sunderlal Hospital, Banaras Hindu University, India |
Age SLEDAI‐2k Anti‐DNA (±) Anti‐ENA (±) Clinical manifestations Medications |
Whole blood collected in heparin tubes, RBC lysis buffer, RNA extracted with TRI reagent (Sigma) RIN > 7 TruSeq Library Preparation Kit (Illumina) HiSeq 2000 platform (Illumina) 100‐bp PE reads |
| All female | ||||
| Meta‐analysis | ||||
|
This study. Data sets 1 + 2 + 3 + 4 |
161 SLE 57 healthy | As above | As above. | As above |
All RNA‐seq data are publicly available from the Sequence Read Archive (SRA).63 Data sets are numbered in descending order of size. Excluded sample in Data set 2: ‘SLE_21’ (SRR6970317), which was later found to not have SLE.
ENA, extractable nuclear antigens; ISM, interferon signature metric; PE, paired‐end; PGA, Physician Global Assessment; RIN, RNA integrity number; SE, single end; SLE, systemic lupus erythematosus; SLEDAI‐2k, SLE disease activity index 2000; UCSF, University of California, San Francisco.
Figure 1Differential gene expression in SLE. 161 SLE (orange symbols) and 57 healthy donor (blue symbols) transcriptomes from four data sets (see Table 1, shown with different symbol shapes) were examined using multivariate statistics methods. (a) Principal components analysis (PCA) was applied to visualise the overall variance between individuals. The same data points are coloured by data set source (left plots) or disease state (right plots) as indicated. (b) Partial least squares discriminant analysis (PLSDA), a supervised clustering method, applies weighting to genes, which separate healthy donors and unstratified SLE patients. Ovals indicate the 80% prediction interval. (c) Standardised expression levels of top‐weighted genes from the PLSDA model were plotted as a heatmap. Each row is an individual, and each column is a gene.
Figure 2Patient clustering. (a) PCA visualisation of 161 SLE whole‐blood transcriptomes after clustering using the k‐means algorithm. Four clusters of patients were segregated and displayed with different symbols. Three data sets were combined (see Table 1). (b) Venn diagram displaying selected top‐ranking disturbed gene sets (from MSigDB hallmark gene sets) in each SLE cluster C1‐C4 compared to the healthy control group; highest ranking gene sets are bolded. (c) Percentage of anti‐Ro autoantibody levels in 99 patients from Data set 1, rated as ‘none’, ‘medium’ or ‘high’, derived from Data set 1 metadata.13 The odds ratio of anti‐Ro positivity and Fisher's exact test P‐values were calculated for each cluster compared to other patients.
Figure 3Disease severity and clinical features in SLE subtypes. SLE clusters C1‐C4 in Data set 2 were compared by clinical features. Blue bars represent the mean, and symbols represent patients. Red + symbols represent patients experiencing flares (temporary period of worsened symptoms) at the time of sampling. (a) SLE disease activity index 2000 (SLEDAI‐2k). (b) Physician Global Assessment (PGA). (c) Ratio of anti‐dsDNA autoantibodies, in C4 vs the other clusters combined. (d) Circulating neutrophil numbers. (e) Total number of ACR criteria each patient was positive for. (f) Percentage map of patients in each cluster, who are positive for particular disease features as detailed (ACR criteria) and flare activity.
Figure 4Relative expression levels of known SLE‐associated genes. Expression levels (log2 fold‐change relative to the mean of the healthy controls) of (a) TNFSF13B (BAFF), (b) TNFSF10 (TRAIL), (c) CFLAR, (d) TLR7, (e) PELI1, (f) TSC22D3 (GILZ), (g) CD40LG, (h) IFNAR1 and (i) CTLA4. Expression of interferon signature metric (ISM) genes: (j) HERC5, (k) CMPK2 and (l) EPSTI1. Therapeutics are indicated in red text above genes coding for the relevant target protein. Three data sets were combined (see Table 1) with batch effects modelled using limma. Significant differences between healthy and SLE samples, using Benjamini–Hochberg‐adjusted P‐values, are indicated (*P < 0.05, **P < 0.01, ***P < 0.001 and ****P < 0.0001). Gene expression in unstratified patients is provided in Supplementary figure 9.
Figure 5Gene signature for SLE flare activity. Whole‐blood RNA‐seq data from 30 SLE patients (24 without flares and six with flares) and 29 healthy donors were compared (Data set 2, see Table 1). (a) Principal components analysis (PCA) to visualise the variation between samples (in all genes); different symbols represent individuals in each group as shown. (b) Partial least squares discriminant analysis (PLSDA) was used to select genes that distinguish the groups. (c–f) Relative expression of flare‐associated genes, shown as the log2 fold‐change relative to the mean of the healthy donor group (‘H’). BH‐adjusted P‐values for differential expression (on count data) were calculated using limma (*P < 0.05, **P < 0.01). Gene set enrichment analysis is provided in Supplementary figure 11.