| Literature DB >> 22532634 |
Arthur G Pratt1, Daniel C Swan, Sarah Richardson, Gillian Wilson, Catharien M U Hilkens, David A Young, John D Isaacs.
Abstract
OBJECTIVE: We sought clinically relevant predictive biomarkers present in CD4 T-cells, or in serum, that identified those patients with undifferentiated arthritis (UA) who subsequently develop rheumatoid arthritis (RA).Entities:
Mesh:
Substances:
Year: 2012 PMID: 22532634 PMCID: PMC3396452 DOI: 10.1136/annrheumdis-2011-200968
Source DB: PubMed Journal: Ann Rheum Dis ISSN: 0003-4967 Impact factor: 19.103
Clinical characteristics of the rheumatoid arthritis (RA) and non-RA comparator groups used to generate list of differentially expressed genes, which together comprise a training cohort for machine-learning (total n=111) and the independent undifferentiated arthritis (UA) validation cohort (n=62)
| Training cohort | Test cohort | |||
|---|---|---|---|---|
| RA (n=47) | Non-RA (n=64) | p | UA | |
| Age (years; mean, SD range) | 60 (46–74) | 48 (34–62) | 0.01 | 52 (37–67) |
| % Female | 65 | 61 | NS | 77 |
| % White Caucasian | 96 | 92 | NS | 90 |
| Symptom duration (weeks; median, IQR) | 12 (8–24) | 21 (10.5–52) | 0.026 | 14 (12–26) |
| Tender joint count (median, IQR) | 10 (4–15) | 7 (2–14) | NS | 8 (3–16.5) |
| Swollen joint count (median, IQR) | 6 (2–10) | 0 (0–2) | <0.001 | 1 (0–3) |
| Morning stiffness (hours; median, IQR) | 1 (0.75–2) | 0.75 (0.1–2) | 0.007 | 1 (0.5–2) |
| ESR (s; median, IQR) | 56 (30–78) | 24 (14–52) | <0.001 | 30 (18–60) |
| CRP (g/l; median, IQR) | 17 (9–62) | 5 (2.5–19) | <0.001 | 8.5 (0–17) |
| ACPA+ (number; per cent) | 29 (62) | 0 (0) | <0.001 | 13 (21) |
| RF+ (number; per cent) | 36 (77) | 3 (6) | <0.001 | 20 (32) |
| DAS28 (median) | 5.37 | NA | – | |
| Leiden prediction score (median, IQR) | NA | NA | – | 6.4 (5–7.6) |
| Outcome diagnoses (number, per cent) | ||||
| RA | 47 (100) | 0 (0) | – | 25 (40) |
| Seronegative sponyloarthropathy | – | 22 (34) | – | 8 (13) |
| Self-limiting inflammatory | – | 12 (19) | – | 9 (15) |
| Other inflammatory | – | 3 (5) | – | 2 (3) |
| OA/non-inflammatory | – | 27 (42) | – | 18 (29) |
Values are mean (1 SD range), median (IQR) or % for normally distributed, skewed or dichotomous data, respectively.
Statistical tests for significant difference between RA and non-RA groups; t-test, Mann–Whitney U or Fisher's exact test for normally distributed, skewed or dichotomous data, respectively.
Demographic, clinical and serological parameters are given for UA–RA and UA–non-RA subgroups in online supplementary table S2. ACPA, anti-citrullinated peptide antibodies; CRP, C reactive protein; DAS28, disease activity score (incorporating 28-swollen/tender joint counts); ESR, erythrocyte sedimentation rate; NS, not significant; OA, osteoarthritis; RA, rheumatoid arthritis; RF, rheumatoid factor; UA, undifferentiated arthritis.
Fold-change and significance level for genes differentially expressed at inception among peripheral blood CD4 T cells between early arthritis clinic patients with inception diagnoses of RA and non-RA (confirmed at ≥1 year; median 28 months follow-up). The official gene symbol and RefSeq accession number are given as identifiers. Listed STAT3-regulated genes are italicised. 12 genes included in statistically most robust ‘RA signature’ appear in boldface, and additional STAT3-regulated genes referred to in text are also provided
| Microarray data (47 RA vs 64 non-RA) | qRT-PCR data (32 RA vs 41 non-RA | ||||
|---|---|---|---|---|---|
| Gene (Accn. No.) 12-Gene RA Signature: | FC | Uncorr. p | Corr. p | FC | p |
| 1.59 | 2.6×10−5 | 0.03 | 2.15 | 0.005 | |
| 1.55 | 3.4×10−6 | 0.03 | 1.83 | 0.002 | |
| 1.52 | 6.8×10−6 | 0.03 | 1.67 | 0.001 | |
| 1.47 | 1.2×10−5 | 0.03 | 1.13 | 0.158 | |
| 1.23 | 3.8×10−5 | 0.04 | 1.25 | 0.003 | |
| 1.2 | 1.7×10−5 | 0.03 | 1.40 | 0.003 | |
| −1.32 | 3.1×10−5 | 0.03 | −1.59 | 0.004 | |
| 1.42 | 1.0×10−5 | 0.03 | ND | ND | |
| 1.31 | 1.1×10−7 | 0.002 | ND | ND | |
| 1.28 | 2.3×10−5 | 0.03 | ND | ND | |
| 1.26 | 2.0×10−5 | 0.03 | ND | ND | |
| 1.32 | 2.1×10−4 | 0.049 | ND | ND | |
| −1.3 | 5.2×10−4 | 0.16 | ND | ND | |
| 1.2 | 0.04 | 0.75 | 1.29 | 0.01 | |
Baseline characteristics of Quantitative real-time PCR validation sub-cohort are similar to those of the training cohort overall (table 1) and are given in the online supplementary table S4.
Calculations based on normalised expression values of array data; Welch's t-test, raw and multiple-test-corrected p values given (see methods).
Calculations based on expression data normalised to the house-keeping gene β-actin (2−ΔCt); Mann–Whitney U test (see methods).
Transcript CR743148 (Illumina Probe ID 6370082) has been retired from NCBI, but the expressed sequence tag corresponds to splice variant(s) within the GPRIN3 gene (chromosome 4.90).
FC, linearised fold-change expression in RA relative to non-RA (ie, negative values represent genes downregulated in RA relative to non-RA by n-fold); ND, not done; RA, rheumatoid arthritis.
Figure 1(A) Hierarchical clustering of training set: 111 samples represented by columns, and indicated individual genes by rows (italicised genes are STAT3 targets). Colour at each co-ordinate indicates gene-wise fold-expression relative to median (colour scale upper right). Underlying colour bar labels samples by inception diagnosis. (B) Receiver operator characteristic (ROC) plot for a rheumatoid arthritis (RA) risk metric derived from normalised gene expression values in the training cohort. Area under curve=0.85; SEM=0.04; p<0.001. (C) Hierarchical clustering of anti-citrullinated peptide antibodies -negative undifferentiated arthritis ‘validation’ sub-cohort samples based on expression patterns of the same genes (interpretation as for figure 1A). (D) ROC curves comparing discriminatory value, in anti-citrullinated peptide antibodies-negative undifferentiated arthritis, of Leiden prediction rule (grey line) with a modified metric incorporating the 12-gene signature. Modified metric confers added value: area under ROC curve (original Leiden prediction rule)=0.74; SEM=0.08 versus area under ROC curve (modified metric incorporating gene signature)=0.84; SEM=0.06; p<0.001 in both cases.
Figure 2Functional analysis of array data. Genes differentially expressed (>1.2-fold change; p<0.05) between osteoarthritis (OA) and three separate inflammatory comparator groups were overlapped in a Venn diagram (see text, and online supplementary gene lists 2–4, for detailed list compositions). Genes uniquely deregulated in rheumatoid arthritis (RA) (anti-citrullinated peptide antibodies (ACPA)-negative, ACPA-positive or both) could thereby be identified and subjected to pathway analysis (see text). The top two over-represented biological functions identified for the three indicated sets are shown, along with the proportion of the set associated with the function in question, and a p value relating to the likelihood of given proportions occurring by chance (Fisher's exact test). Online supplementary gene lists 5–7 summarise functionally related genes thereby identified. The three indicated sets were combined to identify canonical pathways over-represented among genes differentially expressed between RA and OA in general. Pathways of particular interest in the biological context are listed (genes in question are listed in online supplementary gene list 8), *hypergeometric p-values (Fisher's exact) in each case <0.01.
Figure 3(A–B) Baseline profiles of indicated STAT3-regulated genes across four outcome groups; see online supplementary figure S4 and table S2 for additional examples and patient characteristics, respectively. (C–D) Baseline serum interleukin-6 (C; n=131) and C reactive protein (D; n=173) measurements across outcome groups. P-values: non-parametric analyses of variance (Kruskall–Wallis); for post-hoc analyses, 1, 2 and 3 asterisks denote p<0.05, 0.01 and 0.001, respectively (Dunn's multiple comparison analysis). ACPA, anti-citrullinated peptide antibodies; IL-6, interleukin 6; NS, not significant; RA, rheumatoid arthritis; IA, inflammatory arthritis.
Figure 4(A–D) Serum interleukin 6 (IL-6) concentrations correlate with STAT3-inducible gene expression in peripheral blood CD4 T cells (see online supplementary figure S5, for additional examples). Data are shown for 131 individuals in whom paired, contemporaneous samples were available at baseline; Pearson's R and associated p values are shown.