| Literature DB >> 30301895 |
Matthew T Patrick1, Philip E Stuart1, Kalpana Raja1,2, Johann E Gudjonsson1, Trilokraj Tejasvi1,3, Jingjing Yang4,5, Vinod Chandran6,7,8,9, Sayantan Das4, Kristina Callis-Duffin10, Eva Ellinghaus11, Charlotta Enerbäck12, Tõnu Esko13,14, Andre Franke11, Hyun M Kang4, Gerald G Krueger10, Henry W Lim15, Proton Rahman16, Cheryl F Rosen17, Stephan Weidinger18, Michael Weichenthal18, Xiaoquan Wen4, John J Voorhees1, Gonçalo R Abecasis4, Dafna D Gladman6,7,8, Rajan P Nair1, James T Elder1,3, Lam C Tsoi19,20,21.
Abstract
Psoriatic arthritis (PsA) is a complex chronic musculoskeletal condition that occurs in ~30% of psoriasis patients. Currently, no systematic strategy is available that utilizes the differences in genetic architecture between PsA and cutaneous-only psoriasis (PsC) to assess PsA risk before symptoms appear. Here, we introduce a computational pipeline for predicting PsA among psoriasis patients using data from six cohorts with >7000 genotyped PsA and PsC patients. We identify 9 new loci for psoriasis or its subtypes and achieve 0.82 area under the receiver operator curve in distinguishing PsA vs. PsC when using 200 genetic markers. Among the top 5% of our PsA prediction we achieve >90% precision with 100% specificity and 16% recall for predicting PsA among psoriatic patients, using conditional inference forest or shrinkage discriminant analysis. Combining statistical and machine-learning techniques, we show that the underlying genetic differences between psoriasis subtypes can be used for individualized subtype risk assessment.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30301895 PMCID: PMC6177414 DOI: 10.1038/s41467-018-06672-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Computational pipeline to predict psoriasis subtypes. a Overview of pipeline, through quality control, phasing and imputation, association analysis, meta-analysis, and stepwise conditional analysis. b The machine-learning process included separating data randomly into training (cross-validation to optimize the model) and test (holdout) sets, as well as evaluating the results with and without the PAGE Immunochip dataset. PsA psoriatic arthritis; PsC cutaneous-only psoriasis; QC quality control
Number of patients and markers in each Genetic Cohort
| Cohort | Patients | Markers (genotyped and well-imputed) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| PsV | PsA | PsC | Control | Genotyped | SNPa | INDELa | HLA/AAa | Total | |
| PsA GWAS | 1430 | 1430 | NA | 1417 | 972,453 | 17,510,941 | 1,278,891 | 1251 | 18,791,083 |
| CASP GWAS | 1338 | 349 | 639 | 1370 | 438,609 | 15,759,031 | 1,063,919 | 1247 | 16,824,197 |
| Kiel GWAS | 464 | 33 | 269 | 1135 | 504,625 | 13,315,820 | 1,077,158 | 1236 | 14,394,214 |
| Genizon GWAS | 760 | 139 | 399 | 993 | 489,501 | 13,624,904 | 1,093,913 | 1224 | 14,720,041 |
| Exomechip | 3863 | 752 | 1374 | 4027 | 461,092 | 16,411,455 | 976,233 | 1254 | 17,388,942 |
| PAGE Immunochip | 3169 | 971 | 885 | 7394 | 160,228 | 1,414,274 | 84,270 | 1245 | 1,499,789 |
| New Total | 11,024 | 3674 | 3566 | 16,336 | New Union | 23,657,701 (8,730,264b) | 1,403,045 (1,021,305b) | 1270 (1217b) | 25,062,016 (9,752,786b) |
| New GWAS Total | 7855 | 2703 | 2681 | 8943 | New intersection (All) | 1,120,138 (43,356c) | 66,845 (3301c) | 1203 (546c) | 1,188,186 (47,203c) |
| Previous[ | 9293 | 3061 | 3110 | 17,393 | New intersection (GWAS) | 9,771,987 (247,740c) | 870,338 (27,115c) | 1205 (546c) | 10,643,530 (275,401c) |
| Previous[ | 4007 | 1946 | 1363 | 4934 | Previous[ | 8,265,477 (7,091,979b) | 681,304 (627,111b) | 1342 (1216b) | 8,948,123 (7,720,306b) |
| Previous[ | 40,249 (8,775c) | 3187 (717c) | 1141 (309c) | 44,577 (9801c) | |||||
| Previous[ | 6,964,145 (229,722c) | 589,032 (20,195c) | 1269 (326c) | 7,554,446 (250,243c) | |||||
PsV psoriasis vulgaris; PsA psoriatic arthritis; PsC cutaneous-only psoriasis; NA not available
aWell-imputed markers (r2 ≥ 0.7)
bUnion of markers filtered using MAF ≥ 0.01 (these are the markers used in our unconditional meta-analysis)
cIntersection of markers filtered using MAF ≥ 0.01 and p ≤ 0.05 (these are the markers used in our conditional meta-analysis). All the samples are of Caucasian descent
Fig. 2Meta-analysis results. New loci identified by this study are highlighted in red, whereas loci identified in a previous study that were not genome-wide significant in this study are highlighted in blue, for meta-analysis results based on the following comparisons: a PsV vs. Control; b PsA vs. Control; and c PsC vs. Control
Meta-analysis results for possible new psoriasis loci
| Marker ID | Chr | Position (hg19)a | Alleles (risk/nonrisk) | Nearby gene | Phenotype comparison | Direction (PCKGEI)b | Control AFc | Case AFc | Meta ORd | Meta |
|---|---|---|---|---|---|---|---|---|---|---|
| rs9591325 | 13 | 50811220 | T/C |
| PsV-ctl | ++++++ | 0.921 | 0.934 | 1.25 | 7 × 10−9 |
| rs7612823 | 3 | 101613923 | T/C |
| PsA-ctl | ++++++ | 0.806 | 0.836 | 1.25 | 3 × 10−8 |
| rs848 | 5 | 131996500 | C/A |
| PsA-ctl | ++++++ | 0.787 | 0.827 | 1.27 | 1 × 10−9 |
| rs588177 | 11 | 64024056 | C/A |
| PsA-ctl | ++−+++ | 0.301 | 0.339 | 1.20 | 1 × 10−8 |
| rs1177202 | 2 | 61074576 | C/G |
| PsC-ctl | ++++++ | 0.566 | 0.606 | 1.18 | 2 × 10−8 |
| rs2111485 | 2 | 163110536 | G/A |
| PsC-ctl | +++++ | 0.605 | 0.641 | 1.18 | 4 × 10−8 |
| rs14990525 | 16 | 31006289 | TGGTGCTA/- |
| PsC-ctl | +++++ | 0.362 | 0.402 | 1.20 | 9 × 10−10 |
| rs34536443 | 19 | 10463118 | G/C |
| PsC-ctl | ?++++ | 0.955 | 0.978 | 2.08 | 2 × 10−13 |
| rs34685920 | 20 | 48572650 | A/− |
| PsC-ctl | +++++ | 0.568 | 0.608 | 1.20 | 1 × 10−10 |
Chr chromosome, AF allele frequency, OR odds ratio, p p value
aFor insertions or deletions of the reference sequence, position of first base before the insertion point or of first base of the deleted sequence is shown, respectively
bFor six studies of discovery meta-analysis (P = PsA GWAS, C = CASP GWAS, K = Kiel GWAS, G = Genizon GWAS, E = Exomechip, I = PAGE Immunochip) indicates whether OR of risk allele is ≥1 (+), <1 (−), or undetermined due to low imputation quality (?). PsA GWAS directions are only included for PsV-ctl and PsA-ctl, since the PsA GWAS cohort does not contain any patients with PsC subphenotype
cAFs are represented according to the risk allele
dOR and p value for fixed effects meta-analysis with inverse variance weighting
Fig. 3Enrichment for regulatory elements. Enrichment calculated using active enhancers predicted using H3K27ac: a heatmap showing the relative overlap (compared to the 0–0.05 bin) with each cell type for markers with different p values (from gray: no overlap, through to dark green: high overlap) and b overlap for the top five cell types (with highest relative increase from the baseline percentage)
Fig. 4Risk prediction and assessment. a Benchmarking performance, on the cross-validation set, of the top five classifiers (penalized.ridge logistic ridge regression; lda linear discriminant analysis; earth multivariate adaptive regression splines; randomForest random forest; cforest conditional inference forest) out of 26 MLR classifiers in the complete benchmark (Supplementary Figure 4). b Classifier performance (on both the CV and test set), calculated using the area under the receiver operator curve (AUC). c Trade-off between precision and recall when predicting different proportions of samples as having PsA in hold-out test set. d Evaluation of classifier calibration, under different prior probabilities for PsA (on the 3:7 PsA/PsC ratio test set), which we subsequently used to predict the risk of patients with, as of yet, undiagnosed psoriasis subtype developing PsA