| Literature DB >> 35309349 |
Jared Liu1, Sugandh Kumar1, Julie Hong1, Zhi-Ming Huang1, Diana Paez2, Maria Castillo2, Maria Calvo2, Hsin-Wen Chang1, Daniel D Cummins1, Mimi Chung1, Samuel Yeroushalmi1, Erin Bartholomew1, Marwa Hakimi1, Chun Jimmie Ye2,3,4,5,6,7, Tina Bhutani1, Mehrdad Matloubian2,8, Lianne S Gensler2, Wilson Liao1,3.
Abstract
Early diagnosis of psoriatic arthritis (PSA) is important for successful therapeutic intervention but currently remains challenging due, in part, to the scarcity of non-invasive biomarkers. In this study, we performed single cell profiling of transcriptome and cell surface protein expression to compare the peripheral blood immunocyte populations of individuals with PSA, individuals with cutaneous psoriasis (PSO) alone, and healthy individuals. We identified genes and proteins differentially expressed between PSA, PSO, and healthy subjects across 30 immune cell types and observed that some cell types, as well as specific phenotypic subsets of cells, differed in abundance between these cohorts. Cell type-specific gene and protein expression differences between PSA, PSO, and healthy groups, along with 200 previously published genetic risk factors for PSA, were further used to perform machine learning classification, with the best models achieving AUROC ≥ 0.87 when either classifying subjects among the three groups or specifically distinguishing PSA from PSO. Our findings thus expand the repertoire of gene, protein, and cellular biomarkers relevant to PSA and demonstrate the utility of machine learning-based diagnostics for this disease.Entities:
Keywords: CITE-seq; diagnostic test; machine learning; psoriasis; psoriatic arthritis; single cell
Mesh:
Substances:
Year: 2022 PMID: 35309349 PMCID: PMC8924042 DOI: 10.3389/fimmu.2022.835760
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Cell types and subsets among PSA, PSO, and healthy individuals. (A) UMAP of SCTransform-normalized RNA expression integrated with ADT expression, colored by cell subset. (B) Mean percentages of each cell type within the total PBMCs of each subject. Error bars indicate standard error of the mean; * indicates both Wilcoxon and FDR-adjusted Kruskall-Wallis p-values < 0.05.
Figure 2Differentially expressed features between PSA, PSO, and healthy subjects within cell types. Counts of differentially expressed (A) genes and (B) cell surface proteins are shown for each comparison within each cell type. Top 30 differentially expressed (C) genes and (D) cell surface proteins in each cell type are ranked by highest absolute log2 fold change (for genes) or absolute mean difference (for proteins) between PSA cells vs. PSO (circles) or healthy (triangles) cells.
Figure 3Immune cell subsets differentially abundant in psoriatic and healthy individuals. (A) UMAP of de novo clusters identified within select cell types containing clusters with significant abundance differences. (B) Average percentage of cells from each PSA, PSO, or healthy subject in a given cluster out of total cells from that subject in the given cell type. (C) Volcano plots of genes and cell surface proteins upregulated and downregulated in each cluster relative to other cells of the same cell type. * indicates Wilcoxon p-value < 0.05.
Figure 4Machine learning classification of healthy, PSA and PSO subjects. (A) Classification rate (Importance) of top 20 DEGs, along with corresponding (B) accuracy and kappa of eleven different ML classifiers trained on these features. (C) ROC curve of RF model for healthy, PSA, and PSO classification. Analogous plots shown for (D–F) DEPs and (G–I) DEGs combined with DEPs. Error bars indicate 95% confidence interval.
Figure 5Machine learning classification of PSA vs. PSO subjects. (A) Classification rate (Importance) of top 20 DEGs, along with corresponding (B) accuracy and kappa of eleven different ML classifiers trained on these features. (C) ROC curve of RF model. Analogous plots shown for (D–F) DEPs and (G–I) DEGs combined with DEPs. Error bars indicate 95% confidence interval.
Figure 6Machine learning classification of PSA vs. PSO subjects based on 200 PSA-associated genetic risk loci. (A) Accuracy and kappa of eleven ML models. (B) ROC curve for RF model. Error bars indicate 95% confidence interval. Error bars indicate 95% confidence interval.