| Literature DB >> 32171325 |
Chase Correia1, Seamus Mawe2, Shane Lofgren3, Roberta G Marangoni1, Jungwha Lee4,5, Rana Saber1,4, Kathleen Aren1, Michelle Cheng6, Shannon Teaw6, Aileen Hoffmann1, Isaac Goldberg1, Shawn E Cowper7,8, Purvesh Khatri9, Monique Hinchcliff10,11,12, J Matthew Mahoney13,14.
Abstract
BACKGROUND: Skin fibrosis is the clinical hallmark of systemic sclerosis (SSc), where collagen deposition and remodeling of the dermis occur over time. The most widely used outcome measure in SSc clinical trials is the modified Rodnan skin score (mRSS), which is a semi-quantitative assessment of skin stiffness at seventeen body sites. However, the mRSS is confounded by obesity, edema, and high inter-rater variability. In order to develop a new histopathological outcome measure for SSc, we applied a computer vision technology called a deep neural network (DNN) to stained sections of SSc skin. We tested the hypotheses that DNN analysis could reliably assess mRSS and discriminate SSc from normal skin.Entities:
Keywords: AlexNet; Computer vision; Deep neural network; Histology; Modified Rodnan skin score; Outcome measures; Outcomes; Quantitative image features; Scleroderma; Systemic sclerosis
Mesh:
Substances:
Year: 2020 PMID: 32171325 PMCID: PMC7071594 DOI: 10.1186/s13075-020-2127-0
Source DB: PubMed Journal: Arthritis Res Ther ISSN: 1478-6354 Impact factor: 5.156
Fig. 1Deep neural network (DNN) processing of trichrome-stained skin sections. A) Trichrome-stained skin biopsy sections from patients with SSc and healthy controls were photomicrographed at 40x resolution. To sample variability in tissue structure, we randomly selected 100 image patches from the dermis (red box) corresponding to ~ 0.16 mm2. B) Each image patch was used as input to the AlexNet DNN. AlexNet maps the raw pixel values of the input image to a series of more complex image features. The final output is a 4096-dimensional signature of abstract Quantitative Image Features that were used for subsequent multivariate statistical analyses. C) Principal components analysis and multivariate analyses using QIF as the predictor variables were conducted in order to develop, D) a Biopsy Score, E) a Diagnostic Score, F) a Fibrosis Score that was compared to mRSS and skin gene expression biomarkers
Description of Analyses and Terms
| Term | Analysis tool | Purpose |
|---|---|---|
| Image Patch Score | Principal Component Analysis was applied to the 4096 Quantitative Image Features (QIF) generated by the deep neural network (DNN) algorithm for each of the 100 image patches/biopsy | To quantitatively summarize the variance in SSc biopsy histology |
| Biopsy Score | The mean of the 100 Image Patch Scores for each biopsy section | Used as a discovery tool to assess the utility of applying DNN algorithms to stained SSc biopsies. Defining the Biopsy score as the mean of the 100 Image Patch Scores enabled generation of one quantitative histologic score for each biopsy section |
| Diagnostic Score | Logistic regression | To identify QIF that are associated with SSc versus health control biopsy |
| Fibrosis Score | Linear regression | To identify QIF that are associated with mRSS |
Clinical characteristics of patients with systemic sclerosis
| Clinical Characteristic | Primary Cohort | Secondary Cohort | |
|---|---|---|---|
| Mean ± Standard Deviation or as indicated | SSc patients (n = 6) | SSc patients ( | Healthy controls ( |
| Age | 54 ± 6 | 50 ± 11 | 42 ± 12 |
| Women (n, %) | 5 (83%) | 52 (87%) | 13 (81%) |
| Race, Caucasian (n, %) | 5 (83%) | 45 (75%) | 12 (75%) |
| SSc disease duration (months)* | 11 ± 3 | 38 ± 51 | |
| dcSSc (n, %) | 6 (100%) | 42 (70%) | |
| mRSS (median, IQR) | 15 (8) | 15 (8) | |
| Serum autoantibodies (n, %) | |||
| Anticentromere | 0 | 4 (7%) | |
| Anti-RNA polymerase III | 3 (50%) | 15 (25%) | |
| Anti-topoisomerase I (Scl-70) | 2 (33%) | 19 (31%) | |
*Months between date of onset of first non-Raynaud systemic sclerosis (SSc) symptom and date of skin biopsy. dcSSc Diffuse cutaneous SSc
Fig. 2Association between DNN-derived signatures and mRSS. A) To visualize the distribution of Quantitative Image Features (QIFs) for the primary cohort, a heat map was generated with columns consisting of image patches (26 patients × 100 image patches) and rows consisting of 4096 QIFs. For display, the rows and columns of the heat map were sorted by the first principal component, which highlights both the correlation among features across the set of image patches and the differences in QIF signatures across image patches. B) The distribution of the 100 Image Patch Scores for two biopsies is shown for a patient with low (orange), and high (blue) modified Rodnan Skin Score (mRSS). The mean Image Patch Score for each biopsy, termed the Biopsy Score (dotted vertical line) is shown. C) Biopsy Scores significantly correlated with mRSS (R = 0.55, p = 0.010, repeated measures test)
Fig. 3Prediction of SSc status in a secondary cohort. A) We developed and independently tested a logistic regression model composed of Quantitative Image Features (QIFs) to predict SSc vs. healthy control. The model output was a Diagnostic Score, i.e. the predicted probability that the image was from a patient with SSc. Box plots of the of the cross-validation (CV) and independently tested Diagnostic Scores show significant separation between groups. Using a hard threshold of 0.5, the model achieved a 1.9% misclassification rate using cross validation in the training data and 6.6% on the independent testing data set. B) As an alternative visualization of the model performance, the ROC curves for the logistic regression model show that the model achieves high area under the curve (AUC) for both CV (AUC =1.00) and testing (AUC =0.99)
Fig. 4Prediction of fibrosis and association with molecular pathology. A) We developed and independently tested a linear regression model composed of Quantitative Image Features (QIFs) to predict mRSS. The model output was a Fibrosis Score measured arbitrary units (AU). Using cross validation (CV) on the training data set, the Fibrosis Score strongly correlated with modified Rodnan Skin Score (mRSS) (R = 0.70, open squares). This correlation generalized to the independent test set (R = 0.55, p = 5.3 × 10− 6; solid dots). B) The DNN-derived Fibrosis Score significantly correlated with the validated Scleroderma Skin Severity Score (4S) (R = 0.69, p = 2.9 × 10− 17)