| Literature DB >> 34095386 |
Miyuraj Harishchandra Hikkaduwa Withanage1, M Paula Gomez Hernandez2, Emily E Starman3, Andrew B Davis4, Erliang Zeng1, Scott M Lieberman5, Kim A Brogden2, Emily A Lanzel6.
Abstract
Sjögren's syndrome is an autoimmune disease that can also occur in children. The disease is not well defined and there is limited information on the presence of chemokines, cytokines, and biomarkers (CCBMs) in the saliva of children that could improve their disease diagnosis. In a recent study [1], we reported a large dataset of 105 CCBMs that were associated with both lymphocyte and mononuclear cell functions [2] in the saliva of 11 children formally diagnosed with Sjögren's syndrome and 16 normal healthy children. Here, we extend those findings and use the Mendeley dataset [2] to identify CCBMs that have predictive power for Sjögren's syndrome in female children. Datasets of CCBMs from all saliva samples and female children saliva samples were standardized. We used machine learning methods to select Sjögren's syndrome associated CCBMs and assessed the predictive power of selected CCBMs in these two datasets using receiver operating characteristic (ROC) curves and associated areas under curve (AUC) as metrics. We used eight classifiers to identify 16 datasets that contained from 2 to 34 CCBMs with AUC values ranging from 0.91 to 0.94.Entities:
Keywords: Biomarkers; Chemokines; Children; Cytokines; Saliva; Sjögren's syndrome
Year: 2021 PMID: 34095386 PMCID: PMC8165406 DOI: 10.1016/j.dib.2021.107139
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Comparison of the highest area under curve (AUC) values resulting from classifiers using two datasets: maximum AUC of all samples (red) and maximum AUC of female samples only (blue). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Comparison of receiver operating characteristic (ROC) curves of classifiers on different feature sets. The area under curve (AUC) values indicate CCBMs can be served as predictor biomarkers for Sjögren syndrome diagnosis in children. Eight classifiers were used, including k-NN: k-Nearest Neighbor, RF: Random Forest, GP: Gaussian Process, SVM (rbf): Support Vector Machine with rbf Kernel, SVM (Linear): Support Vector Machine with Linear Kernel, LR: Logistic Regression, AB: AdaBoost, and NB: Naïve Bayes.
The area under curve (AUC) values of best performing models.
| Classifier | Model Name | Feature set (a) | AUC |
|---|---|---|---|
| Random Forest (RR) | RR_fe_1 | IL27, MIA, CCL4, CXCL11 | 0.93 |
| RR_fe_2 | IL27, MIA, CCL4, CXCL11, IL23A | ||
| RR_fe_3 | IL27, MIA, CCL4, CXCL11, TNFRSF18 | ||
| Logistic Regression (LR) | LR_fe_1 | IL27, CCL4, CXCL11 | 0.93 |
| Gaussian Process (GP) | GP_fe_1 | IL27, CCL4, CXCL11, IL23A | 0.91 |
| GP_fe_2 | IL27, CCL4, CXCL11, MIA | ||
| GP_fe_3 | IL27, CCL4, CXCL11, MIA, TNFRSF18, CCL19, IL12B, TNFRSF9, LIF, CCM2, GZMB | ||
| GP_fe_4 | IL27, CCL4, CXCL11, IL23A, TNFRSF18, CCL19, IL12B, TNFRSF9, LIF | ||
| k-Nearest Neighbor (NN) | NN_fe_1 | IL27, CCL4 | 0.94 |
| SVM(rbf) | SVMr_fe_1 | CCL4, IL27, CXCL11, TNFRSF18 | 0.94 |
| SVMr_fe_2 | CCL4, IL27, CXCL11 | ||
| SVM(Linear) | SVML_fe_1 | IL27, CCL4, CXCL11, TNFRSF18 | 0.94 |
| SVML_fe_2 | IL27, CCL4, CXCL11, TNFRSF18, MIA | ||
| AdaBoost (AB) | AB_fe_1 | CCL4, IL27, CXCL11, TNFRSF18, IL12B, ALCAM, CCL19, IL23A, TSLP, IL16, LIF, TNFSF5, TNFRSF8, CCL20, IRX1, CCL15, IL15, TNFRSF9, CXCL13, CCM2, CD40, CCL21, IL1B, MIA, XCL1, MMP9, CCL11, S100A8, GZMB, ULBP2, TNFSF11, CCL5, CXCL10, IFNB1 | 0.94 |
| Naïve Bayes (NB) | NB_fe_1 | IL27, CCL4, CXCL11, MIA, IL23A, CCL21, CCL19, ACAN, CCM2, TNFRSF9 | 0.94 |
| NB_fe_2 | IL27, CCL4, CXCL11, MIA, IL23A, CCL21, CCL19, ACAN, CCM2, TNFRSF9, IL12B |
Two different feature sets could have the same prediction power, that is, same AUC values. For example, models “SVML_fe_1″ and “SVML_fe_1″ have the same AUC value (0.94).
Catalog and lot numbers for kits of fluorescent microparticle-based immunoassays used to determine the concentrations of chemokines, cytokines, and biomarkers (CCBMs) in the saliva of children formally diagnosed with Sjögren's syndrome and from normal healthy children, matched for gender and age, who served as study controls.
| Kit/Lot | No. of CCBMs in Plex | CCBMs in kit |
|---|---|---|
| LXSAHM-07 (lot 129,053) | 7 | CCL5, CCL17, IL12A, TIMP1, TNFRSF13B, TNFRSF17, TNFRSF1A |
| LXSAHM-36 (lot 129,052) | 36 | B2M, CALCA, CCL1, CCL11, CCL20, CCL22, CCL26, CCL28, CCL3, CCL7, CCL8, CD274, CD40, CXCL13, CXCL14, CXCL4, CXCL9, FCER1G, GZMA, IFNA1, IFNB1, IFNG, IFNGR1, IL16, IL1A, IL1B, IL1R2, IL21, IL6, LGALS3, LGALS9, SLPI, TNFRSF7, TNFRSF8, TNFSF11, TNFSF5 |
| LXSAHM-22 (lot 128,990) | 22 | IL11RA, C9, CCL21, CST3, FSTL1, IFNL3, IGFBP3, IL12B, IL15, IL27, LGALS3BP, LIF, LTF, MIA, NAGLU, S100A8, S100A9, TNFRSF18, TNFRSF1B, TSLP, ULBP2, XCL1 |
| LXSAHM-40 (lot 129,245) | 40 | A2M, ACAN, ALCAM, AMBP, C5, CA9, CCL13, CCL14, CCL15, CCL18, CCL19, CCL2, CCL23, CCL24, CCL25, CCL27, CCL4, CCM2, CD276, CTSS, CXCL10, CXCL11, FASLG, FSTL3, GAS6, GDF2, GZMB, IFNL2, IL10, IL23A, IL2RA, IL7, IRX1, MMP9, PECAM1, SELL, TNFA, TNFRSF9, TNFSF13, TNFSF13B |
Luminex Human Magnetic Assay, R&D Systems, Minneapolis, MN USA.
| Subject | Immunology |
| Specific subject area | Sjögren's syndrome in children |
| Type of data | Figure and Table |
| How data were acquired | 1. Fluorescent microparticle-based immunoassays (Luminex Human Magnetic Assay, R&D Systems, Minneapolis, MN) |
| 2. Luminex model 100 IS (Luminex, Austin, TX USA) | |
| 3. xPonent v3.1 software (Luminex, Austin, TX) | |
| 4. Milliplex Analyst v5.1 software (EMD Millipore, Billerica, MA) | |
| Data format | Raw and analyzed |
| Parameters for data collection | CCBM concentrations in saliva samples from children with Sjögren's syndrome and from healthy children of the same gender and age |
| Description of data collection | Saliva samples were collected from August 30, 2016 to May 23, 2017. CCBM data was collected from July 12, 2019 to August 16, 2019 |
| Data source location | Institution: University of Iowa College of Dentistry |
| Data accessibility | Repository name: Mendeley Data |
| Related research article | M.P. Gomez Hernandez, E.E. Starman, A.B. Davis, M.H. Hikkaduwa Withanage, E. Zeng, S.M. Lieberman, K.A. Brogden, E.A. Lanzel. A unique profile of chemokines, cytokines, and biomarkers in the saliva of children with Sjögren syndrome. Rheumatology (2021) 1–13. |