| Literature DB >> 32415284 |
Jonas Bianchi1,2, Antônio Carlos de Oliveira Ruellas3, João Roberto Gonçalves4, Beatriz Paniagua5, Juan Carlos Prieto6, Martin Styner6, Tengfei Li7, Hongtu Zhu7, James Sugai8, William Giannobile8, Erika Benavides8, Fabiana Soki8, Marilia Yatabe3, Lawrence Ashman9, David Walker10, Reza Soroushmehr11, Kayvan Najarian11, Lucia Helena Soares Cevidanes3.
Abstract
After chronic low back pain, Temporomandibular Joint (TMJ) disorders are the second most common musculoskeletal condition affecting 5 to 12% of the population, with an annual health cost estimated at $4 billion. Chronic disability in TMJ osteoarthritis (OA) increases with aging, and the main goal is to diagnosis before morphological degeneration occurs. Here, we address this challenge using advanced data science to capture, process and analyze 52 clinical, biological and high-resolution CBCT (radiomics) markers from TMJ OA patients and controls. We tested the diagnostic performance of four machine learning models: Logistic Regression, Random Forest, LightGBM, XGBoost. Headaches, Range of mouth opening without pain, Energy, Haralick Correlation, Entropy and interactions of TGF-β1 in Saliva and Headaches, VE-cadherin in Serum and Angiogenin in Saliva, VE-cadherin in Saliva and Headaches, PA1 in Saliva and Headaches, PA1 in Saliva and Range of mouth opening without pain; Gender and Muscle Soreness; Short Run Low Grey Level Emphasis and Headaches, Inverse Difference Moment and Trabecular Separation accurately diagnose early stages of this clinical condition. Our results show the XGBoost + LightGBM model with these features and interactions achieves the accuracy of 0.823, AUC 0.870, and F1-score 0.823 to diagnose the TMJ OA status. Thus, we expect to boost future studies into osteoarthritis patient-specific therapeutic interventions, and thereby improve the health of articular joints.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32415284 PMCID: PMC7228972 DOI: 10.1038/s41598-020-64942-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The spectrum of Data Science to advance TMJ OA diagnosis includes Data capture and acquisition, Data processing with a web-based data management, Data Analytics involving in-depth statistical analysis, machine learning approaches, and Data communication to help the decision-making support in TMJ OA diagnosis.
Descriptive and demographic values for each clinical variable.
| Variables | Abbreviation | Control Group (n = 46) Female (39) Male (7) | TMJ OA Group (n = 46) Female (39) Male (7) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Median | Mean | 95% CI | SD | Median | Mean | 95% CI | SD | ||||
| Lower | Upper | Lower | Upper | ||||||||
| Age | Age | 38.50 | 39.83 | 35.89 | 43.76 | 13.26 | 38.00 | 37.65 | 33.99 | 41.32 | 12.34 |
| Years of Pain Onset (years) | PainY | — | — | — | — | — | 3.75 | 4.34 | 3.35 | 5.33 | 3.34 |
| Facial Current Pain (years) | PainCur | — | — | — | — | — | 3.00 | 3.07 | 2.47 | 3.66 | 2.00 |
| Facial last 6 months Worst Pain (0 to 10) | PainWor | — | — | — | — | — | 7.00 | 6.89 | 6.06 | 7.73 | 2.81 |
| Facial last 6 months Average Pain (0 to 10) | PainAve | — | — | — | — | — | 4.50 | 4.52 | 3.87 | 5.17 | 2.20 |
| Last 6 Months Distressed by Headaches (0 to 10) | Headaches | 0.00 | 0.63 | 0.33 | 0.93 | 1.02 | 2.00 | 1.65 | 1.33 | 1.97 | 1.08 |
| Last 6 Months Distressed by Muscle Soreness (0 to 10) | MusSor | 0.00 | 0.37 | 0.16 | 0.58 | 0.71 | 1.00 | 1.07 | 0.74 | 1.39 | 1.10 |
| Vertical Range Unassisted Without Pain (mm) | RangeWOpain | 44.50 | 44.91 | 42.42 | 47.40 | 8.39 | 36.35 | 39.00 | 32.44 | 40.26 | 13.16 |
| Vertical Range Unassisted Max (mm) | RangeUnaMax | 47.50 | 46.83 | 44.62 | 49.03 | 7.41 | 45.00 | 44.28 | 41.41 | 47.16 | 9.68 |
| Vertical Range Assisted Max (mm) | RangeAssMax | 50.00 | 49.21 | 47.15 | 51.27 | 6.94 | 49.00 | 47.54 | 44.69 | 50.40 | 9.61 |
CI: Confidence Interval; SD: Standard Deviation.
Descriptive values for each biomolecular variable.
| Variables | Abbreviation | Control Group (n = 46) Female (39) Male (7) | TMJ OA Group (n = 46) Female (39) Male (7) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Median | Mean | 95% CI | SD | Median | Mean | 95% CI | SD | ||||
| Lower | Upper | Lower | Upper | ||||||||
| Angiogenin Serum | ANG_Ser | 1467.10 | 1454.86 | 1389.09 | 1520.62 | 221.46 | 1459.05 | 1457.15 | 1368.65 | 1545.65 | 298.02 |
| BDNF Serum | BDNF_Ser | 280.25 | 719.56 | 378.08 | 1061.04 | 1149.92 | 286.35 | 1121.62 | 544.88 | 1698.35 | 1942.12 |
| CXCL16 Serum | CXCL16_Ser | 3726.70 | 3741.37 | 3550.00 | 3932.73 | 644.40 | 3827.50 | 3988.57 | 3687.27 | 4289.87 | 1014.61 |
| ENA-78 Serum | ENA-78_Ser | 348.70 | 664.23 | 453.24 | 875.21 | 710.48 | 276.40 | 593.05 | 394.98 | 791.11 | 666.97 |
| MMP3 Serum | MMP3_Ser | 2358.25 | 2373.10 | 2073.33 | 2672.87 | 1009.45 | 2305.20 | 2367.03 | 2091.83 | 2642.24 | 926.72 |
| MMP7 Serum | MMP7_Ser | 496.55 | 527.66 | 444.90 | 610.42 | 278.69 | 453.75 | 554.15 | 419.32 | 688.98 | 454.02 |
| OPG Serum | OPG_Ser | 2539.15 | 3010.92 | 2165.32 | 3856.52 | 2847.49 | 2428.10 | 3116.79 | 2415.06 | 3818.51 | 2363.01 |
| PAI-1 Serum | PAI-1_Ser | 6505.60 | 7930.35 | 6486.74 | 9373.96 | 4861.24 | 6693.65 | 7237.11 | 5904.80 | 8569.41 | 4486.42 |
| TGF-β1 Serum | TGF-β1_Ser | 91.20 | 140.68 | 98.90 | 182.47 | 140.70 | 99.15 | 177.84 | 103.81 | 251.87 | 249.29 |
| TIMP-1 Serum | TIMP-1_Ser | 7382.15 | 7280.32 | 7020.93 | 7539.71 | 873.47 | 7351.65 | 7382.74 | 7099.45 | 7666.03 | 953.95 |
| TRANCE Serum | TRANCE_Ser | 2078.70 | 2200.67 | 1885.39 | 2515.95 | 1061.69 | 2507.15 | 2560.51 | 2231.90 | 2889.12 | 1106.56 |
| VE-cadherin Serum | VE-cad_Ser | 6259.20 | 6527.08 | 5140.64 | 7913.53 | 4668.73 | 5308.05 | 6154.80 | 4988.12 | 7321.48 | 3928.70 |
| VEGF Serum | VEGF_Ser | 93.90 | 115.32 | 76.18 | 154.46 | 131.80 | 87.30 | 117.40 | 85.32 | 149.47 | 108.01 |
| Angiogenin Saliva | ANG_Sal | 721.85 | 720.83 | 652.98 | 788.69 | 228.48 | 754.05 | 758.02 | 702.29 | 813.74 | 187.65 |
| BDNF Saliva | BDNF_ Sal | 5.20 | 7.60 | 5.13 | 10.08 | 8.34 | 3.95 | 8.32 | 4.03 | 12.62 | 14.47 |
| CXCL16 Saliva | CXCL16_Sal | 109.40 | 183.94 | 121.59 | 246.29 | 209.95 | 100.60 | 207.09 | 130.24 | 283.95 | 258.80 |
| ENA-78 Saliva | ENA-78_Sal | 2424.60 | 2218.02 | 1925.14 | 2510.90 | 986.25 | 2482.60 | 2410.40 | 2087.41 | 2733.39 | 1087.63 |
| MMP7 Saliva | MMP7_Sal | 3290.90 | 3615.28 | 2949.23 | 4281.33 | 2242.87 | 3594.20 | 3666.79 | 3040.77 | 4292.82 | 2108.08 |
| OPG Saliva | OPG_Sal | 555.75 | 855.69 | 560.07 | 1151.32 | 995.49 | 732.30 | 1329.53 | 754.68 | 1904.38 | 1935.77 |
| PAI-1 Saliva | PAI-1_Sal | 24.40 | 85.02 | 45.07 | 124.96 | 134.51 | 40.40 | 93.14 | 32.77 | 153.52 | 203.32 |
| TGF-β1 Saliva | TGF-β1_Sal | 40.70 | 64.66 | 41.86 | 87.46 | 76.78 | 54.45 | 69.20 | 46.64 | 91.76 | 75.97 |
| TIMP-1 Saliva | TIMP-1_Sal | 4070.90 | 3963.81 | 3663.14 | 4264.48 | 1012.49 | 3889.80 | 3880.69 | 3664.71 | 4096.67 | 727.30 |
| TRANCE Saliva | TRANCE_Sal | 627.60 | 794.52 | 533.49 | 1055.56 | 879.02 | 698.75 | 1041.55 | 622.45 | 1460.64 | 1411.27 |
| VE-cadherin Saliva | VE-cad_Sal | 643.10 | 1008.24 | 657.65 | 1358.84 | 1180.60 | 666.70 | 1313.82 | 585.33 | 2042.30 | 2453.11 |
| VEGF Saliva | VEGF_Sal | 1181.35 | 1342.62 | 1125.23 | 1560.02 | 732.07 | 1441.65 | 1419.39 | 1281.99 | 1556.79 | 462.68 |
CI: Confidence Interval; SD: Standard Deviation.
Descriptive values for each imaging variable.
| Variables | Abbreviation | Control Group (n = 46) Female (39) Male (7) | TMJ OA Group (n = 46) Female (39) Male (7) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Median | Mean | 95% CI | SD | Median | Mean | 95% CI | SD | ||||
| Lower | Upper | Lower | Upper | ||||||||
| Energy | Energy | 0.30 | 0.31 | 0.28 | 0.33 | 0.07 | 0.25 | 0.25 | 0.23 | 0.27 | 0.07 |
| Entropy | Entropy | 2.29 | 2.30 | 2.20 | 2.40 | 0.33 | 2.56 | 2.62 | 2.49 | 2.75 | 0.42 |
| Inverse Difference Moment | InvDifMom | 0.91 | 0.90 | 0.90 | 0.91 | 0.02 | 0.89 | 0.89 | 0.89 | 0.90 | 0.02 |
| Inertia | Inertia | 0.19 | 0.19 | 0.19 | 0.20 | 0.03 | 0.21 | 0.21 | 0.20 | 0.22 | 0.03 |
| Haralick Correlation | HarCor | 317.48 | 375.56 | 303.63 | 447.49 | 242.23 | 410.36 | 603.40 | 467.71 | 739.10 | 456.94 |
| Short Run Emphasis | ShortRE | 0.33 | 0.34 | 0.35 | 0.34 | 0.03 | 0.35 | 0.35 | 0.34 | 0.36 | 0.03 |
| Long Run Emphasis | LongRE | 16.58 | 16.51 | 16.01 | 17.01 | 1.68 | 15.44 | 15.64 | 15.10 | 16.18 | 1.81 |
| Grey Level Non Uniformity | GreyLNU | 2405.84 | 2374.26 | 2272.67 | 2475.84 | 342.08 | 2240.61 | 2249.65 | 2158.63 | 2340.67 | 306.50 |
| Run Length Non Uniformity | RunLNU | 1239.23 | 1287.96 | 1209.65 | 1366.27 | 263.72 | 1443.88 | 1459.22 | 1367.19 | 1551.25 | 309.91 |
| Low Grey Level Run Emphasis | LowGLRE | 0.06 | 0.06 | 0.06 | 0.06 | 0.01 | 0.06 | 0.06 | 0.05 | 0.06 | 0.01 |
| High Grey Level Run Emphasis | HighGLRE | 19.10 | 19.98 | 18.76 | 21.19 | 4.09 | 21.05 | 22.47 | 20.95 | 23.98 | 5.11 |
| Short Run Low Grey Level Emphasis | ShortRLowGLE | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 |
| Short Run High Grey Level Emphasis | ShortRHighGLE | 6.96 | 7.25 | 6.72 | 7.77 | 1.77 | 8.15 | 8.56 | 7.89 | 9.24 | 2.27 |
| Long Run Low Grey Level Emphasis | LongRLowGLE | 1.05 | 1.05 | 0.98 | 1.11 | 0.23 | 0.95 | 0.95 | 0.87 | 1.03 | 0.28 |
| Long Run High Grey Level Emphasis | LongRHighGLE | 299.09 | 303.82 | 283.69 | 323.95 | 67.80 | 309.96 | 317.43 | 298.88 | 335.97 | 62.45 |
| Bone Volume (%) | BV/TV | 0.54 | 0.54 | 0.48 | 0.60 | 0.20 | 0.60 | 0.58 | 0.52 | 0.64 | 0.20 |
| Trabecular Thickness (mm) | Tb.Th | 0.35 | 0.38 | 0.33 | 0.43 | 0.16 | 0.41 | 0.44 | 0.38 | 0.50 | 0.19 |
| Trabecular Separation (mm) | Tb.Sp | 0.28 | 0.34 | 0.27 | 0.40 | 0.21 | 0.26 | 0.35 | 0.25 | 0.44 | 0.31 |
| Trabecular Number (mm−1) | Tb.N | 1.47 | 1.44 | 1.38 | 1.51 | 0.23 | 1.45 | 1.36 | 1.28 | 1.44 | 0.28 |
| Bone Surface to Bone Volume Ratio (mm−1) | BS/BV | 5.79 | 6.08 | 5.43 | 6.73 | 2.18 | 4.89 | 5.30 | 4.65 | 5.95 | 2.18 |
CI: Confidence Interval; SD: Standard Deviation.
Figure 2Mann-Whitney U test comparison between the TMJ OA and control groups showing the variables included in our diagnosis prediction models; (A) Biomolecular features; (B) Radiomics features; (C) Clinical features.
Figure 3(A,C) General association analysis of risk factors. The outer circle shows the AUC, middle circle shows the p-values, and the inner circle shows the q-values for each single feature. (A,C) for 52 features, and 39 interactions, respectively. (B,D) for 52 features and 1326 interactions, respectively. (B,D) The upper graphic shows the AUC, the middle graph shows the p-values, and the lower category shows the q-values for each category of features.
Figure 4Top features with mean contribution (according to feature importance) greater than 80% for 10 times 5-fold CV. (A) Top 13 features in the XGBoost prediction model for 10 times 5-fold CV; (B) Top 7 features the LightGBM prediction model for 10 times 5-fold CV.
Figure 5Top features to diagnose disease status. (A) Boxplots of normalized features; (B) ROC curves of diagnostic sensitivity and specificity for individual features with top mean importance and the mean prediction of XGBoost, LightGBM and their ensemble method IN the 10-times 5-fold CV.
Accuracy, precision, recall, AUROC and F1-score for the methods tested with different hyperparameters evaluated by 10 times 5-fold Cross Validation (mean and standard deviation of the 10 times’ division).
| (η,W,C,S) | Accuracy | Precision.OA | Precision.Control | ||
|---|---|---|---|---|---|
| ( | — | 0.737 (0.025) | 0.760 (0.032) | 0.718 (0.023) | |
| ( | — | 0.763 (0.050) | 0.770 (0.060) | 0.762 (0.053) | |
| ( | (0.001,2,0.7,0.5) | 0.793 (0.032) | 0.793 (0.028) | 0.797 (0.046) | |
| ( | (0.001,1,0.7,0.5) | 0.804 (0.022) | 0.804 (0.020) | 0.808 (0.038) | |
| ( | (0.01, 2, 0.7,0.5) | 0.807 (0.034) | 0.804 (0.029) | 0.812 (0.046) | |
| ( | (0.01, 1, 0.7,0.5) | 0.813 (0.023) | 0.811 (0.022) | 0.817 (0.032) | |
| ( | (0.01,1,0.5,0.5) | 0.814 (0.025) | 0.807 (0.026) | 0.822 (0.028) | |
| ( | (0.01,1,0.7,0.5) | 0.802 (0.039) | 0796 (0.035]) | 0.811 (0.054) | |
| ( | (0.001,1,0.7,0.5) | 0.800 (0.034) | 0.795 (0.033) | 0.807 (0.043) | |
| ( | (0.01,2,0.7,0.5) | 0.805 (0.044) | 0.800 (0.039) | 0.814 (0.058) | |
| ( | (0.01,2,0.7,0.7) | 0.805 (0.044) | 0.800 (0.039) | 0.814 (0.058) | |
| ( | — | 0.795 (0.035) | 0.790 (0.036) | 0.802 (0.042) | |
| ( | |||||
| ( | — | 0.693 (0.030) | 0.780 (0.038) | 0.805 (0.026) | 0.736 (0.025) |
| ( | — | 0.757 (0.072) | 0.770 (0.080) | 0.838 (0.024) | 0.762 (0.051) |
| ( | (0.001,2,0.7,0.5) | 0.796 (0.061) | 0.791 (0.036) | 0.858 (0.025) | 0.793 (0.032) |
| ( | (0.001,1,0.7,0.5) | 0.807 (0.053) | 0.802 (0.030) | 0.861 (0.033) | 0.804 (0.022) |
| ( | (0.01, 2, 0.7,0.5) | 0.811 (0.060) | 0.802 (0.347) | 0.868 (0.031) | 0.806 (0.034) |
| ( | (0.01, 1, 0.7,0.5) | 0.817 (0.040) | 0.809 (0.027) | 0.875 (0.038) | 0.813 (0.023) |
| ( | (0.01,1,0.5,0.5) | 0.826 (0.031) | 0.802 (0.030) | 0.870 (0.029) | 0.814 (0.025) |
| ( | (0.01,1,0.7,0.5) | 0.813 (0.063) | 0.791 (0.040) | 0.859 (0.035) | 0.802 (0.039) |
| ( | (0.001,1,0.7,0.5) | 0.809 (0.050) | 0.791 (0.039) | 0.864 (0.029) | 0.800 (0.034) |
| ( | (0.01,2,0.7,0.5) | 0.815 (0.068) | 0.796 (0.044) | 0.861 (0.035) | 0.805 (0.044) |
| ( | (0.01,2,0.7,0.7) | 0.815 (0.068) | 0.796 (0.044) | 0.861 (0.035) | 0.805 (0.044) |
| ( | — | 0.804 (0.048) | 0.785 (0.044) | 0.795 (0.035) | 0.794 (0.035) |
| ( | |||||
Figure 6Image volume of interested selection to extract radiomics and bone morphometry features.