Literature DB >> 32415284

Osteoarthritis of the Temporomandibular Joint can be diagnosed earlier using biomarkers and machine learning.

Jonas Bianchi1,2, Antônio Carlos de Oliveira Ruellas3, João Roberto Gonçalves4, Beatriz Paniagua5, Juan Carlos Prieto6, Martin Styner6, Tengfei Li7, Hongtu Zhu7, James Sugai8, William Giannobile8, Erika Benavides8, Fabiana Soki8, Marilia Yatabe3, Lawrence Ashman9, David Walker10, Reza Soroushmehr11, Kayvan Najarian11, Lucia Helena Soares Cevidanes3.   

Abstract

After chronic low back pain, Temporomandibular Joint (TMJ) disorders are the second most common musculoskeletal condition affecting 5 to 12% of the population, with an annual health cost estimated at $4 billion. Chronic disability in TMJ osteoarthritis (OA) increases with aging, and the main goal is to diagnosis before morphological degeneration occurs. Here, we address this challenge using advanced data science to capture, process and analyze 52 clinical, biological and high-resolution CBCT (radiomics) markers from TMJ OA patients and controls. We tested the diagnostic performance of four machine learning models: Logistic Regression, Random Forest, LightGBM, XGBoost. Headaches, Range of mouth opening without pain, Energy, Haralick Correlation, Entropy and interactions of TGF-β1 in Saliva and Headaches, VE-cadherin in Serum and Angiogenin in Saliva, VE-cadherin in Saliva and Headaches, PA1 in Saliva and Headaches, PA1 in Saliva and Range of mouth opening without pain; Gender and Muscle Soreness; Short Run Low Grey Level Emphasis and Headaches, Inverse Difference Moment and Trabecular Separation accurately diagnose early stages of this clinical condition. Our results show the XGBoost + LightGBM model with these features and interactions achieves the accuracy of 0.823, AUC 0.870, and F1-score 0.823 to diagnose the TMJ OA status. Thus, we expect to boost future studies into osteoarthritis patient-specific therapeutic interventions, and thereby improve the health of articular joints.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32415284      PMCID: PMC7228972          DOI: 10.1038/s41598-020-64942-0

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Osteoarthritis (OA) affects millions of people worldwide, causing them many years with pain and disability[1]. Trends in the global burden of the disease from 1990 to 2016 show that OA is the second most rapidly rising condition associated with disability, with a 46 percent increase in years lived with disability, just behind diabetes at 52 percent[2]. With the aging population and higher rates of obesity, this burden is expected to rise. The rapid increase in the prevalence of OA will lead to a growing impact and major challenges for health care and public health systems. OA can occur in different joints in the musculoskeletal system, such as the knee, hips, back, hand, and temporomandibular joint (TMJ), having a multifactorial etiology that includes: excessive mechanical stress, hormonal changes, genetics, aging and others[3-5]. The TMJ is a unique model to study early bone changes in OA, as the articular bone surface is covered only by a thin layer of fibrocartilage in the TMJ condyle. Osteoarthritis of the TMJ (TMJ OA) is a multi-system disease, involving numerous pathophysiological processes, and requiring comprehensive assessments to characterize progressive cartilage degradation, subchondral bone remodeling, and chronic pain[4,6-8]. Studies using in vivo OA disease models now benefit from high-resolution cone-beam tomography imaging (HR-CBCT)[9]. HR-CBCT scans allow diagnosis of the bone environment with sub-millimeter resolution comparable to micro-CT, but with a much lower radiation dose[10], and have been widely used by clinicians and researchers[11-14]. As treatments to reverse the chronic damage of TMJ OA are unavailable, early diagnosis may provide the best opportunity to prevent extensive and permanent joint damage. However, current diagnosis is based on pre-existent clinical/imaging signs and symptoms markers using standard protocols recommended for Diagnostic Criteria for Temporomandibular Disorders (DC/TMD), meaning to diagnose TMJ OA degradation of the joint must have already occurred[15,16]. The DC/TMD criteria are based on pre-existent condylar damage, such as subcortical cysts, surface erosions, osteophytes, or generalized sclerosis that are present mainly in the later stages of the disease. Towards an early diagnosis that is predictive of disease status, animal studies indicate that the bone microarchitecture[6,8,17,18] is an important factor in the OA pathogenesis initiation, preceding articular cartilage changes[17,19], and should be investigated in human studies. There is an estimated increase in OA prevalence over the next decades, which reflects in more data acquisition, demanding advances in computational machine learning and data management[20-22]. For this reason, there is a need for precise data mining algorithms, data capture, standardization, management and processing from multiple centers to provide personalized treatment and diagnosis[15,20,22-25]. For disease diagnosis, machine learning approaches have been applied in the medical field[26-29]. Most of the studies have pointed out algorithms and multi-source biomarkers to predict the disease status, such as XGBoost[30], LightGBM[31], deep learning algorithms[32], random forest algorithms[27], etc. The models have been tested with different features, including radiographic and magnetic resonance (MRI) data[33,34], proteomics[28], and clinical information[27] for creating patient-specific prediction models[23]. However, most studies addressed the OA involvement in the knee. For the temporomandibular joint, we found two studies that were done by our group evaluating only the morphological changes in mandibular condyles[35,36]. In addition, most of the literature is focused on multi-center database, or late stages of OA (chronic stages) assessed using routine exams. Here, we addressed surrogate biomarkers such as the radiomics, which can be useful to explore the subchondral bone organization and maybe play a pivotal role in a true early diagnosis of the TMJ OA. We propose novel standardized data representation/processing, statistical learning, and interactive visualization to fully explore biomarker interactions to disease and health. Our data-driven approaches integrate information patterns to provide new insights into the complex etiology of TMJ OA[37]. Data management includes standardized imaging[38], clinical[15] and biomolecular[39] acquisition, and control of patient information from multiple data sources, with standardized demographic for matching OA patients and healthy controls. We have evaluated fifty-two variables to determine the most relevant integrative feature pools using machine-learning algorithms to detect TMJ OA status (Fig. 1). We hypothesize that by combining standardized patient features from multiple sources using statistical machine-learning approaches, we can accurately diagnose TMJ OA status.
Figure 1

The spectrum of Data Science to advance TMJ OA diagnosis includes Data capture and acquisition, Data processing with a web-based data management, Data Analytics involving in-depth statistical analysis, machine learning approaches, and Data communication to help the decision-making support in TMJ OA diagnosis.

The spectrum of Data Science to advance TMJ OA diagnosis includes Data capture and acquisition, Data processing with a web-based data management, Data Analytics involving in-depth statistical analysis, machine learning approaches, and Data communication to help the decision-making support in TMJ OA diagnosis.

Results

Web-based platform to store and compute data analytics of clinical, radiomics and biomolecular markers

Our Data Storage for Computation and Integration (DSCI)[40] web-based system was used for data management with storage and integration of patient information from multiple sources. The DCSI communicates with 3D Slicer[41] platform through the Data Base lnteractor[42] plugin that allows the user to upload the clinical, imaging and biological markers. The data was exported in a.csv file and we show in Tables 1–3, the descriptive statistics for each variable and their respective nomenclature. As most of the variables did not show parametric distribution (after evaluation of the asymmetry, kurtosis and Shapiro-wilk test), the descriptive statistics display the median, mean, standard deviation and upper/lower limits for the 95% confidence interval. The following variables were measured only for the TMJ OA group since the control patients did not present facial and/or TMJ pain: years of pain onset (PainY), current facial pain (PainCur), last six months worst facial pain (PainWor) and last six months average facial pain (PainAve). The TMJ OA and control groups were age and sex matched. We can note that patients with OA present less range of mouth opening, for radiomics and biomolecular variables both present similar values and in Supplementary Fig. 1 we present the statistical differences between them. Finally, the MMP3 protein was not described for saliva, since the expression levels were not detected.
Table 1

Descriptive and demographic values for each clinical variable.

VariablesAbbreviationControl Group (n = 46) Female (39) Male (7)TMJ OA Group (n = 46) Female (39) Male (7)
MedianMean95% CISDMedianMean95% CISD
Clinical VariablesLowerUpperLowerUpper
AgeAge38.5039.8335.8943.7613.2638.0037.6533.9941.3212.34
Years of Pain Onset (years)PainY3.754.343.355.333.34
Facial Current Pain (years)PainCur3.003.072.473.662.00
Facial last 6 months Worst Pain (0 to 10)PainWor7.006.896.067.732.81
Facial last 6 months Average Pain (0 to 10)PainAve4.504.523.875.172.20
Last 6 Months Distressed by Headaches (0 to 10)Headaches0.000.630.330.931.022.001.651.331.971.08
Last 6 Months Distressed by Muscle Soreness (0 to 10)MusSor0.000.370.160.580.711.001.070.741.391.10
Vertical Range Unassisted Without Pain (mm)RangeWOpain44.5044.9142.4247.408.3936.3539.0032.4440.2613.16
Vertical Range Unassisted Max (mm)RangeUnaMax47.5046.8344.6249.037.4145.0044.2841.4147.169.68
Vertical Range Assisted Max (mm)RangeAssMax50.0049.2147.1551.276.9449.0047.5444.6950.409.61

CI: Confidence Interval; SD: Standard Deviation.

Table 3

Descriptive values for each biomolecular variable.

VariablesAbbreviationControl Group (n = 46) Female (39) Male (7)TMJ OA Group (n = 46) Female (39) Male (7)
MedianMean95% CISDMedianMean95% CISD
Biomolecular Variables (pg/ml)LowerUpperLowerUpper
Angiogenin SerumANG_Ser1467.101454.861389.091520.62221.461459.051457.151368.651545.65298.02
BDNF SerumBDNF_Ser280.25719.56378.081061.041149.92286.351121.62544.881698.351942.12
CXCL16 SerumCXCL16_Ser3726.703741.373550.003932.73644.403827.503988.573687.274289.871014.61
ENA-78 SerumENA-78_Ser348.70664.23453.24875.21710.48276.40593.05394.98791.11666.97
MMP3 SerumMMP3_Ser2358.252373.102073.332672.871009.452305.202367.032091.832642.24926.72
MMP7 SerumMMP7_Ser496.55527.66444.90610.42278.69453.75554.15419.32688.98454.02
OPG SerumOPG_Ser2539.153010.922165.323856.522847.492428.103116.792415.063818.512363.01
PAI-1 SerumPAI-1_Ser6505.607930.356486.749373.964861.246693.657237.115904.808569.414486.42
TGF-β1 SerumTGF-β1_Ser91.20140.6898.90182.47140.7099.15177.84103.81251.87249.29
TIMP-1 SerumTIMP-1_Ser7382.157280.327020.937539.71873.477351.657382.747099.457666.03953.95
TRANCE SerumTRANCE_Ser2078.702200.671885.392515.951061.692507.152560.512231.902889.121106.56
VE-cadherin SerumVE-cad_Ser6259.206527.085140.647913.534668.735308.056154.804988.127321.483928.70
VEGF SerumVEGF_Ser93.90115.3276.18154.46131.8087.30117.4085.32149.47108.01
Angiogenin SalivaANG_Sal721.85720.83652.98788.69228.48754.05758.02702.29813.74187.65
BDNF SalivaBDNF_ Sal5.207.605.1310.088.343.958.324.0312.6214.47
CXCL16 SalivaCXCL16_Sal109.40183.94121.59246.29209.95100.60207.09130.24283.95258.80
ENA-78 SalivaENA-78_Sal2424.602218.021925.142510.90986.252482.602410.402087.412733.391087.63
MMP7 SalivaMMP7_Sal3290.903615.282949.234281.332242.873594.203666.793040.774292.822108.08
OPG SalivaOPG_Sal555.75855.69560.071151.32995.49732.301329.53754.681904.381935.77
PAI-1 SalivaPAI-1_Sal24.4085.0245.07124.96134.5140.4093.1432.77153.52203.32
TGF-β1 SalivaTGF-β1_Sal40.7064.6641.8687.4676.7854.4569.2046.6491.7675.97
TIMP-1 SalivaTIMP-1_Sal4070.903963.813663.144264.481012.493889.803880.693664.714096.67727.30
TRANCE SalivaTRANCE_Sal627.60794.52533.491055.56879.02698.751041.55622.451460.641411.27
VE-cadherin SalivaVE-cad_Sal643.101008.24657.651358.841180.60666.701313.82585.332042.302453.11
VEGF SalivaVEGF_Sal1181.351342.621125.231560.02732.071441.651419.391281.991556.79462.68

CI: Confidence Interval; SD: Standard Deviation.

Descriptive and demographic values for each clinical variable. CI: Confidence Interval; SD: Standard Deviation. Descriptive values for each imaging variable. CI: Confidence Interval; SD: Standard Deviation. Descriptive values for each biomolecular variable. CI: Confidence Interval; SD: Standard Deviation.

Radiomic features differentiate control subjects and TMJ OA patients

We used a non-invasive protocol validated by our group to detect the initial morphological changes in the mandibular condyle trabecular bone based on radiomics information[10]. We extracted 20 imaging features (GLCM, GLRLM and bone morphometry described in Table 2 [12,38]. These radiomic features were tested using the Mann-Whitney U test (Fig. 2B, Supplementary Fig. 1B and Supplementary Data 1) for group comparisons. From the 20 variables, 13 showed statistically significant differences between the disease and control groups. These findings suggest, and corroborate the literature, that the trabecular bone has an important role in the TMJ OA pathogenesis[6,17,43].
Table 2

Descriptive values for each imaging variable.

VariablesAbbreviationControl Group (n = 46) Female (39) Male (7)TMJ OA Group (n = 46) Female (39) Male (7)
MedianMean95% CISDMedianMean95% CISD
Radiomics VariablesLowerUpperLowerUpper
EnergyEnergy0.300.310.280.330.070.250.250.230.270.07
EntropyEntropy2.292.302.202.400.332.562.622.492.750.42
Inverse Difference MomentInvDifMom0.910.900.900.910.020.890.890.890.900.02
InertiaInertia0.190.190.190.200.030.210.210.200.220.03
Haralick CorrelationHarCor317.48375.56303.63447.49242.23410.36603.40467.71739.10456.94
Short Run EmphasisShortRE0.330.340.350.340.030.350.350.340.360.03
Long Run EmphasisLongRE16.5816.5116.0117.011.6815.4415.6415.1016.181.81
Grey Level Non UniformityGreyLNU2405.842374.262272.672475.84342.082240.612249.652158.632340.67306.50
Run Length Non UniformityRunLNU1239.231287.961209.651366.27263.721443.881459.221367.191551.25309.91
Low Grey Level Run EmphasisLowGLRE0.060.060.060.060.010.060.060.050.060.01
High Grey Level Run EmphasisHighGLRE19.1019.9818.7621.194.0921.0522.4720.9523.985.11
Short Run Low Grey Level EmphasisShortRLowGLE0.020.020.020.020.000.020.020.020.020.00
Short Run High Grey Level EmphasisShortRHighGLE6.967.256.727.771.778.158.567.899.242.27
Long Run Low Grey Level EmphasisLongRLowGLE1.051.050.981.110.230.950.950.871.030.28
Long Run High Grey Level EmphasisLongRHighGLE299.09303.82283.69323.9567.80309.96317.43298.88335.9762.45
Bone Volume (%)BV/TV0.540.540.480.600.200.600.580.520.640.20
Trabecular Thickness (mm)Tb.Th0.350.380.330.430.160.410.440.380.500.19
Trabecular Separation (mm)Tb.Sp0.280.340.270.400.210.260.350.250.440.31
Trabecular Number (mm−1)Tb.N1.471.441.381.510.231.451.361.281.440.28
Bone Surface to Bone Volume Ratio (mm−1)BS/BV5.796.085.436.732.184.895.304.655.952.18

CI: Confidence Interval; SD: Standard Deviation.

Figure 2

Mann-Whitney U test comparison between the TMJ OA and control groups showing the variables included in our diagnosis prediction models; (A) Biomolecular features; (B) Radiomics features; (C) Clinical features.

Mann-Whitney U test comparison between the TMJ OA and control groups showing the variables included in our diagnosis prediction models; (A) Biomolecular features; (B) Radiomics features; (C) Clinical features.

Control and TMJ OA patients present similar expression levels of selected serum and saliva protein biomarkers

We selected proteins that have previously been detected in the TMJ synovial fluid of OA patients[39]. We collected saliva and serum using less invasive procedures and promising screening diagnostic tools[44] to evaluate the diagnostic performance of each protein and their interactions with radiomics and clinical markers. To analyze each protein’s expression level, we used a customized human protein micro-array kit (RayBiotech, Norcross, GA) with duplicate samples for each patient, controlling the standard curve and limiting expression as can be seen in Supplementary Fig. 2. In both the saliva and serum samples, the levels of proteins did not differ, and large data distribution variability was observed, as described in Table 3. We show in Fig. 2A and Supplementary Data 2, the Mann-Whitney U-test results for comparison between the TMJ OA and control groups. Even though our results showed no differences between our groups, the next sections detail the contribution and diagnostic performance of those proteins to diagnose TMJ OA status

Clinical features differentiate control subjects and TMJ OA patients

We present the Mann-Whitney U test in Supplementary Fig. 1c and Supplementary Data 3 for comparison of both groups for the following clinical variables: RangeAssMax, MusSor, RangeWOpain, Headaches and RangeUnaMax, defined in Table 1. We chose these features because they were measured in both groups and are part of the “Diagnostic Criteria for Temporomandibular Disorders (DC/TMD) for Clinical and Research Applications: Recommendations of the International RDC/TMD Consortium Network and Orofacial Pain Special Interest Group”[15]. Our results show that only RangeAssMax and RangeUnaMax were not statistically significantly different between the TMJ OA and control groups. The clinical features that presented statistically significant differences were correlated with pain or limited by pain. For example, for the maximum opening without pain (RangeWOpain), patients were instructed to open their mouths until they start to feel pain within their TMJs. This approach reduces the amount of opening for the TMJ OA patients that often present pain during opening; however, when the patients were asked to open the mouth as much as they could even with pain (i.e., RangeUnaMAx) the values were not statistically significant between the groups. In Fig. 2C, we display only the variables that were included in our diagnostic models, described in the following sections.

Diagnostic performance to predict TMJ OA status

We had 55 features shown in Tables 1–3 plus gender; however, 4 clinical features are expressed only in OA patients, and for this reason, were not included in the next analysis, resulting in a total of 52. In Fig. 3A, we show the values for the Area Under the curve (AUC), p-value, and q-value for 52 features. Figure 3B shows the AUC (upper), p-value (medium) and q-value (lower) for each category of variables (biological, clinical and radiomics). Most of the features with significant AUC values are clinical or radiomics; no biomolecular feature is detected with AUC > 0.65; nevertheless, it is shown that the interaction of biomolecular features can attain an AUC of 0.74 (Fig. 3C) and have a large contribution in prediction of TMJ OA status(Figs. 3B and 4).
Figure 3

(A,C) General association analysis of risk factors. The outer circle shows the AUC, middle circle shows the p-values, and the inner circle shows the q-values for each single feature. (A,C) for 52 features, and 39 interactions, respectively. (B,D) for 52 features and 1326 interactions, respectively. (B,D) The upper graphic shows the AUC, the middle graph shows the p-values, and the lower category shows the q-values for each category of features.

Figure 4

Top features with mean contribution (according to feature importance) greater than 80% for 10 times 5-fold CV. (A) Top 13 features in the XGBoost prediction model for 10 times 5-fold CV; (B) Top 7 features the LightGBM prediction model for 10 times 5-fold CV.

(A,C) General association analysis of risk factors. The outer circle shows the AUC, middle circle shows the p-values, and the inner circle shows the q-values for each single feature. (A,C) for 52 features, and 39 interactions, respectively. (B,D) for 52 features and 1326 interactions, respectively. (B,D) The upper graphic shows the AUC, the middle graph shows the p-values, and the lower category shows the q-values for each category of features. Top features with mean contribution (according to feature importance) greater than 80% for 10 times 5-fold CV. (A) Top 13 features in the XGBoost prediction model for 10 times 5-fold CV; (B) Top 7 features the LightGBM prediction model for 10 times 5-fold CV.

Contributions assessment of top features and interactions

Top features were filtered with {AUC > 0.7} calculated from the training subjects and then fed into an XGBoost[30]/LightGBM[31] model to make diagnostic predictions. More details can be found in the Method section. We demonstrate (Fig. 4) the contributions of the top (>80% contribution in sum) features selected from the 52 features with mutual interactions, according to feature importance of XGBoost (Fig. 4A) and LightGBM (Fig. 4B) prediction models, using10 times 5-fold cross-validation,. We find that 13 features using the XGBoost model have a mean contribution larger than 80%, while for LightGBM a subset including 7 features contributes >80% information.

Diagnosis of TMJ OA status based on the top features and interactions

After the selection of the best features and interactions (Fig. 4), Fig. 5A displays the boxplots for comparison between OA and control groups. Figure 5B is showing the ROC curves of diagnostic sensitivity and specificity for individual features with top mean importance and the mean prediction of XGBoost, LightGBM and their ensemble method IN the 10-times 5-fold CV. The accuracy, precision, recall, AUC and F1-score of five described methods (), (), (), (), (), () and () (see method section) are shown in Table 4 using 10 times’ 5-fold cross-validation. Even though the main effect of biomolecular features is low, their interaction effects with clinical and radiomics features are important in diagnosis. For the 10 times’ 5-fold CV, the XGBoost + LightGBM with the interaction features achieves the best average accuracy of 0.823, AUC 0.870, and F1-score 0.823.
Figure 5

Top features to diagnose disease status. (A) Boxplots of normalized features; (B) ROC curves of diagnostic sensitivity and specificity for individual features with top mean importance and the mean prediction of XGBoost, LightGBM and their ensemble method IN the 10-times 5-fold CV.

Table 4

Accuracy, precision, recall, AUROC and F1-score for the methods tested with different hyperparameters evaluated by 10 times 5-fold Cross Validation (mean and standard deviation of the 10 times’ division).

(η,W,C,S)AccuracyPrecision.OAPrecision.Control
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{1},\,{{\mathscr{P}}}_{1}$$\end{document}F1,P1)0.737 (0.025)0.760 (0.032)0.718 (0.023)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{1}$$\end{document}F2,P1)0.763 (0.050)0.770 (0.060)0.762 (0.053)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.001,2,0.7,0.5)0.793 (0.032)0.793 (0.028)0.797 (0.046)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.001,1,0.7,0.5)0.804 (0.022)0.804 (0.020)0.808 (0.038)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.01, 2, 0.7,0.5)0.807 (0.034)0.804 (0.029)0.812 (0.046)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.01, 1, 0.7,0.5)0.813 (0.023)0.811 (0.022)0.817 (0.032)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.01,1,0.5,0.5)0.814 (0.025)0.807 (0.026)0.822 (0.028)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.01,1,0.7,0.5)0.802 (0.039)0796 (0.035])0.811 (0.054)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.001,1,0.7,0.5)0.800 (0.034)0.795 (0.033)0.807 (0.043)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.01,2,0.7,0.5)0.805 (0.044)0.800 (0.039)0.814 (0.058)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.01,2,0.7,0.7)0.805 (0.044)0.800 (0.039)0.814 (0.058)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{4}$$\end{document}F2,P4)0.795 (0.035)0.790 (0.036)0.802 (0.042)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}+{{\mathscr{P}}}_{3}$$\end{document}F2,P2+P3)(0.01,1,0.5,0.5)+ (0.01,2,0.7,0.5)0.823 (0.029)0.815 (0.033)0.833 (0.035)
(η,W,C,S)Recall.OARecall.ControlAUCMean.F1.Score
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{1},\,{{\mathscr{P}}}_{1}$$\end{document}F1,P1)0.693 (0.030)0.780 (0.038)0.805 (0.026)0.736 (0.025)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{1}$$\end{document}F2,P1)0.757 (0.072)0.770 (0.080)0.838 (0.024)0.762 (0.051)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.001,2,0.7,0.5)0.796 (0.061)0.791 (0.036)0.858 (0.025)0.793 (0.032)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.001,1,0.7,0.5)0.807 (0.053)0.802 (0.030)0.861 (0.033)0.804 (0.022)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.01, 2, 0.7,0.5)0.811 (0.060)0.802 (0.347)0.868 (0.031)0.806 (0.034)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.01, 1, 0.7,0.5)0.817 (0.040)0.809 (0.027)0.875 (0.038)0.813 (0.023)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}$$\end{document}F2,P2)(0.01,1,0.5,0.5)0.826 (0.031)0.802 (0.030)0.870 (0.029)0.814 (0.025)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.01,1,0.7,0.5)0.813 (0.063)0.791 (0.040)0.859 (0.035)0.802 (0.039)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.001,1,0.7,0.5)0.809 (0.050)0.791 (0.039)0.864 (0.029)0.800 (0.034)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.01,2,0.7,0.5)0.815 (0.068)0.796 (0.044)0.861 (0.035)0.805 (0.044)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{3}$$\end{document}F2,P3)(0.01,2,0.7,0.7)0.815 (0.068)0.796 (0.044)0.861 (0.035)0.805 (0.044)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{4}$$\end{document}F2,P4)0.804 (0.048)0.785 (0.044)0.795 (0.035)0.794 (0.035)
(\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\mathscr{F}}}_{2},\,{{\mathscr{P}}}_{2}+{{\mathscr{P}}}_{3}$$\end{document}F2,P2+P3)

(0.01,1,0.5,0.5)+

(0.01,2,0.7,0.5)

0.837 (0.044)0.809 (0.043)0.870 (0.033)0.823 (0.029)
Top features to diagnose disease status. (A) Boxplots of normalized features; (B) ROC curves of diagnostic sensitivity and specificity for individual features with top mean importance and the mean prediction of XGBoost, LightGBM and their ensemble method IN the 10-times 5-fold CV. Accuracy, precision, recall, AUROC and F1-score for the methods tested with different hyperparameters evaluated by 10 times 5-fold Cross Validation (mean and standard deviation of the 10 times’ division). (0.01,1,0.5,0.5)+ (0.01,2,0.7,0.5)

Cross-validation to control for overfitting

In order to select risk factors from the high-dimensional 52 features plus 1326 interactions, the control of overfitting is necessary. To take advantage of a larger training sample size to fulfill this aim, we use the 10 times’ 5-fold cross-validation and give evaluation and comparison using the average performance of different approaches on validation subjects. Each time in the 10 times’ 5-fold CV, we select hyperparameters, i.e., the iteration steps, by further splitting the training subjects for 10-fold (or 5-fold) cross-validation.

Discussion

We report here, the diagnostic performance of machine learning approaches to predict TMJ osteoarthritis status. Through data acquisition, management and processing, we achieve one of the main challenges of healthcare delivery, which is to integrate the patient data information from multiple sources for accurate diagnosis and meaningful indicators of individual health[45]. To obtain patient-specific, precise diagnostic information, Data Science has become indispensable in medicine, with the integration of data capture, management/processing and in-depth analysis with rigorous and standardized protocols[46,47]. In the carefully controlled data acquisition methods of our study, all subjects had the same imaging acquisition protocol, all clinical assessments were performed by the same pain specialist, and a single investigator collected the clinical, biological and imaging data. This study database was composed initially of 107 subjects, and 15 subjects were excluded due to incomplete or inadequate quality of data. As part of the data management and processing, we extracted radiomics information from each subject’s HR-CBCT scan (Fig. 6) to obtain information that is hidden to the clinicians’ naked eyes[48]. Our result in Supplementary Fig. 1 shows that most radiomics features were able to differentiate between control and TMJ OA patients. Advanced statistical learning approaches, shown in Fig. 4, demonstrate among the radiomics features, that Entropy, Energy, HarCor, are included in our most accurate prediction models and corroborate our previous findings that found correlations between these features and the bone status[38], where a decreased energy was associated with bone sclerosis/loss, and the increased values for HarCor and Entropy was correlated to bone sclerosis/loss. For the knee OA diagnosis, Brahim et al.[26], (2019) using imaging features, evaluated the performance of machine learning algorithms to detect the disease, with x-rays radiographic from the public dataset Osteoarthritis Initiative (OAI). Their results showed an OA detection with 82.98% of accuracy using Random Forest and Naive Bayes classifiers. Even though the results were good, a standardization of the images was necessary to classify the images from multi-centers correctly. Here, we addressed this challenge, using a rigorous protocol for the imaging markers; however, our sample size (n = 46 per group) in comparison with Brahim et al.[26] (n = 516 per group) was not enough to get a good accuracy only using the radiomics, where we found an AUC of 0.70 approximately (Energy, Entropy, Haralick Correlation, Inverse Difference Moment and Trabecular Separation). For this reason, we also included clinical features and biomolecular information that improved our diagnostic model performance. Also, we have found in the literature only two papers showing the application of machine learning approaches in the temporomandibular joint osteoarthritis diagnosis, and both were focused on developing a model only for bone surface classification[35,36].
Figure 6

Image volume of interested selection to extract radiomics and bone morphometry features.

Image volume of interested selection to extract radiomics and bone morphometry features. All clinical data were obtained using the DC/TMD[15] criteria. We have chosen features measured in both groups, as described in Fig. 3, where headaches had the highest AUC among all the features. This marker is highly correlated to temporomandibular disorders in the literature[49], and now our study shows its improved diagnostic performance to predict OA status in conjunction with other features and machine learning approaches. Interestingly, the interaction of headaches with TGF-β1_Sal together was one of the top features with >80% mean contribution to information gain in our statistical learning models. A possible clinical explanation is that patients with headaches had increased levels of this protein, as previous studies have indicated the expression and correlation of this cytokine with mandibular bone degradation in TMJ OA patients[50-53]. Other clinical markers included in our disease model were: RangeWOpain and its interaction with PAI-1_Sal. As TMJ OA patients present with pain in their TMJs, the decreased amount of mouth opening without pain was an important disease sign with an AUC of 0.70; and its interaction with PAI-1_Sal was an exciting finding, as this feature was increased in the OA patients (Fig. 4). PAI-1 is a serine protease inhibitor of tissue plasminogen activator and prevents the formation of plasmin; in OA, PAI-1 has a role in the cascade of enzymatic activities, compromising repair and increasing the cartilage degradation[19,54]. A recent study by our group showed that PAI-1 is correlated with areas of flattening in the lateral surface of mandibular condyles with OA[39], corroborating the results of this study. For the demographic’s aspect, Gender and its interaction with MusSor was another significant feature included in our statistical prediction models. However, as TMJ OA prevalence is higher in women[55], a limitation of this study is the unequal number of male and female subjects; out of the 46 subjects in each group (sample size n = 92), 39 were females and only 7 males. The role of MusSor in this interaction may be due to differences in pain sensitivity in women and men[56], and to the central sensitization caused by painful osteoarthritis as patients with temporomandibular disorders also presented higher prevalence of muscle-related symptoms[57]. In our study, the variables and/or interactions of: PAI-1_Sal*Headaches (AUC: 0.697), RangeWOpain (AUC: 0.7), Headaches (AUC: 0.778), TGF-β1_Sal*Headaches (AUC: 0.707), Gender*MusSor (AUC: 0.716), VE-cad_Sal*Headaches (AUC: 0.698), Headaches*ShortRLowGLE (AUC: 0.692)), PAI-1_Sal*RangeWOpain (AUC: 0.701) were the clinical features presenting >80% mean contribution to information gain, included in LightGBM and/or XGBoost prediction models. A study from 2013[58] that also investigated clinical markers (age, sex, weight, height), and genes (FAS844, FAS670, FASL377, FASL124) with support vector machines and probabilistic neural networks found a high classification performance, showing an AUC of approximately 0.95. This study indicates that our findings may be improved with the addition of genetics information. However, besides the challenges involving genetic studies, here, we used relatively simple approaches to assess our features, that could be applied to the clinical practice. For the biomolecular markers, no differences between OA and control subjects (Supplementary Fig. 2A) were found; however, our prediction models show that the interaction between VE-cad_Ser*ANG_Sal (AUC: 0.74), PAI-1_Sal*Headaches, TGF-β1_Sal*Headaches (AUC: 0.707), VE-cad_Sal*Headaches (AUC: 0.698), PAI-1_Sal*RangeWOpain (AUC: 0.701), TGF-β1_Sal*Headaches and PAI-1_Sal*RangeWOpain (AUC: 0.701) are top features with mean>80% contribution to the information gain in the XGBoost and LightGBM predictive models. As markers of inflammation, VE-cad, ANG, TGF-β1 and PAI-1 have been previously shown[39] to be expressed in the TMJ synovial fluid and plasma and to be correlated with the condylar morphology in OA patients. It should be highlighted that in the present study, those markers were obtained from saliva and blood samples utilizing less invasive procedures for the patient, and circulating levels of pro-inflammatory proteins have been shown to contribute to the pathophysiology of disorders of the TMJ[59]. In addition, saliva has been described as a promising, accurate and non-invasive tool for a reliable diagnosis[44]. A study from Kellesarian et al.[60], showed the cytokines profile in the synovial fluid of patients with TMJ disorders, and the most proteins that we used in our study were listed in their systematic review. For the knees OA, Hear et al.[28], (2014) also evaluated the performance of machine learning algorithms using artificial neural network (TreeBagger decision tree) and cytokines from serum as features to classify patients with TMJ OA, rheumatoid arthritis and normal. Interestingly, the authors found only 12 statistically significant differences in the cytokines from the 38 studied, and the patients were classified based on late symptoms. The CNN showed the high performance to classify the patients (sensitivity and specify higher than 90%), and the author suggests that a combination of the cytokines is more critical for the classification than the individual levels, going towards to the findings of our present study. In a diagnostic perspective, Ahmed et al.[61] aimed to assess the early stages of knee OA. They used random forest, stepwise generalized linear model, generalized linear models with elastic net as classifiers and proteins from plasma, synovial fluid and serum as features, using leave one out cross-validation and k-fold cross-validation to test the model’s performance. The subjects were classified as early OA, control, early arthritis rheumatoid, and non-rheumatoid arthritis (other inflammatory arthritis). As results, the highest F1 score was for eOA diagnosis (0.78) and the lowest for non-RA (0.36) with GLMNET algorithm. In our study, we had a similar sample size and goal (detection of OA in an early stage), and we found a slightly better F1 score, close to 0.83. A recent study, in 2017[27], evaluated machine learning approaches for the identification of new biomarkers for knee osteoarthritis diagnosis. The baseline number of variables for each participant consisted of 186 features, including questionnaires (demographics, anamneses, pain, nutrition, etc.), radiography, magnetic resonance scores, physical/clinical examinations, biomarkers from serum, and urine, etc. The algorithm used was RGIFE (random forest algorithm), and for testing the model, they selected 10-fold cross-validation. The authors found five good prediction models, including different subsets of features in each. The overall discrimination of knee OA among the patients was considered good with good (AUC between 0.80 and 0.90). Each prediction model presented multi-source biomarkers such as clinical information, imaging-based information, pain, food questionnaires, and molecular markers. These results suggest that the disease has a complexity etiology, corroborating with our findings and confirming that there is a need to investigate the association of clinical, imaging, and proteins to better categorized this complex disease. Nowadays, the complex, high-dimensional, and biomedical data from multiple sources benefit from data science, computational advances, and machine learning approaches to improve knowledge in terms of diagnosis, disease classification, clustering data, and disease progression prediction[32,62-64]. For osteoarthritis, studies using mathematical algorithms for diagnosis and personalized treatment decisions are increasing[65]. We have previously shown the diagnostic performance to predict the disease status based on the condylar surface morphology and deep learning approaches[35,40], and now we show an integrative approach based on clinical, imaging radiomics and biomolecular patient-specific data. A limitation of this study is that the cross-sectional study design does not allow assessment of the disease progression and how different disease stages affect the proposed biomarkers. Other studies that assessed early stages of the knee OA using machine learning approaches[33], used subjective radiologic interpretation of 2D x-rays, rather than the high-resolution 3D images included in the present study. Here, with the use of the radiomics, clinical, and protein information, our predictive model with XGBoost + LightGBM and 1378 features/interactions showed an accuracy of 0.823, AUC 0.870, and F1-score 0.823 to determine disease status. Future studies using the proposed machine learning models and longitudinal data will provide better information on the feature’s behavior and disease progression. In conclusion, our in-depth statistical learning analysis was based on the integration and interactions of 52 features. We screened the diagnostic performance of each feature (Figs. 3–5) and built our machine learning models based on the most relevant features. Our final prediction model had an accuracy of 0.823 (SD: 0.029) to predict TMJ OA status using LightGBM + XGBoost with 1378 features interactions. Importantly, we show a comprehensive integration of new tools, data acquisition, management, and approaches to improve articular joint health and predict patient-specific TMJ OA status.

Methods

We followed the “Strengthening the Reporting of Observational studies in Epidemiology” (STROBE) guidelines for observational studies[66]. All experiments were performed in accordance with the guidelines and regulations approved by the Institutional Review Board approval (HUM00105204 and HUM00113199) from the University of Michigan and the informed consent was obtained from all participants.

Study design, setting and participants

After the Institutional Review Board approval (HUM00105204 and HUM00113199) from the University of Michigan, we enrolled patients and subjects from January 2016 to December 2018 that composed our TMJ OA and Control groups, respectively. This cross-sectional study sample was composed of 92 patients, 46 TMJ OA and 46 age and sex-matched control subjects who were selected based on rigorous inclusion criteria. The general health conditions of the participants included: age between 21–70 years old, no history of cancer, no history of jaw joint trauma, no previous surgery in the TMJ or recent jaw joint injections, absence of systemic diseases; no current pregnancy and no congenital bone or cartilage disease. All patients were examined by a single temporomandibular disorders specialist at the Hospital of the University of Michigan (Medicine Oral Surgery Clinic) through the Diagnostic Criteria for Temporomandibular Disorders (DC/TMD)[15] for TMJ osteoarthritis diagnosis. The patients were diagnosed as early stages of TMJ osteoarthritis when they presented: pain in at least one TMJ for less than 10 years, TMJ noise during movement or function in the last 30 days and crepitus detected during mandibular excursive movements. The Control group subjects were recruited by advertisement and evaluated for the absence of TMJ OA clinical and radiographic signs and symptoms. The diagnosis for the TMJ OA group and side of choice (left or right) was confirmed utilizing the radiographic criteria[16], including initial stages of subchondral cyst, erosion, generalized sclerosis and/or osteophytes. For the matching control condyle, the side of choice was the one without any clinical or radiographic findings. The exclusion criteria for the TMJ OA group were patients with middle to chronic TMJ OA diagnosis, evaluated when they present more than 10 years of TMJ pain diagnosis and/or severe stages of bone destruction, subchondral cyst, erosion and generalized sclerosis evaluated using the hr-CBCT by a radiologist.

Variables

Our study was composed by 3 main sub-groups of variables, which were: biomolecular features (composed by proteins of serum and saliva), imaging features (composed by trabecular bone radiomics and morphometry) and clinical features.

Biomolecular data

We evaluated 14 proteins in serum and saliva associated with arthritis initiation and progression, such as nociception, inflammation, angiogenesis and bone resorption, which were: 6ckine, Angiogenin, BDNF, CXCL16, ENA-78, MMP-3, MMP-7, OPG, PAI-1, TGFb1, TIMP-1, TRANCE, VE-Cadherin and VEGF. However, the expression of 6ckine was not expressed in the serum and saliva samples in this study, and MMP-3 was not expressed in saliva. The raw data can be seen in the Supplementary Fig. 2. The reason to select those proteins, besides their participation in the TMJ OA inflammation process[60], was due to our previous studies that detected these markers in the TMJ synovial fluid and saliva of OA patients, showing correlations with bone surface changes[35,39].

Blood and saliva acquisition protocol

The participants had 5 ml of venous blood collected by a trained nurse at the University of Michigan. The blood was centrifuged for 20 minutes at 1000 RPM to separate only the serum that was then aliquoted in 2 ml Eppendorf tubes and stored at −80C. For the saliva collection, the participants received a 14 ml sterile test tube with a funnel inserted; they were instructed to tilt their head forward and drip the saliva off into the tube until 2 ml was collected. They were informed to not spit, talk, or swallow during this process[67].

Custom micro-array

Custom human quantibody protein microarrays obtained from RayBiotech, Inc. Norcross, GA, was used to quantitatively assess the saliva and serum samples for the 14 specific biomarkers. Each participant had duplicates run for the saliva and serum samples (detailed description provided by Jiang et al.[68] and Huang et al.[69]). Supplementary Figures 2, 3 shows the raw values obtained for each participant and the standard curves for each protein.

Clinical signs and symptoms acquisition protocol

The same investigator collected and measured the clinical signs and symptoms of the participants based on the DC/TMD[15] criteria. The variables measured and selected for further statistical analysis were: Age pain began in years - TMJ OA Group only, Current Facial Pain -TMJ OA Group only, Worst Facial Pain in last 6 months -TMJ OA Group only, Average Pain -TMJ OA Group only, Last 6 Months Distressed by Headaches, Last 6 Months Distressed by Muscle Soreness, Vertical Range Unassisted Without Pain (mm), Vertical Range Unassisted Maximum (mm), Vertical Range Assisted Maximum (mm).

Imaging data acquisition

We acquired cone-beam computed tomography scans of each subject using the 3D Accuitomo (J. Morita MFG. CORP Tokyo, Japan) machine at the University of Michigan, School of Dentistry. The protocol for the temporomandibular joint high-resolution CBCT was field of view 40 × 40 mm; 90 kVp, 5 mAs, scanning time of 30.8 s and a voxel size of 0.08 mm[3]. The images were exported in DICOM (.dcm) using the manufacture software: i-Dixel (J. Morita MFG. CORP Tokyo, Japan) and optimization manufacture filter: G_103 + H_009. Finally, the images were coded and de-identified to avoid investigator bias in the statistical analysis.

Imaging trabecular texture-based features

We previously described the optimal parameters to extract radiomics features from the HR-CBCT scans in our study conditions and we followed these parameters to extract the information from our imaging data, using the BoneTexture module[38]. The region analyzed was the internal condylar lateral region (Fig. 6) due to our pilot results that showed this region to be the most significantly different between Control and TMJ OA patients. The textural information evaluated were: Energy, Entropy, Inverse Difference Moment, Inertia, Haralick Correlation, Short Run Emphasis, Long Run Emphasis, Grey Level Non Uniformity, Run Length Non Uniformity, Low Grey Level Run Emphasis, High Grey Level Run Emphasis, Short Run Low Grey Level Emphasis, Short Run High Grey Level Emphasis, Long Run Low Grey Level Emphasis, Long Run High Grey Level Emphasis, Bone Volume, Trabecular Thickness, Trabecular Separation, Trabecular Number and Bone Surface to Bone Volume Ratio.

Exploratory tests

We first did a traditional statistical analysis to explore our data and to test the hypothesis that there is no difference between our groups. Our data does not show normality distribution and for this reason, we chose non-parametrical tests for our analysis. The descriptive analysis, Mann-Whitney U test was done using the software GraphPad Prisma V 8.11 (GraphPad Software, Inc., San Diego, CA). For the descriptive analysis, we showed the median in addition to the mean, the 95% confidence intervals and the standard deviation. The Mann-Whitney U test was used to test our hypothesis and we used a two-tailed test with α of 5%.

Machine learning approaches

We diagnose the OA/control disease status based on the 52 features including five clinical variables, 20 radiomics features, 25 biomolecular features (13 from serum and 12 from saliva) and two demographic variables (age and gender). First, we normalized all features to have zero mean and one standard deviation. Next, we calculated the AUROC (Area under the Receiver Operating Characteristic curve), p-value and q-value[70] from a two-sample Mann-Whitney U test to evaluate the significance of each feature (Fig. 3). Afterward, we compared four different prediction methods, each of which follows the four steps: (I) Cross-validation to avoid overfitting (II) feature selection (III) risk prediction (IV) method evaluation. We used one-sided paired DeLong test[71,72] to validate the significance of AUC comparison between different approaches.

Cross-validation (CV)

We applied the 10 times’ 5-fold CV by taking 4 folds as training and the remaining one-fold as validation with 10 times’ repetition. At each time, we normalized the original 52 features denoted as F1 based on the training subjects and then took the product between each pair of them to generate additional 1326 interactions and denoted the set of 1378 features as . We performed the following two-step procedures by using only the training dataset and feature pools and , respectively, where represents the set of original 52 features, and we took the product between each pair of to generate an additional 1326 interactions and denoted the set of 1378 features as . Afterwards, we applied the 10 times’ 5-fold CV by taking 4 folds as training and the remaining one-fold as validation with 10 times’ repetition. This will further evaluate the sensitivity of the model.

Feature selection

We calculate the AUC for each single feature in and select top features according to {f ∈ F1|AUC of f > 0.7} and {f ∈ F2|AUC of f > 0.7} for feature pools met F1 and F2, respectively.

Evaluation and risk prediction

We trained the logistic regression model (method P1), Extreme Gradient Boosting (XGBoost; method )[30], Light Gradient Boosting Machine (LightGBM; method )[31], and Random Forest (method )[73] model by using the extracted features from the last step for risk prediction of the validation subject. For both XGBoost and LightGBM models, we fix the depth D = 1, and tune the iteration steps by further splitting the training subjects into training and validation subjects for 10-fold cross validation, where AUC is chosen as the evaluation criterion. We evaluate the prediction performance of six pairs of feature set and methods (), (), (), (), () and () by using the accuracy, precision, recall, AUROC and -score[74] on the 10 times 5-fold validation subjects. We also compare the results with other different hyperparameters. For example, we show in Table 4 the results for min_child_weight W∈{1,2}, colsample_bytree C∈{0.5,0.7}, subsample S∈ {0.5,0.7} and the learning rate η∈{0.001,0.01}. Our results showed that the XGBoost and LightGBM model by averaging the prediction probability () has the best performance on the validation subjects in the 10 times 5-fold CV; here the combination of XGBoost and LightGBM is recommended for its robustness in 10 times’ 5-fold CV. Supplementary information. Supplementary information2. Supplementary information3. Supplementary information4. Supplementary information5.
  42 in total

Review 1.  Current understanding of pathogenesis and treatment of TMJ osteoarthritis.

Authors:  X D Wang; J N Zhang; Y H Gan; Y H Zhou
Journal:  J Dent Res       Date:  2015-03-05       Impact factor: 6.116

2.  Quantitative Assessment of Variation in CT Parameters on Texture Features: Pilot Study Using a Nonanatomic Phantom.

Authors:  K Buch; B Li; M M Qureshi; H Kuno; S W Anderson; O Sakai
Journal:  AJNR Am J Neuroradiol       Date:  2017-03-24       Impact factor: 3.825

3.  Role of subchondral bone during early-stage experimental TMJ osteoarthritis.

Authors:  M Embree; M Ono; T Kilts; D Walker; J Langguth; J Mao; Y Bi; J L Barth; M Young
Journal:  J Dent Res       Date:  2011-09-13       Impact factor: 6.116

4.  Subchondral bone loss following orthodontically induced cartilage degradation in the mandibular condyles of rats.

Authors:  Kai Jiao; Li-Na Niu; Mei-Qing Wang; Juan Dai; Shi-Bin Yu; Xiao-Dong Liu; Jun Wang
Journal:  Bone       Date:  2010-09-17       Impact factor: 4.398

5.  Altered functional loading causes differential effects in the subchondral bone and condylar cartilage in the temporomandibular joint from young mice.

Authors:  J Chen; K P Sorensen; T Gupta; T Kilts; M Young; S Wadhwa
Journal:  Osteoarthritis Cartilage       Date:  2008-09-11       Impact factor: 6.576

Review 6.  Primary osteoarthritis no longer primary: three subsets with distinct etiological, clinical, and therapeutic characteristics.

Authors:  Gabriel Herrero-Beaumont; Jorge A Roman-Blas; Santos Castañeda; Sergio A Jimenez
Journal:  Semin Arthritis Rheum       Date:  2009-07-09       Impact factor: 5.532

7.  Accuracy of biomarkers obtained from cone beam computed tomography in assessing the internal trabecular structure of the mandibular condyle.

Authors:  Fouad H Ebrahim; Antonio C O Ruellas; Beatriz Paniagua; Erika Benavides; Karl Jepsen; Larry Wolford; Joao Roberto Goncalves; Lucia H S Cevidanes
Journal:  Oral Surg Oral Med Oral Pathol Oral Radiol       Date:  2017-08-24

Review 8.  Osteoarthritis year in review 2018: imaging.

Authors:  M T Nieminen; V Casula; M T Nevalainen; S Saarakkala
Journal:  Osteoarthritis Cartilage       Date:  2018-12-25       Impact factor: 6.576

Review 9.  Subchondral bone in osteoarthritis: insight into risk factors and microstructural changes.

Authors:  Guangyi Li; Jimin Yin; Junjie Gao; Tak S Cheng; Nathan J Pavlos; Changqing Zhang; Ming H Zheng
Journal:  Arthritis Res Ther       Date:  2013       Impact factor: 5.156

View more
  18 in total

1.  Automatic Segmentation of Mandibular Ramus and Condyles.

Authors:  Celia Le; Romain Deleat-Besson; Juan Prieto; Serge Brosset; Maxime Dumont; Winston Zhang; Lucia Cevidanes; Jonas Bianchi; Antonio Ruellas; Liliane Gomes; Marcela Gurgel; Camila Massaro; Aron Aliaga-Del Castillo; Marilia Yatabe; Erika Benavides; Fabiana Soki; Najla Al Turkestani; Karine Evangelista; Joao Goncalves; Jose Valladares-Neto; Maria Alves Garcia Silva; Cauby Chaves; Fabio Costa; Daniela Garib; Heesoo Oh; Jonathan Gryak; Martin Styner; Jean-Christophe Fillion-Robin; Beatriz Paniagua; Kayvan Najarian; Reza Soroushmehr
Journal:  Annu Int Conf IEEE Eng Med Biol Soc       Date:  2021-11

2.  Temporomandibular Joint Osteoarthritis Diagnosis Using Privileged Learning of Protein Markers.

Authors:  Winston Zhang; Jonas Bianchi; Najla Al Turkestani; Celia Le; Romain Deleat-Besson; Antonio Ruellas; Lucia Cevidanes; Marilia Yatabe; Joao Goncalves; Erika Benavides; Fabiana Soki; Juan Prieto; Beatriz Paniagua; Kayvan Najarian; Jonathan Gryak; Reza Soroushmehr
Journal:  Annu Int Conf IEEE Eng Med Biol Soc       Date:  2021-11

3.  Artificial intelligence and infrared thermography as auxiliary tools in the diagnosis of temporomandibular disorder.

Authors:  Elisa Diniz de Lima; José Alberto Souza Paulino; Ana Priscila Lira de Farias Freitas; José Eraldo Viana Ferreira; Jussara da Silva Barbosa; Diego Filipe Bezerra Silva; Patrícia Meira Bento; Ana Marly Araújo Maia Amorim; Daniela Pita Melo
Journal:  Dentomaxillofac Radiol       Date:  2021-10-06       Impact factor: 2.419

4.  Kindlin-2 loss in condylar chondrocytes causes spontaneous osteoarthritic lesions in the temporomandibular joint in mice.

Authors:  Yumei Lai; Wei Zheng; Minghao Qu; Christopher C Xiao; Sheng Chen; Qing Yao; Weiyuan Gong; Chu Tao; Qinnan Yan; Peijun Zhang; Xiaohao Wu; Guozhi Xiao
Journal:  Int J Oral Sci       Date:  2022-07-04       Impact factor: 24.897

Review 5.  The Role of Ultrasound in Temporomandibular Joint Disorders: An Update and Future Perspectives.

Authors:  Beatrice Maranini; Giovanni Ciancio; Stefano Mandrioli; Manlio Galiè; Marcello Govoni
Journal:  Front Med (Lausanne)       Date:  2022-06-20

6.  TMJOAI: An Artificial Web-Based Intelligence Tool for Early Diagnosis of the Temporomandibular Joint Osteoarthritis.

Authors:  Celia Le; Romain Deleat-Besson; Najla Al Turkestani; Lucia Cevidanes; Jonas Bianchi; Winston Zhang; Marcela Gurgel; Hina Shah; Juan Prieto; Tengfei Li
Journal:  Clin Image Based Proced Distrib Collab Learn Artif Intell Combat COVID 19 Secur Priv Preserv Mach Learn (2021)       Date:  2021-11-14

Review 7.  Clinical decision support systems in orthodontics: A narrative review of data science approaches.

Authors:  Najla Al Turkestani; Jonas Bianchi; Romain Deleat-Besson; Celia Le; Li Tengfei; Juan Carlos Prieto; Marcela Gurgel; Antonio C O Ruellas; Camila Massaro; Aron Aliaga Del Castillo; Karine Evangelista; Marilia Yatabe; Erika Benavides; Fabiana Soki; Winston Zhang; Kayvan Najarian; Jonathan Gryak; Martin Styner; Jean-Christophe Fillion-Robin; Beatriz Paniagua; Reza Soroushmehr; Lucia H S Cevidanes
Journal:  Orthod Craniofac Res       Date:  2021-05-24       Impact factor: 1.826

Review 8.  From pixel to image analysis.

Authors:  P F van der Stelt
Journal:  Dentomaxillofac Radiol       Date:  2020-11-11       Impact factor: 2.419

9.  Decision Support Systems in Temporomandibular Joint Osteoarthritis: A review of Data Science and Artificial Intelligence Applications.

Authors:  Jonas Bianchi; Antonio Ruellas; Juan Carlos Prieto; Tengfei Li; Reza Soroushmehr; Kayvan Najarian; Jonathan Gryak; Romain Deleat-Besson; Celia Le; Marilia Yatabe; Marcela Gurgel; Najla Al Turkestani; Beatriz Paniagua; Lucia Cevidanes
Journal:  Semin Orthod       Date:  2021-05-19       Impact factor: 1.340

Review 10.  Applications of artificial intelligence and machine learning in orthodontics: a scoping review.

Authors:  Yashodhan M Bichu; Ismaeel Hansa; Aditi Y Bichu; Pratik Premjani; Carlos Flores-Mir; Nikhilesh R Vaid
Journal:  Prog Orthod       Date:  2021-07-05       Impact factor: 2.750

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.