| Literature DB >> 30962509 |
Jaynal Abedin1, Joseph Antony2, Kevin McGuinness2, Kieran Moran2,3, Noel E O'Connor2, Dietrich Rebholz-Schuhmann4,5, John Newell4,6.
Abstract
Knee osteoarthritis (KOA) is a disease that impairs knee function and causes pain. A radiologist reviews knee X-ray images and grades the severity level of the impairments according to the Kellgren and Lawrence grading scheme; a five-point ordinal scale (0-4). In this study, we used Elastic Net (EN) and Random Forests (RF) to build predictive models using patient assessment data (i.e. signs and symptoms of both knees and medication use) and a convolution neural network (CNN) trained using X-ray images only. Linear mixed effect models (LMM) were used to model the within subject correlation between the two knees. The root mean squared error for the CNN, EN, and RF models was 0.77, 0.97 and 0.94 respectively. The LMM shows similar overall prediction accuracy as the EN regression but correctly accounted for the hierarchical structure of the data resulting in more reliable inference. Useful explanatory variables were identified that could be used for patient monitoring before X-ray imaging. Our analyses suggest that the models trained for predicting the KOA severity levels achieve comparable results when modeling X-ray images and patient data. The subjectivity in the KL grade is still a primary concern.Entities:
Mesh:
Year: 2019 PMID: 30962509 PMCID: PMC6453934 DOI: 10.1038/s41598-019-42215-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Distribution of KOA severity between training and validation data.
| Severity level | Training: Freq (%) | Validation: Freq (%) | Total: Freq (%) |
|---|---|---|---|
| Level 0 | 1818 (43.2) | 685 (40.5) | 2503 (42.4) |
| Level 1 | 728 (17.3) | 312 (18.4) | 1040 (17.6) |
| Level 2 | 1045 (24.8) | 416 (24.6) | 1461 (24.8) |
| Level 3 | 503 (12.5) | 237 (14.0) | 740 (12.5) |
| Level 4 | 115 (2.7) | 42 (2.5) | 157 (2.7) |
Summary statistics of patient characteristics between complete, training and validation data.
| Characteristics | Training: Mean (SD) | Validation: Mean (SD) | Total: Mean (SD) |
|---|---|---|---|
| Age (year) | 60.3 (9.2) | 61.1 (8.9) | 60.5 (9.1) |
| Female (Freq. %) | 1177 (56.0) | 454 (53.7) | 1631 (55.3) |
| Height (mm) | 1685.2 (93.2) | 1687.3 (92.6) | 1685.8 (93.0) |
| Weight (kg) | 80.7 (16.3) | 80.5 (15.7) | 80.6 (16.1) |
| BMI (kg/ | 28.3 (4.8) | 28.2 (4.6) | 28.3 (4.7) |
| Systolic (mmHg) | 123.3 (15.9) | 123.7 (16.7) | 123.3 (16.1) |
| Diastolic (mmHg) | 75.5 (9.8) | 75.4 (9.6) | 75.5 (9.8) |
Figure 1Boxplots of patient characteristics by Knee Osteoarthritis Severity.
Figure 2Correlation among predictors (dark color indicates stronger correlation).
Figure 3Estimation of hyper-parameter λ. The Y-axis represents RMSE for different values of λ whereas the upper horizontal axis represents the number of predictors. The RMSE increases as the number of predictors decrease but stabilizes after a certain number of predictors are added. The red points represents RMSE and the gray line segments represents a 95% confidence interval corresponding to each RMSE. The optimal value of λ is the minimum value corresponding to a steady-state RMSE.
Figure 4Contribution of each variable on KOA severity score prediction by Elastic Net Regression.
Estimated RMSE from different models for each level of KOA severity level.
| Severity Level | Elastic Net Regression | Linear Mixed Model (LMM) | Random Forest Regression | CNN Regression |
|---|---|---|---|---|
| Level 0 | 0.917 | 0.920 | 0.909 | 0.816 |
| Level 1 | 0.563 | 0.591 | 0.511 | 0.485 |
| Level 2 | 0.881 | 0.895 | 0.853 | 0.840 |
| Level 3 | 1.320 | 1.320 | 1.270 | 0.795 |
| Level 4 | 2.140 | 2.10 | 2.02 | 0.846 |
|
| 0.973 | 0.978 | 0.943 | 0.770 |
Figure 5Data pre-processing work flow. (a) Inspecting entire dataset manually to get subset of relevant candidate variables, (b) calculate percentage of missing values for each variables and also inspect sparsity of the categorical variables. Drop a variable that has more than 15% missing values or very low e.g. less than 5% into one category in a binary variable, (c) creating dummy variables from multi-category variables and then split the dataset into training and test data for predictive model building.
Best performing CNN for classifying the knee images.
| Layer | Kernels | Kernel Size | Strides | Output shape |
|---|---|---|---|---|
| conv1 | 32 | 11 × 11 | 2 | 32 × 100 × 150 |
| maxPool1 | — | 3 × 3 | 2 | 32 × 49 × 74 |
| conv2 | 64 | 5 × 5 | 1 | 64 × 49 × 74 |
| maxPool2 | — | 3 × 3 | 2 | 64 × 24 × 36 |
| conv3 | 96 | 3 × 3 | 1 | 96 × 24 × 36 |
| maxPool3 | — | 3 × 3 | 2 | 96 × 11 × 17 |
| conv4 | 128 | 3 × 3 | 1 | 128 × 11 × 17 |
| maxPool4 | — | 3 × 3 | 2 | 128 × 5 × 8 |
Figure 6Learning curves: training and validation losses, and accuracy of the fully trained CNN.