| Literature DB >> 30974803 |
Jihye Lim1, Jungyoon Kim2, Songhee Cheon3.
Abstract
A large number of people suffer from certain types of osteoarthritis, such as knee, hip, and spine osteoarthritis. A correct prediction of osteoarthritis is an essential step to effectively diagnose and prevent severe osteoarthritis. Osteoarthritis is commonly diagnosed by experts through manual inspection of patients' medical images, which are usually collected in hospitals. Checking the occurrence of osteoarthritis is somewhat time-consuming for patients. In addition, the current studies are focused on automatically detecting osteoarthritis through image-based deep learning algorithms. This needs patients' medical images, which requires patients to visit the hospital. However, medical utilization and health behavior information as statistical data are easier to collect and access than medical images. Using indirect statistical data without any medical images to predict the occurrence of diverse forms of OA can have significant impacts on pro-active and preventive medical care. In this study, we used a deep neural network for detecting the occurrence of osteoarthritis using patient's statistical data of medical utilization and health behavior information. The study was based on 5749 subjects. Principal component analysis with quantile transformer scaling was employed to generate features from the patients' simple background medical records and identify the occurrence of osteoarthritis. Our experiments showed that the proposed method using deep neural network with scaled PCA resulted in 76.8% of area under the curve (AUC) and minimized the effort to generate features. Hence, this methos can be a promising tool for patients and doctors to prescreen for possible osteoarthritis to reduce health costs and patients' time in hospitals.Entities:
Keywords: deep learning; feature extraction; osteoarthritis; prediction
Mesh:
Year: 2019 PMID: 30974803 PMCID: PMC6480580 DOI: 10.3390/ijerph16071281
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1The selection process for the study population (KNHANES: Korea National Health and Nutrition Examination Survey).
Figure 2The proposed system architecture of the deep neural network (DNN) with scaled principal component analysis (PCA).
Figure 3Two-dimensional plots of first and second principal components. (a) Unscaled-PCA. (b) Min-max scaling with PCA. (c) Standard scaling with PCA. (d) Quantile transformer scaling with PCA.
Confusion matrix of the proposed method.
| Confusion Matrix | Predicted (T) | Predicted (F) |
|---|---|---|
| Actual (T) | 270 | 135 |
| Actual (F) | 413 | 1137 |
Figure 4Receiver operating characteristic (ROC) curve for the predictive performance of DNN with scaled-PCA and area under the curve (AUC) of 76.8%.
Correlation coefficient of initial features. BMI: body mass index.
| Feature | Correlation Coefficient | Feature | Correlation Coefficient |
|---|---|---|---|
| Year | −0.023309 | Angina | 0.058126 |
| Region | −0.017512 | Osteoporosis |
|
| Sex |
| Diabetic mellitus | 0.034071 |
| Age |
| Alcohol | -0.160021 |
| Education | −0.24423 | Smoking | 0.190024 |
| Household income | −0.158579 | Physical activity | 0.073899 |
| Married | −0.008994 | BMI | 0.147746 |
| Health status |
| BMI group | 0.14497 |
| Hypertension | 0.103292 | Obesity | 0.118041 |
| Dyslipidemia | 0.129874 | Chronic disease count |
|
| Stroke | 0.029012 | Region category | −0.012289 |
| Myocardial infarction | 0.016415 | Income quartile | −0.182163 |
Bold: the top two best correlation values (±0.25), : the best correlation values (±0.2).