| Literature DB >> 31872173 |
Toru Hirano1, Masayuki Nishide1, Naoki Nonaka2, Jun Seita2, Kosuke Ebina3, Kazuhiro Sakurada2, Atsushi Kumanogoh1.
Abstract
OBJECTIVE: The purpose of this research was to develop a deep-learning model to assess radiographic finger joint destruction in RA.Entities:
Keywords: artificial intelligence; joint destruction; rheumatoid arthritis
Year: 2019 PMID: 31872173 PMCID: PMC6921374 DOI: 10.1093/rap/rkz047
Source DB: PubMed Journal: Rheumatol Adv Pract ISSN: 2514-1775
. 1Flow of machine learning
(A) The first step of the machine learning is a detection of finger joints, and the second step is a scoring of joint destruction. These steps are combined as the assessment model for radiographic finger joint destruction. (B) The first step used 3720 images for machine learning (*). The second step used 8760 images derived from 146 radiographs for training (train dataset) the convolutional neural network (CNN), and 2400 images derived from 40 radiographs for validation during the training process (validation dataset). Thirty radiographs were used for testing the performance of the model (test dataset). (C) The network of the CNN consists of two convolution layers, two pooling layers and three fully connected layers.
Characteristics of the patients
| Characteristic | Total | Train/validation | Test |
|---|---|---|---|
|
| 108 | 93 | 15 |
| Sex, female/male | 90/18 | 77/16 | 13/2 |
| Age, years | 64.9 (53.5, 72.6) | 64.9 (53.4, 72.6) | 64.2 (56.8, 76.0) |
| Disease duration, years | 12.2 (6.4, 17.6) | 12.3 (6.8, 18.6) | 9.4 (0.7, 14.1) |
| Class I/II/III/IV | 39/56/13/0 | 35/46/12/0 | 4/10/1/0 |
| Stage I/II/III/IV | 28/19/29/32 | 24/16/26/27 | 4/3/3/5 |
| ACPA positive, | 73 (67.6) | 63 (67.7) | 10 (66.7) |
| Number of radiographs | 216 | 186 | 30 |
Values of age and disease duration are given as the median and interquartile range. Other values are numbers in each category.
Scoring of joint destruction on training/validation dataset
| Score | JSN score | Erosion score | ||
|---|---|---|---|---|
| PIP/IP | MCP | PIP/IP | MCP | |
| 0 | 128 | 510 | 644 | 761 |
| 1 | 58 | 28 | 74 | 22 |
| 2 | 356 | 184 | 93 | 33 |
| 3 | 326 | 127 | 28 | 28 |
| 4 | 62 | 81 | 22 | 15 |
| 5 | N.D. | N.D. | 69 | 71 |
IP: IP joint of the thumb; JSN: joint space narrowing; N.D.: not defined.
. 2A representative image processed by the model
(A) A whole hand image processed by the model. The red rectangle indicates joints, such as PIP, IP or MCP. The number at the upper left in the rectangle indicates the joint space narrowing (JSN) score (yellow letter) and that at the lower right indicates erosion score (blue letter). (B) An enlarged image shows the joints with JSN score 0, 2, 3 or 4 and those with erosion score 0, 4 or 5. (C) Another enlarged image shows the joints with JSN score 0, 2 or 4 and those with erosion score 0, 3 or 5. IP: IP joint of the thumb.
. 3Test of the model
(A, B) The accuracy, identical to the percentage of exact agreement (PEA), and the loss of joint space narrowing (JSN) score during the process for training dataset (red line) and validation dataset (blue line). (C, D) The accuracy (PEA) and the loss of erosion score for training dataset (red line) and validation dataset (blue line). (E) Distribution of the JSN score assigned by the model (black bar) and by clinicians (light and dark grey bars). (F) Correlation of JSN score between the model and clinicians. (G) Distribution of erosion score assigned by the model (black bar) and by clinicians (light and dark grey bars). (H) Correlation of erosion score between the model and clinicians.
Consistency of scores by the model and clinicians
| Evaluator | Index | Total (%) | PIP/IP (%) | MCP (%) |
|---|---|---|---|---|
| For JSN | ||||
| Model | PEA | 65.4 | 58.3 | 72.5 |
| PCA | 85.3 | 84.0 | 86.6 | |
| Model | PEA | 49.3 | 24.3 | 74.6 |
| PCA | 64.0 | 43.1 | 85.2 | |
| Clinician 1 | PEA | 55.5 | 36.7 | 74.5 |
| PCA | 67.6 | 52.7 | 82.6 | |
| For erosion | ||||
| Model | PEA | 74.1 | 66.0 | 82.4 |
| PCA | 84.3 | 81.9 | 86.6 | |
| Model | PEA | 70.6 | 65.2 | 76.1 |
| PCA | 84.3 | 81.3 | 87.3 | |
| Clinician 1 | PEA | 70.6 | 66.0 | 75.2 |
| PCA | 88.0 | 88.7 | 87.2 | |
A total of 286 joints were assessed. Fourteen joints were not identified by the model. PEA is the percentage of exact agreement, and PCA is the ratio of close agreement (within 1.0 score difference) among evaluators.