| Literature DB >> 31304376 |
Chin-Chi Kuo1,2, Chun-Min Chang3, Kuan-Ting Liu3, Wei-Kai Lin3, Hsiu-Yin Chiang1, Chih-Wei Chung1, Meng-Ru Ho3, Pei-Ran Sun4, Rong-Lin Yang4, Kuan-Ta Chen3.
Abstract
Prediction of kidney function and chronic kidney disease (CKD) through kidney ultrasound imaging has long been considered desirable in clinical practice because of its safety, convenience, and affordability. However, this highly desirable approach is beyond the capability of human vision. We developed a deep learning approach for automatically determining the estimated glomerular filtration rate (eGFR) and CKD status. We exploited the transfer learning technique, integrating the powerful ResNet model pretrained on an ImageNet dataset in our neural network architecture, to predict kidney function based on 4,505 kidney ultrasound images labeled using eGFRs derived from serum creatinine concentrations. To further extract the information from ultrasound images, we leveraged kidney length annotations to remove the peripheral region of the kidneys and applied various data augmentation schemes to produce additional data with variations. Bootstrap aggregation was also applied to avoid overfitting and improve the model's generalization. Moreover, the kidney function features obtained by our deep neural network were used to identify the CKD status defined by an eGFR of <60 ml/min/1.73 m2. A Pearson correlation coefficient of 0.741 indicated the strong relationship between artificial intelligence (AI)- and creatinine-based GFR estimations. Overall CKD status classification accuracy of our model was 85.6% -higher than that of experienced nephrologists (60.3%-80.1%). Our model is the first fundamental step toward realizing the potential of transforming kidney ultrasound imaging into an effective, real-time, distant screening tool. AI-GFR estimation offers the possibility of noninvasive assessment of kidney function, a key goal of AI-powered functional automation in clinical practice.Entities:
Keywords: Epidemiology; Outcomes research; Ultrasonography
Year: 2019 PMID: 31304376 PMCID: PMC6550224 DOI: 10.1038/s41746-019-0104-2
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
The clinical characteristics of the study datasets at patient and image level
| Variables | Patients ( | Ultrasound Images ( | Ultrasound Images in nontesting dataset ( | Ultrasound Images in testing dataset ( | |
|---|---|---|---|---|---|
| Demographics, median (IQR) | |||||
| Age (years) | 65 (53, 74) | 65 (52, 74) | 65 (53, 74) | 63 (51, 74) | 0.269 |
| Male, n (%) | 715 (55.1) | 2464 (54.7) | 2201 (54.9) | 263 (53.1) | 0.459 |
| Comorbidity, n (%) | |||||
| Cardiovascular disease | 1001 (77.2) | 3504 (77.8) | 3117 (77.7) | 387 (78.2) | 0.820 |
| Hypertension | 968 (74.6) | 3405 (75.6) | 3024 (75.4) | 381 (77.0) | 0.446 |
| Diabetes | 533 (41.1) | 1888 (41.9) | 1697 (42.3) | 191 (38.6) | 0.112 |
| Biochemical value, median (IQR) | |||||
| Serum creatinine (mg/dL) | 2.1 (1.4, 4.4) | 2.1 (1.4, 4.3) | 2.1 (1.4, 4.4) | 2.0 (1.4, 3.6) | 0.003 |
| eGFR (mL/min/1.73 m2) | 30.0 (12.3, 48.6) | 30.1 (12.6, 48.7) | 29.9 (12.3, 48.2) | 31.6 (15.7, 55.5) | 0.001 |
| Sonographic parameter, median (IQR) | |||||
| Kidney length (cm) | 9.72 (8.76, 10.53) | 9.62 (8.69, 10.51) | 9.63 (8.66, 10.51) | 9.54 (8.80, 10.48) | 0.703 |
ap values denotes probability for difference between the nontesting and testing datasets and are calculated by Wilcoxon rank sum test for continuous variables and Chi-square test (or Fisher’s exact test as appropriate) for categorical variables
Fig. 1CNN architecture for kidney function estimation based on kidney sonographic images. a Proposed neural network architecture included 33 residual blocks (100 convolution layers in total) as a CNN-based feature extractor and three fully connected layers of 512, 512, and 256 neurons as a regressor for eGFR prediction. Feature maps are colored in blue and the regressor is specified in yellow. The dropout probability was set at 0.5. (b) Components of the first residual block in the CNN
Fig. 2Flowchart shows the summary of the data processing from bagging in the training phase to the final evaluation phase. Briefly, we obtained 10 ResNet models for predicting continuous eGFR and 10 XGBoost models for CKD status classification. In the evaluation phase, we averaged the output of 10 models as the final prediction result
Fig. 3Performance of predicting continuous eGFR (estimated glomerular filtration rate) levels. a Learning curve of the ResNet model. Because we restored the model with minimum validation loss, in this case, we kept the model at epoch 14, where the smallest overfitting occurred. b Scatter plot of both predicted and actual eGFRs with a linear regression prediction line. γ, Pearson correlation coefficient. c Bland-Altman plot of difference between predicted and actual eGFR (predicted- actual eGFR) against mean eGFR. The blue solid line indicates the mean of difference from zero (thin black dotted line) with crude 95% confidence interval (blue dotted line) and the bold black dotted line represents the 95% crude limits of agreement. A linear regression line (red line) with 95% limits of agreement (red dotted line) characterizes the relationship between mean difference and the magnitude of eGFR with a slope of −0.82 (p value < 0.01). d Bland-Altman plot where mean differences are presented in percentage. Shaded grey areas in (c) and (d) represent the range of mean eGFR less than 60 ml/min/1.73 m2
Fig. 4Performance of predicting CKD status. a Confusion matrix of the CKD status classification (eGFR < 60 ml/min/1.73 m2). b ROC curves of the CKD status classification using different eGFR cutoff values based on our proposed CNN model
Fig. 5Tailored image-cropping method, based on two markers that annotated the kidney length, was used to remove the irrelevant peripheral region of the kidneys. To unify the image size to our neural network model, we resized cropped images to 224 × 224 pixels. Data augmentation schemes comprising shift, rotation, and horizontal flip were performed