| Literature DB >> 36186840 |
Hidetaka Uryu1,2, Ohsuke Migita3,4,5, Minami Ozawa5, Chikako Kamijo3, Saki Aoto1, Kohji Okamura6, Fuyuki Hasegawa7,8, Torayuki Okuyama9, Motomichi Kosuga10,11,9, Kenichiro Hata4,12.
Abstract
Fabry disease is a congenital lysosomal storage disease, and most of these cases develop organ damage in middle age. There are some promising therapeutic options for this disorder, which can stabilize the progression of the disease. However, a long delay in diagnosis prevents early intervention, resulting in treatment failure. Because Fabry disease is a rare disease, it is not well recognized and disease specific screening tests are rarely performed. Hence, a novel approach to for detecting patients with a widely practiced clinical test is crucial for the early detection of the disease. Recently, decision support systems based on artificial intelligence (AI) have been developed in many clinical fields. However, the construction of these models requires datasets from a large number of samples; this aspect is one of the main obstacles in AI-based approaches for rare diseases. In this study, with a novel image amplification method to construct the dataset for AI-model training, we built the deep neural-network model to detect Fabry cases from their urine samples. Sensitivity, specificity, and the AUC of the models on validation dataset were 0.902 (95% CI, 0.900-0.903), 0.977 (0.950-0.980), and 0.968 (0.964-0.972), respectively. This model could also extract disease-specific findings that are interpretable with human recognition. These results indicate that we can apply novel AI models for rare diseases based on this image amplification method we developed. We expect this approach could contribute to the diagnosis of Fabry disease. Synopsis: This is the first reported AI-based decision support system to detect undiagnosed Fabry cases, and our new image amplification method will contribute to the AI models for other rare disorders.Entities:
Keywords: AI, artificial intelligence; AUC, area under the curve; AdHE, adaptive histogram equalization; Artificial intelligence; CNN, convolutional neural network; CntStr, contrast stretching; Deep learning; ERT, enzyme replacement therapy; Fabry disease; Image augmentation; InceptResNet, InceptionResNetV2; Mulberry cells; OrdHE, ordinary histogram equalization; ROC, receiver operating characteristic; Xcep, Xception; alpha-Gal A, α- galactosidase A
Year: 2022 PMID: 36186840 PMCID: PMC9523392 DOI: 10.1016/j.ymgmr.2022.100921
Source DB: PubMed Journal: Mol Genet Metab Rep ISSN: 2214-4269
Fig. 1Training and validation datasets for detecting Fabry disease.
All enrolled images are divided into training or validation groups, preprocessed, and collected in each dataset.
Fig. 2Overview of the image preprocessing workflow for training datasets.
Each image in the training group was first processed using histogram equalization for contrast adjustment, as shown in Fig. 3A. 1:4 scale-sized rectangles were randomly clipped from these images, as shown in Figs. 3B–3D. Among these processed segmented images, the segmented positive images were categorized as positive training datasets, while the negative segmented images and the background of the positive images were categorized as negative datasets. For training the neural network models, these datasets were randomly separated as three-fourth for training and one-fourth for hyper-parameter tuning.
Fig. 3Image preprocessing of the original images into segmented images.
(A) Representative images before and after equalization are shown in the upper row. Histograms of the pixel intensity before and after equalization are shown in the lower row. (B) Rectangular areas that were ¼ the size of the original image, including the center of the mulberry cells, were randomly clipped out for the positive training dataset. (C) As the positive-image backgrounds, rectangular areas that were ¼ the size of the original image and did not overlap any part of mulberry cells were randomly clipped out. (D) As negative segmented images, rectangular areas that were ¼ the size of the original image were randomly clipped out. The segmented images from (B) were incorporated into the positive dataset, whereas those from (C) and (D) were incorporated into the negative dataset for training the neural network, as shown in Fig. 2.
Fig. 4Model evaluation based on each training dataset.
(A, B) Representative ROC curves (A) and boxplots of AUCs for each model (B) were shown (N = 15–25 per group, means ± standard error). *p < .05; **p < .01.
(C-E) Representative histograms of the scores for the negative (C), positive (D), and Fabry background segments (E), predicted with the model C. The intervals on the y-axis scale were logarithmically transformed.
(F) Visualization of the class activation area on the segmented images. The original urinary sediment images (left) were segmented and displayed as heatmaps according to the contribution of the model activation for classification (center). Each segment was processed using the models and converted to scores for Fabry disease prediction (right).