| Literature DB >> 35887524 |
Kun-Yi Lin1, Yuan-Ta Li2, Juin-Yi Han3, Chia-Chun Wu1, Chi-Min Chu4, Shao-Yu Peng5, Tsu-Te Yeh1.
Abstract
OBJECTIVE: To use deep learning to predict the probability of triangular fibrocartilage complex (TFCC) injury in patients' MRI scans.Entities:
Keywords: data analysis; deep learning; magnetic resonance imaging; retrospective study; triangular fibrocartilage complex
Year: 2022 PMID: 35887524 PMCID: PMC9322609 DOI: 10.3390/jpm12071029
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1The local and external datasets for model training, validation, and testing.
Figure 2(A) An overview of both algorithms used in this study. The topmost data flow shows how Algorithm 1 (ResNet50) processes images along the coronal axis. ResNet50 itself was originally a model for processing 2D images. For ResNet50 to integrate the information from the z-axis, the MR images would first go through the z-direction cropping, which selects the middle eight of a set of MR images. Z-compression then transforms the eight images into three channels. In the ResNet50 architecture, this study uses the model’s channel dimension as the z-dimension. Other data flows show how Algorithm 2 (MRNet) processes images along three axes. Training of MRNet is two-stage. Three AlexNets are optimized respectively, indicated by the three log-likelihood maximization. In the second stage training, their outputs are then passed to a logistic regression classifier for an ensemble result. In the design for the MRNet, where the backbone AlexNet was also designed for 2D images, batch dimension was used for the z-dimension, and information along the z-axis was integrated with z-max pooling. (B)The architecture of ResNet50, the backbone of Algorithm 1 in this study. ResNet50 features residual links, indicated by the jumping arrows to the right of the layer stacks, facilitating the passing of information into a very deep network. Each convolution layer (colored block) is followed by a batch normalization and ReLU activation function, which are not shown in this Figure 2. (C) The architecture of AlexNet, the main backbone of MRNet in this study. The feature extraction of AlexNet consists of five convolution layers and three max-pooling layers, in the depicted order. In the original AlexNet, feature extraction ends with global average pooling to 256 × 6 × 6 and the output tensor were flattened to a 9216-dimensional vector for the downstream classifier. Here, we followed the MRNet’s design and average-pool the output to 256 × 1 × 1 before entering the fully-connected network.
Figure 3The ROC curves of algorithms (a) 1 and (b) 2 for the internal dataset. The ROC curves of algorithms (c) 1 and (d) 2 for the external dataset. ROC, receiver operating characteristic.
Figure 4The activation heatmaps overlaid on the original MRI scans. The white arrows indicate the overlapping area of the contrast media and heatmaps.