| Literature DB >> 29396120 |
Bradley J Erickson1, Panagiotis Korfiatis2, Timothy L Kline2, Zeynettin Akkus2, Kenneth Philbrick2, Alexander D Weston2.
Abstract
Deep learning (DL) is a popular method that is used to perform many important tasks in radiology and medical imaging. Some forms of DL are able to accurately segment organs (essentially, trace the boundaries, enabling volume measurements or calculation of other properties). Other DL networks are able to predict important properties from regions of an image-for instance, whether something is malignant, molecular markers for tissue in a region, even prognostic markers. DL is easier to train than traditional machine learning methods, but requires more data and much more care in analyzing results. It will automatically find the features of importance, but understanding what those features are can be a challenge. This article describes the basic concepts of DL systems and some of the traps that exist in building DL systems and how to identify those traps.Entities:
Keywords: Deep learning; computer-aided diagnosis; machine learning
Mesh:
Year: 2018 PMID: 29396120 PMCID: PMC5877825 DOI: 10.1016/j.jacr.2017.12.027
Source DB: PubMed Journal: J Am Coll Radiol ISSN: 1546-1440 Impact factor: 5.532
Fig 1Example of three activation functions used in neural networks: (a) rectified linear unit (ReLU), (b) leaky ReLU, (c) sigmoid, and (d) Tanh. Traditional neural networks used sigmoidal functions that simulated actual neurons, but are less effective in current networks, likely because they do not adequately reward very strong activations.
Fig 2Architecture of two popular networks: (a) AlexNet and (b) VGGNet. Input, input image; Conv, convolutional layer; Pool, maximum value pooling layer; Full Conn, fully connected layer; SoftMax, softmax function, also known as normalized exponential function that takes an input vector and maps it to the range of (0,1). This is the class probability. If there are 1,000 outputs in this layer, each value in the 1,000-element vector would correspond with the probability of the input image being that class. The highest value(s) are the predicted class(es).