| Literature DB >> 31376272 |
Igbe Tobore1,2, Jingzhen Li1, Liu Yuhang1, Yousef Al-Handarish1, Abhishek Kandwal1, Zedong Nie1, Lei Wang1.
Abstract
The use of deep learning (DL) for the analysis and diagnosis of biomedical and health care problems has received unprecedented attention in the last decade. The technique has recorded a number of achievements for unearthing meaningful features and accomplishing tasks that were hitherto difficult to solve by other methods and human experts. Currently, biological and medical devices, treatment, and applications are capable of generating large volumes of data in the form of images, sounds, text, graphs, and signals creating the concept of big data. The innovation of DL is a developing trend in the wake of big data for data representation and analysis. DL is a type of machine learning algorithm that has deeper (or more) hidden layers of similar function cascaded into the network and has the capability to make meaning from medical big data. Current transformation drivers to achieve personalized health care delivery will be possible with the use of mobile health (mHealth). DL can provide the analysis for the deluge of data generated from mHealth apps. This paper reviews the fundamentals of DL methods and presents a general view of the trends in DL by capturing literature from PubMed and the Institute of Electrical and Electronics Engineers database publications that implement different variants of DL. We highlight the implementation of DL in health care, which we categorize into biological system, electronic health record, medical image, and physiological signals. In addition, we discuss some inherent challenges of DL affecting biomedical and health domain, as well as prospective research directions that focus on improving health management by promoting the application of physiological signals and modern internet technology. ©Igbe Tobore, Jingzhen Li, Liu Yuhang, Yousef Al-Handarish, Abhishek Kandwal, Zedong Nie, Lei Wang. Originally published in JMIR Mhealth and Uhealth (http://mhealth.jmir.org), 02.08.2019.Entities:
Keywords: ECG; EEG; artificial intelligence; big data; biologicals; biomedical; deep learning; electronic health record; mHealth; machine learning; medical imaging
Year: 2019 PMID: 31376272 PMCID: PMC6696854 DOI: 10.2196/11966
Source DB: PubMed Journal: JMIR Mhealth Uhealth ISSN: 2291-5222 Impact factor: 4.773
Figure 1Google Trends for “big data in healthcare” between 2010 and 2018; (a) occurrence timeline graph and (b) prevalence occurrence by country.
Figure 2How machine learning techniques scale with amount of data.
Figure 3Relationship between artificial intelligence, machine learning and deep learning with emerging timeline.
Figure 4Trends of published papers that implement deep learning techniques. The data are generated by searching for “deep learning” on PubMed database.
Figure 5Structure of a simple autoencoder showing input, hidden, and output layers. The interconnection between the neurons is shown in the direction of the arrows.
Figure 6Feedforward recurrent neural network implementation. The final output from the output layer is fed back as part the input in the input layer. Where It and Ot are the input and output at time t and Ot-1 is the output for the previous input at time t-1.
Figure 7Long short-term memory representation for output sequence influence by input sequence and previous output. Where It-2 and It-1, Ot-2 and Ot-1, Mt-2 and Mt-1 are inputs, outputs, memory respectively for previous time steps. It, Ot, and Mt are the current input, output and memory state of the LSTM cell. Ot+1, and Mt+1 represent subsequent output and memory state respectively for subsequent time step input It+1. I, O, M represent the recurrent input, output and memory state respectively for a simplified LSTM cell operation and Wr is the weight for the computation in the cell.
Figure 8Simple implementation of convolutional neural network to show the sequence of operation to identify “X” with 2 filters.
Figure 9Structure of convolutional neural network with 3 convolution and pooling (conv+pool) layers and 2 fully connected (FC) layers.
Figure 10Left: A general Boltzmann machine. The top layer represents a vector of stochastic binary “hidden” features and the bottom layer represents a vector of stochastic binary “visible” variables. Right: A restricted Boltzmann machine with no hidden-to-hidden and no visible-to-visible connections. Where L, J, and W represent the visible layer, hidden layer, and connection weight between the layers respectively.
Figure 11Left: A 3-layer deep belief network. Right: A stack of modified restricted Boltzmann machine constructed to create a deep Boltzmann machine. V: visible vector; h: a set of hidden neurons; w: connections.
Figure 12Research publications in different category of deep learning methods. These statistics are obtained from PubMed database by searching for publications containing any of the deep learning method in title or abstract.
Figure 13Publications distributions for 5 years in different categories of deep learning methods. These figures are extracted from the Institute of Electrical and Electronics Engineers database of papers from conferences and journals and magazines by using advanced query to search in publication title and abstract containing any of the deep learning methods ((“Publication Title”: Autoencoder) OR (“Abstract”: Autoencoder)).
Figure 14Descriptive summary of biomedical and health applications category and example implemented with deep learning methods: convolutional neural network (CNN), recurrent neural network (RNN), autoencoder (AE), deep boltzmann machine (DBM) and deep belief network (DBN).
Deep learning implementation in biological systems.
| Reference | Task | Method | Remark |
| Saha et al, 2018 [ | Mitosis detection | CNNa | The prediction model has an improved 92% precision, 88% recall, and 90% F-score over conventional machine learning methods |
| Saha et al, 2018 [ | Cell membranes and nuclei classification | CNN | The identification model achieved predictive value of 98.33% accuracy and 6.84% false-positive rate which was comparable with human expert |
| Xu et al, 2017 [ | Red blood cells classification | CNN | Framework classified sickle-shaped red blood cells in an automated manner with above 90% accuracy |
| Wang et al, 2017 [ | Segmentation of adipose tissue | CNN | The model was tested on 2 datasets; the accuracy produced 95.8% and 96.8% for computed tomography slice selection-CNN and fat pixel segmentation-CNN, respectively |
| Xu et al, 2017 [ | Classification, segmentation of tissue | CNN | Outcome generates patterns that reveal biological insights that have been verified by pathologist |
| Hughes et al, 2016 [ | Reactivity to biological macromolecules | CNN | The model captured molecules that would have been missed by standard reactivity screening experiments |
| Song et al, 2018 [ | Segmentation of cervical cytoplasm | CNN | Experimental results achieved an accuracy of 94.50% for nucleus region detection and a precision of 0.9143(SD 0.0202) and a recall of 0.8726(SD 0.0008) for nucleus cell segmentation |
| Gurcan et al, 2001 [ | Detection of microcalcifications | CNN | The results demonstrated optimization of cost surface whose characteristics are not known |
| Han et al, 2015 [ | Membrane bioreactor permeability | RNNb | Simulation and experimental results demonstrate the reliability and effectiveness of the proposed intelligent detection system |
| Zhang et al, 2017 [ | Sequence-specific correction for RNA | RNN | RNN-based bias correction method compares well with the state-of-the-art sequence-specific bias correction method |
| Ren et al, 2018 [ | Prostate cancer differentiation | RNN | Their study demonstrates that prostate cancer patients with Gleason score of 4+3 have a higher risk of disease progression and recurrence compared with prostate cancer patients with Gleason score of 3+4 |
| Wang et al, 2018 [ | Detecting biomedical event trigger for protein and gene | RNN | F-score potentially reached about 80%, which is better than comparative experimental methods |
| Chen et al, 2018 [ | Effective drug combination | DBNc | Predict effective drug combination from gene expression and pathway and ontology fingerprints properties for treating cancer |
| Beevi et al, 2017 [ | Cell mitosis detection | DBN | The algorithm provides improved performance compared with other state-of-the-art techniques with average F-score of 84.29% for the MITOSd dataset and 75% for the clinical dataset from Regional Cancer Centre |
| Zhang et al, 2014 [ | Identification of critical proteins | DBN | The results of comparison showed that DBN had higher reconstruction rate compared with baseline methods and more proteins of critical value to yeast cell cycle process were identified |
| Jiang et al, 2016 [ | Huntington disease identification | DBMe | Results demonstrate that the model can detect important information for differential analysis of time series gene expression datasets |
| Ghasemi et al, 2017[ | Biological activity prediction | DBN | The output of the model demonstrated significant superiority to traditional neural network with random parameters |
| Eraslan et al, 2019 [ | Single cell RNA-seq denoising | AEf | Outperforms existing methods for enhancing biological discovery and data imputation in terms of quality and speed |
| Guan et al, 2018 [ | Gene function annotation | AE | The model can capture intermediate representations to partial corruption of input pattern and generate low-dimensional codes superior to conditional dimension reduction tools |
| Chen et al, 2018 [ | Genomics functional characterization | AE | Retains sufficient biological information with regard to tumor subtypes and clinical prognostic significance. Provides high reproducibility on survival analysis and accurate prediction for cancer subtypes |
| Wang et al, 2018 [ | Visualization of single cell RNA- seq | AE | Reconstructs the cell dynamics in preimplantation embryos and identifies several candidate marker genes associated with early embryo development |
| Maggio et al, 2018 [ | Prognostic profiling for survival prediction | AE | The embedding technique can be used to better stratify patients’ survival |
| Hu et al, 2018 [ | Prediction of drug-likeness | AE | The classification accuracy of drug-like/nondrug–like models are 91.04% on WDI-ACDg databases and 91.20% on MDDR-ZINCh database |
aCNN: convolutional neural network.
bRNN: recurrent neural network.
cDBN: deep belief network.
dMITOS: mitosis detection in breast cancer histological images.
eDBM: deep Boltzmann machine.
fAE: autoencoder.
gWDI-ACD: world drug index-available chemicals directory.
hMDDR-ZINC: MDL drug data report-zinc compound.
Deep learning implementation in electronic health records and medical report management.
| Reference | Task | Method | Remark |
| Wickramasinghe et al, 2017 [ | Extract features from medical records | CNNa | It achieves superior accuracy compared with traditional techniques to detect meaningful clinical motifs and uncovers the underlying structure of the disease |
| Lin et al, 2017 [ | Disease code classification | CNN | The method had a higher testing accuracy (mean AUCb=0.9696; mean F-score=0.9086) than traditional NLPc-based approaches (mean AUC range 0.8183-0.9571; mean F-score range 0.5050-0.8739) |
| Cheng et al, 2016 [ | Risk prediction of chronic congestive heart failure | CNN | The model performance increases the prediction accuracy by 1.5% when 60% training data were used and 5.2% when it is 90% training data |
| Zeng et al, 2017 [ | MobileDeepPill: Recognition of unconstrained pill image | CNN | DLd-based pill image recognition algorithm won the first price of the NIHe NLMf Pill Image Recognition Challenge |
| Li et al, 2018 [ | Extraction of adverse drug events | RNNg | The DL model achieved a result of F-score=65.9%, which is higher than F-score=61.7% from the best system in the MADEh1.0 challenge |
| Zhang et al, 2018 [ | Identify clinical named entity | RNN | CRFi and bidirectional LSTMj-CRF achieved a precision of 0.9203 and 0.9112, recall of 0.8709 and 0.8974, and F-score score of 0.8949 and 0.9043, respectively |
| Jagannatha et al, 2016 [ | Prediction based on sequence labeling | RNN | Prediction model improved detection of the exact phrase for various medical entities |
| Jagannatha et al, 2016 [ | Extraction of medical events | RNN | Cross-validated microaverage of precision, recall, and F-score for all medical tags for gated recurrent unit–documents are 0.812, 0.7938, and 0.8031, respectively, which are higher than other methods |
| Rajkomar et al, 2018 [ | Representation of patients’ record | RNN | Achieved high accuracy for tasks such as predicting in-hospital mortality, prolonged length of stay, and all of a patient’s final discharge diagnoses |
| Hou et al, 2018 [ | Extraction of drug-drug interaction | RNN | DL can efficiently aid in information extraction (drug-drug interaction) from text. The F-score ranged from 49% to 81% |
| Choi et al, 2015 [ | Predicting clinical events | RNN | On the basis of separate blind test set evaluation, the model can perform differential diagnosis with up to 79% recall, which is significantly higher than several baselines |
| Choi et al, 2016 [ | Detection of heart failure onset | RNN | When using an 18-month observation window, the AUC for the RNN model increased to 0.883 and was significantly higher than the 0.834 AUC for the best of the baseline methods |
| Volkova et al, 2017 [ | Forecasting influenza-like illness | RNN | LSTM model outperformed previously used models in all metrics, for example, Pearson correlation (0.79), RMSEk (0.01), RMSPEl (29.52), and MAPEm (69.54) |
| Yadav et al, 2016 [ | Patient data deidentification | RNN | The proposed approach achieved best performance, with 89.63, 90.73, 90.18 for recall, precision, and F-score, respectively |
| Hassanien et al, 2013[ | Classification of diagnoses | RNN | Models outperformed several strong baselines, including a multilayer perceptron trained on hand-engineered features |
| Li et al, 2014 [ | Identifying informative risk factors and predicting bone disease | DBNn | Proposed framework predicted the progression of osteoporosis from risk factors and provided information to improve the understanding of the disease |
| Che et al, 2015 [ | Detection of characteristic patterns of physiology | DBN | The empirical efficacy of the technique was demonstrated on 2 real-world hospital datasets and the model was able to learn interpretable and clinically relevant features |
| Tran et al, 2015 [ | Harness electronic health record with minimal human supervision | DBMo | The model achieved F-scores of 0.21 for moderate-risk and 0.36 for high-risk, which are significantly higher than those obtained by clinicians and competitive with the results obtained by support vector machine |
| Miotto et al, 2016 [ | Predict future of patients | AEp | Results significantly outperformed those achieved using representations based on raw electronic health record data and alternative feature learning strategies |
| Lv et al, 2016 [ | Clinical relation extraction | AE | The proposed model is validated on the dataset of i2b2 2010. The DL method for feature optimization showed great potential |
| Lasko et al, 2013 [ | Inferring phenotypic patterns | AE | The model distinguished the uric acid signatures of gout and acute leukemia despite not being optimized for the task |
aCNN: convolutional neural network.
bAUC: area under the curve.
cNLP: natural language processing.
dDL: deep learning.
eNIH: national institutes of health.
fNLM: national library of medicine.
gRNN: recurrent neural network.
hMADE: medication and adverse drug events.
iCRF: conditional random fields.
jLSTM: long short-term memory.
kRMSE: root mean square error.
lRMSPE: root mean square percentage error.
mMAPE: mean absolute percentage error.
nDBN: deep belief network.
oDBM: deep Boltzmann machine.
pAE: autoencoder.
Application of deep learning techniques in medical images.
| Reference | Task | Method | Remark |
| Shi et al, 2018 [ | Occult invasive disease prediction | CNNa | The performance result exceeded handcrafted computer vision technique that was designed with prior domain knowledge. It achieved operating characteristic curve of 0.70 and 95% CI |
| Wang et al, 2018 [ | Alcohol detection | CNN | The method used multiple images in the experiment and achieved 96.88% sensitivity, specificity of 97.18%, and accuracy of 97.04% |
| Moeskops et al, 2016 [ | Tissue segmentation | CNN | The result demonstrates accurate segmentation in all datasets and the robustness to different age and acquisition protocol |
| Abiyev et al, 2017 [ | Chest disease detection | CNN | Demonstrate accurate classification of chest pathologies such as chronic obstructive pulmonary disease, pneumonia, asthma, tuberculosis, and lung diseases in chest x-rays |
| Liu et al, 2016 [ | Food image recognition for dietary assessment | CNN | These results outperformed all other reported work such as DeepFoodCam using UEC-256 and Food-101 dataset |
| Nie et al, 2017 [ | Detection of standard sagittal plane in pregnancy | DBNb | The model provides knowledge to avoid unnecessary massive searching and corresponding huge computation load |
| Zhang et al, 2016 [ | Benign and malignant breast tumors differentiation | DBMc | Results showed that the deep learning method achieved better classification performance with an accuracy of 93.4%, a sensitivity of 88.6%, a specificity of 97.1%, and an area under the receiver operating characteristic curve of 0.947 |
| Cao et al, 2014 [ | Cancer clinical practice and research | DBM | Experimental results with large volume of real-world medical images showed that multimodal approach is a promising solution for the next generation medical image indexing and retrieval system |
| Wu et al, 2018 [ | Tracking motion of the heart | DBM | Heart shape model that characterizes the statistical variations in heart shapes present in a training dataset for tracking motion of the heart |
| Jang et al, 2017 [ | Four sensorimotor classification | DBM | Identified task-specific (left hand clenching, right hand clenching, auditory attention, and visual stimulus) features and classification of functional/structural magnetic resonance imaging volumes |
| Suk et al, 2014 [ | Alzheimer disease identification | DBM | Achieved maximum accuracy of 95.35%, outperforming other computing methods |
| Khatami et al, 2016 [ | Extract high-level features from medical images | DBN | Experimental results show that the proposed model improves about 0.07% performance compared with other models |
| Zhang et al, 2019 [ | Discovering hierarchical common brain networks | DBN | Three hierarchical layers with hundreds of common and consistent brain networks across individual brains were successfully constructed |
| Hu et al, 2019 [ | Cancer diagnosis | AEd | Diagnosed malignant mesothelioma, a rare but aggressive cancer because of its composite epithelial/mesenchymal pattern |
| Uzunova et al, 2018 [ | Pathology detection | AE | Experiments on 2-dimensional and 3-dimensional datasets show that the approach is suitable for detection of pathologies and deliver reasonable dice coefficient result |
| Lee et al, 2018 [ | Benign and malignant tumor classification | AE | The results show that when deep learning algorithm is applied on sonograms after intensity inhomogeneity correction, there is a significant increase of the tumor classification accuracy |
| Seebock et al, 2018 [ | Age-related macular degeneration classification | AE | Used markers to classify early and late age-related macular degeneration cases. The model yields an accuracy of 81.40% |
| Wang et al, 2019 [ | Spine disease diagnosis | AE | Achieved higher localization accuracy, low model complexity, and without the need for any assumptions about visual field in computed tomography scans |
| Malek et al, 2017 [ | Image description for visually impaired | AE | Fusing a set of AE-learned features gave higher classification rates with regard to using the features individually |
| Zhang et al, 2016 [ | Histopathological images analysis | AE | The method effectively combined the strength of multiple features adaptively as inputs and achieves 91.67% classification accuracy |
| Xia et al, 2016 [ | Human attention process | AE | Experimental results on several benchmark datasets show that in accordance with different inputs, the network can learn distinct basic features for saliency modeling in its encoding layer |
| Mano et al, 2018 [ | Chronic back pain detection | AE | Experimental results from patients in the United Kingdom and Japan (41 patients, 56 controls), achieved accuracy of 63%, with 68% in cross-validation of all data |
aCNN: convolutional neural network.
bDBN: deep belief network.
cDBM: deep Boltzmann machine.
dAE: autoencoder.
Deep learning technique for sensors and physiological signal task.
| Reference | Task | Method | Remark |
| Zeng et al, 2018 [ | Predict mental states of drivers | CNNa | Predicted the mental states of drivers from electroencephalography (EEG) signals using 2 mental state classification models called EEG-Conv and EEG-Conv-R |
| Zhang et al, 2017 [ | Sleep stage classification | CNN | The total accuracy and kappa coefficient of the proposed method are 92% and 0.84, respectively |
| Veiga et al, 2017 [ | Human activity classification | CNN | The exercises could be recognized with 95.89% accuracy |
| Lenz et al, 2011 [ | Interactions in human brain | CNN | Result showed regions of human brain affected by interactions and activities |
| Murad et al, 2017 [ | Human activity recognition | RNNb | Experimental results showed that the proposed method outperforms methods employing conventional machine learning algorithms, such as support vector machine and k-nearest neighbors |
| Liu et al, 2016 [ | Predicting driving fatigue | RNN | Identified brain dynamics in predicting car driving fatigue. The model was evaluated using the generalized cross-subject approach |
| Yu et al, 2015 [ | Human action classification | RNN | Real-time human action classification for recording and regenerating both action sequences and action classification tasks from continuous signal |
| Vakulenko et al, 2017[ | Human body motion analysis | RNN | The method generated missing information from human body motions from sparse control marker settings |
| Mo L et al, 2016; Ordóñez et al, 2016 [ | Human physical activity recognition | RNN | The model can recognize 12 types of activities and the accuracy rate was 81.8%. CNN was applied for feature extraction and long short-term memory model for the human physical activity recognition |
| Mathews et al, 2018 [ | Cardiac arrhythmias diagnosis | DBMc | Single-lead electrocardiogram model detected cardiac abnormalities which was comparable with human expert |
| Chu et al, 2018 [ | Recovery motor imagery | DBNd | Recognize and restructure the incomplete motor imagery in electroencephalogram signals for recovery motor imagery-based treatment |
| Turner et al, 2014 [ | Seizure detection | DBN | High resolution biosensing multichannel model achieved personalized health monitoring |
| Chao et al, 2018 [ | Emotion recognition | DBM | The results showed that the proposed framework outperforms other machine learning classifiers |
| Jindal, 2016 [ | Heart rate monitoring | DBM | Technique was able to predict heart rate with a 10-fold cross-validation error margin of 4.88% |
| Hassan et al, 2018 [ | Human activity recognition | DBN | The proposed approach outperformed traditional expression recognition approaches such as typical multiclass support vector machine and artificial neural network |
| Ruiz-Rodríguez et al, 2014 [ | Blood pressure monitoring | DBM | Continuous noninvasive blood pressure monitoring system with performance higher than benchmark methods |
| Yuan et al, 2019 [ | Epileptic seizures detection | AEe | Experimental results showed that the proposed model was able to achieve higher average accuracy and F-score of 94.37% and 85.34%, respectively |
| Jirayucharoensak et al, 2014 [ | Detection of emotion | AE | The model provided better performance compared with support vector machine and Naive Bayes classifiers |
| Jokanović 2017 [ | Human fall detection | AE | Experimental data were used to demonstrate the superiority of the model over principal component analysis method |
| Xia et al, 2018 [ | Cardiac arrhythmia classification | AE | The results from the model (99.8% accuracy) showed that the classification performance of the proposed approach outperforms most of the state-of-the-art methods |
aCNN: convolutional neural network.
bRNN: recurrent neural network.
cDBM: deep Boltzmann machine.
dDBN: deep belief network.
eAE: autoencoder.
Figure 15Current working technique for application of deep learning with biomedical data.
Figure 16Expected required technique in biomedical application of deep learning.
Figure 17Deep learning biomedical data streaming architecture, challenges, and applications.
Figure 18Overview of external challenges in acquiring and analyzing medical big data.
Figure 19Deep learning service for medical internet of things (IoT) with edge computing and mobile apps for continuous health care monitoring using magnetic resonance images (MRI) and signals such as electrocardiogram (ECG), electroencephalogram (EEG), electromyography (EMG).