| Literature DB >> 34426802 |
Iqbal H Sarker1,2.
Abstract
Deep learning (DL), a branch of machine learning (ML) and artificial intelligence (AI) is nowadays considered as a core technology of today's Fourth Industrial Revolution (4IR or Industry 4.0). Due to its learning capabilities from data, DL technology originated from artificial neural network (ANN), has become a hot topic in the context of computing, and is widely applied in various application areas like healthcare, visual recognition, text analytics, cybersecurity, and many more. However, building an appropriate DL model is a challenging task, due to the dynamic nature and variations in real-world problems and data. Moreover, the lack of core understanding turns DL methods into black-box machines that hamper development at the standard level. This article presents a structured and comprehensive view on DL techniques including a taxonomy considering various types of real-world tasks like supervised or unsupervised. In our taxonomy, we take into account deep networks for supervised or discriminative learning, unsupervised or generative learning as well as hybrid learning and relevant others. We also summarize real-world application areas where deep learning techniques can be used. Finally, we point out ten potential aspects for future generation DL modeling with research directions. Overall, this article aims to draw a big picture on DL modeling that can be used as a reference guide for both academia and industry professionals.Entities:
Keywords: Artificial intelligence; Artificial neural network; Deep learning; Discriminative learning; Generative learning; Hybrid learning; Intelligent systems
Year: 2021 PMID: 34426802 PMCID: PMC8372231 DOI: 10.1007/s42979-021-00815-1
Source DB: PubMed Journal: SN Comput Sci ISSN: 2661-8907
Fig. 1Schematic representation of the mathematical model of an artificial neuron (processing element), highlighting input (), weight (w), bias (b), summation function (), activation function (f) and output signal (y)
Fig. 2An illustration of the position of deep learning (DL), comparing with machine learning (ML) and artificial intelligence (AI)
Fig. 3An illustration of the performance comparison between deep learning (DL) and other machine learning (ML) algorithms, where DL modeling from large amounts of data can increase the performance
Fig. 4A typical DL workflow to solve real-world problems, which consists of three sequential stages (i) data understanding and preprocessing (ii) DL model building and training (iii) validation and interpretation
Fig. 5A general architecture of a a shallow network with one hidden layer and b a deep neural network with multiple hidden layers
Fig. 6A taxonomy of DL techniques, broadly divided into three major categories (i) deep networks for supervised or discriminative learning, (ii) deep networks for unsupervised or generative learning, and (ii) deep networks for hybrid learning and relevant others
Fig. 7An example of a convolutional neural network (CNN or ConvNet) including multiple convolution and pooling layers
Fig. 8Basic structure of a gated recurrent unit (GRU) cell consisting of reset and update gates
Fig. 9Schematic structure of a standard generative adversarial network (GAN)
Fig. 10Schematic structure of a sparse autoencoder (SAE) with several active units (filled circle) in the hidden layer
A summary of deep learning tasks and methods in several popular real-world applications areas
| Application areas | Tasks | Methods | References |
|---|---|---|---|
| Healthcare and Medical applications | Regular health factors analysis | CNN-based | Ismail et al. [ |
| Identifying malicious behaviors | RNN-based | Xue et al. [ | |
| Coronary heart disease risk prediction | Autoencoder based | Amarbayasgalan et al. [ | |
| Cancer classification | Transfer learning based | Sevakula et al. [ | |
| Diagnosis of COVID-19 | CNN and BiLSTM based | Aslan et al. [ | |
| Detection of COVID-19 | CNN-LSTM based | Islam et al. [ | |
| Natural Language Processing | Text summarization | Auto-encoder based | Yousefi et al. [ |
| Sentiment analysis | CNN-LSTM based | Wang et al. [ | |
| Sentiment analysis | CNN and Bi-LSTM based | Minaee et al. [ | |
| Aspect-level sentiment classification | Attention-based LSTM | Wang et al. [ | |
| Speech recognition | Distant speech recognition | Attention-based LSTM | Zhang et al. [ |
| Speech emotion classification | Transfer learning based | Latif et al. [ | |
| Emotion recognition from speech | CNN and LSTM based | Satt et al. [ | |
| Cybersecurity | Zero-day malware detection | Autoencoders and GAN based | Kim et al. [ |
| Security incidents and fraud analysis | SOM-based | Lopez et al. [ | |
| Android malware detection | Autoencoder and CNN based | Wang et al. [ | |
| intrusion detection classification | DBN-based | Wei et al. [ | |
| DoS attack detection | RBM-based | Imamverdiyev et al. [ | |
| Suspicious flow detection | Hybrid deep-learning-based | Garg et al. [ | |
| Network intrusion detection | AE and SVM based | Al et al. [ | |
| IoT and Smart cities | Smart energy management | CNN and Attention mechanism | Abdel et al. [ |
| Particulate matter forecasting | CNN-LSTM based | Huang et al. [ | |
| Smart parking system | CNN-LSTM based | Piccialli et al. [ | |
| Disaster management | DNN-based | Aqib et al. [ | |
| Air quality prediction | LSTM-RNN based | Kok et al. [ | |
| Cybersecurity in smart cities | RBM, DBN, RNN, CNN, GAN | Chen et al. [ | |
| Smart Agriculture | A smart agriculture IoT system | RL-based | Bu et al. [ |
| Plant disease detection | CNN-based | Ale et al. [ | |
| Automated soil quality evaluation | DNN-based | Sumathi et al. [ | |
| Business and Financial Services | Predicting customers’ purchase behavior | DNN based | Chaudhuri [ |
| Stock trend prediction | CNN and LSTM based | anuradha et al. [ | |
| Financial loan default prediction | CNN-based | Deng et al. [ | |
| Power consumption forecasting | LSTM-based | Shao et al. [ | |
| Virtual Assistant and Chatbot Services | An intelligent chatbot | Bi-RNN and Attention model | Dhyani et al. [ |
| Virtual listener agent | GRU and LSTM based | Huang et al. [ | |
| Smart blind assistant | CNN-based | Rahman et al. [ | |
| Object Detection and Recognition | Object detection in X-ray images | CNN-based | Gu et al. [ |
| Object detection for disaster response | CNN-based | Pi et al. [ | |
| Medicine recognition system | CNN-based | Chang et al. [ | |
| Face recognition in IoT-cloud environment | CNN-based | Masud et al. [ | |
| Food recognition system | CNN-based | Liu et al. [ | |
| Affect recognition system | DBN-based | Kawde et al. [ | |
| Facial expression analysis | CNN and LSTM based | Li et al. [ | |
| Recommendation and Intelligent system | Hybrid recommender system | DNN-based | Kiran et al. [ |
| Visual recommendation and search | CNN-based | Shankar et al. [ | |
| Recommendation system | CNN and Bi-LSTM based | Rosa et al. [ | |
| Intelligent system for impaired patients | RL-based | Naeem et al. [ | |
| Intelligent transportation system | CNN-based | Wang et al. [ |
Fig. 11A general structure of transfer learning process, where knowledge from pre-trained model is transferred into new DL model
Fig. 12Schematic structure of deep reinforcement learning (DRL) highlighting a deep neural network
Fig. 13Several potential real-world application areas of deep learning