| Literature DB >> 35999828 |
Mahmoud M Bassiouni1, Ripon K Chakrabortty2, Omar K Hussain3, Humyun Fuad Rahman4.
Abstract
The ongoing COVID-19 pandemic has created an unprecedented predicament for global supply chains (SCs). Shipments of essential and life-saving products, ranging from pharmaceuticals, agriculture, and healthcare, to manufacturing, have been significantly impacted or delayed, making the global SCs vulnerable. A better understanding of the shipment risks can substantially reduce that nervousness. Thenceforth, this paper proposes a few Deep Learning (DL) approaches to mitigate shipment risks by predicting "if a shipment can be exported from one source to another", despite the restrictions imposed by the COVID-19 pandemic. The proposed DL methodologies have four main stages: data capturing, de-noising or pre-processing, feature extraction, and classification. The feature extraction stage depends on two main variants of DL models. The first variant involves three recurrent neural networks (RNN) structures (i.e., long short-term memory (LSTM), Bidirectional long short-term memory (BiLSTM), and gated recurrent unit (GRU)), and the second variant is the temporal convolutional network (TCN). In terms of the classification stage, six different classifiers are applied to test the entire methodology. These classifiers are SoftMax, random trees (RT), random forest (RF), k-nearest neighbor (KNN), artificial neural network (ANN), and support vector machine (SVM). The performance of the proposed DL models is evaluated based on an online dataset (taken as a case study). The numerical results show that one of the proposed models (i.e., TCN) is about 100% accurate in predicting the risk of shipment to a particular destination under COVID-19 restrictions. Unarguably, the aftermath of this work will help the decision-makers to predict supply chain risks proactively to increase the resiliency of the SCs.Entities:
Keywords: COVID-19; Classifiers; Convolutional network; Deep learning; Supply chain risk; Temporal convolutional network
Year: 2022 PMID: 35999828 PMCID: PMC9389854 DOI: 10.1016/j.eswa.2022.118604
Source DB: PubMed Journal: Expert Syst Appl ISSN: 0957-4174 Impact factor: 8.665
Summary of the Literature Review.
| Reference | Problem | Data | Mechanism | Algorithm | Results |
|---|---|---|---|---|---|
| Predicting product shortage risk | Drug distribution company in Iran | Stochastic approach | Multi-stage stochastic integer programming | Their model helped to enhance the company profit by nearly 3.27% | |
| Model a second-generation Biodiesel SC network under risk | Two main biodiesel production in Iran known as JCL and UCO | Fuzzy logic approach | Probabilistic fuzzy logic programming method | Reduction in the total cost of Biodiesel SC | |
| Random disruptions in SC | Plastic pipe industry | Hybrid approach | Stochastic (bi-objective optimization based on fuzzy-means clustering) | Maximize the overall sustainability performance in disruptions | |
| Accommodation of carbon tax with tax rate uncertainty | Data obtained from carbon factories, warehouses, and collection centers | Hybrid approach | Hybrid robust stochastic combined with probabilistic scenario | Product flows adjustment to tax rates shows a small benefit | |
| Capture the interdependence between risks | Global manufacturing SC | Network approach | BBN | Prioritizing risks and strategies | |
| Risks in the production and delivery of goods from a source to a destination | Shipping goods from China to Brazil | Agent approach | Agent-based model | Enabling the ability to model, analyze, control, and monitor the shipment of goods | |
| Selection of supplier during a set of risk factors | Hypothetical data for five different suppliers | Reasoning approach | A rule-based fuzzy inference engine | Helped to obtain the best supplier | |
| Risks and factors affect the SC | Some Financial enterprises | Machine learning | BPNN | Provide good references for enterprise effective decision system | |
| Classification of the retailers based on risk levels | ISACO: a leading distributor for motor vehicles | Machine learning | SOM | Formulation of the risk mitigation methods based on the level of risks | |
| Transport time risks in air cargo SC | Leading forwarder on routes served by airlines | Machine learning | Bayesian parametric model | Assist the forwarder to offer reasonable service and price, enable fair supplier evaluation | |
| Factors and occurrence of risk propagation | Leading automotive organization in India | Machine learning | Bayes network | Detection of the mean service level at maximum or minimum when disruption occurs | |
| Achieve resilience in case of disasters | Data collected from tweets, news, Facebook, WordPress | Machine learning | Big data model | Concluded that Swift trust, information sharing, public–private partnership are the important factors for resilience SC | |
| Prediction of financial data | 20 market stocks | Machine learning | (PLS-WSVM) | Able to detect the turning points if the market falls or rises to a point for a long time | |
| Risk assessment of SC finance | Financial companies from China | Machine learning | CSVM, PSVM, EN-CSVM, EN-PSVM, En-AdaPSVM | Enhancing the credit assessment accuracy | |
| Forecasting multi-channel retail demand | Data obtained from a multi-channel retailer | Deep learning | RF | Ranked the explanatory variable according to the relative importance | |
| Dynamic demand for station-free bike-sharing | Data obtained from downtown area of Nanjing city | Deep learning | LSTM-NNs | Forecast the gap between inflow and outflow of sharing bike trip so a re-balance can be formed during sharing bikes | |
| Practical flight delay prediction | Real data of arrival and departure flights from PEK airport | Deep learning | DBN | An efficient handling of large data to obtain main factors of flight delays. | |
| Reconstruction of the investment portfolio | Data obtained from Yahoo finance | Deep learning | LSTM | Achieved competitive financial performance and social influence | |
| Predicting the growth rates, demand for products and services during the COVID-19 pandemic | Data obtained from google trends and governmental decision of the lockdown | DL with other approaches | Time-series, | Helped the policy makers and planners to make better decisions during the next pandemics | |
| Risks occurred in the agricultural SC during COVID-19 pandemic | Data obtained from 20 companies from their investment in plants | Fuzzy logic approach | FLQ-QWAO | Efficient prediction of different risks in all companies whether micro, small, medium, and multi-national | |
Fig. 1The proposed methodology for predicting shipment data during the COVID restrictions.
Fig. 3Deep learning model based on the BiLSTM layers.
Fig. 2Deep learning model based on the LSTM Layers.
Parameters of the RNN LSTM network layers.
| Layer no. | Layer name | Parameters of each layer | Activations | Learnables |
|---|---|---|---|---|
| 1 | Sequence input layer | Number of Inputs | 16 | – |
| 2 | First LSTM layer | Number of hidden units | 150 | IW |
| 3 | First drop out layer | Drop out Quantity | 150 | |
| 4 | Second LSTM layer | Number of hidden units | 200 | IW |
| 5 | Second drop out layer | Drop Out Quantity | 200 | W |
| 6 | Fully connected layer | Output size | 2 | – |
| 7 | SoftMax Layer | Number of Outputs | 2 | – |
| 8 | Classification layer | Loss function | – | – |
Fig. 4Deep learning model based on the GRU layers.
Parameters of the stacked BiLSTM network layers.
| Layer no. | Layer name | Parameters of each layer | Activations | Learnables |
|---|---|---|---|---|
| 1 | Sequence input layer | Number of Inputs | 16 | – |
| 2 | First BiLSTM layer | Number of hidden units | 300 | IW |
| 3 | First drop out layer | Drop out Quantity | 300 | – |
| 4 | Second BiLSTM layer | Number of hidden units | 300 | IW |
| 5 | Second drop out layer | Drop Out Quantity | 300 | W |
| 6 | Fully connected layer | Output size | 2 | – |
| 7 | SoftMax Layer | Number of Outputs | 2 | – |
| 8 | Classification layer | Loss function | – | – |
Fig. 5(a) Overall TCN model (b) Structure of residual block (c) Structure of dilated casual convolutional layer.
Parameters of the stacked GRU network layers.
| Layer No. | Layer name | Parameters of each layer | Activations | Learnables |
|---|---|---|---|---|
| 1 | Sequence input layer | Number of Inputs | 16 | – |
| 2 | First GRU layer | Number of hidden units | 150 | IW |
| 3 | First drop out layer | Drop out Quantity | 150 | – |
| 4 | Second GRU layer | Number of hidden units | 300 | IW |
| 5 | Second drop out layer | Drop Out Quantity | 300 | W |
| 6 | Fully connected layer | Output size | 2 | – |
| 7 | SoftMax layer | Number of Outputs | 2 | – |
| 8 | Classification layer | Loss function | – | – |
Parameters of the TCN network layers.
| Layers parameters | Values assigned experimentally | |||
|---|---|---|---|---|
| Number of blocks | 4 | |||
| Number of filters | 175 | |||
| Filter size | 3 | |||
| Drop out factor | 0.05 | |||
| Number of input channels | 16 | |||
| Blocks | ||||
| Block 1 | Conv1: Weights | Conv2: Weights | Conv3: Weights | |
| Block 2 | Conv1: Weights | Conv1: Weights | ||
| Block 3 | Conv1: Weights | Conv1: Weights | ||
| Block 4 | Conv1: Weights | Conv1: Weights | ||
| Optional 1 × 1 convolutional layer | Weights: 1 × 16 × 175 | |||
| Fully connected layer | Weights: 2 × 175 | |||
Parameters of each classifier applied on the methodologies.
| Classifiers | Parameters |
|---|---|
| SoftMax | Loss function |
| Support vector machine (SVM) | Batch Size |
| Artificial neural network (ANN) | Learning Rate |
| Random trees (RT) | Batch Size |
| Random Forest (RF) | Batch Size |
| k-nearest neighbor (KNN) | KNN |
Parameters of the training options for the proposed RNN Models.
| Training parameters | Values tested before reaching the final model | Stacked LSTM model | Stacked BiLSTM model | Stacked GRU model |
|---|---|---|---|---|
| Optimizer | Sgdm, Adam, Rmsprop | Adam | Adam | Adam |
| Gradient decay factor | 0.5, 0.7, 0.9, 0.95 0.99 | 0.95 | 0.7 | 0.95 |
| Mini batch size | 1, 8, 16, 32, 64 | 64 | 16 | 32 |
| Initial learning rate | 0.01, 0.001, 0.0001 | 0.001 | 0.001 | 0.001 |
| learning rate schedule | “Constant”, “Piece wise” | “Constant” | “Constant” | “Constant” |
| Max epochs | 10, 15, 20, 25, 30 | 15 | 10 | 15 |
| Iterations per epochs | 38 836 | 38 836 | 38 836 | 38 836 |
| Total number of iterations | 388 360,582 540,776 720, 970 900,1 165 080 | 582 540 | 388 360 | 582 540 |
| L2 Regularization | 0.1, 0.01, 0.001, 0.0001 | 0.001 | 0.1 | 0.0001 |
| Gradient threshold method | “l2-norm”, “global-l2norm” | “l2-norm” | “l2-norm” | “l2-norm” |
| Gradient threshold value | 1, 2, 3, 4, 5, Inf | Inf | Inf | Inf |
| Validation frequency | 50 000, 101 059, 150 000 | 101 059 | 101 059 | 101 059 |
Parameters of the training options for the proposed TCN Model.
| Training parameters | Values tested before reaching the final model | TCN Model |
|---|---|---|
| Optimizer | Sgdm, Adam, Rmsprop | Adam |
| Gradient decay factor | 0.5, 0.7, 0.9, 0.95, 0.99 | 0.99 |
| Mini batch size | 1, 8, 16, 32, 64 | 1 |
| Initial learning rate | 0.01, 0.001, 0.0001 | 0.1 |
| Learning rate schedule | “Constant”, “Piece wise” | “Piecewise” |
| Learning rate drop period | 2, 5, 7, 9 | 5 |
| Learning rate drop factor | 0.2, 0.4, 0.5, 0.7 0.9 | 0.9 |
| Max epochs | 100, 200, 300, 400, 500 | 400 |
| Iterations per epochs | 1 | 1 |
| Total number of iterations | 100, 200, 300, 400, 500 | 400 |
| L2 Regularization | 0.1, 0.01, 0.001, 0.0001 | 0.0001 |
| Gradient threshold method | “l2-norm”, “global-l2norm” | “global-l2norm” |
| Gradient threshold value | 1, 2, 3, 4, 5, Inf | 1 |
| Validation frequency | 20, 50, 70 100 | 70 |
Fig. 6Training and loss curves of the stacked LSTM model for shipments prediction.
The results of the stacked LSTM model on the validation data in terms of various statistical performance measurements.
| Classifiers | TP | FP | FN | TN | SEN | SPEC | P | A | FPR | FNR | F1 | MCC | K |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Softmax | 469 437 | 10 262 | 16 104 | 4198 | 0.966 | 0.290 | 0.978 | 0.709 | 0.033 | 0.972 | 0.218 | 0.215 | |
| RT | 468 528 | 11 171 | 15 895 | 4407 | 0.967 | 0.282 | 0.976 | 0.717 | 0.032 | 0.971 | 0.220 | 0.218 | |
| RF | 473 440 | 6259 | 16 539 | 3763 | 0.966 | 0.375 | 0.986 | 0.624 | 0.033 | 0.976 | 0.242 | 0.227 | |
| KNN | 473 290 | 6409 | 16 340 | 3962 | 0.966 | 0.382 | 0.986 | 0.617 | 0.033 | 0.976 | 0.251 | 0.237 | |
| ANN | 479 263 | 436 | 17 740 | 2562 | 0.964 | 0.854 | 0.999 | 0.145 | 0.035 | 0.981 | 0.320 | 0.211 | |
| SVM | 473 290 | 6409 | 16 340 | 3962 | 0.966 | 0.382 | 0.986 | 0.617 | 0.033 | 0.976 | 0.251 | 0.237 |
The results of the stacked LSTM model on the test data in terms of various statistical performance measurements.
| Classifiers | TP | FP | FN | TN | SEN | SPEC | P | A | FPR | FNR | F1 | MCC | K |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Softmax | 469 162 | 10 355 | 16 306 | 4178 | 0.966 | 0.287 | 0.978 | 0.712 | 0.033 | 0.972 | 0.215 | 0.211 | |
| RT | 468 277 | 11 240 | 16 173 | 4311 | 0.966 | 0.277 | 0.976 | 0.722 | 0.033 | 0.054 | 0.213 | 0.211 | |
| RF | 473 194 | 6323 | 16 794 | 3690 | 0.986 | 0.180 | 0.967 | 0.819 | 0.013 | 0.976 | 0.236 | 0.221 | |
| KNN | 473 015 | 6502 | 16 584 | 3900 | 0.966 | 0.374 | 0.986 | 0.625 | 0.033 | 0.976 | 0.245 | 0.231 | |
| ANN | 479 110 | 407 | 18 079 | 2405 | 0.963 | 0.855 | 0.9991 | 0.144 | 0.036 | 0.9810 | 0.308 | 0.198 | |
| SVM | 473 015 | 6502 | 16 584 | 3900 | 0.9661 | 0.374 | 0.986 | 0.625 | 0.033 | 0.976 | 0.245 | 0.231 |
Fig. 7ROC of the stacked LSTM model using various classifiers on the shipments test data prediction.
Fig. 10ROC of the stacked BiLSTM model using various classifiers on the shipments test data prediction.
Fig. 8Confusion matrix of the stacked LSTM model using various classifiers on the shipments test data prediction.
Fig. 9Training and loss curves of the stacked BiLSTM model for shipments prediction.
The results of the stacked-BiLSTM model on the validation data in terms of various statistical performance measurements.
| Softmax | 476 955 | 2562 | 16 987 | 3497 | 0.965 | 0.577 | 0.994 | 0.422 | 0.034 | 0.979 | 0.299 | 0.249 | |
| RT | 476 428 | 3089 | 16 334 | 4150 | 0.966 | 0.573 | 0.993 | 0.426 | 0.033 | 0.980 | 0.325 | 0.284 | |
| RF | 478 230 | 1287 | 17 045 | 3439 | 0.965 | 0.727 | 0.997 | 0.272 | 0.034 | 0.981 | 0.338 | 0.261 | |
| KNN | 479 397 | 120 | 17 966 | 2518 | 0.963 | 0.954 | 0.999 | 0.045 | 0.036 | 0.981 | 0.335 | 0.210 | |
| ANN | 479 419 | 98 | 18 113 | 2371 | 0.963 | 0.960 | 0.999 | 0.039 | 0.036 | 0.981 | 0.326 | 0.199 | |
| SVM | 479 318 | 199 | 17 242 | 3241 | 0.965 | 0.942 | 0.999 | 0.057 | 0.034 | 0.982 | 0.378 | 0.262 |
The results of the stacked-BiLSTM model on the test data in terms of various statistical performance measurements.
| Classifiers | TP | FP | FN | TN | SEN | SPEC | P | A | FPR | FNR | F1 | MCC | K |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Softmax | 477 224 | 2475 | 16 633 | 3669 | 0.966 | 0.597 | 0.994 | 0.402 | 0.033 | 0.980 | 0.314 | 0.263 | |
| RT | 476 707 | 2992 | 16 062 | 4240 | 0.967 | 0.586 | 0.993 | 0.413 | 0.032 | 0.980 | 0.334 | 0.292 | |
| RF | 478 484 | 1215 | 16 704 | 3598 | 0.966 | 0.747 | 0.997 | 0.252 | 0.033 | 0.981 | 0.353 | 0.275 | |
| KNN | 479 578 | 121 | 17 665 | 2637 | 0.964 | 0.956 | 0.999 | 0.043 | 0.035 | 0.981 | 0.345 | 0.221 | |
| ANN | 479 604 | 95 | 17 810 | 2492 | 0.964 | 0.963 | 0.999 | 0.036 | 0.035 | 0.981 | 0.337 | 0.210 | |
| SVM | 479 503 | 196 | 16 947 | 3355 | 0.965 | 0.944 | 0.999 | 0.055 | 0.034 | 0.982 | 0.387 | 0.272 |
Fig. 11Confusion matrix of the stacked BiLSTM model using various classifiers on the shipments test data prediction.
Fig. 12Training and loss curves of the stacked GRU model for shipments prediction.
The results of the stacked-GRU model on the validation data in terms of various statistical performance measurements.
| Classifiers | TP | FP | FN | TN | SEN | SPEC | P | A | FPR | FNR | F1 | MCC | K |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Softmax | 468 261 | 1438 | 17 747 | 2555 | 0.963 | 0.639 | 0.996 | 0.360 | 0.036 | 0.979 | 0.272 | 0.199 | |
| RT | 474 314 | 5385 | 16 379 | 3923 | 0.966 | 0.451 | 0.990 | 0.548 | 0.033 | 0.978 | 0.274 | 0.245 | |
| RF | 474 993 | 4706 | 16 436 | 3866 | 0.966 | 0.451 | 0.990 | 0.548 | 0.033 | 0.978 | 0.274 | 0.249 | |
| KNN | 479 602 | 97 | 18 017 | 2285 | 0.963 | 0.959 | 0.999 | 0.040 | 0.036 | 0.981 | 0.322 | 0.194 | |
| ANN | 481 349 | 864 | 17 788 | 2514 | 0.964 | 0.744 | 0.998 | 0.025 | 0.035 | 0.980 | 0.294 | 0.203 | |
| SVM | 479 644 | 55 | 18 188 | 2114 | 0.963 | 0.974 | 0.999 | 0.025 | 0.036 | 0.981 | 0.312 | 0.181 |
Fig. 13ROC of the stacked GRU model using various classifiers on the shipments test data prediction.
Fig. 14Confusion matrix of the stacked GRU model using various classifiers on the shipments test data prediction.
The results of the stacked-GRU model on the test data in terms of various statistical performance measurements.
| Classifiers | TP | FP | FN | TN | SEN | SPEC | P | A | FPR | FNR | F1 | MCC | K |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Softmax | 478 199 | 1318 | 18 075 | 2409 | 0.963 | 0.646 | 0.997 | 0.353 | 0.035 | 0.980 | 0.264 | 0.188 | |
| RT | 473 995 | 5522 | 16 680 | 3804 | 0.9659 | 0.439 | 0.990 | 0.560 | 0.034 | 0.977 | 0.264 | 0.235 | |
| RF | 474 770 | 4747 | 16 757 | 3727 | 0.965 | 0.439 | 0.990 | 0.560 | 0.034 | 0.977 | 0.264 | 0.239 | |
| KNN | 479 435 | 82 | 18 347 | 2137 | 0.963 | 0.963 | 0.998 | 0.036 | 0.036 | 0.981 | 0.310 | 0.181 | |
| ANN | 478 693 | 824 | 18 090 | 2394 | 0.963 | 0.743 | 0.998 | 0.256 | 0.036 | 0.980 | 0.285 | 0.193 | |
| SVM | 479 469 | 48 | 18 518 | 1966 | 0.962 | 0.976 | 0.999 | 0.023 | 0.037 | 0.981 | 0.300 | 0.168 |
Fig. 15Training and loss curves of the TCN model for shipments prediction.
The results of the TCN model on the validation data in terms of various statistical performance measurements.
| Classifiers | TP | FP | FN | TN | SEN | SPEC | P | A | FPR | FNR | F1 | MCC | K |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Softmax | 478 931 | 768 | 17 008 | 3294 | 0.965 | 0.810 | 0.998 | 0.189 | 0.034 | 0.981 | 0.353 | 0.260 | |
| RT | 479 662 | 37 | 54 | 20 248 | 0.999 | 0.998 | 0.999 | 0.001 | 0.000 | 0.999 | 0.997 | 0.997 | |
| RF | 479 699 | 0 | 0 | 20 302 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1 | |
| KNN | 479 699 | 0 | 0 | 20 302 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1 | |
| ANN | 468 794 | 10 905 | 1271 | 19 031 | 0.997 | 0.635 | 0.977 | 0.364 | 0.002 | 0.987 | 0.750 | 0.745 | |
| SVM | 479 699 | 0 | 8104 | 12 198 | 0.983 | 1.000 | 1.000 | 0.000 | 0.016 | 0.991 | 0.768 | 0.742 |
The results of the TCN model on the test data in terms of various statistical performance measurements.
| Classifiers | TP | FP | FN | TN | SEN | SPEC | P | A | FPR | FNR | F1 | MCC | K |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Softmax | 478 726 | 791 | 17 223 | 3261 | 0.965 | 0.804 | 0.998 | 0.195 | 0.034 | 0.981 | 0.348 | 0.255 | |
| RT | 479 517 | 0 | 0 | 20 484 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1 | |
| RF | 479 517 | 0 | 0 | 20 484 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1 | |
| KNN | 479 517 | 0 | 0 | 20 484 | 1.0000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1 | |
| ANN | 479 517 | 0 | 8179 | 12 305 | 0.983 | 1.000 | 1.000 | 0.000 | 0.016 | 0.991 | 0.768 | 0.742 | |
| SVM | 479 473 | 44 | 59 | 20 425 | 0.998 | 0.997 | 0.999 | 0.002 | 0.000 | 0.999 | 0.997 | 0.997 |
Fig. 16ROC of the TCN model using various classifiers on the shipments test data prediction.
Fig. 17ROC of the TCN model using various classifiers on the shipments test data prediction.
Fig. 18Representation for the average sensitivity, specificity, precision and accuracy for the four proposed DL methodologies on the proposed experiment.
Fig. 19Visual representation of the average TP, TN, FP and FN measurements for shipments exported based on the DL methods.
Fig. 20(a) represents the radar plot for the proposed DL models based on the six classifiers (b) describes the box plot for the classifiers values obtained from the DL models.