| Literature DB >> 34884024 |
Ahmad Kamal Mohd Nor1, Srinivasa Rao Pedapati1, Masdi Muhammad1, Víctor Leiva2.
Abstract
Surveys on explainable artificial intelligence (XAI) are related to biology, clinical trials, fintech management, medicine, neurorobotics, and psychology, among others. Prognostics and health management (PHM) is the discipline that links the studies of failure mechanisms to system lifecycle management. There is a need, which is still absent, to produce an analytical compilation of PHM-XAI works. In this paper, we use preferred reporting items for systematic reviews and meta-analyses (PRISMA) to present a state of the art on XAI applied to PHM of industrial assets. This work provides an overview of the trend of XAI in PHM and answers the question of accuracy versus explainability, considering the extent of human involvement, explanation assessment, and uncertainty quantification in this topic. Research articles associated with the subject, since 2015 to 2021, were selected from five databases following the PRISMA methodology, several of them related to sensors. The data were extracted from selected articles and examined obtaining diverse findings that were synthesized as follows. First, while the discipline is still young, the analysis indicates a growing acceptance of XAI in PHM. Second, XAI offers dual advantages, where it is assimilated as a tool to execute PHM tasks and explain diagnostic and anomaly detection activities, implying a real need for XAI in PHM. Third, the review shows that PHM-XAI papers provide interesting results, suggesting that the PHM performance is unaffected by the XAI. Fourth, human role, evaluation metrics, and uncertainty management are areas requiring further attention by the PHM community. Adequate assessment metrics to cater to PHM needs are requested. Finally, most case studies featured in the considered articles are based on real industrial data, and some of them are related to sensors, showing that the available PHM-XAI blends solve real-world challenges, increasing the confidence in the artificial intelligence models' adoption in the industry.Entities:
Keywords: AI; PHM; PRISMA; XAI; explainable deep learning; machine learning; reliability; sensing and data extraction
Mesh:
Year: 2021 PMID: 34884024 PMCID: PMC8659640 DOI: 10.3390/s21238020
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Interest shown for the term “explainable AI” in Google searches.
Value and classification of the indicated metric.
| Value | <50% | 50–75% | 75–90% | 90–100% | |
|---|---|---|---|---|---|
| Metric | |||||
| Accuracy | Bad | Fair | Good | Very good | |
| Precision | Bad | Fair | Good | Very good | |
|
| |||||
|
| |||||
| F1 | Bad | Fair | Good | Very good | |
| AUC | Bad | Fair | Good | Very good | |
| PRAUC | Bad | Fair | Good | Very good | |
| Kappa | Bad | Fair | Good | Very good | |
Search strategy.
| Database and Date | Number of | Search Field and Keywords | Filters Applied |
|---|---|---|---|
| IEEE | 144 | Journals, Early Access Article, | |
| Science Direct | 607 | (“explainable” OR “interpretable”) AND (“prognostic” OR “diagnostic” OR “prognosis” OR “diagnosis” OR “anomaly detection” OR “RUL” OR “remaining useful life”) (“explainable AI” OR “explainable machine learning” OR “explainable deep learning” OR “XAI”) AND (“prognostic” OR “diagnostic” OR “anomaly detection” OR “RUL” OR “remaining useful life”) (“explainable AI” OR “explainable machine learning” OR “explainable deep learning” OR “XAI”) AND (“prognosis” OR “diagnosis” OR “anomaly detection” OR “RUL” OR “remaining useful life”) (“interpretable AI” OR “interpretable machine learning” OR “interpretable deep learning” OR “XAI”) AND (“prognostic” OR “diagnostic” OR “anomaly detection” OR “RUL” OR “remaining useful life”) (“interpretable AI” OR “interpretable machine learning” OR “interpretable deep learning” OR “XAI”) AND (“prognosis” OR “diagnosis” OR “anomaly detection” OR “RUL” OR “remaining useful life”) | |
| Springer | 291 | “explainable” OR “interpretable” AND “prognos” “explainable” OR “interpretable” AND “prognos” “explainable” OR “interpretable” AND “diagnos” “explainable” OR “interpretable” AND “diagnos” “explainable” OR “interpretable” AND “RUL” “explainable” OR “interpretable” AND “RUL” “explainable” OR “interpretable” AND “remaining useful life” “explainable” OR “interpretable” AND “remaining useful life” “explainable” OR “interpretable” AND “anomaly detection” “explainable” OR “interpretable” AND “anomaly detection” | |
| ACM Digital Library | 75 | ||
| Scopus | 1931 |
(“explainable” OR “interpretable”) AND (“prognostic” OR “diagnostic” OR “prognosis” OR “diagnosis” OR “anomaly detection” OR “RUL” OR “remaining useful life”) |
Analysis results of selected articles.
| ID | Authors and Year | Title | Publisher, | PHM | XAI Approach | Performance | XAI Assist PHM | Metric | Human Role | Uncertainty Management | Case Study |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | [ | On equivalence of FIS and ELM for interpretable rule-based knowledge representation | IEEE, IEEE Transactions on Neural Networks and Learning Systems | Diagnostic | Rule- and knowledge-based | Accuracy: 85.14% | Yes | No | No | No | Real— |
| 2 | [ | K-PdM: KPI-oriented machinery deterioration estimation framework for oredictive maintenance using cluster-based hidden Markov model | IEEE, | Prognostic | Rule- and knowledge-based | RMSE: 14.28 | No | No | No | Probabilistic state transition model | Simulated—Turbofan engine |
| 3 | [ | Unsupervised classification of multichannel profile data using PCA: An application to an emission control system | Elsevier, | Diagnostic | Cluster- | MSE: 2.127 × 10−5 | Yes | No | Yes | No | Real—Emission control system |
| 4 | [ | Forecasting remaining useful life: Interpretable deep learning approach via variational Bayesian inference | Elsevier, | Prognostic | Interpretable model | MAE: 13.267 | No | No | No | Uncertainty in model parameters | Simulated— |
| 5 | [ | Fault isolation in manufacturing systems based on learning algorithm and fuzzy rule selection | Springer, | Diagnostic | Rule- and knowledge- based | Accuracy: 97.01% | Yes | No | No | Probabilistic classification by Bayes decision rule | Real— |
| 6 | [ | Interpretable logic tree analysis: A data-driven fault tree methodology for causality analysis | Elsevier, | Diagnostic | LAD | Mean and standard errors are less than 2% and 1% | Yes | No | Yes | FTA—Expert opinion | Simulated— |
| 7 | [ | Unsupervised wireless spectrum anomaly detection with interpretable features | IEEE, IEEE Transactions on Cognitive Communications and Networking | Anomaly detection | Autoencoder | Generally better than | Yes | No | No | Probabilistic classification error by discriminator | Real—software defined radio spectrum |
| 8 | [ | An attention-augmented deep architecture for hard drive status monitoring in large-scale storag systems | ACM, ACM Transactions on Storage | Prognostic, diagnostic | Attention mechanism | Prognostic precision: 94.5–98.3% | Diag: Yes | No | No | No | Real— |
| 9 | [ | Visualization and explainable machine learning for efficient manufacturing and system operations | ASTM, | Diagnostic | Others | N/A 1 | Yes | No | Yes | No | Simulated—turbofan |
| 10 | [ | Interpretable anomaly prediction: Predicting anomalous behavior in industry 4.0 settings via regularized logistic regression tools | Elsevier, | Anomaly detection | Interpretable model | Kappa: 0.4–0.6 | Yes | No | No | Statistical feature extraction | Real— |
| 11 | [ | A dynamic structure-adaptive symbolic approach for slewing bearings life prediction under variable working conditions | Sage, | Prognostic | Interpretable model | RMSE: 18.19 | Yes | No | No | No | Real— |
| 12 | [ | Digital twin, physics-based model, and machine learning applied to damage detection in structures | Elsevier, | Diagnostic | Interpretable model | Accuracy: 74.8–93.3% | No | No | No | No | Not specified— |
| 13 | [ | Progress toward interpretable machine learning based disruption predictors across tokamaks | Taylor and Francis, Fusion Science and Technology | Diagnostic | Interpretable model | N/A | No | No | No | Physic-based indicator | Real DIII—D and JET tokamaks |
| 14 | [ | Investigating the physics of tokamak global stability with interpretable ML tools | MDPI, | Anomaly detection | Mathematic equation | Success Rate > 90% | No | No | No | No | Type unspecified—Tokamak |
| 15 | [ | Fault diagnosis of gas turbine based on partly interpretable convolutional neural networks | Elsevier, | Diagnostic | Tree-based | Accuracy: 95.52% | Yes | No | No | No | Simulated— |
| 16 | [ | Addressing noise and skewness in interpretable health-condition assessment by learning model confidence | MDPI, | Diagnostic | Rule- and knowledge- based | F1 Score: 0.8005 | No | No | No | No | Real— |
| 17 | [ | Knowledge extraction and insertion to deep belief network for gearbox fault diagnosis | Elsevier, Knowledge-Based Systems | Diagnostic | Rule- and Knowledge-based | Accuracy: 92.33 | Yes | No | No | No | Real— |
| 18 | [ | Isotonic boosting classification rules | Springer, | Diagnostic | Rule- and knowledge-based | Total Misclassification Probability (TMP): 0.036-0.164 | Yes | No | No | No | Real— |
| 19 | [ | Using an autoencoder in the design of an anomaly detector for smart manufacturing | Elsevier, | Anomaly detection | Autoencoder | Precision: | Yes | No | No | No | Simulated— |
| 20 | [ | Robust interpretable deep learning for intelligent fault diagnosis of induction motors | IEEE, | Diagnostic | Filter-based | Accuracy: 99.95% ± 0.05% | Yes | No | No | No | Real— |
| 21 | [ | Tscatnet: An interpretable cross-domain intelligent diagnosis model with antinoise and few-shot learning capability | IEEE, | Diagnostic | Filter-based | Accuracy: 100% | Yes | No | No | No | Real—Bearing, |
| 22 | [ | Waveletkernelnet: an interpretable deep neural network for industrial intelligent diagnosis. | IEEE, | Diagnostic | Filter-based | Accuracy: | Yes | No | No | No | Real—Bearing, |
| 23 | [ | Vibration signals analysis by explainable artificial intelligence approach: Application on bearing faults diagnosis | IEEE, | Diagnostic | Attention mechanism | N/A | No | No | No | No | Real— |
| 24 | [ | Vision-based fault diagnostics using explainable deep learning with class activation maps | IEEE, | Diagnostic | Attention mechanism | Accuracy: 95.85% | No | No | No | No | Real— |
| 25 | [ | VODCA: Verification of diagnosis using CAM-based approach for explainable process monitoring | MDPI, Sensors | Diagnostic | Attention mechanism | Accuracy: | Yes | No | No | True positive and true negative indicators | Simulated— |
| 26 | [ | Fouling modeling and prediction approach for heat exchangers using deep learning | Elsevier, International Journal of Heat and Mass Transfer | Failure Prediction | Model agnostic | Accuracy: | No | No | No | No | Simulated— |
| 27 | [ | Remaining useful life prognosis for turbofan engine using explainable deep neural network with dimensional reduction | MDPI, Sensors | Prognostic | Model Agnostic | RMSE: 10.41 | No | No | No | No | Simulated—Turbofan engine |
| 28 | [ | Interpretable convolutional neural network through layer-wise relevance propagation for machine fault diagnosis | IEEE, | Diagnostic | LRP | Accuracy: 100% | No | No | No | No | Real— |
| 29 | [ | ProtoSteer: Steering deep sequence model with prototypes | IEEE, | Diagnostic | Others | N/A | Yes | No | Yes | No | Real— |
| 30 | [ | Frequency-temporal-logic-based bearing fault diagnosis and fault interpretation using Bayesian optimization &ANN | Elsevier, | Diagnostic | Others | Better error percentage, error rate and robustness | Yes | No | No | No | Real—Bearings |
| 31 | [ | FLAGS: A methodology for adaptive anomaly detection and root cause analysis on sensor data streams by fusing expert knowledge with machine learning | Elsevier, | Anomaly detection, diagnostic | Rule- and knowledge- based | Accuracy: 75% | Yes, for both | No | Yes | FMEA and FTA—Expert opinion | Real—Train |
| 32 | [ | A new interpretable learning method for fault diagnosis of rolling bearings | IEEE, | Diagnostic | Cluster- based | Accuracy: | Yes | No | No | No | Real— |
| 33 | [ | Stable and explainable deep learning damage prediction for prismatic cantilever steel beam | Elsevier, | Diagnostic | Model Agnostic | Accuracy for 19% | Yes, by LIME only | Stability-fit compensation index (SFC)—Quality indicator of the explanations | No | Yes | Real— |
| 34 | [ | An explainable convolutional neural network for fault diagnosis in linear motion guide | IEEE, | Diagnostic | Attention mechanism | Accuracy: | No | No | No | No | Real— |
| 35 | [ | Stationary subspaces autoregressive with exogenous terms methodology for degradation trend estimation of rolling and slewing bearings | Elsevier, | Prognostic | Others | MAE: 0.0375–0.0414 | Yes | No | No | No | Real— |
1 N/A = Item not included in the studied work.
Excluded articles according to the publication year.
| ID | Authors, Date | Title | Publisher, Publication Name | Exclusion Reason |
|---|---|---|---|---|
| 1 | [ | Adaptive cluster tendency visualization and anomaly detection for streaming data | ACM, ACM Transactions on Knowledge Discovery from Data | Non-PHM-XAI implementation/case study |
| 2 | [ | Improved fault detection and diagnosis using sparse global-local preserving projections | Elsevier, | Process monitoring and anomaly detection |
| 3 | [ | Interpretative identification of the faulty conditions in a cyclic manufacturing process | Elsevier, | Process monitoring and diagnosis |
| 4 | [ | Fault diagnosis in industrial chemical processes using interpretable patterns based on logical analysis of data | Elsevier, | Process monitoring and fault diagnosis |
| 5 | [ | Fisher discriminative sparse representation based on DBN for fault diagnosis of complex system | MDPI, | Process monitoring and fault diagnosis |
| 6 | [ | Knowledge-data-integrated sparse modeling for batch process monitoring | Elsevier, Chemical Engineering Science | Process anomaly detection and diagnosis |
| 7 | [ | An enhanced variable selection and Isolation Forest based methodology for anomaly detection with OES data | Elsevier, | Process anomaly detection and diagnosis |
| 8 | [ | Monitoring influent measurements at water resource recovery facility using data-driven soft sensor approach | IEEE, | Process anomaly detection |
| 9 | [ | Weakly correlated profile monitoring based on sparse multi-channel functional principal component analysis | Taylor and Francis, | Process monitoring |
| 10 | [ | Industrial process monitoring based on knowledge-data integrated sparse model and two-level deviation magnitude plots | ACS, Industrial and Engineering Chemistry Research | Process monitoring, anomaly detection and diagnosis |
| 11 | [ | EasyMiner.eu: web framework for interpretable machine learning based on rules and frequent item sets | Elsevier, | Only development version offers anomaly detection |
| 12 | [ | A condition change detection method for solar conversion efficiency in solar cell manufacturing processes | IEEE, | Process monitoring and anomaly detection |
| 13 | [ | Evolving rule-based explainable artificial intelligence for unmanned aerial vehicles | IEEE, | Interpret why agent deviate from its mission, not because of system failure |
| 14 | [ | Dynamic soft sensor development based on convolutional neural networks | ACS, | Process modelling |
| 15 | [ | Explicit and interpretable nonlinear soft sensor models for influent surveillance at a full-scale wastewater treatment plant | Elsevier, | Process monitoring and variable prediction |
| 16 | [ | Intelligent online catastrophe assessment and preventive control via a stacked denoising autoencoder | Elsevier, Neurocomputing | Black-box |
| 17 | [ | Predictive maintenance using tree-based classification techniques: a case of railway switches | Elsevier, | Predict maintenance need, activity type and maintenance trigger status |
| 18 | [ | Deep understanding in industrial processes by complementing human expertise with interpretable patterns of machine learning | Elsevier, | Process monitoring and fault diagnosis |
| 19 | [ | Sparse robust principal component analysis with applications to fault detection and diagnosis | ACS, Industrial and Engineering Chemistry Research | Process monitoring, fault detection and diagnosis |
| 20 | [ | Process abnormity identification by fuzzy logic rules and expert estimated thresholds derived certainty factor | Elsevier, Chemometrics and Intelligent Laboratory Systems | Process anomaly diagnosis |
| 21 | [ | Dual Bayesian inference for risk-informed vibration-based diagnosis | Wiley, Computer-Aided Civil and Infrastructure Engineering | Uncertainty interpretation, not model’s interpretation |
| 22 | [ | ALVEN: Algebraic learning via elastic net for static and dynamic nonlinear model identification | Elsevier, Computers and Chemical Engineering | Process monitoring and variable prediction |
| 23 | [ | Combining k-means and XGBoost models for anomaly detection using log datasets | MDPI, | Anomaly in project, not engineered system |
| 24 | [ | A modern data-mining approach based on genetically optimized fuzzy systems for interpretable and accurate smart-grid stability prediction | MDPI, Energies | Electrical grid demand stability in financial perspective |
| 25 | [ | Data or interpretations impacts of information presentation strategies on diagnostic processes | Wiley, Human Factors and Ergonomics in Manufacturing and Service Industries | Experiment with operator effectivity following quality of interpretability |
| 26 | [ | Least squares sparse principal component analysis and parallel coordinates for real-time process monitoring | ACS, Industrial and Engineering Chemistry Research | Process monitoring and diagnosis |
| 27 | [ | Process control via random forest classification of profile signals: an application to a tapping process | Elsevier, | Process monitoring and anomaly detection |
| 28 | [ | Diagnosing root causes of intermittent slow queries in cloud databases | ACM, | Diagnosing slow query due to lack of resources, not failure |
| 29 | [ | Performance prediction and interpretation of a refuse plastic fuel fired boiler | IEEE, | Performance prediction |
| 30 | [ | SurvLIME: a method for explaining machine learning survival models | Elsevier, | Medical survival model |
| 31 | [ | A robust algorithm for explaining unreliable machine learning survival models using the Kolmogorov.Smirnov bounds | Elsevier, | Medical survival model |
| 32 | [ | Cryptomining detection in container clouds using system calls and explainable machine learning | IEEE, IEEE Transactions on Parallel and Distributed Systems | Network attack |
| 33 | [ | Decision trees for informative process alarm definition and alarm-based fault classification | Elsevier, Process Safety and Environmental Protection | Process monitoring and anomaly detection |
| 34 | [ | Fuzzy heuristics and decision tree for classification of statistical feature-based control chart patterns | MDPI, Symmetry | Process monitoring and diagnosis |
| 35 | [ | DTDR-ALSTM: Extracting dynamic time-delays to reconstruct multivariate data for improving attention-based LSTM industrial time series prediction models | Elsevier, | Process monitoring and variable prediction |
Figure 2PRISMA flow diagram of the search strategy for our review on PHM-XAI.3.3 (“*” indicates that “n = ” in the database field corresponds to the total number of records from all the databases specified below; and “**” states that the Zotero software was used for duplication analysis).
Figure 3Distribution of PHM tasks for the selected articles.
Figure 4Distribution of the selected articles according to the indicated publisher.
Figure 5Distribution of the excluded articles according to the topic.
Figure 6Distribution of the excluded articles according to the publisher.
Figure 7Distribution of the selected articles according to the indicated year.
Figure 8Distribution of the XAI approach type in the selected articles.
Figure 9Distribution of the XAI assistance in the indicated PHM task.
Figure 10Distribution of the performance of AI models according to the indicated task.
Figure 11(a) Distribution of the type of case study in the selected articles; (b) distribution of human involvement (yes/no) in the selected articles; (c) distribution of explanation metric inclusion (yes/no) in the selected articles; and (d) distribution of uncertainty management inclusion (yes/no) in the selected articles.