| Literature DB >> 36187621 |
Mohamed Zul Fadhli Khairuddin1,2, Khairunnisa Hasikin1,3, Nasrul Anuar Abd Razak1, Khin Wee Lai1, Mohd Zamri Osman4, Muhammet Fatih Aslan5, Kadir Sabanci5, Muhammad Mokhzaini Azizan6, Suresh Chandra Satapathy7, Xiang Wu8.
Abstract
Workplace accidents can cause a catastrophic loss to the company including human injuries and fatalities. Occupational injury reports may provide a detailed description of how the incidents occurred. Thus, the narrative is a useful information to extract, classify and analyze occupational injury. This study provides a systematic review of text mining and Natural Language Processing (NLP) applications to extract text narratives from occupational injury reports. A systematic search was conducted through multiple databases including Scopus, PubMed, and Science Direct. Only original studies that examined the application of machine and deep learning-based Natural Language Processing models for occupational injury analysis were incorporated in this study. A total of 27, out of 210 articles were reviewed in this study by adopting the Preferred Reporting Items for Systematic Review (PRISMA). This review highlighted that various machine and deep learning-based NLP models such as K-means, Naïve Bayes, Support Vector Machine, Decision Tree, and K-Nearest Neighbors were applied to predict occupational injury. On top of these models, deep neural networks are also included in classifying the type of accidents and identifying the causal factors. However, there is a paucity in using the deep learning models in extracting the occupational injury reports. This is due to these techniques are pretty much very recent and making inroads into decision-making in occupational safety and health as a whole. Despite that, this paper believed that there is a huge and promising potential to explore the application of NLP and text-based analytics in this occupational injury research field. Therefore, the improvement of data balancing techniques and the development of an automated decision-making support system for occupational injury by applying the deep learning-based NLP models are the recommendations given for future research.Entities:
Keywords: artificial intelligence; deep learning; machine learning; natural language processing; occupational health and safety
Mesh:
Year: 2022 PMID: 36187621 PMCID: PMC9521307 DOI: 10.3389/fpubh.2022.984099
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Inclusion and exclusion criterion.
|
|
|
|
|---|---|---|
| Sources | Journal/Research Article | Conference papers, journal reviews, news, editorial papers, book series, book and book chapters. |
| Language | English | Non-English |
| Period | 2016 to 2021 | <2016 |
| Area | Engineering, Occupational Safety and Health, Public Health, Artificial Intelligence | Other than Engineering, Occupational Safety and Health, Public Health, Artificial Intelligence |
Search strings for eight databases.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| Occupational injury AND natural language processing | 15 | 35 | 0 | 8 | 0 | 13 | 0 | 2 |
| Occupational accident AND natural language processing | 14 | 47 | 1 | 8 | 3 | 4 | 0 | 1 |
| Occupational injury AND text mining | 2 | 25 | 1 | 0 | 2 | 2 | 0 | 0 |
| Occupational accident AND text mining | 1 | 27 | 0 | 0 | 1 | 10 | 0 | 0 |
| Occupational injury AND injury narratives | 13 | 39 | 0 | 4 | 2 | 17 | 0 | 4 |
| Occupational accident AND injury narratives | 4 | 39 | 0 | 0 | 0 | 5 | 0 | 0 |
| Workplace injury AND natural language processing AND machine learning | 5 | 3 | 0 | 0 | 0 | 2 | 9 | 0 |
| Workplace injury AND natural language processing AND deep learning | 2 | 3 | 0 | 0 | 0 | 1 | 0 | 0 |
| Total including duplicates | 56 | 218 | 2 | 20 | 8 | 74 | 9 | 7 |
| Sub-total including duplicates | 394 | |||||||
| Total selected articles | 27 | |||||||
Figure 1PRISMA flowchart.
Figure 2Percentage of three main algorithms for 27 articles.
Similarity of techniques in existing related studies.
|
|
|
|
|---|---|---|
| ( | 8 | |
| ( | 12 | |
| ( | 7 |
Short Summary of review papers.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Yedla et al. ( | Identify the potential of text narratives in predicting the injury outcomes and days away from work | LR, DT, RF, and ANN | ANN had the best overall accuracy (0.78) for fixed field entries and RF had the best overall accuracy (0.94) for injury narratives. | Data imbalance problems are not considered. | Future studies can expand by using deep learning models such as CNN and RNN. The use of a Generative Adversarial Network (GAN) to overcome data imbalance problems should be explored. |
| Tixier et al. ( | Extract valuable new safety knowledge from large datasets, in terms of “safety clashes” | Graph mining, Hierarchical clustering on Principal Components | Graphical features are useful in identifying the combinations of attributes. | Findings are limited to one dataset only. | Follow-up research should expand the generalizability of the methods to other occupational contexts or settings. |
| Nanda et al. ( | Test the Bayesian decision support system to auto-codes large datasets | NB models; Single-Word (SW) and Two-word Sequence (TW) | TW had higher sensitivity (0.69) than SW (0.66); accuracy increased when the two models agreed (0.80) | Not include information on the nature of the injury and affected body parts. | To include the coded information on the nature of the injury and body parts in the models. |
| Bertke et al. ( | Compare the performance of NB and LR models; Investigate the performance of adding TW into a single model and test the feasibility of the models with external datasets | NB and regularized LR | LR performed better than NB, accuracy (0.80), and adding TW improved the performances of both models. | Lack of quality control on the narratives. | Evaluation of a database with less descriptive narratives will likely have lower success with auto-coder. |
| Kim and Chi ( | Suggests an NLP-based prototype of a Construction Accident Management System | Semantic retrieval model using Okapi BM25 and thesaurus; Tacit knowledge extraction using rule-based, conditional random field (CRF) | Retrieved results 97% relevant to the accident reports; Knowledge accuracy using rule-based (93.75%) and CRF models (84.13%) | Practical limitations in rule generation involve grammatical errors and various expressions from the injury reports. | Required more data to fully learn the tacit knowledge feature. The necessity to apply the proposed system to the real-world construction field for system optimization. |
| Cheng et al. ( | Suggests a hybrid model address sequential problems in text characteristics of accident reports | DT, KNN, NB, LR. SVM, LSTM, GTU and hybrid model–SGRU | SGRU had the best overall performances (0.69) | Existence of imbalanced data distribution in the dataset. | Exploration in sequential learning models such as RNN variants. Focus on the application of data balancing techniques such as over-sampling/under-sampling to tackle the issue of imbalanced data distribution in datasets. |
| Liu et al. ( | Suggests a novel framework, JUMPER understands the sequential decision process in text | JUMPER model; CNN as SentEnc and RNN as controller. | JUMPER reduced the length of text reading up to 40%; up to 30% speedup for prediction; finding key rationale up to 6%; classification, JUMPER achieved better performances on all tasks. | The inaccuracy of neural networks was fed with too much irrelevant information. | Incorporating symbolic reasoning into the output layer in a multitask setting to explicitly handle inference. |
| Xu et al. ( | Provide an improved approach to extracting risk factors from accident reports | Text-mining approach–domain lexicon | Verified TF-H is favored in measuring risk factors | Limited source of text documents. | Extraction of valuable information from different text documents will be given different corpus and tasks and produce a better model. |
| Chokor et al. ( | Assess the strength of unsupervised machine learning-based NLP in re-arranging the type of accidents | K-means clustering | Four accident attributes of clusters–“fall,” “struck by objects,” “electrocutions” and “trenches collapse” | Limited to only one specific geographical data. | Models can be improved by investigating a larger sample of occupational injury reports. |
| Goh and Ubeynarayana ( | Evaluate various text-mining models to classify the accidents | SVM, LR, RF, KNN, DT, NB | SVM had the best average f1 score (0.62); Linear SVM with uni-gram and RF with uni-gram were the best classifiers | An excessive number of terms/features and unrelated elaboration of narratives in the reports. | Developing a more intelligent pre-processing of the narrative such as using rule-based methods may eliminate unrelated narratives to the occupational injury. |
| Luo et al. ( | Suggests a text-based analytic method using fall accident cases for accident analysis | R software; Apriori algorithm | TF-IDF calculation identified 28 causal factors and six groups of accident types; the strong correlation between the causal factors (confidence level = 100%); the occurrence of an accident is the result of the synergistic effects of the causal factors. | Lack of input on the external environment such as temporal characteristics. | Expansion of detailed association analysis including the environmental factors for more comprehensive models. |
| Marucci-Wellman et al. ( | Compare the human-machine approaches to classify the occupational injury narratives | SVM, LR, NB (SW and bi-gram) | LR model had the best performance (0.74); SVM-NB bi-gram models performed as the paired models (0.89); SVM-NB bi-gram-NB SW had improved performance with 0.93 | Handling short noisy injury narratives in many administrative datasets. | Research on finding rare categories of occupational injury narratives shall be enhanced by the integration of NLP and ensemble approaches. |
| Oyedele et al. ( | Compare the state-of-art algorithms with conventional machine learning to analyze the accident reports | R software; DNN, GBM, XGB, SVM, KNN | Deep learning outperformed boosted trees and other algorithms (0.967); GBM-XGB-DNN had better accuracy (>0.90) | Focused on one construction company. | Findings should be validated through additional research by collecting data from several organizations. Implement robust interface techniques and develop deep feedforward neural networks for holistic safety management. |
| Song and Suh ( | Utilize text-mining and LOF models to detect anomalous accidents type | LOF algorithm | Prioritized major clusters–“filling related,” “detection-related,” “ventilation-related,” and “waste-related” accidents | Lack of data quality; poorly written reports on the accident sequence and insufficient keywords. | Documents containing more keywords will produce better text-analytic. Research on forecasting for preventive processes using text documents should be proposed. |
| Suh ( | Identify sectoral patterns and common factors of accident processes using injury narratives | LDA algorithm; R software | Five sectoral patterns were identified; eight topics of accident factors were discovered. | Inconsistency of the data quality; poor quality of narrative text consisting of few words and usage of the single data source. | The value of big data analytics can be enhanced by using multiple data sources and incorporating other external factors related to occupational injury in data analysis. |
| Zhang et al. ( | Classify the causes of accidents and identify the common objects that cause the accidents | SVM, LR, KNN, DT, NB, proposed ensemble model; Rule-based chunking approach | The proposed ensemble model with optimized weights achieved the best performance (0.68); 11 labels as the causes; 10 most common objects identified | The issue on the vagueness of natural language processing techniques. | Exploration of more advanced RNN variants and NLP frameworks such as Natural Node. Emphasize the application of data balancing techniques. |
| Zhong et al. ( | Suggests the deep learning methods to extract unstructured text automatically and provide a visual presentation of accident classification | CNN, SVM, NB, KNN, LDA-based network analysis | CNN outperformed all methods (0.63); nodes with a higher sample degree of centrality were “falls” and “collapse of objects” | Focused only on construction dataset and issues relates to labeling. | Testing the algorithms on much larger samples and developing a multi-label classifier to process occupational injury texts with multiple labels. |
| Tixier et al. ( | Apply RF and SGTB in predicting the injury | RF, SGTB | SGTB models reached higher predictive skills; models predicted three safety outcomes (0.236<RPSS<0.436)–“injury type.” “energy type.” “body part” | Focused only on the construction industry which limits the generalizability of the models. | More training on model stacking algorithms and using training data extracted from other sectors to widen the model application. |
| Sarkar et al. ( | Develop a model to predict injury severity based on reactive and proactive data | SVM, ANN, NB, KNN, CART, RF; LDA-based topic modeling | RF outperformed other models; performances of classifiers were better in mixed data; KMSMOTE performed better in oversampling technique | Focused only on the steel industry that limits generalizability and the dataset used has limited observations. | Analysis of a larger amount of data for better generalizability of the results. Exploring the data balancing techniques such as oversampling, under-sampling, algorithm-level, or cost-sensitive. Consider including other factors as input data. |
| Tixier et al. ( | Test the attributes and safety outcomes can be extracted automatically and accurately from the injury reports | R software based on hand-coded rules and keywords | R capable to scan the narratives with high recall (0.97), precision (0.95), and f1 score (0.96) | The system is not robust to erroneous input such as misspelled, missing, or unseen words. | NLP systems should be hybrid with different ML algorithms. Explore the potential of data-mining methods such as hierarchical clustering. |
| Baker et al. ( | Predict the safety outcomes | RF, XGB, SVM, CART | XGB, RF, and SVM performed comparably for classification; XGB-RF models as model stacking performed better than in single model | Addressing the limitations of judgement bias with empirical data. | Utilization of more powerful predictive algorithms such as neural networks to improve human decision-making. An interesting area of research is to predict the success or failure of occupational injury occurrences. |
| Ganguli et al. ( | Analyze the injury reports on public databases to be applied to private datasets | RF | With the high success of 95% classification on MHSA data; models were able to classify with about 96% accuracy in a non-MHSA data | Too dependent on the terminology and the report writing style. | Improve automation by standardization of occupational injury report writing. |
| Zhang ( | Explore the state-of-art text mining techniques for the automatic classification of occupational accident reports. | Hybrid structured deep neural network | Proposed neural networks outperform each baseline model in terms of weighted average f1 score with 0.723 | The size of the corpus used in this study is relatively small. | Application of data balancing techniques such as oversampling when pre-processing the accident causes. Building a larger domain-specific corpus can be beneficial for improving the quality of learned word embedding. |
| Guanyang et al. ( | Generate word clusters of words as contributory factors and form causal dependency. | NLP with K-means clustering and text mining techniques of co-occurrence network | Both methods are capable of identifying contributing factors. The co-occurrence network approach exhibits advantages in extracting dependency among the contributory factors, while K-means clustering is only able to indicate general correlations. | A co-occurrence network can inevitably omit important contributing factors. | Incorporating supervised learning techniques and fundamental network theory to identify underlying patterns of how the nodes (key objects) are connected. |
| Neththi et al. ( | Extract sources of hazards from occupational injury reports by using Text Mining (TM) and Natural Language Processing (NLP) techniques | Rule-based extraction tool, SVM, Kernel SVM, KNN, NB, and RF | The F1 score obtained through the rule-based model is 0.95. The worker factor is the highest contributor to construction site accidents | Limited literature focusing on extracting sources of hazard in the construction industry. | Further modified and utilized to extract any other reports in various domains by adjusting the N-gram files accordingly, provided that the N-grams be enriched with relevant words and phrases |
| Zhong et al. ( | Develop a novel framework that provides the ability to analyze hazard records automatically | Latent Dirichlet Allocation (LDA) model, CNN, Word Co-occurrence Network (WCN), and Word Cloud (WC) | The trained CNN-based deep learning model outperforms the shallow learning model | The complexity of the framework lies in the architecture of the CNN, especially on the hyper-parameter tuning. | Focus on determining how the integration of advanced semantic and syntactic features with the domain-specific knowledge of CNN models can result in improvements in the classification process. |
| Jing et al. ( | Developed a text-mining method for chemical accident cases based on word embedding and deep learning. | word2vec model and LSTM | Trends in chemical accidents could are obtained through correlation analysis based on word embedding | Complete injury reports can be hard to obtain. Data from websites are often incomplete, and complete cases are not fully disclosed to the public. | Establish a high-quality chemical accident case dataset. |
Figure 3Distribution by year of publication.
Figure 4Industrial sectors in existing literature.
Figure 5Proposed framework of multimodal prediction learning.