| Literature DB >> 29518921 |
Lluís Sanmiquel1, Marc Bascompta2, Josep M Rossell3, Hernán Francisco Anticoi4, Eduard Guash5.
Abstract
An analysis of occupational accidents in the mining sector was conducted using the data from the Spanish Ministry of Employment and Social Safety between 2005 and 2015, and data-mining techniques were applied. Data was processed with the software Weka. Two scenarios were chosen from the accidents database: surface and underground mining. The most important variables involved in occupational accidents and their association rules were determined. These rules are composed of several predictor variables that cause accidents, defining its characteristics and context. This study exposes the 20 most important association rules in the sector-either surface or underground mining-based on the statistical confidence levels of each rule as obtained by Weka. The outcomes display the most typical immediate causes, along with the percentage of accidents with a basis in each association rule. The most important immediate cause is body movement with physical effort or overexertion, and the type of accident is physical effort or overexertion. On the other hand, the second most important immediate cause and type of accident are different between the two scenarios. Data-mining techniques were chosen as a useful tool to find out the root cause of the accidents.Entities:
Keywords: association rules; data mining; overexertion; previous cause; type of accident
Mesh:
Year: 2018 PMID: 29518921 PMCID: PMC5877007 DOI: 10.3390/ijerph15030462
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Frequency distribution for the target variable Type of Accident.
Feature selection: Attribute evaluators, search methods and selection modes.
| Target Attribute | Attribute Evaluators | Search Methods | Selection Modes |
|---|---|---|---|
| ChiSquaredAttributeEval | Ranker | Full training set and | |
| CfsSubsetEval | GreedyStepwise | ||
| ClassifierSubsetEval | RandomSearch | ||
| InfoGainAttributeEval | Ranker |
Ranking of the variables for Scenario I.
| Variables | PC | S | PA | E | PO | C | A | WH | DH |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
Ranking of the variables for Scenario II.
| Variables | PC | PA | P | E | S | A | PO | DH | C | WH |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
The 20 best association rules for Scenario I with their corresponding confidence level and percentage of accidents for each association.
| Predictor Variable 1 | Predictor Variable 2 | Predictor Variable 3 | Predictor Variable 4 | Target Variable | Confidence | % Accidents |
|---|---|---|---|---|---|---|
| C1 | PC6 | PA4 | PO3 | TA6 | 0.868 | 6.21 |
| PC6 | PA4 | PO3 | S5 | TA6 | 0.867 | 6.30 |
| C1 | PO3 | PA4 | S5 | TA6 | 0.863 | 7.10 |
| E4 | C1 | PC6 | TA6 | 0.842 | 5.05 | |
| E4 | C1 | PC6 | TA6 | 0.828 | 5.60 | |
| C1 | PC6 | WH2 | E4 | TA6 | 0.816 | 8.37 |
| C1 | PC6 | PO3 | TA6 | 0.814 | 13.40 | |
| PC6 | WH2 | PO3 | TA6 | 0.814 | 7.32 | |
| C1 | PC6 | WH2 | PO3 | TA6 | 0.814 | 7.20 |
| S5 | PC2 | PA2 | TA3 | 0.812 | 5.88 | |
| C1 | PC2 | PA4 | TA3 | 0.811 | 6.52 | |
| S5 | C1 | PC2 | PA1 | TA3 | 0.804 | 5.34 |
| PC2 | PA2 | WH2 | TA3 | 0.803 | 5.73 | |
| PC2 | PA4 | PO3 | TA3 | 0.799 | 5.73 | |
| C1 | PC6 | WH3 | TA6 | 0.799 | 5.28 | |
| C1 | PC2 | PA4 | PO3 | TA3 | 0.799 | 5.59 |
| S5 | PC2 | WH3 | TA3 | 0.798 | 5.13 | |
| S5 | C1 | PC6 | PO3 | TA6 | 0.798 | 8.56 |
| S5 | PC6 | PO3 | TA6 | 0.797 | 8.72 | |
| S5 | C1 | PC6 | TA6 | 0.796 | 9.59 |
The 20 best association rules for Scenario II with their corresponding confidence level and percentage of accidents for each association.
| Predictor Variable 1 | Predictor Variable 2 | Predictor Variable 3 | Predictor Variable 4 | Target Variable | Confidence | % Accidents |
|---|---|---|---|---|---|---|
| C1 | PC6 | WH2 | TA6 | 0.811 | 7.11 | |
| C1 | PC6 | WH2 | PO5 | TA6 | 0.809 | 5.73 |
| PC6 | PA4 | P1 | E1 | TA6 | 0.807 | 7.65 |
| PC6 | PA4 | PO5 | TA6 | 0.806 | 6.38 | |
| S3 | PC6 | P1 | WH2 | TA6 | 0.796 | 5.68 |
| PC6 | WH2 | PO5 | TA6 | 0.795 | 9.23 | |
| PC6 | DH2 | PO5 | TA6 | 0.794 | 6.04 | |
| C1 | PC6 | PO5 | E1 | TA6 | 0.792 | 10.75 |
| PC6 | P3 | PO5 | TA6 | 0.792 | 6.35 | |
| C1 | PC6 | P1 | TA6 | 0.786 | 7.30 | |
| C1 | PC6 | P1 | PO5 | TA6 | 0.785 | 5.25 |
| PC6 | DH3 | PO5 | TA6 | 0.783 | 5.56 | |
| PC6 | P1 | PO5 | TA6 | 0.771 | 8.38 | |
| S3 | PC6 | PO5 | TA6 | 0.771 | 5.17 | |
| PC4 | PA6 | P1 | PO5 | TA2 | 0.766 | 6.25 |
| S3 | E1 | PC6 | PO5 | TA6 | 0.753 | 5.34 |
| C1 | PC4 | PO5 | TA2 | 0.750 | 5.68 | |
| S3 | PC6 | PO5 | TA6 | 0.745 | 6.40 | |
| PC6 | WH3 | PO5 | TA6 | 0.743 | 5.73 | |
| C1 | PA6 | PO5 | TA2 | 0.464 | 6.44 |
Number of times a variable appears in an association rule for Scenario I.
| Variables | PC | C | PO | WH | S | PA | DH | A | E |
|---|---|---|---|---|---|---|---|---|---|
| 75 | 57 | 47 | 42 | 39 | 33 | 17 | 15 | 7 | |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
Number of times a variable appears in an association rule for Scenario II.
| Variables | PO | C | PC | WH | P | DH | PA | E | S | A |
|---|---|---|---|---|---|---|---|---|---|---|
| 64 | 42 | 37 | 32 | 25 | 20 | 16 | 13 | 10 | 4 | |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
Data from the Spanish Ministry of Employment and Social Safety between 2005–2015.
| Variable | ≤49 Workers | 50–99 Workers | 100–499 Workers | ≥500 Workers |
|---|---|---|---|---|
| 12.2% | 9.4% | 50.5% | 27.9% | |
| 4.2% | 5.3% | 25.8% | 64.7% | |
| 2.9 | 1.8 | 2.0 | 0.4 |