| Literature DB >> 36236727 |
Paweł Stefaniak1, Maria Stachowiak1, Wioletta Koperska1, Artur Skoczylas1, Paweł Śliwiński2.
Abstract
Systems that use automatic speech recognition in industry are becoming more and more popular. They bring benefits especially in cases when the user's hands are often busy or the environment does not allow the use of a keyboard. However, the accuracy of algorithms is still a big challenge. The article describes the attempt to use ASR in the underground mining industry as an improvement in the records of work in the heavy machinery chamber by a foreman. Particular attention was paid to the factors that in this case will have a negative impact on speech recognition: the influence of the environment, specialized mining vocabulary, and the learning curve. First, the foreman's workflow and documentation were recognized. This allowed for the selection of functionalities that should be included in the application. A dictionary of specialized mining vocabulary and a source database were developed which, in combination with the string matching algorithms, aim to improve correct speech recognition. Text mining analysis, machine learning methods were used to create functionalities that provide assistance in registering information. Finally, the prototype of the application was tested in the mining environment and the accuracy of the results were presented.Entities:
Keywords: augmented reality; automatic speech recognition; self-propelled machine; text mining; transcription; underground mining; voice interface; wearable computer
Mesh:
Year: 2022 PMID: 36236727 PMCID: PMC9573029 DOI: 10.3390/s22197628
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Hourly schedule of the foreman’s work.
Summary of tables and fields that are accessible by the foreman.
| System | Table | No. of | Data Types | No. of Data Type Occurrences |
|---|---|---|---|---|
| CMMS | Register of | 14 | Categorical | 3 |
| [0–1] | 7 | |||
| Numerical | 4 | |||
| Register of entry and exit | 11 | Categorical | 5 | |
| Numerical | 4 | |||
| Time | 2 | |||
| e-Raport | Machines working plan | 3 | Categorical | 3 |
| Maintenance performed | 13 | Categorical | 5 | |
| [0–1] | 3 | |||
| Numerical | 2 | |||
| Text | 3 | |||
| Settlement of work for | 12 | Categorical | 3 | |
| Numerical | 8 | |||
| Text | 1 | |||
| Settlement of work for | 19 | Categorical | 13 | |
| Numerical | 2 | |||
| Time | 3 | |||
| Text | 1 |
Description of fields that can be used by the voice assistant.
| System | Table | Variables | Data Type | Explanation |
|---|---|---|---|---|
| CMMS | Register of machines | Machine | Categorical | Machine name |
| PRD | [0–1] | Part of work spent in production | ||
| DPL | [0–1] | Part of work spent on a planned service | ||
| DNP | [0–1] | Part of work spent on an unplanned service | ||
| NPP | [0–1] | Part of work spent on a planned repair | ||
| NPL | [0–1] | Part of work spent on an unplanned repair | ||
| NAW | [0–1] | Part of work spent on an emergency repair | ||
| AWR | [0–1] | Part of work spent on an emergency stop | ||
| Milometer value | Numerical | Milometer value | ||
| Register of entry and exit | Company | Categorical | Company name | |
| Subcontracting company | Categorical | Name of the subcontracting company | ||
| Name and surname | Categorical | Name and surname of the employee | ||
| eRaport | Machines working plan | Machine | Categorical | Machine name |
| Operator | Categorical | Name and surname of the employee | ||
| Mining department | Categorical | A mining department | ||
| Maintenance performed | Machine | Categorical | Machine name | |
| Works to be done | Free written text | Description of the work to be performed on the current shift in free written text | ||
| Operator | Categorical | Name and surname of the employee | ||
| Mechanics | Categorical | Name and surname of the employee | ||
| Work | Free written text | Listed elements for maintenance | ||
| Work to be done on the next shift | Free written text | Description of the work to be performed on the next shift in free written text | ||
| Milometer value | Numerical | Milometer value from machine | ||
| Settlement of work for machines | Machine | Categorical | Machine names | |
| Operator | Categorical | Name and surname of the employee | ||
| Mining department | Categorical | A mining department that requests a machine | ||
| Settlement of work for employees | Employee | Categorical | Name, surname, HMC, employee ID | |
| Mining department | Categorical | A mining department that requests a machine | ||
| Machine | Categorical | Machine name |
Review of the quality of algorithms on sample with different noises.
| Noise Type | Noise | Well-Caught Words (%) | Word Error Rate | Levenshtein | Jaro | Jaro–Winkler |
|---|---|---|---|---|---|---|
| None | None | 100 | 0 | 1.0 | 1.0 | 1.0 |
| Normal | Traffic jam | 93.33 | 0.06 | 0.99 | 0.99 | 0.99 |
| Cleaning sounds | 100.00 | 0.00 | 1.00 | 1.00 | 1.00 | |
| Moving stuff | 100.00 | 0.00 | 1.00 | 1.00 | 1.00 | |
| Hair dryer | 81.25 | 0.20 | 0.97 | 0.98 | 0.99 | |
| Industrial | Electric | 81.25 | 0.20 | 0.93 | 0.96 | 0.97 |
| Factory sounds | 93.33 | 0.06 | 0.97 | 0.98 | 0.99 | |
| Grinder | 87.50 | 0.13 | 0.99 | 0.99 | 0.99 | |
| Ship engine | 73.33 | 0.26 | 0.92 | 0.96 | 0.98 | |
| Car engine | 87.50 | 0.13 | 0.98 | 0.99 | 0.99 |
Figure 2Process of creating a matrix of words.
Categories prepared to train the model.
| Category | Count |
|---|---|
| Drive system | 2825 |
| Fire extinguishing system | 1900 |
| Service brake system | 1516 |
| Electrical installation | 699 |
| Suspension arms and bushings | 593 |
| Hydraulic system | 453 |
| Central lubrication system | 388 |
| Running gear | 329 |
| Air conditioning system | 319 |
Confusion matrix for prepared model.
| Electrical Installation | Fire Extinguishing System | Suspension Arms and Bushings | Hydraulic System | Central Lubrication System | Running Gear | Air Conditioning System | Drive System | Service Brake System | |
|---|---|---|---|---|---|---|---|---|---|
| Electrical installation | 206 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 |
| Fire extinguishing system | 0 | 570 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Suspension arms and bushings | 6 | 0 | 172 | 0 | 0 | 0 | 0 | 0 | 0 |
| Hydraulic system | 0 | 0 | 1 | 115 | 0 | 0 | 0 | 0 | 0 |
| Central lubrication system | 0 | 0 | 0 | 0 | 136 | 0 | 0 | 0 | 0 |
| Running gear | 0 | 0 | 0 | 0 | 0 | 99 | 0 | 0 | 0 |
| Air conditioning system | 0 | 0 | 0 | 0 | 0 | 0 | 96 | 0 | 0 |
| Drive system | 3 | 0 | 14 | 0 | 0 | 1 | 1 | 828 | 0 |
| Service brake system | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 455 |
Figure 3Diagram of commands for main tasks.
Figure 4Diagram of the data validation algorithm.
Figure 5Client-server architecture for the assistant.
Table describing testers.
| Person | Number of Recordings | Experience |
|---|---|---|
| P1 | 6 | Device developer |
| P2 | 5 | Had no previous contact with the device |
| P3 | 5 | Knew the commands but had never used the device |
Figure 6Map of the chamber with marked places of recordings.
Results for each recording.
| Person | Recording | Time | Avg. Entry Correctness (%) | Entries Fully Filled (%) |
|---|---|---|---|---|
| P1 | 1 | 02:25 | 89.58% | 68.75% |
| P1 | 2 | 02:39 | 68.75% | 56.25% |
| P1 | 3 | 02:53 | 87.50% | 68.75% |
| P1 | 4 | 02:04 | 72.92% | 50.00% |
| P1 | 5 | 02:21 | 87.50% | 62.50% |
| P1 | 6 | 01:42 | 83.33% | 62.50% |
| P2 | 7 | 01:28 | 75.00% | 50.00% |
| P2 | 8 | 02:02 | 72.92% | 50.00% |
| P2 | 9 | 01:47 | 81.25% | 50.00% |
| P2 | 10 | 01:54 | 77.08% | 50.00% |
| P2 | 11 | 01:42 | 85.42% | 68.75% |
| P3 | 12 | 02:09 | 79.17% | 50.00% |
| P3 | 13 | 02:00 | 83.33% | 56.25% |
| P3 | 14 | 02:03 | 79.17% | 50.00% |
| P3 | 15 | 02:21 | 79.17% | 43.75% |
| P3 | 16 | 02:17 | 72.92% | 43.75% |
Results for different data types.
| The Type of the Variable | Accuracy |
|---|---|
| Machine | 86.33% |
| Operator | 92.58% |
| Department | 60.16% |