| Literature DB >> 35937881 |
Yutong Gu1, Chao Zheng2, Masahiro Todoh3, Fusheng Zha4.
Abstract
A sign language translation system can break the communication barrier between hearing-impaired people and others. In this paper, a novel American sign language (ASL) translation method based on wearable sensors was proposed. We leveraged inertial sensors to capture signs and surface electromyography (EMG) sensors to detect facial expressions. We applied a convolutional neural network (CNN) to extract features from input signals. Then, long short-term memory (LSTM) and transformer models were exploited to achieve end-to-end translation from input signals to text sentences. We evaluated two models on 40 ASL sentences strictly following the rules of grammar. Word error rate (WER) and sentence error rate (SER) are utilized as the evaluation standard. The LSTM model can translate sentences in the testing dataset with a 7.74% WER and 9.17% SER. The transformer model performs much better by achieving a 4.22% WER and 4.72% SER. The encouraging results indicate that both models are suitable for sign language translation with high accuracy. With complete motion capture sensors and facial expression recognition methods, the sign language translation system has the potential to recognize more sentences.Entities:
Keywords: American sign language; electromyography; inertial measurement units; long short-term memory; transformer
Year: 2022 PMID: 35937881 PMCID: PMC9345758 DOI: 10.3389/fnins.2022.962141
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
Forty commonly used American sign language sentences.
|
|
|
|
|
|---|---|---|---|
| 1. I'm happy! | 11. Today I feel sad. | 21. Are you deaf? | 31. I'm fine. |
| 2. Wow the steak is delicious! | 12. I don't like cat. | 22. Are you finish? | 32. I'm busy. |
| 3. Happy new year! | 13. Why you are sad. | 23. Are you alright? | 33. I need help. |
| 4. Merry Christmas! | 14. I'm afraid of spider. | 24. Do you want milk and cookies? | 34. You like him. |
| 5. Wow the dessert is delicious! | 15. Running, growing up, I hate it. | 25. Do you like ice-cream? | 35. I go to church on Sunday. |
| 6. Haha the commercial is funny! | 16. I don't know where, sad. | 26. Are you happy with studying history? | 36. I'm a broke college student. |
| 7. With you I'm happy! | 17 My friend dislikes wrestling. | 27. Do you come to church on Sunday? | 37. I go to beach this summer. |
| 8. Happy thanksgiving! | 18. His wife dislikes cooking. | 28. Do you also want fries? | 38. We are hungry. |
| 9. Happy mother's day! | 19. I'm worried. they are angry. | 29. Did you finish eating vegetable? | 39. I go back home. |
| 10. This year we are happy! | 20. I feel annoyed. | 30. Does this food have strawberry? | 40. They enjoy eating hamburgers. |
Figure 1Devices for data collection: (A) perception Neuron motion capture system; (B) electromyography (EMG) signal acquisition system.
Figure 2Signal preprocessing flowchart.
Figure 3An example of electromyography (EMG) data preprocessing: (A) raw EMG data from sentence no. 21; (B) corresponding preprocessed EMG data.
Figure 4Facial expressions' classification model.
The vocabulary for 40 American sign language (ASL) sentences.
| ! | , | . | ? | Christmas | I | I'm | Sunday |
| A | Afraid | Alright | Also | And | Angry | Annoyed | Are |
| Back | Beach | Broke | Busy | Cat | Church | College | Come |
| Commercial | Cooking | Cookies | Day | Deaf | Delicious | Dessert | Did |
| Dislikes | Do | Does | Don't | Eating | Enjoy | Feel | Fine |
| Finish | Food | Friend | Fries | Funny | Go | Growing | Haha |
| Hamburgers | Happy | Hate | Have | Help | Him | His | History |
| Home | Hungry | Ice-cream | Is | It | Know | Like | Merry |
| Milk | Mother's | My | Need | New | Of | On | Running |
| Sad | Spider | Steak | Strawberry | Student | Studying | Summer | Thanksgiving |
| The | They | This | To | Today | Up | Vegetable | Want |
| We | Where | Why | Wife | With | Worried | Wow | Wrestling |
| Year | You | <BOS> | <EOS> | <PAD> |
Figure 5Architecture of long short-term memory (LSTM)-based translation model.
Figure 6Detailed structure of long short-term memory (LSTM) unit.
Figure 7Transformer-based translation model: (A) architecture of the model; (B) detailed structure of the self-attention layer.
Figure 8Facial expressions classification results: (A) accuracy of five cross-validation sets; (B) total confusion matrix of cross-validation steps.
Figure 9Long short-term memory (LSTM) model evaluation result.
Figure 10Transformer model evaluation result.
Word error rate comparison.
|
|
| |
|---|---|---|
| Input with EMG | 7.74% | 4.22% |
| Input without EMG | 11.86% | 8.43% |
Sentence error rate comparison.
|
|
| |
|---|---|---|
| Input with EMG | 9.17% | 4.72% |
| Input without EMG | 14.72% | 8.89% |
User-independent validation results.
|
|
|
| |
|---|---|---|---|
| Participant 2 | 41.95% | 44.50% | 93.25% |
| Participant 3 | 41.12% | 46.00% | 95.00% |