Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.

Literature DB >> 32201857

Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.

Yue Gu¹, Ruiyu Zhang¹, Xinwei Zhao¹, Shuhong Chen¹, Jalal Abdulbaqi¹, Ivan Marsic¹, Megan Cheng², Randall S Burd².

Abstract

Trauma activity recognition aims to detect, recognize, and predict the activities (or tasks) during a trauma resuscitation. Previous work has mainly focused on using various sensor data including image, RFID, and vital signals to generate the trauma event log. However, spoken language and environmental sound, which contain rich communication and contextual information necessary for trauma team cooperation, are still largely ignored. In this paper, we propose a multimodal attention network (MAN) that uses both verbal transcripts and environmental audio stream as input; the model extracts textual and acoustic features using a multi-level multi-head attention module, and forms a final shared representation for trauma activity classification. We evaluated the proposed architecture on 75 actual trauma resuscitation cases collected from a hospital. We achieved 72.4% accuracy with 0.705 F1 score, demonstrating that our proposed architecture is useful and efficient. These results also show that using spoken language and environmental audio indeed helps identify hard-to-recognize activities, compared to previous approaches. We also provide a detailed analysis of the performance and generalization of the proposed multimodal attention network.

Entities: Chemical Disease Gene Species

Keywords: environmental sound; multimodal attention network; spoken language; trauma activity recognition

Year: 2019 PMID： 32201857 PMCID： PMC7085888 DOI： 10.1109/ichi.2019.8904713

Source DB: PubMed Journal: IEEE Int Conf Healthc Inform ISSN： 2575-2626

9 in total

1 in total

1. Video-based Concurrent Activity Recognition for Trauma Resuscitation.

Authors: Yanyi Zhang; Yue Gu; Ivan Marsic; Yinan Zheng; Randall S Burd
Journal: IEEE Int Conf Healthc Inform Date: 2021-03-12

1 in total

Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.

1. Statistical modeling and recognition of surgical workflow.

2. Speech Intention Classification with Multimodal Deep Learning.

3. Multimodal Affective Analysis Using Hierarchical Attention Strategy with Word-Level Alignment.

4. Communication during trauma resuscitation: do we know what is happening?

5. Face recognition: a convolutional neural-network approach.

6. Language-Based Process Phase Detection in the Trauma Resuscitation.

7. Activity Recognition for Medical Teamwork Based on Passive RFID.

8. Deep Learning for RFID-Based Activity Recognition.

9. Hybrid Attention based Multimodal Network for Spoken Language Classification.

1. Video-based Concurrent Activity Recognition for Trauma Resuscitation.