| Literature DB >> 35655713 |
Yanyi Zhang1, Xinyu Li1,2, Ivan Marsic1.
Abstract
Multi-label activity recognition is designed for recognizing multiple activities that are performed simultaneously or sequentially in each video. Most recent activity recognition networks focus on single-activities, that assume only one activity in each video. These networks extract shared features for all the activities, which are not designed for multi-label activities. We introduce an approach to multi-label activity recognition that extracts independent feature descriptors for each activity and learns activity correlations. This structure can be trained end-to-end and plugged into any existing network structures for video classification. Our method outperformed state-of-the-art approaches on four multi-label activity recognition datasets. To better understand the activity-specific features that the system generated, we visualized these activity-specific features in the Charades dataset. The code will be released later.Entities:
Year: 2021 PMID: 35655713 PMCID: PMC9159520 DOI: 10.1109/cvpr46437.2021.01439
Source DB: PubMed Journal: Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit ISSN: 1063-6919