Literature DB >> 33799412

Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder.

Kaito Hirasawa1, Keisuke Maeda2, Takahiro Ogawa3, Miki Haseyama3.   

Abstract

A new method for the detection of important scenes in baseball videos via a time-lag-aware multimodal variational autoencoder (Tl-MVAE) is presented in this paper. Tl-MVAE estimates latent features calculated from tweet, video, and audio features extracted from tweets and videos. Then, important scenes are detected by estimating the probability of the scene being important from estimated latent features. It should be noted that there exist time-lags between tweets posted by users and videos. To consider the time-lags between tweet features and other features calculated from corresponding multiple previous events, the feature transformation based on feature correlation considering such time-lags is newly introduced to the encoder in MVAE in the proposed method. This is the biggest contribution of the Tl-MVAE. Experimental results obtained from actual baseball videos and their corresponding tweets show the effectiveness of the proposed method.

Entities:  

Keywords:  Twitter; detection of important scenes; multimodal variational autoencoder; sports video; time-lags

Year:  2021        PMID: 33799412     DOI: 10.3390/s21062045

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  1 in total

1.  Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets.

Authors:  Kaito Hirasawa; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Journal:  Sensors (Basel)       Date:  2022-03-23       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.