Literature DB >> 29994351

Multimodal Machine Learning: A Survey and Taxonomy.

Tadas Baltrusaitis, Chaitanya Ahuja, Louis-Philippe Morency.   

Abstract

Our experience of the world is multimodal - we see objects, hear sounds, feel texture, smell odors, and taste flavors. Modality refers to the way in which something happens or is experienced and a research problem is characterized as multimodal when it includes multiple such modalities. In order for Artificial Intelligence to make progress in understanding the world around us, it needs to be able to interpret such multimodal signals together. Multimodal machine learning aims to build models that can process and relate information from multiple modalities. It is a vibrant multi-disciplinary field of increasing importance and with extraordinary potential. Instead of focusing on specific multimodal applications, this paper surveys the recent advances in multimodal machine learning itself and presents them in a common taxonomy. We go beyond the typical early and late fusion categorization and identify broader challenges that are faced by multimodal machine learning, namely: representation, translation, alignment, fusion, and co-learning. This new taxonomy will enable researchers to better understand the state of the field and identify directions for future research.

Entities:  

Year:  2018        PMID: 29994351     DOI: 10.1109/TPAMI.2018.2798607

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  68 in total

1.  Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.

Authors:  Yao-Hung Hubert Tsai; Martin Q Ma; Muqiao Yang; Ruslan Salakhutdinov; Louis-Philippe Morency
Journal:  Proc Conf Empir Methods Nat Lang Process       Date:  2020-11

Review 2.  The future of sleep health: a data-driven revolution in sleep science and medicine.

Authors:  Ignacio Perez-Pozuelo; Bing Zhai; Joao Palotti; Raghvendra Mall; Michaël Aupetit; Juan M Garcia-Gomez; Shahrad Taheri; Yu Guan; Luis Fernandez-Luque
Journal:  NPJ Digit Med       Date:  2020-03-23

3.  Disentangled-Multimodal Adversarial Autoencoder: Application to Infant Age Prediction With Incomplete Multimodal Neuroimages.

Authors:  Dan Hu; Han Zhang; Zhengwang Wu; Fan Wang; Li Wang; J Keith Smith; Weili Lin; Gang Li; Dinggang Shen
Journal:  IEEE Trans Med Imaging       Date:  2020-11-30       Impact factor: 10.048

Review 4.  Application of machine learning in the management of acute myeloid leukemia: current practice and future prospects.

Authors:  Jan-Niklas Eckardt; Martin Bornhäuser; Karsten Wendt; Jan Moritz Middeke
Journal:  Blood Adv       Date:  2020-12-08

Review 5.  Multimodal deep learning for biomedical data fusion: a review.

Authors:  Sören Richard Stahlschmidt; Benjamin Ulfenborg; Jane Synnergren
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

6.  COVID-19 Detection Through Transfer Learning Using Multimodal Imaging Data.

Authors:  Michael J Horry; Subrata Chakraborty; Manoranjan Paul; Anwaar Ulhaq; Biswajeet Pradhan; Manas Saha; Nagesh Shukla
Journal:  IEEE Access       Date:  2020-08-14       Impact factor: 3.367

7.  Consistent cross-modal identification of cortical neurons with coupled autoencoders.

Authors:  Rohan Gala; Agata Budzillo; Fahimeh Baftizadeh; Jeremy Miller; Nathan Gouwens; Anton Arkhipov; Gabe Murphy; Bosiljka Tasic; Hongkui Zeng; Michael Hawrylycz; Uygar Sümbül
Journal:  Nat Comput Sci       Date:  2021-02-22

8.  Multi-Modal Learning from Video, Eye Tracking, and Pupillometry for Operator Skill Characterization in Clinical Fetal Ultrasound.

Authors:  Harshita Sharma; Lior Drukker; Aris T Papageorghiou; J Alison Noble
Journal:  Proc IEEE Int Symp Biomed Imaging       Date:  2021-05-25

9.  Bayesian metamodeling of complex biological systems across varying representations.

Authors:  Barak Raveh; Liping Sun; Kate L White; Tanmoy Sanyal; Jeremy Tempkin; Dongqing Zheng; Kala Bharath; Jitin Singla; Chenxi Wang; Jihui Zhao; Angdi Li; Nicholas A Graham; Carl Kesselman; Raymond C Stevens; Andrej Sali
Journal:  Proc Natl Acad Sci U S A       Date:  2021-08-31       Impact factor: 11.205

10.  Real-Time Human Recognition at Night via Integrated Face and Gait Recognition Technologies.

Authors:  Samah A F Manssor; Shaoyuan Sun; Mohammed A M Elhassan
Journal:  Sensors (Basel)       Date:  2021-06-24       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.