Graph explorer

Multimodal Deep Learning

This book is the result of a seminar in which we reviewed multimodal approaches and attempted to create a solid overview of the field, starting with the current state-of-the-art approaches in the two subfields of Deep Learning individually. Further, modeling frameworks are discussed where one modality is transformed into the other, as well as models in which one modality is utilized to enhance representation learning for the other. To conclude the second part, architectures with a focus on handling both modalities simultaneously are introduced. Finally, we also cover other modalities as well as general-purpose multi-modal models, which are able to handle different tasks on different modalities within one unified architecture. One interesting application (Generative Art) eventually caps off this booklet.

15 nodes16 linksoverview previewMultimodal Deep Learning
15 nodes16 links
Multimodal Deep Learning15 visible / 15 total nodes / 82 links
Related contextCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipWorks onAuthorshipAuthorshipAuthorshipTopic signalTopic signalAuthorshipAuthorshipAuthorshipAuthorshipAuthorshipAuthorshipAuthorshipAuthorshipWMultimodal Deep Learningpreprint / 2023ACem AkkusResearcherALuyang ChuResearcherAVladana DjakovicResearcherASteffen Jauch-WalserResearcherTMachine Learning49008 worksTComputation and Language14115 worksAPhilipp KochResearcherAGiacomo LossResearcherAChristopher MarquardtResearcherAMarco MoldovanResearcherAMatthias AßenmacherResearcherAChristian HeumannResearcherADaniel SchalkResearcherAJann GoschenhoferResearcher
PaperSignal 1014 links

Multimodal Deep Learning

preprint / 2023

Open