Discriminative multimodal learning via conditional priors in generative models

Andrade Mancisidor, Rogelio; Kampffmeyer, Michael Christian; Aas, Kjersti; Jenssen, Robert

dc.contributor.author	Andrade Mancisidor, Rogelio
dc.contributor.author	Kampffmeyer, Michael Christian
dc.contributor.author	Aas, Kjersti
dc.contributor.author	Jenssen, Robert
dc.date.accessioned	2024-01-11T13:37:25Z
dc.date.available	2024-01-11T13:37:25Z
dc.date.issued	2023-11-02
dc.description.abstract	Deep generative models with latent variables have been used lately to learn joint representations and generative processes from multi-modal data, which depict an object from different viewpoints. These two learning mechanisms can, however, conflict with each other and representations can fail to embed information on the data modalities. This research studies the realistic scenario in which all modalities and class labels are available for model training, e.g. images or handwriting, but where some modalities and labels required for downstream tasks are missing, e.g. text or annotations. We show, in this scenario, that the variational lower bound limits mutual information between joint representations and missing modalities. We, to counteract these problems, introduce a novel conditional multi-modal discriminative model that uses an informative prior distribution and optimizes a likelihood-free objective function that maximizes mutual information between joint representations and missing modalities. Extensive experimentation demonstrates the benefits of our proposed model, empirical results show that our model achieves state-of-the-art results in representative problems such as downstream classification, acoustic inversion, and image and annotation generation.	en_US
dc.identifier.citation	Andrade Mancisidor, Kampffmeyer, Aas, Jenssen. Discriminative multimodal learning via conditional priors in generative models. Neural Networks. 2023;169	en_US
dc.identifier.cristinID	FRIDAID 2214165
dc.identifier.doi	10.1016/j.neunet.2023.10.048
dc.identifier.issn	0893-6080
dc.identifier.issn	1879-2782
dc.identifier.uri	https://hdl.handle.net/10037/32434
dc.language.iso	eng	en_US
dc.publisher	Elsevier	en_US
dc.relation.journal	Neural Networks
dc.rights.accessRights	openAccess	en_US
dc.rights.holder	Copyright 2023 The Author(s)	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0	en_US
dc.rights	Attribution 4.0 International (CC BY 4.0)	en_US
dc.title	Discriminative multimodal learning via conditional priors in generative models	en_US
dc.type.version	publishedVersion	en_US
dc.type	Journal article	en_US
dc.type	Tidsskriftartikkel	en_US
dc.type	Peer reviewed	en_US

Tilhørende fil(er)

Navn:: article.pdf
Størrelse:: 2.431Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Artikler, rapporter og annet (fysikk og teknologi) [1062]

Vis enkel innførsel

Med mindre det står noe annet, er denne innførselens lisens beskrevet som Attribution 4.0 International (CC BY 4.0)