Vis enkel innførsel

dc.contributor.authorAndrade Mancisidor, Rogelio
dc.contributor.authorKampffmeyer, Michael Christian
dc.contributor.authorAas, Kjersti
dc.contributor.authorJenssen, Robert
dc.date.accessioned2024-01-11T13:37:25Z
dc.date.available2024-01-11T13:37:25Z
dc.date.issued2023-11-02
dc.description.abstractDeep generative models with latent variables have been used lately to learn joint representations and generative processes from multi-modal data, which depict an object from different viewpoints. These two learning mechanisms can, however, conflict with each other and representations can fail to embed information on the data modalities. This research studies the realistic scenario in which all modalities and class labels are available for model training, e.g. images or handwriting, but where some modalities and labels required for downstream tasks are missing, e.g. text or annotations. We show, in this scenario, that the variational lower bound limits mutual information between joint representations and missing modalities. We, to counteract these problems, introduce a novel conditional multi-modal discriminative model that uses an informative prior distribution and optimizes a likelihood-free objective function that maximizes mutual information between joint representations and missing modalities. Extensive experimentation demonstrates the benefits of our proposed model, empirical results show that our model achieves state-of-the-art results in representative problems such as downstream classification, acoustic inversion, and image and annotation generation.en_US
dc.identifier.citationAndrade Mancisidor, Kampffmeyer, Aas, Jenssen. Discriminative multimodal learning via conditional priors in generative models. Neural Networks. 2023;169en_US
dc.identifier.cristinIDFRIDAID 2214165
dc.identifier.doi10.1016/j.neunet.2023.10.048
dc.identifier.issn0893-6080
dc.identifier.issn1879-2782
dc.identifier.urihttps://hdl.handle.net/10037/32434
dc.language.isoengen_US
dc.publisherElsevieren_US
dc.relation.journalNeural Networks
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2023 The Author(s)en_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0en_US
dc.rightsAttribution 4.0 International (CC BY 4.0)en_US
dc.titleDiscriminative multimodal learning via conditional priors in generative modelsen_US
dc.type.versionpublishedVersionen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US
dc.typePeer revieweden_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution 4.0 International (CC BY 4.0)
Med mindre det står noe annet, er denne innførselens lisens beskrevet som Attribution 4.0 International (CC BY 4.0)