Associative Variational Auto-Encoder with Distributed Latent Spaces and Associators
Author(s) -
Dae Ung Jo,
ByeongJu Lee,
Jongwon Choi,
Haanju Yoo,
Jin Young Choi
Publication year - 2020
Publication title -
proceedings of the aaai conference on artificial intelligence
Language(s) - English
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v34i07.6778
Subject(s) - associative property , modality (human–computer interaction) , computer science , latent variable , modalities , modal , autoencoder , encoder , space (punctuation) , artificial intelligence , association (psychology) , pattern recognition (psychology) , artificial neural network , mathematics , psychology , social science , chemistry , sociology , polymer chemistry , pure mathematics , operating system , psychotherapist
In this paper, we propose a novel structure for a multi-modal data association referred to as Associative Variational Auto-Encoder (AVAE). In contrast to the existing models using a shared latent space among modalities, our structure adopts distributed latent spaces for multi-modalities which are connected through cross-modal associators. The proposed structure successfully associates even heterogeneous modality data and easily incorporates the additional modality to the entire network via the associator. Furthermore, in our structure, only a small amount of supervised (paired) data is enough to train associators after training auto-encoders in an unsupervised manner. Through experiments, the effectiveness of the proposed structure is validated on various datasets including visual and auditory data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom