z-logo
open-access-imgOpen Access
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Author(s) -
Yidi Li,
Hong Liu,
Hao Tang
Publication year - 2022
Publication title -
proceedings of the ... aaai conference on artificial intelligence
Language(s) - English
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v36i2.20035
Subject(s) - computer science , robustness (evolution) , artificial intelligence , speech recognition , modal , audio signal processing , modalities , audio visual , perception , sensor fusion , audio signal , computer vision , pattern recognition (psychology) , speech coding , social science , biochemistry , chemistry , multimedia , neuroscience , sociology , biology , polymer chemistry , gene

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here