z-logo
open-access-imgOpen Access
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Author(s) -
Yidi Li,
Hong Liu,
Hao Tang
Publication year - 2022
Publication title -
proceedings of the aaai conference on artificial intelligence
Language(s) - English
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v36i2.20035
Subject(s) - computer science , robustness (evolution) , artificial intelligence , speech recognition , modal , audio signal processing , modalities , audio visual , perception , sensor fusion , audio signal , computer vision , pattern recognition (psychology) , speech coding , social science , biochemistry , chemistry , multimedia , neuroscience , sociology , biology , polymer chemistry , gene

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom