Open Access
History Aware Multimodal Transformer for Vision-and-Language Navigation
Author(s) -
Shizhe Chen,
Pierre-Louis Guhur,
Cordelia Schmid,
Ivan Laptev
Publication year - 2021
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - Uncategorized
Resource type - Conference proceedings
Subject(s) - computer science , transformer , artificial intelligence , reinforcement learning , spatial relation , relation (database) , computer vision , human–computer interaction , engineering , voltage , database , electrical engineering