Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN | Zendy

Hang Xu | Zendy; Linpu Fang | Zendy; Xiaodan Liang | Zendy; Wenxiong Kang | Zendy; Zhenguo Li | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN

Author(s) -

Hang Xu,

Linpu Fang,

Xiaodan Liang,

Wenxiong Kang,

Zhenguo Li

Publication year - 2020

Publication title -

proceedings of the aaai conference on artificial intelligence

Language(s) - English

Resource type - Journals

eISSN - 2374-3468

pISSN - 2159-5399

DOI - 10.1609/aaai.v34i07.6937

Subject(s) - computer science , exploit , artificial intelligence , graph , granularity , representation (politics) , domain (mathematical analysis) , object detection , semantic feature , feature (linguistics) , object (grammar) , feature learning , machine learning , pattern recognition (psychology) , natural language processing , theoretical computer science , mathematics , politics , political science , law , operating system , mathematical analysis , linguistics , philosophy , computer security

The dominant object detection approaches treat each dataset separately and fit towards a specific domain, which cannot adapt to other domains without extensive retraining. In this paper, we address the problem of designing a universal object detection model that exploits diverse category granularity from multiple domains and predict all kinds of categories in one system. Existing works treat this problem by integrating multiple detection branches upon one shared backbone network. However, this paradigm overlooks the crucial semantic correlations between multiple domains, such as categories hierarchy, visual similarity, and linguistic relationship. To address these drawbacks, we present a novel universal object detector called Universal-RCNN that incorporates graph transfer learning for propagating relevant semantic information across multiple datasets to reach semantic coherency. Specifically, we first generate a global semantic pool by integrating all high-level semantic representation of all the categories. Then an Intra-Domain Reasoning Module learns and propagates the sparse graph representation within one dataset guided by a spatial-aware GCN. Finally, an Inter-Domain Transfer Module is proposed to exploit diverse transfer dependencies across all domains and enhance the regional feature representation by attending and transferring semantic contexts globally. Extensive experiments demonstrate that the proposed method significantly outperforms multiple-branch models and achieves the state-of-the-art results on multiple object detection benchmarks (mAP: 49.1% on COCO).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research