Subgraph and object context‐masked network for scene graph generation | Zendy

Zheng Zhenxing | Zendy; Li Zhendong | Zendy; An Gaoyun | Zendy; Feng Songhe | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Subgraph and object context‐masked network for scene graph generation

Author(s) -

Zheng Zhenxing,

Li Zhendong,

An Gaoyun,

Feng Songhe

Publication year - 2020

Publication title -

iet computer vision

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.38

H-Index - 37

eISSN - 1751-9640

pISSN - 1751-9632

DOI - 10.1049/iet-cvi.2019.0896

Subject(s) - computer science , artificial intelligence , spatial contextual awareness , inference , scene graph , object (grammar) , encode , context (archaeology) , spatial relation , relation (database) , graph , subgraph isomorphism problem , path (computing) , pattern recognition (psychology) , cognitive neuroscience of visual object recognition , computer vision , theoretical computer science , data mining , paleontology , biology , rendering (computer graphics) , biochemistry , chemistry , gene , programming language

Scene graph generation is to recognise objects and their semantic relationships in an image and can help computers understand visual scene. To improve relationship prediction, geometry information is essential and usually incorporated into relationship features. Existing methods use coordinates of objects to encode their spatial layout. However, in this way, they neglect the context of objects. In this study, to take full use of spatial knowledge efficiently, the authors propose a novel subgraph and object context‐masked network (SOCNet) consisting of spatial mask relation inference (SMRI) and hierarchical message passing (HMP) modules to address the scene graph generation task. In particular, to take advantage of spatial knowledge, SMRI masks partial context of object features depending on their spatial layout of objects and corresponding subgraph to facilitate their relationship recognition. To refine the features of objects and subgraphs, they also propose HMP that passes highly correlated messages from both microcosmic and macroscopic aspects through a triple‐path structure including subgraph–subgraph, object–object, and subgraph–object paths. Finally, statistical co‐occurrence probability is used to regularise relationship prediction. SOCNet integrates HMP and SMRI into a unified network, and comprehensive experiments on visual relationship detection and visual genome datasets indicate that SOCNet outperforms several state‐of‐the‐art methods on two common tasks.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research