z-logo
open-access-imgOpen Access
Deformable Part-based Fully Convolutional Network for Object Detection
Author(s) -
Taylor Mordan,
Nicolas Thome,
Matthieu Cord,
Gilles Hénaff
Publication year - 2017
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5244/c.31.88
Subject(s) - pascal (unit) , minimum bounding box , computer science , discriminative model , pooling , artificial intelligence , object detection , convolutional neural network , bounding overwatch , focus (optics) , pattern recognition (psychology) , computer vision , object (grammar) , image (mathematics) , physics , optics , programming language
Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular. In this paper we introduce DP-FCN, a deep model for object detection which explicitly adapts to shapes of objects with deformable parts. Without additional annotations, it learns to focus on discriminative elements and to align them, and simultaneously brings more invariance for classification and geometric information to refine localization. DP-FCN is composed of three main modules: a Fully Convolutional Network to efficiently maintain spatial resolution, a deformable part-based RoI pooling layer to optimize positions of parts and build invariance, and a deformation-aware localization module explicitly exploiting displacements of parts to improve accuracy of bounding box regression. We experimentally validate our model and show significant gains. DP-FCN achieves state-of-the-art performances of 83.1% and 80.9% on PASCAL VOC 2007 and 2012 with VOC data only.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom