Generative Models for Labeling Multi-object Configurations in Images
Author(s) -
Yali Amit,
Alain Trouvé
Publication year - 2006
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
DOI - 10.1007/11957959_19
Subject(s) - computer science , conditional independence , artificial intelligence , object (grammar) , generative model , computation , independence (probability theory) , class (philosophy) , generative grammar , hidden variable theory , cognitive neuroscience of visual object recognition , binary number , pattern recognition (psychology) , machine learning , algorithm , mathematics , statistics , physics , arithmetic , quantum mechanics , quantum
We propose a generative approach to the problem of label- ing images containing congurations of objects from multiple classes. The main building blocks are dense statistical models for individual ob- jects. The models assume conditional independence of binary oriented edge variables conditional on a hidden instantiation parameter, which also determines an object support. These models are then be composed to form models for object congurations with various interactions includ- ing occlusion. Choosing the optimal conguration is entirely likelihood based and no decision boundaries need to be pre-learned. Training in- volves estimation of model parameters for each class separately. Both training and classication involve estimation of hidden pose variables which can be computationally intensive. We describe two levels of ap- proximation which facilitate these computations: the Patchwork of Parts (POP) model and the coarse part based models (CPM). A concrete im- plementation of the approach is illustrated on the problem of reading zip-codes.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom