Learning to Segment Object Proposals via Recursive Neural Networks

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

To avoid the exhaustive search over locations and scales, current state-of-the-art object detection systems usually involve a crucial component generating a batch of candidate object proposals from images. In this paper, we present a simple yet effective approach for segmenting object proposals via a deep architecture of recursive neural networks (RNNs), which hierarchically groups regions for detecting object candidates over scales. Unlike traditional methods that mainly adopt fixed similarity measures for merging regions or finding object proposals, our approach adaptively learns the region merging similarity and the objectness measure during the process of hierarchical region grouping. Specifically, guided by a structured loss, the RNN model jointly optimizes the cross-region similarity metric with the region merging process as well as the objectness prediction. During inference of the object proposal generation, we introduce randomness into the greedy search to cope with the ambiguity of grouping regions. Extensive experiments on standard benchmarks, e.g., PASCAL VOC and ImageNet, suggest that our approach is capable of producing object proposals with high recall while well preserving the object boundaries and outperforms other existing methods in both accuracy and efficiency.

Related collections

Most cited references 11

Record: found
Abstract: not found
Book Chapter: not found

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren … (2014)

0 comments Cited 303 times – based on 0 reviews

Bookmark

Record: found
Abstract: not found
Book Chapter: not found

Edge Boxes: Locating Object Proposals from Edges

C Zitnick, Piotr Dollar (2014)

0 comments Cited 244 times – based on 0 reviews

Bookmark

Record: found
Abstract: found
Article: not found

Measuring the objectness of image windows.

Bogdan Alexe, Thomas Deselaers, Vittorio D. Ferrari (2012)

We present a generic objectness measure, quantifying how likely it is for an image window to contain an object of any class. We explicitly train it to distinguish objects with a well-defined boundary in space, such as cows and telephones, from amorphous background elements, such as grass and road. The measure combines in a Bayesian framework several image cues measuring characteristics of objects, such as appearing different from their surroundings and having a closed boundary. These include an innovative cue to measure the closed boundary characteristic. In experiments on the challenging PASCAL VOC 07 dataset, we show this new cue to outperform a state-of-the-art saliency measure, and the combined objectness measure to perform better than any cue alone. We also compare to interest point operators, a HOG detector, and three recent works aiming at automatic object segmentation. Finally, we present two applications of objectness. In the first, we sample a small numberof windows according to their objectness probability and give an algorithm to employ them as location priors for modern class-specific object detectors. As we show experimentally, this greatly reduces the number of windows evaluated by the expensive class-specific model. In the second application, we use objectness as a complementary score in addition to the class-specific model, which leads to fewer false positives. As shown in several recent papers, objectness can act as a valuable focus of attention mechanism in many other applications operating on image windows, including weakly supervised learning of object categories, unsupervised pixelwise segmentation, and object tracking in video. Computing objectness is very efficient and takes only about 4 sec. per image.

0 comments Cited 190 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Publication date Created: 2016-12-03

Article

ArXiV ID: 1612.01057

SO-VID: e639b94a-3c4d-4747-89f1-07519dce41cb

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments 12 pages

Categories cs.CV

ScienceOpen disciplines: Computer vision & Pattern recognition

Data availability:

ScienceOpen disciplines: Computer vision & Pattern recognition

Learning to Segment Object Proposals via Recursive Neural Networks

Read this article at

Abstract

Related collections

Recursive Rule based Visual Categorization

Most cited references 11

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Edge Boxes: Locating Object Proposals from Edges

Measuring the objectness of image windows.

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 93

Most referenced authors 71