Object Detection Combining Recognition and Segmentation

Loading...
Thumbnail Image
Penn collection
Departmental Papers (CIS)
Degree type
Discipline
Subject
Computer Sciences
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Wang, Liming
Song, Gang
Shang, I-fan
Contributor
Abstract

We develop an object detection method combining top-down recognition with bottom-up image segmentation. There are two main steps in this method: a hypothesis generation step and a verification step. In the top-down hypothesis generation step, we design an improved Shape Context feature, which is more robust to object deformation and background clutter. The improved Shape Context is used to generate a set of hypotheses of object locations and figureground masks, which have high recall and low precision rate. In the verification step, we first compute a set of feasible segmentations that are consistent with top-down object hypotheses, then we propose a False Positive Pruning(FPP) procedure to prune out false positives. We exploit the fact that false positive regions typically do not align with any feasible image segmentation. Experiments show that this simple framework is capable of achieving both high recall and high precision with only a few positive training examples and that this method can be generalized to many object classes.

Advisor
Date of presentation
2007-01-01
Conference name
Departmental Papers (CIS)
Conference dates
2023-05-17T07:09:12.000
Conference location
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Liming Wang, Jianbo Shi, Gang Song, and I-Fan Shen. 2007. Object detection combining recognition and segmentation. In Proceedings of the 8th Asian conference on Computer vision - Volume Part I (ACCV'07), Yasushi Yagi, Sing Bing Kang, In So Kweon, and Hongbin Zha (Eds.), Vol. Part I. Springer-Verlag, Berlin, Heidelberg, 189-199. © ACM, 2007. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Proceedings of the 8th Asian conference on Computer vision - Volume Part I , {(2007)}
Recommended citation
Collection