Shape Representations for Object Recognition

dc.contributor.advisorKostas Daniilidis
dc.contributor.advisorBen Taskar
dc.contributor.advisorJianbo Shi
dc.contributor.authorToshev, Alexander
dc.contributor.authorToshev, Alexander
dc.date2023-05-17T05:55:28.000
dc.date.accessioned2023-05-22T17:44:17Z
dc.date.available2011-01-26T00:00:00Z
dc.date.issued2011-05-16
dc.date.submitted2011-01-26T07:45:40-08:00
dc.description.abstractThe problem of object recognition has been at the forefront of computer vision research in the last decade. The most successful approaches have used mainly edge- or texture-based representations. The shape of the object outline, albeit widely used for pre-segmented objects, has found limited applicability to the detection problem in real images. The fact that shape is a truly holistic global percept is challenging because background structure and interior object contours can easily clutter a global shape descriptor and render it unusable. Therefore, figure-ground organization, which segments the object of interest and removes the cluttering contours, is of paramount importance. However, purely bottom-up segmentation rarely provides a good object outline suitable for shape-based detection. In this thesis, we study a novel shape representation, called a chordiogram, which allows us to address the above challenges. The chordiogram is a holistic shape descriptor capturing global geometric relationships between object boundaries. Based on the chordiogram, we introduce a boundary structure segmentation model which efficiently integrates region and boundary grouping principles with shape-based matching. This method uses holistic shape for simultaneous object segmentation and detection in highly cluttered scenes. We apply it on established recognition benchmarks and achieve state-of-the art results. Further, we study the applicability of shape for object detection in videos. We show that shape-based representations can be used not only to robustly detect moving objects but also to provide a rough estimate of their pose. For this purpose, we utilize freely available large datasets of 3D synthetic models. Beyond linking shape matching with perceptual grouping, we study the interplay between feature matching and perceptual grouping. We introduce co-salient regions -- coherent, corresponding segments in two or more images -- and describe two algorithms for their detection. Co-salient regions are applied to two problems -- wide-baseline stereo and motion segmentation. In the former problem we show how to estimate correspondences between regions and improve feature matches, while in the latter segments representing same object parts are tracked across multiple frames in a video.
dc.description.degreeDoctor of Philosophy (PhD)
dc.identifier.urihttps://repository.upenn.edu/handle/20.500.14332/30574
dc.legacy.articleid1407
dc.legacy.fulltexturlhttps://repository.upenn.edu/cgi/viewcontent.cgi?article=1407&context=edissertations&unstamped=1
dc.source.issue354
dc.source.journalPublicly Accessible Penn Dissertations
dc.source.statuspublished
dc.subject.otherobject recognition
dc.subject.othershape
dc.subject.othershape matching
dc.subject.otherobject segmentation
dc.subject.otherArtificial Intelligence and Robotics
dc.titleShape Representations for Object Recognition
dc.typeDissertation/Thesis
digcom.contributor.authorisAuthorOfPublication|email:toshev@seas.upenn.edu|institution:University of Pennsylvania|Toshev, Alexander
digcom.date.embargo2011-01-26T00:00:00-08:00
digcom.identifieredissertations/354
digcom.identifier.contextkey1746747
digcom.identifier.submissionpathedissertations/354
digcom.typedissertation
dspace.entity.typePublication
relation.isAuthorOfPublication4fbf34f8-6dea-4e6f-9463-73f44961c266
relation.isAuthorOfPublication.latestForDiscovery4fbf34f8-6dea-4e6f-9463-73f44961c266
upenn.graduate.groupComputer and Information Science
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dissertation.pdf
Size:
20.26 MB
Format:
Adobe Portable Document Format