Lab Papers (GRASP)

Document Type

Conference Paper

Date of this Version



Copyright 2009 IEEE. Reprinted from:

Toshev, A.; Makadia, A.; Daniilidis, K., "Shape-based object recognition in videos using 3D synthetic object models," Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on , vol., no., pp.288-295, 20-25 June 2009


This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Pennsylvania's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to By choosing to view this document, you agree to all provisions of the copyright laws protecting it.


In this paper we address the problem of recognizing moving objects in videos by utilizing synthetic 3D models. We use only the silhouette space of the synthetic models making thus our approach independent of appearance. To deal with the decrease in discriminability in the absence of appearance, we align sequences of object masks from video frames to paths in silhouette space. We extract object silhouettes from video by an integration of feature tracking, motion grouping of tracks, and co-segmentation of successive frames. Subsequently, the object masks from the video are matched to 3D model silhouettes in a robust matching and alignment phase. The result is a matching score for every 3D model to the video, along with a pose alignment of the model to the video. Promising experimental results indicate that a purely shape-based matching scheme driven by synthetic 3D models can be successfully applied for object recognition in videos.


image matching, image segmentation, image sequences, object recognition, video signal processing, 3D synthetic object model, alignment phase, cosegmentation, feature tracking, object mask sequence, pose alignment, robust matching phase, shape-based matching scheme, shape-based object recognition, silhouette space, video frame



Date Posted: 08 October 2009