Abstract:Category-specific 3D object shape models have greatly boosted the recent advances in object detection, recognition and segmentation. However, even the most advanced approach for learning 3D object shapes still requires heavy manual annotations on large-scale 2D images. Such annotations include object categories, object keypoints, and figure-ground segmentation for the instances in each image. In particular, annotating figure-ground segmentation is unbearably labor-intensive and time-consuming. To address this problem, this paper devotes to learn category-specific 3D shape models under weak supervision, where only object categories and keypoints are required to be manually annotated on the training 2D images. By exploring the underlying relationship between two tasks: object segmentation and category-specific 3D shape reconstruction, we propose a novel weakly-supervised learning framework to jointly address these two tasks and combine them to boost the final performance of the learned 3D shape models. Moreover, learning without using figure-ground segmentation leads to ambiguous solutions. To this end, we develop the confidence weighting schemes in the viewpoint estimation and 3D shape learning procedure. These schemes effectively reduce the confusion caused by the noisy data and thus increase the chances for recovering more reliable 3D object shapes. Comprehensive experiments on the challenging PASCAL VOC benchmark show that our framework achieves comparable performance with the state-of-the-art methods that use expensive manual segmentation-level annotations. In addition, our experiments also demonstrate that our 3D shape models improve object segmentation performance.

Deep Learning Shape Priors for Object Segmentation

LEARNING SHAPE PRIORS BY PAIRWISE COMPARISON FOR ROBUST SEMANTIC SEGMENTATION

Shape Sparse Representation for Joint Object Classification and Segmentation

Learning Universal Shape Dictionary for Realtime Instance Segmentation

Scribble-Based 3D Shape Segmentation via Weakly-Supervised Learning

Deep Convolutional Neural Networks Meet Variational Shape Compactness Priors for Image Segmentation

Deep Optimized Priors for 3D Shape Modeling and Reconstruction

Optimal Multi-Object Segmentation with Novel Gradient Vector Flow Based Shape Priors

Unsupervised 3D Shape Segmentation and Co-Segmentation Via Deep Learning.

Local Deep Feature Learning Framework for 3D Shape.

Reduced Set Density Estimator For Object Segmentation Based On Shape Probabilistic Representation

Learning Shape Priors for Single-View 3D Completion and Reconstruction

Learning with Explicit Shape Priors for Medical Image Segmentation

Nonparametric Joint Shape and Feature Priors for Image Segmentation

DeepShape: Deep-Learned Shape Descriptor for 3D Shape Retrieval

Weakly-Supervised Learning of Category-Specific 3D Object Shapes.

Shape generation via learning an adaptive multimodal prior

Deep-SLAM++: Object-level RGBD SLAM based on class-specific deep shape priors

Simultaneous variational image segmentation and object recognition via shape sparse representation

PartNet: A Recursive Part Decomposition Network for Fine-grained and Hierarchical Shape Segmentation

Deep Convolutional Neural Networks with Spatial Regularization, Volume and Star-shape Priori for Image Segmentation