A Benchmark Grocery Dataset of Realworld Point Clouds From Single View

Shivanand Venkanna Sheshappanavar,Tejas Anvekar,Shivanand Kundargi,Yufan Wang,Chandra Kambhamettu
2024-04-08
Abstract:Fine-grained grocery object recognition is an important computer vision problem with broad applications in automatic checkout, in-store robotic navigation, and assistive technologies for the visually impaired. Existing datasets on groceries are mainly 2D images. Models trained on these datasets are limited to learning features from the regular 2D grids. While portable 3D sensors such as Kinect were commonly available for mobile phones, sensors such as LiDAR and TrueDepth, have recently been integrated into mobile phones. Despite the availability of mobile 3D sensors, there are currently no dedicated real-world large-scale benchmark 3D datasets for grocery. In addition, existing 3D datasets lack fine-grained grocery categories and have limited training samples. Furthermore, collecting data by going around the object versus the traditional photo capture makes data collection cumbersome. Thus, we introduce a large-scale grocery dataset called 3DGrocery100. It constitutes 100 classes, with a total of 87,898 3D point clouds created from 10,755 RGB-D single-view images. We benchmark our dataset on six recent state-of-the-art 3D point cloud classification models. Additionally, we also benchmark the dataset on few-shot and continual learning point cloud classification tasks. Project Page:
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem this paper attempts to address is: existing grocery datasets are primarily focused on 2D images and lack large-scale real-world 3D point cloud datasets, which limits the training and evaluation of 3D deep learning models in grocery recognition tasks. Additionally, existing 3D datasets lack fine-grained grocery categories and have limited training samples. These issues make it difficult to develop robust grocery recognition systems. Specifically, the paper proposes a large-scale 3D point cloud grocery dataset named 3DGrocery100, which contains 100 categories and a total of 87,898 3D point clouds generated from 10,755 RGB-D single-view images. Through this dataset, the authors aim to: 1. **Provide a benchmark 3D point cloud grocery dataset**: for training and evaluating 3D point cloud classification models. 2. **Support fine-grained grocery category recognition**: existing 3D datasets lack fine-grained grocery categories, and the 3DGrocery100 dataset fills this gap. 3. **Promote research in few-shot learning and continual learning**: by introducing the 3DGrocery63 subset, this dataset can be used for few-shot learning and class-incremental learning tasks, evaluating the model's generalization ability on new categories. In summary, this paper aims to advance research and applications in 3D computer vision for grocery recognition by providing a large-scale, high-quality 3D point cloud grocery dataset.