A Benchmark Grocery Dataset of Realworld Point Clouds From Single View

Shivanand Venkanna Sheshappanavar,Tejas Anvekar,Shivanand Kundargi,Yufan Wang,Chandra Kambhamettu

2024-04-08

Abstract:Fine-grained grocery object recognition is an important computer vision problem with broad applications in automatic checkout, in-store robotic navigation, and assistive technologies for the visually impaired. Existing datasets on groceries are mainly 2D images. Models trained on these datasets are limited to learning features from the regular 2D grids. While portable 3D sensors such as Kinect were commonly available for mobile phones, sensors such as LiDAR and TrueDepth, have recently been integrated into mobile phones. Despite the availability of mobile 3D sensors, there are currently no dedicated real-world large-scale benchmark 3D datasets for grocery. In addition, existing 3D datasets lack fine-grained grocery categories and have limited training samples. Furthermore, collecting data by going around the object versus the traditional photo capture makes data collection cumbersome. Thus, we introduce a large-scale grocery dataset called 3DGrocery100. It constitutes 100 classes, with a total of 87,898 3D point clouds created from 10,755 RGB-D single-view images. We benchmark our dataset on six recent state-of-the-art 3D point cloud classification models. Additionally, we also benchmark the dataset on few-shot and continual learning point cloud classification tasks. Project Page:

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem this paper attempts to address is: existing grocery datasets are primarily focused on 2D images and lack large-scale real-world 3D point cloud datasets, which limits the training and evaluation of 3D deep learning models in grocery recognition tasks. Additionally, existing 3D datasets lack fine-grained grocery categories and have limited training samples. These issues make it difficult to develop robust grocery recognition systems. Specifically, the paper proposes a large-scale 3D point cloud grocery dataset named 3DGrocery100, which contains 100 categories and a total of 87,898 3D point clouds generated from 10,755 RGB-D single-view images. Through this dataset, the authors aim to: 1. **Provide a benchmark 3D point cloud grocery dataset**: for training and evaluating 3D point cloud classification models. 2. **Support fine-grained grocery category recognition**: existing 3D datasets lack fine-grained grocery categories, and the 3DGrocery100 dataset fills this gap. 3. **Promote research in few-shot learning and continual learning**: by introducing the 3DGrocery63 subset, this dataset can be used for few-shot learning and class-incremental learning tasks, evaluating the model's generalization ability on new categories. In summary, this paper aims to advance research and applications in 3D computer vision for grocery recognition by providing a large-scale, high-quality 3D point cloud grocery dataset.

A Benchmark Grocery Dataset of Realworld Point Clouds From Single View

Benchmarking Large-Scale Multi-View 3D Reconstruction Using Realistic Synthetic Images

BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation

A Hierarchical Grocery Store Image Dataset with Visual and Semantic Labels

Building3D: An Urban-Scale Dataset and Benchmarks for Learning Roof Structures from Point Clouds

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges

Large-Scale Indoor Visual-Geometric Multimodal Dataset and Benchmark for Novel View Synthesis

Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Applications

RPC: A Large-Scale Retail Product Checkout Dataset

KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments

Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data

UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles

Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving

ClearPose: Large-scale Transparent Object Dataset and Benchmark

Towards Real-World Multi-View Object Classification: Dataset, Benchmark, and Analysis

360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images

The Food Recognition Benchmark: Using Deep Learning to Recognize Food in Images

A 3D INDOOR-OUTDOOR BENCHMARK DATASET FOR LoD3 BUILDING POINT CLOUD SEMANTIC SEGMENTATION

Monocular Image-Based 3-D Model Retrieval: A Benchmark

An Aerial Photogrammetry Benchmark Dataset for Point Cloud Segmentation and Style Translation

A Dataset and Benchmark for Shape Completion of Fruits for Agricultural Robotics