Toward New Retail: A Benchmark Dataset for Smart Unmanned Vending Machines

Haijun Zhang,Donghai Li,Yuzhu Ji,Haibin Zhou,Weiwei Wu,Kai Liu
DOI: https://doi.org/10.1109/tii.2019.2954956
IF: 12.3
2020-12-01
IEEE Transactions on Industrial Informatics
Abstract:Deep learning is a popular direction in computer vision and digital image processing. It is widely utilized in many fields, such as robot navigation, intelligent video surveillance, industrial inspection, and aerospace. With the extensive use of deep learning techniques, classification and object detection algorithms have been rapidly developed. In recent years, with the introduction of the concept of "unmanned retail," object detection, and image classification play a central role in unmanned retail applications. However, open-source datasets of traditional classification and object detection have not yet been optimized for application scenarios of unmanned retail. Currently, classification and object detection datasets do not exist that focus on unmanned retail solely. Therefore, in order to promote unmanned retail applications by using deep learning-based classification and object detection, in this article we collected more than 30 000 images of unmanned retail containers using a refrigerator affixed with different cameras under both static and dynamic recognition environments. These images were categorized into ten kinds of beverages. After manual labeling, images in our constructed dataset contained 155 153 instances, each of which was annotated with a bounding box. We performed extensive experiments on this dataset using ten state-of-the-art deep learning-based models. Experimental results indicate great potential of using these deep learning-based models for real-world smart unmanned vending machines.
automation & control systems,computer science, interdisciplinary applications,engineering, industrial
What problem does this paper attempt to address?
The paper attempts to address the issue of the current lack of specialized image classification and object detection datasets for Unmanned Vending Machines (UVMs) in the unmanned retail scenario. Although existing classification and object detection datasets perform well in other fields, they are not optimized for the specific application scenarios of unmanned retail. Therefore, the authors collected over 30,000 images from inside UVMs and divided them into 10 common beverage categories, with each instance manually annotated, including bounding box information. By constructing this large-scale dataset, the authors hope to promote the development of UVM applications based on deep learning. Specifically, the main objectives of the paper include: 1. **Constructing a benchmark dataset**: To facilitate the application of UVMs, the authors constructed a large-scale multi-category beverage detection dataset, including images of static and dynamic purchasing events. 2. **Evaluating deep learning models**: The authors evaluated 10 state-of-the-art deep learning models on this dataset to verify the potential application value of these models in actual UVMs. 3. **Promoting technological development**: By providing a high-quality dataset specifically for UVMs, the authors hope to promote further research and application of computer vision technology in the unmanned retail field. In summary, this paper aims to address the shortcomings of existing datasets in the unmanned retail scenario by constructing and evaluating a specialized image dataset for UVMs, thereby promoting the development of UVM technology.