VOMTC: Vision Objects for Millimeter and Terahertz Communications

Sunwoo Kim,Yongjun Ahn,Daeyoung Park,Byonghyo Shim
DOI: https://doi.org/10.1109/TCCN.2024.3435909
2024-09-14
Abstract:Recent advances in sensing and computer vision (CV) technologies have opened the door for the application of deep learning (DL)-based CV technologies in the realm of 6G wireless communications. For the successful application of this emerging technology, it is crucial to have a qualified vision dataset tailored for wireless applications (e.g., RGB images containing wireless devices such as laptops and cell phones). An aim of this paper is to propose a large-scale vision dataset referred to as Vision Objects for Millimeter and Terahertz Communications (VOMTC). The VOMTC dataset consists of 20,232 pairs of RGB and depth images obtained from a camera attached to the base station (BS), with each pair labeled with three representative object categories (person, cell phone, and laptop) and bounding boxes of the objects. Through experimental studies of the VOMTC datasets, we show that the beamforming technique exploiting the VOMTC-trained object detector outperforms conventional beamforming techniques.
Networking and Internet Architecture,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?