CODE: Compact IoT Data Collection with Precise Matrix Sampling and Efficient Inference

Huali Lu,Feng Lyu,Ju Ren,Jiadi Yu,Fan Wu,Yaoxue Zhang,Xuemin (Sherman) Shen
DOI: https://doi.org/10.1109/icdcs54860.2022.00077
2022-01-01
Abstract:It is unpractical to conduct full-size data collection in ubiquitous IoT data systems due to the energy constraints of IoT sensors and large system scales. Although sparse sensing technologies have been proposed to infer missing data based on partial sampled data, they usually focus on data inference while neglecting the sampling process, restraining the inference efficiency. In addition, their inferring methods highly depend on data linearity correlations, which become less effective when data are not linearly correlated. In this paper, we propose, Compact IOT Data CollEction, namely CODE, to conduct precise data matrix sampling and efficient inference. Particularly, CODE integrates two major components, i.e., cluster-based matrix sampling and Generative Adversarial Networks (GAN)-based matrix inference, to reduce the data collection cost and guarantee the data benefits, respectively. In the sampling component, a cluster-based sampling approach is devised, in which data clustering is first conducted and then a two-step sampling is performed in accordance with the number of clusters and clustering errors. For the inference component, a GAN-based model is developed to estimate the full matrix, which consists of a generator network that learns to generate a fake matrix, and a discriminator network that learns to discriminate the fake matrix from the real one. A reference implementation of CODE is conducted under three operational large-scale IoT systems, and extensive data-driven experiment results are provided to demonstrate its efficiency and robustness.
What problem does this paper attempt to address?