Dynamic Data Source Selection: A Case of Weather Stations for IoT Applications

John Byabazaire,Gregory M. P. O'Hare,Rem Collier,Declan Delaney
DOI: https://doi.org/10.1109/WF-IOT54382.2022.10152030
2022-01-01
Abstract:Quality data is crucial to dependable, trustworthy data analytics and data-driven decision-making in IoT-based applications. Data pre-processing and data cleaning are the only considered strategies for data quality assurance. Whilst these have provided usable results, within IoT, a problem still exists. Data quality issues within IoT proliferate throughout all the stages of the data cycle, with the majority at the source. This work proposes a solution to achieve two main objectives; 1) improving data quality of IoT streams by using alternative sources. 2) demonstrate that improved data quality results in improved application performance. Firstly, results from two experimental settings based on weather data from a real-world deployment show that by seeking alternative sources, data quality improved by 15.6% and 0.2% for completeness and accuracy, respectively. Secondly, two distinct scenarios show that by improving data quality, the model's RMSE improved by an average of 2 and 2.5 from 4 and 17.5, respectively.
What problem does this paper attempt to address?