Detection of anomalies and Data Drift in a time-series dismissal prediction system

Nataliya Boyko,Roman Kovalchuk
DOI: https://doi.org/10.52866/ijcsm.2024.05.03.012
2024-07-08
Iraqi Journal for Computer Science and Mathematics
Abstract:The purpose of the study is to develop a system that automatically processes data based on existingand newly entered data, especially with the aim of ensuring high data quality by detecting and eliminatinganomalies. The quantile filtering method, Chebyshev’s inequality, Kolmogorov-Smirnov two-sample test, andothers should be noted among the methods used. In the course of the research, the theoretical aspects of themethods, various principles of detecting anomalies for different types of data were considered and analysed.Different principles and approaches applied to anomaly detection in different contexts were explored. The resultsof the analysis and the selection of optimal methods for detecting anomalies in various types of data are importantfor the effective functioning of the automatic data processing system. This will make it possible to achieveaccuracy and reliability in the detection of anomalies and ensure high quality of data used in the machine learningsystem.
What problem does this paper attempt to address?