A comprehensive ensemble classification techniques detecting and managing concept drift in dynamic imbalanced data streams

DOI: https://doi.org/10.1007/s11276-024-03742-0
IF: 2.701
2024-04-24
Wireless Networks
Abstract:Data stream mining is essential in various fields such as education, the Internet of Things (IoT), social media, entertainment, weather monitoring, and finance. This is due to the continuous and huge amount of data generated by applications in these sectors. Moreover, this data stream is prone to concept drift, in addition to showing characteristics of heterogeneity and imbalance. Contemporary methods for addressing unbalanced learning in data mining often employ classifiers that are tailored to the number of features required for categorization. The control of concept drift is an absolute necessity due to the ever-changing data distributions and the endless and rapid nature of the various data streams. Concept drift is an obstacle in heterogeneous stream data mining, marked by noticeable variations that can range from massive to more complex changes. When addressing drifts, conventional approaches often employ fixed-size blocks or windows, posing challenges in managing events that are in a continuous state of change. This paper introduces a novel approach called "Ensemble Classification Techniques Detecting and Managing Concept Drift in Dynamic and Imbalanced Data Streams" to address these issues. Our method aims to effectively adjust to different types of concept drift by providing a precise and flexible classification of distinct data streams. The suggested ensemble classifier is a valuable contribution to stream data mining, since it effectively addresses the intricate challenges associated with dynamic concept drifts. Experimental results proved that the proposed method has demonstrated superior performance compared to existing methods. According to the findings of the experiment, the proposed method obtains a precision of 69.28% and a recall rate of 69.54%, which gives it an advantage over other methods that produce results that are almost identical.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?