Abstract:One of the most challenging problems in the field of online learning is concept drift, which deeply influences the classification stability of streaming data. If the data stream is imbalanced, it is even more difficult to detect concept drifts and make an online learner adapt to them. Ensemble algorithms have been found effective for the classification of streaming data with concept drift, whereby an individual classifier is built for each incoming data chunk and its associated weight is adjusted to manage the drift. However, it is difficult to adjust the weights to achieve a balance between the stability and adaptability of the ensemble classifiers. In addition, when the data stream is imbalanced, the use of a size-fixed chunk to build a single classifier can create further problems; the data chunk may contain too few or even no minority class samples (i.e., only majority class samples). A classifier built on such a chunk is unstable in the ensemble. In this article, we propose a chunk-based incremental learning method called adaptive chunk-based dynamic weighted majority (ACDWM) to deal with imbalanced streaming data containing concept drift. ACDWM utilizes an ensemble framework by dynamically weighting the individual classifiers according to their classification performance on the current data chunk. The chunk size is adaptively selected by statistical hypothesis tests to access whether the classifier built on the current data chunk is sufficiently stable. ACDWM has four advantages compared with the existing methods as follows: 1) it can maintain stability when processing nondrifted streams and rapidly adapt to the new concept; 2) it is entirely incremental, i.e., no previous data need to be stored; 3) it stores a limited number of classifiers to ensure high efficiency; and 4) it adaptively selects the chunk size in the concept drift environment. Experiments on both synthetic and real data sets containing concept d-ift show that ACDWM outperforms both state-of-the-art chunk-based and online methods.

Emril:Ensemble Method Based on Reinforcement Learning for Binary Classification in Imbalanced Drifting Data Streams

Reinforcement Online Active Learning Ensemble for Drifting Imbalanced Data Streams

Intensive Class Imbalance Learning in Drifting Data Streams

Pro-IDD: Pareto-based Ensemble for Imbalanced and Drifting Data Streams

A comprehensive ensemble classification techniques detecting and managing concept drift in dynamic imbalanced data streams

A comprehensive active learning method for multiclass imbalanced data streams with concept drift

Adaptive Chunk-Based Dynamic Weighted Majority for Imbalanced Data Streams With Concept Drift

Imbalanced Data Stream Classification using Dynamic Ensemble Selection

Bin.Ini: an Ensemble Approach for Dynamic Data Streams

Hybrid Firefly Optimised Ensemble Classification for Drifting Data Streams with Imbalance

Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance

Drift-Aware Multi-Memory Model for Imbalanced Data Streams

A Systematic Study of Online Class Imbalance Learning with Concept Drift

An Ensemble Classifier Method for Classifying Data Streams with Recurrent Concept Drift.

A Hybrid Active-Passive Approach to Imbalanced Nonstationary Data Stream Classification

An Ensemble Classifier Algorithm for Mining Data Streams Based on Concept Drift

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification

Recurring Concept Meta-learning for Evolving Data Streams

A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework

Rarity updated ensemble with oversampling: An ensemble approach to classification of imbalanced data streams

Online Active Learning for Drifting Data Streams