Abstract:While recognizing the significance of data in machine learning, we focus on addressing the challenge of concept drift, particularly in dynamic data streams. We propose an innovative incremental decision tree algorithm tailored for learning regression trees and model trees from evolving data streams. Vital to ensuring the quality and accuracy of predictive models is addressing this challenge. In this context, we present a novel solution: an incremental decision tree algorithm tailored for learning regression trees and model trees from time-varying data streams. Our algorithm is designed to operate at high speeds, effectively accommodating the influx of data at any scale, including scenarios with potentially unlimited data. Key innovations of our approach include a probabilistic defined sampling strategy that enhances the learning process and an advanced automatic method capable of handling non-stationary data distributions. However, the primary innovation lies in our methodology for detecting concept drift. Unlike conventional methods that reactively respond to increased errors, we introduce a proactive approach: monitoring the quality of individual subtrees by tracking their error evolution. This method allows us to detect changes in the objective function promptly, leading to timely adaptations in the model structure. Through extensive experimentation and evaluation, we demonstrate the effectiveness of our proposed algorithm in terms of prediction accuracy, model size, and change detection capabilities. Representing a significant advancement in the field of machine learning, particularly in addressing the challenge of concept drift in data streams, the proposed algorithm offers a competitive alternative to existing flow classifiers. Showcasing superior performance in terms of precision, recall, Fisher measure, and scalability, it underscores its potential to enhance decision-making processes across various domains by adapting swiftly to changing data patterns and maintaining high accuracy. The algorithm's innovative approach to incremental learning of decision rules, coupled with its adaptive extension for handling concept drift, holds promise for real-world applications where accurate and timely insights are paramount. Overall, the algorithm's robustness, adaptability, and efficiency position it as a valuable asset in stream data classification and decision support systems.

Fast Unsupervised Online Drift Detection Using Incremental Kolmogorov-Smirnov Test

On the Reliable Detection of Concept Drift from Streaming Unlabeled Data

A comprehensive analysis of concept drift locality in data streams

A novel framework for concept drift detection using autoencoders for classification problems in data streams

Concept Drift Detection and Adaptation with Weak Supervision on Streaming Unlabeled Data

Online Semi-Supervised Concept Drift Detection with Density Estimation

Unsupervised Concept Drift Detection from Deep Learning Representations in Real-time

Handling Adversarial Concept Drift in Streaming Data

No Free Lunch Theorem for concept drift detection in streaming data classification: A review

Concept Drift Detection in Data Stream Mining : A literature review

Diagnosing Concept Drift in Streaming Data

Online Drift Detection with Maximum Concept Discrepancy

A Neighbor-Searching Discrepancy-based Drift Detection Scheme for Learning Evolving Data

Mitigating concept drift in data streams: an incremental decision tree approach

A Concept Drift Detection Algorithm for Power Data Stream

An Approach For Concept Drift Detection in a Graph Stream Using Discriminative Subgraphs

Concept Drift Detection Based on Typicality and Eccentricity

Online Active Learning for Drifting Data Streams

Temporal Attention for Few-Shot Concept Drift Detection in Streaming Data

Incremental Learning with Concept Drift Detection and Prototype-based Embeddings for Graph Stream Classification

Concept Drift Detection Delay Index