Abstract:In this paper, we find that existing online forecasting methods have the following issues: 1) They do not consider the update frequency of streaming data and directly use labels (future signals) to update the model, leading to information leakage. 2) Eliminating information leakage can exacerbate concept drift and online parameter updates can damage prediction accuracy. 3) Leaving out a validation set cuts off the model's continued learning. 4) Existing GPU devices cannot support online learning of large-scale streaming data. To address the above issues, we propose a novel online learning framework, Act-Now, to improve the online prediction on large-scale streaming data. Firstly, we introduce a Random Subgraph Sampling (RSS) algorithm designed to enable efficient model training. Then, we design a Fast Stream Buffer (FSB) and a Slow Stream Buffer (SSB) to update the model online. FSB updates the model immediately with the consistent pseudo- and partial labels to avoid information leakage. SSB updates the model in parallel using complete labels from earlier times. Further, to address concept drift, we propose a Label Decomposition model (Lade) with statistical and normalization flows. Lade forecasts both the statistical variations and the normalized future values of the data, integrating them through a combiner to produce the final predictions. Finally, we propose to perform online updates on the validation set to ensure the consistency of model learning on streaming data. Extensive experiments demonstrate that the proposed Act-Now framework performs well on large-scale streaming data, with an average 28.4% and 19.5% performance improvement, respectively. Experiments can be reproduced via <a class="link-external link-https" href="https://github.com/Anoise/Act-Now" rel="external noopener nofollow">this https URL</a>.

Active Learning for Streaming Networked Data

Clustering-based Active Learning Classification towards Data Stream

Online Active Learning for Drifting Data Streams

Streaming Active Learning with Deep Neural Networks

A Framework of Online Learning with Imbalanced Streaming Data.

An Adaptive Learning Approach for Noisy Data Streams

Active Learning for Networked Data Based on Non-Progressive Diffusion Model

Learning with Asynchronous Labels

Graph-based Reinforcement Learning for Active Learning in Real Time: An Application in Modeling River Networks

An Online Active Broad Learning Approach for Real-Time Safety Assessment of Dynamic Systems in Nonstationary Environments

Distributed Active Learning.

Mining Noisy Data Streams Via A Discriminative Model

Effective Active Learning Method for Spiking Neural Networks.

Adaptive Learning for Weakly Labeled Streams

Act Now: A Novel Online Forecasting Framework for Large-Scale Streaming Data

A reliable adaptive prototype-based learning for evolving data streams with limited labels

Neural Active Learning Beyond Bandits

STREAMLINE: Streaming Active Learning for Realistic Multi-Distributional Settings

Toward Mining Capricious Data Streams: A Generative Approach

Streaming Label Learning for Modeling Labels on the Fly.

An Ensemble Classifier Algorithm for Mining Data Streams Based on Concept Drift