Abstract:Under a newly introduced setting of multistream classification, two data streams are involved, which are referred to as source and target streams. The source stream continuously generates data instances from a certain domain with labels, while the target stream does the same task without labels from another domain. Existing approaches assume that domains for both data streams are identical, which is not quite true, since data streams from different sources may contain distinct features. Indeed, they may even have different numbers of features. Furthermore, obtaining labels for every instance in a data stream is often expensive and time-consuming. Therefore, it has become an important topic to explore if classes of labeled instances from other related streams are helpful to predict the classes of unlabeled instances in a different stream. Note that domains of source and target streams may have distinct feature spaces and data distributions. Our objective is to predict class labels of data instances in the target stream by using the classifiers trained by the source stream. We propose a framework of multistream classification by using projected data from a common latent feature space, which is embedded from both source and target domains. This framework is also crucial for enterprise system defenders to detect cross-platform attacks, such as Advanced Persistent Threats (APTs). Empirical valuation and analysis on both real-world and synthetic datasets are performed to validate the effectiveness of our proposed algorithm, comparing to state-of-the-art techniques. Experimental results show that our approach significantly outperforms other existing approaches.

Multistream Classification with Heterogeneous Feature Space

Multistream Classification for Cyber Threat Data with Heterogeneous Feature Space

Heterogeneous Domain Adaptation for Multistream Classification on Cyber Threat Data

Integrating Data-Driven Segmentation, Local Feature Extraction and Fisher Kernel Encoding to Improve Time Series Classification

A Domain Adaptation Approach For Multistream Classification

Data stream classification in dynamic feature space using feature mapping

Classification with Streaming Features: an Emerging-Pattern Mining Approach

Streaming Classification with Emerging New Class by Class Matrix Sketching.

Social Stream Classification with Emerging New Labels.

Cross-scene Hyperspectral Image Classification Based on DWT and Manifold-Constrained Subspace Learning

High-Dimensional Multi-Label Data Stream Classification with Concept Drifting Detection

HClustream: A Novel Approach for Clustering Evolving Heterogeneous Data Stream

A New Ensemble Method for Multi-label Data Stream Classification in Non-stationary Environment

SACCOS: A Semi-Supervised Framework for Emerging Class Detection and Concept Drift Adaption Over Data Streams

A Feature Weighted Ensemble Classifier on Stream Data

Low-Rank Transfer Learning for Multi-stream Data Classification

Heterogeneous Spectral-Spatial Feature Transfer With Structure Preserved Distribution Alignment for Hyperspectral Image Classification

Fuzzy Mutual Information-Based Multilabel Feature Selection with Label Dependency and Streaming Labels

Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information

Double-Coupling Learning for Multi-Task Data Stream Classification

Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification.