Abstract:Generally speaking, we can easily specify many causal relationships in the prediction tasks of ubiquitous computing, such as human activity prediction, mobility prediction, and health prediction. However, most of the existing methods in these fields failed to take advantage of this prior causal knowledge. They typically make predictions only based on correlations in the data, which hinders the prediction performance in real-world scenarios because a distribution shift between training data and testing data generally exists. To fill in this gap, we proposed a G AN-based C ausal I nformation L earning prediction framework (GCIL), which can effectively leverage causal information to improve the prediction performance of existing ubiquitous computing deep learning models. Specifically, faced with a unique challenge that the treatment variable, referring to the intervention that influences the target in a causal relationship, is generally continuous in ubiquitous computing, the framework employs a representation learning approach with a GAN-based deep learning model. By projecting all variables except the treatment into a latent space, it effectively minimizes confounding bias and leverages the learned latent representation for accurate predictions. In this way, it deals with the continuous treatment challenge, and in the meantime, it can be easily integrated with existing deep learning models to lift their prediction performance in practical scenarios with causal information. Extensive experiments on two large-scale real-world datasets demonstrate its superior performance over multiple state-of-the-art baselines. We also propose an analytical framework together with extensive experiments to empirically show that our framework achieves better performance gain under two conditions: when the distribution differences between the training data and the testing data are more significant and when the treatment effects are larger. Overall, this work suggests that learning causal information is a promising way to improve the prediction performance of ubiquitous computing tasks. We open both our dataset and code 1 and call for more research attention in this area.

On the causality-preservation capabilities of generative modelling

From Identifiable Causal Representations to Controllable Counterfactual Generation: A Survey on Causal Generative Modeling

Principled Knowledge Extrapolation with GANs.

Causal-TGAN: Generating Tabular Data Using Causal Generative Adversarial Networks

Empowering Predictive Modeling by GAN-based Causal Information Learning

On the Opportunity of Causal Deep Generative Models: A Survey and Future Directions

On the Use of Generative Models in Observational Causal Analysis

Emerging Synergies in Causality and Deep Generative Models: A Survey

De-Biasing Generative Models using Counterfactual Methods

Using GPT-4 to guide causal machine learning

Counterfactual Generative Modeling with Variational Causal Inference

Implicit Causal Models for Genome-wide Association Studies

Modeling and Discovering Direct Causes for Predictive Models

Causal generative explainers using counterfactual inference: a case study on the Morpho-MNIST dataset

Causality Learning with Wasserstein Generative Adversarial Networks

Semi-Supervised Learning for Deep Causal Generative Models

Synthesizing Property & Casualty Ratemaking Datasets using Generative Adversarial Networks

Latent generative modeling of long genetic sequences with GANs

Modular Learning of Deep Causal Generative Models for High-dimensional Causal Inference

A Generative Approach for Financial Causality Extraction