Abstract:Purpose Partial least squares structural equation modeling (PLS-SEM) has become popular in the information systems (IS) field for modeling structural relationships between latent variables as measured by manifest variables. However, while researchers using PLS-SEM routinely stress the causal-predictive nature of their analyses, the model evaluation assessment relies exclusively on criteria designed to assess the path model's explanatory power. To take full advantage of the purpose of causal prediction in PLS-SEM, it is imperative for researchers to comprehend the efficacy of various quality criteria, such as traditional PLS-SEM criteria, model fit, PLSpredict, cross-validated predictive ability test (CVPAT) and model selection criteria. Design/methodology/approach A systematic review was conducted to understand empirical studies employing the use of the causal prediction criteria available for PLS-SEM in the database of Industrial Management and Data Systems (IMDS) and Management Information Systems Quarterly (MISQ). Furthermore, this study discusses the details of each of the procedures for the causal prediction criteria available for PLS-SEM, as well as how these criteria should be interpreted. While the focus of the paper is on demystifying the role of causal prediction modeling in PLS-SEM, the overarching aim is to compare the performance of different quality criteria and to select the appropriate causal-predictive model from a cohort of competing models in the IS field. Findings The study found that the traditional PLS-SEM criteria (goodness of fit (GoF) by Tenenhaus, R2 and Q2) and model fit have difficulty determining the appropriate causal-predictive model. In contrast, PLSpredict, CVPAT and model selection criteria (i.e. Bayesian information criterion (BIC), BIC weight, Geweke–Meese criterion (GM), GM weight, HQ and HQC) were found to outperform the traditional criteria in determining the appropriate causal-predictive model, because these criteria provided both in-sample and out-of-sample predictions in PLS-SEM. Originality/value This research substantiates the use of the PLSpredict, CVPAT and the model selection criteria (i.e. BIC, BIC weight, GM, GM weight, HQ and HQC). It provides IS researchers and practitioners with the knowledge they need to properly assess, report on and interpret PLS-SEM results when the goal is only causal prediction, thereby contributing to safeguarding the goal of using PLS-SEM in IS studies.

The Challenge of Prediction in Information Systems Research

Predictive Models in Software Engineering: Challenges and Opportunities

Demystifying the role of causal-predictive modeling using partial least squares structural equation modeling in information systems research

Linear Probability Models in Information Systems Research.

Prediction-Oriented Model Selection In Partial Least Squares Path Modeling

PLS-Based Model Selection: the Role of Alternative Explanations in Information Systems Research

Behavioral economics in information systems research: Critical analysis and research strategies

Testing Theories with Big Data : A SuperPower Approach

Prediction and Inference: From Models and Data to Artificial Intelligence

Predictive Model Selection in Partial Least Squares Path Modeling (PLS-PM)

The need for better statistical testing in data-driven energy technology modeling

On Information-Theoretic Measures of Predictive Uncertainty

Explanation Plus Prediction—The Logical Focus of Project Management Research

Practical Problems of Statistical Learning

A Survey on Event Prediction Methods from a Systems Perspective: Bringing Together Disparate Research Areas

Evaluating Predictive Models of Student Success: Closing the Methodological Gap

OR Forum—Tenure Analytics: Models for Predicting Research Impact

Common Mistakes when Applying Computational Intelligence and Machine Learning to Stock Market modelling

The Relative Value of Prediction in Algorithmic Decision Making

Forecasting in social settings: The state of the art

Embracing the Paradigm Shift from Variable-Based to Case-Based Modeling