Big data and predictive analytics: A sytematic review of applications

Amirhossein Jamarani,Saeid Haddadi,Raheleh Sarvizadeh,Mostafa Haghi Kashani,Mohammad Akbari,Saeed Moradi
DOI: https://doi.org/10.1007/s10462-024-10811-5
IF: 9.588
2024-06-19
Artificial Intelligence Review
Abstract:Big data involves processing vast amounts of data using advanced techniques. Its potential is harnessed for predictive analytics, a sophisticated branch that anticipates unknown future events by discerning patterns observed in historical data. Various techniques obtained from modeling, data mining, statistics, artificial intelligence, and machine learning are employed to analyze available history to extract discriminative patterns for predictors. This study aims to analyze the main research approaches on Big Data Predictive Analytics (BDPA) based on very up-to-date published articles from 2014 to 2023. In this article, we fully concentrate on predictive analytics using big data mining techniques, where we perform a Systematic Literature Review (SLR) by reviewing 109 articles. Based on the application and content of current studies, we introduce taxonomy including seven major categories of industrial, e-commerce, smart healthcare, smart agriculture, smart city, Information and Communications Technologies (ICT), and weather. The benefits and weaknesses of each approach, potentially important changes, and open issues, in addition to future paths, are discussed. The compiled SLR not only extends on BDPA's strengths, open issues, and future works but also detects the need for optimizing the insufficient metrics in big data applications, such as timeliness, accuracy, and scalability, which would enable organizations to apply big data to shift from retrospective analytics to prospective predictive if fulfilled.
computer science, artificial intelligence
What problem does this paper attempt to address?
The main aim of this paper is to systematically review and analyze the research progress and applications in the field of Big Data Predictive Analytics (BDPA). Specifically, the authors' objectives include: 1. **Identification and Classification**: Identify and classify current research methods on BDPA and systematically compare these studies. 2. **Evaluation Metrics and Methods**: Explore the evaluation metrics, evaluation methods, as well as tools and environments used in BDPA. 3. **Challenges and Future Trends**: Identify the open challenges faced by BDPA and future development trends. To achieve the above objectives, the authors reviewed 109 relevant papers from 2014 to 2023 and proposed a technical and comprehensive classification system, dividing BDPA applications into seven main categories: industry, e-commerce, smart healthcare, smart agriculture, smart cities, information and communication technology (ICT), and weather. Additionally, the paper discusses the advantages, disadvantages, potential significant changes, unresolved issues, and future directions of each method. In summary, the main purpose of this paper is to fill the gaps in existing research through a systematic literature review, provide researchers with a framework for comprehensively understanding different BDPA methods, and offer guidance to organizations on how to transition from retrospective analysis to forward-looking predictions using big data.