From the evolution of public data ecosystems to the evolving horizons of the forward-looking intelligent public data ecosystem empowered by emerging technologies

Anastasija Nikiforova,Martin Lnenicka,Petar Milić,Mariusz Luterek,Manuel Pedro Rodríguez Bolívar
2024-05-22
Abstract:Public data ecosystems (PDEs) represent complex socio-technical systems crucial for optimizing data use in the public sector and outside it. Recognizing their multifaceted nature, previous research pro-posed a six-generation Evolutionary Model of Public Data Ecosystems (EMPDE). Designed as a result of a systematic literature review on the topic spanning three decade, this model, while theoretically robust, necessitates empirical validation to enhance its practical applicability. This study addresses this gap by validating the theoretical model through a real-life examination in five European countries - Latvia, Serbia, Czech Republic, Spain, and Poland. This empirical validation provides insights into PDEs dynamics and variations of implementations across contexts, particularly focusing on the 6th generation of forward-looking PDE generation named "Intelligent Public Data Generation" that represents a paradigm shift driven by emerging technologies such as cloud computing, Artificial Intelligence, Natural Language Processing tools, Generative AI, and Large Language Models (LLM) with potential to contribute to both automation and augmentation of business processes within these ecosystems. By transcending their traditional status as a mere component, evolving into both an actor and a stakeholder simultaneously, these technologies catalyze innovation and progress, enhancing PDE management strategies to align with societal, regulatory, and technical imperatives in the digital era.
Computers and Society,Artificial Intelligence,Emerging Technologies,Human-Computer Interaction,Information Retrieval
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve The paper aims to validate a theoretical model—the Evolutionary Model of Public Data Ecosystems (EMPDE)—and enhance its practical applicability through case studies in five European countries (Latvia, Serbia, Czech Republic, Spain, and Poland). Specifically, the paper empirically examines six generations of PDE and their related elements, including components, relationships, stakeholders, roles, data types, processes and activities, and different stages of the data lifecycle. The focus is on the sixth generation of PDE, namely "Intelligent Public Data Generation," which is driven by emerging technologies such as cloud computing, artificial intelligence, natural language processing tools, generative AI, and large language models, representing a paradigm shift in PDE. ### Main Objectives 1. **Validate the Theoretical Model**: Empirically validate the various generations and characteristics of the EMPDE model. 2. **Explore PDE Dynamics**: Analyze the dynamic changes and implementation differences of PDE in different countries. 3. **Identify Key Characteristics**: Determine the formative characteristics of different evolutionary generations of PDE, especially the key elements of the sixth generation PDE. 4. **Promote Innovation and Progress**: Explore how emerging technologies can catalyze innovation and progress in PDE management strategies, making them better suited to the technological, social, and regulatory demands of the digital age. ### Research Methods - **Expert Assessment Questionnaire**: Conduct expert assessments in the selected five countries to collect data on the existence, importance, time span, and influencing factors of each generation of PDE. - **Literature Review**: Design the questionnaire and assessment criteria based on a systematic literature review. - **Data Analysis**: Validate and revise the EMPDE model through quantitative and qualitative analysis of the collected data. ### Results - **Existence and Importance of Generations**: All six generations exist in the sample countries, but some generations are missing in individual countries (e.g., Serbia and Spain). - **Time Span**: The start and end times of each generation vary across countries, differing from the timelines in the literature. - **Key Characteristics**: Most characteristics are rated as "important" or "very important" across all PDE generations, with the role of stakeholders becoming particularly significant in the fourth generation PDE. ### Conclusion Through empirical research, the paper validates and revises the EMPDE model, providing an in-depth understanding of the evolutionary trajectory of PDE. In particular, the characteristics of the sixth generation PDE and the impact of emerging technologies are discussed in detail, offering valuable insights for future research and practice.