EUPRO - A reference database on project-based R&D collaboration networks

Thomas Scherngell,Michael Barber,Georg Zahradnik,Anna Wolfmayr,Xheneta Bilalli Shkodra
DOI: https://doi.org/10.1038/s41597-024-03129-y
2024-03-14
Scientific Data
Abstract:The EUPRO database enables the analysis of participation patterns of organisations in and across different European R&D funding initiatives and the investigation of resulting collaborative R&D network structures and dynamics. The perimeter of EUPRO is currently more than 600,000 R&D projects funded by European (EU, transnational or national) research funding organisations, comprising systematic information about contents of the R&D projects, their participating organizations (including organisation type and location), and a number of additional characteristics (e.g. underlying policy instrument and programme). This scientific data descriptor serves as illustrative information source for users, both from science as well as from policy. It discusses the conceptual background and derives respective analytical opportunities for different actual, highly relevant debates in innovation studies and related fields. Moreover, the data collection process is described in a compact manner, as well as how the collected data are harmonized and aggregated into a suitable data model for analytical purposes. Finally, we put forward issues of technical validation, data quality and enrichment, and usage notes on how to access EUPRO.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper primarily introduces the EUPRO database, which is a reference database on European Research and Development (R&D) cooperation networks. EUPRO aims to enable users to analyze the changes in organizational participation patterns in different European R&D funding programs and to study the resulting collaborative R&D network structures and their dynamic changes. Specifically, EUPRO covers information on over 600,000 R&D projects funded by European (including EU, transnational, and national-level) research funding agencies. This information includes the specific content of the projects, participating organizations (including organization types and geographical locations), and other characteristics (such as policy tools and project plans). The paper addresses the following key issues: 1. **Content and scope of the database**: Describes the types of data included in the EUPRO database and how this data is collected, standardized, and integrated into a data model suitable for analysis. 2. **Applications of the database**: Discusses how EUPRO can be used for scientific research and policy-making, particularly in the field of innovation research. 3. **Technical validation and data quality**: Explains how to ensure the quality of the data, including data verification measures in the process of standardizing country codes, geolocation, etc. 4. **Future development directions**: Mentions the possible future development directions of the EUPRO database, including adding new modules and sub-modules to meet the evolving data needs. In summary, the focus of this paper is to introduce the design concept, content composition, data processing flow of the EUPRO database, and its application value in academic research and policy analysis. Through EUPRO, researchers can better understand the formation mechanisms and development trends of R&D cooperation networks across Europe.