Advancing Organizational Science Through Synthetic Data: A Path to Enhanced Data Sharing and Collaboration

Pengda Wang,Andrew C. Loignon,Sirish Shrestha,George C. Banks,Frederick L. Oswald
DOI: https://doi.org/10.1007/s10869-024-09997-w
IF: 6.604
2024-12-07
Journal of Business and Psychology
Abstract:The importance of data sharing in organizational science is well-acknowledged, yet the field faces hurdles that prevent this, including concerns around privacy, proprietary information, and data integrity. We propose that synthetic data generated using machine learning (ML) could offer one promising solution to surmount at least some of these hurdles. Although this technology has been widely researched in the field of computer science, most organizational scientists are not familiar with it. To address the lack of available information for organizational scientists, we propose a systematic framework for the generation and evaluation of synthetic data. This framework is designed to guide researchers and practitioners through the intricacies of applying ML technologies to create robust, privacy-preserving synthetic data. Additionally, we present two empirical demonstrations using the ML method of generative adversarial networks (GANs) to illustrate the practical application and potential of synthetic data in organizational science. Through this exploration, we aim to furnish the community with a foundational understanding of synthetic data generation and encourage further investigation and adoption of these methodologies. By doing so, we hope to foster scientific advancement by enhancing data-sharing initiatives within the field.
psychology, applied,business
What problem does this paper attempt to address?