The Landscape of Data Reuse in Interactive Information Retrieval: Motivations, Sources, and Evaluation of Reusability

Tianji Jiang,Wenqi Li,Jiqun Liu
2024-11-23
Abstract:Sharing and reusing research data can effectively reduce redundant efforts in data collection and curation, especially for small labs and research teams conducting human-centered system research, and enhance the replicability of evaluation experiments. Building a sustainable data reuse process and culture relies on frameworks that encompass policies, standards, roles, and responsibilities, all of which must address the diverse needs of data providers, curators, and reusers. To advance the knowledge and accumulate empirical understandings on data reuse, this study investigated the data reuse practices of experienced researchers from the area of Interactive Information Retrieval (IIR) studies, where data reuse has been strongly advocated but still remains a challenge. To enhance the knowledge on data reuse behavior and reusability assessment strategies within IIR community, we conducted 21 semi-structured in-depth interviews with IIR researchers from varying demographic backgrounds, institutions, and stages of careers on their motivations, experiences, and concerns over data reuse. We uncovered the reasons, strategies of reusability assessments, and challenges faced by data reusers within the field of IIR as they attempt to reuse researcher data in their studies. The empirical finding improves our understanding of researchers' motivations for reusing data, their approaches to discovering reusable research data, as well as their concerns and criteria for assessing data reusability, and also enriches the on-going discussions on evaluating user-generated data and research resources and promoting community-level data reuse culture and standards.
Information Retrieval,Digital Libraries
What problem does this paper attempt to address?
The problems that this paper attempts to solve are: **To understand the data reuse behaviors, motivations, data discovery methods, strategies for evaluating data reusability of researchers in the field of Interactive Information Retrieval (IIR), as well as the challenges and concerns they face when attempting to reuse others' data**. Specifically, the research aims to deeply explore the data reuse practices of IIR researchers through the following four aspects: 1. **What are the main intentions/motivations for researchers to reuse data?** 2. **How do IIR researchers discover and obtain reusable research data in their research practices?** 3. **How do IIR researchers evaluate the reusability of research data shared by others?** 4. **What are the main issues that make IIR researchers reluctant to reuse others' data?** The answers to these questions are helpful to reveal the specific situation of data reuse within the IIR community and provide a basis for improving the infrastructure that supports data sharing and reuse. In addition, by understanding the experiences and challenges of researchers from different backgrounds in data reuse, it can provide useful insights for the development of an interdisciplinary data - sharing and reuse culture. ### Research Background Data reuse refers to using data originally collected for other research purposes for new research questions or replicating previous research. Although data reuse has many potential benefits, such as improving research efficiency and enhancing the reproducibility of results, in practice, researchers face many challenges, including locating, accessing, and understanding data, as well as ensuring the quality and applicability of data. Especially in the IIR field, due to the diversity of methodologies and the need for interdisciplinary cooperation, data reuse faces unique challenges. ### Method To answer the above questions, the author conducted 21 semi - structured in - depth interviews, covering IIR researchers with different professional backgrounds, institutions, and career stages. Through these interviews, the researchers obtained first - hand information about data reuse behaviors, motivations, discovery, and evaluation strategies. The research results not only revealed the differences between system - oriented and user - oriented researchers but also provided valuable suggestions for improving the data reuse infrastructure. ### Conclusion Through the analysis of interview data, the research revealed different practice patterns of IIR researchers in data reuse. System - oriented researchers use external data more frequently, while user - oriented researchers rely more on data within the team or in the cooperation network. Overall, the research found the data reuse motivations of researchers in exploring new insights and validating hypotheses and pointed out the common challenges they face, such as irregular data documentation, ethical issues, etc. Through this research, the author hopes to promote the data reuse practice in the IIR field and provide a valuable reference for the broader scientific research community.