XRZoo: A Large-Scale and Versatile Dataset of Extended Reality (XR) Applications

Shuqing Li,Chenran Zhang,Cuiyun Gao,Michael R. Lyu
2024-12-11
Abstract:The rapid advancement of Extended Reality (XR, encompassing AR, MR, and VR) and spatial computing technologies forms a foundational layer for the emerging Metaverse, enabling innovative applications across healthcare, education, manufacturing, and entertainment. However, research in this area is often limited by the lack of large, representative, and highquality application datasets that can support empirical studies and the development of new approaches benefiting XR software processes. In this paper, we introduce XRZoo, a comprehensive and curated dataset of XR applications designed to bridge this gap. XRZoo contains 12,528 free XR applications, spanning nine app stores, across all XR techniques (i.e., AR, MR, and VR) and use cases, with detailed metadata on key aspects such as application descriptions, application categories, release dates, user review numbers, and hardware specifications, etc. By making XRZoo publicly available, we aim to foster reproducible XR software engineering and security research, enable cross-disciplinary investigations, and also support the development of advanced XR systems by providing examples to developers. Our dataset serves as a valuable resource for researchers and practitioners interested in improving the scalability, usability, and effectiveness of XR applications. XRZoo will be released and actively maintained.
Software Engineering,Artificial Intelligence,Cryptography and Security,Human-Computer Interaction
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the lack of large - scale, representative, and high - quality datasets in current extended reality (XR, including augmented reality AR, mixed reality MR, and virtual reality VR) application research. The absence of such datasets limits researchers' ability to conduct empirical research and develop new methods to improve the XR software development process. Specifically, the paper points out the following problems: 1. **Lack of datasets**: There is no large - scale XR application dataset in the existing research community. This causes researchers to rely only on limited and possibly outdated datasets, or collect small - scale and unrepresentative samples, thus affecting the universality and reliability of research results. 2. **Challenges brought by platform diversity**: The XR ecosystem is scattered across multiple platforms, and each platform has its own unique technology stack and hardware devices, which makes it difficult to obtain comprehensive data. The differences between different platforms also increase the complexity of data collection and integration. 3. **Data quality and integrity**: Due to technical limitations and platform - specific constraints, there are many challenges in ensuring the quality, integrity, and accuracy of the collected data. For example, some platforms lack API support, while others have strict limits on access frequency. 4. **Consistency of cross - platform data analysis**: When aggregating data from multiple sources, the consistency of metadata is also an important issue. The metadata formats and levels of detail provided by different platforms vary, and additional effort is required for data synthesis and normalization. To solve these problems, the authors constructed a large - scale comprehensive XR application dataset named "XRZ OO". This dataset contains 12,528 free XR applications from nine major application stores and covers all types of XR technologies and application scenarios. By providing such a publicly available dataset, the authors hope to promote repeatable XR software engineering and security research, support interdisciplinary investigations, and help developers develop more advanced XR systems.