SRNI-CAR: A comprehensive dataset for analyzing the Chinese automotive market

Ruixin Ding,Bowei Chen,James M. Wilson,Zhi Yan,Yufei Huang
2023-12-19
Abstract:The automotive industry plays a critical role in the global economy, and particularly important is the expanding Chinese automobile market due to its immense scale and influence. However, existing automotive sector datasets are limited in their coverage, failing to adequately consider the growing demand for more and diverse variables. This paper aims to bridge this data gap by introducing a comprehensive dataset spanning the years from 2016 to 2022, encompassing sales data, online reviews, and a wealth of information related to the Chinese automotive industry. This dataset serves as a valuable resource, significantly expanding the available data. Its impact extends to various dimensions, including improving forecasting accuracy, expanding the scope of business applications, informing policy development and regulation, and advancing academic research within the automotive sector. To illustrate the dataset's potential applications in both business and academic contexts, we present two application examples. Our developed dataset enhances our understanding of the Chinese automotive market and offers a valuable tool for researchers, policymakers, and industry stakeholders worldwide.
General Economics,Artificial Intelligence,Computers and Society,Machine Learning
What problem does this paper attempt to address?
The problem this paper attempts to address is the lack of comprehensive coverage in existing automotive market datasets, which fails to meet the growing demand for more diverse variables. Specifically, existing datasets often lack critical information such as the brand creation date and the model launch date, which limits the understanding of market dynamics. Additionally, there are deficiencies in existing datasets regarding new energy vehicle brands and consumer preference analysis. To solve these issues, the paper introduces a comprehensive dataset named SRNI-CAR, which covers sales data, online reviews, and industry news and information from the Chinese automotive market from 2016 to 2022. This dataset not only integrates industry news, development insights, automotive marketing data, consumer online reviews, and sales information but also introduces some previously unincluded important variables, such as model launch dates and brand creation dates. These improvements enable the dataset to support a wider range of research possibilities, enhance the accuracy and interpretability of analyses, and hold significant commercial value in the automotive industry. Through this dataset, researchers, policymakers, and industry stakeholders can better understand the Chinese automotive market, improve sales forecast accuracy, expand business application scope, formulate relevant policies and regulations, and promote the development of academic research.