EVBattery: A Large-Scale Electric Vehicle Dataset for Battery Health and Capacity Estimation

Haowei He,Jingzhao Zhang,Yanan Wang,Benben Jiang,Shaobo Huang,Chen Wang,Yang Zhang,Gengang Xiong,Xuebing Han,Dongxu Guo,Guannan He,Minggao Ouyang
2023-11-02
Abstract:Electric vehicles (EVs) play an important role in reducing carbon emissions. As EV adoption accelerates, safety issues caused by EV batteries have become an important research topic. In order to benchmark and develop data-driven methods for this task, we introduce a large and comprehensive dataset of EV batteries. Our dataset includes charging records collected from hundreds of EVs from three manufacturers over several years. Our dataset is the first large-scale public dataset on real-world battery data, as existing data either include only several vehicles or is collected in the lab environment. Meanwhile, our dataset features two types of labels, corresponding to two key tasks - battery health estimation and battery capacity estimation. In addition to demonstrating how existing deep learning algorithms can be applied to this task, we further develop an algorithm that exploits the data structure of battery systems. Our algorithm achieves better results and shows that a customized method can improve model performances. We hope that this public dataset provides valuable resources for researchers, policymakers, and industry professionals to better understand the dynamics of EV battery aging and support the transition toward a sustainable transportation system.
Machine Learning,Systems and Control
What problem does this paper attempt to address?
The paper aims to address the issue of Electric Vehicle (EV) battery health status and capacity estimation, particularly in the study and application on large-scale real-world datasets. With the proliferation of electric vehicles, battery safety has become a significant research topic. To facilitate the development of data-driven approaches, the authors introduce a large and comprehensive electric vehicle battery dataset—EVBattery. This dataset contains years of charging records from hundreds of electric vehicles from three manufacturers and is the first large-scale publicly available real-world battery dataset. The dataset includes not only time-series data such as charging voltage, current, temperature, and State of Charge (SOC) but also provides two types of labels: battery health status and battery capacity, corresponding to two key tasks. In the paper, the authors not only demonstrate how to apply existing deep learning algorithms to these tasks but also develop an algorithm that leverages the battery system's data structure (DyAD), which achieves better results in the task of battery health status estimation. This indicates that methods tailored to specific datasets can improve model performance. Moreover, the dataset is valuable for researchers, policymakers, and industry professionals to understand the dynamics of electric vehicle battery aging and to support the transition to sustainable transportation systems. In summary, the paper's main contributions lie in providing a large-scale real-world electric vehicle battery dataset for research on battery health status and capacity estimation; benchmarking the performance of machine learning and deep learning algorithms on these two tasks; and designing the DyAD algorithm, which achieves the best results in battery health status detection, highlighting the importance of customized algorithms. Through these works, the paper advances research in the field of electric vehicle battery management systems, especially in anomaly detection and capacity estimation.