Unsupervised Time Series Extraction from Controller Area Network Payloads

Brent J. Stone,Scott Graham,Barry Mullins,Christine Schubert Kabban
DOI: https://doi.org/10.48550/arXiv.1904.03078
2019-04-05
Abstract:This paper introduces a method for unsupervised tokenization of Controller Area Network (CAN) data payloads using bit level transition analysis and a greedy grouping strategy. The primary goal of this proposal is to extract individual time series which have been concatenated together before transmission onto a vehicle's CAN bus. This process is necessary because the documentation for how to properly extract data from a network may not always be available; passenger vehicle CAN configurations are protected as trade secrets. At least one major manufacturer has also been found to deliberately misconfigure their documented extraction methods. Thus, this proposal serves as a critical enabler for robust third-party security auditing and intrusion detection systems which do not rely on manufacturers sharing confidential information.
Cryptography and Security
What problem does this paper attempt to address?