Car Type Recognition with Deep Neural Networks

Heikki Huttunen,Fatemeh Shokrollahi Yancheshmeh,Ke Chen
DOI: https://doi.org/10.1109/IVS.2016.7535529
2016-04-22
Abstract:In this paper we study automatic recognition of cars of four types: Bus, Truck, Van and Small car. For this problem we consider two data driven frameworks: a deep neural network and a support vector machine using SIFT features. The accuracy of the methods is validated with a database of over 6500 images, and the resulting prediction accuracy is over 97 %. This clearly exceeds the accuracies of earlier studies that use manually engineered feature extraction pipelines.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to automatically identify four types of vehicles (Bus, Truck, Van and Small car) and improve the identification accuracy. Specifically, the author aims to achieve this goal by using low - cost cameras to replace traditional high - cost hardware and technologies (such as induction ground loops and laser scanners). In addition, the author hopes to avoid complex pre - processing steps through data - driven methods (such as deep neural networks and support vector machines) and ensure robustness under different environmental conditions (such as insufficient lighting, dirt or change of view angle). ### Main contributions of the paper: 1. **Simplifying hardware requirements**: Using low - cost cameras for vehicle type classification reduces the cost of hardware installation and maintenance. 2. **Improving classification accuracy**: Through deep - learning methods (such as deep neural networks), a classification accuracy of over 97% has been achieved, exceeding previous methods of manually designing feature extraction pipelines. 3. **Reducing pre - processing dependence**: Avoiding complex pre - processing steps makes the system simpler and more robust and able to work stably in various environments. ### Solutions: The author compared two data - driven frameworks: - **Deep Neural Network (DNN)**: It learns features directly from the original images without the need to manually design a feature extraction pipeline. - **Support Vector Machine (SVM) combined with Scale - Invariant Feature Transform (SIFT)**: It uses SIFT features for feature extraction and then uses SVM for classification. ### Experimental results: - In a single - camera scenario, the classification accuracy of DNN is 98.06%, while that of SIFT + SVM is 97.35%. - In a two - camera - at - different - angles scenario, the classification accuracy of DNN is 97.75%, while that of SIFT + SVM is 96.19%. These results show that DNN is slightly superior to SVM + SIFT in classification accuracy, especially in multi - camera scenarios, where DNN performs more prominently. ### Formula presentation: The formulas involved in the paper are mainly related to the structure and training of deep neural networks. For example, the output calculation of the convolutional layer can be expressed as: \[ \text{Output} = \sigma(W \ast X + b) \] where: - \( W \) is the convolution kernel weight, - \( X \) is the input image, - \( b \) is the bias term, - \( \sigma \) is the activation function (such as ReLU), - \( \ast \) represents the convolution operation. In addition, when randomly searching for hyper - parameters, the learning rate \( \eta \) is randomly sampled from a geometric distribution: \[ \eta \sim \text{Geometric}(p) \] where \( p \) is the probability parameter of the geometric distribution. Through these methods, the author has successfully solved the problem of automatic vehicle type identification and demonstrated the potential of data - driven methods in practical applications.