Twelve-layer deep convolutional neural network with stochastic pooling for tea category classification on GPU platform

Yu-Dong Zhang,Khan Muhammad,Chaosheng Tang
DOI: https://doi.org/10.1007/s11042-018-5765-3
IF: 2.577
2018-02-27
Multimedia Tools and Applications
Abstract:Automatic tea-category identification is an important topic in factories and supermarkets. Traditional methods need to extract features from tea images manually, which may not be optimal for tea images classification. To avoid the time consuming efforts of handcrafted features extraction, this study proposed a new method combining convolutional neural network (CNN) with stochastic pooling. We collected 900 tea images of Oolong, green, and black teas, with 300 images for each category. The data augmentation method was used over the training set. We employed stochastic gradient descent with momentum (SGDM) to train the CNN. The experiments showed that a 12-layer CNN gives a good result. The sensitivities of Oolong, green, and black tea are 99.5%, 97.5%, and 98.0%, respectively. The overall accuracy of all three-tea categories is 98.33%. The stochastic pooling gives better results than maximum pooling and average pooling. The optimal number of convolutional layer for this task is 5. In addition, GPU has a 175× acceleration in training set and a 122× acceleration in test set, compared to CPU platform.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?