Ore image classification based on small deep learning model: Evaluation and optimization of model depth, model structure and data size

Yang Liu,Zelin Zhang,Xiang Liu,Lei Wang,Xuhui Xia
DOI: https://doi.org/10.1016/j.mineng.2021.107020
IF: 5.479
2021-10-01
Minerals Engineering
Abstract:<p>The ore image classification technology based on deep learning is an effective way to improve the image sensor-based ore sorting classification capability. However, in practice, the image sensor-based ore sorting technique often has the problem of insufficient data, and has not systematically considered the impact of model structure and dataset size on the modeling efficiency and classification performance of deep learning. Therefore, this paper attempts to explore a more suitable small deep learning model for ore image classification by considering the model depth, model structure, and dataset size. Six Convolutional Neural Networks (CNNs) models are established with different depths based on Alex Net and VGG Net and the model structure is optimized by adding BN layer. Taking the gas-coal image dataset as case study, we systematically explore the influence of model depth, model structure, dataset size on the training process efficiency and classification accuracy. Meanwhile, the operational process of coal image classifiers is analyzed visually through the ways of Channel Visualization maps, Heatmaps, Grad-CAM map, and Guided Backpropagation maps.</p>
engineering, chemical,mining & mineral processing,mineralogy
What problem does this paper attempt to address?
The paper mainly addresses the issues in mineral image classification technology, particularly the problem of insufficient data encountered in the practical application of image sensor-based mineral sorting technology, and the inadequate consideration of the impact of model structure and dataset size on the efficiency of deep learning modeling and classification performance. The core objective of the paper is to explore a smaller deep learning model that is more suitable for mineral image classification. By considering factors such as model depth, model structure, and dataset size, the paper attempts to optimize the model to improve training efficiency and classification accuracy. Specifically, the paper establishes six convolutional neural network (CNN) models of different depths based on AlexNet and VGGNet, and optimizes the model structure by adding Batch Normalization (BN) layers. Additionally, the paper uses a gas image dataset as a case study to systematically explore the impact of model depth, model structure, and dataset size on training process efficiency and classification accuracy. In summary, the paper aims to address the following issues: 1. **The impact of model structure and dataset size on mineral image classification performance**: Study the performance of CNN models of different depths on datasets of varying sizes, and evaluate how these factors affect the training efficiency and classification accuracy of the models. 2. **Optimizing model structure**: Optimize the model structure by introducing methods such as BN layers to improve the training speed and convergence of the model. 3. **Visualization analysis**: Through the analysis of channel visualization maps, heatmaps, Grad-CAM maps, and guided backpropagation maps, visually demonstrate the operation process of the coal image classifier. This research is of great significance for selecting small deep learning classification models suitable for different dataset scales.