Evaluating the impact of tuned pre-trained architectures' feature maps on deep learning model performance for tomato disease detection

Halit Bakır
DOI: https://doi.org/10.1007/s11042-023-17503-2
IF: 2.577
2023-11-14
Multimedia Tools and Applications
Abstract:As global food demands escalate, ensuring optimal crop health has become paramount. Traditional disease detection methods often fall short in terms of speed and accuracy, emphasizing the need for advanced, technology-driven solutions. Thus, this paper explores the impact of fine-tuning pre-trained CNN architectures synchronously with the structure of CNN model, focusing on disease detection in tomato leaves. We suggested utilizing these pre-trained CNN architectures as a feature extraction phase and tuning them alongside the classification phase of the proposed model. We posit that the harmonious tuning of both the feature extraction phase and the classification phase holds the potential to enhance the performance of any deep-learning model. In pursuit of this objective, we extended the concept of hyperparameters to encompass the feature extraction phase, alongside a comprehensive spectrum of hyperparameters that influence deep-learning models. Subsequently, we harnessed the random search algorithm to optimize these hyperparameters and determine the optimal model architecture for enhanced tomato disease detection accuracy. The model refined through the random search algorithm was designated as Xception-CNN. Initially, we trained and evaluated the proposed Xception-CNN model using the tomato leaves dataset. Subsequently, we conducted an experiment by removing the feature extraction phase from the Xception-CNN model and transforming it into an end-to-end scratch-CNN model. This step aimed to both validate the efficacy of our approach and unveil the impact of fine-tuned pre-trained model feature maps on CNN model performance. The outcomes indicated the superior performance of the proposed Xception-CNN model compared to the Scratch-CNN model across all evaluation metrics. Notably, the classification accuracy of the Scratch-CNN model peaked at 76.70%, whereas the Xception-CNN model achieved an impressive accuracy of 99.40%. These findings underscore the significance of meticulous deep-learning model refinement, coupled with the utilization of pre-trained models, within such an optimization and refinement process.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?