CNN-HOG based hybrid feature mining for classification of coffee bean varieties using image processing

DOI: https://doi.org/10.1007/s11042-024-18952-z
IF: 2.577
2024-04-03
Multimedia Tools and Applications
Abstract:Ethiopia, known as the birthplace of coffee, relies on coffee exports as a major source of foreign currency. This research paper focuses on developing a hybrid feature mining technique to automatically classify Ethiopian coffee beans based on their provenance: Harrar, Jimma, Limu, Sidama, and Wellega, which correspond to their botanical origins. A dataset of coffee bean images is collected from various regions through the Ethiopian Commodity Exchange (ECX) in Addis Ababa. The proposed system incorporates preprocessing phases including image resizing, filtering, contrast enhancement, noise removal, grayscale conversion, and segmentation using a combined thresholding and K-means approach for grayscale and RGB images, respectively. Classification is performed using a radial basis function (RBF) kernel function of support vector machine (SVM). To address the color-feature similarity challenge, the study explores merging color and texture features using the histogram of oriented gradients (HOG) local feature descriptor. Performance evaluation is conducted for HOG feature extraction, CNN feature extraction, and a hybrid feature vector (HOG-CNN) using a multi-class SVM classifier, achieving accuracies of 74.17%, 85.83%, and 97.5%, respectively. The deep-shallow-based feature (CNN-HOG) combination demonstrates the highest accuracy of 97.5% in this study. The findings highlight the effectiveness of the proposed hybrid feature mining approach in automatically classifying Ethiopian coffee bean varieties, with potential applications in quality control and traceability within the coffee industry.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?