Discrimination of maturity of Camellia oleifera fruit on-site based on generative adversarial network and hyperspectral imaging technique

Mengmeng Sun,Hongzhe Jiang,Weidong Yuan,Shouxiang Jin,Hongping Zhou,Yu Zhou,Cong Zhang
DOI: https://doi.org/10.1007/s11694-023-02145-7
2023-09-26
Abstract:As a key factor in determining the optimal harvest period of Camellia oleifera fruit maturity is significant in increasing camellia oil yield. Due to the interference of environmental factors, capturing hyperspectral images of Camellia oleifera fruit on-site often suffers from a low spectral signal-to-noise ratio. In this study, the multi-head attention parallel residual block-cycle generative adversarial network (MAPRB-CycleGAN) model was used to reconstruct on-site spectral data of Camellia oleifera fruit. Meanwhile, the characteristics and quality of the reconstructed spectra were analyzed through subjective visual inspection of the curve effect and the principal component analysis (PCA) algorithm. Then, the modeling effect of the reconstructed on-site spectral data after being reconstructed by wavelet transform, PCA, and kernel principal component analysis was compared. After the conventional pretreatment method in chemometrics, the partial least squares discriminant analysis (PLS-DA) model established with the field original spectra of Camellia oleifera fruit achieved the highest classification accuracy of 93.82%. The maturity classification model of Camellia oleifera fruit established by the PLS-DA method using the reconstructed spectra from this paper achieved 99.54% accuracy, and the classification accuracy of this data set in multiple models was higher than that of other reconstructed data sets. The results indicate that hyperspectral imaging technique combined with MAPRB-CycleGAN can realize high-accuracy maturity classification of Camellia oleifera fruit, and the spectral reconstruction can be exploited as a preprocessing method for on-site spectral data collection, which provides important technical support for obtaining high-quality spectral data of Camellia oleifera fruit in field environments. The randomness and unpredictability of outdoor environmental factors lead to the diversity in data collection, and enhancing the robustness of generative models to cope with environmental influences is the most significant challenge in practical applications. It will also be the focal point of our future research.
food science & technology
What problem does this paper attempt to address?