Application of Near-Infrared Hyperspectral Imaging with Variable Selection Methods to Determine and Visualize Caffeine Content of Coffee Beans

Chu Zhang,Hao Jiang,Fei Liu,Yong He
DOI: https://doi.org/10.1007/s11947-016-1809-8
2017-01-01
Food and Bioprocess Technology
Abstract:Hyperspectral imaging covering the spectral range of 874–1734 nm was used to determine caffeine content of coffee beans. Spectral data of 958.24–1628.89 nm were extracted and preprocessed. Partial least squares regression (PLSR) model on the preprocessed full spectra obtained good performance with coefficient of determination of prediction (R 2 p ) of 0.843 and root mean square error of prediction (RMSEP) of 131.904 μg/g. In addition, 10 variable selection methods were applied to select the best optimal wavelengths. The PLSR models on the different optimal wavelengths obtained satisfactory results. The PLSR model on the wavelengths selected by random frog (RF) performed the best, with R 2 p of 0.878 and RMSEP of 116.327 μg/g. The RF wavelength selection combined with the PLSR model also achieved satisfactory visualization of caffeine content between different coffee beans. The overall results indicated that optimal wavelength selection was an efficient method for spectral data preprocessing, and hyperspectral imaging was illustrated as a potential technique for real-time online determination for caffeine content of coffee beans.
What problem does this paper attempt to address?