Lactose prediction in dry milk with hyperspectral imaging: A data analysis competition at the "International Workshop on Spectroscopy and Chemometrics 2024"

Maria Frizzarin,Vicky Caponigro,Katarina Domijan,Arnaud Molle,Timilehin Aderinola,Thach Le Nguyen,Davide Serramazza,Georgiana Ifrim,Agnieszka Konkolewska
DOI: https://doi.org/10.1016/j.chemolab.2024.105279
IF: 4.175
2024-12-02
Chemometrics and Intelligent Laboratory Systems
Abstract:In April 2024, the Vistamilk SFI Research Centre organized the fourth edition of the "International Workshop on Spectroscopy and Chemometrics – Spectroscopy meets modern Statistics". Within this event, a data challenge was organized among workshop participants, focusing on hyperspectral imaging (HSI) of milk samples.Milk is a complex emulsion comprising of fats, water, proteins, and carbohydrates. Due to the widespread prevalence of lactose intolerance, precise lactose quantification in milk samples became necessary for the dairy industry.The dataset provided to the participants contained spectral data extracted from HSI, without the spatial information, obtained from 72 samples with reference laboratory values for lactose concentration [mg/mL]. The winning strategy was built using ROCKET, a convolutional-based method that was originally designed for time series classification, which achieved a Pearson correlation of 0.86 and RMSE of 9.8 on the test set. The present paper describes the approaches and statistical methods adopted by all the participants to analyse the data and develop the lactose prediction models.
automation & control systems,computer science, artificial intelligence,instruments & instrumentation,statistics & probability,mathematics, interdisciplinary applications,chemistry, analytical
What problem does this paper attempt to address?