A Complete Analysis Pipeline for the Processing, Alignment and Quantification of HPLC–UV Wine Chromatograms

Alan Ianeselli,Edoardo Longo,Simone Poggesi,Marco Montali,Emanuele Boselli
DOI: https://doi.org/10.1007/s10337-023-04301-z
2024-01-18
Chromatographia
Abstract:Elucidating the chemistry of wine would help defining its quality, chemical and sensory characteristics and optimise the wine-making processes. High-performance liquid chromatography coupled with UV–Vis spectroscopy (HPLC–UV–Vis) is a common analysis method used to obtain the molecular profile of wine samples. We propose a complete procedure for the analysis of wine chromatograms. Data are pre-processed using standard methods of down-sampling, smoothing and baseline subtraction. Multiple samples are then merged in a three-dimensional tensor, decomposed using parallel factor analysis (PARAFAC2) into three factors: (i) one reduced (rank-one) chromatogram per sample, (ii) an estimate of the samples' spectral UV–Vis profile and (iii) an estimate of the samples' concentrations. If the decomposition is performed on a single peak of the tensor, the second and third factors correspond to the representative wavelength spectrum and to the relative concentrations of the samples, respectively. Otherwise, when multiple peaks are analysed, further processing is required. In the latter case, the decomposed rank-one chromatograms are peak-detected and aligned, clustered and integrated. A table containing the concentration of the peaks at different retention times is obtained. The pipeline proposed in this study is a guideline for a quantitative and reproducible chemical analysis of wine, or other samples, via the HPLC–UV–Vis method.
chemistry, analytical,biochemical research methods
What problem does this paper attempt to address?
The paper aims to address the challenges of processing, alignment, and quantification of chromatograms in high-performance liquid chromatography (HPLC) - ultraviolet-visible spectroscopy (UV) analysis of wine. Existing methods are time-consuming and complex in data preprocessing and analysis, especially for high-dimensional data. The paper presents a complete analysis workflow, which preprocesses the data using standard methods such as down-sampling, smoothing, and baseline correction. Then, it utilizes parallel factor analysis (PARAFAC2) to fuse the data of multiple samples into a three-dimensional tensor, which is decomposed into three factors: reduced chromatograms, estimated UV-visible spectra of the samples, and estimated concentrations of the samples. When analyzing individual peaks, representative wavelength spectra and relative concentrations can be directly obtained. However, when analyzing multiple peaks, further peak detection, alignment, clustering, and integration are required. This workflow simplifies complex data and provides a quantitative and reproducible guide for chemical analysis of wine or other samples, facilitating the understanding of the molecular chemical properties of wine.