Inequality relations for NMR-based polymer homoblock analysis and extended application: Reanalysis of historical data on alginates, chitosans, homogalacturonans, and galactomannans

Xiaohui Xing,Kanglin Xing,Yves S. Y. Hsieh,D. Wade Abbott
DOI: https://doi.org/10.2139/ssrn.4757239
IF: 2.975
2024-06-13
Carbohydrate Research
Abstract:There has been a long-standing bottleneck in the quantitative analysis of the frequencies of homoblock polyads beyond triads using 1 H and 13 C NMR for linear polysaccharides, primarily because monosaccharides within a long homoblock share similar chemical environments due to identical neighboring units, resulting in indistinct NMR peaks. In this study, through rigorous mathematical induction, inequality relations were established that enabled the calculation of frequency ranges of homoblock polyads from historically reported NMR-derived frequency values of diads and/or triads of alginates, chitosans, homogalacturonans, and galactomannans. The calculated homoblock frequency ranges were then applied to evaluate three chain growth statistical models, including the Bernoulli chain, first-order Markov chain, and second-order Markov chain, for predicting homoblock frequencies in these polysaccharides. Furthermore, based on the mathematically derived inequality relations, a novel 2D array was constructed, enabling the graphical visualization of homoblock features in polysaccharides. It was demonstrated, as a proof of concept, that the novel 2D array, along with a 1D code generated from it, could serve as an effective feature engineering tool for polymer classification using machine learning algorithms.
biochemistry & molecular biology,chemistry, organic, applied
What problem does this paper attempt to address?