NMR Based Metabonomic Data Preprocessing

WEN Jin-bo,YANG Shu-yu,XIAO Xian,DONG Ji-yang,LI Xue-jun,CHEN Zhong
DOI: https://doi.org/10.3321/j.issn:0438-0479.2007.06.010
2007-01-01
Abstract:Normalization is one of the most important steps of metabonomic data preprocessing.In this study,on one hand,we compared the effects of three kinds of normalization methods to the pattern recognition results in the data preprocessing,on the other hand,we evaluated evolutionary variable selection methods in improving the quality of the data clustering.Three kinds of normalization methods,i.e.Inf-Norm,1-Norm and 2-Norm,were tested on the metabonomics data sets composed of normal and diabetes I rats' urine NMR spectra data.They were found to greatly affect the outcome of the data analysis.1-Norm method performed better than the other two methods.Besides,parameter R was defined to evaluate the quality of PC scoring plot,and introduced into the fitness function of genetic algorithm(GA).The use of GA for variable selection was found to improve the data clustering quality.After GA,parts of the variables were discarding that was better to identify and recognize characteristic metabolites.
What problem does this paper attempt to address?