The Use of Random Projections for the Analysis of Mass Spectrometry Imaging Data

Andrew D. Palmer,Josephine Bunch,Iain B. Styles
DOI: https://doi.org/10.1007/s13361-014-1024-7
IF: 3.262
2014-12-19
Journal of the American Society for Mass Spectrometry
Abstract:The ‘curse of dimensionality’ imposes fundamental limits on the analysis of the large, information rich datasets that are produced by mass spectrometry imaging. Additionally, such datasets are often too large to be analyzed as a whole and so dimensionality reduction is required before further analysis can be performed. We investigate the use of simple random projections for the dimensionality reduction of mass spectrometry imaging data and examine how they enable efficient and fast segmentation using k-means clustering. The method is computationally efficient and can be implemented such that only one spectrum is needed in memory at any time. We use this technique to reveal histologically significant regions within MALDI images of diseased human liver. Segmentation results achieved following a reduction in the dimensionality of the data by more than 99% (without peak picking) showed that histologic changes due to disease can be automatically visualized from molecular images.Graphical Abstractᅟ
chemistry, physical,spectroscopy, analytical,biochemical research methods
What problem does this paper attempt to address?