Feature selection and classification of leukocytes using random forest

Mukesh Saraswat,K. V. Arya
DOI: https://doi.org/10.1007/s11517-014-1200-8
2014-10-05
Abstract:In automatic segmentation of leukocytes from the complex morphological background of tissue section images, a vast number of artifacts/noise are also extracted causing large amount of multivariate data generation. This multivariate data degrades the performance of a classifier to discriminate between leukocytes and artifacts/noise. However, the selection of prominent features plays an important role in reducing the computational complexity and increasing the performance of the classifier as compared to a high-dimensional features space. Therefore, this paper introduces a novel Gini importance-based binary random forest feature selection method. Moreover, the random forest classifier is used to classify the extracted objects into artifacts, mononuclear cells, and polymorphonuclear cells. The experimental results establish that the proposed method effectively eliminates the irrelevant features, maintaining the high classification accuracy as compared to other feature reduction methods.
engineering, biomedical,computer science, interdisciplinary applications,mathematical & computational biology,medical informatics
What problem does this paper attempt to address?