Automatic Incremental Clustering Using Bat-Grey Wolf Optimizer-Based MapReduce Framework for Effective Management of High-Dimensional Data

Ch. Vidyadhari,N. Sandhya,P. Premchand
DOI: https://doi.org/10.4018/ijaci.2020100105
2020-10-01
International Journal of Ambient Computing and Intelligence
Abstract:In this research paper, an incremental clustering approach-enabled MapReduce framework is implemented that include two phases, mapper and reducer phase. In the mapper phase, there are two processes, pre-processing and feature extraction. Once the input data is pre-processed, the feature extraction is done using wordnet features. Then, the features are fed to the reducer phase, where the features are selected using entropy function. Then, the automatic incremental clustering is done using bat-grey wolf optimizer (BAGWO). BAGWO is the integration of bat algorithm (BA) into grey wolf optimization (GWO) for generating various clusters of text documents. Upon the arrival of the incremental data, the mapping of the new data with respect to the centroids is done to obtain the effective cluster. For mapping, kernel-based deep point distance and for centroid update, fuzzy concept is used. The performance of the proposed framework outperformed the existing techniques using rand coefficient, Jaccard coefficient, and clustering accuracy with maximal values 0.921, 0.920, and 0.95, respectively.
What problem does this paper attempt to address?