Object recognition using cognition based decision tree clustering in multi-level artificial neural network classifier and self similarity as feature criterion

Upendra Kumar
DOI: https://doi.org/10.1007/s11042-024-18691-1
IF: 2.577
2024-03-09
Multimedia Tools and Applications
Abstract:This work showed the capability of handling large number of classes for classification with human cognition inspired methods. A cognition based techniques for both feature extraction, (self-similarity feature, Intensity Level Multi Fractal Dimension (ILMFD)) as well as classification purpose (decision tree clustering based multi-level Artificial Neural Network classifier-MLANN-DTC) were employed to implement facial recognition based object detection system. A DTC based approach reduces the search space time and also provides opportunity for very less amount of classes (a smaller part of the large number of classes) to be handled by the respective classifier for classification. It also mimics fast recognition capability of humans. In this work, two different databases were used for experiment, first one is our own collected facial images from rotation based video clips (117 persons and 40 facial images per person) named as NS database, and other is standard ORL database (40 persons and 10 facial images per person). In pre-processing step, the facial images were segmented to obtain facial part using context window based texture of pixels (CWTP) & back-propagation neural network (BPNN) based model and then a scale and rotation independent ILMFD feature was computed from each segmented image. Further, a combination of K-means and hierarchal clustering was used to build super classes. All classes' data were distributed among these 6 super classes (heuristically chosen) for own NS database and 3 for ORL database as per their similarity based on ILMFD features. Multi-level ANNs models were employed for all super classes and further their classification results were fed into decision clustering based model to obtain fine-tuned results, which showed significant improvement in terms of classification efficiency. This approach believes in center tendency of largest cluster to refer the actual class decision from multiple decisions obtain corresponding to multiple input data of the same class. In this work, the MLANN-DTC based proposed model has produced 89.542 ± 1.167% and 87.098 ± 2.066% classification efficiency (± standard deviation) for single input and for group based decision (decision clustering), 95.042 ± 0.719% and 89 ± 2.549% for NS and ORL database respectively. This improved classification results motivate its application for other object recognition and classification problems. The basic idea of this work also supports better handling of classification which deals with a large number of classes.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?