A literature survey on various aspect of class imbalance problem in data mining

Shivani Goswami,Anil Kumar Singh
DOI: https://doi.org/10.1007/s11042-024-18244-6
IF: 2.577
2024-02-05
Multimedia Tools and Applications
Abstract:Data has become much widely available in recent years. Since the past years, Learning classifiers from unbalanced data is a crucial issue that comes up frequently in classification difficulties. In such cases, the majority of the instances belong to one class while many fewer belong to the other class, which is typically the more significant class. As important data is extracted from data during learning, if the ratio between classes are changed.Class Imbalances causes the classifier's performance to decrease. The imbalanced data issues is well-known in numerous application areas and has recently become a open research challenge in data mining and learning algorithms. In such situation, nearly all the instances belong to majority class, but very few belongs to the minority class, which is often the most crucial class for prediction or detection. Since typical classifications demand a high accuracy over a complete range of examples in this situation, machine learning techniques frequently overwhelm the majority class and neglect the minority class. This survey first evaluated academic initiatives specifically aimed at the issues of class imbalance. Then, we analyzed numerous solutions at four levels during the learning stages. The purpose of survey is to present an overview of class imbalance problem that includes their issues, solution and their disadvantages. The survey concluded with suggestions for future investigation, research problems, and developments in the field. The survey involved the adaptive processes as well.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?