Detecting Malignant Patients Via Modified Boosted Tree

CaiLing Dong,YiLong Yin,XiuKun Yang
DOI: https://doi.org/10.1007/s11432-010-3107-9
2010-01-01
Science China Information Sciences
Abstract:As one of the most effective measures to extract useful information from medical database and provide scientific decision-making for diagnosis and treatment of diseases, medical data mining has become an increasingly hot topic in the last few years. Some of the intrinsic characteristics of medical databases, such as the huge volume and imbalanced samples as well as stringent performance standards, make this mining process particularly challenging. By elaborating various challenges existing in Task 1 of KDD Cup 2008 competition, this paper analyzes some potential solutions to these problems and presents a modified boosted tree as the final classification model. This model ranked the fourth among all the solutions to Task 1. We hope that our analysis and solutions to these challenges would contribute to the development of medical data mining applications.
What problem does this paper attempt to address?