Kernel induced random survival forests

Fang Yang,Jiheng Wang,Guangzhe Fan
DOI: https://doi.org/10.48550/arXiv.1008.3952
2010-08-24
Abstract:Kernel Induced Random Survival Forests (KIRSF) is a statistical learning algorithm which aims to improve prediction accuracy for survival data. As in Random Survival Forests (RSF), Cumulative Hazard Function is predicted for each individual in the test set. Prediction error is estimated using Harrell's concordance index (C index) [Harrell et al. (1982)]. The C-index can be interpreted as a misclassification probability and does not depend on a single fixed time for evaluation. The C-index also specifically accounts for censoring. By utilizing kernel functions, KIRSF achieves better results than RSF in many situations. In this report, we show how to incorporate kernel functions into RSF. We test the performance of KIRSF and compare our method to RSF. We find that the KIRSF's performance is better than RSF in many occasions.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the prediction accuracy in survival data analysis. Specifically, the author proposes a new method named Kernel Induced Random Survival Forests (KIRSF), aiming to improve the traditional Random Survival Forests (RSF) by introducing kernel functions. KIRSF uses kernel functions to map the original data to a high - dimensional space, so that it can better capture the non - linear relationships in the data and further improve the prediction accuracy. The experimental results in this paper through simulated data and actual data sets (bone marrow transplantation data) show that KIRSF is significantly superior to the traditional RSF method in prediction accuracy.