An Improved Random Forests Algorithm with Application to Social Ties Inferring of LBS Users

Chun-lai MA,Hong SHAN,Tao MA,Ying-chun SHI
2016-01-01
Abstract:Inferring social ties from the location information of LBS users,which can provide more information for group discovery and community detection,is now becoming a new problem in intelligence mining from location big data.Based on theory of co-occurrences,four categories of features of co-occurrences region are selected,inducted and optimized.Moreover,for the problem that it is difficult for Random Forests to handle high-dimensional data with redundancy features,an improved Random Forests based on feature space stratified sampling strategy is proposed in the paper.Fisher ratio which is selected to measure the importance of features in the algorithm is regarded as the basis for feature subspace partition when proportionally sampling.And random forest is created after that.The problem that noise is introduced easily when the subspace is constructed using random sampling method is avoided effectively with the improved algorithm.The experiment results show that it is more effective for the improved algorithm to classify high dimen sion data with redundant features.So,it is more suitable for social ties inferring of LBS users.
What problem does this paper attempt to address?