An Examination of On-Line Machine Learning Approaches for Pseudo-Random Generated Data

Jia Zhu,Chuanhua Xu,Zhixu Li,Gabriel Fung,Xueqin Lin,Jin Huang,Changqin Huang
DOI: https://doi.org/10.1007/s10586-016-0586-5
2016-01-01
Cluster Computing
Abstract:A pseudo-random generator is an algorithm to generate a sequence of objects determined by a truly random seed which is not truly random. It has been widely used in many applications, such as cryptography and simulations. In this article, we examine current popular machine learning algorithms with various on-line algorithms for pseudo-random generated data in order to find out which machine learning approach is more suitable for this kind of data for prediction based on on-line algorithms. To further improve the prediction performance, we propose a novel sample weighted algorithm that takes generalization errors in each iteration into account. We perform intensive evaluation on real Baccarat data generated by Casino machines and random number generated by a popular Java program, which are two typical examples of pseudo-random generated data. The experimental results show that support vector machine and k-nearest neighbors have better performance than others with and without sample weighted algorithm in the evaluation data set.
What problem does this paper attempt to address?