A neural network for uncertain data classification.
Wenhua Xu,Zheng Qin,Yang Chang,CC Aggarwal,PS Yu,CC Aggarwal,J Bi,T Zhang,A Bifet,R Kirkby,G Holmes,B Pfahringer,A Bifet,G Holmes,B Pfahringer,R Kirkby,R Gavalda,A Bifet,G Holmes,B Pfahringer,E Frank,P Domingos,G Hulten,J Gama,R Rocha,P Medas,S Garg,RC Jain,L Guang,W Ya-Dong,S Xiao-Hong,G Hulten,L Spencer,P Domingos,S Vijendra,C Liang,Y Zhang,Q Song,MM Masud,J Gao,L Khan,J Han,B Thuraisingham,S Pan,K Wu,Y Zhang,X Li,B Pfahringer,G Holmes,R Kirkby,B Qin,Y Xia,S Prabhakar,Y Tu,B Qin,Y Xia,F Li,M Scholz,R Klinkenberg,WN Street,YS Kim,S Tsang,B Kao,KY Yip,W Ho,SD Lee,T Velmurugan,T Santhanam,H Wang,W Fan,P Yu,J Han,J Ge,Y Xia,CH Nadungodage
2008-01-01
Information Technology Journal
Abstract:During the last decade, classification from data streams is based on deterministic learning algorithms which learn from precise and complete data. However, a multitude of practical applications only supply approximate measurements. Usually, the estimated errors of the measurements are also available and they are valuable supplemental information for the classification process. Therefore, the development of highly efficient algorithms dealing with uncertain examples has emerged as an exciting new direction in stream data mining literature. In this study, an ensemble classification model ECluds is built from data streams having uncertain attribute values. ECluds applies supervised k-means clustering algorithm on uncertain data stream chunks, then extracts sufficient statistics into micro-clusters. An ensemble of micro-clusters performs classification on test examples using nearest neighbor algorithm and majority …