The Application of Data Mining Techniques in Epidemiological Survey Data

LI Jiang-ping,BI Yu-xue,YAN Hong
DOI: https://doi.org/10.3969/j.issn.1001-568x.2011.08.004
2011-01-01
Abstract:OBJECTIVE Introduction our experiences of applying three models of data mining application in the rural health project and choose the best model to analysis.METHODS In the Enterprise Miner module of software SAS 9.13,4 238 observations were sampled from database and built by three models.Split the dataset at 70%,15% and 15% rate into training set,testing set and validation set,to fitting,testing and verification the model.Through Root ASE、Misclassification rate、ROC curve and Diagnose chart to choice the best model.RESULTS BP neural network is the best model of this study,it's Root ASE was 0.372,Misclassification rate was 0.257 and ROC curve area was largest in the three models.CONCLUSION Data mining make more choices when we do data analysis,data set according to the characteristics of their own could choice of a suitable model to analysis,and made the results more reliable.
What problem does this paper attempt to address?