APPLICATION OF PRINCIPAL COMPONENT ANALYSIS AND MULTIVARIATE LINEAR REGRESSION IN THE ANALYSIS OF THE RELATIONSHIP BETWEEN INCIDENCE OF BACILLARY DYSENTERY AND METEOROLOGICAL FACTORS

廖洪秀,张强,杜长慧,蒋小花,邓长飞,陈鑫
2009-01-01
Abstract:[Objective] To study the relationship between incidence of bacillary and meteorological factors, and investigate the application of principal component analysis and multiple linear regression in the analysis of relationship between incidence of bacillary dysentery and meteorological factors. [Methods] The incidence of dysentery in Chengdu was analyzed by descriptive study from 1999 to 2005. Data of monthly incidence of bacillary dysentery and monthly meteorological data during 1999 to 2005 were collected to establish a database for linear correlation, principal component and multiple linear regression analyses. [Results] Incidence of bacillary dysentery presented obvious seasonal fluctuation, and there were much more cases in summer and autumn (May to October) .There was a positive correlation between the incidence of dysentery and temperature and rainfall respectively (r1 = 0.930, P1﹤0.05; r2 = 0.896, P2﹤0.05), while a negative correlation with fog days (r =-0.585, P﹤0.05). Principal component analysis and multiple linear regression set up an estimated equation for the relationship between incidence of bacillary dysentery and meteorological factors, and showed some meteorological factors with greater influence on incidence of bacillary dysentery including wind speed, temperature, rainfall. [Conclusion] The higher incidence of bacillary dysentery is caused by higher air temperature and higher air humidity. An estimated equation based on principal component analysis and multiple linear regression could predict monthly incidence of bacillary dysentery.
What problem does this paper attempt to address?