A Simulation Study for Propensity Score in Dealing with Col- linearity Data

Yang Mei,Xiao Jing,Shen Yi,c H
2013-01-01
Abstract:Objective The aim of our study was to find whether propensity score( PS) method wgiykd be better in parameter esti- mates than common logistic regression model in dealing with collinearity data by different sample size and degree of collinearity. Methods A lo- gistic model was used as a standard model in our study since it was assigned to deal with non-collinearity data. Monte Carlo simulation was employed to compare parameter estimates between PS regression and common logistic regression in dealing with collinearity data under conditions of different sample size and degree of collinearity. Results( 1) Given a 14% posi- tive proportion of outcome variable and a 0.92 correlation coefficient( r) between covariates and exposure factor the parameter estimates,either re- gression coefficient or its standard error from PS regression,were close to parameters estimated from standard regression model,compared to the com- mon logistic regression. These differences of parameter estimates were grad- ually disappeared along with increase of sample size.( 2) Given sample size of 1000 and 500 and 4% positive proportion of outcome variable,we estimated regression coefficient and its standard error from three models a- long with degree of collinearity. The trend of parameters estimated from PS regression was parallel with the trend of standard model. It means the differ- ence between these two models is consistent. However,the change of re- gression coefficient and standard error estimated from the common logistic regression were parallel with changes of two models mentioned above when r is in a low level. But it changes its direction at r = 0. 5( n = 1000) or r = 0. 3( n = 500). Conclusion The parameters estimated from PS regres- sion were more reliable than the common logistic regression,especially un- der the conditions of small sample size and data with severe collinearity. Therefore,PS regression could be one of excellent methods in dealing with collinearity data.
What problem does this paper attempt to address?