A Cross-cultural Comparison of the Reliability Generalization Studies on the Eysenck Personality Questionnaire and Its Implications
Jiao Can,Zhang Minqiang,Zhang Jieting,Wu Li,Zhang Wenyi
DOI: https://doi.org/10.16719/j.cnki.1671-6981.2011.06.038
2011-01-01
Abstract:EPQ is widely used all around the world,and there is no exception in China.In China,five revised versions of EPQ are available, and all popularly used.Caruso(2001) has conducted a reliability generalization analysis of the EPQ used in other countries and has gained some significant results.However,no such analysis has been made so far in the Chinese context. This paper attempted to study the reliability generalization on the EPQ in China.The analysis was done from the perspectives of descriptive statistics and hierarchical multiple regression with data from seven Chinese major psychology journals from 1998 to 2008. What's more,our findings were compared with those of analogous studies on EPQ in other countries,conducted by Caruso et al. With the results of the reliability generalization on EPQ at home and abroad compared,the following similarities are found:1) the sampling characteristics do affect the reliability coefficients;2) a low proportion of articles reported the reliability coefficient or its range from the data at hand,5.80%and 6.20%respectively;3) P scale owns a low reliability coefficient,probably because its unidimension condition is not met;4) standard deviation of scores in subscales is the main predictor variable for the reliability coefficients of P, N,E and L subscale. However,differences in the studies are also discovered:1) 84.82%of researches in China do not report the reliability coefficient from the data at hand,compared with 62.51%of researches in other countries,which indicates that more scale users in China ignore the reliability from the data at hand;2) Variables such as the number of items,the mean of scores,standard deviation of age,sample type have different predictive functions on the reliability for EPQ in China and in other countries.In China,the prediction from the mean of scores to P,N and L subscale and the prediction from the number of items to P,E and N subscale are found to be statistically significant,which is not found in other countries.On the other hand,the prediction from the standard deviation of age to P,and L subscale and the prediction from the sample type to P subscale are not statistically significant in China,but are all significant in other countries. In summary,the analysis and comparison demonstrate that 1) "reliability induction" is inappropriate when questionnaires available are used;besides the background and size of samples,it is of great necessity to report the reliability coefficient of the samples at hand, which can make the research complete with the information which was mentioned above;2) a certain heterogeneity of samples may enhance the reliability in the use of EPQ;3) the increase of item numbers which does not follow the psychometric rules will not necessarily improve the reliability of scores.