A comparative study on application of data mining technique in human shape clustering: Principal component analysis vs. Factor analysis

J. Niu,Yiling He,Muyuan Li,Xin Zhang,L. Ran,Chuzhi Chao,Bao-dong Zhang
DOI: https://doi.org/10.1109/ICIEA.2010.5515577
2010-06-15
Abstract:Traditional human shape classification usually adopted some key measurements, leading several problems to product ergonomic design. Multivariate analysis is able to supplement the disadvantages of traditional method. Among methods of multivariate analysis, Principal component analysis (PCA) and Factor analysis (FA) have enjoyed widespread popularity. Though both of them are to reduce the dimensions of variances in the sample, there are differences between PCA and FA worth further investigation. The purpose of the paper is to demonstrate the differences between PCA and FA by analyzing the multivariate anthropometric data. K-means cluster analysis was developed to divide samples into groups with homogenous characteristics according to the PCA scores (or FA scores). ANOVA (analysis of variance) was adopted to compare the dimensions in corresponding clusters between PCA and FA. For all the dimensions, the p-value equals to 0.000, indicating there is significant difference for the samples between PCA and FA at the significance level of 0.005. Finally, the regression models of the reference dimensions based on the key dimensions, i.e., stature and waist girth, were investigated for the ease of utilization the FA (or PCA) results into applications such as building a family of digital manikin. In conclusion, the techniques have similarities and differences, and should not be abused. PCA analyzes all variance of the data set, while FA analyzes only common variances. A priori decision on the techniques depends on the domain expertise, and the statistic characteristics of the sample.
Engineering
What problem does this paper attempt to address?