Image Classification Approach of Bag of Visual Words Model Based on Hadoop

Chunping Hou,Qiannan Zhang,Baoliang Wang,Peng Chang,Shaowei Sun
DOI: https://doi.org/10.11784/tdxbz201604045
2017-01-01
Abstract:As the Internet grows and technology of acquiring digital images advances rapidly,problems with the conventional image classification methods gradually arise while dealing with massive digital images,such as being time-consuming and lacking timely update of the file system and processing architecture.To combat this problem,an image classification approach is proposed based on Apache Hadoop,the mainstream open-source distributed process-ing system.Firstly,the bag of visual words(BoVW)model was utilized to achieve simplified image representa-tions.Meanwhile,an improvement was made to the model during the histogram representation period and an adap-tive soft assignment algorithm was proposed.Lastly,the easy-paralleled random forest algorithm was employed as the classifier so as to make full use of the advantages of the platform.Experiments show that the proposed method of image classification based on Hadoop could effectively decrease the computing time compared with single-PC method while dealing with mass images,and at the same time gain good classification results.
What problem does this paper attempt to address?