Detection of Article Qualities in the Chinese Wikipedia Based on C4.5 Decision Tree

Kui Xiao,Bing Li,Peng He,Xi-hui Yang
DOI: https://doi.org/10.1007/978-3-642-39787-5_36
2013-01-01
Abstract:The number of articles in Wikipedia is growing rapidly. It is important for Wikipedia to provide users with high quality and reliable articles. However, the quality assessment metric provided by Wikipedia are inefficient, and other mainstream quality detection methods only focus on the qualities of the English Wikipedia articles, and usually analyze the text contents of articles, which is also a time-consuming process. In this paper, we propose a method for detecting the article qualities of the Chinese Wikipedia based on C4.5 decision tree. The problem of quality detection is transformed to classification problem of high-quality and low-quality articles. By using the fields from the tables in the Chinese Wikipedia database, we built the decision trees to distinguish high-quality articles from low-quality ones.
What problem does this paper attempt to address?