An Improved Parallel Algorithm for Text Categorization

Wenchuan Yang,Yimin Fu,Dong Zhang
DOI: https://doi.org/10.1109/is3c.2016.120
2016-01-01
Abstract:This paper proposes an approach using MapReduce-based Rocchio relevance feedback algorithm, which improved the traditional Rocchio algorithm in the MapReduce paradigm, to resolve the problem of massive information filtering. Traditional text classification algorithms have vital impact on information filtering.
What problem does this paper attempt to address?