Research of Text Categorization Based on Immune Algorithm

ZHANG QIRUI,ZHANG LING,DONG SHOUBIN,TAN JINGHUA
2007-01-01
Abstract:The clonal selection principle and density control mechanism are used by the natural immune system to define the features of an immune response to an antigenic stimulus. It establishes the ideas that only those cells that have higher affinity and lower den-sity are selected to proliferate. A new algorithm,called the Clonal Selection Algorithm Based on Antibody Density (CSABAD),is brought forward and successfully implemented in text categorization. In text categorization,antigen,B cell and antibody are respec-tively corresponded with training text,an individual of classifier and affinity between the individual and training texts. The final clas-sifier is composed with many memory B cells. The method is applied to the 20_newsgroups dataset and we obtains a F1 score of 80.90%. The result shows that CSABAD significantly outperform Rocchio and Naive Bayes.
What problem does this paper attempt to address?