Acquaintance: A Novel Vector-Space N-Gram Technique for Document Categorization

Stephen Huffman,M. Damashek
Abstract:Acquaintance is the name of a novel vector-space n-gram technique for categorizing documents. The technique is completely language independent, highly garble resistant, and computationally simple. An unoptimized version of the algorithm was used to process the TREC database in a very short time.
What problem does this paper attempt to address?