Experimental study of recognition rate in statistical pattern classification based on finite size of design sample set

Tetsuo Takeshita,Shigeyuki Nozawa,Jun‐Ichiro Toriwaki,Jun-Ichiro Toriwaki
DOI: https://doi.org/10.1002/scj.4690270206
1996-01-01
Systems and Computers in Japan
Abstract:When two classes of patterns follow the multidimensional normal distribution, the following procedure is considered. The same number of finite samples is extracted from each set of patterns. The covariance matrices and the average vectors are estimated. Using the estimated parameters, the linear discriminant function or the quadratic discriminant function is assessed by testing how correctly other samples extracted from the same set can be recognized. The authors found a method in which the above method is iterated to examine the deterioration of the recognition ratio due to the finiteness of the learning samples and to calculate the average recognition ratio. Next, the relations among the dimension of the feature patterns, the number of learning samples and the average recognition ratio are examined and compared to the expression for the approximate recognition ratio (theoretical formula) by Raudys and Fukunaga et al. The limit of application of the evaluation formula is indicated. The deterioration of the recognition ratio is examined, and it is shown that the error ratio is higher in the Bayes decision, and the error ratio becomes closer to the ‐distribution when the dimension of the feature vector is increased.
What problem does this paper attempt to address?