Pruning Distorted Images in MNIST Handwritten Digits

Amarnath R,Vinay Kumar V
2023-05-26
Abstract:Recognizing handwritten digits is a challenging task primarily due to the diversity of writing styles and the presence of noisy images. The widely used MNIST dataset, which is commonly employed as a benchmark for this task, includes distorted digits with irregular shapes, incomplete strokes, and varying skew in both the training and testing datasets. Consequently, these factors contribute to reduced accuracy in digit recognition. To overcome this challenge, we propose a two-stage deep learning approach. In the first stage, we create a simple neural network to identify distorted digits within the training set. This model serves to detect and filter out such distorted and ambiguous images. In the second stage, we exclude these identified images from the training dataset and proceed to retrain the model using the filtered dataset. This process aims to improve the classification accuracy and confidence levels while mitigating issues of underfitting and overfitting. Our experimental results demonstrate the effectiveness of the proposed approach, achieving an accuracy rate of over 99.5% on the testing dataset. This significant improvement showcases the potential of our method in enhancing digit classification accuracy. In our future work, we intend to explore the scalability of this approach and investigate techniques to further enhance accuracy by reducing the size of the training data.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the issue of reduced handwritten digit recognition accuracy in the MNIST handwritten digit dataset due to image distortion and blurriness. Specifically, the paper proposes a two-stage deep learning approach to detect and remove distorted and blurry images from the training set, thereby improving the overall accuracy and reliability of the handwritten digit classification model. By filtering out these noisy images, the quality of the training data is enhanced, enabling the model to achieve an accuracy of over 99.5% on the test set. This method aims to reduce the problems of overfitting and underfitting and increase the confidence level of predictions.