An Improvement of Data Classification Using Random Multimodel Deep Learning (RMDL)

Mojtaba Heidarysafa,Kamran Kowsari,Donald E. Brown,Kiana Jafari Meimandi,Laura E. Barnes
DOI: https://doi.org/10.18178/ijmlc.2018.8.4.703
2018-08-23
Abstract:The exponential growth in the number of complex datasets every year requires more enhancement in machine learning methods to provide robust and accurate data classification. Lately, deep learning approaches have achieved surpassing results in comparison to previous machine learning algorithms. However, finding the suitable structure for these models has been a challenge for researchers. This paper introduces Random Multimodel Deep Learning (RMDL): a new ensemble, deep learning approach for classification. RMDL solves the problem of finding the best deep learning structure and architecture while simultaneously improving robustness and accuracy through ensembles of deep learning architectures. In short, RMDL trains multiple randomly generated models of Deep Neural Network (DNN), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) in parallel and combines their results to produce better result of any of those models individually. In this paper, we describe RMDL model and compare the results for image and text classification as well as face recognition. We used MNIST and CIFAR-10 datasets as ground truth datasets for image classification and WOS, Reuters, IMDB, and 20newsgroup datasets for text classification. Lastly, we used ORL dataset to compare the model performance on face recognition task.
Machine Learning,Computer Vision and Pattern Recognition,Information Retrieval,Neural and Evolutionary Computing
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to find the best deep - learning structures and architectures suitable for complex data classification. Specifically, with the exponential growth of complex data sets every year, machine - learning methods need to be continuously improved to provide more powerful and accurate data classification. Although deep - learning methods have achieved remarkable results in data classification such as images and texts, finding the structure that is most suitable for these models remains a challenge. For this reason, this paper proposes the Random Multimodel Deep Learning (RMDL) method. RMDL aims to solve the problem of finding the best deep - learning structure while improving classification accuracy by integrating multiple deep - learning architectures such as Deep Neural Networks (DNN), Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN). Specifically, RMDL trains multiple randomly - generated deep - learning models in parallel and combines their results to produce a better classification effect than a single model. This method not only improves the robustness and accuracy of the model, but also reduces the need for trial - and - error for specific applications and data sets. ### The main contributions of the paper include: 1. **Integrated deep - learning method**: By integrating multiple deep - learning models, the final model is made more robust and accurate. 2. **Optimization techniques**: Different optimization techniques are used to stabilize the classification task. 3. **Feature extraction methods**: Different feature extraction methods are adopted for each Random Deep Learning (RDL) model, especially for text data. 4. **Prevention of over - fitting**: The dropout technique is used in each individual RDL. 5. **Majority - voting mechanism**: By conducting a majority vote on n RDL models, the accuracy and robustness of the results are improved. 6. **Handling multiple data types**: RMDL is able to handle multiple data types such as text, image and video. Through these improvements, RMDL can show higher accuracy and robustness in different types of complex data classification tasks.