Parallel Structure Deep Neural Network Using CNN and RNN with an Attention Mechanism for Breast Cancer Histology Image Classification

Hongdou Yao,Xuejie Zhang,Xiaobing Zhou,Shengyan Liu
DOI: https://doi.org/10.3390/cancers11121901
2019-11-29
Cancers
Abstract:In this paper, we present a new deep learning model to classify hematoxylin–eosin-stained breast biopsy images into four classes (normal tissues, benign lesions, in situ carcinomas, and invasive carcinomas). Our model uses a parallel structure consist of a convolutional neural network (CNN) and a recurrent neural network (RNN) for image feature extraction, which is greatly different from the common existed serial method of extracting image features by CNN and then inputting them into RNN. Then, we introduce a special perceptron attention mechanism, which is derived from the natural language processing (NLP) field, to unify the features extracted by the two different neural network structures of the model. In the convolution layer, general batch normalization is replaced by the new switchable normalization method. And the latest regularization technology, targeted dropout, is used to substitute for the general dropout in the last three fully connected layers of the model. In the testing phase, we use the model fusion method and test time augmentation technology on three different datasets of hematoxylin–eosin-stained breast biopsy images. The results demonstrate that our model significantly outperforms state-of-the-art methods.
oncology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the classification accuracy in breast cancer histological image classification. Specifically, the author proposes a new deep - learning model for classifying hematoxylin - eosin - stained breast biopsy images into four categories: normal tissue, benign lesions, carcinoma in situ, and invasive carcinoma. The solution to this problem is of great significance for the early diagnosis and treatment of breast cancer and can significantly improve the work efficiency of pathologists and the accuracy of diagnosis. ### Main contributions of the paper 1. **Innovation in model structure**: - **Parallel structure**: The model adopts a parallel structure composed of a convolutional neural network (CNN) and a recurrent neural network (RNN) to extract image features, which is different from the traditional method of first using CNN to extract features and then inputting them into RNN. - **Attention mechanism**: A special perceptron attention mechanism is introduced to unify the features extracted by two different neural network structures. This mechanism draws on techniques in the field of natural language processing (NLP). - **New normalization method**: In the convolutional layer, a new switchable normalization method is used instead of the traditional batch normalization. - **Targeted dropout**: In the last three fully - connected layers of the model, the latest targeted dropout technique is used instead of the traditional dropout method. 2. **Experimental verification**: - The model was tested on three different breast cancer datasets, including BACH2018, Bioimaging2015, and the extended Bioimaging2015 dataset. - The experimental results show that the performance of this model on these datasets is significantly better than the existing state - of - the - art methods. ### Key techniques and methods 1. **Parallel structure**: - **CNN module**: DenseNet is used as the CNN module, which solves the vanishing gradient problem through dense connection operations and reduces network parameters. - **RNN module**: Stacked LSTM (Long - Short - Term Memory network) is used as the RNN module to extract the time - series features of pixels. 2. **Attention mechanism**: - Through the perceptron attention mechanism, the importance of the features extracted by CNN and RNN is dynamically allocated, enabling the model to better balance the two types of features. 3. **New normalization method**: - The switchable normalization method is used, which combines the advantages of instance normalization, layer normalization, and batch normalization, enabling the model to learn the most suitable normalization method for each layer. 4. **Targeted dropout**: - The targeted dropout method is used in the fully - connected layers. Through the pruning strategy, different neurons are purposefully inhibited, improving the robustness and performance of the model. ### Experimental results - **BACH2018 dataset**: The best accuracy on the test set is 0.92, which is 1% higher than the current best result. - **Bioimaging2015 dataset**: The average accuracy on the test set is 98.3%, and the highest accuracy reaches 100%. In conclusion, through the innovative model structure and advanced technical means, this paper significantly improves the accuracy of breast cancer histological image classification and provides strong support for computer - aided diagnosis of breast cancer.