Abstract:In this paper, we present a new deep learning model to classify hematoxylin–eosin-stained breast biopsy images into four classes (normal tissues, benign lesions, in situ carcinomas, and invasive carcinomas). Our model uses a parallel structure consist of a convolutional neural network (CNN) and a recurrent neural network (RNN) for image feature extraction, which is greatly different from the common existed serial method of extracting image features by CNN and then inputting them into RNN. Then, we introduce a special perceptron attention mechanism, which is derived from the natural language processing (NLP) field, to unify the features extracted by the two different neural network structures of the model. In the convolution layer, general batch normalization is replaced by the new switchable normalization method. And the latest regularization technology, targeted dropout, is used to substitute for the general dropout in the last three fully connected layers of the model. In the testing phase, we use the model fusion method and test time augmentation technology on three different datasets of hematoxylin–eosin-stained breast biopsy images. The results demonstrate that our model significantly outperforms state-of-the-art methods.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to improve the classification accuracy in breast cancer histological image classification. Specifically, the author proposes a new deep - learning model for classifying hematoxylin - eosin - stained breast biopsy images into four categories: normal tissue, benign lesions, carcinoma in situ, and invasive carcinoma. The solution to this problem is of great significance for the early diagnosis and treatment of breast cancer and can significantly improve the work efficiency of pathologists and the accuracy of diagnosis. ### Main contributions of the paper 1. **Innovation in model structure**: - **Parallel structure**: The model adopts a parallel structure composed of a convolutional neural network (CNN) and a recurrent neural network (RNN) to extract image features, which is different from the traditional method of first using CNN to extract features and then inputting them into RNN. - **Attention mechanism**: A special perceptron attention mechanism is introduced to unify the features extracted by two different neural network structures. This mechanism draws on techniques in the field of natural language processing (NLP). - **New normalization method**: In the convolutional layer, a new switchable normalization method is used instead of the traditional batch normalization. - **Targeted dropout**: In the last three fully - connected layers of the model, the latest targeted dropout technique is used instead of the traditional dropout method. 2. **Experimental verification**: - The model was tested on three different breast cancer datasets, including BACH2018, Bioimaging2015, and the extended Bioimaging2015 dataset. - The experimental results show that the performance of this model on these datasets is significantly better than the existing state - of - the - art methods. ### Key techniques and methods 1. **Parallel structure**: - **CNN module**: DenseNet is used as the CNN module, which solves the vanishing gradient problem through dense connection operations and reduces network parameters. - **RNN module**: Stacked LSTM (Long - Short - Term Memory network) is used as the RNN module to extract the time - series features of pixels. 2. **Attention mechanism**: - Through the perceptron attention mechanism, the importance of the features extracted by CNN and RNN is dynamically allocated, enabling the model to better balance the two types of features. 3. **New normalization method**: - The switchable normalization method is used, which combines the advantages of instance normalization, layer normalization, and batch normalization, enabling the model to learn the most suitable normalization method for each layer. 4. **Targeted dropout**: - The targeted dropout method is used in the fully - connected layers. Through the pruning strategy, different neurons are purposefully inhibited, improving the robustness and performance of the model. ### Experimental results - **BACH2018 dataset**: The best accuracy on the test set is 0.92, which is 1% higher than the current best result. - **Bioimaging2015 dataset**: The average accuracy on the test set is 98.3%, and the highest accuracy reaches 100%. In conclusion, through the innovative model structure and advanced technical means, this paper significantly improves the accuracy of breast cancer histological image classification and provides strong support for computer - aided diagnosis of breast cancer.

Parallel Structure Deep Neural Network Using CNN and RNN with an Attention Mechanism for Breast Cancer Histology Image Classification

Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example

Classification of Breast Cancer Lesions in Ultrasound Images by using Attention Layer and loss Ensembles in Deep Convolutional Neural Networks

Classification of breast cancer histology images using Convolutional Neural Networks

Multi-stages attention Breast cancer classification based on nonlinear spiking neural P neurons with autapses

Classification of Breast Cancer Lesions in Ultrasound Images by Using Attention Layer and Loss Ensemble in Deep Convolutional Neural Networks

A Multi-Task Convolutional Neural Network for Lesion Region Segmentation and Classification of Non-Small Cell Lung Carcinoma

Neural Network Based Classification of Breast Cancer Histopathological Image from Intraoperative Rapid Frozen Sections

Improved Breast Cancer Classification through Combining Transfer Learning and Attention Mechanism

BCDnet: Parallel heterogeneous eight-class classification model of breast pathology

A deep multi-branch attention model for histopathological breast cancer image classification

Using convolutional neural networks for the classification of breast cancer images

LMHistNet: Levenberg–Marquardt Based Deep Neural Network for Classification of Breast Cancer Histopathological Images

Breast cancer histopathology image classification based on dual-stream high-order network

Using parallel pre-trained types of DCNN model to predict breast cancer with color normalization

Attention-Based Deep Learning Approach for Breast Cancer Histopathological Image Multi-Classification

Comparative Analysis and Ensemble Enhancement of Leading CNN Architectures for Breast Cancer Classification

Interpretable Classification from Skin Cancer Histology Slides Using Deep Learning: A Retrospective Multicenter Study

Automatic approach for breast cancer detection based on deep belief network using histopathology images

Look, Investigate, and Classify: A Deep Hybrid Attention Method for Breast Cancer Classification

Super-resolution and segmentation deep learning for breast cancer histopathology image analysis