Abstract:Introduction: Proteins located in subcellular compartments have played an indispensable role in the physiological function of eukaryotic organisms. The pattern of protein subcellular localization is conducive to understanding the mechanism and function of proteins, contributing to investigating pathological changes of cells, and providing technical support for targeted drug research on human diseases. Automated systems based on featurization or representation learning and classifier design have attracted interest in predicting the subcellular location of proteins due to a considerable rise in proteins. However, large-scale, fine-grained protein microscopic images are prone to trapping and losing feature information in the general deep learning models, and the shallow features derived from statistical methods have weak supervision abilities. Methods: In this work, a novel model called HAR_Locator was developed to predict the subcellular location of proteins by concatenating multi-view abstract features and shallow features, whose advanced advantages are summarized in the following three protocols. Firstly, to get discriminative abstract feature information on protein subcellular location, an abstract feature extractor called HARnet based on Hybrid Attention modules and Residual units was proposed to relieve gradient dispersion and focus on protein-target regions. Secondly, it not only improves the supervision ability of image information but also enhances the generalization ability of the HAR_Locator through concatenating abstract features and shallow features. Finally, a multi-category multi-classifier decision system based on an Artificial Neural Network (ANN) was introduced to obtain the final output results of samples by fitting the most representative result from five subset predictors. Results: To evaluate the model, a collection of 6,778 immunohistochemistry (IHC) images from the Human Protein Atlas (HPA) database was used to present experimental results, and the accuracy, precision, and recall evaluation indicators were significantly increased to 84.73%, 84.77%, and 84.70%, respectively, compared with baseline predictors.

Learning complex subcellular distribution patterns of proteins via analysis of immunohistochemistry images

Automated classification of protein subcellular localization in immunohistochemistry images to reveal biomarkers in colon cancer

Automated classification of protein subcellular location patterns on images of human reproductive tissues

Bioimage-based Protein Subcellular Location Prediction: a Comprehensive Review

ImPLoc: a Multi-Instance Deep Learning Model for the Prediction of Protein Subcellular Localization Based on Immunohistochemistry Images.

Bioimaging-Based Detection Of Mislocalized Proteins In Human Cancers By Semi-Supervised Learning

Learning Protein Subcellular Localization Multi-View Patterns from Heterogeneous Data of Imaging, Sequence and Networks

Deep Learning-Based Classification of Protein Subcellular Localization from Immunohistochemistry Images

HAR_Locator: a novel protein subcellular location prediction model of immunohistochemistry images based on hybrid attention modules and residual units

Image-based Classification of Protein Subcellular Location Patterns in Human Reproductive Tissue by Ensemble Learning Global and Local Features

An image-based multi-label human protein subcellular localization predictor (iLocator) reveals protein mislocalizations in cancer tissues.

Protein Subcellular Localization Prediction by Concatenation of Convolutional Blocks for Deep Features Extraction From Microscopic Images

Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence.

Automated Image-Based Protein Subcellular Location Prediction in Human Reproductive Tissue Based on Ensemble Learning Global and Local Patterns

Image-Based Human Protein Subcellular Location Prediction Using Local Tetra Patterns Descriptor

Consistency and Variation of Protein Subcellular Location Annotations.

Automated identification of protein expression intensity and classification of protein cellular locations in mouse brain regions from immunofluorescence images

Multi-scale Deep Learning for the Imbalanced Multi-Label Protein Subcellular Localization Prediction Based on Immunohistochemistry Images

Identification of Multiple Subcellular Locations for Proteins in Budding Yeast

Incorporating Organelle Correlations into Semi-Supervised Learning for Protein Subcellular Localization Prediction

Human Proteins Characterization with Subcellular Localizations.