Abstract:Our view of proteases has come a long way since P. A. Levene reported his studies on “The Cleavage Products of Proteoses” in the first issue of The Journal of Biological Chemistry published October 1, 1905 (1). Today, after more than 100 years and 350,000 articles on these enzymes in the scientific literature, proteases remain at the cutting edge of biological research. Proteases likely arose at the earliest stages of protein evolution as simple destructive enzymes necessary for protein catabolism and the generation of amino acids in primitive organisms. For many years, studies on proteases focused on their original roles as blunt aggressors associated with protein demolition. However, the realization that, beyond these nonspecific degradative functions, proteases act as sharp scissors and catalyze highly specific reactions of proteolytic processing, producing new protein products, inaugurated a new era in protease research (2). The current success of research in this group of ancient enzymes derives mainly from the large collection of findings demonstrating their relevance in the control of multiple biological processes in all living organisms (3–11). Thus, proteases regulate the fate, localization, and activity of many proteins, modulate protein-protein interactions, create new bioactive molecules, contribute to the processing of cellular information, and generate, transduce, and amplify molecular signals. As a direct result of these multiple actions, proteases influence DNA replication and transcription, cell proliferation and differentiation, tissue morphogenesis and remodeling, heat shock and unfolded protein responses, angiogenesis, neurogenesis, ovulation, fertilization, wound repair, stem cell mobilization, hemostasis, blood coagulation, inflammation, immunity, autophagy, senescence, necrosis, and apoptosis. Consistent with these essential roles of proteases in cell behavior and survival and death of all organisms, alterations in proteolytic systems underlie multiple pathological conditions such as cancer, neurodegenerative disorders, and inflammatory and cardiovascular diseases. Accordingly, many proteases are a major focus of attention for the pharmaceutical industry as potential drug targets or as diagnostic and prognostic biomarkers (12). Proteases also play key roles in plants and contribute to the processing, maturation, or destruction of specific sets of proteins in response to developmental cues or to variations in environmental conditions (13). Likewise, many infectious microorganisms require proteases for replication or use proteases as virulence factors, which has facilitated the development of protease-targeted therapies for diseases of great relevance to human life such as AIDS (12). Finally, proteases are also important tools of the biotechnological industry because of their usefulness as biochemical reagents or in the manufacture of numerous products (e.g. Ref. 14). This outstanding diversity in protease functions directly results from the evolutionary invention of a multiplicity of enzymes that exhibit a variety of sizes and shapes. Thus, the architectural design of proteases ranges from small enzymes made up of simple catalytic units (∼20 kDa) to sophisticated protein-processing and degradation machines, like the proteasome and meprin metalloproteinase isoforms (0.7–6 MDa) (15). In terms of specificity, diversity is also a common rule. Thus, some proteases exhibit an exquisite specificity toward a unique peptide bond of a single protein (e.g. angiotensin-converting enzyme); however, most proteases are relatively nonspecific for substrates, and some are overtly promiscuous and target multiple substrates in an indiscriminate manner (e.g. proteinase K). Proteases also follow different strategies to establish their appropriate location in the cellular geography and, in most cases, operate in the context of complex networks comprising distinct proteases, substrates, cofactors, inhibitors, adaptors, receptors, and binding proteins, which provide an additional level of interest but also complexity to the study of proteolytic enzymes. This work aims at serving as a primer to a minireview series on proteases to be published in forthcoming issues of this Journal. This introductory article will focus on the discussion of the large and growing complexity of proteolytic enzymes present in all organisms, from bacteria to man. We will first show the results of comparative genomic analysis that have shed light on the real dimensions of the proteolytic space. The levels of protease complexity and mechanisms of protease regulation will then be addressed. Finally, we will discuss current frontiers and future perspectives in protease research.

Identification of Proteases and Their Types

ProtIdent: a Web Server for Identifying Proteases and Their Types by Fusing Functional Domain and Sequential Evolution Information.

Comprehensive protease specificity profiling

Identification of Phosphopeptides with Unknown Cleavage Specificity by a De Novo Sequencing Assisted Database Search Strategy.

Bioinformatic approaches for predicting substrates of proteases.

Prediction of Peptidase Category Based on Functional Domain Composition.

Proteases: Multifunctional Enzymes in Life and Disease*

HIV‐1 Protease Cleavage Site Prediction Based on Amino Acid Property

Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural Information

Twenty Years of Bioinformatics Research for Protease-Specific Substrate and Cleavage Site Prediction: a Comprehensive Revisit and Benchmarking of Existing Methods.

Peptide Codes For Multiple Protease Activity Assay Via High-Resolution Mass Spectrometric Quantitation

iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.

Study Of Inhibitors Against Sars Coronavirus By Computational Approaches

Non-prime- and Prime-side Profiling of Pro-Pro Endopeptidase Specificity Using Synthetic Combinatorial Peptide Libraries and Mass Spectrometry

Accelerating Proteomics Using Broad Specificity Proteases

PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.

Revealing Favorable and Unfavorable Residues in Cooperative Positions in Protease Cleavage Sites.

DNA‐Encoded Noncanonical Substrate Library for Protease Profiling

Purification and Characterisation of a Novel Protease from Cordyceps Sinensis and Determination of the Cleavage Site Motifs Using Oriented Peptide Library Mixtures

The utility of proteases in proteomics, from sequence profiling to structure and function analysis

Sensitive Identification of Known and Unknown Protease Activities by Unsupervised Linear Motif Deconvolution