The Minimum Information about a Tailoring Enzyme/Maturase data standard for capturing natural product biosynthesis

Mitja M. Zdouc,David Meijer,Friederike Biermann,Jonathan Holme,Aleksandra Korenskaia,Annette Lien,Nico L. L. Louwen,Jorge C. Navarro-Muñoz,Giang-Son Nguyen,Adriano Rutz,Anastasia Sveshnikova,Judith Szenei,Barbara Terlouw,Rosina Torres Ortega,Marc Feuermann,Alan J. Bridge,Justin J. J. van der Hooft,Tilmann Weber,Nadine Ziemert,Kai Blin,Marnix H. Medema
DOI: https://doi.org/10.26434/chemrxiv-2024-78mtl
2024-04-09
Abstract:Natural products, also known as specialized or secondary metabolites, show extraordinary chemical diversity and potent biological activities. Their biosynthesis usually first encompasses scaffold generation, followed by additional tailoring and maturation steps, leading to the mature compound. The latter steps are often performed by accessory enzymes known as tailoring enzymes or maturases. While knowledge about reaction and substrate specificities of these enzymes is essential for natural product biosynthesis, it is often scattered in the literature, hampering understanding and computational processing. Here, we conceptualize the Minimum Information about a Tailoring Enzyme/Maturase (MITE) data standard. We envision this data standard to serve in collecting experimentally verified data on reaction, substrate specificity, and other metadata of tailoring enzymes. Closely associated with the previously established Minimum Information about a Biosynthetic Gene cluster (MIBiG) data standard, MITE will aim to capture tailoring enzyme reaction information that is currently not systemized. We anticipate that MITE will accelerate natural product structure predictions from sequence, evolutionary analyses of biosynthetic pathways and synthetic biology engineering of specialized metabolic pathways.
Chemistry
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve the data standardization problem of natural products (NPs) tailoring enzymes and maturases. Although these enzymes play a crucial role in the biosynthesis of natural products, currently, the knowledge about their reaction specificity and substrate specificity is scattered in the literature and lacks systematic organization and standard data formats. This not only hinders the understanding of the functions of these enzymes but also limits the development of computational processing and applications. To solve this problem, the authors propose a data standard named **Minimum Information about a Tailoring Enzyme/Maturase (MITE)**. MITE aims to collect the reaction specificity, substrate specificity, and other metadata of experimentally verified tailoring enzymes and maturases. MITE is closely related to the existing Minimum Information about a Biosynthetic Gene Cluster (MIBiG), but focuses on capturing currently un - systematized tailoring enzyme reaction information. ### Main objectives 1. **Standardize data formats**: Provide a standardized data format for recording the reaction specificity and substrate specificity of tailoring enzymes and maturases. 2. **Promote data sharing**: Promote data sharing and communication among researchers through a standardized data format. 3. **Support computational analysis**: Provide structured data for computational biology tools to support natural product structure prediction, evolutionary analysis of biosynthetic pathways, and synthetic biology engineering. 4. **Fill the gaps in existing databases**: Existing databases such as Rhea, BRENDA, and RetroRules have limitations in describing the reaction and substrate specificity of natural product tailoring enzymes. MITE aims to fill these gaps. ### Case study To demonstrate the application of the MITE data standard, the authors conducted a detailed analysis using the biosynthesis pathway of the antibacterial peptide microcin J25 as an example. The biosynthesis of microcin J25 involves four genes (mcjA, mcjB, mcjC, and mcjD), among which mcjB and mcjC are respectively responsible for the folding and cleavage of the precursor peptide and the formation of the characteristic lariat fold. The authors created MITE entries, which detailed the reaction SMARTS and the verified reaction pairs of these two enzymes. ### Conclusion The proposal of the MITE data standard aims to promote the research on natural product tailoring enzymes and maturases through standardized and systematic data formats, accelerate the structure prediction of natural products and the analysis of biosynthetic pathways, and provide strong support for synthetic biology and drug development.