Bohrium
robot
新建

空间站广场

论文
Notebooks
比赛
课程
Apps
我的主页
我的Notebooks
我的论文库
我的足迹

我的工作空间

任务
节点
文件
数据集
镜像
项目
数据库
公开
DataMol | 分子构象对齐
化学信息学
DataMol
化学信息学DataMol
wangyz@dp.tech
发布于 2023-06-15
1
1
AI4SCUP-CNS-BBB(v1)

DataMol Tutorials - Aligning Molecules

©️ Copyright 2023 @ Authors
作者: 王应泽 📨
日期:2023-06-14
共享协议:本作品采用知识共享署名-非商业性使用-相同方式共享 4.0 国际许可协议进行许可。
快速开始:点击上方的 开始连接 按钮,选择 datamol:06-15镜像及任意节点配置,稍等片刻即可运行。

代码
文本

One of the most important things enabling us to have success in drug discovery is understanding structure-activity relationship (SAR). This is a foundation that explains how the structure of a molecule relates to its biological activity. Here is an excerpt from CDDVault that explains the importance of SAR in drug discovery:

”SAR depends on the recognition of which structural characteristics correlate with chemical and biological reactivity. Thus the ability to draw conclusions about an unknown compound depends upon both the structural features that can be characterized as well as the database of molecules against which they are compared. When combined with appropriate professional judgment, SAR can be a powerful tool for understanding functional implications when similarities are found. For example, in the case of risk assessment of uncharacterized compounds, data from the most sensitive toxicological endpoints should be included in the analysis, such as carcinogenicity or cardiotoxicity.”

This is where aligning molecules can be useful. Given the importance of structure, aligning molecules is generally useful for a few things:

  1. Aligning molecules makes it easier to visualize and highlight substructures
  2. It can help you identify and understand any shared structures between drug candidates
  3. It can help you identify any potential activity patterns
    1. For example, when analyzing activity cliffs - “defined as pairs or groups of structurally similar compounds that are active against the same target but have large differences in potency. Activity cliffs capture chemical modifications that strongly influence biological activity”. An example of some compounds with activity cliffs and their structural differences highlighted are shown below:

Aligning_1.png

Source

Note: The concept of an activity cliff seems simple in this example, however, in practice it is extremely complicated to represent them computationally and derive a systematic approach for identification. Read more about the evolving concept of activity cliffs here.

As an example to show you how alignment makes visualization easier, please see the image below. The alignment of this sample dataset makes it extremely easy to identify the core structure of 3 rings which are common in all compounds.

Aligning_2.png

Source

Tutorial

This tutorial will show you how to:

  1. Partition a list of molecules into clusters sharing a common scaffold of a common core, then align the molecules to that common core.
    1. Note: this function will compute the list of smiles/smarts representative of each cluster first.
  2. Aligning molecules according to a template molecule

Datamol example

代码
文本
[1]
import datamol as dm

data = dm.data.cdk2()[13:15]
smiles = data["smiles"].iloc[:].tolist()
mols = [dm.to_mol(s) for s in smiles]

dm.to_image(mols)
代码
文本
[2]
aligned_list = dm.align.auto_align_many(mols)
dm.to_image(aligned_list)
代码
文本
[3]
# Align the compounds within each cluster in a set of compounds
data = dm.data.cdk2()[0:24]
smiles = data["smiles"].iloc[:].tolist()
mols = [dm.to_mol(s) for s in smiles]

aligned_list = dm.align.auto_align_many(mols, partition_method="cluster")
dm.to_image(aligned_list, mol_size=(200, 150))
代码
文本
[4]
# You can also align a compound to a template compound
# In this example, we want to align the compound mols[0] with the compound mols[7]
mols[0]
代码
文本
[5]
mols[7]
代码
文本
[6]
mol_aligned = dm.align.template_align(mols[0], template=mols[7])
mol_aligned
代码
文本
化学信息学
DataMol
化学信息学DataMol
点个赞吧
推荐阅读
公开
DataMol | 分子信息预处理
化学信息学DataMol
化学信息学DataMol
wangyz@dp.tech
发布于 2023-06-15
2 转存文件
公开
DataMol | 分子描述符计算
化学信息学DataMol
化学信息学DataMol
wangyz@dp.tech
发布于 2023-06-15