A chemical reaction entity recognition method based on a natural language data augmentation strategy

Xiaowen Zhang,Yang Li,Chaoyi Li,Jingyuan Zhu,Zhiqiang Gan,Lei Wang,Xiaofei Sun,Hengzhi You
DOI: https://doi.org/10.1039/d4cc01471e
IF: 4.9
2024-08-18
Chemical Communications
Abstract:Impressive applications of artificial intelligence in the field of chemical reaction prediction heavily depend on abundant reliable datasets. The automated extraction of reaction procedures to build structured chemical databases is of growing importance. Here, we propose a novel model named DACRER for large-scale reaction extraction, in which transfer learning and a data augmentation strategy were employed. This model was evaluated for chemical datasets and shows good performance in identifying and processing chemical texts.
chemistry, multidisciplinary
What problem does this paper attempt to address?