EnvText: A Chinese text mining tool for environmental domain with advanced BERT model

Huaibin Bi,Bing Li,Yong Qiu,Change Miao
DOI: https://doi.org/10.1016/j.simpa.2023.100559
2023-01-01
Software Impacts
Abstract:EnvText is a user-friendly Natural Language Processing (NLP) software package designed to analyze and mine Chinese text in environmental domain for NLP beginners. EnvText integrates essential NLP tools, such as domain-specific word lists, entity lists, corpus, pre-trained models and so on. It provides an excellent interface for neural network models, such as Bidirectional Encoder Representations from Transformers (BERT) and Recurrent Neural Networks (RNNs), which can automatically carry out functions like text classification, entity extraction, relationship extraction, and etc. Users can use multiple pre-trained models in EnvText or simply train their own model and perform text analysis.
What problem does this paper attempt to address?