Natural language processing was effective in assisting rapid title and abstract screening when updating systematic reviews

Xuan Qin,Jiali Liu,Yuning Wang,Yanmei Liu,Ke Deng,Yu Ma,Kang Zou,Ling Li,Xin Sun
DOI: https://doi.org/10.1016/j.jclinepi.2021.01.010
IF: 7.407
2021-05-01
Journal of Clinical Epidemiology
Abstract:<p><strong>Objectives</strong>: To examine whether the use of natural language processing (NLP) technology is effective in assisting rapid title and abstract screening when updating a systematic review.</p><p><strong>Study Design:</strong> Using the searched literature from a published systematic review, we trained and tested an NLP model that enables rapid title and abstract screening when updating a systematic review. The model was a light gradient boosting machine (LightGBM), an ensemble learning classifier which integrates four pretrained Bidirectional Encoder Representations from Transformers (BERT) models. We divided the searched citations into two sets (i.e., training and test sets). The model was trained using the training set and assessed for screening performance using the test set. The searched citations, whose eligibility was determined by two independent reviewers, were treated as the reference standard.</p><p><strong>Results:</strong> The test set included 947 citations; our model included 340 citations, excluded 607 citations, and achieved 96% sensitivity, and 78% specificity. If the classifier assessment in the case study was accepted, reviewers would lose 8 of 180 eligible citations (4%), none of which were ultimately included in the systematic review after full-text consideration, while decreasing the workload by 64.1%.</p><p><strong>Conclusions:</strong> NLP technology using the ensemble learning method may effectively assist in rapid literature screening when updating systematic reviews.</p>
public, environmental & occupational health,health care sciences & services
What problem does this paper attempt to address?