Scalable Community Extraction of Text Networks for Automated Grouping in Medical Databases

Tomilayo Komolafe,Allan Fong,Srijan Sengupta
DOI: https://doi.org/10.48550/arXiv.2111.15633
2021-11-28
Abstract:Networks are ubiquitous in today's world. Community structure is a well-known feature of many empirical networks, and a lot of statistical methods have been developed for community detection. In this paper, we consider the problem of community structure in text networks,which is greatly relevant in medical errors and patient safety databases. We adapt a well-known community extraction method to develop a scalable algorithm for community extraction in large text databases. The application of our method on a real-world patient safety report database demonstrates that the groups generated from community extraction are much more accurate than manual tagging by frontline workers.
Social and Information Networks
What problem does this paper attempt to address?