Automated Extraction of ABAC Policies from Natural-Language Documents in Healthcare Systems.

Yutang Xia,Shengfang Zhai,Qinting Wang,Huiting Hou,Zhonghai Wu,Qingni Shen
DOI: https://doi.org/10.1109/bibm55620.2022.9995559
2022-01-01
Abstract:The healthcare system is a distributed collaborative system and the sensitivity of the medical data is one of the most important requirements. Preventing unauthorized access to healthcare information and data sharing security in the healthcare environment are critical processes that affect the credibility of the system. To achieve this goal and to meet the requirements of the healthcare system, access control is an important measure to realize the safe sharing of resources. The attribute-based access control (ABAC) model meets the complex security requirements of large and complex systems and provides a dynamic, flexible and scalable solution. The main obstacle to deploying ABAC is the precise development of ABAC policies. Manually developing access control policies is tedious, time-consuming and error prone. Most systems have high-level requirement specifications, which are written in natural language. These natural language (NL) documents have the intended access control policies for the systems. In this paper, we propose a new approach towards extracting policies from natural language documents. By fully taking advantage of Bidirectional Encoder Representations from Transformers (BERT) and Semantic role labeling (SRL), we are able to correctly identify access control policy (ACP) sentences with an average F1 score of 85% and correctly extract rules with an average F1 score of 72%, which outperforms the state-of-the-art and leads to a performance improvement of 7% and 2% respectively over the previously reported results.
What problem does this paper attempt to address?