Identifying Privacy Leakage from User-Generated Content in An Online Health Community - A deep learning approach

Yushan Zhu,Xing Tong,Xi Wang
DOI: https://doi.org/10.1109/ICHI.2019.8904689
2019-01-01
Abstract:Online Health Communities (OHCs) have become a widely used resource for obtaining and sharing health-related information during the past decade. However, the health information privacy issues in the OHC domain have not been fully explored. Insufficient attention to personal privacy management may result in intentional or unintentional disclosure of users' sensitive information, and consequently harm the communication environment, as well as individuals' safety. Based on the user-generated-content, this preliminary research applies the method of text mining to identify different types of information leakages occurs in a breast cancer OHC. The preliminary results indicate that approximately 60% of the OHC users are willing to express their emotional feelings, and 10.86% are motivated to disclose their health information. In addition, the analysis based upon the longitudinal data from 2007 to 2018 will be practiced investigating the OHC users' behavior trajectories in private information exposure. These findings of the study have practically implications for OHC users, administers, and website designers.
What problem does this paper attempt to address?