Predictive Privacy: Collective Data Protection in the Context of AI and Big Data

Rainer Muehlhoff
Abstract:Big data and artificial intelligence (AI) pose a new challenge for data protection. This is because these techniques are used to make predictions about third parties based on the anonymous data of many people, for example about purchasing power, gender, age, sexual orientation, ethnicity, the course of an illness, etc. The basis for such applications of "predictive analytics" is a comparison of behavioural data (e.g. usage, tracking or activity data) of the individual in question with the potentially anonymously processed data of many others using machine learning models or simpler statistical methods. The article first points out that there is considerable potential for abuse associated with predictive analytics, which manifests itself as social inequality, discrimination and exclusion. These potentials for abuse are not regulated by current data protection law (EU GDPR); in fact, the use of anonymised mass data takes place in a largely unregulated space. Under the term "predictive privacy", a data protection approach is presented that counters the risks of abuse of predictive analytics. The predictive privacy of a person or group is violated when sensitive information about them is predicted based on the data of many other individuals without their knowledge and against their will. Predictive privacy is then formulated as a collectivist protected good of data protection and various improvements of the GDPR with regard to the regulation of predictive analytics are proposed.
Law,Computer Science
What problem does this paper attempt to address?