Provably Beneficial Artificial Intelligence

Stuart Russell
DOI: https://doi.org/10.1145/3490099.3519388
2022-03-22
Abstract:As AI advances in capabilities and moves into the real world, its potential to benefit humanity seems limitless. Yet we see serious problems including racial and gender bias, manipulation by social media, and an arms race in lethal autonomous weapons. Looking further ahead, Alan Turing predicted the eventual loss of human control over machines that exceed human capabilities. I will argue that Turing was right to express concern but wrong to think that doom is inevitable. Instead, we need to develop a new kind of AI that is provably beneficial to humans.
What problem does this paper attempt to address?