QoS-Aware Multi-Armed Bandits

Lenz Belzner,Thomas Gabor
DOI: https://doi.org/10.1109/FAS-W.2016.36
2017-02-28
Abstract:Motivated by runtime verification of QoS requirements in self-adaptive and self-organizing systems that are able to reconfigure their structure and behavior in response to runtime data, we propose a QoS-aware variant of Thompson sampling for multi-armed bandits. It is applicable in settings where QoS satisfaction of an arm has to be ensured with high confidence efficiently, rather than finding the optimal arm while minimizing regret. Preliminary experimental results encourage further research in the field of QoS-aware decision making.
Machine Learning,Software Engineering
What problem does this paper attempt to address?