Abstract:How to incentivize strategic workers using limited budget is a very fundamental problem for crowdsensing systems; nevertheless, since the sensing abilities of the workers may not always be known as prior knowledge due to the diversities of their sensor devices and behaviors, it is difficult to properly select and pay the unknown workers. Although the uncertainties of the workers can be addressed by the standard Combinatorial Multi-Armed Bandit (CMAB) framework in existing proposals through a trade-off between exploration and exploitation, we may not have sufficient budget to enable the trade-off among the individual workers, especially when the number of the workers is huge while the budget is limited. Moreover, the standard CMAB usually assumes the workers always stay in the system, whereas the workers may join in or depart from the system over time, such that what we have learnt for an individual worker cannot be applied after the worker leaves. To address the above challenging issues, in this paper, we first propose an off-line Context-Aware CMAB-based Incentive (CACI) mechanism. We innovate in leveraging the exploration-exploitation trade-off in an elaborately partitioned context space instead of the individual workers, to effectively incentivize the massive unknown workers with a very limited budget. We also extend the above basic idea to the on-line setting where unknown workers may join in or depart from the systems dynamically, and propose an on-line version of the CACI mechanism. Specifically, by the exploitation-exploration trade-off in the context space, we learn to estimate the sensing ability of any unknown worker (even it never appeared in the system before) according to its context information. We perform rigorous theoretical analysis to reveal the upper bounds on the regrets of our CACI mechanisms and to prove their truthfulness and individual rationality, respectively. Extensive experiments on both synthetic and real datasets are also conducted to verify the efficacy of our mechanisms.

K-Level Truthful Incentivizing Mechanism and Generalized K-Mab Problem

Explore Truthful Incentives for Tasks with Heterogenous Levels of Difficulty in the Sharing Economy.

Multi-Armed Bandit with Budget Constraint and Variable Costs.

Socially-Optimal Mechanism Design for Incentivized Online Learning

Auction-Based Combinatorial Multi-Armed Bandit Mechanisms with Strategic Arms

Robust and Performance Incentivizing Algorithms for Multi-Armed Bandits with Strategic Agents

Combination of Auction Theory and Multi-Armed Bandits: Model, Algorithm, and Application

Multi-armed Bandits with Cost Subsidy

Truthful and Dual-direction Combinatorial Multi-Armed Bandit Scheme to Maximize Profit for Mobile Crowd Sensing

Quality-Aware Incentive Mechanisms Under Social Influences in Data Crowdsourcing

Incentive Mechanism for Macrotasking Crowdsourcing: A Zero-Determinant Strategy Approach

Combinatorial Multi-Armed Bandit with General Reward Functions

Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives

Who Should Be Given Incentives? Counterfactual Optimal Treatment Regimes Learning for Recommendation

Truth based three-tier Combinatorial Multi-Armed Bandit ecosystems for mobile crowdsensing

Budget-Feasible Online Incentive Mechanisms for Crowdsourcing Tasks Truthfully

Estimating and Incentivizing Imperfect-Knowledge Agents with Hidden Rewards

End-to-End Cost-Effective Incentive Recommendation under Budget Constraint with Uplift Modeling

Combinatorial Multi-Armed Bandit: General Framework and Applications.

Cost-Effective Incentive Allocation via Structured Counterfactual Inference

On Cost-Effective Incentive Mechanisms in Microtask Crowdsourcing