Abstract:In strategic classification, agents modify their features, at a cost, to ideally obtain a positive classification from the learner's classifier. The typical response of the learner is to carefully modify their classifier to be robust to such strategic behavior. When reasoning about agent manipulations, most papers that study strategic classification rely on the following strong assumption: agents fully know the exact parameters of the deployed classifier by the learner. This often is an unrealistic assumption when using complex or proprietary machine learning techniques in real-world prediction tasks.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is in strategic classification, when agents only have partial information about the classifier used by the learner, how to optimize the model accuracy of the learner through the release of partial information. Specifically, the traditional assumption is that the agent fully understands the classifier parameters used by the learner, but this assumption is often not valid in practical applications, especially when using complex or proprietary machine - learning techniques for high - risk scoring and prediction tasks. Therefore, this paper proposes the problem of studying how the learner can maximize the model accuracy by releasing some real information when the agent has only partial knowledge of the classifier.
### Core problems of the paper
1. **Partial information assumption of agents**: Different from previous studies, this paper assumes that the agent does not fully understand the classifier used by the learner, but has a prior of a common distribution about the classifier.
2. **Release of partial information**: The learner can selectively release some real information about the classifier to help qualified agents pass through the classifier while preventing unqualified agents from using this information for manipulation.
3. **Strategy for optimizing information release**: The learner's goal is to determine how much information to release to maximize the model accuracy, even if this may increase the agent's ability to manipulate.
### Mathematical representation
Let \( \pi \) be the prior distribution of the agent over the classifier \( h\in H \), \( D \) be the distribution on the agent feature space \( X \), and the cost function \( c:X\times X\rightarrow[0,\infty) \) represents the cost for the agent to change features. The learner's goal is to find a subset \( H^*\subseteq H \) that contains its deployed classifier \( h \) such that its expected strategic accuracy is maximized:
\[ U(H)=\Pr_{x\sim D}[h(BR(x, H)) = f(x)] \]
where \( BR(x, H) \) represents the best response of the agent under the given information \( H \), that is, to maximize its utility:
\[ u_x(x', H)=\Pr_{h'\sim\pi|H}[h'(x') = 1]-c(x, x') \]
### Key contributions
- **New model**: Introduced the assumption that the agent has partial knowledge of the classifier and allows the learner to release partial information.
- **Algorithm complexity**: Studied the computational complexity of the agent's best response, proved that in general it is infeasible to calculate the best response, but there are efficient algorithms under specific conditions such as low - dimensional linear classifiers.
- **Time complexity of information release**: Explored how much information the learner should release to maximize the model accuracy, and provided a closed - form solution and an efficient algorithm for the uniform prior distribution.
In summary, this paper provides a new perspective for strategic classification by introducing the partial information assumption and the information release mechanism, aiming to solve the model optimization problem in the case of limited classifier information of agents in practical applications.