Abstract:Optimal transport is a powerful framework for the efficient allocation of resources between sources and targets. However, traditional models often struggle to scale effectively in the presence of large and heterogeneous populations. In this work, we introduce a discrete optimal transport framework designed to handle large-scale, heterogeneous target populations, characterized by type distributions. We address two scenarios: one where the type distribution of targets is known, and one where it is unknown. For the known distribution, we propose a fully distributed algorithm to achieve optimal resource allocation. In the case of unknown distribution, we develop a federated learning-based approach that enables efficient computation of the optimal transport scheme while preserving privacy. Case studies are provided to evaluate the performance of our learning algorithm.
What problem does this paper attempt to address?
This paper attempts to solve the problem of discrete optimal transport (DOT) in large - scale and heterogeneous target groups, especially in the case of incomplete information. Specifically, the paper focuses on two scenarios:
1. **Known target type distribution**: In this case, the paper proposes a fully distributed algorithm to achieve optimal resource allocation.
2. **Unknown target type distribution**: In this case, the paper introduces a method based on federated learning (FL) that can efficiently calculate the optimal transport scheme while protecting privacy.
### Main problems
Traditional optimal transport models usually have difficulty in effectively expanding when facing large - scale and heterogeneous target groups. This is because traditional models assume that complete information of all participants is available and the transport network can be clearly characterized. However, in the modern context (such as supply chains and machine learning), the network is large - scale and dynamically changing, involving many target nodes requesting resources. In this case, resource providers are often unable to obtain the preference information of all target nodes. Therefore, it is necessary to expand the standard optimal transport framework to adapt to large - scale and heterogeneous target groups with incomplete information.
### Solutions
The paper proposes a new optimal transport framework aimed at addressing the challenges brought by a large number of target nodes with different preferences. The framework solves the above problems in the following ways:
- **Representation of target types**: Use the type distribution function to classify target nodes based on their preferences for resources.
- **Two key settings**:
- When the target type distribution is known, a fully distributed algorithm is proposed for optimally allocating resources among target nodes.
- When the target type distribution is unknown, a federated learning method is introduced, enabling source nodes to collaboratively and efficiently update the transport scheme while gradually collecting new information about target nodes.
### Advantages of federated learning
The federated learning method is particularly suitable for privacy - sensitive scenarios because it allows each target node to calculate a local solution without directly sharing private data. These local solutions are then aggregated to form a global transport plan. This has practical application value in privacy - protection systems and mobile computing environments.
### Case studies
The paper provides case studies to evaluate the performance of the proposed learning algorithm, demonstrating its effectiveness in dealing with large - scale and heterogeneous target groups.
### Formula summary
To better understand the formulas in the paper, the following are the presentations in Markdown format of several key formulas:
1. **Target type distribution**:
\[
P_t(x)=\frac{n_x}{N}, \quad \forall x\in X
\]
where \(n_x\) represents the number of target nodes of type \(x\), and \(N\) is the total number of target nodes.
2. **Optimization problem**:
\[
\max_{\Pi}\sum_{x\in X}\sum_{y\in Y}t_{xy}(\pi_{xy})P_t(x)+\sum_{y\in Y}\sum_{x\in X}s_{xy}(\pi_{xy})P_t(x)
\]
Subject to the constraints:
\[
p_x\leq\sum_{y\in Y_x}\pi_{xy}\leq\bar{p}_x, \quad \forall x\in X,
\]
\[
q_y\leq\sum_{x\in X_y}\pi_{xy}P_t(x)\leq\bar{q}_y, \quad \forall y\in Y,
\]
\[
\pi_{xy}\geq0, \quad \forall \{x, y\}\in E.
\]
3. **Lagrangian function in ADMM algorithm**:
\[
L(\Pi^t, \Pi^s, \Pi, \alpha^t_{xy},