Abstract:In this paper, we consider a multi-stage dynamic assortment optimization problem with multi-nomial choice modeling (MNL) under resource knapsack constraints. Given the current resource inventory levels, the retailer makes an assortment decision at each period, and the goal of the retailer is to maximize the total profit from purchases. With the exact optimal dynamic assortment solution being computationally intractable, a practical strategy is to adopt the re-solving technique that periodically re-optimizes deterministic linear programs (LP) arising from fluid approximation. However, the fractional structure of MNL makes the fluid approximation in assortment optimization highly non-linear, which brings new technical challenges. To address this challenge, we propose a new epoch-based re-solving algorithm that effectively transforms the denominator of the objective into the constraint. Theoretically, we prove that the regret (i.e., the gap between the resolving policy and the optimal objective of the fluid approximation) scales logarithmically with the length of time horizon and resource capacities.

What problem does this paper attempt to address?

The paper primarily focuses on addressing the multi-stage dynamic assortment optimization problem, particularly in the context of the multinomial logit (MNL) choice model under resource knapsack constraints. Specifically, the goal of the paper is to select a subset (i.e., "assortment") from a given set of products to offer to customers, in order to maximize the total profit that the retailer can obtain from these sales activities. Since precisely solving the optimal dynamic assortment is computationally challenging, the paper proposes a re-solving strategy that periodically re-optimizes a deterministic linear program (LP) derived from a fluid approximation. However, the fractional structure in the MNL choice model makes the assortment optimization in the fluid approximation highly nonlinear, presenting new technical challenges. To address this challenge, the authors propose a new cycle-based re-solving algorithm that effectively transforms the denominator of the objective function into part of the constraints. Theoretical analysis shows that the regret (i.e., the gap between the proposed strategy and the fluid approximation optimal solution) grows logarithmically with the length of the time horizon and the resource capacity. The key contributions of the paper can be summarized as follows: 1. **Design of a New Algorithm**: A new cycle-based re-solving algorithm is designed, which simplifies the problem by transforming the denominator of the fluid approximation's objective function into constraints and uses a cycle-based strategy to overcome the nonlinearity of the objective function. 2. **Theoretical Results**: It is proven that the regret between the proposed strategy and the fluid approximation optimal solution grows logarithmically with the length of the time horizon and the resource capacity. This is significantly better than the \(O(\sqrt{T})\) regret achieved in previous work. 3. **Practical Applications**: The work not only makes theoretical advancements but also proposes practical methods to handle the dynamic assortment optimization problem, which has broad application prospects in fields such as online retail and advertising. In summary, this paper addresses the problem of dynamic assortment optimization using the MNL model under resource constraints and proposes an effective re-solving strategy that significantly reduces the gap with the fluid approximation optimal solution.

A Re-solving Heuristic for Dynamic Assortment Optimization with Knapsack Constraints

A Unified Algorithmic Framework for Dynamic Assortment Optimization under MNL Choice

Online Joint Assortment-Inventory Optimization under MNL Choices

Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models.

Assortment and Inventory Planning under Dynamic (Stockout-Based) Substitution in the Presence of Customer Returns: A Fluid Analysis

Dynamic Assortment Selection under Inventory and Limited Switches Constraints

Assortment Optimization under the Multivariate MNL Model

Dynamic Assortment Optimization with Changing Contextual Information

Optimal Policy for Dynamic Assortment Planning under Multinomial Logit Models

Dynamic Assortment Optimization: Beyond MNL Model

Robust Dynamic Assortment Optimization in the Presence of Outlier Customers

An Optimal Policy for Dynamic Assortment Planning Under Uncapacitated Multinomial Logit Models

Dynamic Assortment with Online Learning under Threshold Multinomial Logit Model

Contextual Decision-Making with Knapsacks Beyond the Worst Case

Constrained Assortment Optimization under the Cross-Nested Logit Model

MNL-Bandits under Inventory and Limited Switches Constraints

Dynamic Assortment Optimization for Reusable Products with Random Usage Durations

Dynamic Pricing and Replenishment: Optimality, Bounds, and Asymptotics

Dynamic Assortment Selection under the Nested Logit Models.

Dynamic Assortment Planning Without Utility Parameter Estimation

Dynamic Joint Assortment and Pricing Optimization with Demand Learning.