Abstract:We study in this paper the consequences of using the Mean Absolute Percentage Error (MAPE) as a measure of quality for regression models. We prove the existence of an optimal MAPE model and we show the universal consistency of Empirical Risk Minimization based on the MAPE. We also show that finding the best model under the MAPE is equivalent to doing weighted Mean Absolute Error (MAE) regression, and we apply this weighting strategy to kernel regression. The behavior of the MAPE kernel regression is illustrated on simulated data.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to explore the impact brought by using Mean Absolute Percentage Error (MAPE) as a quality measure for regression models. Specifically, the authors focus on the following aspects: 1. **Existence of the Optimal MAPE Model**: It is proven that when using MAPE as a loss function, there exists an optimal regression model. 2. **Consistency of Empirical Risk Minimization (ERM)**: It is shown that the empirical risk minimization based on MAPE has universal consistency, that is, as the sample size increases, the model obtained through ERM will converge to the optimal model. 3. **Relationship between MAPE and Weighted MAE**: It is pointed out that finding the best MAPE model is equivalent to performing weighted Mean Absolute Error (MAE) regression, and this weighting strategy is applied to kernel regression. 4. **Impact of Complexity Control**: The impact of using MAPE as a loss function on the covering number and Vapnik - Chervonenkis (VC) dimension is analyzed, thereby exploring its performance in the Uniform Law of Large Numbers (ULLN). 5. **Practical Applications**: The behavior of MAPE kernel regression is demonstrated through simulated data, verifying the validity of the theoretical results. ### Specific Problem Description The core problem of the paper is to study the theoretical properties of MAPE as an evaluation index for regression models and its practical application effects. MAPE is defined as: \[ L_{\text{MAPE}}(g) = E\left(\frac{|g(X) - Y|}{|Y|}\right) \] where \( g \) is a regression model, \( X \) is an input variable, and \( Y \) is a target variable. MAPE is widely used in practical applications due to its intuitive interpretation of relative error, especially in fields such as finance and price calibration. ### Main Contributions - **Theoretical Contributions**: Prove the existence of the optimal MAPE model and the consistency of ERM. - **Methodological Contributions**: Propose a method to transform MAPE into weighted MAE and apply it to kernel regression. - **Practical Applications**: Verify the effectiveness of MAPE regression through simulated data. These studies not only deepen the understanding of MAPE but also provide theoretical support and technical means for practical applications.

Mean Absolute Percentage Error for regression models

Using the Mean Absolute Percentage Error for Regression Models

Empirical risk minimization is consistent with the mean absolute percentage error

A better measure of relative prediction accuracy for model selection and model estimation

On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

Root-mean-square error (RMSE) or mean absolute error (MAE): when to use them or not

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation

A simulation study of the model evaluation criterion mmre

Least Absolute Relative Error Estimation

Error Density-dependent Empirical Risk Minimization

Least Product Relative Error Estimation

Optimal estimator for uncertainty-based measurement quality control

Model aggregation: minimizing empirical variance outperforms minimizing empirical error

A New Interpretation on the MMSE As a Robust MEE Criterion

On a Biased Prediction Based on Optimal Mean Square Error Criterion

Mean of Ratios or Ratio of Means: statistical uncertainty applied to estimate Multiperiod Probability of Defaul

Ermma: Expected Risk Minimization for Matrix Approximation-Based Recommender Systems

An Effective Meaningful Way to Evaluate Survival Models

Scoring rule nets: beyond mean target prediction in multivariate regression

Empirical Risk Minimization with Relative Entropy Regularization

The proximal map of the weighted mean absolute error