Mean Absolute Percentage Error for regression models

Arnaud De Myttenaere,Boris Golden,Bénédicte Le Grand,Fabrice Rossi
DOI: https://doi.org/10.1016/j.neucom.2015.12.114
2017-07-10
Abstract:We study in this paper the consequences of using the Mean Absolute Percentage Error (MAPE) as a measure of quality for regression models. We prove the existence of an optimal MAPE model and we show the universal consistency of Empirical Risk Minimization based on the MAPE. We also show that finding the best model under the MAPE is equivalent to doing weighted Mean Absolute Error (MAE) regression, and we apply this weighting strategy to kernel regression. The behavior of the MAPE kernel regression is illustrated on simulated data.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to explore the impact brought by using Mean Absolute Percentage Error (MAPE) as a quality measure for regression models. Specifically, the authors focus on the following aspects: 1. **Existence of the Optimal MAPE Model**: It is proven that when using MAPE as a loss function, there exists an optimal regression model. 2. **Consistency of Empirical Risk Minimization (ERM)**: It is shown that the empirical risk minimization based on MAPE has universal consistency, that is, as the sample size increases, the model obtained through ERM will converge to the optimal model. 3. **Relationship between MAPE and Weighted MAE**: It is pointed out that finding the best MAPE model is equivalent to performing weighted Mean Absolute Error (MAE) regression, and this weighting strategy is applied to kernel regression. 4. **Impact of Complexity Control**: The impact of using MAPE as a loss function on the covering number and Vapnik - Chervonenkis (VC) dimension is analyzed, thereby exploring its performance in the Uniform Law of Large Numbers (ULLN). 5. **Practical Applications**: The behavior of MAPE kernel regression is demonstrated through simulated data, verifying the validity of the theoretical results. ### Specific Problem Description The core problem of the paper is to study the theoretical properties of MAPE as an evaluation index for regression models and its practical application effects. MAPE is defined as: \[ L_{\text{MAPE}}(g) = E\left(\frac{|g(X) - Y|}{|Y|}\right) \] where \( g \) is a regression model, \( X \) is an input variable, and \( Y \) is a target variable. MAPE is widely used in practical applications due to its intuitive interpretation of relative error, especially in fields such as finance and price calibration. ### Main Contributions - **Theoretical Contributions**: Prove the existence of the optimal MAPE model and the consistency of ERM. - **Methodological Contributions**: Propose a method to transform MAPE into weighted MAE and apply it to kernel regression. - **Practical Applications**: Verify the effectiveness of MAPE regression through simulated data. These studies not only deepen the understanding of MAPE but also provide theoretical support and technical means for practical applications.