A non-parametric estimator for Archimedean copulas under flexible censoring scenarios and an application to claims reserving

Marie Michaelides,Hélène Cossette,Mathieu Pigeon
DOI: https://doi.org/10.48550/arXiv.2401.07724
2024-01-15
Abstract:With insurers benefiting from ever-larger amounts of data of increasing complexity, we explore a data-driven method to model dependence within multilevel claims in this paper. More specifically, we start from a non-parametric estimator for Archimedean copula generators introduced by Genest and Rivest (1993), and we extend it to diverse flexible censoring scenarios using techniques derived from survival analysis. We implement a graphical selection procedure for copulas that we validate using goodness-of-fit methods applied to complete, single-censored, and double-censored bivariate data. We illustrate the performance of our model with multiple simulation studies. We then apply our methodology to a recent Canadian automobile insurance dataset where we seek to model the dependence between the activation delays of correlated coverages. We show that our model performs quite well in selecting the best-fitted copula for the data at hand, especially when the dataset is large, and that the results can then be used as part of a larger claims reserving methodology.
Methodology,Applications
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the problems of complex dependencies and censored data in insurance data. Specifically, the author proposes a non - parametric estimation method to model the dependencies in multi - level claims and extends it especially for flexible censoring scenarios. The following are the main problems that this paper attempts to solve: 1. **Handling complex data structures**: - As insurance companies are able to collect more and more complex data, traditional parametric methods may not be able to fully capture the dependencies in these data. - The author hopes to use non - parametric estimation methods to better handle this complexity, especially in the presence of large amounts of data. 2. **Dealing with censored data**: - In insurance data, censored data is a common problem, that is, some of the observed values may be incomplete or truncated. - For example, in some cases, the claim amount may be censored because of reinsurance treaties, or the payment may stop after a given payment period, even if the claim is still open. - By introducing techniques from survival analysis, the author proposes a method applicable to multiple censoring scenarios. 3. **Selecting an appropriate Copula model**: - Copula is a tool for describing the dependencies between multivariate data, especially the Archimedean Copula has been widely used in many fields. - The author proposes a graphical selection procedure to select the Copula model that best fits the data and uses the goodness - of - fit method for verification. - This process is applicable not only to complete data, but also to singly and doubly censored binary data. 4. **Application to real - world data sets**: - The author applies this method to a recent Canadian auto insurance data set to model the activation - delay dependencies between different insurance coverages. - The results show that this method performs well in selecting the best - fitting Copula, especially when the amount of data is large. ### Summary The core problem of this paper is to develop a non - parametric estimation method to handle the problems of complex dependencies and censored data in insurance data. Through this method, the author hopes to more accurately model the dependencies in multi - level claims and provide insurance companies with better loss - prediction tools.