Abstract:With insurers benefiting from ever-larger amounts of data of increasing complexity, we explore a data-driven method to model dependence within multilevel claims in this paper. More specifically, we start from a non-parametric estimator for Archimedean copula generators introduced by Genest and Rivest (1993), and we extend it to diverse flexible censoring scenarios using techniques derived from survival analysis. We implement a graphical selection procedure for copulas that we validate using goodness-of-fit methods applied to complete, single-censored, and double-censored bivariate data. We illustrate the performance of our model with multiple simulation studies. We then apply our methodology to a recent Canadian automobile insurance dataset where we seek to model the dependence between the activation delays of correlated coverages. We show that our model performs quite well in selecting the best-fitted copula for the data at hand, especially when the dataset is large, and that the results can then be used as part of a larger claims reserving methodology.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to solve the problems of complex dependencies and censored data in insurance data. Specifically, the author proposes a non - parametric estimation method to model the dependencies in multi - level claims and extends it especially for flexible censoring scenarios. The following are the main problems that this paper attempts to solve: 1. **Handling complex data structures**: - As insurance companies are able to collect more and more complex data, traditional parametric methods may not be able to fully capture the dependencies in these data. - The author hopes to use non - parametric estimation methods to better handle this complexity, especially in the presence of large amounts of data. 2. **Dealing with censored data**: - In insurance data, censored data is a common problem, that is, some of the observed values may be incomplete or truncated. - For example, in some cases, the claim amount may be censored because of reinsurance treaties, or the payment may stop after a given payment period, even if the claim is still open. - By introducing techniques from survival analysis, the author proposes a method applicable to multiple censoring scenarios. 3. **Selecting an appropriate Copula model**: - Copula is a tool for describing the dependencies between multivariate data, especially the Archimedean Copula has been widely used in many fields. - The author proposes a graphical selection procedure to select the Copula model that best fits the data and uses the goodness - of - fit method for verification. - This process is applicable not only to complete data, but also to singly and doubly censored binary data. 4. **Application to real - world data sets**: - The author applies this method to a recent Canadian auto insurance data set to model the activation - delay dependencies between different insurance coverages. - The results show that this method performs well in selecting the best - fitting Copula, especially when the amount of data is large. ### Summary The core problem of this paper is to develop a non - parametric estimation method to handle the problems of complex dependencies and censored data in insurance data. Through this method, the author hopes to more accurately model the dependencies in multi - level claims and provide insurance companies with better loss - prediction tools.

A non-parametric estimator for Archimedean copulas under flexible censoring scenarios and an application to claims reserving

Parametric estimation of conditional Archimedean copula generators for censored data

Rank-based inference tools for copula regression, with property and casualty insurance applications

Nonparametric estimation of multivariate copula using empirical bayes method

Portfolio credit risk with Archimedean copulas: asymptotic analysis and efficient simulation

Regression for copula-linked compound distributions with applications in modeling aggregate insurance claims

Survival Estimation for Missing not at Random Censoring Indicators based on Copula Models

Dependence Modeling of Frequency-Severity of Insurance Claims Using Waiting Time

Nonparametric Estimation of Conditional Copula Using Smoothed Checkerboard Bernstein Sieves

Copula Sensitivity Analysis for Portfolio Credit Derivatives

Copula-based Semiparametric Nonnormal Transformed Linear Model for Survival Data with Dependent Censoring

Estimates of Marginal Survival for Dependent Competing Risks Based on an Assumed Copula

Adaptive Bernstein Copulas and Risk Management

Estimation and Model Selection of Semiparametric Multivariate Survival Functions under General Censorship

Efficiency gains in value-at-risk and expected shortfall estimation by using copulas and full maximum likelihood

Measuring the Dependence between Non-Gaussian Financial Assets Using Copulae: Risk Management, Option Pricing and Default Risk

Portfolio Risk Assessment using Copula Models

Generating unfavourable VaR scenarios with patchwork copulas

A Copula-Based family of Bivariate Composite Models for Claim Severity Modelling

Dependence Modelling of Frequency-Severity of Insurance Claims Using Waiting Time for Claim

Nonparametric Estimation of Copula Functions for Dependence Modelling