Abstract:Nonparametric estimators for the mean and the covariance functions of functional data are proposed. The setup covers a wide range of practical situations. The random trajectories are, not necessarily differentiable, have unknown regularity, and are measured with error at discrete design points. The measurement error could be heteroscedastic. The design points could be either randomly drawn or common for all curves. The estimators depend on the local regularity of the stochastic process generating the functional data. We consider a simple estimator of this local regularity which exploits the replication and regularization features of functional data. Next, we use the ``smoothing first, then estimate'' approach for the mean and the covariance functions. They can be applied with both sparsely or densely sampled curves, are easy to calculate and to update, and perform well in simulations. Simulations built upon an example of real data set, illustrate the effectiveness of the new approach.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively estimate the mean function and covariance function when dealing with functional data. Specifically, the paper focuses on how to perform non - parametric estimation in the following situations:
1. **Random trajectories may be non - differentiable**: These trajectories have unknown regularity and are measured with errors at discrete design points.
2. **Measurement errors may be heteroscedastic**: That is, the error variances at different observation points may be different.
3. **Design points can be randomly sampled or common to all curves**: This means that the selection methods of observation points can be different.
4. **Data can be sparsely or densely sampled**: That is, the number of observation points for each curve can be very small or very large.
To address the above challenges, the paper proposes a method based on "smoothing first, then estimate", which relies on the estimation of local regularity. Specifically, the main contributions of the paper include:
- **Local regularity estimation**: A simple method is proposed to estimate the local regularity of the random process that generates functional data. This method utilizes the replication and regularization characteristics of functional data.
- **Estimation of mean and covariance functions**: Use the local regularity estimation to adjust the smoothing parameters, thereby obtaining the optimal estimates of the mean function and covariance function.
- **Applicable to different design situations**: This method is applicable to both independent design (the observation time points of each sample are random) and common design (the observation time points of all samples are the same).
- **No need for complex numerical optimization**: The entire process does not require complex numerical optimization steps, and is simple to calculate and easy to update.
Through these methods, the paper aims to provide a general framework for effectively estimating the mean and covariance functions under different conditions. This is of great significance in practical applications, especially in fields such as energy, chemistry, physics, astronomy, and medicine, where the regularity of functional data may be very complex and unknown.