Abstract:The Receiver Operating Characteristic (ROC) curve is a crucial method for evaluating the effectiveness of diagnostic medical indicators and has found extensive applications. However, errors are inevitable in the data acquisition process. Therefore, discussions on error and various methods for improving and handling data have not only become the focus of academic discourse but also hold practical significance. Unlike general statistics, the diversity of error situations, ranges, and impacts in biostatistics often present unique challenges. In practical scenarios, such as drug experiments, limited sample sizes and variations in individual responses to the same drug necessitate the use of error models, data scales, and statistical processing based on historical data, biomedical knowledge, and experimental data. Furthermore, the choice of an appropriate method depends on the specific objectives of the experiment, which is essential for producing compelling conclusions. Importantly, the field of biology has introduced methods to address errors, such as cross-comparison experiments or repeated experiments, and data processing must adapt to changes in experimental designs. This paper presents a statistical approach based on the widely used practice of error reduction through repeated experiments in the context of assessing generic drug consistency. The paper first summarizes the common types of errors encountered in biostatistics and the corresponding analytical, control, and optimization measures. It explores several methods for calculating the Area Under the ROC Curve (AUC) when sampling error is introduced and applies error reduction through repeated experiments. Subsequently, the paper validates the methods under different error scenarios using simulated data, highlighting the suitability of different statistical models and their reasons for selection in cases where the difference between healthy and diseased populations is not substantial. This paper offers valuable insights into handling various types of real-world data to eliminate errors and obtain more accurate statistical conclusions.

Impact of methodological assumptions and covariates on the cutoff estimation in ROC analysis

A robust approach for ROC curves with covariates

Significance Tests for Covariates in the Diagnostic Accuracy Index of a Biomarker Against a Continuous Gold Standard.

Bayesian nonparametric inference for the covariate-adjusted ROC curve

ROC Curve Estimation under Test-Result-dependent Sampling.

A placement-value based approach to concave ROC analysis

ROCnReg: An R Package for Receiver Operating Characteristic Curve Inference with and without Covariate Information

Covariate Adjustment in Continuous Biomarker Assessment

Robust and flexible inference for the covariate-specific ROC curve

Nonparametric Covariate Adjustment for Receiver Operating Characteristic Curves

Defining an Optimal Cut-Point Value in ROC Analysis: An Alternative Approach

A Marginal Model Approach for Analysis of Multi-Reader Multi-Test Receiver Operating Characteristic (ROC) Data

Covariate-specific Evaluation of Continuous Biomarker

Estimation of ROC Curve with Multiple Types of Missing Gold Standard

Methods of determining optimal cut-point of diagnostic biomarkers with application of clinical data in ROC analysis: an update review

A discrete time-to-event model for the meta-analysis of full ROC curves

Comparison of methods for calculating confidence intervals of AUC in ROC curve considering sampling error

Nonparametric receiver operating characteristic curve analysis with an imperfect gold standard

ROC curve analysis: a useful statistic multi-tool in the research of nephrology

Time-dependent ROC curve analysis in medical research: current methods and applications

Decision Curve Analysis: a Technical Note