Abstract:Missing data often result in undesirable bias and loss of efficiency. These become substantial problems when the response mechanism is nonignorable, such that the response model depends on unobserved variables. It is necessary to estimate the joint distribution of unobserved variables and response indicators to manage nonignorable nonresponse. However, model misspecification and identification issues prevent robust estimates despite careful estimation of the target joint distribution. In this study, we modelled the distribution of the observed parts and derived sufficient conditions for model identifiability, assuming a logistic regression model as the response mechanism and generalised linear models as the main outcome model of interest. More importantly, the derived sufficient conditions are testable with the observed data and do not require any instrumental variables, which are often assumed to guarantee model identifiability but cannot be practically determined beforehand. To analyse missing data, we propose a new imputation method which incorporates verifiable identifiability using only observed data. Furthermore, we present the performance of the proposed estimators in numerical studies and apply the proposed method to two sets of real data: exit polls for the 19th South Korean election data and public data collected from the Korean Survey of Household Finances and Living Conditions.

Identification Problem for The Analysis of Binary Data with Non-ignorable Missing

Identification enhanced generalised linear model estimation with nonignorable missing outcomes

Identifiability of Normal and Normal Mixture Models With Nonignorable Missing Data

Identifiability and Estimation of Two-Sample Data with Nonignorable Missing Response

Using auxiliary data for binomial parameter estimation with nonignorable nonresponse

Diagnosing missing always at random in multivariate data

Identifiable Generative Models for Missing Not at Random Data Imputation

Nonstandard Conditionally Specified Models for Nonignorable Missing Data.

Identification And Inference With Nonignorable Missing Covariate Data

Handling Nonmonotone Missing Data with Available Complete-Case Missing Value Assumption

Statistical Inference with Different Missing-data Mechanisms

Identification, Doubly Robust Estimation, and Semiparametric Efficiency Theory of Nonignorable Missing Data With a Shadow Variable

Identification of Graphical Models for Nonignorable Nonresponse in Terms of Information on Odds Ratios

Conditions for Ignoring the Missing-Data Mechanism in Likelihood Inferences for Parameter Subsets

Non-standard conditionally specified models for non-ignorable missing data

A Note on the Robustness of a Full Bayesian Method for Nonignorable Missing Data Analysis

New possibilities in identification of binary choice models with fixed effects

Analysis of Longitudinal Data under Nonignorable Nonmonotone Nonresponse

Simulation-based Sensitivity Analysis for Non-ignorable Missing Data

A Bayesian hybrid method for the analysis of generalized linear models with missing-not-at-random covariates