Three Contributions to Statistical Computing

Donald B. Rubin,Yaming Yu
2005-01-01
Abstract:This thesis contains three papers (which may be read independently) on statistical computing. Paper 1 deals with improving the expectation-maximization (EM) algorithm using vector sequence transformations (VSTs). VSTs, such as reduced rank extrapolation and minimal polynomial extrapolation, are popular in numerical analysis but almost unknown to statisticians. For a vector sequence {sn, n ≥ 1} that converges linearly to s∞ these methods achieve increased speed of convergence by making a guess at s∞ based on several consecutive values of the sequence. These accelerators are easy to implement, take little computer time, and often result in considerably improved speed, although they do not automatically preserve the monotone increase in the likelihood function, which is a great advantage of EM. When fitting hierarchical models such as generalized linear mixed models (GLMMs) using EM or the data augmentation (DA) algorithm, the computing efficiency may depend crucially on the choice of the augmentation scheme. The efficient data augmentation idea of Meng and van Dyk (1997, 1999, 2001) chooses a DA scheme that is both quick to converge and easy to implement. Papers 2 and 3 both deal with finding efficient DA schemes. Paper 2 first studies the convergence rates of EM under two special augmentation schemes, the sufficient augmentation (SA) and the ancillary augmentation (AA), and then proceeds to design, using results on EM, certain optimal conditional augmentation (OCA) schemes, to speed up the DA algorithm. On the theoretical side, we derive general formulas for the OCA, which are often easily obtained or approximated; in the empirical comparisons, we extend the use of efficient DA to complicated situations such as GLMMs, and show that OCA results in considerable gain in efficiency. Paper 3 proposes a combined augmentation approach, which utilizes simultaneously the SA and the AA. Using a Poisson time series model as a realistic example, we demonstrate that, by utilizing both sufficient and ancillary augmentation schemes, considerable computational efficiency is gained with limited extra human effort, and the improved algorithm is fast for various data configurations. In the theoretical study that complements the empirical investigations, we show that, in addition to being robust, under certain conditions this combined approach is also optimal among a broad class of DA schemes.
What problem does this paper attempt to address?