Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models

doi:10.1111/J.1467-9868.2010.00749.X

Open AccessJournal ArticleDOI

Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models

Simon N. Wood

- 01 Jan 2011 -

Journal of The Royal Statistical Society...

- Vol. 73, Iss: 1, pp 3-36

Chats0

TLDR

In this article, a Laplace approximation is used to obtain an approximate restricted maximum likelihood (REML) or marginal likelihood (ML) for smoothing parameter selection in semiparametric regression.

Abstract:

Summary. Recent work by Reiss and Ogden provides a theoretical basis for sometimes preferring restricted maximum likelihood (REML) to generalized cross-validation (GCV) for smoothing parameter selection in semiparametric regression. However, existing REML or marginal likelihood (ML) based methods for semiparametric generalized linear models (GLMs) use iterative REML or ML estimation of the smoothing parameters of working linear approximations to the GLM. Such indirect schemes need not converge and fail to do so in a non-negligible proportion of practical analyses. By contrast, very reliable prediction error criteria smoothing parameter selection methods are available, based on direct optimization of GCV, or related criteria, for the GLM itself. Since such methods directly optimize properly defined functions of the smoothing parameters, they have much more reliable convergence properties. The paper develops the first such method for REML or ML estimation of smoothing parameters. A Laplace approximation is used to obtain an approximate REML or ML for any GLM, which is suitable for efficient direct optimization. This REML or ML criterion requires that Newton–Raphson iteration, rather than Fisher scoring, be used for GLM fitting, and a computationally stable approach to this is proposed. The REML or ML criterion itself is optimized by a Newton method, with the derivatives required obtained by a mixture of implicit differentiation and direct methods. The method will cope with numerical rank deficiency in the fitted model and in fact provides a slight improvement in numerical robustness on the earlier method of Wood for prediction error criteria based smoothness selection. Simulation results suggest that the new REML and ML methods offer some improvement in mean-square error performance relative to GCV or Akaike's information criterion in most cases, without the small number of severe undersmoothing failures to which Akaike's information criterion and GCV are prone. This is achieved at the same computational cost as GCV or Akaike's information criterion. The new approach also eliminates the convergence failures of previous REML- or ML-based approaches for penalized GLMs and usually has lower computational cost than these alternatives. Example applications are presented in adaptive smoothing, scalar on function regression and generalized additive model selection.

Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models

Citations

Genome sequence-based species delimitation with confidence intervals and improved distance functions

brms: An R Package for Bayesian Multilevel Models Using Stan

Advanced Bayesian Multilevel Modeling with the R Package brms

The Allelic Landscape of Human Blood Cell Trait Variation and Links to Common Complex Disease

Marine Taxa Track Local Climate Velocities

References

Modern Applied Statistics with S

Generalized Linear Models

Generalized Linear Models

Random-effects models for longitudinal data

Spline models for observational data

Related Papers (5)

R: A language and environment for statistical computing.

Generalized Additive Models.

Fitting Linear Mixed-Effects Models Using lme4

Mixed Effects Models and Extensions in Ecology with R

Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach