Display options
Share it on

Int J Biostat. 2016 May 01;12(1):179-201. doi: 10.1515/ijb-2014-0060.

Optimal Spatial Prediction Using Ensemble Machine Learning.

The international journal of biostatistics

Molly Margaret Davies, Mark J van der Laan

PMID: 27130244 DOI: 10.1515/ijb-2014-0060

Abstract

Spatial prediction is an important problem in many scientific disciplines. Super Learner is an ensemble prediction approach related to stacked generalization that uses cross-validation to search for the optimal predictor amongst all convex combinations of a heterogeneous candidate set. It has been applied to non-spatial data, where theoretical results demonstrate it will perform asymptotically at least as well as the best candidate under consideration. We review these optimality properties and discuss the assumptions required in order for them to hold for spatial prediction problems. We present results of a simulation study confirming Super Learner works well in practice under a variety of sample sizes, sampling designs, and data-generating functions. We also apply Super Learner to a real world dataset.

MeSH terms

Publication Types