Random forest methodology for model-based recursive partitioning: the mobForest package for R

BMC Bioinformatics. 2013 Apr 11:14:125. doi: 10.1186/1471-2105-14-125.

Abstract

Background: Recursive partitioning is a non-parametric modeling technique, widely used in regression and classification problems. Model-based recursive partitioning is used to identify groups of observations with similar values of parameters of the model of interest. The mob() function in the party package in R implements model-based recursive partitioning method. This method produces predictions based on single tree models. Predictions obtained through single tree models are very sensitive to small changes to the learning sample. We extend the model-based recursive partition method to produce predictions based on multiple tree models constructed on random samples achieved either through bootstrapping (random sampling with replacement) or subsampling (random sampling without replacement) on learning data.

Results: Here we present an R package called "mobForest" that implements bagging and random forests methodology for model-based recursive partitioning. The mobForest package constructs large number of model-based trees and the predictions are aggregated across these trees resulting in more stable predictions. The package also includes functions for computing predictive accuracy estimates and plots, residuals plot, and variable importance plot.

Conclusion: The mobForest package implements a random forest type approach for model-based recursive partitioning. The R package along with it source code is available at http://CRAN.R-project.org/package=mobForest.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Models, Statistical*
  • Software*
  • Statistics, Nonparametric