Format

Send to

Choose Destination
Biometrics. 2015 Dec;71(4):905-17. doi: 10.1111/biom.12355. Epub 2015 Aug 21.

Local-aggregate modeling for big data via distributed optimization: Applications to neuroimaging.

Author information

1
Department of Statistics, Rice University.
2
Dobelman Family Junior Chair Departments of Statistics and Electrical & Computer Engineering, Rice University, Department of Pediatrics-Neurology, Baylor College of Medicine, Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital.

Abstract

Technological advances have led to a proliferation of structured big data that have matrix-valued covariates. We are specifically motivated to build predictive models for multi-subject neuroimaging data based on each subject's brain imaging scans. This is an ultra-high-dimensional problem that consists of a matrix of covariates (brain locations by time points) for each subject; few methods currently exist to fit supervised models directly to this tensor data. We propose a novel modeling and algorithmic strategy to apply generalized linear models (GLMs) to this massive tensor data in which one set of variables is associated with locations. Our method begins by fitting GLMs to each location separately, and then builds an ensemble by blending information across locations through regularization with what we term an aggregating penalty. Our so called, Local-Aggregate Model, can be fit in a completely distributed manner over the locations using an Alternating Direction Method of Multipliers (ADMM) strategy, and thus greatly reduces the computational burden. Furthermore, we propose to select the appropriate model through a novel sequence of faster algorithmic solutions that is similar to regularization paths. We will demonstrate both the computational and predictive modeling advantages of our methods via simulations and an EEG classification problem.

KEYWORDS:

ADMM; Big data; Ensemble learning; Generalized linear models; Multi-subject neuroimaging; Parallel computing; Regularization paths

PMID:
26295449
DOI:
10.1111/biom.12355
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center