Format

Send to

Choose Destination
Biometrics. 2018 Dec;74(4):1311-1319. doi: 10.1111/biom.12888. Epub 2018 May 11.

Order selection and sparsity in latent variable models via the ordered factor LASSO.

Author information

1
Mathematical Sciences Institute, The Australian National University, Acton, ACT 2601, Australia.
2
School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia.
3
School of Mathematics and Statistics, and the Evolution & Ecology Research Centre, UNSW Sydney, NSW 2052, Australia.

Abstract

Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, typically a sparse structure. Motivated by the application of GLLVMs to study marine species assemblages in the Southern Ocean, we propose the Ordered Factor LASSO or OFAL penalty for order selection and achieving sparsity in GLLVMs. The OFAL penalty is the first penalty developed specifically for order selection in latent variable models, and achieves this by using a hierarchically structured group LASSO type penalty to shrink entire columns of the loading matrix to zero, while ensuring that non-zero loadings are concentrated on the lower-order factors. Simultaneously, individual element sparsity is achieved through the use of an adaptive LASSO. In conjunction with using an information criterion which promotes aggressive shrinkage, simulation shows that the OFAL penalty performs strongly compared with standard methods and penalties for order selection, achieving sparsity, and prediction in GLLVMs. Applying the OFAL penalty to the Southern Ocean marine species dataset suggests the available environmental predictors explain roughly half of the total covariation between species, thus leading to a smaller number of latent variables and increased sparsity in the loading matrix compared to a model without any covariates.

KEYWORDS:

Dimension reduction; Factor analysis; Generalized linear latent variable models; LASSO; Loadings; Penalized likelihood; Regularization

PMID:
29750847
DOI:
10.1111/biom.12888
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center