Selection of latent variables for multiple mixed-outcome models

Scand Stat Theory Appl. 2014 Dec;41(4):1064-1082. doi: 10.1111/sjos.12084. Epub 2014 Apr 2.

Abstract

Latent variable models have been widely used for modeling the dependence structure of multiple outcomes data. However, the formulation of a latent variable model is often unknown a priori, the misspecification will distort the dependence structure and lead to unreliable model inference. Moreover, multiple outcomes with varying types present enormous analytical challenges. In this paper, we present a class of general latent variable models that can accommodate mixed types of outcomes. We propose a novel selection approach that simultaneously selects latent variables and estimates parameters. We show that the proposed estimator is consistent, asymptotically normal and has the oracle property. The practical utility of the methods is confirmed via simulations as well as an application to the analysis of the World Values Survey, a global research project that explores peoples' values and beliefs and the social and personal characteristics that might influence them.

Keywords: SCAD penalty; dependence structure; latent variables model; oracle property; selection of latent variables.