Send to

Choose Destination
See comment in PubMed Commons below
Stat Med. 2004 Sep 30;23(18):2911-33.

Regression analysis of multiple source and multiple informant data from complex survey samples.

Author information

  • 1Department of Mathematics, Smith College, College Lane, Northampton, MA 01063, USA.


In this tutorial, we describe regression-based methods for analysing multiple source data arising from complex sample survey designs. We use the term 'multiple-source' data to encompass all cases where data are simultaneously obtained from multiple informants, or raters (e.g. self-reports, family members, health care providers, administrators) or via different/parallel instruments, indicators or methods (e.g. symptom rating scales, standardized diagnostic interviews, or clinical diagnoses). We review regression models for analysing multiple source risk factors or multiple source outcomes and show that they can be considered special cases of generalized linear models, albeit with correlated outcomes. We show how these methods can be extended to handle the common survey features of stratification, clustering, and sampling weights. We describe how to fit regression models with multiple source reports derived from complex sample surveys using general purpose statistical software. Finally, the methods are illustrated using data from two studies: the Stirling County Study and the Eastern Connecticut Child Survey.

Copyright 2004 John Wiley & Sons, Ltd.

[PubMed - indexed for MEDLINE]
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Loading ...
    Write to the Help Desk