Format

Send to

Choose Destination
Genet Epidemiol. 2011 Apr;35(3):159-73. doi: 10.1002/gepi.20564. Epub 2011 Jan 31.

Phenotype harmonization and cross-study collaboration in GWAS consortia: the GENEVA experience.

Author information

1
Collaborative Health Studies Coordinating Center, Department of Biostatistics, University of Washington, Seattle, Washington 98115, USA. siirib@u.washington.edu

Abstract

Genome-wide association study (GWAS) consortia and collaborations formed to detect genetic loci for common phenotypes or investigate gene-environment (G*E) interactions are increasingly common. While these consortia effectively increase sample size, phenotype heterogeneity across studies represents a major obstacle that limits successful identification of these associations. Investigators are faced with the challenge of how to harmonize previously collected phenotype data obtained using different data collection instruments which cover topics in varying degrees of detail and over diverse time frames. This process has not been described in detail. We describe here some of the strategies and pitfalls associated with combining phenotype data from varying studies. Using the Gene Environment Association Studies (GENEVA) multi-site GWAS consortium as an example, this paper provides an illustration to guide GWAS consortia through the process of phenotype harmonization and describes key issues that arise when sharing data across disparate studies. GENEVA is unusual in the diversity of disease endpoints and so the issues it faces as its participating studies share data will be informative for many collaborations. Phenotype harmonization requires identifying common phenotypes, determining the feasibility of cross-study analysis for each, preparing common definitions, and applying appropriate algorithms. Other issues to be considered include genotyping timeframes, coordination of parallel efforts by other collaborative groups, analytic approaches, and imputation of genotype data. GENEVA's harmonization efforts and policy of promoting data sharing and collaboration, not only within GENEVA but also with outside collaborations, can provide important guidance to ongoing and new consortia.

PMID:
21284036
PMCID:
PMC3055921
DOI:
10.1002/gepi.20564
[Indexed for MEDLINE]
Free PMC Article

Publication types, MeSH terms, Grant support

Publication types

MeSH terms

Grant support

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center