Genetic analysis of biological pathway data through genomic randomization

Hum Genet. 2011 May;129(5):563-71. doi: 10.1007/s00439-011-0956-2. Epub 2011 Jan 30.

Abstract

Genome Wide Association Studies (GWAS) are a standard approach for large-scale common variation characterization and for identification of single loci predisposing to disease. However, due to issues of moderate sample sizes and particularly multiple testing correction, many variants of smaller effect size are not detected within a single allele analysis framework. Thus, small main effects and potential epistatic effects are not consistently observed in GWAS using standard analytical approaches that consider only single SNP alleles. Here, we propose unique methodology that aggregates variants of interest (for example, genes in a biological pathway) using GWAS results. Multiple testing and type I error concerns are minimized using empirical genomic randomization to estimate significance. Randomization corrects for common pathway-based analysis biases, such as SNP coverage and density, linkage disequilibrium, gene size and pathway size. Pathway Analysis by Randomization Incorporating Structure (PARIS) applies this randomization and in doing so directly accounts for linkage disequilibrium effects. PARIS is independent of association analysis method and is thus applicable to GWAS datasets of all study designs. Using the KEGG database as an example, we apply PARIS to the publicly available Autism Genetic Resource Exchange GWAS dataset, revealing pathways with a significant enrichment of positive association results.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Autistic Disorder / genetics
  • Genome-Wide Association Study / statistics & numerical data*
  • Humans
  • Linkage Disequilibrium
  • Metabolic Networks and Pathways / genetics*
  • Polymorphism, Single Nucleotide
  • Random Allocation