Format

Send to

Choose Destination
Stat Biosci. 2017 Jun;9(1):73-90. doi: 10.1007/s12561-016-9156-x. Epub 2016 Jul 8.

Improving Hierarchical Models Using Historical Data with Applications in High-Throughput Genomics Data Analysis.

Li B1, Li Y1, Qin ZS1,2.

Author information

1
Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA.
2
Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA 30322, USA.

Abstract

Modern high-throughput biotechnologies such as microarray and next generation sequencing produce a massive amount of information for each sample assayed. However, in a typical high-throughput experiment, only limited amount of data are observed for each individual feature, thus the classical 'large p, small n' problem. Bayesian hierarchical model, capable of borrowing strength across features within the same dataset, has been recognized as an effective tool in analyzing such data. However, the shrinkage effect, the most prominent feature of hierarchical features, can lead to undesirable over-correction for some features. In this work, we discuss possible causes of the over-correction problem and propose several alternative solutions. Our strategy is rooted in the fact that in the Big Data era, large amount of historical data are available which should be taken advantage of. Our strategy presents a new framework to enhance the Bayesian hierarchical model. Through simulation and real data analysis, we demonstrated superior performance of the proposed strategy. Our new strategy also enables borrowing information across different platforms which could be extremely useful with emergence of new technologies and accumulation of data from different platforms in the Big Data era. Our method has been implemented in R package "adaptiveHM", which is freely available from https://github.com/benliemory/adaptiveHM.

KEYWORDS:

450K methylation array; Bayesian hierarchical model; bisulphite sequencing; historical data; informative prior

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center