Format

Send to:

Choose Destination
See comment in PubMed Commons below
Stat Med. 2010 Feb 20;29(4):489-503. doi: 10.1002/sim.3815.

A Bayesian approach to joint modeling of protein-DNA binding, gene expression and sequence data.

Author information

  • 1Division of Biostatistics, Department of Clinical Sciences, University of Texas Southwestern Medical Center at Dallas, Dallas, TX, USA. yang.xie@utsouthwestern.edu

Abstract

The genome-wide DNA-protein-binding data, DNA sequence data and gene expression data represent complementary means to deciphering global and local transcriptional regulatory circuits. Combining these different types of data can not only improve the statistical power, but also provide a more comprehensive picture of gene regulation. In this paper, we propose a novel statistical model to augment protein-DNA-binding data with gene expression and DNA sequence data when available. We specify a hierarchical Bayes model and use Markov chain Monte Carlo simulations to draw inferences. Both simulation studies and an analysis of an experimental data set show that the proposed joint modeling method can significantly improve the specificity and sensitivity of identifying target genes as compared with conventional approaches relying on a single data source.

(c) 2010 John Wiley & Sons, Ltd.

PMID:
20049751
[PubMed - indexed for MEDLINE]
PMCID:
PMC3341088
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for John Wiley & Sons, Inc. Icon for PubMed Central
    Loading ...
    Write to the Help Desk