Format

Send to

Choose Destination
BMC Bioinformatics. 2017 Nov 29;18(1):530. doi: 10.1186/s12859-017-1935-y.

A proximity-based graph clustering method for the identification and application of transcription factor clusters.

Author information

1
University of Michigan Medical School, 1301 Catherine, Ann Arbor, 48109-5624, USA. maxspad@umich.edu.
2
University of Michigan Department of Computational Medicine and Bioinformatics, 100 Washtenaw Avenue, Ann Arbor, 48109, USA.
3
University of Michigan Medical School Department of Emergency Medicine, 1500 E Medical Center Drive, Ann Arbor, 48109, USA.
4
University of Michigan Department of Genetics, 1241 E Catherine, Ann Arbor, 48109, USA.

Abstract

BACKGROUND:

Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions.

METHODS:

Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF.

RESULTS:

We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions.

CONCLUSION:

The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics.

KEYWORDS:

Genome regulation; Graph clustering; Graph theory; Network analysis; TF clusters; Transcription factors

PMID:
29187152
PMCID:
PMC5706350
DOI:
10.1186/s12859-017-1935-y
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center