Self-organizing neural networks to support the discovery of DNA-binding motifs

Shaun Mahony; Panayiotis V Benos; Terry J Smith; Aaron Golden

doi:10.1016/j.neunet.2006.05.023

Self-organizing neural networks to support the discovery of DNA-binding motifs

Neural Netw. 2006 Jul-Aug;19(6-7):950-62. doi: 10.1016/j.neunet.2006.05.023. Epub 2006 Jul 12.

Authors

Shaun Mahony¹, Panayiotis V Benos, Terry J Smith, Aaron Golden

Affiliation

¹ Department of Computational Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA. shaun.mahony@ccbb.pitt.edu

PMID: 16839740
DOI: 10.1016/j.neunet.2006.05.023

Abstract

Identification of the short DNA sequence motifs that serve as binding targets for transcription factors is an important challenge in bioinformatics. Unsupervised techniques from the statistical learning theory literature have often been applied to motif discovery, but effective solutions for large genomic datasets have yet to be found. We present here three self-organizing neural networks that have applicability to the motif-finding problem. The core system in this study is a previously described SOM-based motif-finder named SOMBRERO. The motif-finder is integrated in this work with a SOM-based method that automatically constructs generalized models for structurally related motifs and initializes SOMBRERO with relevant biological knowledge. A self-organizing tree method that displays the relationships between various motifs is also presented, and it is shown that such a method can act as an effective structural classifier of novel motifs. The performance of the three self-organizing neural networks is evaluated here using various datasets.

Publication types

Comparative Study
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Amino Acid Motifs / physiology*
Animals
Cluster Analysis*
Computational Biology
DNA / metabolism*
Databases, Genetic
Humans
Molecular Sequence Data
Neural Networks, Computer*
Protein Structure, Tertiary
Sequence Analysis, DNA

Substances

DNA