Warning: The NCBI web site requires JavaScript to function. more...
Generate a file for use with external citation management software.
Protein Design Group, Centro Nacional de Biotecnología-CSIC, Cantoblanco, Madrid, Spain. andrade@ebi.ac.uk
Using a SOM (self-organizing map) we can classify sequences within a protein family into subgroups that generally correspond to biological subcategories. These maps tend to show sequence similarity as proximity in the map. Combining maps generated at different levels of resolution, the structure of relations in protein families can be captured that could not otherwise be represented in a single map. The underlying representation of maps enables us to retrieve characteristic sequence patterns for individual subgroups of sequences. Such patterns tend to correspond to functionally important regions. We present a modified SOM algorithm that includes a convergence test that dynamically controls the learning parameters to adapt them to the learning set instead of being fixed and externally optimized by trial and error. Given the variability of protein family size and distribution, the addition of this features is necessary. The method is successfully tested with a number of families. The rab family of small GTPases is used to illustrate the performance of the method.
Your browsing activity is empty.
Activity recording is turned off.
Turn recording back on