Hierarchical classification of G-protein-coupled receptors with data-driven selection of attributes and classifiers

Int J Data Min Bioinform. 2010;4(2):191-210. doi: 10.1504/ijdmb.2010.032150.

Abstract

We address the important bioinformatics problem of predicting protein function from a protein's primary sequence. We consider the functional classification of G-Protein-Coupled Receptors (GPCRs), whose functions are specified in a class hierarchy. We tackle this task using a novel top-down hierarchical classification system where, for each node in the class hierarchy, the predictor attributes to be used in that node and the classifier to be applied to the selected attributes are chosen in a data-driven manner. Compared with a previous hierarchical classification system selecting classifiers only, our new system significantly reduced processing time without significantly sacrificing predictive accuracy.

MeSH terms

  • Computational Biology / methods*
  • Databases, Protein
  • Receptors, G-Protein-Coupled / chemistry
  • Receptors, G-Protein-Coupled / classification*
  • Sequence Analysis, Protein

Substances

  • Receptors, G-Protein-Coupled