Self Organizing Map-Based Classification of Cathepsin k and S Inhibitors with Different Selectivity Profiles Using Different Structural Molecular Fingerprints: Design and Application for Discovery of Novel Hits

Molecules. 2016 Jan 30;21(2):175. doi: 10.3390/molecules21020175.

Abstract

The main step in a successful drug discovery pipeline is the identification of small potent compounds that selectively bind to the target of interest with high affinity. However, there is still a shortage of efficient and accurate computational methods with powerful capability to study and hence predict compound selectivity properties. In this work, we propose an affordable machine learning method to perform compound selectivity classification and prediction. For this purpose, we have collected compounds with reported activity and built a selectivity database formed of 153 cathepsin K and S inhibitors that are considered of medicinal interest. This database has three compound sets, two K/S and S/K selective ones and one non-selective KS one. We have subjected this database to the selectivity classification tool 'Emergent Self-Organizing Maps' for exploring its capability to differentiate selective cathepsin inhibitors for one target over the other. The method exhibited good clustering performance for selective ligands with high accuracy (up to 100 %). Among the possibilites, BAPs and MACCS molecular structural fingerprints were used for such a classification. The results exhibited the ability of the method for structure-selectivity relationship interpretation and selectivity markers were identified for the design of further novel inhibitors with high activity and target selectivity.

Keywords: cathepsin inhibitors; clustering; fingerprints; selectivity; self-organizing map (SOM).

MeSH terms

  • Cathepsin K / antagonists & inhibitors*
  • Cathepsins / antagonists & inhibitors*
  • Classification
  • Computational Biology / methods*
  • Databases, Chemical
  • Drug Discovery
  • Humans
  • Machine Learning
  • Protein Binding
  • Protein Conformation
  • Structure-Activity Relationship

Substances

  • Cathepsins
  • cathepsin S
  • Cathepsin K