Using matched molecular series as a predictive tool to optimize biological activity

J Med Chem. 2014 Mar 27;57(6):2704-13. doi: 10.1021/jm500022q. Epub 2014 Mar 14.

Abstract

A matched molecular series is the general form of a matched molecular pair and refers to a set of two or more molecules with the same scaffold but different R groups at the same position. We describe Matsy, a knowledge-based method that uses matched series to predict R groups likely to improve activity given an observed activity order for some R groups. We compare the Matsy predictions based on activity data from ChEMBLdb to the recommendations of the Topliss tree and carry out a large scale retrospective test to measure performance. We show that the basis for predictive success is preferred orders in matched series and that this preference is stronger for longer series. The Matsy algorithm allows medicinal chemists to integrate activity trends from diverse medicinal chemistry programs and apply them to problems of interest as a Topliss-like recommendation or as a hypothesis generator to aid compound design.

MeSH terms

  • Algorithms*
  • Alkanes / chemical synthesis
  • Alkanes / chemistry
  • Computational Biology
  • Computer Simulation
  • Databases, Chemical
  • Drug Design*
  • Molecular Structure
  • Predictive Value of Tests
  • Structure-Activity Relationship*

Substances

  • Alkanes