An informatic pipeline for managing high-throughput screening experiments and analyzing data from stereochemically diverse libraries

J Comput Aided Mol Des. 2013 May;27(5):455-68. doi: 10.1007/s10822-013-9641-y. Epub 2013 Apr 13.

Abstract

Integration of flexible data-analysis tools with cheminformatics methods is a prerequisite for successful identification and validation of "hits" in high-throughput screening (HTS) campaigns. We have designed, developed, and implemented a suite of robust yet flexible cheminformatics tools to support HTS activities at the Broad Institute, three of which are described herein. The "hit-calling" tool allows a researcher to set a hit threshold that can be varied during downstream analysis. The results from the hit-calling exercise are reported to a database for record keeping and further data analysis. The "cherry-picking" tool enables creation of an optimized list of hits for confirmatory and follow-up assays from an HTS hit list. This tool allows filtering by computed chemical property and by substructure. In addition, similarity searches can be performed on hits of interest and sets of related compounds can be selected. The third tool, an "S/SAR viewer," has been designed specifically for the Broad Institute's diversity-oriented synthesis (DOS) collection. The compounds in this collection are rich in chiral centers and the full complement of all possible stereoisomers of a given compound are present in the collection. The S/SAR viewer allows rapid identification of both structure/activity relationships and stereo-structure/activity relationships present in HTS data from the DOS collection. Together, these tools enable the prioritization and analysis of hits from diverse compound collections, and enable informed decisions for follow-up biology and chemistry efforts.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Combinatorial Chemistry Techniques
  • Databases, Factual
  • Drug Design*
  • High-Throughput Screening Assays*
  • Humans
  • Structure-Activity Relationship*