Bayesian modeling in virtual high throughput screening

Anthony E Klon

doi:10.2174/138620709788489046

Bayesian modeling in virtual high throughput screening

Comb Chem High Throughput Screen. 2009 Jun;12(5):469-83. doi: 10.2174/138620709788489046.

Author

Anthony E Klon¹

Affiliation

¹ Department of Molecular Modeling, Pharmacopeia Drug Discovery Inc., P.O. Box 5350 Princeton, NJ 08543-5350, USA. aklon@locuspharma.com

PMID: 19519326
DOI: 10.2174/138620709788489046

Abstract

Naïve Bayesian classifiers are a relatively recent addition to the arsenal of tools available to computational chemists. These classifiers fall into a class of algorithms referred to broadly as machine learning algorithms. Bayesian classifiers may be used in conjunction with classical modeling techniques to assist in the rapid virtual screening of large compound libraries in a systematic manner with a minimum of human intervention. This approach allows computational scientists to concentrate their efforts on their core strengths of model building. Bayesian classifiers have an added advantage of being able to handle a variety of numerical or binary data such as physicochemical properties or molecular fingerprints, making the addition of new parameters to existing models a relatively straightforward process. As a result, during a drug discovery project these classifiers can better evolve with the needs of the projects from general models in the lead finding stages to increasingly precise models in the lead optimization stages that are of particular interest to a specific medicinal chemistry team. Although other machine learning algorithms abound, Bayesian classifiers have been shown to compare favorably under most working conditions and have been shown to be tolerant of noisy experimental data.

MeSH terms

Algorithms*
Artificial Intelligence*
Drug Discovery*