Format

Send to

Choose Destination
See comment in PubMed Commons below
SAR QSAR Environ Res. 2007 Jan-Mar;18(1-2):141-53.

Predicting activities without computing descriptors: graph machines for QSAR.

Author information

  • 1Laboratoire d'Electronique, Ecole SupĂ©rieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI-ParisTech), 10 rue Vauquelin, 75005 Paris, France.

Abstract

We describe graph machines, an alternative approach to traditional machine-learning-based QSAR, which circumvents the problem of designing, computing and selecting molecular descriptors. In that approach, which is similar in spirit to recursive networks, molecules are considered as structured data, represented as graphs. For each example of the data set, a mathematical function (graph machine) is built, whose structure reflects the structure of the molecule under consideration; it is the combination of identical parameterised functions, called "node functions" (e.g. a feedforward neural network). The parameters of the node functions, shared both within and across the graph machines, are adjusted during training with the "shared weights" technique. Model selection is then performed by traditional cross-validation. Therefore, the designer's main task consists in finding the optimal complexity for the node function. The efficiency of this new approach has been demonstrated in many QSAR or QSPR tasks, as well as in modelling the activities of complex chemicals (e.g. the toxicity of a family of phenols or the anti-HIV activities of HEPT derivatives). It generally outperforms traditional techniques without requiring the selection and computation of descriptors.

PMID:
17365965
DOI:
10.1080/10629360601054313
[PubMed - indexed for MEDLINE]
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Loading ...
    Support Center