Assessing drug target association using semantic linked data

Bin Chen; Ying Ding; David J Wild

doi:10.1371/journal.pcbi.1002574

Assessing drug target association using semantic linked data

PLoS Comput Biol. 2012;8(7):e1002574. doi: 10.1371/journal.pcbi.1002574. Epub 2012 Jul 5.

Authors

Bin Chen¹, Ying Ding, David J Wild

Affiliation

¹ School of Informatics and Computing, Indiana University, Bloomington, IN, USA.

Abstract

The rapidly increasing amount of public data in chemistry and biology provides new opportunities for large-scale data mining for drug discovery. Systematic integration of these heterogeneous sets and provision of algorithms to data mine the integrated sets would permit investigation of complex mechanisms of action of drugs. In this work we integrated and annotated data from public datasets relating to drugs, chemical compounds, protein targets, diseases, side effects and pathways, building a semantic linked network consisting of over 290,000 nodes and 720,000 edges. We developed a statistical model to assess the association of drug target pairs based on their relation with other linked objects. Validation experiments demonstrate the model can correctly identify known direct drug target pairs with high precision. Indirect drug target pairs (for example drugs which change gene expression level) are also identified but not as strongly as direct pairs. We further calculated the association scores for 157 drugs from 10 disease areas against 1683 human targets, and measured their similarity using a [Formula: see text] score matrix. The similarity network indicates that drugs from the same disease area tend to cluster together in ways that are not captured by structural similarity, with several potential new drug pairings being identified. This work thus provides a novel, validated alternative to existing drug target prediction algorithms. The web service is freely available at: http://chem2bio2rdf.org/slap.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Computational Biology / methods
Data Mining / methods*
Databases, Factual*
Drug Discovery / methods*
Humans
Models, Theoretical
Reproducibility of Results
Semantics*