Send to

Choose Destination
See comment in PubMed Commons below
Proteomics. 2013 Jan;13(2):261-8. doi: 10.1002/pmic.201200326. Epub 2012 Dec 3.

Heterogeneous data integration by tree-augmented naïve Bayes for protein-protein interactions prediction.

Author information

Department of Electrical Engineering and Computer Science, The University of Kansas, Lawrence, KS 66045, USA.


Most proteins execute their functions through interacting with other proteins. Thus, understanding protein-protein interactions (PPIs) is essential to decipher biological functions in a living cell. To predict large-scale PPIs, effective and efficient computational approaches are desirable to integrate heterogeneous data sources provided by advanced technologies. In this paper, we extend our previous work on a Bayesian classifier for human PPI predictions from model organisms, by introducing a tree-augmented naïve Bayes (TAN) classifier. TAN maintains the simplicity and robustness of a naïve Bayes classifier while allows for the dependence among variables. Our empirical results show that by integrating features extracted from microarray expression measurements, Gene Ontology values, and orthologous scores, TAN achieves higher classification accuracy than the manually constructed Bayesian network classifier and naïve Bayes. For human PPI prediction, TAN obtains 88% sensitivity while keeping a reasonable 70% specificity on testing samples.

[Indexed for MEDLINE]
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Wiley
    Loading ...
    Support Center