Send to

Choose Destination
See comment in PubMed Commons below
Pac Symp Biocomput. 2008:279-90.

Integration of microarray and textual data improves the prognosis prediction of breast, lung and ovarian cancer patients.

Author information

Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, Leuven, B-3001, Belgium.


Microarray data are notoriously noisy such that models predicting clinically relevant outcomes often contain many false positive genes. Integration of other data sources can alleviate this problem and enhance gene selection and model building. Probabilistic models provide a natural solution to integrate information by using the prior over model space. We investigated if the use of text information from PUBMED abstracts in the structure prior of a Bayesian network could improve the prediction of the prognosis in cancer. Our results show that prediction of the outcome with the text prior was significantly better compared to not using a prior, both on a well known microarray data set and on three independent microarray data sets.

[Indexed for MEDLINE]
Free full text
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Pacific Sympsium On Biocomputing
    Loading ...
    Support Center