Display Settings:

Format

Send to:

Choose Destination
We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
    BMC Bioinformatics. 2009 Jan 29;10:38. doi: 10.1186/1471-2105-10-38.

    On reliable discovery of molecular signatures.

    Source

    Computational Biology, Department of Physics, Linköping University, SE58183 Linköping, Sweden. rnilsson@broad.mit.edu

    Abstract

    BACKGROUND:

    Molecular signatures are sets of genes, proteins, genetic variants or other variables that can be used as markers for a particular phenotype. Reliable signature discovery methods could yield valuable insight into cell biology and mechanisms of human disease. However, it is currently not clear how to control error rates such as the false discovery rate (FDR) in signature discovery. Moreover, signatures for cancer gene expression have been shown to be unstable, that is, difficult to replicate in independent studies, casting doubts on their reliability.

    RESULTS:

    We demonstrate that with modern prediction methods, signatures that yield accurate predictions may still have a high FDR. Further, we show that even signatures with low FDR may fail to replicate in independent studies due to limited statistical power. Thus, neither stability nor predictive accuracy are relevant when FDR control is the primary goal. We therefore develop a general statistical hypothesis testing framework that for the first time provides FDR control for signature discovery. Our method is demonstrated to be correct in simulation studies. When applied to five cancer data sets, the method was able to discover molecular signatures with 5% FDR in three cases, while two data sets yielded no significant findings.

    CONCLUSION:

    Our approach enables reliable discovery of molecular signatures from genome-wide data with current sample sizes. The statistical framework developed herein is potentially applicable to a wide range of prediction problems in bioinformatics.

    PMID:
    19178740
    [PubMed - indexed for MEDLINE]
    PMCID:
    PMC2646701
    Free PMC Article

    Images from this publication.See all images (4)Free text

    Figure 1
    Figure 3
    Figure 2
    Figure 4

      Supplemental Content

      Icon for BioMed Central Icon for PubMed Central

      Save items

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk