Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Genome Res. 2012 Feb;22(2):375-85. doi: 10.1101/gr.120477.111. Epub 2011 Jun 7.

De novo discovery of mutated driver pathways in cancer.

Author information

  • 1Department of Computer Science and Center for Computational Molecular Biology, Brown University, Providence, Rhode Island 02912, USA.

Abstract

Next-generation DNA sequencing technologies are enabling genome-wide measurements of somatic mutations in large numbers of cancer patients. A major challenge in the interpretation of these data is to distinguish functional "driver mutations" important for cancer development from random "passenger mutations." A common approach for identifying driver mutations is to find genes that are mutated at significant frequency in a large cohort of cancer genomes. This approach is confounded by the observation that driver mutations target multiple cellular signaling and regulatory pathways. Thus, each cancer patient may exhibit a different combination of mutations that are sufficient to perturb these pathways. This mutational heterogeneity presents a problem for predicting driver mutations solely from their frequency of occurrence. We introduce two combinatorial properties, coverage and exclusivity, that distinguish driver pathways, or groups of genes containing driver mutations, from groups of genes with passenger mutations. We derive two algorithms, called Dendrix, to find driver pathways de novo from somatic mutation data. We apply Dendrix to analyze somatic mutation data from 623 genes in 188 lung adenocarcinoma patients, 601 genes in 84 glioblastoma patients, and 238 known mutations in 1000 patients with various cancers. In all data sets, we find groups of genes that are mutated in large subsets of patients and whose mutations are approximately exclusive. Our Dendrix algorithms scale to whole-genome analysis of thousands of patients and thus will prove useful for larger data sets to come from The Cancer Genome Atlas (TCGA) and other large-scale cancer genome sequencing projects.

PMID:
21653252
[PubMed - indexed for MEDLINE]
PMCID:
PMC3266044
Free PMC Article

Images from this publication.See all images (5)Free text

Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central Icon for Faculty of 1000
    Loading ...
    Write to the Help Desk