Integrative genomic data mining for discovery of potential blood-borne biomarkers for early diagnosis of cancer

PLoS One. 2008;3(11):e3661. doi: 10.1371/journal.pone.0003661. Epub 2008 Nov 6.

Abstract

Background: With the arrival of the postgenomic era, there is increasing interest in the discovery of biomarkers for the accurate diagnosis, prognosis, and early detection of cancer. Blood-borne cancer markers are favored by clinicians, because blood samples can be obtained and analyzed with relative ease. We have used a combined mining strategy based on an integrated cancer microarray platform, Oncomine, and the biomarker module of the Ingenuity Pathways Analysis (IPA) program to identify potential blood-based markers for six common human cancer types.

Methodology/principal findings: In the Oncomine platform, the genes overexpressed in cancer tissues relative to their corresponding normal tissues were filtered by Gene Ontology keywords, with the extracellular environment stipulated and a corrected Q value (false discovery rate) cut-off implemented. The identified genes were imported to the IPA biomarker module to separate out those genes encoding putative secreted or cell-surface proteins as blood-borne (blood/serum/plasma) cancer markers. The filtered potential indicators were ranked and prioritized according to normalized absolute Student t values. The retrieval of numerous marker genes that are already clinically useful or under active investigation confirmed the effectiveness of our mining strategy. To identify the biomarkers that are unique for each cancer type, the upregulated marker genes that are in common between each two tumor types across the six human tumors were also analyzed by the IPA biomarker comparison function.

Conclusion/significance: The upregulated marker genes shared among the six cancer types may serve as a molecular tool to complement histopathologic examination, and the combination of the commonly upregulated and unique biomarkers may serve as differentiating markers for a specific cancer. This approach will be increasingly useful to discover diagnostic signatures as the mass of microarray data continues to grow in the 'omics' era.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Biomarkers, Tumor / blood*
  • Biomarkers, Tumor / genetics
  • Breast Neoplasms / blood
  • Breast Neoplasms / diagnosis
  • Breast Neoplasms / genetics
  • Carcinoma / blood
  • Carcinoma / diagnosis
  • Carcinoma / genetics
  • Colonic Neoplasms / blood
  • Colonic Neoplasms / diagnosis
  • Colonic Neoplasms / genetics
  • Databases, Genetic
  • Early Diagnosis
  • Female
  • Gene Expression Profiling*
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Lung Neoplasms / blood
  • Lung Neoplasms / diagnosis
  • Lung Neoplasms / genetics
  • Male
  • Microarray Analysis*
  • Neoplasm Proteins / blood
  • Neoplasm Proteins / genetics*
  • Neoplasms / blood
  • Neoplasms / diagnosis*
  • Neoplasms / genetics
  • Neoplastic Cells, Circulating
  • Ovarian Neoplasms / blood
  • Ovarian Neoplasms / diagnosis
  • Ovarian Neoplasms / genetics
  • Pancreatic Neoplasms / blood
  • Pancreatic Neoplasms / diagnosis
  • Pancreatic Neoplasms / genetics
  • Prostatic Neoplasms / blood
  • Prostatic Neoplasms / diagnosis
  • Prostatic Neoplasms / genetics
  • Up-Regulation

Substances

  • Biomarkers, Tumor
  • Neoplasm Proteins