Format

Send to

Choose Destination
See comment in PubMed Commons below
Clin Lab Med. 2008 Mar;28(1):145-66, viii. doi: 10.1016/j.cll.2007.10.010.

Data mining in genomics.

Author information

1
Division of Biostatistics and Epidemiology, Department of Public Health Sciences, Box 800717, University of Virginia, Charlottesville, VA 22908, USA. jaeklee@virginia.edu

Abstract

This article reviews important emerging statistical concepts, data mining techniques, and applications that have been recently developed and used for genomic data analysis. First, general background and some critical issues in genomic data mining are summarized. A novel concept of statistical significance is described, the so-called "false discovery rate"-the rate of false-positives among all positive findings-which has been suggested to control the error rate of numerous false-positives in large screening biological data analysis. Two recent statistical testing methods are then introduced: significance analysis of microarray and local pooled error tests. Statistical modeling in genomic data analysis is then presented, such as analysis of variance and heterogeneous error modeling approaches that have been suggested for analyzing microarray data obtained from multiple experimental or biological conditions. Two sections then describe data exploration and discovery tools largely termed as supervised learning and unsupervised learning. The former approaches include several multivariate statistical methods to investigate coexpression patterns of multiple genes, and the latter are the classification methods to discover genomic biomarker signatures for predicting important subclasses of human diseases. The last section briefly summarizes various genomic data mining approaches in biomedical pathway analysis and patient outcome or chemotherapeutic response prediction.

PMID:
18194724
PMCID:
PMC2253491
DOI:
10.1016/j.cll.2007.10.010
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Elsevier Science Icon for PubMed Central
    Loading ...
    Support Center