Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Nucleic Acids Res. 2001 Jun 15;29(12):E60-0.

Identification and prevention of a GC content bias in SAGE libraries.

Author information

  • 1Department of Human Genetics, University of Michigan Medical School and Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI 48109, USA.

Abstract

Serial Analysis of Gene Expression (SAGE) is becoming a widely used gene expression profiling method for the study of development, cancer and other human diseases. Investigators using SAGE rely heavily on the quantitative aspect of this method for cataloging gene expression and comparing multiple SAGE libraries. We have developed additional computational and statistical tools to assess the quality and reproducibility of a SAGE library. Using these methods, a critical variable in the SAGE protocol was identified that has the potential to bias the Tag distribution relative to the GC content of the 10 bp SAGE Tag DNA sequence. We also detected this bias in a number of publicly available SAGE libraries. It is important to note that the GC content bias went undetected by quality control procedures in the current SAGE protocol and was only identified with the use of these statistical analyses on as few as 750 SAGE Tags. In addition to keeping any solution of free DiTags on ice, an analysis of the GC content should be performed before sequencing large numbers of SAGE Tags to be confident that SAGE libraries are free from experimental bias.

PMID:
11410683
[PubMed - indexed for MEDLINE]
PMCID:
PMC55759
Free PMC Article

Images from this publication.See all images (9)Free text

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central
    Loading ...
    Write to the Help Desk