Display Settings:

Format

Send to:

Choose Destination
    PLoS One. 2009 Nov 17;4(11):e7862.

    Mismatch and G-stack modulated probe signals on SNP microarrays.

    Source

    Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Leipzig, Germany. binder@izbi.uni-leipzig.de

    Abstract

    BACKGROUND:

    Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe signals of Affymetrix GeneChip SNP arrays as a function of the probe sequence to identify relevant sequence motifs which potentially cause systematic biases of genotyping and copy number estimates.

    METHODOLOGY/PRINCIPAL FINDINGS:

    The probe design of GeneChip SNP arrays enables us to disentangle different sources of intensity modulations such as the number of mismatches per duplex, matched and mismatched base pairings including nearest and next-nearest neighbors and their position along the probe sequence. The effect of probe sequence was estimated in terms of triple-motifs with central matches and mismatches which include all 256 combinations of possible base pairings. The probe/target interactions on the chip can be decomposed into nearest neighbor contributions which correlate well with free energy terms of DNA/DNA-interactions in solution. The effect of mismatches is about twice as large as that of canonical pairings. Runs of guanines (G) and the particular type of mismatched pairings formed in cross-allelic probe/target duplexes constitute sources of systematic biases of the probe signals with consequences for genotyping and copy number estimates. The poly-G effect seems to be related to the crowded arrangement of probes which facilitates complex formation of neighboring probes with at minimum three adjacent G's in their sequence.

    CONCLUSIONS:

    The applied method of "triple-averaging" represents a model-free approach to estimate the mean intensity contributions of different sequence motifs which can be applied in calibration algorithms to correct signal values for sequence effects. Rules for appropriate sequence corrections are suggested.

    PMID:
    19924253
    [PubMed - indexed for MEDLINE]
    PMCID: PMC2775684
    Free PMC Article

    Images from this publication.See all images (13) Free text

    Figure 10
    Figure 6
    Figure 9
    Figure 8
    Figure 4
    Figure 5
    Figure 12
    Figure 13
    Figure 7
    Figure 11
    Figure 2
    Figure 1
    Figure 3

      Supplemental Content

      Click here to read Click here to read

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk