Display Settings:

Format

Send to:

Choose Destination
    Genome Res. 2002 Mar;12(3):458-61.

    Computational detection and location of transcription start sites in mammalian genomic DNA.

    Source

    Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom. td2@sanger.ac.uk

    Abstract

    Transcription, the process whereby RNA copies are made from sections of the DNA genome, is directed by promoter regions. These define the transcription start site, and also the set of cellular conditions under which the promoter is active. At least in more complex species, it appears to be common for genes to have several different transcription start sites, which may be active under different conditions. Eukaryotic promoters are complex and fairly diffuse structures, which have proven hard to detect in silico. We show that a novel hybrid machine-learning method is able to build useful models of promoters for >50% of human transcription start sites. We estimate specificity to be >70%, and demonstrate good positional accuracy. Based on the structure of our learned models, we conclude that a signal resembling the well known TATA box, together with flanking regions of C-G enrichment, are the most important sequence-based signals marking sites of transcriptional initiation at a large class of typical promoters.

    PMID:
    11875034
    [PubMed - indexed for MEDLINE]
    PMCID: PMC155284
    Free PMC Article

    Images from this publication.See all images (4) Free text

    Figure 1
    Figure 4
    Figure 3
    Figure 2

      Supplemental Content

      Click here to read Click here to read

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk