Format

Send to

Choose Destination
Plant J. 2017 May;90(3):587-605. doi: 10.1111/tpj.13511. Epub 2017 Apr 4.

Identification of Arabidopsis genic and non-genic promoters by paired-end sequencing of TSS tags.

Author information

1
United Graduate School of Agriculture, Gifu University, Yanagido 1-1, Gifu City, Gifu, 501-1193, Japan.
2
Faculty of Applied Biological Sciences, Gifu University, Yanagido 1-1, Gifu City, Gifu, 501-1193, Japan.
3
RIKEN Center for Sustainable Resource Science, Suehiro-cho 1-7-22, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
4
Institute of Medical Science, University of Tokyo, Shiroganedai 4-6-1, Minato-ku, Tokyo, 108-8639, Japan.
5
Plant Global Education Project, Graduate School of Biological Sciences, Nara Institute of Science and Technology, Takayam-cho 8916-5, Ikoma, Nara, 630-0192, Japan.
6
JST ALCA, Tokyo, Japan.

Abstract

Information about transcription start sites (TSSs) provides baseline data for the analysis of promoter architecture. In this paper we used paired- and single-end deep sequencing to analyze Arabidopsis TSS tags from several libraries prepared from roots, shoots, flowers and etiolated seedlings. The clustering of approximately 33 million mapped TSS tags led to the identification of 324 461 promoters that covered 79.7% (21 672/27 206) of protein-coding genes in the Arabidopsis genome. In addition we identified intragenic, antisense and orphan promoters that were not associated with any gene models. Of these, intragenic promoters exhibited unique characteristics regarding dinucleotide sequences at TSSs and core promoter element composition, suggesting that these promoters use different mechanisms of transcriptional initiation. An analysis of base composition with regard to promoter position revealed a low GC content throughout the promoter region and several local strand biases that were evident for TATA-type promoters, but not for Coreless-type promoters. Most observed strand biases coincided with strand biases of single nucleotide polymorphism rate. Our analysis also revealed that transcription of a gene is supported by an average of 2.7 genic promoters, among which one specific promoter, designated as a top promoter, substantially determines the expression level of the gene.

KEYWORDS:

Arabidopsis thaliana ; TSS-seq; core promoter element; promoter; promoter maturation; transcription start site; transcriptional regulation

PMID:
28214361
DOI:
10.1111/tpj.13511
[Indexed for MEDLINE]
Free full text

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center