| Literature DB >> 28214361 |
Mutsutomo Tokizawa1, Kazutaka Kusunoki1, Hiroyuki Koyama1,2, Atsushi Kurotani3, Tetsuya Sakurai3, Yutaka Suzuki4, Tomoaki Sakamoto5, Tetsuya Kurata5, Yoshiharu Y Yamamoto1,2,3,6.
Abstract
Information about transcription start sites (TSSs) provides baseline data for the analysis of promoter architecture. In this paper we used paired- and single-end deep sequencing to analyze Arabidopsis TSS tags from several libraries prepared from roots, shoots, flowers and etiolated seedlings. The clustering of approximately 33 million mapped TSS tags led to the identification of 324 461 promoters that covered 79.7% (21 672/27 206) of protein-coding genes in the Arabidopsis genome. In addition we identified intragenic, antisense and orphan promoters that were not associated with any gene models. Of these, intragenic promoters exhibited unique characteristics regarding dinucleotide sequences at TSSs and core promoter element composition, suggesting that these promoters use different mechanisms of transcriptional initiation. An analysis of base composition with regard to promoter position revealed a low GC content throughout the promoter region and several local strand biases that were evident for TATA-type promoters, but not for Coreless-type promoters. Most observed strand biases coincided with strand biases of single nucleotide polymorphism rate. Our analysis also revealed that transcription of a gene is supported by an average of 2.7 genic promoters, among which one specific promoter, designated as a top promoter, substantially determines the expression level of the gene.Entities:
Keywords: zzm321990Arabidopsis thalianazzm321990; TSS-seq; core promoter element; promoter; promoter maturation; transcription start site; transcriptional regulation
Mesh:
Substances:
Year: 2017 PMID: 28214361 DOI: 10.1111/tpj.13511
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 6.417