BACKGROUND: The core promoter, a region of about 100 base-pairs flanking the transcription start site (TSS), serves as the recognition site for the basal transcription apparatus. Drosophila TSSs have generally been mapped by individual experiments; the low number of accurately mapped TSSs has limited analysis of promoter sequence motifs and the training of computational prediction tools. RESULTS: We identified TSS candidates for about 2,000 Drosophila genes by aligning 5' expressed sequence tags (ESTs) from cap-trapped cDNA libraries to the genome, while applying stringent criteria concerning coverage and 5'-end distribution. Examination of the sequences flanking these TSSs revealed the presence of well-known core promoter motifs such as the TATA box, the initiator and the downstream promoter element (DPE). We also define, and assess the distribution of, several new motifs prevalent in core promoters, including what appears to be a variant DPE motif. Among the prevalent motifs is the DNA-replication-related element DRE, recently shown to be part of the recognition site for the TBP-related factor TRF2. Our TSS set was then used to retrain the computational promoter predictor McPromoter, allowing us to improve the recognition performance to over 50% sensitivity and 40% specificity. We compare these computational results to promoter prediction in vertebrates. CONCLUSIONS: There are relatively few recognizable binding sites for previously known general transcription factors in Drosophila core promoters. However, we identified several new motifs enriched in promoter regions. We were also able to significantly improve the performance of computational TSS prediction in Drosophila.
BACKGROUND: The core promoter, a region of about 100 base-pairs flanking the transcription start site (TSS), serves as the recognition site for the basal transcription apparatus. Drosophila TSSs have generally been mapped by individual experiments; the low number of accurately mapped TSSs has limited analysis of promoter sequence motifs and the training of computational prediction tools. RESULTS: We identified TSS candidates for about 2,000 Drosophila genes by aligning 5' expressed sequence tags (ESTs) from cap-trapped cDNA libraries to the genome, while applying stringent criteria concerning coverage and 5'-end distribution. Examination of the sequences flanking these TSSs revealed the presence of well-known core promoter motifs such as the TATA box, the initiator and the downstream promoter element (DPE). We also define, and assess the distribution of, several new motifs prevalent in core promoters, including what appears to be a variant DPE motif. Among the prevalent motifs is the DNA-replication-related element DRE, recently shown to be part of the recognition site for the TBP-related factorTRF2. Our TSS set was then used to retrain the computational promoter predictor McPromoter, allowing us to improve the recognition performance to over 50% sensitivity and 40% specificity. We compare these computational results to promoter prediction in vertebrates. CONCLUSIONS: There are relatively few recognizable binding sites for previously known general transcription factors in Drosophila core promoters. However, we identified several new motifs enriched in promoter regions. We were also able to significantly improve the performance of computational TSS prediction in Drosophila.
Authors: M Scherf; A Klingenhoff; K Frech; K Quandt; R Schneider; K Grote; M Frisch; V Gailus-Durner; A Seidel; R Brack-Werner; T Werner Journal: Genome Res Date: 2001-03 Impact factor: 9.043
Authors: P Carninci; Y Shibata; N Hayatsu; Y Sugahara; K Shibata; M Itoh; H Konno; Y Okazaki; M Muramatsu; Y Hayashizaki Journal: Genome Res Date: 2000-10 Impact factor: 9.043
Authors: Y Suzuki; H Taira; T Tsunoda; J Mizushima-Sugano; J Sese; H Hata; T Ota; T Isogai; T Tanaka; S Morishita; K Okubo; Y Sakaki; Y Nakamura; A Suyama; S Sugano Journal: EMBO Rep Date: 2001-05 Impact factor: 8.807
Authors: Y Suzuki; T Tsunoda; J Sese; H Taira; J Mizushima-Sugano; H Hata; T Ota; T Isogai; T Tanaka; Y Nakamura; A Suyama; Y Sakaki; S Morishita; K Okubo; S Sugano Journal: Genome Res Date: 2001-05 Impact factor: 9.043
Authors: Chin Yan Lim; Buyung Santoso; Thomas Boulay; Emily Dong; Uwe Ohler; James T Kadonaga Journal: Genes Dev Date: 2004-07-01 Impact factor: 11.361
Authors: Trevor J Parry; Joshua W M Theisen; Jer-Yuan Hsu; Yuan-Liang Wang; David L Corcoran; Moriah Eustice; Uwe Ohler; James T Kadonaga Journal: Genes Dev Date: 2010-08-27 Impact factor: 11.361
Authors: Tae Hoon Kim; Leah O Barrera; Ming Zheng; Chunxu Qu; Michael A Singer; Todd A Richmond; Yingnian Wu; Roland D Green; Bing Ren Journal: Nature Date: 2005-06-29 Impact factor: 49.962
Authors: Simon P Duffy; Aaron M Young; Benoit Morin; Christopher J Lucarotti; Ben F Koop; David B Levin Journal: J Virol Date: 2006-07 Impact factor: 5.103
Authors: C J Mungall; S Misra; B P Berman; J Carlson; E Frise; N Harris; B Marshall; S Shu; J S Kaminker; S E Prochnik; C D Smith; E Smith; J L Tupy; C Wiel; G M Rubin; S E Lewis Journal: Genome Biol Date: 2002-12-23 Impact factor: 13.583