| Literature DB >> 35869513 |
Pavel A Makhnovskii1, Oleg A Gusev2,3, Roman O Bokov1, Guzel R Gazizova3, Tatiana F Vepkhvadze1,4, Evgeny A Lysenko1, Olga L Vinogradova1,4, Fedor A Kolpakov5, Daniil V Popov6,7.
Abstract
BACKGROUND: More than half of human protein-coding genes have an alternative transcription start site (TSS). We aimed to investigate the contribution of alternative TSSs to the acute-stress-induced transcriptome response in human tissue (skeletal muscle) using the cap analysis of gene expression approach. TSSs were examined at baseline and during recovery after acute stress (a cycling exercise).Entities:
Keywords: CAGE; Differential TSSs usage; Promoter shift; Transcription factor; Transcription start site
Mesh:
Year: 2022 PMID: 35869513 PMCID: PMC9308330 DOI: 10.1186/s40246-022-00399-8
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 6.481
Fig. 1Annotation of cap analysis of gene expression (CAGE) transcription start site (TSS) clusters. A. CAGE TSS clusters showed the classical distribution to “sharp” and “broad” classes. B. Distribution of distance from CAGE TSS clusters to TSSs annotated in Ensembl and RefSeq (most of our CAGE TSS clusters fall between ± 50 bp from annotated TSSs). C. Location of CAGE TSS clusters (ordered by priority) versus gene annotation. Additionally, putative alternative starts were verified using the coverage and exon–exon junction (RNA sequencing data) and annotated to corresponding genes. D. Number of CAGE TSS clusters annotated to different locations before (left panel) and after (right panel) elimination of low-abundance CAGE TSS clusters. E. Expression (median and interquartile range) of CAGE TSS clusters annotated to different locations. F. Overlap of CAGE TSS clusters from the FANTOM5 and refTSS projects with those in our study and 3764 first defined (probably muscle-specific) CAGE TSS clusters. G. Analysis of the coverage and exon–exon junctions for the first exon (RNA sequencing data; example for the NOS1 gene) allows the annotation, for the first time, of 290 CAGE TSS clusters belonging to 163 genes. CDS, coding sequence
Fig. 2Contribution of alternative TSSs to acute-stress-induced transcriptome response and their functional role. A. Number of differentially expressed genes (DEGs) relative to pre-exercise and their overlap at different time points after acute exercise. Up- and down-regulated genes are shown in red and blue, respectively; the number of time-specific DEGs is underlined. B. Principal component analysis shows the consistency of gene responses in different volunteers at different time points during the first hours after acute exercise. C. Number of DEGs having one (532 genes) or several (733 and 146 genes) TSSs and showing differential TSSs usage (146 genes). Another set of genes (111 genes) shows differential TSSs usage without altering the overall expression of each gene. D. Examples of different TSS regulation patterns. E. Distribution of TSSs per gene and a potential functional role of the alternative TSSs
Fig. 3Individual promoter regions for the CAGE TSSs in skeletal muscle. A. Individual muscle promoter region around the CAGE TSS cluster was identified using overlapped ATAC-seq and DNase-seq data (MACS2 peaks) (example for a bidirectional promoter); additionally, the density of TFBSs was shown (see Additional file1: Fig. S3). B. Open chromatin probability distribution around the CAGE TSS (− 2000 to + 2000 bp) is similar to the density of transcription factor binding sites (TFBSs) (thick and dashed lines are median and interquartile range, respectively). C. Individual muscle promoters (86%) show greater expression level and density of TFBSs than a small fraction of pseudo-promoter—a region − 2000 to + 2000 bp from the CAGE TSS clusters without (> 2000 bp from the CAGE TSS cluster) open chromatin. Median, interquartile range, 1–99% range, and P value (Wilcoxon test) values are shown. D. The individual muscle promoter regions for all promoters and bidirectional promoters are shown (arranged, from bottom to top, according to increasing total length of the promoter; each bidirectional promoter is depicted twice in relation to each TSS). E. The use of individual promoter regions increases the number of predicted TFs associated with changes in gene expression at 1 h, 3 h, and 6 h after exercise, compared with the use of standard regions with a fixed length
Fig. 4TFs associated with co-expressed individual muscle promoter regions induced by exercise. Unsupervised analysis revealed 21 clusters of co-expressed individual promoter regions induced by exercise (b: before exercise). Each cluster includes data of 10 subjects and 4 time points; expression in each time point for each subject expressed by rank (the highest expression level: 4; the lowest: 1) and is presented as median and interquartile range. Denominator: the number of exercise-regulated individual muscle promoter regions at a time point; Numerator: the cluster size. The top enriched TFs are shown for each cluster