| Literature DB >> 16356263 |
Haibo Zhang1, Ju Youn Lee, Bin Tian.
Abstract
BACKGROUND: Alternative polyadenylation is one of the mechanisms in human cells that give rise to a variety of transcripts from a single gene. More than half of the human genes have multiple polyadenylation sites (poly(A) sites), leading to variable mRNA and protein products. Previous studies of individual genes have indicated that alternative polyadenylation could occur in a tissue-specific manner.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16356263 PMCID: PMC1414089 DOI: 10.1186/gb-2005-6-12-r100
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Tissue-specific positional preference of poly(A) site usage. (a) Tissue-specific positional preference of type II genes. (b) Tissue-specific positional preference of type III genes. Distance to the median was calculated as (observed usage - median usage)/median usage. Only tissues with significant p values (<0.05, Chi-squared test) are shown here. 2F, 2M, and 2L are the poly(A) sites closest to the 5' end, middle, and closest to the 3' end in a type II gene, respectively. 3U and 3D are poly(A) sites located upstream of the exon closest to the 3' end and poly(A) sites in the exon closest to the 3' end in a type III gene, respectively.
Figure 2Tissue-specific strong and weak poly(A) site usage. For each tissue, three bars represent the distance to the median usage according to three different cutoffs for the classification of strong and weak sites, for example, 60%, 75% and 90%. For each gene at a given cutoff, the poly(A) site with the percent of supporting ESTs above the cutoff was classified as a strong site. If there was a strong poly(A) site, other sites of the same gene were classified as weak sites. Only values for the weak sites are shown. Significant ones (p value < 0.05, Chi-squared test) are marked with asterisks.
Polyadenylation-related protein factors
| Gene name | Gene ID* | U95Av2 probe-sets | U133A probe-sets | |
| 1 | CPSF-160, cleavage and polyadenylation specificity factor 1, 160 kDa | 29894 | 33132_at | 201638_s_at |
| 2 | CPSF-100, cleavage and polyadenylation specificity factor 2, 100 kDa | 53981 | NA | NA |
| 3 | CPSF-73, cleavage and polyadenylation specificity factor 3, 73 kDa | 51692 | NA | NA |
| 4 | CPSF-30, cleavage and polyadenylation specificity factor 4, 30 kDa | 10898 | 35743_at | 206688_s_at |
| 5 | CFI25, cleavage factor Im, 25 kDa | 11051 | 39142_at | 202697_at |
| 6 | CFI68, cleavage factor Im, 68 kDa | 11052 | 35757_at | 202469_s_at |
| 7 | hFip1, | 81608 | NA | 221007_s_at |
| 8 | CstF50, cleavage stimulatory factor subunit 1, 50 kDa | 1477 | 32723_at | 202190_at |
| 9 | CstF-64, cleavage stimulatory factor subunit 2, 64 kDa | 1478 | 40334_at | 204459_at |
| 10 | CstF-77, cleavage stimulatory factor subunit 3, 77 kDa | 1479 | 41183_at | 203947_at |
| 11 | τCstF-64, cleavage stimulatory factor subunit 2, 64 kDa, tau variant | 23283 | 41248_at | 212901_s_at |
| 12 | CFIIA, HEAB, ATP/GTP binding protein, component of CFIIAm [52] | 10978 | 33149_at | 204370_at |
| 13 | PCF11, pre-mRNA cleavage complex II protein [53] | 51585 | 41665_at | 203378_at |
| 14 | PABPII, poly(A) binding protein, nuclear 1 | 8106 | 39050_at | 201544_x_at |
| 15 | Symplekin [1,2] | 8189 | 32402_s_at | 32402_s_at |
| 16 | HNRPF, heterogeneous nuclear ribonucleoprotein F [5] | 3185 | 38071_at | 201376_s_at |
| 17 | HNRPH, heterogeneous nuclear ribonucleoprotein H1 (H) [6] | 3187 | 41292_at | 201031_s_at |
| 18 | HNRPH', heterogeneous nuclear ribonucleoprotein H2 (H') [6] | 3188 | 41131_f_at | 201132_at |
| 19 | U2AF65, U2 small nuclear RNA auxiliary factor2 [7] | 11338 | 32556_at | 218381_s_at |
| 20 | U1A, U1 small nuclear ribonucleoprotein polypeptide A [8,10] | 6626 | 40842_at | 201770_at |
| 21 | PC4, transcriptional coactivator [3] | 10923 | 36171_at | 212857_x_at |
| 22 | Similar to HSPC182 protein, human HomoloGene of yeast Ssu72 [4] | 286528 | NA | NA |
| 23 | SRp20 [12] | 6428 | 351_f_at | 202899_s_at |
| 24 | PTB, polypyrimidine tract binding protein, also known as hnRNP I [11] | 5725 | 40593_at | 202189_x_at |
| 25 | nPTB, polypyrimidine tract binding protein 2 | 58155 | NA | 218683_at |
| 26 | PAP, poly(A) polymerase | 10914 | 34855_at | 209388_at |
References supporting the role of a given factor in mRNA polyadenylation can be found in review articles Proudfoot et al. [54], Zhao et al. [21], and Edwalds-Gilbert et al. [15], if not otherwise noted. Some genes do not have corresponding probe sets on a microarray, as indicated by NA. *Gene IDs are NCBI Entrez Gene IDs [55].
Figure 3Gene expression of polyadenylation factors. (a) Two-way hierarchical clustering of the U95Av2 data using 21 polyadenylation factors. (b) Two-way hierarchical clustering of the U133A data using 23 polyadenylation factors. See Table 1 for polyadenylation factors in each dataset. Tissues and genes that are consistently clustered together in both datasets are marked by gray lines. (c) Correlation of mRNA expression levels of 21 polyadenylation-related factors across 25 human tissues (upper diagonal, data from U133A; lower diagonal, data from U95Av2). Based on the scale displayed on top of the figure, small squares are colored to represent the extent of correlation between mRNA expression levels of the 21 genes in each pair of human tissues. DRG, dorsal root ganglion.
Figure 4Boxplots of mRNA expression of several factors in brain and other tissues. (a) PTB, (b) U1A, (c) τCstF-64, (d) PC4, and (e) nPTB. All factors except nPTB were present in both the U95Av2 and U133A datasets. Brain tissues include amygdala, thalamus, caudate nucleus, fetal brain, and whole brain. Expression values lower in brain tissues than other tissues are in green; and those higher in brain tissues are in red.
Figure 5Brain-specific over-represented cis elements. (a) Schematic of the four poly(A) regions investigated. (b) Identified cis elements in -100/-41. (c) Identified cis elements in -40/-1. (d) Identified cis elements in +1/+40. Logos are shown for cis elements. Under each logo is the percent of hits for the corresponding cis element in poly(A) sites preferred in brain tissues (red), poly(A) sites not preferred in brain tissues (green), and all poly(A) sites (black). In the graphs, the y-axis is the percent of hits, and the x-axis is the distance to the poly(A) site. Horizontal dotted lines are the average value, and vertical dotted lines are the -100, -40, +40, +100 nt positions.