| Literature DB >> 20805997 |
Fei Wang1, Shi Huang, Long Ma.
Abstract
RNA splicing generates multiple transcript isoforms from a single gene and enhances the complexity of eukaryotic gene expression. In some eukaryotes, operon exists as an ancient regulatory mechanism of gene expression that requires strict positional and regulatory relationships among its genes. It remains unknown whether operonic genes generate transcript isoforms in a similar manner as non-operonic genes do, the expression of which is less likely limited by their positions and relationships with surrounding genes. We analyzed the number of transcript isoforms of Caenorhabditis elegans operonic genes and found that C. elegans operons contain a much higher proportion of genes with multiple transcript isoforms than non-operonic genes do. For genes that express multiple transcript isoforms, there is no apparent difference between the number of isoforms in operonic and non-operonic genes. C. elegans operonic genes also have a different preference of the 20 most common 3' splice sites compared to non-operonic genes. Our analyses suggest that C. elegans operons enhance expression complexity by increasing the proportion of genes that express multiple transcript isoforms and maintain splicing efficiency by differential use of common 3' splice sites.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20805997 PMCID: PMC2929210 DOI: 10.1371/journal.pone.0012456
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1C. elegans operons contain a higher proportion of genes that express multiple transcript isoforms.
(A) C. elegans operonic genes express more transcript isoforms per gene than non-operonic genes do. (B) C. elegans operons contain a higher proportion of genes that express multiple transcript isoforms than non-operonic genes do. (C) Alternatively spliced C. elegans operonic genes and non-operonic genes have a similar number of transcript isoforms per gene. Z-test was performed (Figure 1A and 1C) to evaluate the significance of difference between the means of transcript numbers. Error bars represent standard deviations.
The numbers of genes and transcripts we analyzed.
| Whole-genome | Non-operon | Operon | ||||
| Genes | Transcripts | Genes | Transcripts | Genes | Transcripts | |
|
| 20109 | 20109 | 18369 | 18369 | 1740 | 1740 |
|
| 4248 | 11832 | 3106 | 8732 | 1142 | 3100 |
|
| 24357 | 31941 | 21475 | 27101 | 2882 | 4840 |
Genes and annotated transcripts were downloaded from WormMart and processed with MS Excel.
The proportions and numbers of the 20 most frequently used 3′ splice sites in different groups of genes.
| Proportions and numbers of the top 20 3' splice sites in each group of genes | |||||||
| Whole-genome | Non-operon | Operon | |||||
| 3' splice sites | Ratio | n | Ratio | n | Ratio | n |
|
| ttttcag | 0.261 | 27890 | 0.256 | 23202 | 0.291 | 4688 |
|
| atttcag | 0.14 | 14919 | 0.139 | 12639 | 0.141 | 2280 |
|
| tttccag | 0.0857 | 9149 | 0.0835 | 7567 | 0.0983 | 1582 |
|
| tttgcag | 0.0359 | 3827 | 0.0341 | 3094 | 0.0456 | 733 |
|
| tttttag | 0.0346 | 3695 | 0.0355 | 3220 | 0.0295 | 475 |
|
| gtttcag | 0.034 | 3617 | 0.0348 | 3155 | 0.0287 | 462 |
|
| ctttcag | 0.0329 | 3508 | 0.0337 | 3050 | 0.0285 | 458 |
|
| tttacag | 0.0295 | 3152 | 0.0291 | 2640 | 0.0318 | 512 | 0.18 |
| attccag | 0.0295 | 3147 | 0.0298 | 2703 | 0.0276 | 444 | 0.18 |
| attttag | 0.0218 | 2327 | 0.0223 | 2025 | 0.0188 | 302 |
|
| attacag | 0.0161 | 1721 | 0.0162 | 1471 | 0.0155 | 250 | 0.37 |
| aattcag | 0.0153 | 1633 | 0.0154 | 1395 | 0.0148 | 238 | 0.36 |
| tattcag | 0.0137 | 1457 | 0.0138 | 1255 | 0.0126 | 202 | 0.33 |
| attgcag | 0.0129 | 1378 | 0.0131 | 1188 | 0.0118 | 190 | 0.04 |
| cttccag | 0.0115 | 1230 | 0.0119 | 1080 | 0.00932 | 150 | 0.03 |
| tttctag | 0.00949 | 1012 | 0.00978 | 886 | 0.00783 | 126 | 0.02 |
| ttttaag | 0.00866 | 924 | 0.00887 | 804 | 0.00746 | 120 | 0.23 |
| tcttcag | 0.00838 | 894 | 0.00841 | 762 | 0.00821 | 132 | 0.96 |
| tgttcag | 0.008 | 854 | 0.00822 | 745 | 0.00678 | 109 |
|
| tttatag | 0.00735 | 783 | 0.00737 | 668 | 0.00715 | 115 | 0.96 |
|
|
|
|
|
|
|
| NA |
Total numbers of each 3′ splice sites were calculated as described in Methods. The proportions of the top 20 sites were presented as a percentage of all identified 3′ splice sites in the groups of genes specified. Pairwise Z-test was performed for each 3′ splice site to test the signficance of difference between proportions of operonic and non-operonic genes. p≤0.01 is the confidence level.
Figure 2Common 3′ splice sites are used differentially by C. elegans operonic genes.
The proportions of each 3′ splice site (X axis) of operonic and non-operonic genes were compared to that of all genes of the whole genome and were presented as fold changes (Y axis). Pairwise Z-test was performed (see Table 2) to evaluate the significance of difference between the proportions of each 3′ splice site in operonic genes and non-operonic genes. *: p≤0.01.