| Literature DB >> 20657786 |
David A de Lima Morais1, Paul M Harrison.
Abstract
BACKGROUND: Alternatively-spliced (AS) forms can vary protein function, intracellular localization and post-translational modifications. AS coupled with mRNA nonsense-mediated decay (NMD) can also control the transcript abundance. Here, we have investigated the genome-scale conservation of alternatively-spliced NMD candidates (AS-NMD candidates), in mammals. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 20657786 PMCID: PMC2908137 DOI: 10.1371/journal.pone.0011695
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of the AS-NMD candidates.
| Genomes | (1) AS events | (2) EST/cDNA transcripts supporting the AS | (3) Genes with an AS-NMD candidate | (4) Number of (1) that are | (5) Proportion of (3) that have orthologs | (6) AS-NMD candidate genes with an orthologous AS-NMD candidate in the human genome |
|
| 2536 | 14343 | 2051 (9.8%) | 12/2536 (0.5%) | -------- | -------- |
|
| 1211 | 1628 | 1069 (4.6%) | 23/1211 (1.9%) | 970/1069 (91%) | 262/970 (27%) ∇ |
|
| 166 | 190 | 149 (0.84%) | 6/166 (3.6%) | 129/149 (87%) | 45/129 (34%) ∇ |
|
| 374 | 600 | 342 (1.8%) | 4/374 (1.1%) | 297/342 (87%) | 105/297 (35%) ∇ |
Percentage of the total number of known genes based on Ensembl build 52.
Percentage of retained introns whose length is a multiple of 3.
Orthology was based on the relationship one-to-one between a human transcript and a transcript from the other genome according to Ensembl annotation.
∇Significant by a hypergeometric probability test with P ≤10−10. We calculated the probability of picking the observed number of AS-NMD candidates in the other mammals that are conserved in human, from the total population of human genes.
Summary of the AS-NMD candidates involving intron retentions (IRs).
| Genomes | (1) EST/cDNA transcripts supporting the IR | (2) Genes with an AS-NMD candidate involving an IR | (3) IRs with lengths that are multiples of 3 | (4) Orthologs | (5) AS-NMD candidate genes with an orthologous AS-NMD candidate in the human genome | (6) Proportion of cases in (5) that are in the same position in the gene |
|
| 2117 | 884 | 37% | -------- | -------- | -------- |
|
| 725 | 475 | 28% | 440 | 153/475 (35%) ∇∇∇ | 20/153 (13%) |
|
| 123 | 95 | 36% | 92 | 32/95 (35%) ∇ | 7/32 (22%) |
|
| 428 | 231 | 30% | 211 | 85/231 (40%) ∇∇ | 17/85 (20%) |
Percentage of the total number of known genes based on Ensembl build 52.
Percentage of retained introns whose length is a multiple of 3.
The percentage is based on the number of AS-NMD candidates with an orthologous AS-NMD candidates in the human genome (Table 1).
Orthology was based on the relationship one-to-one between a human transcript and a transcript from the other genome according to Ensembl annotation.
∇ Significant by a hypergeometric probability test with P ≤10−10. We calculated the probability of picking the observed number of AS-NMD candidates in the other mammals that are conserved in human, from the total population of human genes.
Number of events per genome.
| Events |
|
|
|
|
| IR | 884 (43%) | 475 (44%) | 95 (63%) | 231 (67%) |
| CSE | 444 (21%) | 137 (12%) | 10 (6%) | 27 (8%) |
| ASD | 592 (29%) | 247 (23%) | 26 (17%) | 49(14%) |
| ASA | 616 (30%) | 352 (32%) | 35 (23%) | 67 (19%) |
| Avg. IR size | 259 nt | 171 nt | 189 nt | 188 nt |
| % id. with a human Ortholog | -------- | 84% | 85% | 88% |
IR (intron retained); CSE (Cassette exon); ASD (alternative splice donor); ASA (alternative splice acceptor).
Values based on the number of genes.
The percentage does not add up to 100 since one gene can have more than one event simultaneously.
Values based on protein sequence.
Figure 1Mean GC content of retained introns, exons bordering retained introns and non-retained introns in human AS-NMD candidates.
The data sets was divided into five categories to avoid bias due to their length. The data set of ‘Non-retained introns’ was generated by randomly picking a non-retained intron from the set of genes that contain retained introns, so that the same number of sequences is in each bin. In addition, we included bars in the plot for 'Retained introns that are conserved between human and mouse.
Codon usage comparison of retained introns, exons and non-retained introns in human AS-NMD candidates.
| Retained introns and Non-retained introns | Retained introns and Exons | Non-retained introns and Exons | |
| Codon table (Average χ2) | 1324±282 | 950±86 | 2531±310 |
*Comparison of the entire codon usage table (61 codons, 60 degrees of freedom) among three datasets.
Retained introns (IRs) with a premature stop codon that not target the mRNA for NMD.
|
|
|
|
| |
| Retained introns | 5029 | 970 | 261 | 522 |
| Number of genes with a IR. | 3111 | 770 | 205 | 416 |
| Avg. size of IR. | 421 nt | 180 nt | 210 nt | 202 nt |
| IR (3 | 33.5% | 34% | 36% | 35% |
Percentage of retained introns whose length is multiple of 3.
Most represented GO terms per genome.
|
|
|
|
|
| GO Molecular Function Level 3 | |||
| Hydrolase activity (GO:0016787) Nucleotide binding (GO:0000166) Lyase activity (GO:0016829) Oxidoreductase activity (GO:0016491) GTPase regulator activity (GO:0030695) | Nucleotide binding (GO:0000166) Hydrolase activity (GO:0016787) Transferase activity (GO:0016740) Helicase activity (GO:0004386) | Transferase activity (GO:0016740) | Structural constituent of ribosome (GO:0003735) |
| GO Molecular Function Level 4 | |||
| Transferase activity, transferring phosphorus-containing groups (GO:0016772) Purine nucleotide binding (GO:0017076) Translation factor activity, nucleic acid binding (GO:0008135) Carbon-carbon lyase activity (GO:0016830) RNA binding (GO:0003723) | Purine nucleotide binding (GO:0017076) Transferase activity, transferring phosphorus-containing groups (GO:0016772) Hydrolase activity, acting on acid anhydrides (GO:0016817) Ligase activity, forming carbon-oxygen bonds (GO:0016875) RNA binding (GO:0003723) | Hydrolase activity, acting on glycosyl bonds (GO:0016798) | Purine nucleotide binding (GO:0017076) |
Statistically significant (but for P value non-corrected for multiple hypotheses only).
Comparison between human AS-NMD candidates with and without matches in PRIDE database.
|
| AS events | ESTs/cDNAs/mRNAs supporting the AS | Number Genes | Orthologs in another genome | AS-NMD candidates with an orthologous AS-NMD candidate in another genome | AS-NMD candidates with a IR |
| AS-NMD candidate with match in PRIDE | 205 | 385 | 194 | 159 (mouse) 139 (rat) 156 (cow) | 30 (mouse) 5 (rat) 13 (cow) | 110 |
| AS-NMD candidate without match | 2331 | 13958 | 1857 | 1560 (mouse) 1425 (rat) 1478 (cow) | 232 (mouse) 40 (rat) 92 (cow) | 884 |
Figure 2The RPS3 gene.
(A) Conservation of the gene RPS13 among human, mouse and cow. Yellow bars, UTRs; Blue bars exons; Orange bars, Intron; Green bar, repetitive elements. The conserved retained intron is indicated (braces). (B) Alignment of UTR-Exon1-Intron1-Exon2-Intron2. The start and end of the conserved intron is indicated by vertical bars.
Figure 3Percentage of identity in a window of 20 nucleotides of five pairs of AS-NMD candidates orthologs between human and mouse.
The red bar above each graph represents the retained intron in the mouse AS-NMD candidate (IR = retained intron in the human genome).