| Literature DB >> 26391769 |
Xiang Jia Min1,2, Brian Powell3, Jonathan Braessler3, John Meinken4,3,5, Feng Yu3, Gaurav Sablok6.
Abstract
BACKGROUND: Protein functional diversity at the post-transcriptional level is regulated through spliceosome mediated pre-mRNA alternative splicing (AS) events and that has been widely demonstrated to be a key player in regulating the functional diversity in plants. Identification and analysis of AS genes in cereal crop plants are critical for crop improvement and understanding regulatory mechanisms.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26391769 PMCID: PMC4578763 DOI: 10.1186/s12864-015-1914-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of raw sequence data and assembled data in each organism
| Species | ESTs | mRNAs | Total Sequences | Cleaned Sequences | Total PUTs | Average Length (bp) |
|---|---|---|---|---|---|---|
|
| 987327 | 82451 | 1069868 | 1053842 | 163778 | 783 |
|
| 207012 | 11953 | 219065 | 212768 | 102424 | 751 |
|
| 209835 | 33248 | 243083 | 241690 | 60189 | 1002 |
|
| 2019524 | 91990 | 2111514 | 1822653 | 488243 | 466 |
PUTs putative unique transcripts
Percentage of alternative splicing genes
| Total mapped PUTs (%) | PUT match to gene model | Total unique genes | AS genes | AS (%) | |
|---|---|---|---|---|---|
|
| 104447 (63.8) | 71830 | 26191 | 7883 | 30.1 |
|
| 47843 (46.7) | 36467 | 17402 | 2414 | 13.9 |
|
| 50224 (83.4) | 38654 | 26540 | 3580 | 13.5 |
|
| 207332 (42.5) | 119418 | 28698 | 9689 | 33.8 |
AS Alternative splicing
Alternative splicing events in different cereal species
| Species | IR(%) | AltD(%) | AltA (%) | ES (%) | Complex event (%) | Total events | Total AS genes |
|---|---|---|---|---|---|---|---|
|
| 8288 (42.0) | 1245 (6.3) | 1950 (9.9) | 762 (3.9) | 7447 (37.8) | 19692 | 8734 |
|
| 2193 (55.8) | 332 (8.5) | 576 (14.7) | 161 (4.1) | 665 (16.9) | 3927 | 2657 |
|
| 4448 (23.5) | 1072 (5.7) | 1230 (6.5) | 507 (2.6) | 11681 (61.7) | 18938 | 3971 |
|
| 11048 (40.4) | 2080 (7.6) | 3314 (11.4) | 1568 (5.7) | 5576 (20.4) | 23386 | 10687 |
IR Intron Retention, AltD Alternative donor, AltA Alternative acceptor, ES exon skipping
Exon and intron size in cereal plants
| Exon | Intron | |||||
|---|---|---|---|---|---|---|
| Sample size | Average size (bp) | SD | Sample Size | Average size (bp) | SD | |
|
| 127627 | 180 | 261 | 180575 | 440 | 695 |
|
| 52330 | 133 | 113 | 79735 | 434 | 703 |
|
| 106753 | 179 | 222 | 144860 | 422 | 747 |
|
| 137020 | 142 | 133 | 209139 | 554 | 1057 |
SD Standard deviation
Fig. 1Distribution of internal exon size: The x-axis indicates the size of internal exons. Bin sizes are right inclusive (e.g., bin 100 comprises sequences of lengths 1–100 bp). The y-axis indicates the frequency of internal exons. The inset shows a detailed distribution of small internal exons
Fig. 2Distribution of intron size: The x-axis indicates the size of introns. Bin sizes are right inclusive (e.g., bin 100 comprises sequences of lengths 1 –100 bp). The y-axis indicates the frequency of introns. The inset shows a detailed distribution of small introns
Protein family classification of alternative genes in cereal plants
| PFAM | Domain |
|
|
|
| Putative Functions |
|---|---|---|---|---|---|---|
| pfam00069 | Pkinase | 205 | 228 | 55 | 74 | Protein kinase domain |
| pfam00076 | RRM_1 | 112 | 61 | 32 | 43 | RNA recognition motif |
| pfam07714 | Pkinase_Tyr | 88 | 79 | 12 | 25 | Protein tyrosine kinase |
| pfam13639 | zf-RING_2 | 53 | 28 | 8 | 15 | Ring finger domain |
| pfam00067 | p450 | 45 | 56 | 4 | 37 | Cytochrome P450 |
| pfam00481 | PP2C | 45 | 25 | 11 | 11 | Protein phosphatase 2C |
| pfam00249 | Myb_DNA-binding | 44 | 14 | 7 | 22 | Myb-like DNA-binding domain |
| pfam00179 | UQ_con | 43 | 17 | 10 | 7 | Ubiquitin-conjugating enzyme |
| pfam00010 | HLH | 41 | 5 | 1 | 7 | Helix-loop-helix DNA-binding domain |
| pfam00071 | Ras | 38 | 20 | 11 | 7 | Ras family |
| pfam00141 | peroxidase | 37 | 30 | 11 | 31 | Peroxidase |
| pfam00153 | Mito_carr | 35 | 24 | 7 | 14 | Mitochondrial carrier protein |
| pfam01559 | Zein | 35 | 0 | 0 | 0 | Zein seed storage protein |
| pfam01490 | Aa_trans | 33 | 12 | 1 | 7 | Transmembrane amino acid transporter protein |
| pfam02365 | NAM | 33 | 33 | 8 | 14 | No apical meristem (NAM) protein |
| pfam00125 | Histone | 31 | 9 | 3 | 5 | Core histone H2A/H2B/H3/H4 |
| pfam01370 | Epimerase | 31 | 26 | 8 | 22 | NAD dependent epimerase/dehydratase family |
| pfam00083 | Sugar_tr | 30 | 22 | 7 | 10 | Sugar (and other) transporter |
| pfam00847 | AP2 | 30 | 9 | 3 | 9 | AP2 domain |
| pfam00106 | adh_short | 29 | 25 | 10 | 15 | short chain dehydrogenase |
| pfam00657 | Lipase_GDSL | 29 | 5 | 1 | 16 | GDSL-like Lipase/Acylhydrolase |
| pfam00085 | Thioredoxin | 28 | 14 | 6 | 11 | Thioredoxin |
| pfam00226 | DnaJ | 28 | 18 | 9 | 9 | DnaJ domain |
| pfam03151 | TPT | 27 | 9 | 2 | 6 | Triose-phosphate Transporter family |
| pfam00004 | AAA | 26 | 18 | 6 | 14 | ATPase family associated with various cellular |
| pfam00270 | DEAD | 24 | 21 | 5 | 8 | DEAD/DEAH box helicase |
| pfam00504 | Chloroa_b-bind | 24 | 19 | 11 | 20 | Chlorophyll A-B binding protein |
| pfam02309 | AUX_IAA | 24 | 13 | 5 | 5 | AUX/IAA family |
| pfam00149 | Metallophos | 23 | 9 | 3 | 13 | Calcineurin-like phosphoesterase |
| pfam00134 | Cyclin_N | 22 | 9 | 2 | 4 | Cyclin |
| pfam00450 | Peptidase_S10 | 21 | 18 | 6 | 18 | Serine carboxypeptidase |
| pfam03106 | WRKY | 21 | 22 | 7 | 7 | WRKY DNA -binding domain |
| pfam13041 | PPR_2 | 21 | 30 | 1 | 12 | PPR repeat family |
| Total | 6900 | 4939 | 1362 | 2890 |
Note: a complete list is shown in Additional file 1: Table S1
Classification of biological processes based on Gene Ontology (GO)
Classification of molecular functions based on Gene Ontology (GO)
Fig. 3Conserved alternative splicing genes in rice (Oryza sativa) ssp japonica, rice ssp indica, sorghum (Sorghum bicolor), and maize (Zea mays) plants
Conserved alternative splicing genes among five monocot plants
|
|
| Z. mays | S. bicolor |
| CDD/Pfam | ||
|---|---|---|---|---|---|---|---|
| Osi19962 | Osj954 | Zm92934 | Sb6267 | Bd2565 | pfam03171 | 2OG-FeII_Oxy | 2OG-Fe(II) oxygenase superfamily |
| Osi18787 | Osj44013 | Zm40020 | Sb12294 | Bd28385 | pfam00004 | AAA | ATPase family associated with various cellular |
| Osi6875 | Osj22392 | Zm88316 | 45969421 | Bd7352 | pfam00248 | Aldo_ket_red | Aldo/keto reductase family |
| Osi9356 | Osj41340 | Zm162 | Sb17314 | Bd6214 | pfam00248 | Aldo_ket_red | Aldo/keto reductase family |
| CX100091 | Osj15328 | Zm35072 | Sb10885 | Bd29210 | pfam00439 | Bromodomain | Bromodomain |
| Osi12568 | Osj24409 | Zm100060 | Sb8817 | Bd24009 | pfam05042 | Caleosin | Caleosin related protein |
| CT843009 | Osj14649 | Zm32705 | Sb6709 | Bd10918 | pfam00571 | CBS | CBS domain |
| Osi524 | CT828785.1 | Zm73067 | Sb4586 | Bd10523 | pfam04733 | Coatomer_E | Coatomer epsilon subunit |
| Osi21096 | Osj16673 | FL103380 | 2.42E + 08 | Bd4031 | pfam07876 | Dabb | Stress responsive A/B Barrel Domain |
| Osi8549 | Osj47391 | Zm69871 | Sb334 | Bd7166 | pfam05605 | Di19 | Drought induced 19 protein (Di19) |
| CT833644.1 | CI258157 | Zm20082 | Sb10226 | Bd7036 | pfam05057 | DUF676 | Putative serine esterase (DUF676) |
| Osi21136 | Osj16693 | Zm46142 | Sb13903 | Bd7810 | pfam05623 | DUF789 | Protein of unknown function (DUF789) |
| Osi19974 | Osj14932 | Zm70017 | Sb10575 | Bd3731595 | pfam00676 | E1_dh | Dehydrogenase E1 component |
| Osi1759 | Osj22934 | Zm35625 | Sb15873 | Bd7027 | pfam01370 | Epimerase | NAD dependent epimerase/dehydratase family |
| CT842225 | Osj27697 | Zm91971 | Sb4930 | Bd268 | pfam00316 | FBPase | Fructose-1-6-bisphosphatase |
| Osi20900 | Osj16392 | Zm58947 | Sb3303 | Bd7531597 | pfam00210 | Ferritin | Ferritin-like domain |
| Osi339 | Osj20205 | Zm20714 | Sb12056 | Bd6374 | pfam00762 | Ferrochelatase | Ferrochelatase |
| Osi11082 | Osj491 | Zm59942 | Sb3313 | Bd27405 | pfam00125 | Histone | Core histone H2A/H2B/H3/H4 |
| Osi13655 | Osj36042 | Zm81325 | Sb15256 | Bd28446 | pfam00403 | HMA | Heavy-metal-associated domain |
| Osi11360 | Osj36865 | Zm38497 | Sb20674 | Bd9583 | pfam00447 | HSF_DNA-bind | HSF-type DNA-binding |
| Osi17520 | Osj35947 | Zm27347 | Sb9471 | Bd7833 | pfam01156 | IU_nuc_hydro | Inosine-uridine preferring nucleoside |
| Osi13902 | Osj25885 | Zm23750 | Sb12436 | Bd28318 | pfam00013 | KH_1 | KH domain |
| Osi11280 | Osj28328 | Zm35841 | Sb9907 | Bd13744 | cd00116 | LRR_RI | Leucine-rich repeats (LRRs) |
| CT844279 | CB642464 | Zm3338 | Sb7337 | Bd28467 | pfam01717 | Meth_synt_2 | Cobalamin-independent synthase |
| Osi1437 | Osj37916 | Zm4695 | Sb5119 | Bd7994 | pfam00635 | Motile_Sperm | MSP (Major sperm protein) domain |
| Osi231 | Osj25397 | Zm37411 | Sb10332 | Bd28960 | pfam14360 | PAP2_C | PAP2 superfamily C-terminal |
| Osi8815 | Osj32580 | Zm61468 | Sb11226 | Bd6619 | pfam01195 | Pept_tRNA_hydro | Peptidyl-tRNA hydrolase |
| Osi16666 | Osj19199 | Zm104454 | Sb12015 | Bd16056 | pfam00450 | Peptidase_S10 | Serine carboxypeptidase |
| Osi12736 | Osj14309 | Zm22618 | Sb7927 | Bd8683 | pfam00141 | peroxidase | Peroxidase |
| Osi833 | Osj39350 | Zm92939 | Sb19533 | Bd5931597 | pfam00069 | Pkinase | Protein kinase domain |
| Osi3301 | Osj17780 | Zm29726 | Sb14730 | Bd29285 | pfam00069 | Pkinase | Protein kinase domain |
| Osi6061 | Osj15126 | Zm59883 | Sb673 | Bd15932 | PLN02756 | PLN02756 | S-methyl-5-thioribose kinase |
| Osi6187 | Osj42201 | Zm39790 | Sb2138 | Bd8363 | pfam00348 | polyprenyl_synt | Polyprenyl synthetase |
| Osi13092 | Osj21144 | Zm33939 | Sb5787 | Bd23758 | pfam14299 | PP2 | Phloem protein 2 |
| Osi11891 | NM_001070568.2 | Zm87952 | Sb2001 | Bd2595 | pfam00854 | PTR2 | POT family |
| Osi20788 | Osj19691 | Zm39384 | 30944654 | Bd10083 | pfam07992 | Pyr_redox_2 | Pyridine nucleotide-disulphide |
| CT837906.1 | Osj7689 | Zm101865 | Sb5520 | Bd25885 | pfam00719 | Pyrophosphatase | Inorganic pyrophosphatase |
| Osi21504 | Osj17274 | Zm6058 | Sb11340 | Bd21664 | pfam00072 | Response_reg | Response regulator receiver domain |
| Osi15366 | Osj24220 | Zm5068 | Sb10671 | Bd23705 | pfam02453 | Reticulon | Reticulon |
| Osi9029 | Osj47510 | Zm24118 | Sb11323 | Bd8231593 | pfam03214 | RGP | Reversibly glycosylated polypeptide |
| Osi5643 | Osj25267 | Zm80771 | Sb227 | Bd11010 | pfam01246 | Ribosomal_L24e | Ribosomal protein L24e |
| Osi8310 | Osj36859 | Zm101179 | Sb11303 | Bd6311 | pfam00076 | RRM_1 | RNA recognition motif |
| Osi1456 | Osj43479 | Zm371 | Sb12579 | Bd15819 | pfam00076 | RRM_1 | RNA recognition motif |
| Osi773 | Osj43052 | Zm24001 | Sb2305 | Bd20070 | pfam00464 | SHMT | Serine hydroxymethyltransferase |
| Osi9812 | Osj14203 | Zm39491 | 2.42E + 08 | Bd28258 | pfam01406 | tRNA-synt_1e | tRNA synthetases class I (C) catalytic |
| Osi9653 | Osj44577 | Zm33457 | Sb5144 | Bd6360 | pfam00443 | UCH | Ubiquitin carboxyl-terminal hydrolase |
| Osi2251 | Osj35805 | Zm98577 | 2.42E + 08 | Bd20683 | pfam12076 | Wax2_C | WAX2 C-terminal domain |
| Osi15508 | Osj14495 | Zm95 | Sb10474 | Bd4536 | pfam05495 | zf-CHY | CHY zinc finger |
| Osi21052 | Osj4519 | Zm39479 | Sb9831 | Bd24331 | No Pfam predicted | ||
| Osi8778 | Osj195 | Zm100142 | Sb1070 | Bd2265 | No Pfam predicted | ||
| Osi20728 | Osj16408 | Zm34294 | 57806619 | Bd19455 | No Pfam predicted | ||
| Osi17233 | Osj20996 | Zm100171 | Sb1504 | Bd16477 | No Pfam predicted | ||
| CT830510.1 | Osj3010 | Zm96019 | Sb2210 | Bd12504 | No Pfam predicted | ||