| Literature DB >> 24244432 |
Hongyan Wang1, Jianguo Wen, Chung-Che Chang, Xiaobo Zhou.
Abstract
More and more transcription factors and their motifs have been reported and linked to specific gene expression levels. However, focusing only on transcription is not sufficient for mechanism research. Most genes, especially in eukaryotes, are alternatively spliced to different isoforms. Some of these isoforms increase the biodiversity of proteins. From this viewpoint, transcription and splicing are two of important mechanisms to modulate expression levels of isoforms. To integrate these two kinds of regulation, we built a linear regression model to select a subset of transcription factors and splicing factors for each co-expressed isoforms using least-angle regression approach. Then, we applied this method to investigate the mechanism of myelodysplastic syndromes (MDS), a precursor lesion of acute myeloid leukemia. Results suggested that expression levels of most isoforms were regulated by a set of selected regulatory factors. Some of the detected factors, such as EGR1 and STAT family, are highly correlated with progression of MDS. We discovered that the splicing factor SRSF11 experienced alternative splicing switch, and in turn induced different amino acid sequences between MDS and controls. This splicing switch causes two different splicing mechanisms. Polymerase Chain Reaction experiments also confirmed that one of its isoforms was over-expressed in MDS. We analyzed the regulatory networks constructed from the co-expressed isoforms and their regulatory factors in MDS. Many of these networks were enriched in the herpes simplex infection pathway which involves many splicing factors, and pathways in cancers and acute or chronic myeloid leukemia.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24244432 PMCID: PMC3828332 DOI: 10.1371/journal.pone.0079118
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flowchart of proposed method for constructing regulatory networks.
Flowchart of proposed method for constructing regulatory networks: (A). Process raw RNA-seq data, find out deferentially expressed isoforms using Tophat and Cufflinks and cluster these isoforms to get gene cluster that may be regulated by same TFs and SFs. (B). Construct two dataset, promote region data (PRD) and exon-intron data (EID), for mining the interaction strength of the TF-isoform interactions and SF-isoform interaction. (C). Use interaction strength to predict the expression levels of isoforms in a co-expressed group. (D). Link model-selected TFs and SFs with their target genes.
Regulatory networks found by our model.
| No | Target Genes | Transcription Factors | Splicing Factors |
|
| GNB1, GPATCH4, MRPL51, BRD4, ODC1, ATP5J, HNRNPD, RBBP7, CALM3, ANP32C, PSPC1, SETD3 | Blimp-1, FOX factors, FOXP1a, Sp1, STAT5B | MBNL1, PSF, Sam68, SF2, SRp38, YB-1 |
|
| PARK7, KDM1A, SSR2, DNTT, HNRNPA1L2, SUPT16H, SLIRP, YWHAE, TAF15, TOP2A, SRSF1, ATP5A1, EEF1B2, HMGN1, U2SURP, HSD17B4, CANX, MAP7D3, HNRNPM, BCL11A | Egr-1, Evi-1, LXR direct repeat 4, MEF-2, Nkx2-5, Pax-6, RSRFC4, Sp1 | ETR-3, hnRNPD, hnRNPI(PTB), hnRNPQ, MBNL1, PSF, SF2, SRp38, SRp40, hnRNPC/C1/C2 |
|
| TARDBP, HDAC1, ARHGDIB, ESD, CX-CR4, TTC1, HLA-DPB1, ANP32B, ATP1A1, ARPC5 | ATF6, COUPTF, FOX factors, FOXP1d, IRF, Nkx3-1 | SRp40, hnRNPE1/E2 |
|
| PRDM2,GPBP1L1,ZRANB2,DNTTIP2,SMG7,EIF4G2,PHF21A,RBM4B,GNPTAB,UFM1,ERCC5,SF3B1,CD47,CDV3,BDP1,DDAH2,SNX3,PPIL4,KIT,PCMTD1,RABGAP1,DDX3X,C16orf80 | Blimp-1, Egr-1, Evi-1, FOX factors, KROX, NIT2, Pax-6, POU3F2, RSRFC4, SRF | hnRNPQ, KSRP, SF1, SRp38, SRp54, SRp55, TIA-1, TIAL1, YB-1, hnRNPA1/A2, hnRNPE1/E2 |
|
| SRRM1, JUN, PRDX6, DUSP10, RBM34, HSP90AA1, SRRM2, RPS7, RPL3, RPL32, RPS20 | Blimp-1, HNF-1alpha, KROX, MEF-2, Nkx2-5, Sp1 | SRp20, SRp54, YB-1 |
|
| TMEM50A, DNAJC8, ZNHIT6, MRPL9, DDX21, RPL36, RPL13A, NOP58, YWHAB, SREK1, ZMAT2, EIF2AK1, RBM3, UPF3B, HMGB2, SRF | Blimp-1, Egr-1, ER-alpha, Evi-1, LXR direct repeat 4, Nkx2-5, Pax-6, Pbx1a, PLZF | SC35, SF2, SRp40, TIA-1, hnRNPA1/A2 |
|
| STMN1, ATPIF1, CFL1, APLP2, SARNP, MLEC, POMP, ERH, FUS, NME1, GNAI2, HNRNPAB, DEK, HIST1H2BC, HIST1H2AC, HIST1H4H, SEC61G, HIST1H2BK | AP-2alphaA, Egr-1, HOXA5 (Hox-1.3), MCM1, MTF-1, NF-Y, Pax-6, PLZF, RXR-alpha, STAT1 | hnRNPF, hnRNPI(PTB), MBNL1, SRp30c, SRp55, TIAL1 |
|
| SRSF4, NASP, SERBP1, CSTF3, SSRP1, ERP29, RAN, HMGB1, SRP14, PSMA4, NUTF2, RPS16, VAMP8, HSPE1, CCT5, TAF9, HIST1H2BD, HIST1H4E, TUBB, HNRNPA2B1, PPIA, FKBP3 | AP-2alphaA, Egr-1, ER-alpha, Evi-1, GLI1, IRF, MCM1, NIT2, Nkx2-5, Pbx1a, RSRFC4, STAT1, STAT5A (homotetramer), UF1H3BETA | 9G8, hnRNPI(PTB), HTra2alpha, SRp30c, TIAL1 |
|
| KHDRBS1, PIP4K2A, RPLP2, SMARCC2, RBM25, RFX7, YWHAQ, XRCC5, VDAC1, GNB2L1, SHFM1, SMC1A, HDGF, LSM14A, MAT2A | AP-2alphaA, ATF6, Egr-1, MTF-1, Nkx2-5, STAT5B, ZID | 9G8, HTra2alpha, MBNL1, PSF, SC35, SRp30c |
|
| SFPQ, SRSF11, BCAS2, TFAM, ZFP91, CSDA, NGRN, RPS15A, RPL26, NACA2, EEF2, DDX18, SF3B1, MAPRE1, SATB1, RPL24, SMARCA5, RUFY1, NUP153, DDX39B, RPL10A, HOXA7, RPL7, TCEAL4, RBMX, SYF2 | ER-alpha, GLI1, KAISO, mat1-Mc, MIG1, MTF-1, Nkx2-5, Pax-6, PLZF, RSRFC4, Sp1, SRF, STAT1, ZID | 9G8, DAZAP1, hnRNPF, HTra2beta1, KSRP, MBNL1, SRp30c, TIA-1, hnRNPC/C1/C2, hnRNPE1/E2 |
|
| THRAP3, PSMB4, ARF3, TCF25, GABARAP, TAF15, CHMP2A, GAR1, C9orf78, ACIN1, UBB, BASP1 | Egr-1, Gfi1b, NF-Y, STAT5B, UF1H3BETA | PSF, SC35, SRp40, SRp55 |
|
| GNL2, EIF3M, CD44, FNBP4, OSBPL8, NUFIP2, STK17B, SYPL1, MDM4, ANKRD36B, IER2 | FOXO3a, POU3F2, TBX5 | 9G8, ETR-3, hnRNPF, hnRNPP(TLS), SRp20, hnRNPH1/H2/H3 |
|
| NASP, UQCRH, ZRANB2, NUCKS1, FKBP4, ANP32A, EIF3C, PTMA, SUMO3, HNRNPA2B1, TXN, MYL12B | AIRE, COUP-TF1, Egr-1, Evi-1, GLI1, MTF-1, NF-kappaB, STAT5A (homodimer), UF1H3BETA | Sam68 |
|
| EPS15, RAB13, ERLIN1, CAT, CWC15, RPL6, EID1, SEC11A, CKLF, RPL23, RPL19, FBXO7, EIF3L, POLR2B, RPL34, RPS3A, RPS6, CNTRL, EIF2S3, IK, HMGN4 | AIRE, COUPTF, Egr-1, Evi-1, FOXP1a, HNF-1alpha, LXR direct repeat 4, Nkx2-2, Nkx2-5, PLZF, Sp1, SRF, PPARalpha:RXRalpha, PPARG | hnRNPP(TLS), KSRP, MBNL1, SF2, SRp55 |
|
| FUBP1, UFC1, HNRNPU, CHD4, ETV6, FGFR1OP2, PFDN5, NCKAP1L, NACA, NAP1L1, CTCF, GLOD4, ZNF830, RPL38, SAFB2, CEBPZ, CEP63, USO1, HNRNPD, PPWD1, NSA2, RPS23, HINT1, SKP1, RPS12, EIF3H, CEP350, ARID4A, PRPF40A, ST13, STXBP3, SERTAD2, CNBP | AIRE, AP-2, ATF6, Blimp-1, Egr-1, Evi-1, KROX, LXR direct repeat 4, MEF-2, Nkx2-5, 1-Oct, POU3F2, RSRFC4, Sp1, SRF, ZBRK1 | 9G8, hnRNPK, hnRNPP(TLS), hnRNPQ, hnRNPU, SRp20, SRp30c, SRp38, SRp40, TIA-1, YB-1, hnRNPA1/A2, hnRNPC/C1/C2, hnRNPE1/E2 |
|
| PHTF1, H3F3C, TRIAP1, PMAIP1, KLF2, EEF1B2, IL8, HNRPDL, ARRDC3, FKBP5, NAMPT | HNF4, HSF1, MafA, MIG1, PLZF, Sp1, ZID | 9G8, KSRP |
|
| MAN1A2, IFI16, UPF2, FGFR1OP2, FLT3, TERF2IP, LRRFIP1, EGR1, MLL5, TMEM66, IRF2BP2, ITSN2, PRRC2C, STT3B, CD164, SPRY1 | Egr-1, Nkx2-5, p53, RSRFC4, SRF, STAT1, STAT5B (homodimer), UF1H3BETA | 9G8, MBNL1, TIA-1, hnRNPA1/A2, hnRNPC/C1/C2, hnRNPH1/H2/H3 |
|
| HIST2H2BE, SLC2A3, SRSF5, TEC, PTP4A1, ESYT2, LYN, SMARCA2, JMJD1C, PIM3, KLF10 | Egr-1, Nkx2-5, Pbx1a, Sp1 | PSF |
|
| ANP32E, ILF2, COX4I1, AATF, MPO, SNRPD1, FBL, RPS19, SNRPD2, RPL18, PPM1G, RPS21, GCFC1, CCDC72, H2AFZ, COX6C, DKC1 | LXR direct repeat 4, Nkx2-5, 1-Oct, PPARalpha:RXRalpha, REST, STAT5A (homodimer), STAT5B, TBX5, UF1H3BETA | hnRNPD, hnRNPU, HTra2alpha, SRp40, SRp55 |
|
| GATAD2B, GDI2, APBB1IP, RPS13, HNRNPA1, LTA4H, MFAP1, ME2, MYADM, SEPT2, C21orf59, RUNX1, RPL35A, SNX2, HSPA9, PAIP2, LARS, SOX4, POLR2K, STRBP, RPL7A, SLC25A5, NRD1, LARP1, C19orf77, MRPL3, PNRC2, MTIF3 | ATF6, Egr-1, ER-alpha, HNF-1alpha, IRF-1, MEF-2, MTF-1, Nkx2-5, PLZF, RSRFC4, Sp1, Sp3, UF1H3BETA, ZID | ETR-3, MBNL1, SRp38, SRp54, SRp55, TIAL1, hnRNPA1/A2, hnRNPE1/E2 |
|
| SELL, TMEM123, DDX6, TMBIM6, CASC4, B2M, CHD2, USP34, SETD2, PAK2, HSD17B11, GOLPH3, DFIP1, WRNIP1, UTRN, VAMP2, PCM1, SORL1, PAN3, ATXN3, XRN2, FBXO11, C4orf3, GAPT | E2F-1:DP-2, Egr-1, GLI1, HNF-1alpha, IRF, IRF-3, KROX, mat1-Mc, MEF-2, MyoD, NF-kappaB, NIT2, Nkx2-5, NRSF, RSRFC4, Sp1, ZID | hnRNPI(PTB), KSRP, SF2, SRp20, SRp30c |
|
| RGS1, PTMS, PARP2, C16orf61, CLK1, EAF2, SLU7, RSRC2, ZCCHC7 | AIRE, Evi-1, FOXO3a, IRF-3, MEF-2, VDR, CAR, PXR | hnRNPD |
|
| RGS2, CD34, RSF1, WNK1, CBFB, CSTB, IQGAP2, GNAI1, LAPTM4B, SH3KBP1, HNRNPM, ARCN1, MARCH6 | AP-2, BLIMP1, KROX, MEF-2, Sp3, STAT5B | Sam68, hnRNPA1/A2 |
|
| HNRNPU, SAP18, RBM26, LEO1, UQCRC2, RPL27, RPS9, DUSP11, DARS, ARPC2, ESF1, SERINC3, PITPNB, HNRNPA0, TCERG1, NPM1, C6orf48, IL2RG, RPS4X, API5, SEP15, RPL23A | Blimp-1, Egr-1, Evi-1, GLI1, MTF-1, NIT2, Nkx2-2, Pax-6, PLZF, ZID | DAZAP1, hnRNPD, hnRNPI(PTB), MBNL1, SRp20, SRp30c, YB-1, hnRNPA1/A2, hnRNPE1/E2 |
|
| EIF4EBP2, CELF1, ATXN7L3B, PRPF8, RPSA, SRP72, DDX46, STK38, SRSF3, NGFRAP1, TOP2B | BLIMP1, COUP-TF1, Egr-1, HNF-1alpha, IRF, IRF-1, SRF, ZID | YB-1 |
|
| PDLIM1, LMO2, C11orf67, SLC38A1, DUSP6, ROCK2, KIF2A, PHF3, CD164, TAX1BP1, SEPT6, DDX1 | Egr-1, HNF-1alpha, RXR-alpha | ETR-3, hnRNPK, SRp55 |
|
| PSMA1, NEDD8, ADAM10, PSMC5, UBE2G2, PDCD10, SEC62, CWC27, LCP2, XPO7, PEBP1 | AIRE, ATF6, FOXO1, Gfi1b, 1-Oct, Sp1 | hnRNPF, SRp20, TIA-1 |
|
| NXF1, FAU, TAF1D, PPTC7, PABPC3, ELF1, CHD9, JUND, LAPTM4A, RANBP2, HAT1, UBE2D3, SBDS, FLJ44635, TSC22D3, HSPH1, KPNB1 | ER-alpha, ER-beta, GLI1, KROX, Nkx2-5, NRSF, Staf | HTra2alpha, Sam68, SRp30c, SRp38, SRp55, hnRNPA1/A2 |
|
| CCND2, METAP2, ARGLU1, MIS18BP1, CCNK, PSME3, RPL38, GLTSCR2, RPS11, HADHA, RPL31, HSPD1, RPL37A, NOP56, BRK1, RPL37, UBE2V2 | AIRE, AP-2alphaA, dl, Egr-1, HNF-1alpha-A, 1-Oct, Olf-1, p53, Pax-6, Pbx1a, RSRFC4, Sp1 | SC35, SRp38, hnRNPE1/E2 |
|
| ITM2B, SCAPER, PPIG, MRFAP1L1, RASGEF1B, AIF1, RNPC3, H3F3B, BPTF, AFF1, B3GNT5 | Alx-4, Blimp-1, BLIMP1, HNF-1alpha, KROX, Sp1 | hnRNPF, SRp40, hnRNPA1/A2 |
|
| NIN, ZNF609, ZFP36L2, SOD1, NOL7, LTB, ITGA6, WDR77, PIM1, SEPT7 | ABF, Antp, COUP-TF1, ID1, KAISO, myogenin/NF-1, RORalpha1 | hnRNPK |
The first column is differentially expressed isoform groups in our cases, the second and third columns are the Transcription factors and splicing factors predicted by our regression model. Some cells are blank, which means no corresponding factors for that co-expressed group.
Names and description of splicing factors used in our model.
| Splicing Factors | Gene Name | Description |
|
| SRSF7 | Splicing factor arginine/serine-rich 7. The shuttling protein 9G8 binds TAP and can function as export factors. |
|
| CUGBP1 | CUG triplet repeat RNA binding protein 1. CUGBP1 induces exon 5 inclusion in cTNT gene (PMID: 9563950), induces exon 11 exclusion in IR gene (PMID: 11528389), induces intron 2 retention in CIC-1 gene (PMID: 12150906). |
|
| DAZAP1 | DAZ associated protein 1. |
|
| CUGBP2 | CUG triplet repeat RNA binding protein 2. ETR-3 induces exon 5 inclusion in cTNT gene (PMID: 11931771), induces exon 9 inclusion in CFTR gene (PMID: 15657417), promotes selectively the exclusion of Tau exon 2 (PMID: 16862542). |
|
|
| Heterogeneous nuclear ribonucleoprotein A1. hnRNP A1 carries bidirectional shuttling signals that serve for both nuclear localization and export (PMID: 8521471) |
|
| HNRNPA2B1 | Heterogeneous nuclear ribonucleoprotein A2/B1. hnRNP A2 is involved in cytoplasmic RNA transport (PMID: 11024030). |
|
| HNRNPC | Heterogeneous nuclear ribonucleoprotein C. Tetramer composed of 3 copies of isoform C1 and 1 copy of isoform C2. hnRNP C proteins are restricted to the nucleus because they bear a nuclear retention sequence (NRS) (PMID: 8830767). |
|
| HNRNPC | Heterogeneous nuclear ribonucleoprotein C. Isoform C1 is due to Alternative Splicing. |
|
| HNRNPC | Heterogeneous nuclear ribonucleoprotein C. Isoform C2 is due to Alternative Splicing. |
|
| HNRNPD | Heterogeneous nuclear ribonucleoprotein D. |
|
| HNRNPD | Heterogeneous nuclear ribonucleoprotein D. Isoform D0 is due to Alternative Splicing. |
|
| HNRNPDL | Heterogeneous nuclear ribonucleoprotein D-like. |
|
| PCBP1 | Holy(rC) binding protein 1. |
|
| PCBP2 | Holy(rC) binding protein 2. |
|
| HNRNPF | Heterogeneous nuclear ribonucleoprotein F. |
|
| HNRNPH1 | Heterogeneous nuclear ribonucleoprotein H1. |
|
| HNRNPH2 | Heterogeneous nuclear ribonucleoprotein H2. |
|
| HNRNPH3 | Heterogeneous nuclear ribonucleoprotein H3. |
|
| PTBP1 | Polypyrimidine tract binding protein 1. In the context of CALCA gene, PTB enhances exon 4 inclusion (PMID: 9858533). nPTB functionally compensates for PTB and is up-regulated when PTB is removed (PMID:17679092). |
|
| HNRNPK | Heterogeneous nuclear ribonucleoprotein K. isoform J is due to Alternative Splicing. |
|
| HNRNPK | Heterogeneous nuclear ribonucleoprotein K. hnRNP K carries bidirectional shuttling signals that serve for both nuclear localization and export (PMID: 9218800). |
|
| HNRNPM | Heterogeneous nuclear ribonucleoprotein M. |
|
| FUS | Fusion (involved in t (12, 16) in malignant liposarcoma). |
|
| SYNCRIP | Synaptotagmin binding cytoplasmic RNA interacting protein. |
|
| HNRNPU | Heterogeneous nuclear ribonucleoprotein U (scaffold attachment factor A). |
|
| TRA2A | Transformer-2 alpha. |
|
| SRSF10 | Splicing factor arginine/serine-rich 10. |
|
| KHSRP | KH-type splicing regulatory protein. |
|
| MBNL1 | Muscleblind-like. MBNL proteins can act as activators or repressors of splicing on different pre-mRNAs (PMID: 15257297). MBNLs are dsRNA binding factors that can bind CUG or CCUG repeats (PMID: 14722159). |
|
| SFPQ | Splicing factor proline/glutamine-rich (polypyrimidine tract binding protein associated). |
|
| RBM25 | RNA binding motif protein 25. RBM25 stimulated proapoptotic Bcl-X(s) isoform through weak 5′ss selection in EX2 (PMID: 18663000). |
|
| RBM4 | RNA binding motif protein 4. RBM4 induce exon inclusion of alpha-TM EX9a and EX2b (PMID: 16260624) and tau EX10 (PMID: 16777844). |
|
| RBM5 | RNA binding motif protein 5. |
|
| KHDRBS1 | KH domain containing RNA binding signal transduction associated 1. |
|
| SRSF2 | Splicing factor arginine/serine-rich 2. SC35 accelerates transcriptional elongation (co-transcriptional splicing) (PMID: 18641664). |
|
| SF1 | Splicing factor 1. Gomafu lncRNA UACUAAC repeats bind to mouse SF1 with a higher affinity than the mammalian branch point consensus regulating splicing efficiency by changing the splicing factors nuclear level (PMID: 21463453) |
|
| SRSF1 | Splicing factor arginine/serine-rich 1 (splicing factor 2, alternate splicing factor). The shuttling protein SF2/ASF binds TAP and can function as export factors (18364396). |
|
| SRSF3 | Splicing factor arginine/serine-rich 3. The shuttling protein SRp20 binds TAP and can function as export factors (18364396). |
|
| SRSF9 | Splicing factor arginine/serine-rich 9. |
|
| FUSIP1 | FUS interacting protein (serine/arginine-rich). Dephosphorylation converts SRp38 to a splicing repressor (PMID: 12419250). SRp38 functions as a general splicing repressor when dephosphorylated, but when phosphorylated it functions as a sequence-specific splicing activator (PMID: 18794844). |
|
| SRSF5 | Splicing factor arginine/serine-rich 5. |
|
| SRSF11 | Splicing factor, arginine/serine-rich 11. |
|
| SRSF6 | Splicing factor arginine/serine-rich 6. |
|
| SRSF4 | Splicing factor arginine/serine-rich 4. |
|
| TARDBP | TAR DNA binding protein. It can act as transcriptional repressor (21252238). |
|
| TIA1 | Cytotoxic granule-associated RNA binding protein. |
|
| TIAL1 | TIA1 cytotoxic granule-associated RNA binding protein-like 1. |
|
| YBX1 | Y box binding protein 1. |
|
| ZRANB2 | Zinc finger, RAN-binding domain containing 2. ZRANB2 (ZNF265) is an SR-like protein that induce exclusion of EX2 and EX3 from the Tra2beta1 pre-mRNA in HEK293 cell (PMID: 11448987). |
This table contains 22 splicing factors which are selected to predict the expression levels of differentially expressed isoforms. This table lists their names and some related references. Most of these details are from SpliceAid.
Pseudocode of LARS algorithm.
| LARS algorithm |
| Data: Normalized expression levels of co-expressed isoforms |
| Output: Regression coefficients |
| All coefficients |
| Active set |
| Find predictor |
| Let direction |
| Repeat |
| Adjust the coefficient in the direction |
| Put |
| Let |
| Until |
Target genes and corresponding factors in networks 1.
| TFs | Coefficients | SFs | Coefficients | Target genes | Adjusted R |
|
| 0.0341 | Sam68 | 1.082 | GNB1 PSPC1ATP5J SETD3 | |
|
| −0.303 | SF2 | 0.106 | GPATCH4 HNRNPD | |
|
| 0.023 | SRp38 | −0.6486 | MRPL51 RBBP7 | 0.991 |
|
| −0.387 | MBNL1 | −1.106 | BRD4 CALM3 | |
|
| −0.012 | PSF | −0.681 | ODC1 ANP32C |
This table lists the target genes and factors that regulate them. The regression coefficients are listed on the right side.
Figure 2Expression ratio of SRSF11’s three isoforms (A), motifs in SRSF11’s isoforms and classical SR proteins (B), RT-PCR results (C) and protein Expression of SRSF11.
(A). Expression ratio of SRSF11’s three isoforms in seven disease sample and control: uc009wbj.1 (light green), uc001deu.2 (light blue) and uc001.dev.3 (light red). They have almost the same total expression levels but very different ratios in MDS (four ) and control (average of five controls), which means the splicing patterns of SRSF11 are switched. (B). This figure demonstrates motifs in SRSF11’s isoforms and classical SR proteins. Different motifs have different bio-function. (C). Three isoforms that are over-expressed in our disease samples are picked up for RT-PCR validation. They are isoforms of three splicing factor, one isoform (uc001deu.2, refseq ID: NM_001190987) of SRSF11, one isoform (uc001xlp.3, refseq ID: NM_006925) of SRSF5 and one isoform (uc003jun.2, refseq ID:NM_080743) of SRSF12. Validation demonstrated that their expression levels in MDS disease are higher than in control. (D). Isofrom uc001deu.2 is translated into protein Q05519 and Q05519 is highly expressed in blood disease according to the Model Organism Protein Expression Database (MOPED); COPD: Chronic obstructive pulmonary disease.
Results of enrichment analysis using KEGG database.
| No. | Pathway | Regulatory Network |
|
| Herpes simplex infection | NT2, NT6, NT7, NT8, NT9, NT10, NT11, NT12, NT13, NT15, NT16, NT18, NT20, NT22, NT25, NT31 |
|
| Pathways in cancer | NT8, NT9, NT14, NT15, NT18, NT20 |
|
| Chronic myeloid leukemia | NT9, NT14, NT18, NT22 |
|
| Maturity onset diabetes of the young | NT15, NT25, NT32 |
|
| Acute myeloid leukemia | NT18, NT20 |
|
| PPAR signaling pathway | NT15, NT20 |
|
| Pertussis | NT8, NT22 |
|
| Transcriptional mis-regulation in cancer | NT15 |
|
| Jak-STAT pathway | NT18 |
This table lists top enriched KEGG pathways and corresponding networks number.
Results of enrichment analysis using GO database.
| Network | Biological Process | P value |
|
| mRNA splicing, via spliceosome | 5.8e-05 |
| leading edge cell differentiation | 2.2e-05 | |
| ERK5 cascade | 3.9e-6 | |
|
| mRNA splice site selection | 8.8e-05 |
|
| regulation of RNA splicing | 1.2e-05 |
| positive regulation of mRNA splicing, via spliceosome | 5.5e-05 | |
| positive regulation of transcription initiation from RNA polymerase II promoter | 7.4e-06 | |
| positive regulation of neural precursor cell proliferation | 3.8e-05 | |
|
| mRNA 5′-splice site recognition | 2.9e-05 |
| regulation of muscle cell differentiation | 1.3e-07 | |
|
| transcription from RNA polymerase II promoter | 3.1e-05 |
| termination of RNA polymerase II transcription | 3.8e-05 | |
| mRNA 3′-end processing | 1.0e-07 | |
| multi-organism reproductive process | 4.7e-06 |
Three networks enriched in some GO biological processes. This table lists the details and the P values.
Figure 3Connected regulatory network for network 18.
Connected regulatory network for network 18. Red nodes are transcription factors, blue nodes are splicing factors and gray nodes in the middle are targets. The connection between targets is from IPA and the connection between factors and targets are from our interaction strength matrix.