| Literature DB >> 25707505 |
Yu-Ting Tseng, Wenyuan Li, Ching-Hsien Chen, Shihua Zhang, Jeremy J W Chen, Xianghong Zhou, Chun-Chi Liu.
Abstract
BACKGROUND: Protein-protein interactions (PPIs) are key to understanding diverse cellular processes and disease mechanisms. However, current PPI databases only provide low-resolution knowledge of PPIs, in the sense that "proteins" of currently known PPIs generally refer to "genes." It is known that alternative splicing often impacts PPI by either directly affecting protein interacting domains, or by indirectly impacting other domains, which, in turn, impacts the PPI binding. Thus, proteins translated from different isoforms of the same gene can have different interaction partners.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25707505 PMCID: PMC4331710 DOI: 10.1186/1471-2164-16-S2-S10
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The screenshots of the isoform interaction and module search function in the IIIDB. (A) Users can upload their own gene expression data for III prediction. (B) IIIDB shows top 100 IIIs and uses these interactions to construct an III network. Users can download complete prediction result as a file. (C) In interaction search function, users can input a gene symbol or gene ID to search associated IIIs and isoform modules. The interaction search section provides interface on searching high-confidence (score > 2.575) or low-confidence (score > 1.692) prediction. The resulting III table for interaction search shows full evidence values including Pearson correlations for 19 RNA-seq datasets and domain interaction score. (D) The resulting page for module search includes the user friendly network graph and pathway/GO enrichment analysis result.
Figure 2We performed logistic regression with 19 RNA-seq datasets and the domain-domain interaction database to construct IIIDB. In RNA-seq data processing, Bowtie2 and eXpress were used to calculate isoform expressions. To confirm with PPI, given an III prediction I1 and I2, we only keep this isoform interaction if the gene symbols of I1 and I2 have PPI in IntAct database.
19 RNA-seq datasets from SRA
| ID | SRA ID | # Exp | Title |
|---|---|---|---|
| ERP000546 | 48 | Illumina bodyMap2 transcriptome | |
| SRP005169 | 41 | Widespread splicing changes in human brain development and aging | |
| SRP005408 | 31 | Gene expression profile in postmortem hippocampus using RNAseq for addicted human samples | |
| SRP010280 | 31 | Integrative genome-wide analysis reveals cooperative regulation of alternative splicing by hnRNP proteins | |
| SRP002628 | 30 | Comparative transcriptomic analysis of prostate cancer and matched normal tissue using RNA-seq | |
| ERP000550 | 29 | Complete transcriptomic landscape of prostate cancer in the Chinese population using RNA-seq | |
| SRP005242 | 21 | A Comparison of Single Molecule and Amplification Based Sequencing of Cancer Transcriptomes: RNA-Seq Comparison | |
| SRP002079 | 20 | GSE20301: Dynamic transcriptomes during neural differentiation of human embryonic stem cells | |
| ERP000992 | 18 | The effect of estrogen and progesterone and their antagonists in Ishikawa cell line compared to MCF7 and T47D cells | |
| SRP000727 | 16 | Alternative Isoform Regulation in Human Tissue Transcriptomes | |
| SRP007338 | 16 | GSE30017: Widespread regulated alternative splicing of single codons accelerates proteome evolution | |
| SRP010166 | 16 | GSE34914: Deep Sequence Analysis of non-small cell lung cancer: Integrated analysis of gene expression, alternative splicing, and single nucleotide variations in lung adenocarcinomas with and without oncogenic KRAS mutations | |
| ERP000710 | 12 | Transciptome profiling of ovarian cancer cell lines | |
| SRP005411 | 11 | RNA-Seq Quantification of the Complete Transcriptome of Genes Expressed in the Small Airway Epithelium of Nonsmokers and Smokers | |
| SRP006731 | 11 | GSE29155: RNA-Seq anlalysis of prostate cancer cell lines using Next Generation Sequencing | |
| SRP013224 | 11 | GSE38006: Next-generation sequencing reveals HIV-1-mediated suppression of T cell activation and RNA processing and the regulation of non-coding RNA expression in a CD4+ T cell line | |
| ERP000418 | 10 | Gene expression profiles between normal and breast tumor genomes | |
| ERP000573 | 10 | RNA and chromatin structure | |
| SRP010483 | 10 | GSE35296: The human pancreatic islet transcriptome: impact of pro-inflammatory cytokines |
Figure 3Precision and recall curve for logistic regression model. At recall 15%, logistic regression model achieve 60% precision (High-confidence prediction); at recall 68%, logistic regression model achieve 40% precision (Low-confidence prediction).
The enrichment rate of isoform modules based on GO and pathway enrichments
| Isoform modules | # modules | % modules enriched with GOa | % modules enriched with pathwayb |
|---|---|---|---|
| MODES modules | 1025 | 88.7% | 36.1% |
| Randomization | 1025 | 49.7% | 10.1% |
a The modules enriched with GO term (P-value < 0.001).
b The modules enriched with pathway (P-value < 0.01).
BCL2L1 isoform interaction partners.
| Isoform | RefSeq ID | mRNA length | Protein length | UniProt ID | Interaction partner (high confidence) |
|---|---|---|---|---|---|
| NM_138578 | 2575 | 233 | Q07817-1 | BAK1 BAX NLRP1 | |
| NM_001191 | 2386 | 170 | Q07817-2 | BCL2 |
Figure 4Two isoform modules of RASSF1. (A) RASSF1 has four isoforms (isoform A, B, C and D), of which two isoforms belong to module Mod-162 and Mod-384. (B) The isoform module Mod-162 includes RASSF1 isoform A (NM_007182), RASSF5 and STK4. (C) The isoform module Mod-384 includes RASSF1 isoform C (NM_170713), RASSF3, RASSF4, SAV1, STK3 and STK4.