Literature DB >> 27142340

Genome-Wide Functional Annotation of Human Protein-Coding Splice Variants Using Multiple Instance Learning.

Bharat Panwar1, Rajasree Menon1, Ridvan Eksi1, Hong-Dong Li1, Gilbert S Omenn1, Yuanfang Guan1.   

Abstract

The vast majority of human multiexon genes undergo alternative splicing and produce a variety of splice variant transcripts and proteins, which can perform different functions. These protein-coding splice variants (PCSVs) greatly increase the functional diversity of proteins. Most functional annotation algorithms have been developed at the gene level; the lack of isoform-level gold standards is an important intellectual limitation for currently available machine learning algorithms. The accumulation of a large amount of RNA-seq data in the public domain greatly increases our ability to examine the functional annotation of genes at isoform level. In the present study, we used a multiple instance learning (MIL)-based approach for predicting the function of PCSVs. We used transcript-level expression values and gene-level functional associations from the Gene Ontology database. A support vector machine (SVM)-based 5-fold cross-validation technique was applied. Comparatively, genes with multiple PCSVs performed better than single PCSV genes, and performance also improved when more examples were available to train the models. We demonstrated our predictions using literature evidence of ADAM15, LMNA/C, and DMXL2 genes. All predictions have been implemented in a web resource called "IsoFunc", which is freely available for the global scientific community through http://guanlab.ccmb.med.umich.edu/isofunc .

Entities:  

Keywords:  ADAM15; DMXL2; IsoFunc; LMNA/C; RNA-seq; alternative splicing; functional annotation; gene ontology (GO); multiple instance learning (MIL); protein-coding splice variant (PCSV); support vector machine (SVM)

Mesh:

Substances:

Year:  2016        PMID: 27142340     DOI: 10.1021/acs.jproteome.5b00883

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  13 in total

1.  Annotation of Alternatively Spliced Proteins and Transcripts with Protein-Folding Algorithms and Isoform-Level Functional Networks.

Authors:  Hongdong Li; Yang Zhang; Yuanfang Guan; Rajasree Menon; Gilbert S Omenn
Journal:  Methods Mol Biol       Date:  2017

2.  DeepIsoFun: a deep domain adaptation approach to predict isoform functions.

Authors:  Dipan Shaw; Hao Chen; Tao Jiang
Journal:  Bioinformatics       Date:  2019-08-01       Impact factor: 6.937

3.  Trim33 is required for appropriate development of pre-cardiogenic mesoderm.

Authors:  Sudha Rajderkar; Jeffrey M Mann; Christopher Panaretos; Kenji Yumoto; Hong-Dong Li; Yuji Mishina; Benjamin Ralston; Vesa Kaartinen
Journal:  Dev Biol       Date:  2019-03-30       Impact factor: 3.582

4.  Metrics for the Human Proteome Project 2016: Progress on Identifying and Characterizing the Human Proteome, Including Post-Translational Modifications.

Authors:  Gilbert S Omenn; Lydie Lane; Emma K Lundberg; Ronald C Beavis; Christopher M Overall; Eric W Deutsch
Journal:  J Proteome Res       Date:  2016-09-20       Impact factor: 4.466

5.  IsoResolve: predicting splice isoform functions by integrating gene and isoform-level features with domain adaptation.

Authors:  Hong-Dong Li; Changhuo Yang; Zhimin Zhang; Mengyun Yang; Fang-Xiang Wu; Gilbert S Omenn; Jianxin Wang
Journal:  Bioinformatics       Date:  2021-05-01       Impact factor: 6.937

6.  A High-Resolution Genome-Wide CRISPR/Cas9 Viability Screen Reveals Structural Features and Contextual Diversity of the Human Cell-Essential Proteome.

Authors:  Thierry Bertomeu; Jasmin Coulombe-Huntington; Andrew Chatr-Aryamontri; Karine G Bourdages; Etienne Coyaud; Brian Raught; Yu Xia; Mike Tyers
Journal:  Mol Cell Biol       Date:  2017-12-13       Impact factor: 4.272

7.  Progress in the Chromosome-Centric Human Proteome Project as Highlighted in the Annual Special Issue IV.

Authors:  Young-Ki Paik; Christopher M Overall; Eric W Deutsch; William S Hancock; Gilbert S Omenn
Journal:  J Proteome Res       Date:  2016-11-04       Impact factor: 4.466

8.  The impact of the RBM4-initiated splicing cascade on modulating the carcinogenic signature of colorectal cancer cells.

Authors:  Jung-Chun Lin; Yuan-Chii Lee; Yu-Chih Liang; Yang C Fann; Kory R Johnson; Ying-Ju Lin
Journal:  Sci Rep       Date:  2017-03-09       Impact factor: 4.379

9.  Tissue-specific mouse mRNA isoform networks.

Authors:  Gaurav Kandoi; Julie A Dickerson
Journal:  Sci Rep       Date:  2019-09-27       Impact factor: 4.379

10.  Assessing the functional relevance of splice isoforms.

Authors:  Fernando Pozo; Laura Martinez-Gomez; Thomas A Walsh; José Manuel Rodriguez; Tomas Di Domenico; Federico Abascal; Jesús Vazquez; Michael L Tress
Journal:  NAR Genom Bioinform       Date:  2021-05-22
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.