Literature DB >> 30265558

Structure and Protein Interaction-Based Gene Ontology Annotations Reveal Likely Functions of Uncharacterized Proteins on Human Chromosome 17.

Chengxin Zhang, Xiaoqiong Wei1, Gilbert S Omenn, Yang Zhang.   

Abstract

Understanding the function of human proteins is essential to decipher the molecular mechanisms of human diseases and phenotypes. Of the 17 470 human protein coding genes in the neXtProt 2018-01-17 database with unequivocal protein existence evidence (PE1), 1260 proteins do not have characterized functions. To reveal the function of poorly annotated human proteins, we developed a hybrid pipeline that creates protein structure prediction using I-TASSER and infers functional insights for the target protein from the functional templates recognized by COFACTOR. As a case study, the pipeline was applied to all 66 PE1 proteins with unknown or insufficiently specific function (uPE1) on human chromosome 17 as of neXtProt 2017-07-01. Benchmark testing on a control set of 100 well-characterized proteins randomly selected from the same chromosome shows high Gene Ontology (GO) term prediction accuracies of 0.69, 0.57, and 0.67 for molecular function (MF), biological process (BP), and cellular component (CC), respectively. Three pipelines of function annotations (homology detection, protein-protein interaction network inference, and structure template identification) have been exploited by COFACTOR. Detailed analyses show that structure template detection based on low-resolution protein structure prediction made the major contribution to the enhancement of the sensitivity and precision of the annotation predictions, especially for cases that do not have sequence-level homologous templates. For the chromosome 17 uPE1 proteins, the I-TASSER/COFACTOR pipeline confidently assigned MF, BP, and CC for 13, 33, and 49 proteins, respectively, with predicted functions ranging from sphingosine N-acyltransferase activity and sugar transmembrane transporter to cytoskeleton constitution. We highlight the 13 proteins with confident MF predictions; 11 of these are among the 33 proteins with confident BP predictions and 12 are among the 49 proteins with confident CC. This study demonstrates a novel computational approach to systematically annotate protein function in the human proteome and provides useful insights to guide experimental design and follow-up validation studies of these uncharacterized proteins.

Entities:  

Keywords:  COFACTOR; I-TASSER; chromosome 17; human proteome; neXtProt protein existence levels; structure-based function annotation; uPE1 proteins with unknown function

Mesh:

Substances:

Year:  2018        PMID: 30265558      PMCID: PMC6438760          DOI: 10.1021/acs.jproteome.8b00453

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  42 in total

1.  SPICKER: a clustering approach to identify near-native protein folds.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  J Comput Chem       Date:  2004-04-30       Impact factor: 3.376

2.  How significant is a protein structure similarity with TM-score = 0.5?

Authors:  Jinrui Xu; Yang Zhang
Journal:  Bioinformatics       Date:  2010-02-17       Impact factor: 6.937

3.  pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination.

Authors:  Anna Lobley; Michael I Sadowski; David T Jones
Journal:  Bioinformatics       Date:  2009-05-07       Impact factor: 6.937

4.  COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information.

Authors:  Chengxin Zhang; Peter L Freddolino; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

5.  COFACTOR: an accurate comparative algorithm for structure-based protein function annotation.

Authors:  Ambrish Roy; Jianyi Yang; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2012-05-08       Impact factor: 16.971

6.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

7.  A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project.

Authors:  Latarsha J Carithers; Kristin Ardlie; Mary Barcus; Philip A Branton; Angela Britton; Stephen A Buia; Carolyn C Compton; David S DeLuca; Joanne Peter-Demchok; Ellen T Gelfand; Ping Guan; Greg E Korzeniewski; Nicole C Lockhart; Chana A Rabiner; Abhi K Rao; Karna L Robinson; Nancy V Roche; Sherilyn J Sawyer; Ayellet V Segrè; Charles E Shive; Anna M Smith; Leslie H Sobin; Anita H Undale; Kimberly M Valentino; Jim Vaught; Taylor R Young; Helen M Moore
Journal:  Biopreserv Biobank       Date:  2015-10       Impact factor: 2.300

8.  The Putative SLC Transporters Mfsd5 and Mfsd11 Are Abundantly Expressed in the Mouse Brain and Have a Potential Role in Energy Homeostasis.

Authors:  Emelie Perland; Emilia Lekholm; Mikaela M Eriksson; Sonchita Bagchi; Vasiliki Arapi; Robert Fredriksson
Journal:  PLoS One       Date:  2016-06-07       Impact factor: 3.240

9.  The neXtProt knowledgebase on human proteins: 2017 update.

Authors:  Pascale Gaudet; Pierre-André Michel; Monique Zahn-Zabal; Aurore Britan; Isabelle Cusin; Marcin Domagalski; Paula D Duek; Alain Gateau; Anne Gleizes; Valérie Hinard; Valentine Rech de Laval; JinJin Lin; Frederic Nikitin; Mathieu Schaeffer; Daniel Teixeira; Lydie Lane; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

10.  LOMETS: a local meta-threading-server for protein structure prediction.

Authors:  Sitao Wu; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2007-05-03       Impact factor: 16.971

View more
  12 in total

Review 1.  Progress on Identifying and Characterizing the Human Proteome: 2018 Metrics from the HUPO Human Proteome Project.

Authors:  Gilbert S Omenn; Lydie Lane; Christopher M Overall; Fernando J Corrales; Jochen M Schwenk; Young-Ki Paik; Jennifer E Van Eyk; Siqi Liu; Michael Snyder; Mark S Baker; Eric W Deutsch
Journal:  J Proteome Res       Date:  2018-08-23       Impact factor: 4.466

2.  Detecting Gene Ontology misannotations using taxon-specific rate ratio comparisons.

Authors:  Xiaoqiong Wei; Chengxin Zhang; Peter L Freddolino; Yang Zhang
Journal:  Bioinformatics       Date:  2020-08-15       Impact factor: 6.937

3.  Functions of Essential Genes and a Scale-Free Protein Interaction Network Revealed by Structure-Based Function and Interaction Prediction for a Minimal Genome.

Authors:  Chengxin Zhang; Wei Zheng; Micah Cheng; Gilbert S Omenn; Peter L Freddolino; Yang Zhang
Journal:  J Proteome Res       Date:  2021-01-04       Impact factor: 4.466

4.  Progress on Identifying and Characterizing the Human Proteome: 2019 Metrics from the HUPO Human Proteome Project.

Authors:  Gilbert S Omenn; Lydie Lane; Christopher M Overall; Fernando J Corrales; Jochen M Schwenk; Young-Ki Paik; Jennifer E Van Eyk; Siqi Liu; Stephen Pennington; Michael P Snyder; Mark S Baker; Eric W Deutsch
Journal:  J Proteome Res       Date:  2019-09-13       Impact factor: 4.466

5.  Probing the antioxidant activity of functional proteins and bioactive peptides in Hermetia illucens larvae fed with food wastes.

Authors:  Jiaxin Lu; Yuwen Guo; Atif Muhmood; Bei Zeng; Yizhan Qiu; Pan Wang; Lianhai Ren
Journal:  Sci Rep       Date:  2022-02-18       Impact factor: 4.379

Review 6.  I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction.

Authors:  Xiaogen Zhou; Wei Zheng; Yang Li; Robin Pearce; Chengxin Zhang; Eric W Bell; Guijun Zhang; Yang Zhang
Journal:  Nat Protoc       Date:  2022-08-05       Impact factor: 17.021

7.  Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project.

Authors:  Gilbert S Omenn; Lydie Lane; Christopher M Overall; Ileana M Cristea; Fernando J Corrales; Cecilia Lindskog; Young-Ki Paik; Jennifer E Van Eyk; Siqi Liu; Stephen R Pennington; Michael P Snyder; Mark S Baker; Nuno Bandeira; Ruedi Aebersold; Robert L Moritz; Eric W Deutsch
Journal:  J Proteome Res       Date:  2020-10-19       Impact factor: 4.466

8.  Blinded Testing of Function Annotation for uPE1 Proteins by I-TASSER/COFACTOR Pipeline Using the 2018-2019 Additions to neXtProt and the CAFA3 Challenge.

Authors:  Chengxin Zhang; Lydie Lane; Gilbert S Omenn; Yang Zhang
Journal:  J Proteome Res       Date:  2019-10-18       Impact factor: 4.466

Review 9.  A high-stringency blueprint of the human proteome.

Authors:  Subash Adhikari; Edouard C Nice; Eric W Deutsch; Lydie Lane; Gilbert S Omenn; Stephen R Pennington; Young-Ki Paik; Christopher M Overall; Fernando J Corrales; Ileana M Cristea; Jennifer E Van Eyk; Mathias Uhlén; Cecilia Lindskog; Daniel W Chan; Amos Bairoch; James C Waddington; Joshua L Justice; Joshua LaBaer; Henry Rodriguez; Fuchu He; Markus Kostrzewa; Peipei Ping; Rebekah L Gundry; Peter Stewart; Sanjeeva Srivastava; Sudhir Srivastava; Fabio C S Nogueira; Gilberto B Domont; Yves Vandenbrouck; Maggie P Y Lam; Sara Wennersten; Juan Antonio Vizcaino; Marc Wilkins; Jochen M Schwenk; Emma Lundberg; Nuno Bandeira; Gyorgy Marko-Varga; Susan T Weintraub; Charles Pineau; Ulrike Kusebauch; Robert L Moritz; Seong Beom Ahn; Magnus Palmblad; Michael P Snyder; Ruedi Aebersold; Mark S Baker
Journal:  Nat Commun       Date:  2020-10-16       Impact factor: 14.919

10.  Crowdsourcing biocuration: The Community Assessment of Community Annotation with Ontologies (CACAO).

Authors:  Jolene Ramsey; Brenley McIntosh; Daniel Renfro; Suzanne A Aleksander; Sandra LaBonte; Curtis Ross; Adrienne E Zweifel; Nathan Liles; Shabnam Farrar; Jason J Gill; Ivan Erill; Sarah Ades; Tanya Z Berardini; Jennifer A Bennett; Siobhan Brady; Robert Britton; Seth Carbon; Steven M Caruso; Dave Clements; Ritu Dalia; Meredith Defelice; Erin L Doyle; Iddo Friedberg; Susan M R Gurney; Lee Hughes; Allison Johnson; Jason M Kowalski; Donghui Li; Ruth C Lovering; Tamara L Mans; Fiona McCarthy; Sean D Moore; Rebecca Murphy; Timothy D Paustian; Sarah Perdue; Celeste N Peterson; Birgit M Prüß; Margaret S Saha; Robert R Sheehy; John T Tansey; Louise Temple; Alexander William Thorman; Saul Trevino; Amy Cheng Vollmer; Virginia Walbot; Joanne Willey; Deborah A Siegele; James C Hu
Journal:  PLoS Comput Biol       Date:  2021-10-28       Impact factor: 4.779

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.