Literature DB >> 32935893

Consistency and variation of protein subcellular location annotations.

Ying-Ying Xu1,2,3, Hang Zhou2, Robert F Murphy3, Hong-Bin Shen2.   

Abstract

A major challenge for protein databases is reconciling information from diverse sources. This is especially difficult when some information consists of secondary, human-interpreted rather than primary data. For example, the Swiss-Prot database contains curated annotations of subcellular location that are based on predictions from protein sequence, statements in scientific articles, and published experimental evidence. The Human Protein Atlas (HPA) consists of millions of high-resolution microscopic images that show protein spatial distribution on a cellular and subcellular level. These images are manually annotated with protein subcellular locations by trained experts. The image annotations in HPA can capture the variation of subcellular location across different cell lines, tissues, or tissue states. Systematic investigation of the consistency between HPA and Swiss-Prot assignments of subcellular location, which is important for understanding and utilizing protein location data from the two databases, has not been described previously. In this paper, we quantitatively evaluate the consistency of subcellular location annotations between HPA and Swiss-Prot at multiple levels, as well as variation of protein locations across cell lines and tissues. Our results show that annotations of these two databases differ significantly in many cases, leading to proposed procedures for deriving and integrating the protein subcellular location data. We also find that proteins having highly variable locations are more likely to be biomarkers of diseases, providing support for incorporating analysis of subcellular location in protein biomarker identification and screening.
© 2020 Wiley Periodicals LLC.

Entities:  

Keywords:  Swiss-Prot database; annotation consistency; human protein atlas; location biomarker; protein subcellular location

Mesh:

Substances:

Year:  2020        PMID: 32935893      PMCID: PMC7790864          DOI: 10.1002/prot.26010

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  30 in total

1.  High performance set of PseAAC and sequence based descriptors for protein classification.

Authors:  Loris Nanni; Sheryl Brahnam; Alessandra Lumini
Journal:  J Theor Biol       Date:  2010-06-15       Impact factor: 2.691

2.  Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms.

Authors:  Kuo-Chen Chou; Hong-Bin Shen
Journal:  Nat Protoc       Date:  2008       Impact factor: 13.491

3.  Proteomics. Tissue-based map of the human proteome.

Authors:  Mathias Uhlén; Linn Fagerberg; Björn M Hallström; Cecilia Lindskog; Per Oksvold; Adil Mardinoglu; Åsa Sivertsson; Caroline Kampf; Evelina Sjöstedt; Anna Asplund; IngMarie Olsson; Karolina Edlund; Emma Lundberg; Sanjay Navani; Cristina Al-Khalili Szigyarto; Jacob Odeberg; Dijana Djureinovic; Jenny Ottosson Takanen; Sophia Hober; Tove Alm; Per-Henrik Edqvist; Holger Berling; Hanna Tegel; Jan Mulder; Johan Rockberg; Peter Nilsson; Jochen M Schwenk; Marica Hamsten; Kalle von Feilitzen; Mattias Forsberg; Lukas Persson; Fredric Johansson; Martin Zwahlen; Gunnar von Heijne; Jens Nielsen; Fredrik Pontén
Journal:  Science       Date:  2015-01-23       Impact factor: 47.728

4.  Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features.

Authors:  Hang Zhou; Yang Yang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

5.  A proposal for validation of antibodies.

Authors:  Mathias Uhlen; Anita Bandrowski; Steven Carr; Aled Edwards; Jan Ellenberg; Emma Lundberg; David L Rimm; Henry Rodriguez; Tara Hiltke; Michael Snyder; Tadashi Yamamoto
Journal:  Nat Methods       Date:  2016-09-05       Impact factor: 28.547

6.  Integration of Heterogeneous Experimental Data Improves Global Map of Human Protein Complexes.

Authors:  Jose Lugo-Martinez; Ziv Bar-Joseph; Jörn Dengjel; Robert F Murphy
Journal:  ACM BCB       Date:  2019-09

7.  Learning complex subcellular distribution patterns of proteins via analysis of immunohistochemistry images.

Authors:  Ying-Ying Xu; Hong-Bin Shen; Robert F Murphy
Journal:  Bioinformatics       Date:  2020-03-01       Impact factor: 6.937

8.  A subcellular map of the human proteome.

Authors:  Peter J Thul; Lovisa Åkesson; Mikaela Wiking; Diana Mahdessian; Aikaterini Geladaki; Hammou Ait Blal; Tove Alm; Anna Asplund; Lars Björk; Lisa M Breckels; Anna Bäckström; Frida Danielsson; Linn Fagerberg; Jenny Fall; Laurent Gatto; Christian Gnann; Sophia Hober; Martin Hjelmare; Fredric Johansson; Sunjae Lee; Cecilia Lindskog; Jan Mulder; Claire M Mulvey; Peter Nilsson; Per Oksvold; Johan Rockberg; Rutger Schutten; Jochen M Schwenk; Åsa Sivertsson; Evelina Sjöstedt; Marie Skogs; Charlotte Stadler; Devin P Sullivan; Hanna Tegel; Casper Winsnes; Cheng Zhang; Martin Zwahlen; Adil Mardinoglu; Fredrik Pontén; Kalle von Feilitzen; Kathryn S Lilley; Mathias Uhlén; Emma Lundberg
Journal:  Science       Date:  2017-05-11       Impact factor: 47.728

9.  Automated Learning of Subcellular Variation among Punctate Protein Patterns and a Generative Model of Their Relation to Microtubules.

Authors:  Gregory R Johnson; Jieyue Li; Aabid Shariff; Gustavo K Rohde; Robert F Murphy
Journal:  PLoS Comput Biol       Date:  2015-12-01       Impact factor: 4.475

10.  Analysis of the Human Protein Atlas Image Classification competition.

Authors:  Wei Ouyang; Casper F Winsnes; Martin Hjelmare; Anthony J Cesnik; Lovisa Åkesson; Hao Xu; Devin P Sullivan; Shubin Dai; Jun Lan; Park Jinmo; Shaikat M Galib; Christof Henkel; Kevin Hwang; Dmytro Poplavskiy; Bojan Tunguz; Russel D Wolfinger; Yinzheng Gu; Chuanpeng Li; Jinbin Xie; Dmitry Buslov; Sergei Fironov; Alexander Kiselev; Dmytro Panchenko; Xuan Cao; Runmin Wei; Yuanhao Wu; Xun Zhu; Kuan-Lun Tseng; Zhifeng Gao; Cheng Ju; Xiaohan Yi; Hongdong Zheng; Constantin Kappel; Emma Lundberg
Journal:  Nat Methods       Date:  2019-11-28       Impact factor: 28.547

View more
  2 in total

1.  Convolutional Neural Network-Based Artificial Intelligence for Classification of Protein Localization Patterns.

Authors:  Kaisa Liimatainen; Riku Huttunen; Leena Latonen; Pekka Ruusuvuori
Journal:  Biomolecules       Date:  2021-02-11

2.  Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information.

Authors:  Ge Wang; Yu-Jia Zhai; Zhen-Zhen Xue; Ying-Ying Xu
Journal:  Biomolecules       Date:  2021-10-29
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.