| Literature DB >> 27173522 |
Olivier Sallou1, Paula D Duek2, Thomas A Darde3, Olivier Collin1, Lydie Lane4, Frédéric Chalmel5.
Abstract
Among the 20 000 human gene products predicted from genome annotation, about 3000 still lack validation at protein level. We developed PepPSy, a user-friendly gene expression-based prioritization system, to help investigators to determine in which human tissues they should look for an unseen protein. PepPSy can also be used by biocurators to revisit the annotation of specific categories of proteins based on the 'omics' data housed by the system. In this study, it was used to prioritize 21 dubious protein-coding genes among the 616 annotated in neXtProt for reannotation. PepPSy is freely available at http://peppsy.genouest.orgDatabase URL: http://peppsy.genouest.org.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27173522 PMCID: PMC4865363 DOI: 10.1093/database/baw070
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.PepPSy layouts. (A) The ‘Filtration’ and (B) the ‘Prioritization’ tab contain menus to filter proteins and sliders to increase (red, from +1 to +10) or decrease (blue, from -10 to -1) contribution of each prioritization module to the final ranking.
Figure 2.PepPSy outputs. (A) The output displays columns for neXtProt IDs, gene symbols, protein description, NCBI Entrez gene IDs, a color-coded evolution of the neXtProt Protein Existence status over time, the current Protein Existence status, the observability status, the prioritized rank of each gene product and the human tissues in which ranked genes are expressed in the different transcriptomic and proteomic datasets.
List of the ten PE5 entries from neXtProt release 2014-09-19 that have expression information in the four transcriptomic datasets
| neXtProt accession | Gene symbol | Observability status | UniGene | Affymetrix U133 Plus 2.0 | Affymetrix All Exon 1.0 | Hum. Protein Atlas RNA-seq | |
|---|---|---|---|---|---|---|---|
| 1 | NX_B1AH88 | TSPO | observable | Intestine > Brain > … | Mouth > Bone marrow > … | Spleen > Intestine > … | Bone marrow > Skin > … |
| 2 | NX_Q9NPU4 | C14orf132 | with special handling | Brain > Testis > … | Brain > Spinal cord > … | Brain | Brain > Oviduct > … |
| 3 | NX_P0CF97 | FAM200B | likely unobservable | Brain > Eye > … | Male germ cell > Brain > … | Brain | Ovary > Kidney > … |
| 4 | NX_Q9BWV7 | TTLL2 | likely unobservable | Testis > … | Male germ cell > Testis | Testis | Testis |
| 5 | NX_Q5T036 | FAM120AOS | likely unobservable | Brain > Lung > … | Placenta > Pituitary gland > … | Intestine > Kidney > … | Placenta > Thyroid > … |
| 6 | NX_Q96SF2 | CCT8L2 | likely unobservable | Testis > Brain > … | Male germ cell > Testis | Intestine > Pancreas > … | Testis |
| 7 | NX_Q8IVY1 | C1orf210 | with special handling | Pancreas > Intestine > … | Female germ cell | Intestine > Kidney > … | Intestine > Stomach > … |
| 8 | NX_P0CB46 | CASP16 | likely unobservable | Spleen > Uterus > … | Female germ cell | Intestine | Intestine |
| 9 | NX_Q8N5Q1 | FAM71E2 | observable | Testis > Thymus > … | Male germ cell | Testis | Testis |
| 10 | NX_Q96HZ7 | C21orf119 | likely unobservable | Intestine > Prostate > … | Testis > Male germ cell > … | Testis > Kidney > … | Skeletal muscle > Thyroid > … |
The list has been prioritized using the default parameters of PepPSy. The four last columns show the two tissues in which the highest expression levels have been reported in each dataset.
Summary of the reannotation of PE5 entries
| neXtProt accession | Gene symbol | New PE | Sample suggestion for further analyses |
|---|---|---|---|
| NX_B1AH88 | TSPO | PE5 | |
| NX_Q9NPU4 | C14orf132 | PE3 | Brain; membrane fraction |
| NX_P0CF97 | FAM200B | PE3 | Brain; will be difficult to distinguish from FAM200A |
| NX_Q9BWV7 | TTLL2 | PE1? (in progress) | Testis |
| NX_Q5T036 | FAM120AOS | PE3? (in progress) | Placenta? |
| NX_Q96SF2 | CCT8L2 | PE2 | Testis; will be difficult to distinguish from CCT8L1P |
| NX_Q8IVY1 | C1orf210 | PE1 | |
| NX_P0CB46 | CASP16 | Deleted | |
| NX_Q8N5Q1 | FAM71E2 | PE1 | |
| NX_Q96HZ7 | C21orf119 | PE5 | |
| NX_Q8IYS8 | BOD1L2 | PE2 | Testis |
| NX_P0CG32 | ZCCHC18 | PE3 | Brain?; will be difficult to distinguish from ZCCHC12 |
| NX_C9J798 | RASA4B | PE3 | Skeletal muscle; quasi undistinguishable from RASA4 |