| Literature DB >> 21769189 |
Singh Jitendra, Ranjana Narula, Shefali Agnihotri, Maneet Singh.
Abstract
UNLABELLED: A hypothetical protein is predicted to be expressed from an open reading frame without known experimental evidence of translation. They constitute a substantial fraction of proteomes. Domain extraction from these hypothetical sequences helps to search for protein coding genes for protein structural and functional annotation. We describe the analysis of prediction data in a sequence dataset of hypothetical protein orthologs of Pongo abelii (orangutan) and Sus scrofa (pig). It should be noted that these orangutan-pig orthologs are also non-homologous to human proteins. These predicted data find application in the genome wide annotation of proteins in poorly understood genomes. ABBREVIATIONS: PDB - Protein Data Bank, DEG - Database of Essential Genes, CDD - Conserved Domain Database, IUCN - International Union for Conservation of Nature.Entities:
Keywords: Pongo abelii; Sus scrofa; functional annotation; hypothetical proteins; structure prediction; subcellular localization
Year: 2011 PMID: 21769189 PMCID: PMC3134776 DOI: 10.6026/97320630006297
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Flowchart describing identification and analysis of hypothetical orthologs in pig and orangutan.
Figure 2Distribution of essential and non-essential hypothetical proteins.
Figure 3Sub-cellular localization prediction using BaCelLo.
Figure 4Sub-cellular localization prediction using CELLO
Figure 5Subcellular localization prediction using WOLF PSORT.