| Literature DB >> 25873935 |
Johny Ijaq1, Mohanalatha Chandrasekharan1, Rajdeep Poddar1, Neeraja Bethi1, Vijayaraghava S Sundararajan1.
Abstract
Hypothetical proteins (HPs) are the proteins predicted to be expressed from an open reading frame, making a substantial fraction of proteomes in both prokaryotes and eukaryotes. Genome projects have led to the identification of many therapeutic targets, the putative function of the protein, and their interactions. In this review we enlist various methods linking annotation to structural and functional prediction of HPs that assist in the discovery of new structures and functions serving as markers and pharmacological targets for drug designing, discovery, and screening. Further we give an overview of how mass spectrometry as an analytical technique is used to validate protein characterisation. We discuss how microarrays and protein expression profiles help understanding the biological systems through a systems-wide study of proteins and their interactions with other proteins and non-proteinaceous molecules to control complex processes in cells. Finally, we articulate challenges on how next generation sequencing methods have accelerated multiple areas of genomics with special focus on uncharacterized proteins.Entities:
Keywords: annotation; drug design research; functional prediction; hypothetical proteins; protein–protein interactions; public repository
Year: 2015 PMID: 25873935 PMCID: PMC4379932 DOI: 10.3389/fgene.2015.00119
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Methods used for protein characterization and annotation.
| 1 | Basic local alignment tool (BLAST) | Used for finding similar sequences in protein databases |
| 2 | ExPASy -- Protparam tool | Used for computation of various physical and chemical parameters like molecular weight, isoelectric point (Pi), amino acid composition, atomic composition, extinction co-efficient, instability index, aliphatic index, and grand average of hydropathy (GRAVY) |
| 3 | signalP | Predicts signal peptide cleavage sites. |
| 4 | secretomeP | Used for identifying proteins involved in non-classical secretory pathway. |
| 5 | PSORT B | Predicts subcellular localization of bacterial proteins. |
| 6 | PSLpred | Predicts subcellular localization of proteins from Gram-negative bacteria. |
| 7 | CELLO | Assign localization to both prokaryotic and eukaryotic proteins |
| 8 | TMHMM | used to authenticate whether the protein is a membrane protein or not. |
| 9 | HMMTOP | Predict transmembrane topology. |
| 10 | Pfam | Collection of multiple protein sequence alignments |
| 11 | SVMprot | SVM (Support vector machine based classification of proteins |
| 12 | SYSTERS | For grouping of proteins on the basis of their functions. |
| 13 | SUPERFAMILY | Hierarchical domain classification of PDB structures. |
| 14 | CATH (Class, Architecture, Topology, Homology) | Used for finding protein similarities across evolutionary |
| I5 | CDART (The conserved domain architecture | comprehensively organized database of protein families and |
| retrieval tool) | sub-families, their evolutionary relationships in the form of | |
| phylogenetic trees | ||
| 16 | PANTHER (Protein analysis through evolutionary relationships) | Identification and annotation of protein domains. |
| 17 | SMART | Automatic hierarchical clustering of the protein sequences |
| 18 | ProtoNet | |
| 19 | InterProScan | Searches interPro for motif discovery. It is the integration of |
| several large protein signature databases. | ||
| 20 | MOTIF | used for Motif discovery. |
| 21 | MEME suite | Database searching for assigning function to the discovered motifs. |
| 22 | S | Used for predicting protein--protein interactions. |
| 1 | Gel filtration chromatography | Separates proteins based on their size (which is closely related to their molecular weight) |
| 2 | Ion- exchange chromatography | Purify proteins according to their overall charge |
| 3 | Affinity chromatography | Separates proteins based on their affinity to bind to a known ligand. |
| 4 | SDS-PAGE | Separates protein according to molecular weight and allows the measurement of the molecular weight in comparison with marker proteins. |
| 5 | Isoelectric focusing | Separates proteins based on their PI on a polyacryl-amide gel with a PH gradient. |
| 6 | 2D-Electrophoresis | Isoelectric focussing is often used in conjunction with SDS-PAGE to give a very powerful method of protein characterization by separating the sample of protein first by isoelectric point and then by molecular weight. |
| 7 | NMR spectroscopy | For determining three dimensional structure of proteins |
| 8 | Mass spectrometry | For protein identification and characterization. |
| 9 | Yeast two hybrid assay | For studying protein--protein interactions. |
| 10 | Phage display method | For studying protein--protein interactions |
| 11 | Microarray analysis | For systems-oriented study of proteins |
| 12 | Next generation sequencing | For high-throughput sequencing of genome and proteome analysis. |