Literature DB >> 22368391

ProPAS: standalone software to analyze protein properties.

Songfeng Wu1, Yunping Zhu.   

Abstract

Physicochemical properties of proteins/peptides are often used to separate proteins/peptides from complex biological samples in proteomics research, which is necessary for the further proteins/peptides identification. So, the analyses of these properties are very important for the design of proteomic experiments and the further analyses of identified results. Current available web-based tools are difficult to analyze large-scale datasets. Here we provide a standalone software ProPAS (Protein Properties Analyses Software) to calculate these properties in local computers. ProPAS could be used to calculate several physicochemical parameters of proteins/peptides, including the isoelectric point (pI), hydrophobicity (Hy) and mass weight (MW). Additionally, ProPAS provides the statistic results to get the number distribution along the parameter values. The results could be copied to other software such as Excel to get the distribution chart. The amino acid composition of all the input proteins/peptides could also be calculated from all input sequences. Detail description, updates and download link for ProPAS could be found at the website of http://bioinfo.hupo.org.cn/tools/ProPAS/propas.htm.

Entities:  

Year:  2012        PMID: 22368391      PMCID: PMC3283891          DOI: 10.6026/97320630008167

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Protein identification is a basic task for proteomics research. Usually, the proteins/peptides need to be separated according to their physicochemical properties before the identification [1]. The common used physicochemical properties include isoelectric point, hydrophobicity and mass weight. So, the calculation of these physicochemical properties is very important and could be used for the design of proteomic experiments. Further analyses of physicochemical properties for the identified proteins might be helpful to discover the experimental bias, which would be important for improving the experiments. The analyses could also be used to discover the different distribution caused by some biological factors, for example the different isoelectric point and hydrophobicity in different sub-cellular localization [2, 3]. Most current available softwares for these physicochemical properties calculation are web-based, including the software in ExPASy [4]. The web-based softwares are difficult for the analyses of large-scale datasets. ProPAS provide an alternative choice to analyze the data in local computer, which could be used to process small or large-scale datasets. Additionally, ProPAS could also provide the statistic results to get the protein/peptide number distribution of these physicochemical properties.

Methodology

Calculation of isoelectric point:

Isoelectric point (pI) was estimated as the pH at which a protein took on a neutral charge (R=0) [5]. (Please see supplementary material for definition of R).

Calculation of hydrophobicity:

Hydrophobicity value for each protein is calculated as the mean value of hydrophobic index for all amino acids. In this software, Kyte-Doolittle scale [6] is used as the hydrophobic index.

Calculation of mass weight:

Protein mass weight (MW) is calculated by the addition of average isotopic masses of amino acids in the protein and the average isotopic mass of one water molecule. The mass weight values for the amino acids are from ExPASy [7], including 20 standard amino acids and 2 non-standard amino acids (Selenocysteine and Pyrrolysine). Additionally, the characters B (Asx: Aspartic acid or Asparagine), Z (Glx: Glutamine or Glutamic acid) and X (Xaa: Any amino acid) are also accepted for the calculation, whose mass values are calculated as the mean values of the corresponding amino acid masses weighted by their respective frequencies. In this program, their mass values are: B (114.6686), Z (128.7531) and X (111.1138).

Calculation of amino acid composition:

Amino acid composition is calculated as the count and the percentage for each amino acid from all the input proteins/peptides. The software was developed by Perl script. Figure 1 shows the interface and its functional modules of the software.
Figure 1

The interface of ProPAS. The different functional modules are showed in the interface: (1) input format options; (2) text box for input data; (3) text box for input file and its directory; (4) parameters that could be analyzed; (5) need statistic or not; (6) range and step of the parameter for the statistic; (7) file and directory that the results will be exported to; (8) text box for output data; (9) button to calculate the properties; (10) Button to save the results showed in text box “8” to file; (11) Button to copy the results showed in text box “8” to clipboard.

Input

The input data for the software includes:

(a) One or more protein sequences with Fasta format (“Fasta” in the software); (b) One protein/peptide with only amino acid sequence (“Single plain sequence” in the software); (c) One or more peptide sequences, one peptide for each line (“Peptide list” in the software). The data could be input with two different ways. Firstly, the data could be copied to the clipboard and then pasted to the text box in the software. Secondly, the data could be stored as a file and then the user would find and select the file by clicking the “File open…” button. If the data is provided by both of the above ways, only the data from the file would be used for the calculation.

Parameter setting:

Sometimes, the user would only need to calculate some of the parameters. It could be set in the parameter plate. De-select any parameter would skip the calculation. For the distribution statistic, the ranges and their steps could also be changed in the parameter plate upon the requirement of the research.

Output

The calculation results could be showed in the right text box of the software, or be exported to the file if there is the file name and its directory in the box following “Save to file:” when running the program.

The output results include the following sections:

(a) Result of the physicochemical properties for each protein/peptide. It would list the values of pI, Hy and MW for each protein. (b) Result of the pI statistic; (c) Result of the MW statistic; (d) Result of the Hy statistic; (e) Result of the amino acid composition statistic; in the statistic results, the number and percent of each step of the parameters are provided to get their distribution.

Caveat & future development

The post-translational modifications are not considered in the calculation, which would be added in the further development. Some other parameters of protein properties would also be considered if needed (for example, the protein stability and so on).
  7 in total

Review 1.  Protein identification and analysis tools in the ExPASy server.

Authors:  M R Wilkins; E Gasteiger; A Bairoch; J C Sanchez; K L Williams; R D Appel; D F Hochstrasser
Journal:  Methods Mol Biol       Date:  1999

2.  Multi-modality of pI distribution in whole proteome.

Authors:  Songfeng Wu; Ping Wan; Jianqi Li; Dong Li; Yunping Zhu; Fuchu He
Journal:  Proteomics       Date:  2006-01       Impact factor: 3.984

3.  Prediction of protein subcellular location using hydrophobic patterns of amino acid sequence.

Authors:  Tongliang Zhang; Yongsheng Ding; Kuo-Chen Chou
Journal:  Comput Biol Chem       Date:  2006-09-11       Impact factor: 2.877

Review 4.  Protein and peptide fractionation, enrichment and depletion: tools for the complex proteome.

Authors:  Linda Ly; Valerie C Wasinger
Journal:  Proteomics       Date:  2011-01-17       Impact factor: 3.984

5.  The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences.

Authors:  B Bjellqvist; G J Hughes; C Pasquali; N Paquet; F Ravier; J C Sanchez; S Frutiger; D Hochstrasser
Journal:  Electrophoresis       Date:  1993-10       Impact factor: 3.535

6.  A simple method for displaying the hydropathic character of a protein.

Authors:  J Kyte; R F Doolittle
Journal:  J Mol Biol       Date:  1982-05-05       Impact factor: 5.469

7.  Evidence for the adaptation of protein pH-dependence to subcellular pH.

Authors:  Pedro Chan; Jim Warwicker
Journal:  BMC Biol       Date:  2009-10-22       Impact factor: 7.431

  7 in total
  9 in total

1.  Continuous Elution Proteoform Identification of Myelin Basic Protein by Superficially Porous Reversed-Phase Liquid Chromatography and Fourier Transform Mass Spectrometry.

Authors:  Daniel A Plymire; Casey E Wing; Dana E Robinson; Steven M Patrie
Journal:  Anal Chem       Date:  2017-10-31       Impact factor: 6.986

2.  Comparative genomic analysis of six bacteria belonging to the genus Novosphingobium: insights into marine adaptation, cell-cell signaling and bioremediation.

Authors:  Han Ming Gan; André O Hudson; Ahmad Yamin Abdul Rahman; Kok Gan Chan; Michael A Savka
Journal:  BMC Genomics       Date:  2013-06-28       Impact factor: 3.969

3.  PHYSICO: An UNIX based Standalone Procedure for Computation of Individual and Group Properties of Protein Sequences.

Authors:  Parth Sarthi Sen Gupta; Shyamashree Banerjee; Rifat Nawaz Ul Islam; Sudipta Mondal; Buddhadev Mondal; Amal K Bandyopadhyay
Journal:  Bioinformation       Date:  2014-02-19

4.  Halophilic microbial community compositional shift after a rare rainfall in the Atacama Desert.

Authors:  Gherman Uritskiy; Samantha Getsin; Adam Munn; Benito Gomez-Silva; Alfonso Davila; Brian Glass; James Taylor; Jocelyne DiRuggiero
Journal:  ISME J       Date:  2019-07-04       Impact factor: 10.302

5.  Enabling QTY Server for Designing Water-Soluble α-Helical Transmembrane Proteins.

Authors:  Fei Tao; Hongzhi Tang; Shuguang Zhang; Mengke Li; Ping Xu
Journal:  mBio       Date:  2022-01-18       Impact factor: 7.867

6.  Effect of polyethylene glycol 20 000 on protein extraction efficiency of formalin-fixed paraffin-embedded tissues in South Africa.

Authors:  Sophia Rossouw; Hocine Bendou; Liam Bell; Jonathan Rigby; Alan Christoffels
Journal:  Afr J Lab Med       Date:  2021-12-17

7.  Metagenomic analysis reveals that modern microbialites and polar microbial mats have similar taxonomic and functional potential.

Authors:  Richard Allen White; Ian M Power; Gregory M Dipple; Gordon Southam; Curtis A Suttle
Journal:  Front Microbiol       Date:  2015-09-23       Impact factor: 5.640

8.  Evaluation of Protein Purification Techniques and Effects of Storage Duration on LC-MS/MS Analysis of Archived FFPE Human CRC Tissues.

Authors:  Sophia C Rossouw; Hocine Bendou; Renette J Blignaut; Liam Bell; Jonathan Rigby; Alan Christoffels
Journal:  Pathol Oncol Res       Date:  2021-05-03       Impact factor: 3.201

9.  Ecogenomics and Adaptation Strategies of Southern Ocean Viral Communities.

Authors:  Tomás Alarcón-Schumacher; Sergio Guajardo-Leiva; Manuel Martinez-Garcia; Beatriz Díez
Journal:  mSystems       Date:  2021-08-10       Impact factor: 6.496

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.