Literature DB >> 24993579

Polar profile of antiviral peptides from AVPpred Database.

Carlos Polanco1, José Lino Samaniego, Jorge Alberto Castañón-González, Thomas Buhse.   

Abstract

Diseases of viral origin in humans are among the most serious threats to health and the global economy. As recent history has shown the virus has a high pandemic potential, among other reasons, due to its ability to spread by air, hence the identification, investigation, containment, and treatment of viral diseases should be considered of paramount importance. In this sense, the bioinformatics research has focused on finding fast and efficient algorithms that can identify highly toxic antiviral peptides and to serve as a first filter, so that trials in the laboratory are substantially reduced. The work presented here contributes to this effort through the use of an algorithm already published by this team, called polarity index method, which identifies with high efficiency antiviral peptides from the exhaustive analysis of the polar profile, using the linear sequence of the peptide. The test carried out included all peptides in APD2 Database and 60 antiviral peptides identified by Kumar and co-workers (Nucleic Acids Res 40:W199-204, 2012), to build its AVPpred algorithm. The validity of the method was focused on its discriminating capacity so we included the 15 sub-classifications of both Databases.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24993579      PMCID: PMC7091296          DOI: 10.1007/s12013-014-0084-4

Source DB:  PubMed          Journal:  Cell Biochem Biophys        ISSN: 1085-9195            Impact factor:   2.194


Introduction

Within the antimicrobial peptides there is a group of particular importance, the antiviral peptides. These peptides are potential drugs to face the increased incidence of chronic viral infections caused by HIV and hepatitis [4, 5]; for that reason there is a need to accelerate the development of synthetic antiviral peptides. Inclusive, the testing of vaccines against HIV and hepatitis [6, 7] that are under development offers no guarantee of success regardless of the millions of infected people. In counterpart, antiviral medications are not oriented to the most acute infections that cause serious diseases, such as hemorrhagic fever and cancer [8-10]; they have limited effectiveness and serious side effects. Perhaps more importantly, the antiviral chemotherapy is producing a rapid development of drug-resistant strains, as a result of the high rate of virus replication, due to the low resistance to replication [11]. In such a scenario, the work in proteomics and bioinformatics should focus on the generation of fast and robust algorithms [12, 13] identifying the antiviral action, from the linear or three-dimensional structure of the peptide. However, the simulation of the three-dimensional structure of the peptide is very complex, without taking into account other factors involved such as: The dynamics of the membrane and toxicity. In this work, we use the Polarity index method [14], already published by our team to identify SCAAP [15-17] and which only uses the linear peptide sequence to identify the same group of antiviral peptides that were identified by AVPpred algorithm [2]. This method [14] generates an exhaustive analysis of the peptide polarity through its polarity matrix. In this sense, the method apparently does not consider other factors, but indirectly it does, because it requires being calibrated by “a set of peptides” that are characteristic of the profile.

Materials and Methods

The identification of antiviral peptides performed by the polarity index method [14] requires the modifications below (Supplementary Material).

Polarity Index method. Updates

Modifications Replacing the Q[i,j] matrix in the source program [14] by the Table 1, which represents the incidences of antiviral sequences with a unique pathogenic action. Table 1 considers 60 AVPpred antiviral peptides extracted from Kumar and co workers [2], and 1 antiviral peptide extracter from APD2 database [1].
Table 1

Q[i,j] Polarity matrix

P+P−NNP
P+0.03548616170.01845280270.03548616170.0681334287
P−0.01632363350.01774308090.02980837410.0709723234
N0.04045422380.03264726700.08729595690.0908445716
NP0.06671398130.06174591930.09155429150.1937544346

Incidences of 60 AVPpred antiviral peptides extracted from Kumar [2], and one antiviral peptide from APD2 [1]

Q[i,j] Polarity matrix Incidences of 60 AVPpred antiviral peptides extracted from Kumar [2], and one antiviral peptide from APD2 [1] Replacing the rule in the source program [14] by P[i,j] + Q[i,j] vector complying with the next rule: polar interactions 15 or 16 are present in the 1st position, polar interaction 14 is not present from 3th to 8th positions, polar interaction 2 is not present in the first seven positions, polar interaction 5 is not present in the first five positions, polar interactions 4 or 8 are not present from 12th to 16th positions, polar interactions 14, 15 or 16 are not present from 11th to 16th positions, polar interaction 9 is not present in the 1st, 4th, 5th or 9th positions, polar interaction 1 is not present in the 10th or 11th positions, polar interaction 11 is not present in the first position, polar interaction 1 is not present in 10th or 11st positions, polar interaction 14 is not present in the 5th position, and polar interaction 4 is not present in the 4th position (Table 2).
Table 2

Polarity index method rules

Position12345678910111213141516
P[i,j] + Q[i,j] vector of study.(1,1)(1,2)(1,3)(1,4)(2,1)(2,2)(2,3)(2,4)(3,1)(3,2)(3,3)(3,4)(4,1)(4,2)(4,3)(4,4)

Rule # 1

Polar interactions 15 or 16 are present in the 1st position

Rule # 2

Polar interaction 14 is not present from 3th to 8th positions

××××××

Rule # 3

Polar interaction 2 is not present in the first seven positions

×××××××

Rule # 4

Polar interaction 5 is not present in the first five positions

×××××

Rule # 5

Polar interactions 4 or 8 are not present from 12th to 16th positions

×××××

Rule # 6

Polar interactions 14, 15 or 16 are not present from 11th to 16th positions

××××××

Rule # 7

Polar interaction 9 is not present in the 1st, 4th, 5th or 9th positions

××××

Rule # 8

Polar interaction 1 is not present in the 10th or 11th positions

××

Rule # 9

Polar interaction 11 is not present in the first position

×

Rule # 10

Polar interaction 1 is not present in 10th or 11st positions

××

Rule # 11

Polar interaction 14 is not present in the 5th position

×

Rule # 12

Polar interaction 4 is not present in the 4th position

×

Identification Rules in Polarity index method for antiviral peptides. (✔) The polar interaction is present in the position. (×) The polar interaction is not present in the position

Polarity index method rules Rule # 1 Polar interactions 15 or 16 are present in the 1st position Rule # 2 Polar interaction 14 is not present from 3th to 8th positions Rule # 3 Polar interaction 2 is not present in the first seven positions Rule # 4 Polar interaction 5 is not present in the first five positions Rule # 5 Polar interactions 4 or 8 are not present from 12th to 16th positions Rule # 6 Polar interactions 14, 15 or 16 are not present from 11th to 16th positions Rule # 7 Polar interaction 9 is not present in the 1st, 4th, 5th or 9th positions Rule # 8 Polar interaction 1 is not present in the 10th or 11th positions Rule # 9 Polar interaction 11 is not present in the first position Rule # 10 Polar interaction 1 is not present in 10th or 11st positions Rule # 11 Polar interaction 14 is not present in the 5th position Rule # 12 Polar interaction 4 is not present in the 4th position Identification Rules in Polarity index method for antiviral peptides. (✔) The polar interaction is present in the position. (×) The polar interaction is not present in the position

APD2 Database Trial Data Preparation

We use 3,636 peptides in the APD2 Database [1] classified by their multiple action against: 149 Gram− ONLY, 1711 Gram +/Gram− ONLY, 315 Gram+ ONLY, 141 cancer cells, 9 sperms, 88 HIV, 744 fungi, 21 insects, 244 mammalian cells, 47 parasites, 3 protists, 39 chemotaxis, 0 SCAAP, 125 virus; and also 1,059 by their unique action against: 111 Gram− ONLY, 213 Gram+ ONLY, 518 Gram+/Gram− ONLY, 20 cancer cells, 0 HIV, 88 fungi, 35 H1N1, 2 insects, 11 mammalian cells, 9 parasites, 1 protists, 0 chemotaxis, 30 SCAAP, 0 sperms and 21 virus.

AVPpred Algorithm Trial Data Preparation

From Kumar and co workers [2] work about antiviral peptides, we took 60 validated and experimental peptides from 1,245 antiviral peptides. His work evaluated these 60 peptides with 25 physicochemical properties (Table 3) out of 144 properties from the same database and called it AAindex database [3]. These peptides were used to build the SVM AVPpred algorithm [2], and we are using them here to validate the Polarity index method
Table 3

Physicochemical properties from AVPpred

#AAindex IDDescriptionPolarityReference
1BEGF750103Conformational parameter of beta-turn[18]
2BULH740102Apparent partial specific volume[19]
3CHAM810101Steric parameter[20]
4CHOP780204Normalized frequency of N-terminal helix[21]
5CHOP780206Normalized frequency of N-terminal non helical region[21]
6CHOP780215Frequency of the 4th residue in turn[21]
7CIDH920104Normalized hydrophobicity scales for alpha/beta-proteins[21]
8COHE430101Partial specific volume[22]
9FASG760105PK-C[23]
10FAUJ880104STERIMOL length of the side chain[24]
11FINA770101Helix-coil equilibrium constant[25]
12FINA910101Helix initiation parameter at position i − 1[26]
13GEIM800101Alpha-helix indices[27]
14GEIM800102Alpha-helix indices for alpha-proteins[27]
15GEIM800104Alpha-helix indices for alpha/beta-proteins[27]
16KARP850101Flexibility parameter for no rigid neighbors[28]
17KARP850102Flexibility parameter for one rigid neighbor[28]
18AURR980101Normalized positional residue frequency at helix termini N4′[29]
19AURR980118Normalized positional residue frequency at helix termini C”[29]
20AURR980120Normalized positional residue frequency at helix termini C4′[29]
21AVBF000107Slopes tripeptide FDPB PARSE neutral[30]
22GEOR030102Linker propensity from 1-linker dataset[31]
23KIDA850101Hydrophobicity-related index[32]
24GUYH850102Apparent partition energies calculated from Wertz-Scheraga index[33]
25CASG920101Hydrophobicity scale from native protein structures[34]

Selected physicochemical properties to build the AVPpred algorithm by Kumar [2]. AAindex amino acid index database [3]. Polarity (✔) physicochemical properties are directly or indirectly related to the polarity

Physicochemical properties from AVPpred Selected physicochemical properties to build the AVPpred algorithm by Kumar [2]. AAindex amino acid index database [3]. Polarity (✔) physicochemical properties are directly or indirectly related to the polarity

Results

Polarity index method is an algorithm that determines the probable antiviral pathogen action of peptides by using the peptide polarity sequence. It was applied to the APD2 database and the experimental peptides from AVPpred algorithm [2] with the following results. From the 25 physicochemical properties used to design the SVM AVPpred algorithm [2], 21 are directly or indirectly (18/25 = 84 %) related to the polarity (Table 3, column Polarity with entries with figure ✔). Polarity index method had over 43/60 = 71 % efficiency detecting the 60 validated and experimental antiviral peptides from Kumar and coworkers [2] (Table 4, entries 1–60 and Table 5 column AVPpred), and 1/1 = 100 % detecting the antiviral peptides from APD2 database (Table 4, entry 61 and Table 5, colum Virus). There are no coincidences between both excluding sets (Table 4, columns #1 and #2).
Table 4

Polarity index matches by linear sequence in virus

NoCodeID PUBMEDSequence#1#2References
1AVP_06186096849GPPISLERLDVGTNLGNAIAKLEAKELLESSDQI[35]
2AVP_06293788062KVLHLEGEVNKIALLSTNKAVVSLSNGVSVLTSN[36]
3AVP_06072893293DFLEENITALLEEAQIQQEKNMYELQKLNSWDVFG[37]
4AVP_01688382405EGPTLGNWAREIWATLFGKAN[38]
5AVP_01798382405NWAREIWATLFKKAN[38]
6AVP_046710390360FAIKWEYVLLLFLL[39]
7AVP_05121848704SWLRDIWDWKCEVLSDFK[40]
8AVP_05141848704SWLRDIWDWLCEVLSDFK[40]
9AVP_03731848704SWLRDIWDWICEVLSDFK[40]
10AVP_037252472831SWLRDIWDWICEVLSD[41]
11AVP_03879223423TWLRAIWDWVCTALTDFK[42]
12AVP_03238822631PPVYTKDVDISSQISSMNQSLQQSKDYIKEAQKILDTVNPSL[43]
13AVP_03283012869VANDPIDISIELNKAKSDLEESKEWIRRSNQKLDSDN[44]
14AVP_021011118300ANTAFVSSAHNTQKIPAGAPFNRNLRAMLADLRQNAAFAG[45]
15AVP_00241695254CGGNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ[46]
16AVP_031216667080EQCREEEDDRN[47]
17AVP_035815893660GGTIFDCGETCFLGTCYTPGCSCGNYGFCYGTNN[48]
18AVP_00197841460GICRCICGKGICRCICGR[49]
19AVP_028421685289GICRCICGRGICRCICGR[50]
20AVP_03977529412GIKEWKRIVQRIKDFLRNLVN[51]
21AVP_036116872274GLPVCGETCVGGTCNTPGCTCSWPVCTRNN[52]
22AVP_028610521339GVCRCLCRRGVCRCICRR[53]
23AVP_030415949629KTCENLADTY[54]
24AVP_06847031661LEAIPCSIPPCFLFGKPFVF[55]
25AVP_06927031661LEAIPISIPPELAFAKPFVF[55]
26AVP_07037031661LEAIPMSIPPEVFFGKPFVF[55]
27AVP_01559777331LSYRCPCRN[56]
28AVP_04091433527WMEWDREIEELAKKAEELAKKAEELAKKAWASLWNWF[57]
29AVP_02223031048YALLIRMIYKNI[58]
30AVP_022419468303YQLLARMIYKNI[59]
31AVP_02259504927YQLLIAMIYKNI[60]
32AVP_05842578615YTSLIHSLIEESQNQQEKNEQELLEFDKWASLWNWFN[61]
33AVP_05913040055YTSLIHSLIEESQNQQEKNEQELLELNKWASLWNWFN[62]
34AVP_03109516047GLFGVLGSIAKHVLPHVVPVIAEKLN[63]
35AVP_01837826699DWHLGQGVSIEWRKK[64]
36AVP_04449784389FLFPLITSFLSKVL[65]
37AVP_045210951191GLFDIIKKIAESW[66]
38AVP_045912089438GWLKKIESIIDAF[67]
39AVP_046011053427HVDKKVADKVLLLKQLRIMRLLTRLN[68]
40AVP_03487826699TTWEAWDRAIAEYAARIEALIRAAQEQQEKNEAILREL[64]
41AVP_03547826699TTWEAWDRAIAEYAARIEALIRASQEQQEKNEAELREL[64]
42AVP_06123012869ELNKAKSDLEESKEWIRRSNQKLDSIGNWHQSSTTN[44]
43AVP_06233012869IELNKAKSDLEESKEWIRRSNQKLDSIGNWHQSSTN[44]
44AVP_03903012465AAHLIDALYAEFLGGRVLTT[69]
45AVP_006810077657HRWRKRWRKHRWRKRWRK[70]
46AVP_006110077657KRWRKRWRKWRWRKRWRK[70]
47AVP_005515095345RTRKRGRKRTRKRGRK[71]
48AVP_019114648299RGGKIAGKIAKIAGKIAKIAGKIA[72]
49AVP_01343012869ISIELNKAKSDLEESKEWIRRSNQKLDSIGNWHQSN[44]
50AVP_04822959959RRKKAAVALLPAVLLA[73]
51AVP_04832959959RRKKAAVALLPAVLLAL[73]
52AVP_03022661722KDLLFKN[74]
53AVP_01133788062GPPISLERLDVGTNLGNAIAKLEDAKELLESSDQI[35]
54AVP_01143788062HRIDLGPPISLERLDVGTNLGNAIAKLEDAKELLE[35]
55AVP_01163788062ISLERLDVGTNLGNAIAKLEDAKELLESSDQILRS[35]
56AVP_028812810954CGECGGGHIVGRFCMVVRFLRLVFI[75]
57AVP_02899634230CRCCELKSLCPTLMRVVRLLGLVLLN[76]
58AVP_01386096849LVFPSDEFDASISQVNEKINQSLAFIRKSDELLHN[77]
59AVP_02782833514KWKLFKKIGIGKFLHVAKKF[78]
60AVP_033512886019CDVIALLCHLNT[79]
61APD2_012097744961RICRCICGRRICRCICGR[80]

Subject sequences identified by polarity index method in APD2 database [1] and by AVPpred [2], where peptides have action only on virus. #1: (N) rejected peptide by SVM AVPpred algorithm. #2: (N) rejected peptide by polarity index method. Source: National Center for Biotechnology Information, US National Library of Medicine http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins in database: Swiss-Prot (swissprot), accessed March 20, 2013

Table 5

Polarity index matches by pathogenic action

Number of hitsGram+ onlyGram− onlyGram+/ Gram−Virus%HIVFungiProtistsParasitesInsectsSpermsCancer cellsMammalian cellsChemotaxisSCAAPAVPpred%
Unique action38191341100060100220134270
21311151810881920201105160
Multiple action66274261916163012312881800
3151491,711125187443472191412443900

Polarity index method matches for both APD2 [1] and AVPpred antiviral peptide groups [2]. Unique action peptides with pathogenic action against only one group. Multiple action peptides with pathogenic action against two or more groups. (%): Percentage of hits/total peptides

Polarity index matches by linear sequence in virus Subject sequences identified by polarity index method in APD2 database [1] and by AVPpred [2], where peptides have action only on virus. #1: (N) rejected peptide by SVM AVPpred algorithm. #2: (N) rejected peptide by polarity index method. Source: National Center for Biotechnology Information, US National Library of Medicine http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins in database: Swiss-Prot (swissprot), accessed March 20, 2013 Polarity index matches by pathogenic action Polarity index method matches for both APD2 [1] and AVPpred antiviral peptide groups [2]. Unique action peptides with pathogenic action against only one group. Multiple action peptides with pathogenic action against two or more groups. (%): Percentage of hits/total peptides

Discussion

When reviewing different databases of antimicrobial peptides, we detected a peptide with predominantly toxic action toward a pathogenic group; this allows us to assume that nature considers only small changes in the primary structure of the peptide to induce its possible pathogenic action. In that sense, the peptide linear structure plays an important role in the identification of its pathogenic action, when we use algorithms that use “training sets” with the desired profile. Although the physicochemical property called polarity is involved in most of the algorithms that predict anti-virus peptides, this method has an innovative aspect as it expresses the metric through a polarity matrix that includes 16 interactions. We see this matrix as a picture of the polar dynamics of the peptide. The polarity matrix clearly shows a pattern that led us to achieve an identification efficiency of 70 % on the AVPpred database. The same pattern also rejected other groups of peptides in APD2 database, with the exception of the anti-virus set. We assume that if we built the matrix with one digit, perhaps we did not have enough information to focus the method correctly. We believe the effectiveness of the polarity index method in terms of the computing resources required makes it suitable candidate for a more detailed analysis related to the sub-domains of peptides. In this regards, we have initiated a comprehensive classification of the APD2 database gathering, from published manuscripts, the toxicity values of antimicrobial peptides and thus explore peptide sub-domains with specific and very toxic pathogenic action. Our team is working on this as we consider it of vital importance to strongly support basic scientific research. We have also published work where the same method is used to understand the profile of the “first proteins” from 4 billion years ago. For this task, we are using clusters of GPU coprocessors, which will allow the analysis of 15 amino acids in length peptides. Finally, within the antiviral peptide group, there are two sub-groups not approached in this work as they constitute by themselves an independent topic for the importance they have and the degree of subject matter expertise required: the influenza A type H1N1 and HIV peptides. Given their potential to provoke a world pandemic, these two groups of peptides concentrate the efforts of several research groups, as they can undoubtedly become a problem of enormous proportions. Our team is now directing the method toward the identification of these two sub-groups.

Conclusions

In summary, we report an implementation of a polarity index method in the exhaustive prediction of antiviral peptides from AVPpred and APD2 databases, with high level of discriminative efficiency (44/61 = 72), from the reading of its linear sequence.

Availability

The test files and source code are given as “Supplementary Material” Below is the link to the electronic supplementary material. Supplementary material 1 (GZ 3700 kb) Supplementary material 2 (GZ 3700 kb)
  74 in total

Review 1.  Analytical ancestry: "firsts" in fluorescent labeling of nucleosides, nucleotides, and nucleic acids.

Authors:  Larry J Kricka; Paolo Fortina
Journal:  Clin Chem       Date:  2009-02-20       Impact factor: 8.327

2.  A toy model of prebiotic peptide evolution: the possible role of relative amino acid abundances.

Authors:  Carlos Polanco; Thomas Buhse; José Lino Samaniego; Jorge Alberto Castañón González
Journal:  Acta Biochim Pol       Date:  2013-06-06       Impact factor: 2.149

Review 3.  Prediction of the secondary structure of proteins from their amino acid sequence.

Authors:  P Y Chou; G D Fasman
Journal:  Adv Enzymol Relat Areas Mol Biol       Date:  1978

4.  Amino acid side-chain partition energies and distribution of residues in soluble proteins.

Authors:  H R Guy
Journal:  Biophys J       Date:  1985-01       Impact factor: 4.033

5.  Nucleotide sequence of the gene encoding the fusion (F) glycoprotein of human respiratory syncytial virus.

Authors:  P L Collins; Y T Huang; G W Wertz
Journal:  Proc Natl Acad Sci U S A       Date:  1984-12       Impact factor: 11.205

6.  Protein folding and the genetic code: an alternative quantitative model.

Authors:  M Charton
Journal:  J Theor Biol       Date:  1981-07-07       Impact factor: 2.691

7.  Molecular and biophysical characterization of TT virus: evidence for a new virus family infecting humans.

Authors:  I K Mushahwar; J C Erker; A S Muerhoff; T P Leary; J N Simons; L G Birkenmeyer; M L Chalmers; T J Pilot-Matias; S M Dexai
Journal:  Proc Natl Acad Sci U S A       Date:  1999-03-16       Impact factor: 11.205

8.  The nucleotide sequence of the mRNA encoding the fusion protein of measles virus (Edmonston strain): a comparison of fusion proteins from several different paramyxoviruses.

Authors:  C Richardson; D Hull; P Greer; K Hasel; A Berkovich; G Englund; W Bellini; B Rima; R Lazzarini
Journal:  Virology       Date:  1986-12       Impact factor: 3.616

9.  An infectious molecular clone of an unusual macrophage-tropic and highly cytopathic strain of human immunodeficiency virus type 1.

Authors:  R Collman; J W Balliet; S A Gregory; H Friedman; D L Kolson; N Nathanson; A Srinivasan
Journal:  J Virol       Date:  1992-12       Impact factor: 5.103

10.  Cloning and characterization of zona pellucida genes and cDNAs from a variety of mammalian species: the ZPA, ZPB and ZPC gene families.

Authors:  J D Harris; D W Hibler; G K Fontenot; K T Hsu; E C Yurewicz; A G Sacco
Journal:  DNA Seq       Date:  1994
View more
  1 in total

1.  In silico analyses of conservational, functional and phylogenetic distribution of the LuxI and LuxR homologs in Gram-positive bacteria.

Authors:  Akanksha Rajput; Manoj Kumar
Journal:  Sci Rep       Date:  2017-08-01       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.