| Literature DB >> 32435167 |
Behzad Dehghani1, Tayebeh Hashempour1, Zahra Hasanshahi1, Javad Moayedi1.
Abstract
class="Disease">Hepatitis C virus (HCV) infection is aEntities:
Keywords: Bioinformatics; Core; Domain1; HCV
Year: 2019 PMID: 32435167 PMCID: PMC7223762 DOI: 10.1007/s10989-019-09838-y
Source DB: PubMed Journal: Int J Pept Res Ther ISSN: 1573-3149 Impact factor: 1.931
The accession numbers of all 188 sequences that were used in this study
| 2006(16) | DQ065821-DQ065836 and DQ202323 | ||||||||
| 2012(1) | JN129986 | ||||||||
| 2013(84) | KC118250-KC118333, KC118288 | ||||||||
| 2014(28) | KC285337-KC285362, JQ341409,KF218585,KC118320 | ||||||||
| 2016(59) | KT329291-KT329341, KU736828-KU736837 | ||||||||
Fig. 1Phylogenetic tree based on domain1 sequences and by using neighbor joining method. The phylogenetic tree was constructed by the NJ method. The numbers at the forks show the numbers of occurrences of the repetitive groups to the right out of 100 bootstrap samples. All used reference sequences were showed after accession numbers (1a, 1b, and etc.). Sequences were categorized in five major clusters
Frequency of all identified mutations in HCV-core domain1 sequences
| 2006 | 2012 | 2013 | 2014 | 2016 | Ref | |
|---|---|---|---|---|---|---|
| 11(T) | – | – | 1(P)1.2% | – | 1a(N) | |
| 12(K) | – | – | – | – | – | |
| 23(K) | – | – | – | – | – | |
| 25(P) | – | – | – | – | –– | |
| 39(R) | – | – | – | – | – | |
| 45(G) | –– | – | – | – | – | |
| 49(T) | 2(V)11.7% | – | 2(P) 2.4%, 5(A) 6.1%, 1(V) 1.2% | 1(V) 3.5% | 1(p)1.6% | – |
| 69(R) | – | – | – | – | – | |
| 70(R) | 2(Q)11.7% | – | 3(Q) 3.7%, 2(H) 2.4%, 1(P) 1.2% | 4(Q)14.3% | 25(Q)42.3%,1(H)1.6% | 3a,6a (Q) |
| 74(R) | – | – | – | – | – | |
| 78(Q) | – | – | – | 1 (R) 3.5% | 2a,2b (K) | |
| 90(G) | – | – | 1(L) 1.2%, 1(S)1.2% | – | – | |
| 91(C) | 2(M)11.7% | – | 13(M)16.4%, 3(L) 3.7%, 1(F) 1.2% | – | 43(M)72.8%, 7(L)11.8 | 1b (M); 2a and 5a (L) |
| 102(G) | – | – | – | – | 5a (S) | |
| 110 | 8(T)47%, 9(N)53% | – | 23(N)28.4%, 5(S)6.1%, 53(T)65.4% | 18(T)64.3%, 8(N)28.57% | N (10)17% | 1b, 1a, 2b(T); 1c(S); 4a, 3a, 6a, 2a, 5a(N) |
The majority of mutations happened in amino acid residues 49, 70, 91, 110
Domain1 physicochemical properties computed by “Protparam”
| Amino acid composition | Number | Percentage (%) |
|---|---|---|
| Number of amino acids: 117 | ||
| Molecular weight: 13320.2 | ||
| Theoretical pI: 12.05 | ||
| Ala (A) | 4 | 3.40 |
| Arg (R) | 21 | 17.90 |
| Asn (N) | 5 | 4.30 |
| Asp (D) | 2 | 1.70 |
| Cys (C) | 1 | 0.90 |
| Gln (Q) | 6 | 5.10 |
| Glu (E) | 3 | 2.60 |
| Gly (G) | 16 | 13.70 |
| Ile (I) | 2 | 1.70 |
| Leu (L) | 6 | 5.10 |
| Lys (K) | 7 | 6.00 |
| Met (M) | 1 | 0.90 |
| Phe (F) | 1 | 0.90 |
| Pro (P) | 17 | 14.50 |
| Ser (S) | 7 | 6.00 |
| Thr (T) | 6 | 5.10 |
| Trp (W) | 5 | 4.30 |
| Tyr (Y) | 3 | 2.60 |
| Val(V)4 | 3.40 | |
| Total number of negatively charged residues (Asp + Glu) | 5 | |
| Total number of positively charged residues (Arg + Lys) | 28 | |
| The estimated half-life is | ||
| Mammalian reticulocytes, in vitro | 30 h | |
| Yeast, in vivo | > 20 h | |
| | > 10 h | |
| The instability index (II) | 77.69 unstable | |
| Aliphatic index | 40 | |
| Grand average of hydropathicity (GRAVY) | − 1.424 | |
16 meric conserved B-cell epitopes regions in HCV-core domain1, predicted by ABCpred online software
| Rank | Sequence | Position | Score |
|---|---|---|---|
| 1 | QPGYPWPLYGNEGCGW | 78–93 | 0.92 |
| 2 | TRKTSERSQPRGRRQP | 49–64 | 0.9 |
| 3 | MSTNPKPQKKNKRNTN | 1–16 | 0.89 |
| 4 | PIPKARRPEGRTWAQP | 64–79 | 0.86 |
| 4 | GSRPSWGPTDPRRRSR | 102–117 | 0.86 |
HLA predicted epitopes in HCV-core domain1 sequence (genotype 1a) for HLA’s that were determined by previous researches in Iranian patients
| HLA types | Epitopes positions |
|---|---|
| HLA-A1 | 1_10 |
| HLA-A2 | 29–44, 77–85 |
| HLA-A3 | 2–10,29–59, 96–104 |
| HLA-B*3501 | 6–14, 27–35, 41–49, 57–65,78–91, 99–116 |
| HLA-B*3801 | 29–37, 77–85, 89–97 |
| HLA-Cw*0401 | 23–37, 58–65, 78–93 |
| HLA-Cw*0702 | 27–35, 73–86 |
| HLA-DQA1*05:01 | 2–16,13–27 |
| HLA-DQA1*0201 | NO |
| HLA-DQB1*0602 | 2–16,17–31, 102–116 |
| HLA-DQB1*0301 | 96–104 |
| HLA-G: HLA-G*01:01, HLA-G*01:02 | 4–16,8–21, 22–33, 66–79, 100–112 |
| HLA-DRB1*0301 | 96–104 |
| HLA-DRB1*0305 | 96–104 |
| HLA-DRB1*0309 | 96–104 |
| HLA-DRB1*0701 | No |
| HLA-DRB1*11 | 34–42 |
CTL epitopes in HCV-core domain 1; the high score epitopes are displayed
| Peptide Rank | Position | Sequence | Score | Prediction |
|---|---|---|---|---|
| 1 | 48–56 | ATRKTSERS | 1 | Epitope |
| 2 | 80–88 | GYPWPLYGN | 1 | Epitope |
| 3 | 2–10 | STNPKPQKK | 0.99 | Epitope |
Fig. 2Phosphorylation sites prediction for domain 1 using “NetPhos” online software. Green lines indicate 6 sites for serine, blues lines show 3 sites for threonine, and one purple line shows tyrosine. All sites with scores above the threshold of 0.5 were considered as phosphorylation sites
Percentage of secondary structures in core and domain1
| Alpha helix (Hh) | Extended strand (Ee) | Beta turn (Tt) | Random coil (Cc) | |
|---|---|---|---|---|
| Core | 39, 20.42% | 25, 13.09% | 14, 7.33% | 113, 59.16% |
| Domain 1 | NO | 19, 16.24% | 10, 8.55% | 88, 75.21% |
Fig. 3Secondary structure prediction using SOMPA. Red region is extended strand, blue is the alpha helix, green is beta turn, and purple is the random coil. The majority of core structure belongs to random coil
Fig. 4Secondary structure prediction using PHYRE2. The result of this tool shows that the majority of the core structure (40%) is alpha helix which is indicated with green helix, also the confidence keys of the predicted structure for these regions are high
Fig. 5Secondary structure prediction using (PS)2-v2. C coil, H helix, and E extended strand. The majority of core structure contains coil structure
Fig. 63D structure of the core protein using “I-TASSER” program. The selected model had the highest C score and it was qualified by “Qmean”
Fig. 73D structure of domain1 using I-TASSER program. The selected model had the highest C score and it was qualified by “Qmean”
Fig. 8Ramachandran plot was used to visualize energetically allowed regions for backbone dihedral angles ψ against ϕ of amino acid residues in modeled protein structure (LCC model) for tertiary structure of core protein by RAMPAGE; the majority of amino acids residues were in favored region (119 amino acids) and allowed region (52 amino acids)
Fig. 9Ramachandran plot was used to visualize energetically allowed regions for backbone dihedral angles ψ against ϕ of amino acid residues in modeled protein structure (LCC model) for tertiary structure of domain1 by RAMPAGE; majority of amino acids residues were in favored region (71 amino acids) and allowed region (28 amino acids)
Fig. 10A: the position of the B-Cell epitopes (yellow region) on core tertiary structure and B: the T-cell epitopes (yellow region) on core 3D structure
The results of predicted cleavage positions for 12 common proteases: B-cell, T-cell, and CTL predicted epitopes
| Positions | Caspase | Chymotrypsin | Clostripain | Elastase | Pepsin | Proteinase K | Asp-N | Staphylococcal peptidase I | Thrombin | Proline | Trypsin |
|---|---|---|---|---|---|---|---|---|---|---|---|
| B-cell epitopes | |||||||||||
| 78–93 | 3(8,9,16) | 5(4, 7, 9,15, 16) | 6(4, 6, 8, 9, 12, 16) | 1(11) | 1(12) | ||||||
| 49–64 | 5 (2, 7, 11, 13, 14) | 3(1, 4, 6) | 1(5) | 1(6) | 6(2, 3, 7, 11, 13, 14) | ||||||
| 1_16 | 1(1) | 1(13) | 2(3,15) | 1(7) | 4(9, 10, 12, 13) | ||||||
| 64–79 | 1(13) | 3(6, 7, 11) | 2(5,14) | 1(12) | 6(2, 5, 9, 12, 13, 14) | 1(8) | 1(9) | 1(8) | 3(4, 6, 11) | ||
| 102–117 | 1(6) | 5(3, 12, 13, 14, 16) | 2(6,9) | 1(9) | 1(4) | 3(12, 14, 16) | |||||
| CTL epitopes | |||||||||||
| 48–56 | 2(3,8) | 1(1) | 4(1, 2, 5, 7) | 1(6) | 1(7) | 3(3, 4 ,8) | |||||
| 80–88 | 2(6,7) | 3(2, 5, 7) | 4(2, 4, 6, 7) | ||||||||
| 2_10 | 1(2) | 1(6) | 2(8,9) | ||||||||
| T-cell epitopes | |||||||||||
| HLA-A1 | |||||||||||
| 1_10 | 1(1) | 1(3) | 1(7) | 2(9,10) | |||||||
| HLA-A2 | |||||||||||
| 29–44 | 3(6, 7, 15) | 3(10, 11, 14) | 2(2, 5) | 4(5, 6, 8, 15) | 7(1, 2, 5, 6, 7, 8, 15) | 1(10) | 3(10, 11, 14) | ||||
| HLA-A2 | |||||||||||
| 77–85 | 1(9) | 1(1) | 2(5,8) | 4(1, 5, 7, 9) | |||||||
| HLA-A3 | |||||||||||
| 2_10 | 1(2) | 1(6) | 2(8,9) | ||||||||
| HLA-A3 | |||||||||||
| 29–59 | 1(25) | 3(7,8,16) | 7(11, 12, 15, 19, 22, 27, 31) | 4 (3, 6, 18, 20) | 4(6 7 9 16) | 12(2, 3, 6, 7, 8, 9, 16, 18, 20, 21, 24, 26,) | 1(26) | 1(11) | 8(11 12 15 19 22 23 27 31) | ||
| HLA-A3 | |||||||||||
| 96–104 | |||||||||||
| HLA-B*3501 | |||||||||||
| 6_14 | 1(8) | 1(2) | 4(4, 5, 7, 8) | ||||||||
| HLA-B*3501 | |||||||||||
| 27_35 | 1(9) | 2(5,8) | 2(8,9) | 4(4, 5, 8, 9) | |||||||
| HLA-B*3501 | |||||||||||
| 41–49 | 1(4) | 2(3,7) | 2(6,8) | 1(4) | 4(4 6 8 9) | 2(3,7) | |||||
| HLA-B*3501 | |||||||||||
| 57–65 | 3(3 5 6) | 1(9) | 4(3, 5, 6, 11) | ||||||||
| HLA-B*3501 | |||||||||||
| 78–91 | 2(8,9) | 3(4, 7, 9) | 5(4, 6, 8, 9, 12) | 1(11) | 1(12) | ||||||
| HLA-B*3501 | |||||||||||
| 99–116 | 1(9) | 5(3, 6, 15, 16, 17) | 2(9, 12) | 1(12) | 1(7) | 3(3, 15, 17) | |||||
| HLA-B*3801 | |||||||||||
| 29–37 | 3(7, 8, 9) | 2(3, 6) | 4(6, 7, 8, 9) | 6(2 3 6 7 8 9) | |||||||
| HLA-B*3801 | |||||||||||
| 77–85 | 1(9) | 1(1) | 2(5,8) | 4(1, 5, 7, 9) | |||||||
| HLA-B*3801 | |||||||||||
| 89–97 | 3(5, 8, 9) | 1(6) | 5(4, 5, 7, 8, 9) | 5(1, 5, 6, 8, 9) | 1(1) | ||||||
| HLA-Cw*0401 | |||||||||||
| 23–37 | 3(7, 8, 9) | 2(3, 6) | 4(6, 7, 8, 9) | 6(2, 3, 6, 7, 8, 9) | |||||||
| HLA-Cw*0401 | |||||||||||
| 58–65 | 3(2,4,5) | 3(2,4,5) | |||||||||
| HLA-Cw*0401 | |||||||||||
| 78–93 | 3(8, 9 ,16) | 5(4, 7, 9, 15, 16) | 6(4, 6, 8, 9, 12, 16) | 1(11) | 1(12) | ||||||
| HLA-Cw*0702 | |||||||||||
| 27–35 | 1(9) | 2(5, 8) | 2(8,9) | 4(4, 5, 8, 9) | |||||||
| HLA-Cw*0702 | |||||||||||
| 73–86 | 3(4, 13, 14) | 1(2) | 1(5) | 4(3 9 12 14) | 7(3 4 5 9 11 13 14) | 1(2) | |||||
| HLA-DQA1*05:01 | |||||||||||
| 2_16 | 1(12) | 2(2,14) | 1(6) | 4(8 ,9, 11, 12) | |||||||
| HLA-DQA1*05:01 | |||||||||||
| 13–27 | 3(1,5,6) | 1(10) | 1(12) | 3(3,10,12) | 1(8) | 1(7) | 3(1,5,11) | ||||
| HLA-DQB1*0602 | |||||||||||
| 2_16 | 1(12) | 2(2,14) | 1(6) | 4(8 ,9, 11, 12) | |||||||
| HLA-DQB1*0602 | |||||||||||
| 17–31 | 1(4) | ||||||||||
| HLA-DQB1*0602 | |||||||||||
| 102–116 | 2(1,2) | 2(6,15) | 1(8) | 4(6, 8, 14, 15) | 1(3) | 2(1,7) | |||||
| HLA-DQB1*0301 | 3(1, 2, 3) | 2(6, 9) | 2(1, 2) | 3(1, 2, 3) | 2(6, 9) | ||||||
| HLA-G: HLA-G*01:01, HLA-G*01:02 | |||||||||||
| 4_16 | 1(10) | 1(12) | 1(4) | 4(6 7 9 10) | |||||||
| 8_21 | 3(6,10,11) | 1(8) | 1(13) | 1(12) | 5(2, 3, 5 ,6, 10) | ||||||
| 22–33 | 2(1,10) | 1(3) | 4(1,3,9,10) | 1(2) | |||||||
| 66–79 | 1(11) | 3(4,5,9) | 2(3,12) | 1(10) | 5(3, 7, 10, 11, 12) | 1(6) | 1(7) | 1(6) | 3(2,4,9) | ||
| 100–112 | 1(8) | 2(2,5) | 2(8,11) | 1(11) | 1(6) | 1(2) | |||||
| HLA-DRB1*0301 | |||||||||||
| 96–104 | 3(1,2,3) | 2(6,9) | 2(1,2) | 3(1,2,3) | 2(6,9) | ||||||
| HLA-DRB1*0305 | |||||||||||
| 96–104 | 3(1,2,3) | 2(6,9) | 2(1,2) | 3(1,2,3) | 2(6,9) | ||||||
| HLA-DRB1*0309 | |||||||||||
| 96–104 | 3(1,2,3) | 2(6,9) | 2(1,2) | 3(1,2,3) | 2(6,9) | ||||||
| HLA-DRB1*11 | |||||||||||
| 34–42 | 2(3,4) | 2(7,8) | 1(2) | 3(2,3,5) | 4(2,3,4,5) | 1(7) | 2(7,8) | ||||
The position of each epitope, the number of cleave sites in each epitope as well as the position of the cleave sites in each epitope are mentioned in the table