Literature DB >> 19238246

Implications from predictions of HLA-DRB1 binding peptides in the membrane proteins of Corynebacterium diphtheriae.

Febin Prabhu Dass J1, V L Deepika.   

Abstract

The aerobic gram positive bacterium Corynebacterium diphtheriae causes diphtheria, a respiratory tract illness characterized by symptoms such as sore throat, low fever, and an adherent membrane on the tonsils, pharynx, and/or nasal cavity. Therefore, it is important to develop preventive vaccines for diphtheria. The availability of the 2,488,635 bp long complete sequence for the C. diphtheriae genome provides an opportunity to understand cell mediated immune response using Computational Biology tools from the bacterial proteome sequence data. We selected 355 membrane proteins from the C. diphtheriae proteome using annotation data to identify potential HLA-DRB1 binding short peptide using modeling, simulations and predictions. This exercise identified 30 short peptides in membrane proteins showing binding capability to HLA-DRB1 alleles. These peptides serve as outline for the understanding of cell mediated immune response to C. diphtheriae. It should be noted that the predicted data to be verified using binding assays for further consideration.

Entities:  

Keywords:  C. diphtheriae; HLA-DRB; membrane proteins; peptide epitopes; vaccines

Year:  2008        PMID: 19238246      PMCID: PMC2639685          DOI: 10.6026/97320630004111

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Diphtheria is an epidemic disease that remains as a threat to health in the developing world [1]. The diphtheria epidemic had caused more than 157,000 cases and 5000 deaths according to WHO reports [2]. Diphtheria is caused by Corynebacterium diphtheriae, a non-sporulating, non-encapsulated, non-motile, pleomorphic gram-positive bacillus. The complete genome of the living organism Corynebacterium diphtheriae was sequenced [3]. The sequence and annotation of genome is available in DDBJ/EMBL/GenBank databases with the accession number BX248353. Immunization leads to the disappearance of toxigenic strains, but toxigenicity can be rapidly conferred on non-toxigenic strains via phage conversion. This makes the return of epidemic diphtheria a real threat when there is insufficient immunity. Hence, it is crucial and critical develop vaccines for diphtheria. Identification of potential peptide candidates for vaccine development have been shown using computational methods for pathogenic organisms elsewhere [4]. The application of proteome sequence data and Bioinformatics tool for the development of peptide vaccines for infectious diseases has gained momentum in recent years [5]. C. diphtheriae virulence factors, other than diphtheria toxin can be the potential cause to the disease. Recently, surface proteins of 67 and 72kDa, named 67-72p were isolated and related to the attachment of C. diphtheria to the human erythrocytes [1]. The pathogenicity island of the organism encode vast majority of fimbria and iron uptake system. The fimbrial system show similarities to sortases bound to surface proteins and are considered as pathogenicity factor [3]. It can be drawn that membrane related proteins could have pathogenicity property. The efficacy of immune response is associated with antigen specificity, diversity and human leukocyte antigen (HLA) alleles. [6]. There are two classes of HLA molecules, namely class I and class II that are recognized for two distinct sets of T cells, the CD8+ and CD4+ T cells, respectively [7,8]. There are evidences suggesting the association of diphtheriae antigen with class II alleles. [9] Nonetheless, the HLA molecules are highly polymorphic and there are more than 800 class II HLA alleles known till date [10]. Here, we describe the identification of potential peptides in the membrane proteins of C. diphtheriae binding to HLA DRB1*0101, DRB1*0301, DRB1*0401, DRB1*0701, DRB1*1801, DRB1*1101 and DRB1*1501 using predictions, modeling and simulations. It should be noted that these alleles have an allelic frequency of 20-50¢ [11]

Methodology

C. diphtheriae membrane protein dataset

The whole genome sequence of C. diphtheriae with annotated protein sequences is available at the EMBL/GenBank database with accession number BX248353. The genome is 2,488,635 bp long containing 2320 CDS with about 53.5¢ G+C content [3]. The annotated protein sequences for membrane or associated proteins that are annotated as putative ABC transport membrane protein, integral membrane protein, membrane protein, fimbrial protein, surface anchored protein and hypothetical membrane protein. Thus, we created a membrane protein dataset from C. diphtheriae with 355 sequences (list available from authors upon request)

Prediction of antigenic regions in membrane proteins

It is important to identify antigenic regions in the 355 membrane or associated proteins in C. diphtheriae. The antigenic subsequences from the membrane proteins were identified using the Antigenic server in EMBOSS GUI version 1.12 [12]. The server uses a semi-empirical method for the prediction of antigenic regions in a protein based on the physiochemical properties of amino acids and their frequency of occurrence in experimentally known segmental epitopes as described elsewhere [13]. The stringency level is enhanced by keeping 2.3 as a cut-off value. Peptide sequences with less than 9 residues were ignored as class 2 HLA molecules bind peptides of length 12-15 residues long [14]. Thus, we identified 1448 antigenic regions from the 355 membrane proteins. However, we further reduced the set to 30 based on antigenic score obtained from the server (Table 1 under supplementary material). We used this dataset for HLA binding peptides as described in the next section.

Prediction of T-cell peptides

PROPRED server [15] was used for the prediction of MHC class II HLA DRB1 allele binding regions in the identified antigenic regions (Table 1 under supplementary material). The server employs amino acid position coefficient tables deduced from literature by linear prediction model [16]. A threshold of 3¢ was fixed to reduce the rate of false positives. Prediction were performed for alleles DRB1*0101, 0401 and 1501. We restricted our analysis to these alleles due to availability of structures determined by X-ray crystallography at the protein databank (PDB).

Discussion

The availability of the completely sequenced genome of C. diphtheriae provides an opportunity to understand pathogenicity using a genome wide scanning and analysis. The annotated genome of C. diphtheriae contains 2320 CDS. We identified 355 membrane proteins using annotated protein information the dataset. This accounts for only 15¢ of the genome CDS. The corresponding protein sequences of 355 proteins were used as a dataset for the identification of peptide antigenic regions using ANTIGEN and PROPRED servers. The ANTIGEN server uses physiochemical properties of amino acids to determine antigen region regions in the membrane protein dataset. This exercise resulted in about 1448 short segments of length 9 or more from the 355 membrane proteins. The least score for the predicted antigenic peptide sequence is found to be 0.92 and the maximum score is 3.09. The minimum length of the peptide is kept as 9 residues in the parameter selection. The antigenic peptide sequences up to a maximum length of 54 residues were obtained. The data on immunogenic to C. diphtheriae antigen is limited except for some sporadic data. We further reduced this number to 30 segments using antigenicity score obtained from ANTIGEN (Table 1 in supplementary material). This set of peptides is used for the screening of potential binding to HLA DRB1*0101, 0401 and 1501 using PROPRED. These alleles have coverage of about 20-150¢ among different populations [11].The screening of peptide bindings to HLA DRB1*0101, 0401 and 1501 using PROPRED identified a list of peptides with positive binding scores (Table 1 under supplementary material). This list of peptides provides a framework for further investigation of C. diphtheriae towards the development of vaccine candidates. Subunit vaccines consisting of diphtheriae protein antigens present a safe and specific tool for the prevention of Diphtheria. The identification of promiscuous binding to HLA is an ideal prerequisite for the design of subunit vaccines. It should be noted that these are predicted data requiring validation using binding assays. The current challenge in synthetic vaccine design is the development of a methodology to identify and test short antigen peptides as potential T-cell epitopes. Data driven statistical methods such as PROPRED are generally available for class II DR1, and DR4. Reliable predictions of immunogenic peptides can reduce the experimental effort needed to identify new epitopes, and though reliable predictions of the MHC binding can be used to rank the possible epitopes very accurately. In this paper we have tried to accurately predict antigenic peptides using PROPRED which is a tool used to predict the MHC binding peptides. Class II MHC binding predictions need to develop a greater accuracy level, but new tools have emerged that deliver significantly improved predictions not only in terms of accuracy, but also in MHC specificity coverage. The future development of advanced tools using structured based features will significantly improve the efficiency of prediction for high true positives considerations.

Conclusion

The scanning of immunological relevant regions of the bacterial proteome sequence of C. diphtheriae is essential in the identification of specific HLA binding peptides for potential design of vaccine candidates for diphtheria. Here, we identified 30 segments using ANTIGEN at EMBOSS for further screening of these peptides to PROPRED for DRB1*0101, 0401 and 1501 specific binding. The specific segments showing positive binding score with each of these alleles is presented for further investigations using modeling towards in vitro investigations.
  15 in total

1.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices.

Authors:  T Sturniolo; E Bono; J Ding; L Raddrizzani; O Tuereci; U Sahin; M Braxenthaler; F Gallazzi; M P Protti; F Sinigaglia; J Hammer
Journal:  Nat Biotechnol       Date:  1999-06       Impact factor: 54.908

2.  ProPred: prediction of HLA-DR binding sites.

Authors:  H Singh; G P Raghava
Journal:  Bioinformatics       Date:  2001-12       Impact factor: 6.937

3.  The design of Jemboss: a graphical user interface to EMBOSS.

Authors:  Tim Carver; Alan Bleasby
Journal:  Bioinformatics       Date:  2003-09-22       Impact factor: 6.937

4.  Bioinformatic strategies for better understanding of immune function.

Authors:  Nikolai Petrovsky; Christian Schönbach; Vladimir Brusic
Journal:  In Silico Biol       Date:  2003

5.  Poor binding of a HER-2/neu epitope (GP2) to HLA-A2.1 is due to a lack of interactions with the center of the peptide.

Authors:  J J Kuhns; M A Batalia; S Yan; E J Collins
Journal:  J Biol Chem       Date:  1999-12-17       Impact factor: 5.157

6.  A semi-empirical method for prediction of antigenic determinants on protein antigens.

Authors:  A S Kolaskar; P C Tongaonkar
Journal:  FEBS Lett       Date:  1990-12-10       Impact factor: 4.124

Review 7.  Capture and processing of exogenous antigens for presentation on MHC molecules.

Authors:  C Watts
Journal:  Annu Rev Immunol       Date:  1997       Impact factor: 28.527

Review 8.  MHC-dependent antigen processing and peptide presentation: providing ligands for T lymphocyte activation.

Authors:  R N Germain
Journal:  Cell       Date:  1994-01-28       Impact factor: 41.582

9.  The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129.

Authors:  A M Cerdeño-Tárraga; A Efstratiou; L G Dover; M T G Holden; M Pallen; S D Bentley; G S Besra; C Churcher; K D James; A De Zoysa; T Chillingworth; A Cronin; L Dowd; T Feltwell; N Hamlin; S Holroyd; K Jagels; S Moule; M A Quail; E Rabbinowitsch; K M Rutherford; N R Thomson; L Unwin; S Whitehead; B G Barrell; J Parkhill
Journal:  Nucleic Acids Res       Date:  2003-11-15       Impact factor: 16.971

Review 10.  Diphtheria remains a threat to health in the developing world--an overview.

Authors:  Ana Luíza Mattos-Guaraldi; Lílian Oliveira Moreira; Paulo Vieira Damasco; Raphael Hirata Júnior
Journal:  Mem Inst Oswaldo Cruz       Date:  2004-03-09       Impact factor: 2.743

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.