| Literature DB >> 30871454 |
Kanwal Naz1, Anam Naz1, Shifa Tariq Ashraf1, Muhammad Rizwan2, Jamil Ahmad2,3, Jan Baumbach4, Amjad Ali5.
Abstract
BACKGROUND: A revolutionary diversion from classical vaccinology to reverse vaccinology approach has been observed in the last decade. The ever-increasing genomic and proteomic data has greatly facilitated the vaccine designing and development process. Reverse vaccinology is considered as a cost-effective and proficient approach to screen the entire pathogen genome. To look for broad-spectrum immunogenic targets and analysis of closely-related bacterial species, the assimilation of pangenome concept into reverse vaccinology approach is essential. The categories of species pangenome such as core, accessory, and unique genes sets can be analyzed for the identification of vaccine candidates through reverse vaccinology.Entities:
Keywords: And therapeutic targets; Core genome; Microbial species; PanRV; Pangenome; Reverse vaccinology; Vaccine targets
Mesh:
Substances:
Year: 2019 PMID: 30871454 PMCID: PMC6419457 DOI: 10.1186/s12859-019-2713-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Tools and databases implemented and integrated into the PanRV modules
| Name | Function | Source |
|---|---|---|
| Prokka 1.12 | Rapid prokaryotic genome annotation tool | [ |
| Roary 1.0 | Rapid large-scale prokaryote pan genome analysis | [ |
| BLAST+ | Local alignment search | [ |
| PSORTb 3.0 | Prediction of protein subcellular localization | [ |
| HMMTOP 2.1 | Prediction of transmembrane topology | [ |
| DEG | Database of essential genes to check essentiality. | [ |
| VFDB | Virulence factors database for virulence identification | [ |
| MvirDB | Microbial virulence database for virulence identification | [ |
| RefSeq (Human Genome Resources) | Human genome database for Homology search | [ |
| ABCPred | B-Cell epitope prediction | [ |
| Propred-I | Prediction of promiscuous major histocompatibility complex (MHC) Class-I binding sites. | [ |
| Propred | Prediction of MHC Class-II binding regions in an antigen sequence. | [ |
| Vaxijen v2.0 | Antigenicity checking | [ |
| UniProt-SwissProt | Manually annotated protein sequences database with information extracted from literature for homology search and functional annotation | [ |
| COG | Functional annotation | [ |
| CARD | antibiotic resistance analysis | [ |
Comparison of PanRV with other available tools on the basis of selected features
| Features | Vaxign | NERVE | Jenner-predict | VacSol | PanRV |
|---|---|---|---|---|---|
| Pangenome Estimation | × | × | × | × | ✓ |
| Protein Localization Prediction | ✓ | ✓ | ✓ | ✓ | ✓ |
| Essential Genes Identification |
|
|
| ✓ | ✓ |
| Virulent Factor Identification |
| ✓ | ✓ | ✓ | ✓ |
| Homology Analysis with Human | ✓ | ✓ | ✓ | ✓ | ✓ |
| Homology Analysis with Gut Flora | ✓ | ✓ | ✓ | × | ✓ |
| Identification of Trans Membrane Helices | ✓ | ✓ | ✓ | ✓ | ✓ |
| Molecular Weight Estimation |
|
|
| × | ✓ |
| Epitope Mapping | ✓ |
| ✓ | ✓ | ✓ |
| Functional Annotation using COG |
|
|
| × | ✓ |
| Antibiotic Resistance Association Analysis |
|
|
| × | ✓ |
| Graphical User Interface | ✓ |
| ✓ | ✓ | ✓ |
| Downloadable Package |
| ✓ |
| ✓ | ✓ |
| Automatic Installer |
|
|
| ✓ | ✓ |
× indicates absence while ✓ indicates the presence of a specific feature in the respective tools
Fig. 1workflow of Pangenome-Reverse Vaccinology Package: Four modules of the pipeline (1) PGM (Pangenome Estimation Module), (2) RVM (Reverse Vaccinology Module), (3) FAM (Functional Annotation Module), (4) ARM (Antibiotic Resistance Association Module). PGM starts with multiple genomes files (.gff). These files are subjected to pangenome estimation pipeline (Roary), generating a pan_genome_reference along with several other supplementary files. Roary commands of query_pan_genome (union, intersection, complement) generate files (pan_genome_results, core_genome_results, accessory_genome_results). These files include gene ID (all isolates) in the respective category (Pan, Core, and Accessory) file. IDs lists (PanIDList, CoreIDList, AccessoryIDList) are picked from these files. IDs are then mapped to pan_genome_reference file and nucleotide FASTA sequences are extracted (Pangenome_Nuc, Coregene_Nuc, Accessor_Nuc, Unique_Nuc). Protein FASTA files (Pan, Core, Accessory, Unique) are generated by running a Perl script (Translator.pl). These predicted sets (Pan, Core, and Accessory) from PGM can be further subjected to RVM to identify putative vaccine candidates. Where selected pangenome category passes through each subfilter of RVM that extracts putative vaccine candidates along with their epitopes using the epitope mapping component. FAM and ARM identify functional annotation/significance and antibiotic resistance association of input FASTA file employing COG/UniProt and CARD databases, respectively. The results are displayed in CSV files
List of vaccine targets prioritized via PanRV
| RVM | ARM | FAM | ||||
|---|---|---|---|---|---|---|
| PanRV ID | Candidate Proteins (COG) | No. B cell Epitope | No. T cell Epitope | Resistance Association | COG ID | Function (UniProt)/annotation |
| 95 | Surface antigen | 4 | 9 | – | COG3942 M | ssaA2_1 |
| 169 | LysM repeat | 3 | 6 | – | COG1388 M | N-acetylmuramoyl-L-alanine amidase sle1alanine amidase sle1 |
| 262 | phosphomethylpyrimidine kinase | 2 | 2 | – | COG0351 H | Putative pyridoxine kinase |
| 323 | LysM repeat | 5 | 6 | – | COG1388 M | Probable autolysin SsaALP |
| 998 | Periplasmic serine protease, S1-C subfamily, contain C-terminal PDZ domain | 2 | 2 | – | COG0265 O | Serine protease Do-like HtrA |
| 1303 | Surface antigen | 3 | 5 | – | COG3942 M | Staphylococcal secretory antigen ssaA2_2 |
| 1306 | Surface antigen | 4 | 5 | – | COG3942 M | Staphylococcal secretory antigen ssaA2_3 |
RVM (Reverse Vaccinology Module) results include PanRV IDs of 7 prioritized proteins along with the protein names and number of B and T cell epitopes. The results of ARM (Antibiotic Resistance Association Module) are illustrated as ARO IDs (if any) and the FAM (Functional Annotation Module) results are shown as COG IDs along with their functional annotations retrieved from UniProt
Fig. 2Comparison of the execution time of PanRV with a variable number of genomes of Staphylococcus aureus. PanRV execution time includes Pangenome analysis and Reverse vaccinology analysis. The graph depicts that with increase in number of genomes execution time steadily rises