| Literature DB >> 24790156 |
Stephen J Goodswen1, Paul J Kennedy1, John T Ellis1.
Abstract
UNLABELLED: We present Vacceed, a highly configurable and scalable framework designed to automate the process of high-throughput in silico vaccine candidate discovery for eukaryotic pathogens. Given thousands of protein sequences from the target pathogen as input, the main output is a ranked list of protein candidates determined by a set of machine learning algorithms. Vacceed has the potential to save time and money by reducing the number of false candidates allocated for laboratory validation. Vacceed, if required, can also predict protein sequences from the pathogen's genome.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24790156 PMCID: PMC4207429 DOI: 10.1093/bioinformatics/btu300
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Vacceed framework. A set hierarchal structure exists for the execution of all Vacceed scripts e.g. startup → master script → resource script → subordinate script (only three resources are shown to maintain clarity)
Fig. 2.Extract of a Vacceed configuration file defined by a header-key format (only one resource, WoLF PSORT, is shown for brevity)
Fig. 3.Schematic of data flow in Vacceed
Programs currently integrated in Vacceed
| Name | Function | URL (last viewed May 2014) |
|---|---|---|
| Part A—Build proteome | ||
| Augustus | ||
| GlimmerHMM | ||
| BLAT | Aligns expressed sequence tags (ESTs) to DNA | |
| GMAP | Aligns expressed sequence tags (ESTs) to DNA | |
| N-Scan | ||
| BLASTN | Finds regions of similarity between nucleotide sequences | |
| BLASTP | Finds regions of similarity between protein sequences | |
| Part B—Run pipeline (vaccine candidate discovery) | ||
| WoLf PSORT | Protein subcellular localization prediction | |
| SignalP | Predicts presence and location of signal peptide cleavage sites | |
| TargetP | Protein subcellular localization prediction | |
| Phobius | Combined transmembrane topology and signal peptide predictor | |
| TMHMM | Prediction of transmembrane helices in proteins | |
| MHC I-binding | Peptide binding to MHC class I molecules | |
| MHC II-binding | Peptide binding to MHC class II molecules | |
Fig. 4.Examples of evidence profiles and a ranked vaccine candidate list (only four proteins out of potentially thousands constituting the target pathogen are shown for brevity)