| Literature DB >> 33999207 |
Muhammad Farhan Khalid1, Kanzal Iman1, Amna Ghafoor1, Mujtaba Saboor1, Ahsan Ali1, Urwa Muaz1, Abdul Rehman Basharat1, Taha Tahir1, Muhammad Abubakar1, Momina Amer Akhter1, Waqar Nabi2, Wim Vanderbauwhede2, Fayyaz Ahmad3, Bilal Wajid4,5,6, Safee Ullah Chaudhary1.
Abstract
PERCEPTRON is a next-generation freely available web-based proteoform identification and characterization platform for top-down proteomics (TDP). PERCEPTRON search pipeline brings together algorithms for (i) intact protein mass tuning, (ii) de novo sequence tags-based filtering, (iii) characterization of terminal as well as post-translational modifications, (iv) identification of truncated proteoforms, (v) in silico spectral comparison, and (vi) weight-based candidate protein scoring. High-throughput performance is achieved through the execution of optimized code via multiple threads in parallel, on graphics processing units (GPUs) using NVidia Compute Unified Device Architecture (CUDA) framework. An intuitive graphical web interface allows for setting up of search parameters as well as for visualization of results. The accuracy and performance of the tool have been validated on several TDP datasets and against available TDP software. Specifically, results obtained from searching two published TDP datasets demonstrate that PERCEPTRON outperforms all other tools by up to 135% in terms of reported proteins and 10-fold in terms of runtime. In conclusion, the proposed tool significantly enhances the state-of-the-art in TDP search software and is publicly available at https://perceptron.lums.edu.pk. Users can also create in-house deployments of the tool by building code available on the GitHub repository (http://github.com/BIRL/Perceptron).Entities:
Year: 2021 PMID: 33999207 PMCID: PMC8262694 DOI: 10.1093/nar/gkab368
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.PERCEPTRON software architecture. PERCEPTRON web architecture is based on Model View Controller (MVC). Users can access this web service by signing up or logging in as a guest via PERCEPTRON graphical user interface (GUI) which is compatible with multiple web browsers. Secured PERCEPTRON web application program interface (API) acquires data from the front-end including (i) user credentials and (ii) parameters for protein search query, using https protocol. This information is stored in the PERCEPTRON database (DB). PERCEPTRON Core, which consists of the algorithmic pipeline, executes search using data stored in PERCEPTRON DB and PERCEPTRON Protein Repository. Processed results for each job are then moved to PERCEPTRON DB where they are stored. API returns these results to the front-end from where users can access them using PERCEPTRON GUI.
Figure 2.PERCEPTRON Workflow. PERCEPTRON pipeline initiates with the input of MS-based top-down proteomics data in specific input file formats. As the first step, mass-based filtering of user-specified protein database is performed. The resulting candidate protein list is further filtered using sequence-derived information via peptide sequence tag (PST) extractor. Next, the updated candidate protein list is subjected to in silico fragmentation. Theoretical spectra obtained using user-specified fragmentation methods are then compared with experimental spectra. The resulting filtered candidate protein list is ranked in light of user-defined weight-based scoring of each algorithm. The final candidate protein list contains ranked proteoforms at 1% false discovery rate (FDR).