| Literature DB >> 31607941 |
Antonios Vekris1, Eleftherios Pilalis2,3, Aristotelis Chatziioannou2,3, Klaus G Petry1.
Abstract
Phage Display is a powerful method for the identification of peptide binding to targets of variable complexities and tissues, from unique molecules to the internal surfaces of vessels of living organisms. Particularly for in vivo screenings, the resulting repertoires can be very complex and difficult to study with traditional approaches. Next Generation Sequencing (NGS) opened the possibility to acquire high resolution overviews of such repertoires and thus facilitates the identification of binders of interest. Additionally, the ever-increasing amount of available genome/proteome information became satisfactory regarding the identification of putative mimicked proteins, due to the large scale on which partial sequence homology is assessed. However, the subsequent production of massive data stresses the need for high-performance computational approaches in order to perform standardized and insightful molecular network analysis. Systems-level analysis is essential for efficient resolution of the underlying molecular complexity and the extraction of actionable interpretation, in terms of systemic biological processes and pathways that are systematically perturbed. In this work we introduce PepSimili, an integrated workflow tool, which performs mapping of massive peptide repertoires on whole proteomes and delivers a streamlined, systems-level biological interpretation. The tool employs modules for modeling and filtering of background noise due to random mappings and amplifies the biologically meaningful signal through coupling with BioInfoMiner, a systems interpretation tool that employs graph-theoretic methods for prioritization of systemic processes and corresponding driver genes. The current implementation exploits the Galaxy environment and is available online. A case study using public data is presented, with and without a control selection.Entities:
Keywords: Galaxy platform; Gene Ontology; Reactome; biological interpretation; enrichment analysis; network analysis; phage display
Year: 2019 PMID: 31607941 PMCID: PMC6769401 DOI: 10.3389/fphys.2019.01160
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
FIGURE 1Simplified view of the workflow. (A) Represents the construction of the Mock repertoire, (B) the mapping and scoring processes, and (C) the BioInfoMiner analysis of the proteins/genes of interest.
FIGURE 2PepSimili main input and outputs. (A) The input form of the PepSimili automated workflow. Confidence level refers to the z-score cut-off, (B) output proteins, ranked by m-score, (C) output tabular file reporting the hits, (D) a heatmap visualizing the mapping of prioritized genes (x axis) to systemic processes (y axis).
FIGURE 3Mappings on a small portion of the WASF1 protein. The profile of the mappings of the HUVEC_tcm peptides (HUVEC under tumor conditioned medium) on the WASF1 protein is shown here, between residues 246 and 266. The x-axis presents the sequence, the y-axis the amino acid hits. Insert (a) shows the position of this particular stretch on the complete profile of the protein (red peek). Insert (b) shows in the first row the amino acid sequence, in second row the position of the amino acids and in third row the total aah per residue. Further below are shown the 13 peptides defining this stretch, positioned in frame with their similar sequence on the protein.
Summary of the genes prioritized by BioInfoMiner (GO and Reactome).