| Literature DB >> 30171158 |
Roman Mylonas1,2, Ilan Beer3, Christian Iseli1,2, Chloe Chong4,5, Hui-Song Pak4,5, David Gfeller2,4, George Coukos4,5, Ioannis Xenarios1,2, Markus Müller6,2, Michal Bassani-Sternberg7,2.
Abstract
Spliced peptides are short protein fragments spliced together in the proteasome by peptide bond formation. True estimation of the contribution of proteasome-spliced peptides (PSPs) to the global human leukocyte antigen (HLA) ligandome is critical. A recent study suggested that PSPs contribute up to 30% of the HLA ligandome. We performed a thorough reanalysis of the reported results using multiple computational tools and various validation steps and concluded that only a fraction of the proposed PSPs passes the quality filters. To better estimate the actual number of PSPs, we present an alternative workflow. We performed de novo sequencing of the HLA-peptide spectra and discarded all de novo sequences found in the UniProt database. We checked whether the remaining de novo sequences could match spliced peptides from human proteins. The spliced sequences were appended to the UniProt fasta file, which was searched by two search tools at a false discovery rate (FDR) of 1%. We find that 2-6% of the HLA ligandome could be explained as spliced protein fragments. The majority of these potential PSPs have good peptide-spectrum match properties and are predicted to bind the respective HLA molecules. However, it remains to be shown how many of these potential PSPs actually originate from proteasomal splicing events.Entities:
Keywords: Bioinformatics searching; De novo sequencing; Human Leukocyte Antigen; Immunology; Immunopeptidomics; Mass Spectrometry; Peptidomics; Proteasome-spliced peptides
Mesh:
Substances:
Year: 2018 PMID: 30171158 PMCID: PMC6283289 DOI: 10.1074/mcp.RA118.000877
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
Fig. 1.(A) Motif deconvolution analysis with MixMHCp and GibbsCluster of the 9-mer LM_UniProt and LM_spliced peptides, and comparison to known logos from Immune Epitope Database and Analysis resource (IEDB) for HLA-A03:01, HLA-A23:01, and HLA-B08:01 (HLA-B15:18 has no experimental ligands in IEDB). Motifs found in LM_UniProt peptides are highly reproducible and comparable to the known motifs from IEDB, while this is not the case for motifs found in LM_spliced peptides. (B) Length distribution of the LM_UniProt and LM_spliced peptide. (C) Rate of differing PSMs for the peptides in LM_UniProt and LM_spliced. Only MS/MS scans where both search strategies reported a match at FDR of 1% were considered.
Fig. 2.(A) Example of MS/MS annotation of endogenous HLA-Ip identified as an LM_spliced peptide (DHAQQPYSM), (upper panel) MS/MS of the synthetic counterpart of the LM_spliced (lower panel). (B) MS/MS annotation of the same endogenous HLA-Ip as an alternative UniProt peptide (DHRSEQSSM, upper panel), and MS/MS of the synthetic counterpart of the alternative UniProt peptide (lower panel). (C) The cosine similarity score calculated for the 21 pairs of MS/MS spectra of LM_spliced peptides and their synthetic counterparts and the pairs of the alternative sequences from UniProt and their synthetic peptides.
Fig. 3.(A) Scheme of the de novo based pipeline to identify possible spliced peptides. (B) The splicing gap in relation to the number of TagPep matches. The number of TagPep matches is the number of returned hits with a splicing gap shorter than a given value. Hits with the same splicing position and splicing gap are merged and counted as one hit. (C) Distribution of 8–14-mer peptides predicted by NetMHCpan as binders and nonbinders among the DeNovo_spliced and UniProt peptides identified in Mel16, Mel15, RA957, and Fib samples, and in addition also LM_spliced peptides.
Fig. 4.(A) Histogram of splicing positions within 9-mers for the DeNovo_spliced peptides identified by both Comet and MaxQuant. (B) Histogram of LM_spliced peptides splicing positions. (C) MaxQuant delta scores as a function of the splicing position within the 9-mers DeNovo_spliced peptides identified by both Comet and MaxQuant.
Summary of the level of agreement in scan matching and peptide identifications between Mascot (from Liepe et al.), MaxQuant, and Comet for the different subset of peptides identified in one raw file of immunopeptidomics data form the Fib dataset