| Literature DB >> 35627168 |
Chunyu Liu1,2, Yu Zhang2,3, Xingxing Jian2, Xiaoxiu Tan2,4, Manman Lu1,2, Jian Ouyang2,5, Zhenhao Liu2, Yuyu Li1, Linfeng Xu2,3, Lanming Chen1, Yong Lin3, Lu Xie1,2.
Abstract
A proteogenomics-based neoantigen prediction pipeline, namely ProGeo-neo, was previously developed by our team to predict neoantigens, allowing the identification of class-I major histocompatibility complex (MHC) binding peptides based on single-nucleotide variation (SNV) mutations. To improve it, we here present an updated pipeline, i.e., ProGeo-neo v2.0, in which a one-stop software solution was proposed to identify neoantigens based on the paired tumor-normal whole genome sequencing (WGS)/whole exome sequencing (WES) data in FASTQ format. Preferably, in ProGeo-neo v2.0, several new features are provided. In addition to the identification of MHC-I neoantigens, the new version supports the prediction of MHC class II-restricted neoantigens, i.e., peptides up to 30-mer in length. Moreover, the source of neoantigens has been expanded, allowing more candidate neoantigens to be identified, such as in-frame insertion-deletion (indels) mutations, frameshift mutations, and gene fusion analysis. In addition, we propose two more efficient screening approaches, including an in-group authentic neoantigen peptides database and two more stringent thresholds. The range of candidate peptides was effectively narrowed down to those that are more likely to elicit an immune response, providing a more meaningful reference for subsequent experimental validation. Compared to ProGeo-neo, the ProGeo-neo v2.0 performed well based on the same dataset, including updated functionality and improved accuracy.Entities:
Keywords: bioinformatics; neoantigen; proteogenomic; tumor immunotherapy
Mesh:
Substances:
Year: 2022 PMID: 35627168 PMCID: PMC9141370 DOI: 10.3390/genes13050783
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Workflow of ProGeo-neo v2.0. Including detection of SNV/INDEL based on tumor/normal WGS/WES data; HLA allele prediction, gene fusion detection, and gene expression detection based on tumor RNA-seq data; neoantigen screening by raw proteomics data (LC-MS/MS); neoantigen prediction (peptides-HLA class I/II); screening and filtering of candidate neoantigens.
Figure 2Schematic diagram of mutant peptides synthesis.
Figure 3Source distribution of candidate neoantigens: (A). HLA class I binding neoantigens (8–11-mer) (B). HLA class II binding neoantigens (15–30-mer).
Figure 4The number of predicted neoantigens bound to each HLA allele.
Performance comparison between ProGeo-neo v2.0 and ProGeo-neo v1.0.
| Mutant Peptides | MHC I Binders | Filtering by Gene Expression | Filtering by MS | Aff ≤ 34 nM TPM ≥ 33 | |
|---|---|---|---|---|---|
| ProGeo-neo v2.0 | 376,671 | 52,514 | 43,169 | 636 | 19 |
|
| 14.31% | 19.18% | 21.05% | ||
| ProGeo-neo v1.0 | 373,046 | 36,835 | 30,142 | 655 | |
|
| 11.45% |
Figure 5Results overview for the neoantigen discovery: (A). MHC class I binding neoantigens (8–11-mer) (B). MHC class II binding neoantigens (15–30-mer).