| Literature DB >> 30510237 |
Jasreet Hundal1, Susanna Kiwala1, Yang-Yang Feng1, Connor J Liu1, Ramaswamy Govindan2,3, William C Chapman4, Ravindra Uppaluri5, S Joshua Swamidass6, Obi L Griffith1,2,3,7, Elaine R Mardis8, Malachi Griffith9,10,11,12.
Abstract
Recent efforts to design personalized cancer immunotherapies use predicted neoantigens, but most neoantigen prediction strategies do not consider proximal (nearby) variants that alter the peptide sequence and may influence neoantigen binding. We evaluated somatic variants from 430 tumors to understand how proximal somatic and germline alterations change the neoantigenic peptide sequence and also affect neoantigen binding predictions. On average, 241 missense somatic variants were analyzed per sample. Of these somatic variants, 5% had one or more in-phase missense proximal variants. Without incorporating proximal variant correction for major histocompatibility complex class I neoantigen peptides, the overall false discovery rate (incorrect neoantigens predicted) and the false negative rate (strong-binding neoantigens missed) across peptides of lengths 8-11 were estimated as 0.069 (6.9%) and 0.026 (2.6%), respectively.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30510237 PMCID: PMC6309579 DOI: 10.1038/s41588-018-0283-9
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1:Overview of the pipeline
The steps required for incorporating and assessing the impact of proximal variants on neoantigen binding prediction are depicted as a flow diagram. There are three main steps. (a) Alignment and variant calling of matched tumor (pink) and normal (green) sequencing data. (b) Phasing of proximal somatic and germline variants: The pink bars represent the tumor sequence reads, with mismatches/sequencing errors shown in small gray rectangles. For a somatic variant of interest (SVOI; labeled with a red flag), we scan 89 bp on either side to assess for proximal germline or somatic SNVs (labeled with blue and orange boxes). These proximal variants are then phased together to determine linkage. Only proximal variants that are in phase (orange box) with the SVOI (red box) are considered for downstream neoantigen analysis. Other (out-of-phase) proximal variants (blue box) are ignored. (c) Neoantigen binding predictions are then assessed after performing proximal variant correction (PVC). The left panel shows the ‘uncorrected’ wildtype and mutant peptides along with their respective binding scores for a single SVOI example. The right panel shows PVC (‘corrected’) peptides and scores for this SVOI.
Figure 2:Mischaracterization of neoantigens before proximal variant correction
The effect of accounting for proximal variants in neoantigen selection is summarized in several ways (n = 380 biologically independent samples with at least one proximal variant). (a) Violin plot (distribution of all data in blue and whiskers indicating max/min values) showing the change in uncorrected neoantigen binding using the existing approach (MTuncorrected) versus PVC (MTcorrected), represented as log10 MT fold change (MTuncorrected / MTcorrected) across 8–11-mers for all variants in phase with the somatic variant of interest. (b) For 8–11-mer peptides, the False Negative Rate (FNR) (shown as orange bars) represents the number of instances when a truly strong-binding peptide was mistaken as a weak-binding peptide (MTuncorrected > 500 nM, and MT fold-change < 1.1 ). The False Discovery Rate (FDR) (shown in blue bars) represents the number of instances where a strong-binder before PVC (MTuncorrected < 500nM) is determined to have an incorrect peptide sequence as a result of a proximal variant. (c) Log10 scaled comparison of corrected versus uncorrected binding scores for 9-mer peptides considering patient-specific MHC Class I alleles. Dotted lines demarcate the binding affinity threshold of 500 nM. (d) Log10 scaled comparison of corrected versus uncorrected binding scores for 10-mer peptides considering patient-specific MHC Class I alleles.