| Literature DB >> 34522901 |
Ting-You Wang1, Rendong Yang1,2.
Abstract
Exitron splicing (EIS) events in cancers can disrupt functional protein domains to cause cancer driver effects. EIS has been recognized as a new source of tumor neoantigens. Here, we describe an integrated protocol for EIS and EIS-derived neoantigen identification using RNA-seq data. The protocol constitutes a step-by-step guide from data collection to neoantigen prediction. For complete details on the use and execution of this protocol, please refer to Wang et al. (2021).Entities:
Keywords: Bioinformatics; Cancer; Genetics; Genomics; Immunology; RNAseq
Mesh:
Substances:
Year: 2021 PMID: 34522901 PMCID: PMC8424586 DOI: 10.1016/j.xpro.2021.100788
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1Flow chart showing exitron and exitron-derived neoantigens detection with ScanExitron and ScanNeo
The identified exitrons in the example data set
| chrm:start-end | ao | strand | gene_symbol | Length | splice_site | pso | psi | dp |
|---|---|---|---|---|---|---|---|---|
| chr22:29489329–29489390 | 169 | + | NEFH | 60 | GC-AG | 0.261 | 0.739 | 648 |
| chr22:29489371–29489432 | 80 | + | NEFH | 60 | GT-AG | 0.115 | 0.885 | 696 |
| chr22:29489593–29489618 | 36 | + | NEFH | 24 | GC-AG | 0.0848 | 0.915 | 424 |
Figure 2Three exitron splicing (EIS) events identified in NEFH gene loci by ScanExitron from the example RNA-seq data
The predicted exitron-derived neoantigens in the example data set
| Chrom | Start | Stop | Gene name | HLA allele | Peptide length | MT epitope seq | WT epitope seq | Best MT score method | Best MT score | Corresponding WT score |
|---|---|---|---|---|---|---|---|---|---|---|
| chr22 | 29489329 | 29489389 | NEFH | HLA-B∗07:02 | 9 | SPPEAKSPA | SPPEAKSPE | NetMHCpan | 399.52 | 7247.45 |
Figure 3Tumor-specific exitron (TSE) splicing events detection in PRAD cohort
(A) The proportion of frameshift and inframe TSEs in PRAD tumors.
(B) The proportion of genes with and without exitrons in PRAD tumors.
(C) Exitron size distribution of TSEs identified in PRAD tumors.
(D) PSO distribution of TSEs identified in PRAD tumors.
Figure 4The loads of TSEs, frameshift TSEs, inframe TSEs, neoantigen-yielding TSEs, neoantigen-yielding frameshift TSEs, neoantigen-yielding inframe TSEs, and putative TSE neoantigens in PRAD tumors
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Example data (example.bam file) | This paper | |
| RNA-Seq data from TCGA PRAD cohort | NCI Genomic Data Commons | |
| HLA types for TCGA cohort | ||
| GENCODE human gene annotations | ||
| Human reference genome NCBI build 38, GRCh38 | Genome Reference Consortium | |
| HISAT2 | RRID:SCR_015530; | |
| ScanExitron | ||
| Pyfaidx v0.5.9.2 | ||
| SamTools v1.12 | RRID:SCR_00210; | |
| BEDTools v2.26.0 | RRID:SCR_006646; | |
| RegTools v0.4.2 | ||
| ScanNeo | RRID:SCR_019253; | |
| transIndel v2.0 | ||
| OptiType v1.2 | ||
| Yara aligner v1.0.2 | ||
| Variant Effect Predictor v102.0 | RRID:SCR_007931; | |
| Sambamba v0.8.0 | ||
| IEDB MHC class I peptide binding prediction tools v3.1 | ||
| BWA v0.7.17 | RRID:SCR_010910; | |
| PyVCF v0.6.8 | N/A | |
| Picard v2.24.0 | Broad Institute | |
| HDF5 v1.10.4 | The HDF Group | |
| Tabix v1.12 | RRID:SCR_00210; | |
| PC with 4 CPU cores and 16GB RAM | AMD | N/A |
| HPC system with 16 CPU cores and 64GB RAM | AMD | N/A |