| Literature DB >> 32913912 |
Pooja Ramesh1, Vidhyavathy Nagarajan1, Vartika Khanchandani1, Vasanth Kumar Desai1, Vidya Niranjan1.
Abstract
BACKGROUND: Cancer that arises from epithelial cells of the esophagus is called esophagus squamous cell carcinoma (ESCC) and is mostly observed in developing nations. Evaluation of cancer genomes and its regulation into proteins plays a predominant role in understanding the cancer progressions. Mass-spectrometry-based proteomics is a consequential tool to estimate proteomic variation and posttranslational modifications (PTMs) from standard protein databases. Post-translational modifications play a crucial role in protein folding and PTMs can be accounted for as a biological signal to interpret the structural changes and transition order of proteins. Functional validation of cancer-related mutations can explain the effects of mutations on genes and the identification of Oncogenes and tumor suppressor genes. Therefore, we present a study on protein variations to interpret the structural changes and transition order of proteins in ESCC carcinogenesis.Entities:
Keywords: Bioinformatics; Bottom-up proteomics; Cancer research; Esophageal squamous cell carcinoma; Genetics; Oncology; PTM; Proteogenomics; RNA-seq proteogenomics
Year: 2020 PMID: 32913912 PMCID: PMC7472856 DOI: 10.1016/j.heliyon.2020.e04813
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
High through-put RNA-seq data for ESCC samples obtained from NCBI-SRA repository. Repository information is listed in Table S-1. Abbreviations: poly(A)+: Polyadenylation tail to mRNA.
| Cell Line | SRR ID | RNA-Library Preparation | Sequence Type | Sample Type | Read length (bp) | Read count | Size of Sample (GB) |
|---|---|---|---|---|---|---|---|
| esophageal squamous cell carcinoma | SRR2678176 | Poly(A)+ | Paired | Adult Male | 100 | 6.5x109 | 3.8Gb |
| esophageal squamous cell carcinoma | SRR2678178 | Poly(A)+ | Paired | Adult Female | 100 | 4.2 × 109 | 2.5Gb |
| esophageal squamous cell carcinoma | SRR2678180 | Poly(A)+ | Paired | Adult Male | 100 | 4.8 × 109 | 2.9Gb |
| esophageal squamous cell carcinoma | SRR2678170 | Poly(A)+ | Paired | Adult Male | 100 | 3.9 × 109 | 2.5Gb |
| esophageal squamous cell carcinoma | SRR2678360 | Poly(A)+ | Paired | Adult Male | 100 | 4.1 × 109 | 2.5Gb |
| esophageal squamous cell carcinoma | SRR2678324 | Poly(A)+ | Paired | Adult Male | 100 | 5.5 × 109 | 3.3Gb |
| esophageal squamous cell carcinoma | SRR2678264 | Poly(A)+ | Paired | Adult Male | 100 | 5 × 109 | 3Gb |
| esophageal squamous cell carcinoma | SRR2678182 | Poly(A)+ | Paired | Adult Female | 100 | 5.4 × 109 | 3.2Gb |
| esophageal squamous cell carcinoma | SRR2678172 | Poly(A)+ | Paired | Adult Male | 100 | 5.5 × 109 | 3.3Gb |
| esophageal squamous cell carcinoma | SRR2678162 | Poly(A)+ | Paired | Adult Female | 100 | 4.4 × 109 | 2.6Gb |
Figure 1Overview of the RNA-seq Proteogenomics workflow with G-PTM search strategy for identification of proteomic variants and PTM peptides. RNA-seq is analyzed using Galaxy package for sequence variants and sample-specific database is generated containing SAV peptides, NSJ peptides and uniprot protein sequence with site-specific PTM annotations. Spectral matching is performed using the Metamorephus tool using G-PTM search strategy.
Figure 2Diverse types of PTMs obtained by G-PTM search strategy for 10 ESCC samples. Figure 2.1: Various types of Acetylation and most numbers of PSMs accounted for N-acetylalanine PTM. Figure 2.2 In Phosphorylation, Phosphoserine PTM has the maximum number of PSMs. Figure 2.3: 4-Hydroxyproline shows most of PSMs in Hydroxylation. Figure 2.4: illustrates most of PTMs are recorded for Carbamidomethyl PTM.