| Literature DB >> 33365374 |
Vassiliki Kotoula1,2, Kyriakos Chatzopoulos1,2, Kyriaki Papadopoulou1, Eleni Giannoulatou3,4, Georgia-Angeliki Koliou5, Vasilios Karavasilis6, Elissavet Pazarli7, Stavroula Pervana7, Georgia Kafiri8, Georgios Tsoulfas9, Sofia Chrisafi1, Helen Sgouramali1, Pavlos Papakostas10, Dimitrios Pectasides10, Prodromos Hytiroglou2, George Pentheroudakis11,12, George Fountzilas1,13,14.
Abstract
Genotypic and phenotypic comparisons of tumors in multiple tissue samples from the same patient are important for understanding disease evolution and treatment possibilities. Panel NGS genotyping is currently widely used in this context, whereby NGS variant filtering and final evaluation constitute the basis for meaningful comparisons. Here, we present the genotype data used for genotype / phenotype comparisons between matched primary / metastatic colorectal tumors in the work by Chatzopoulos et al (doi: 10.1016/j.humpath.2020.10.009), as well as the process followed for obtaining these data. We describe key issues while processing routinely formalin-fixed paraffin-embedded (FFPE) tumors for genotyping, NGS application (Ion Torrent), a stringent variant filtering algorithm for genotype analyses in FFPE tissues and particularly in matched tumor samples, and provide the respective datasets. Apart from research, tumor NGS genotyping is currently applied for clinical diagnostic purposes in Oncology. The datasets and method description provided herein (a) are important for comprehending the peculiarities of FFPE tumor genotyping, which is still mostly based on principles of germline DNA genotyping; (b) can be used in pooled analyses, e.g., of primary / metastatic tumors for the investigation of tumor evolution.Entities:
Keywords: Ampliseq panel; Colorectal cancer; Formalin fixed paraffin embedded tissue; Matched sample genotypes; Next generation sequencing; Targeted NGS; Tumor cell content
Year: 2020 PMID: 33365374 PMCID: PMC7749371 DOI: 10.1016/j.dib.2020.106646
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1FFPE tumor processing for molecular studies. A – C: Histologic evaluation of hematoxylin & eosin (H&E) sections with examples of common pitfalls in TCC% evaluation. In A, the tumor area of interest is marked on the H&E section avoiding necrotic, hemorrhagic and normal tissue elements. In B, higher magnification of the section in A is shown; malignant cells occupy 70% of this tumor area. However, the surrounding dense lymphocytic infiltrates by far outnumber malignant cells. Hence, the tumor DNA percentage in the extracted sample will be only 30%. In C, malignant cells appear to occupy 30% of this tumor area (the tumor produces a lot of mucin but also has desmoplastic stroma). However, the malignant cells outnumber non-malignant cells in this case and the tumor DNA percentage in this sample will be 80%. D: The process of microdissection for enrichment of molecular templates in tumor DNA. Unstained sections (5 – 10 micron thick) are oriented on simple sterilized glass slides to the respective H&E section that has been marked for the tumor area of interest → By using a thin scalpel, the tissue corresponding to the marked area is removed from the unstained section (arrow) directly after deparaffinization and partial rehydration → All macrodissected tissue fragments are placed in an Eppendorf tube with lysis buffer → Proteinase K is added to the lysis buffer and the tissue fragments are allowed to dissolve completely.
FFPE tumor DNA content (TCC%) and NGS quality metrics for informative samples
| Tumor DNA % | Mean Depth | Uniformity % | Number of Variants | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Primary | Meta | Primary | Meta | Blood | Primary | Meta | Blood | Primary | Meta | Blood | ||
| N Samples | 69* | 68* | 71 | 70 | 49 | 71 | 70 | 49 | 71 | 70 | 49 | |
| Quantiles | ||||||||||||
| 100.00% | maximum | 90 | 90 | 14391.0 | 10669.0 | 5273.0 | 91.6 | 91.8 | 88.4 | 298.0 | 340.0 | 40.0 |
| 99.50% | 90 | 90 | 14391.0 | 10669.0 | 5273.0 | 91.6 | 91.8 | 88.4 | 298.0 | 340.0 | 40.0 | |
| 97.50% | 90 | 90 | 13235.8 | 9194.2 | 5085.0 | 91.5 | 91.3 | 88.4 | 211.6 | 311.3 | 38.8 | |
| 90.00% | 85 | 80 | 4218.2 | 4169.0 | 1901.0 | 88.9 | 89.5 | 87.7 | 73.6 | 165.8 | 33.0 | |
| 75.00% | quartile | 75 | 70 | 3130.0 | 3432.3 | 1081.0 | 87.4 | 87.9 | 86.6 | 40.0 | 39.0 | 28.5 |
| 50.00% | median | 60 | 50 | 1931.0 | 1570.0 | 857.9 | 85.3 | 85.1 | 84.5 | 35.0 | 32.5 | 25.0 |
| 25.00% | quartile | 40 | 35.25 | 513.4 | 694.5 | 731.7 | 81.6 | 78.9 | 80.2 | 31.0 | 30.0 | 19.0 |
| 10.00% | 35 | 24 | 392.7 | 502.5 | 565.7 | 75.8 | 73.0 | 75.1 | 28.0 | 27.0 | 16.0 | |
| 2.50% | 17.5 | 5 | 362.0 | 399.2 | 509.0 | 45.8 | 56.9 | 70.6 | 15.8 | 18.6 | 12.3 | |
| 0.50% | 10 | 2.5 | 358.7 | 366.8 | 509.0 | 41.5 | 55.2 | 70.5 | 15.0 | 17.0 | 12.0 | |
| 0.00% | minimum | 10 | 2.5 | 358.7 | 366.8 | 509.0 | 41.5 | 55.2 | 70.5 | 15.0 | 17.0 | 12.0 |
| Mean | 58.2 | 51.6 | 2381.3 | 2281.0 | 1136.1 | 82.7 | 82.2 | 83.0 | 48.7 | 52.7 | 24.1 | |
| Std Dev | 19.0 | 22.1 | 2623.5 | 2032.8 | 925.7 | 8.8 | 8.1 | 4.8 | 46.8 | 62.2 | 6.2 | |
Abbreviations: NGS = next generation sequencing; meta = metastasis; Std Dev = standard deviation
Fig. 2Common / rare SNPs and mutations as assessed in this dataset. A. Number of cases (X axis) with tumors mutated in the respective genes. B. Paired comparisons of variant allele frequencies (VAFs or freqVAR1) for common and rare SNPs between primary / metastatic tumors (1081 comparisons; correlation 0.8204; 95%CI 0.80-0.84), primary tumor / blood (695 comparisons; correlation 0.7406; 95%CI 0.70-0.77), metastatic tumor / blood (686 comparisons; correlation 0.7219; 95%CI 0.68-0.76). C. Paired comparisons of shared mutation VAFs between primary / metastatic tumors (107 comparisons; correlation 0.5465; 95%CI 0.39-0.67).
Fig. 3Distribution of mutations in primary and matched metastatic colorectal carcinomas. The map shows shared and private mutations in matched primary / metastatic tumor samples, along with mutation types (color key), number of mutations per sample (top) and prevalence of mutated genes among tumors. Tumor pairs are shown in columns; the lighter color shade stands for the primary, the darker shade for the metastatic tumor. The map was created with the free R software.
Germline variants in the examined samples
| Chromosome:Position | Gene | Protein | Coding | ClinVar (Year) | Sample | TCC% | Genotype | Position Coverage | Variant Coverage | VAF |
|---|---|---|---|---|---|---|---|---|---|---|
| 16:68849520 | CDH1 | p.Val475Met | c.1423G>A | VUS (2018) | ||||||
| blood | G/A | 215 | 116 | 0.54 | ||||||
| primary | 50 | G/A | 1443 | 798 | 0.55 | |||||
| metastatic | 10 | G/A | 1976 | 1,056 | 0.53 | |||||
| 7:116403245 | MET | p.Val854Ile | c.2560G>A | unknown (2020) | ||||||
| blood | G/A | 842 | 355 | 0.42 | ||||||
| primary | 65 | G/A | 1287 | 420 | 0.33 | |||||
| metastatic | 75 | G/A | 589 | 71 | 0.12 | |||||
| 2:212812268 | ERBB4 | p.Arg103His | c.308G>A | unknown (2020) | ||||||
| blood | C/T | 1153 | 582 | 0.5 | ||||||
| primary | 65 | C/T | 1614 | 635 | 0.39 | |||||
| metastatic | 0 | C/T | 1994 | 829 | 0.42 | |||||
| 16:68862107 | CDH1 | p.Arg732Leu | c.2195G>T | VUS (2019) | ||||||
| blood | G/T | 623 | 299 | 0.48 | ||||||
| primary | not evaluable | G/T | 1986 | 993 | 0.5 | |||||
| metastatic | not evaluable | G/T | 797 | 354 | 0.44 | |||||
| 2:48030612 | MSH6 | p.Arg1076Cys | c.3226C>T | likely patho (2018) | ||||||
| blood | C/T | 876 | 431 | 0.49 | ||||||
| primary | 65 | C/T | 131 | 68 | 0.52 | |||||
| metastatic | 75 | NA | failed | failed | failed |
VUS: variant of unknown significance;
: rare SNP (5000 Exomes); patho: pathogenic; TCC%: tumor cell content; VAF: variant allele frequence (variant load)
| Subject | Health and medical sciences |
| Specific subject area | Molecular Pathology; Pathology; cancer genotyping; tumor evolution |
| Type of data | Targeted next generation sequencing; custom panel; multiple matched samples; formalin-fixed paraffin-embedded (FFPE) tissue; germline DNA |
| How data were acquired | Ampliseq panel sequencing (Ion Torrent; Proton sequencer); tissue sectioning and staining; light microscope; macrodissection |
| Data format | Filtered and analysed NGS data; raw microscopy data; processed histology report data |
| Parameters for data collection | FFPE tissue blocks; histology reports; matched primary and metastatic tumors; FFPE and germline DNA |
| Description of data collection | Microscopic evaluation of hematoxylin and eosin stained FFPE tissue sections for determination of histopathologic parameters and tumor cell content (TCC%). Macrodissection, DNA extraction and measurement. FFPE and germline NGS genotyping (Ion Torrent), library construction, library evaluation, sequencing on an Ion Proton sequencer, variant calling, multiple steps for variant filtering in matched tumor samples, and variant annotation. |
| Data source location | Laboratory of Molecular Oncology (Hellenic Foundation for Cancer Research / Aristotle University of Thessaloniki), Thessaloniki, Greece; Hellenic Cooperative Oncology Group (HeCOG), Athens, Greece |
| Data accessibility | Matched genotype datasets suitable for pooled analyses are hosted with the article. Raw data can be publicly accessed through HeCOG server at the link below: |
| Related research article | K. Chatzopoulos, V. Kotoula, G.A. Koliou, E. Giannoulatou, K. Papadopoulou, V. Karavasilis et al. Genotype – phenotype associations in colorectal adenocarcinomas and their matched metastases. Hum Pathol. 2020 Nov 5:S0046-8177(20)30215-X. doi: |