| Literature DB >> 30709382 |
Takahiko Koyama1, Kahn Rhrissorrakrai2, Laxmi Parida2.
Abstract
BACKGROUND: Significant numbers of variants detected in cancer patients are often left labeled only as variants of unknown significance (VUS). In order to expand precision medicine to a wider population, we need to extend our knowledge of pathogenicity and drug response in the context of VUS's.Entities:
Keywords: GENIE; Precision medicine; Recurrent variants; Variant disparity; Variant of unknown significance
Mesh:
Substances:
Year: 2019 PMID: 30709382 PMCID: PMC6359859 DOI: 10.1186/s12885-019-5313-1
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Process flow diagram of filters to remove variants
Fig. 2Variant recurrence in GENIE samples. Histogram indicates the number of variants (y-axis, log-scale) that occur at a given frequency (x-axis). As the frequency of recurrence increases, the number of variants decreases. However, a sizable number of variants observed in over 100 samples, which are listed in Table 1
Hall of fame variants that appear in over 100 samples
| Gene | AA change | CDS change | Counts |
|---|---|---|---|
| KRAS | p.Gly12Asp | c.35G > A | 711 |
| BRAF | p.Val600Glu | c.1799 T > A | 615 |
| KRAS | p.Gly12Val | c.35G > T | 607 |
| PIK3CA | p.Glu545Lys | c.1633G > A | 524 |
| PIK3CA | p.His1047Arg | c.3140A > G | 500 |
| KRAS | p.Gly12Cys | c.34G > T | 449 |
| TP53 | p.Arg175His | c.524G > A | 370 |
| PIK3CA | p.Glu542Lys | c.1624G > A | 332 |
| IDH1 | p.Arg132His | c.395G > A | 323 |
| KRAS | p.Gly13Asp | c.38G > A | 260 |
| TP53 | p.Arg273His | c.818G > A | 243 |
| TP53 | p.Arg273Cys | c.817C > T | 242 |
| TP53 | p.Arg248Gln | c.743G > A | 223 |
| TP53 | p.Arg248Trp | c.742C > T | 211 |
| TP53 | p.Arg282Trp | c.844C > T | 182 |
| TP53 | p.Arg213* | c.637C > T | 179 |
| NRAS | p.Gln61Arg | c.182A > G | 173 |
| EGFR | p.Leu858Arg | c.2573 T > G | 160 |
| EGFR | p.Glu746_Ala750del | c.2236_2250delGAATTAAGAGAAGCA | 159 |
| AKT1 | p.Glu17Lys | c.49G > A | 155 |
| KRAS | p.Gly12Ala | c.35G > C | 152 |
| TP53 | p.Arg342* | c.1024C > T | 135 |
| NRAS | p.Gln61Lys | c.181C > A | 123 |
| TP53 | p.Arg196* | c.586C > T | 110 |
| IDH1 | p.Arg132Cys | c.394C > T | 109 |
| KRAS | p.Gln61His | c.183A > T | 108 |
| TP53 | p.Tyr220Cys | c.659A > G | 108 |
| APC | p.Arg1450* | c.4348C > T | 102 |
| APC | p.Arg876* | c.2626C > T | 100 |
| FGFR3 | p.Ser249Cys | c.746C > G | 100 |
Top mutated genes in GENIE
| Gene | Variants observed |
|---|---|
| TP53 | 8083 |
| KRAS | 2811 |
| PIK3CA | 2693 |
| APC | 2674 |
| KMT2D | 1980 |
| ARID1A | 1494 |
| PTEN | 1313 |
| EGFR | 1212 |
| ATM | 1202 |
| NF1 | 1122 |
| BRAF | 1116 |
| BRCA2 | 949 |
| NOTCH1 | 924 |
| RB1 | 917 |
| ATRX | 887 |
| SETD2 | 845 |
| CREBBP | 840 |
| CDKN2A | 788 |
| ERBB4 | 740 |
| KMT2A | 739 |
| SMAD4 | 733 |
| ARID1B | 719 |
| SMARCA4 | 696 |
| ROS1 | 688 |
| FBXW7 | 687 |
| EP300 | 669 |
| ARID2 | 649 |
| PTPRD | 647 |
| TET2 | 642 |
| DNAPK | 639 |
| Others | 70,732 |
Total number of distinct variants in each classification of interest is shown
| Variant Classification | Distinct Variants | Total Counts |
|---|---|---|
| Novel | 55,192 | 59,544 |
| ExAC | 6907 | 8237 |
| Recurrent | 4561 | 25,378 |
| Intronic | 1039 | 1205 |
| Hotspot indels | 1211 | 1459 |
| Inactivating TSG | 8834 | 10,294 |
| GENIE recurrent | 730 | 2598 |
| VUS | 56,032 | 60,195 |
| Novel VUS | 45,933 | 48,203 |
| GENIE COSMIC recurrent | 2586 | 5372 |
Fig. 3Variants classified according to filters. The percent of variants classified by each of the following filters: ExAC – variants with similar or higher frequencies in ExAC; Recurrent – variants detected in ≥3 samples in COSMIC; Intronic – variants found in introns excluding splice junctions; Inactivating variant in TSG – likely inactivating factors that occur in tumor suppressor gene; GENIE recurrent – variants detected in ≥3 samples in GENIE and < 3 samples in in COSMIC; Potential artifacts – variants occurring only from a single sequencing center; and VUS – all remaining variants are considered variants of unknown significance. Newly retrieved recurrent variants revealed in this study accounts for 3% (GENIE recurrent)
Number of recurrent GENIE variants that are underrepresented in COSMIC (< 3 samples)
| Minimum number of recurrences | GENIE recurrent variants | GENIE and COSMIC combined recurrent variants |
|---|---|---|
| 10 | 4 | 5 |
| 9 | 4 | 7 |
| 8 | 5 | 15 |
| 7 | 15 | 44 |
| 6 | 40 | 100 |
| 5 | 87 | 264 |
| 4 | 236 | 740 |
| 3 | 730 | 2586 |
List of highly recurrent GENIE variants (≥ 6 samples) that are underrepresented in COSMIC (< 3 samples)
| Gene | AA change | CDS change | GENIE count | COSMIC count | Cancer types |
|---|---|---|---|---|---|
|
| p.A179T | c.535G > A | 19 | 1 | Non-Small Cell Lung Cancer(13);Melanoma(2);Colorectal Cancer(4) |
| ERBB3 | p.E928G | c.2783A > G | 14 | 2 | Melanoma(1);Colorectal Cancer(2);Bladder Cancer(2);Hepatobiliary Cancer(1);Small Bowel Cancer(1);Breast Cancer(6);Esophagogastric Cancer(1) |
| CDKN2A | p.P75S | c.223C > T | 13 | 1 | Prostate Cancer(1);Colorectal Cancer(2);Ovarian Cancer(5);Non-Small Cell Lung Cancer(1);Endometrial Cancer(2);Breast Cancer(1);Cervical Cancer(1) |
| SMO | p.L23dup | c.67_69dupCTG | 11 | 2 | Thyroid Cancer(1);Melanoma(1);Glioma(2);Leukemia(2);Non-Small Cell Lung Cancer(2);Cancer of Unknown Primary(1);Endometrial Cancer(1);Breast Cancer(1) |
| SMARCA4 | p.R1189Q | c.3566G > A | 8 | 2 | Glioma(1);Bladder Cancer(3);Renal Cell Carcinoma(1);Cancer of Unknown Primary(1);Breast Cancer(1);Esophagogastric Cancer(1) |
| CDKN2A | p.E69G | c.206A > G | 7 | 0 | Non-Small Cell Lung Cancer(6);Colorectal Cancer(1) |
| ERBB3 | p.M91I | c.273G > A | 7 | 2 | Bladder Cancer(6);Endometrial Cancer(1) |
| ERBB3 | p.K329E | c.985A > G | 7 | 0 | Bladder Cancer(4);Breast Cancer(1);Colorectal Cancer(1);Esophagogastric Cancer(1) |
| ERCC2 | p.N238S | c.713A > G | 7 | 0 | Bladder Cancer(5);Breast Cancer(2) |
| FANCA | p.K1283R | c.3848A > G | 7 | 0 | Bladder Cancer(1);Endometrial Cancer(1);Hepatobiliary Cancer(1);Breast Cancer(3);Leukemia(1) |
| FLT1 | p.R501K | c.1502G > A | 7 | 0 | Melanoma(1);Ovarian Cancer(1);Non-Small Cell Lung Cancer(2);Cancer of Unknown Primary(1);Breast Cancer(1);Cervical Cancer(1) |
| FLT4 | p.P30fs | c.88delC | 7 | 2 | Non-Small Cell Lung Cancer(1);Colorectal Cancer(6) |
| IKZF1 | p.D22N | c.64G > A | 7 | 0 | Colorectal Cancer(1);Non-Small Cell Lung Cancer(1);Melanoma(3);Glioma(1);Embryonal Tumor(1) |
| KDR | p.S265 L | c.794C > T | 7 | 0 | Myelodysplasia(1);Melanoma(1);Colorectal Cancer(1);Sellar Tumor(1);Non-Small Cell Lung Cancer(2);“Skin Cancer, Non-Melanoma”(1) |
| PIK3CB | p.R604fs | c.1809dupC | 7 | 1 | Bladder Cancer(1);“Skin Cancer, Non-Melanoma”(1);Breast Cancer(3);Endometrial Cancer(1);Esophagogastric Cancer(1) |
| APC | p.T1160K | c.3479C > A | 6 | 0 | Ovarian Cancer(1);Salivary Gland Cancer(1);Non-Small Cell Lung Cancer(1);Melanoma(1);Colorectal Cancer(2) |
| ARID1A | p.S735 N | c.2204G > A | 6 | 0 | Bladder Cancer(1);Hepatobiliary Cancer(1);Leukemia(1);Non-Small Cell Lung Cancer(1);Breast Cancer(1);Cervical Cancer(1) |
| ASXL1 | p.S1028 L | c.3083C > T | 6 | 1 | Thyroid Cancer(1);Non-Small Cell Lung Cancer(2);Melanoma(1);Small Cell Lung Cancer(1);Prostate Cancer(1) |
| ATRX | p.G1071R | c.3211G > A | 6 | 0 | Skin Cancer, Non-Melanoma(1);Non-Small Cell Lung Cancer(2);Breast Cancer(1);Colorectal Cancer(1);Myeloproliferative Neoplasm(1) |
| BRCA1 | p.E597K | c.1789G > A | 6 | 1 | Bladder Cancer(2);“Skin Cancer, Non-Melanoma”(1);Non-Small Cell Lung Cancer(1);Breast Cancer(1);Colorectal Cancer(1) |
| CARD11 | p.R377Q | c.1130G > A | 6 | 2 | Non-Hodgkin Lymphoma(1);Colorectal Cancer(1);Uterine Sarcoma(1);Ovarian Cancer(1);Small Bowel Cancer(1);“Skin Cancer, Non-Melanoma”(1) |
| CDKN2A | p.V106 V | c.318G > A | 6 | 2 | Melanoma(1);Non-Small Cell Lung Cancer(2);Breast Cancer(1);Colorectal Cancer(1);Gastrointestinal Stromal Tumor(1) |
| ERBB4 | p.E452K | c.1354G > A | 6 | 2 | Skin Cancer, Non-Melanoma(2);Melanoma(4) |
| ERCC3 | p.R742W | c.2224C > T | 6 | 1 | Glioma(1);Sex Cord Stromal Tumor(1);Renal Cell Carcinoma(1);Non-Small Cell Lung Cancer(1);Leukemia(1);Colorectal Cancer(1) |
| FBXW7 | p.R441W | c.1321C > T | 6 | 1 | Colorectal Cancer(3);Endometrial Cancer(1);Renal Cell Carcinoma(1);Esophagogastric Cancer(1) |
| IGF1R | p.R1246C | c.3736C > T | 6 | 1 | CNS Cancer(1);Endometrial Cancer(2);“Skin Cancer, Non-Melanoma”(1);Breast Cancer(1);Glioma(1) |
| IKZF1 | p.E304K | c.910G > A | 6 | 0 | Melanoma(6) |
| KMT2D | p.K4832 N | c.14496G > T | 6 | 0 | Ovarian Cancer(1);Breast Cancer(1);Colorectal Cancer(2);Head and Neck Cancer(1);Thymic Tumor(1) |
| MET | p.D1028H | c.3082G > C | 6 | 2 | Non-Small Cell Lung Cancer(6) |
| MET | p.D1028Y | c.3082G > T | 6 | 0 | Non-Small Cell Lung Cancer(6) |
| MYC | p.S161 L | c.482C > T | 6 | 0 | Ovarian Cancer(1);Endometrial Cancer(2);Colorectal Cancer(2);Cervical Cancer(1) |
| PDCD1 | p.T36 fs | c.104delC | 6 | 2 | Endometrial Cancer(2);Non-Small Cell Lung Cancer(1);Glioma(1);Colorectal Cancer(2) |
| PDCD1LG2 | p.P81S | c.241C > T | 6 | 0 | Colorectal Cancer(1);Head and Neck Cancer(1);Appendiceal Cancer(1);Mesothelioma(1);Leukemia(1);Pancreatic Cancer(1) |
| PIK3C2B | p.G1435R | c.4303G > C | 6 | 0 | Head and Neck Cancer(1);Ovarian Cancer(1);Sellar Tumor(1);Appendiceal Cancer(1);Non-Small Cell Lung Cancer(1);Breast Cancer(1) |
| PMS2 | p.K651R | c.1952A > G | 6 | 1 | Bladder Cancer(2);Uterine Sarcoma(1);Endometrial Cancer(1);Colorectal Cancer(2) |
| RAF1 | p.S259F | c.776C > T | 6 | 1 | Small Bowel Cancer(1);Bladder Cancer(1);Non-Small Cell Lung Cancer(1);Melanoma(3) |
| ROS1 | p.G1915R | c.5743G > A | 6 | 0 | Melanoma(1);“Skin Cancer, Non-Melanoma”(1);Non-Small Cell Lung Cancer(1);Breast Cancer(2);Cancer of Unknown Primary(1) |
| SMAD4 | p.D351N | c.1051G > A | 6 | 2 | Appendiceal Cancer(1);Bladder Cancer(1);Hepatobiliary Cancer(1);Breast Cancer(1);Colorectal Cancer(2) |
| SMAD4 | p.R361S | c.1081C > A | 6 | 2 | Pancreatic Cancer(1);Hepatobiliary Cancer(1);Colorectal Cancer(4) |
| SMAD4 | p.G419 W | c.1255G > T | 6 | 1 | Colorectal Cancer(6) |