| Literature DB >> 32980694 |
Keijiro Mizukami1, Yusuke Iwasaki1, Eiryo Kawakami2, Makoto Hirata3, Yoichiro Kamatani4, Koichi Matsuda4, Mikiko Endo1, Kokichi Sugano5, Teruhiko Yoshida6, Yoshinori Murakami7, Hidewaki Nakagawa8, Amanda B Spurdle9, Yukihide Momozawa10.
Abstract
BACKGROUND: National Comprehensive Cancer Network (NCCN) recently recommended germline genetic testing for all pancreatic cancer patients. However, the genes targeted by genetic testing and the feasibility of selecting patients likely to carry pathogenic variants have not been sufficiently verified. The purpose of this study was to genetically characterize Japanese patients and examine whether the current guideline is applicable in this population.Entities:
Keywords: ATM; BRCA; Machine learning; Pancreatic cancer; Pathogenic variants; Universal screening for patients
Mesh:
Substances:
Year: 2020 PMID: 32980694 PMCID: PMC7519363 DOI: 10.1016/j.ebiom.2020.103033
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Demographic and clinical characteristics of study participants.
| Characteristic | PC patients No. (%) | Controls No. (%) |
|---|---|---|
| No. of participants | 1009 (100) | 23,780 (100) |
| Mean age at entry ± SD, (range), y | 67.71 ± 9.77 (27 - 96) | 71.11 ± 7.25 (60 - 105) |
| Mean age at diagnosis ± SD, (range), y | 67.17 ± 9.92 (24 - 96) | – |
| Sex | ||
| Male | 626 (62.0) | 12,520 (52.6) |
| Female | 383 (38.0) | 11,260 (47.4) |
| Smoking history | ||
| Yes | 603 (59.8) | 11,155 (46.9) |
| No | 397 (39.3) | 12,523 (52.7) |
| Missing | 9 (0.9) | 102 (0.4) |
| Alcohol history | ||
| Yes | 555 (55.0) | 10,907 (45.9) |
| No | 443 (43.9) | 12,731 (53.5) |
| Missing | 11 (1.1) | 142 (0.6) |
| BMI at diagnosis ± SD, (range), kg/m2 | 20.85 ± 3.31 (12.2 - 37.9) | – |
| Personal history of cancer(s) other than PC | ||
| Yes | 152 (15.1) | 0 (0) |
| No | 857 (84.9) | 23,780 (100) |
| Family history of cancer(s) | ||
| Yes | 575 (57.0) | 0 (0) |
| No | 434 (43.0) | 23,780 (100) |
| Stage of cancer | ||
| 0 | 10 (1.0) | – |
| I | 42 (4.2) | – |
| II | 46 (4.6) | – |
| III | 92 (9.1) | – |
| IVa | 112 (11.1) | – |
| IVb | 88 (8.7) | – |
| Missing | 619 (61.3) | – |
Controls who were 60 years old or over and did not have personal nor family history of cancers were used.
P-value: 4.236 × 10−9 in sex, 2.762 × 10−16 in smoking history, and 5.045 × 10−9 in alcohol history. PC, pancreatic cancer; BMI, body mass index; SD, standard deviation.
Results of the gene-based association test using pathogenic variants.
| Gene | No. of | Case ( | Control ( | P-value | OR (95% CI) |
|---|---|---|---|---|---|
| 34 | 25 (2.49) | 41 (0.17) | 2.723 × 10−18 | 14.7 (8.5 - 24.9) | |
| 35 | 17 (1.69) | 38 (0.16) | 3.168 × 10−11 | 10.7 (5.7 - 19.5) | |
| 18 | 9 (0.90) | 16 (0.07) | 3.333 × 10−7 | 13.4 (5.2 - 32.2) | |
| 2 | 2 (0.20) | 2 (0.01) | 0.009 | 23.6 (1.7 - 324.7) | |
| 8 | 2 (0.20) | 6 (0.03) | 0.039 | 7.9 (0.8 - 44.1) | |
| 1 | 1 (0.10) | 0 (0) | 0.041 | Inf (0.6 - Inf) | |
| 13 | 3 (0.30) | 21 (0.09) | 0.072 | 3.4 (0.6 - 11.3) | |
| 9 | 2 (0.20) | 12 (0.05) | 0.109 | 3.9 (0.4 - 17.7) | |
| 4 | 1 (0.10) | 3 (0.01) | 0.153 | 7.9 (0.1 - 97.8) | |
| 5 | 1 (0.10) | 4 (0.02) | 0.187 | 5.9 (0.1 - 59.7) | |
| 8 | 1 (0.10) | 8 (0.03) | 0.312 | 3.0 (0.1 - 22) | |
| 8 | 1 (0.10) | 8 (0.03) | 0.312 | 3.0 (0.1 - 22) | |
| 5 | 1 (0.10) | 8 (0.03) | 0.312 | 3.0 (0.1 - 22) | |
| 8 | 1 (0.10) | 9 (0.04) | 0.340 | 2.6 (0.1 - 19) | |
| 9 | 0 (0) | 0 (0) | 1 | 0 (0 - Inf) | |
| 7 | 0 (0) | 8 (0.03) | 1 | 0 (0 - 13.8) | |
| 7 | 0 (0) | 7 (0.03) | 1 | 0 (0 - 16.4) | |
| 6 | 0 (0) | 7 (0.03) | 1 | 0 (0 - 16.4) | |
| 5 | 0 (0) | 10 (0.04) | 1 | 0 (0 - 10.5) | |
| 4 | 0 (0) | 4 (0.02) | 1 | 0 (0 - 35.8) | |
| 3 | 0 (0) | 6 (0.03) | 1 | 0 (0 - 20) | |
| 2 | 0 (0) | 2 (0.01) | 1 | 0 (0 - 125.7) | |
| 2 | 0 (0) | 2 (0.01) | 1 | 0 (0 - 125.7) | |
| 1 | 0 (0) | 1 (<0.01) | 1 | 0 (0 - 906.9) | |
| 1 | 0 (0) | 2 (0.01) | 1 | 0 (0 - 125.7) | |
| 0 | 0 (0) | 0 (0) | 1 | 0 (0 - Inf) | |
| 0 | 0 (0) | 0 (0) | 1 | 0 (0 - Inf) | |
| Sum | 205 | 67 (6.67) | 225 (0.95) | 1.525 × 10−31 | 7.5 (5.5 - 9.9) |
Fisher's exact test under a recessive model for MUTYH and dominant model for the other genes.
Genes described in the NCCN guideline. OR, odds ratio; CI, confidence interval.
Fig 1Location and number of frequent pathogenic variants in three genes associated with pancreatic cancer. Locations of frequent pathogenic variants found in patients and of Pfam domains downloaded from the UCSC genome browser table function in proteins are shown with lollipop structures, with the variant type indicated by color. Pink and yellow circles indicate variants of loss of function and nonsynonymous, respectively. The x-axis reflects the number of amino acid residues. ANAPC5 = anaphase-promoting complex subunit 5; BRCT = BRCA1 C terminus; FAT = FRAP, ATM and TRRAP; FATC = FRAP, ATM, TRRAP C-terminal; PI3_PI4_kinase = phosphatidylinositol 3- and 4-kinase; TAN = telomere-length maintenance and DNA damage repair; zf-C3HC4 = zinc finger, C3HC4 type; zf-RING=zing finger, RING type.
Fig 2Kaplan-Meier analysis of overall survival for pancreatic cancer patients with and without pathogenic variants in three genes (BRCA1, ATM, and BRCA2).
Fig 3Predictive performance of machine learning for identifying patients with pathogenic variants. ROC curves derived from six supervised machine learning classifiers and multiple logistic regression classifiers for segregating cancer cases with germline pathogenic variants in four genes (BRCA1, BRCA2, TP53, and PALB2) in breast cancer (A), and in three genes (BRCA1, ATM and BRCA2) in pancreatic cancer (B). ROC curves using random forest and first ten sets from 100 reduced breast cancer data (C). The sensitivity and specificity (Supplementary Table 10) of predicting the carrier status of breast cancer patients based on the NCCN guideline criteria for germline genetic testing (Supplementary Table 9) are indicated as follows: ×, prediction based on criteria described as "Testing is clinically indicated" in the NCCN guideline; ▲, prediction based on criteria described as "Testing may be considered" as well as "Testing is clinically indicated". rf, random forest; cforest, conditional random forest; gbm, gradient boosting machine; nb, naive bayes; nnet, neural network; svmRadial, support vector machine; glm, multiple logistic regression.