| Literature DB >> 32628708 |
Hector Eduardo Sanchez-Ibarra1, Xianli Jiang2, Elena Yareli Gallegos-Gonzalez1, Adriana Carolina Cavazos-González1, Yenho Chen2, Faruck Morcos2, Hugo Alberto Barrera-Saldaña1.
Abstract
Mutations in KRAS, NRAS, and BRAF (RAS/BRAF) genes are the main predictive biomarkers for the response to anti-EGFR monoclonal antibodies (MAbs) targeted therapy in metastatic colorectal cancer (mCRC). This retrospective study aimed to report the mutational status prevalence of these genes, explore their possible associations with clinicopathological features, and build and validate a predictive model. To achieve these objectives, 500 mCRC Mexican patients were screened for clinically relevant mutations in RAS/BRAF genes. Fifty-two percent of these specimens harbored clinically relevant mutations in at least one screened gene. Among these, 86% had a mutation in KRAS, 7% in NRAS, 6% in BRAF, and 2% in both NRAS and BRAF. Only tumor location in the proximal colon exhibited a significant correlation with KRAS and BRAF mutational status (p-value = 0.0414 and 0.0065, respectively). Further t-SNE analyses were made to 191 specimens to reveal patterns among patients with clinical parameters and KRAS mutational status. Then, directed by the results from classical statistical tests and t-SNE analysis, neural network models utilized entity embeddings to learn patterns and build predictive models using a minimal number of trainable parameters. This study could be the first step in the prediction for RAS/BRAF mutational status from tumoral features and could lead the way to a more detailed and more diverse dataset that could benefit from machine learning methods.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32628708 PMCID: PMC7337295 DOI: 10.1371/journal.pone.0235490
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Mutational status of patients with mCRC by genes and country regions.
| Country region | Wildtype | Mutated | Wildtype | Mutated | Wildtype | Mutated |
|---|---|---|---|---|---|---|
| North | 150 (56%) | 118 (44%) | 254 (95%) | 14 (5%) | 198 (96%) | 8 (4%) |
| North Pacific Coast | 68 (60%) | 46 (40%) | 112 (98%) | 2 (2%) | 74 (93%) | 6 (7%) |
| Bajío | 25 (49%) | 26 (51%) | 50 (98%) | 1 (2%) | 35 (97%) | 1 (3%) |
| South | 19 (54%) | 16 (46%) | 34 (97%) | 1 (3%) | 23 (92%) | 2 (8%) |
| Central | 13 (41%) | 19 (59%) | 30 (94%) | 2 (6%) | 22 (96%) | 1 (4%) |
| Total | 275 (55%) | 225 (45%) | 480 (96%) | 20 (4%) | 352 (95%) | 18 (5%) |
Mutations by location and type in RAS/BRAF genes.
| Gene Location | No. Mutated Cases per Gene | ||
|---|---|---|---|
| Codon | |||
| 162 (72%) | 6 (30%) | N/A | |
| G12V | 40 (18%) | 0(0%) | N/A |
| G12D | 73 (32%) | 6 (30%) | N/A |
| G12C | 17 (8%) | 0(0%) | N/A |
| G12A | 17 (8%) | 0(0%) | N/A |
| G12S | 11 (5%) | 0(0%) | N/A |
| G12R | 2 (1%) | 0(0%) | N/A |
| G12V/D/A† | 1 (0.4%) | 0(0%) | N/A |
| G12C/S† | 1 (0.4%) | 0(0%) | N/A |
| 36 (16%) | 2 (10%) | N/A | |
| G13A | 36 (16%) | 0(0%) | N/A |
| G13D | 0(0%) | 1 (5%) | N/A |
| G13R/V† | 0(0%) | 1 (5%) | N/A |
| 2 (1%) | 0(0%) | N/A | |
| A59T/V† | 2 (1%) | 0(0%) | N/A |
| 10 (4%) | 11 (55%) | N/A | |
| Q61H | 2 (1%) | 1 (5%) | N/A |
| Q61K | 2 (1%) | 3 (15%) | N/A |
| Q61L/R† | 6 (3%) | 7 (35%) | N/A |
| 2 (1%) | 1 (5%) | N/A | |
| K117N/R/E† | 2 (1%) | 1 (5%) | N/A |
| 13 (6%) | 0(0%) | N/A | |
| A146P/T/V† | 13 (6%) | 0(0%) | N/A |
| N/A | N/A | 18 (100%) | |
| V600E/D† | N/A | N/A | 18 (100%) |
*Presence of one of these mutations, not concomitant. N/A = Not applicable.
Associations between genetic and clinicopathological features.
| Total | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Variable | Mutated (n = 225, %) | Wild-type (n = 275, %) | p value | Mutated (n = 20, %) | Wild-type (n = 480, %) | p value | Mutated (n = 18, %) | Wild-type (n = 352, %) | p value | (n = 500, %) |
| 57.4 | 56.1 | 0.9233 | 51.9 | 56.9 | 0.1629 | 63.4 | 56.6 | 0.0639 | 56.6 | |
| 0.4662 | 1 | 0.814 | ||||||||
| Female | 107 (47.6%) | 127 (46.2) | 9 (45%) | 225 (46.9%) | 8 (44.4%) | 168 (47.7%) | 234 (46.8%) | |||
| Male | 118 (52.4%) | 148 (53.8%) | 11 (55%) | 255 (53.1%) | 10 (55.6%) | 184 (52.3%) | 266 (53.2%) | |||
| 0.2298 | ||||||||||
| Proximal colon | 35 (29.2%) | 22 (15.1%) | 0 (0%) | 57 (22.4%) | 6 (46.2%) | 38 (20.1%) | 57 (21.4%) | |||
| Distal colon | 35 (29.2%) | 57 (39%) | 4 (36.4%) | 88 (34.5%) | 7 (53.8%) | 71 (37.6%) | 92 (34.6%) | |||
| Rectum | 50 (41.7%) | 67 (45.9%) | 7 (63.6%) | 110 (43.1%) | 0 (0%) | 80 (42.3%) | 117 (44%) | |||
| 0.1894 | 0.4984 | 0.9237 | ||||||||
| Adenocarcinoma | 102 (83.6%) | 140 (88.1%) | 14 (100%) | 228 (85.4%) | 9 (90%) | 189 (85.1%) | 242 (86.1%) | |||
| Mucinous carcinoma | 16 (13.1%) | 11 (6.9%) | 0 (0%) | 27 (10.1%) | 1 (10%) | 23 (10.4%) | 27 (9.6%) | |||
| Signet ring cell carcinoma | 1 (0.8%) | 5 (3.1%) | 0 (0%) | 6 (2.2%) | 0 (0%) | 5 (2.3%) | 6 (2.1%) | |||
| Others | 3 (2.5%) | 3 (1.9%) | 0 (0%) | 6 (2.2%) | 0 (0%) | 5 (2.3%) | 6 (2.1%) | |||
| 0.2758 | 0.7361 | 0.7684 | ||||||||
| Well | 4 (16.9%) | 11 (9.6%) | 2 (20%) | 23 (12.3%) | 2 (22.2%) | 22 (13.8%) | 25 (12.7%) | |||
| Moderate | 56 (67.5%) | 87 (76.3%) | 7 (70%) | 136 (72.7%) | 6 (66.7%) | 115 (71.9%) | 143 (72.6%) | |||
| Poor | 13 (15.7%) | 16 (14%) | 1 (10%) | 28 (15%) | 1 (11.1%) | 23 (14.4%) | 29 (14.7%) | |||
| 0.1538 | 0.1934 | 0.3535 | ||||||||
| 2 | 9 (7.1%) | 4 (2.4%) | 0 (0%) | 13 (4.6%) | 0 (0%) | 10 (4.6%) | 13 (4.4%) | |||
| 3 | 18 (14.2%) | 26 (15.7%) | 0 (0%) | 44 (15.7%) | 3 (27.3%) | 29 (13.3%) | 44 (15%) | |||
| 4 | 100 (78.7%) | 136 (81.9%) | 13 (100%) | 223 (79.6%) | 8 (72.7%) | 179 (82.1%) | 236 (80.6%) | |||
| 0.3794 | 0.9616 | 0.956 | ||||||||
| Liver | 22 (45.8%) | 31 (48.4%) | 2 (66.7%) | 51 (46.8%) | 1 (100%) | 44 (47.8%) | 53 (47.3%) | |||
| Lung | 6 (12.5%) | 5 (7.8%) | 0 (0%) | 11 (10.1%) | 0 (0%) | 8 (8.7%) | 11 (9.8%) | |||
| Liver and lung | 2 (4.2%) | 3 (4.7%) | 0 (0%) | 5 (4.6%) | 0 (0%) | 5 (5.4%) | 5 (4.5%) | |||
| Peritoneum | 0 (0%) | 5 (7.8%) | 0 (0%) | 5 (4.6%) | 0 (0%) | 4 (4.3%) | 5 (4.5%) | |||
| Lymph node | 16 (33.3%) | 16 (25%) | 1 (33.3%) | 31 (28.4%) | 0 (0%) | 26 (28.3%) | 32 (28.6%) | |||
| Ovary | 2 (4.2%) | 4 (6.3%) | 0 (0%) | 6 (5.5%) | 0 (0%) | 5 (5.4%) | 6 (5.4%) | |||
*Significance threshold at p <0.05.
Fig 1t-SNE clustering based on 3 clinical features (tumor site, histological grade, and city).
And blindly colored KRAS mutation. Five selected clusters were annotated. Each data point represents a patient with a specific color indicating the subgroup of a clinical feature.
Fig 2Performance of the neural network model by using continuous age a) ROC curve of the predictive model for KRAS mutation by using continuous age, histological subtype, histological grade, tumor site, and city.
Accuracy (b) and loss (c) curves for both training and validation datasets during model training converge to a more stable configuration.