| Literature DB >> 29462916 |
S Pamela K Shiao1, James Grayson2, Chong Ho Yu3, Brandi Wasek4, Teodoro Bottiglieri5.
Abstract
For the personalization of polygenic/omics-based health care, the purpose of this study was to examine the gene-environment interactions and predictors of colorectal cancer (CRC) by including five key genes in the one-carbon metabolism pathways. In this proof-of-concept study, we included a total of 54 families and 108 participants, 54 CRC cases and 54 matched family friends representing four major racial ethnic groups in southern California (White, Asian, Hispanics, and Black). We used three phases of data analytics, including exploratory, family-based analyses adjusting for the dependence within the family for sharing genetic heritage, the ensemble method, and generalized regression models for predictive modeling with a machine learning validation procedure to validate the results for enhanced prediction and reproducibility. The results revealed that despite the family members sharing genetic heritage, the CRC group had greater combined gene polymorphism rates than the family controls (p < 0.05), on MTHFR C677T, MTR A2756G, MTRR A66G, and DHFR 19 bp except MTHFR A1298C. Four racial groups presented different polymorphism rates for four genes (all p < 0.05) except MTHFR A1298C. Following the ensemble method, the most influential factors were identified, and the best predictive models were generated by using the generalized regression models, with Akaike's information criterion and leave-one-out cross validation methods. Body mass index (BMI) and gender were consistent predictors of CRC for both models when individual genes versus total polymorphism counts were used, and alcohol use was interactive with BMI status. Body mass index status was also interactive with both gender and MTHFR C677T gene polymorphism, and the exposure to environmental pollutants was an additional predictor. These results point to the important roles of environmental and modifiable factors in relation to gene-environment interactions in the prevention of CRC.Entities:
Keywords: colorectal cancer; gene–environment interaction; multi-ethnic groups; predictor
Year: 2018 PMID: 29462916 PMCID: PMC5872084 DOI: 10.3390/jpm8010010
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Comparison of demographic/environmental factors between family control and cancer groups.
| Factors | Control | Cancer | ||
|---|---|---|---|---|
| Gender | Male | 14 (25.9%) | 25 (46.3%) | |
| Marital Status | Married | 33 (61.1%) | 35 (64.8%) | 0.1739 |
| Health Status | Good/Excellent | 40 (74.1%) | 39 (72.2%) | 0.6878 |
| Age | Years (mean ± SD) (range) | 47.04 ± 17.16 | 60.98 ± 10.86 | |
| Body Mass Index | Lean (<20) | 2 (3.7%) | 2 (3.7%) | 0.8082 |
| Vegetable Intake/Day | ≥3 servings | 15 (27.8%) | 12 (22.2%) | 0.6779 |
| Fruit Intake/Day | ≥2 servings | 27 (50.0%) | 24 (44.4%) | 0.7345 |
| Whole Grain Intake/Day | ≥3 servings | 8 (14.8%) | 6 (11.1%) | 0.7821 |
| Liquid Intake/Day | ≥8 cups | 16 (29.6%) | 15 (27.8%) | 0.9645 |
| Sleepy Days/Week | 0 days | 10 (19.6%) | 7 (13.0%) | 0.7355 |
| Physical Activity | Minutes mean ± SD | 48.1 ± 53.9 | 37.4 ± 41.8 | 0.2515 |
| Tobacco Use | Yes | 5 (9.3%) | 4 (7.4%) | 0.7277 |
| Alcohol Use | Yes | 24 (44.4%) | 32 (59.3%) | 0.1478 |
| Stress (0–10) | <5 | 32 (59.3%) | 31 (57.4%) | 0.6671 |
| Nervous or Anxious | Not at all | 26 (48.1%) | 25 (46.3%) | 0.9971 |
| Depressed | Not at all | 36 (66.7%) | 34 (63.0%) | 0.3581 |
| Cognitive Capacity | Good/Excellent | 46 (85.2%) | 45 (83.3%) | 0.7418 |
| Functional Capacity | Good/Excellent | 49 (90.7%) | 45 (83.3%) | 0.7027 |
| Role Functions | Good/Excellent | 49 (90.7%) | 44 (81.5%) | 0.4913 |
| Spiritual Support | Good/Excellent | 39 (72.2%) | 45 (83.3%) | 0.2074 |
| Convenience to Healthcare | Good/Excellent | 50 (92.6%) | 52 (96.3%) | 0.2293 |
| Health Insurance Coverage | Good/Excellent | 44 (81.5%) | 47 (87.0%) | 0.1330 |
| Air Quality in Community | Good/Excellent | 34 (63.0%) | 29 (53.7%) | 0.7790 |
| Air Quality at Home | Good/Excellent | 12 (22.2%) | 13 (24.1%) | 0.6859 |
| Tobacco Use by Family Members | Yes | 5 (9.3%) | 6 (11.1%) | 0.7005 |
| Exposure to Pollutants | Yes | 5 (9.3%) | 14 (25.9%) | |
| Race | White | 16 (29.6%) | 18 (33.3%) | 0.8842 |
The statistically significant values have been highlighted in red. SD: Standard deviation.
Comparison of demographic, lifestyle, and environmental factors across racial groups.
| Factors | White | Asian | Hispanic | African | ||
|---|---|---|---|---|---|---|
| Gender | Male | 15 (44.1%) | 13 (31.0%) | 8 (34.8%) | 3 (33.3%) | 0.6876 |
| Marital Status | Married | 24 (70.6%) | 31 (73.8%) | 9 (39.1%) | 4 (44.4%) | 0.0658 |
| Health Status | Good/Excellent | 27 (79.4%) | 30 (71.4%) | 16 (69.6%) | 6 (66.7%) | 0.3674 |
| Age | Years (mean ± SD) (range) | 57.53 ± 2.73 | 53.55 ± 2.45 | 49.78 ± 3.31 | 53.67 ± 5.30 | 0.3478 |
| Body Mass Index | Lean (< 20) | 1 (2.9%) | 3 (7.1%) | 0 (0%) | 0 (0%) | |
| Vegetable Intake/Day | ≥3 servings | 11 (32.4%) | 13 (31.0%) | 2 (8.7%) | 1 (11.1%) | 0.1414 |
| Fruit Intake/Day | ≥2 servings | 15 (44.1%) | 22 (52.4%) | 10 (43.5%) | 4 (44.4%) | 0.3406 |
| Whole Grain Intake/Day | ≥3 servings | 7 (20.6%) | 4 (9.5%) | 2 (8.7%) | 1 (11.1%) | 0.3985 |
| Liquid Intake/Day | ≥8 cups | 10 (29.4%) | 12 (28.6%) | 7 (30.4%) | 2 (22.2%) | 0.4805 |
| Sleepy Days/Week | 0 days | 7 (20.6%) | 7 (16.7%) | 2 (8.7%) | 1 (11.1%) | 0.8448 |
| Physical Activity/Week | mean ± SD | 39.3±35.2 | 43.9±54.5 | 54.1±59.2 | 21.7±17.7 | 0.1223 |
| Tobacco Use | Yes | 1 (2.9%) | 4 (9.5%) | 3 (13.0%) | 1 (11.1%) | 0.5457 |
| Alcohol Use | Yes | 27 (79.4%) | 13 (31.0%) | 14 (33.3%) | 2 (22.2%) | |
| Stress (0–10) | <5 | 15 (44.1%) | 28 (66.7%) | 14 (33.3%) | 6 (66.7%) | 0.1253 |
| Nervous or Anxious | Not at all | 14 (41.2%) | 20 (47.6%) | 11 (47.8%) | 6 (66.7%) | 0.4130 |
| Depressed | Not at all | 22 (64.7%) | 28 (66.7%) | 13 (56.5%) | 7 (77.8%) | 0.8608 |
| Cognitive Capacity | Good/Excellent | 31 (91.2%) | 34 (81.0%) | 19 (82.6%) | 7 (77.8%) | 0.3889 |
| Functional Capacity | Good/Excellent | 27 (79.4%) | 40 (95.2%) | 19 (82.6%) | 8 (88.9%) | 0.3398 |
| Role Functions | Good/Excellent | 31 (91.2%) | 36 (85.7%) | 18 (78.3%) | 8 (88.9%) | 0.4095 |
| Spiritual Support | Good/Excellent | 27 (79.4%) | 34 (81.0%) | 17 (73.9%) | 6 (66.7%) | 0.5334 |
| Convenience to Healthcare | Good/Excellent | 34 (100%) | 37 (88.1%) | 23 (100%) | 8 (88.9%) | 0.2321 |
| Health Insurance Coverage | Good/Excellent | 32 (94.1%) | 30 (71.4%) | 22 (95.7%) | 7 (77.8%) | 0.1175 |
| Air Quality in Community | Good/Excellent | 17 (50.0%) | 28 (66.7%) | 11 (47.8%) | 7 (77.8%) | 0.4525 |
| Air Quality at Home | Good/Excellent | 7 (20.6%) | 10 (23.8%) | 6 (26.1%) | 2 (22.2%) | 0.2545 |
| Tobacco Use in Family | Yes | 2 (5.9%) | 4 (9.5%) | 4 (17.4%) | 1 (11.1%) | 0.5316 |
| Exposure to Pollution | Yes | 7 (20.6%) | 5 (11.9%) | 4 (17.4%) | 3 (33.3%) | 0.4659 |
The statistically significant values have been highlighted in red.
Comparison of gene polymorphisms between family control and cancer groups.
| Genes | Control | Cancer | ||
|---|---|---|---|---|
| 0 (CC) | 28 (51.9%) | 23 (42.6%) | 0.6285 | |
| 0 (AA) | 32 (59.2%) | 34 (63.0%) | 0.8212 | |
| 0 (AA) | 39 (72.2%) | 36 (66.7%) | 0.3712 | |
| 0 (AA) | 28 (52.4%) | 19 (35.6%) | 0.1842 | |
| Del/Del | 20 (37.0%) | 13 (24.1%) | 0.2188 | |
| Total Polymorphism (0–10) | ≥4 | 16 (29.6%) | 27 (50.0%) |
The statistically significant values have been highlighted in red. Ins: Insertion; Del: Deletion.
Comparison of gene polymorphisms across racial groups.
| Genes | White | Asian | Hispanic | African | ||
|---|---|---|---|---|---|---|
| 0 (CC) | 3 (38.2%) | 21 (50.0%) | 8 (34.8%) | 9 (100.0%) | ||
| 0 (AA) | 18 (52.9%) | 29 (69.1%) | 10 (43.5%) | 9 (100.0%) | ||
| MTHFR Deficiency | 0% | 2 (5.9%) | 11 (26.2%) | 0 (0.0%) | 9 (100.0%) | |
| ≥50% | 10 (29.4%) | 7 (16.7%) | 7 (30.4%) | 0 (0%) | 0.1553 | |
| 0 (AA) | 20 (58.8%) | 30 (71.4%) | 21 (91.3%) | 4 (44.4%) | ||
| 0 (AA) | 7 (20.6%) | 20 (48.8%) | 17 (73.9%) | 3 (33.3%) | ||
| Del/Del | 3 (8.8%) | 24 (57.1%) | 4 (17.4%) | 2 (22.2%) | ||
| Total Polymorphism (0–10) | ≥4 | 16 (47.1%) | 21 (50.0%) | 5 (21.7%) | 1 (11.1%) |
Distribution of gene polymorphisms per control and cancer groups across racial groups.
| Control Group | Cancer Group | |||||||
|---|---|---|---|---|---|---|---|---|
| Genotypes | 0 | 1 | 2 | Population Allele Frequency | 0 | 1 | 2 | |
| CC | CT | TT | % C/T | CC | CT | TT | ||
| Total | 28 (51.9) | 21 (44.4) | 5 (9.3) | NS | 75/25 | 23 (42.6) | 25 (46.3) | 6 (11.1) |
| White | 8 (50.0) | 7 (43.8) | 1 (6.2) | NS | 53/47 | 5 (27.8) | 9 (50) | 4 (22.2) |
| Asian | 12 (52.2) | 8 (34.8) | 3 (13.0) | NS | 70/30 | 9 (47.4) | 9 (47.4) | 1 (5.2) |
| Hispanic | 4 (36.4) | 6 (54.5) | 1 (9.1) | NS | 55/45 | 4 (33.3) | 7 (58.3) | 1 (8.3) |
| Black | 4 (100) | 0 (0) | 0 (0) | - | 91/9 | 5 (100) | 0 (0) | 0 (0) |
| AA | AC | CC | % A/C | AA | AC | CC | ||
| Total | 32 (59.2) | 15 (27.8) | 7 (13) | 0.0314 | 75/25 | 34 (63) | 15 (27.8) | 5 (9.3) |
| White | 7 (43.8) | 6 (37.5) | 3 (18.8) | NS | 85/15 | 11 (61.1) | 6 (33.3) | 1 (5.6) |
| Asian | 16 (69.6) | 5 (21,7) | 2 (8.7) | NS | 78/22 | 13 (68.4) | 4 (21.1) | 2 (10.5) |
| Hispanic | 5 (45.4) | 4 (36.4) | 2 (18.2) | NS | 84/16 | 5 (41.7) | 5 (41.7) | 2 (16.7) |
| Black | 4 (100) | 0 (0) | 0 (0) | - | 85/15 | 5 (100) | 0 (0) | 0 (0) |
| AA | AG | GG | % A/G | AA | AG | GG | ||
| Total | 39 (72.2) | 12 (22.2) | 3 (5.6) | NS | 36 (66.7) | 17 (31.5) | 1 (1.8) | |
| White | 10 (62.5) | 4 (25.0) | 2 (12.5) | NS | 84/16 | 10 (55.5) | 7 (38.9) | 1 (5.5) |
| Asian | 17 (73.9) | 5 (21.7) | 1 (4.3) | NS | 65–91/9–35 | 13 (68.4) | 6 (31.6) | 0 (0) |
| Hispanic | 11 (100) | 0 (0) | 0 (0) | - | 19/81 | 10 (83.3) | 2 (16.7) | 0 (0) |
| Black | 1 (25) | 3 (75) | 0 (0) | NS | 30–37/63–70 | 3 (60.0) | 2 (40.0) | 0 (0) |
| AA | AG | GG | % A/G | AA | AG | GG | ||
| Total | 28 (52.6) | 18 (33.4) | 7 (13) | NS | 64/36 | 19 (35.6) | 25 (46.8) | 10 (18.5) |
| White | 3 (18.8) | 6 (37.5) | 7 (43.8) | NS | 45/55 | 4 (22.2) | 9 (50.0) | 5 (27.8) |
| Asian | 14 (63.6) | 8 (36.4) | 0 (0) | NS | 74/26 | 6 (31.6) | 10 (52.6) | 3 (15.8) |
| Hispanic | 10 (90.9) | 1 (9.1) | 0 (0) | NS | 72/28 | 7 (58.3) | 3 (25.0) | 2 (16.7) |
| Black | 1 (25.0) | 3 (75.0) | 0 (0) | NS | 73/27 | 2 (40.0) | 3 (60.0) | 0 (0) |
| II | ID | DD | % I/D | II | ID | DD | ||
| Total | 20 (37) | 17 (31.5) | 17 (31.5) | 0.0068 | 50/50 | 13 (24.1) | 25 (46.3) | 16 (29.6) |
| White | 2 (12.5) | 6 (37.5) | 8 (50.0) | NS | 45–47/53–55 | 1 (5.6) | 11 (61.1) | 6 (33.3) |
| Asian | 15 (65.2) | 6 (26.1) | 2 (8.7) | NS | 63/37 | 9 (47.4) | 7 (36.8) | 3 (15.8) |
| Hispanic | 2 (18.2) | 4 (36.4) | 5 (45.4) | NS | 58/42 | 2 (16.7) | 3 (25.0) | 7 (58.3) |
| Black | 1 (25.0) | 1 (25.0) | 2 (50.0) | NS | 55/45 | 1 (20.0) | 4 (80.0) | 0 (0) |
HWE: Hardy–Weinberg equilibrium; - not available; NS: Not significant; HWE calculator: http://www.koonec.com/k-blog/2010/06/20/hardy-weinberg-equilibrium-calculator/; http://useast.ensembl.org/index.html; https://www.cdc.gov/genomics/population/genvar/frequencies/mthfr.htm.
Genetic predictors of cancer.
| Term | Number of Splits | Column Contribution | Portion | |
|---|---|---|---|---|
| 46 | 1.09506968 | 0.3082 | ||
| MTHFR Deficiency | 47 | 0.82548898 | 0.2324 | |
| 43 | 0.48910685 | 0.1377 | ||
| 46 | 0.4855324 | 0.1367 | ||
| 42 | 0.41353505 | 0.1164 | ||
| 33 | 0.24403481 | 0.0687 |
Demographic/environmental predictors of cancer.
| Term | Number of Splits | Column Contribution | Portion | |
|---|---|---|---|---|
| Body mass index | 10 | 6.78930886 | 0.3058 | |
| Marital Status | 7 | 4.42559099 | 0.1993 | |
| Race | 7 | 3.76884353 | 0.1698 | |
| Exposure to Pollutants | 3 | 2.12649039 | 0.0958 | |
| Gender | 5 | 1.81587428 | 0.0818 | |
| Insurance Coverage | 3 | 1.17074973 | 0.0527 | |
| Air Quality in the Community | 5 | 1.06350409 | 0.0479 | |
| Convenience of HealthcareAccess | 3 | 0.52529395 | 0.0237 | |
| Air Quality in the Home | 3 | 0.29975415 | 0.0135 | |
| Tobacco Smoker in the Home | 2 | 0.21686301 | 0.0098 |
Lifestyle predictors of cancer.
| Term | Number of Splits | Column Contribution | Portion | |
|---|---|---|---|---|
| Stress | 27 | 3.43552989 | 0.1093 | |
| Physical Activity | 30 | 3.37660068 | 0.1074 | |
| Times Using Alcohol | 31 | 3.13235692 | 0.0996 | |
| Spiritual Support | 25 | 2.91976087 | 0.0929 | |
| Sleepiness | 28 | 2.87042298 | 0.0913 | |
| Functional Role | 22 | 2.53679611 | 0.0807 | |
| Whole Grain Dietary Intake | 17 | 1.92470816 | 0.0612 | |
| Functional Capacity | 16 | 1.81050686 | 0.0576 | |
| Fruits Intake | 20 | 1.52985178 | 0.0487 | |
| Vegetables Intake | 20 | 1.51937688 | 0.0483 | |
| Cognitive Capacity | 16 | 1.36910873 | 0.0435 | |
| Depression | 13 | 1.32859637 | 0.0423 | |
| Health Status Overall | 11 | 1.25492503 | 0.0399 | |
| Nervous and Anxious | 12 | 1.17885473 | 0.0375 | |
| Total Liquid Intake | 17 | 0.80560452 | 0.0256 | |
| Tobacco Smoking | 8 | 0.45134732 | 0.0144 |
Model comparisons between bootstrap forest and logistic regression.
| Misclassification Rates | ||
|---|---|---|
| Factors | Bootstrap Forest | Logistic Regression |
| Demographic–Environmental | 0.1942 | 0.2353 |
| Genetic | 0.2019 | 0.3137 |
| Health Metrics/Lifestyle | 0.1584 | 0.2475 |
All predictors of cancer for gene–environment interactions.
| Term | Number of Splits | Column Contribution | Portion | |
|---|---|---|---|---|
| Body mass index | 73 | 2.34801946 | 0.1604 | |
| Physical Activity | 67 | 1.83265224 | 0.1252 | |
| Sleepiness | 74 | 1.78325631 | 0.1218 | |
| Spiritual Support | 63 | 1.75806876 | 0.1201 | |
| MTHFR Deficiency | 76 | 1.66137349 | 0.1135 | |
| Times Using Alcohol | 63 | 1.46035411 | 0.0998 | |
| Functional Role | 65 | 1.3622703 | 0.0931 | |
| Stress | 63 | 1.32282568 | 0.0904 | |
| 58 | 1.10742696 | 0.0757 |
Predictors of cancer for Asians.
| Term | Number of Splits | Column Contribution | Portion | |
|---|---|---|---|---|
| Sleepiness | 44 | 1.44019311 | 0.2209 | |
| Stress | 35 | 1.32619458 | 0.2034 | |
| 32 | 1.02397504 | 0.1570 | ||
| Physical Activity, Minutes/Week | 38 | 0.9726631 | 0.1492 | |
| Body mass index | 25 | 0.643443 | 0.0987 | |
| MTHFR Deficiency | 28 | 0.46681593 | 0.0716 | |
| Spiritual Support | 21 | 0.29696771 | 0.0455 | |
| Times Using Alcohol | 22 | 0.19012811 | 0.0292 | |
| Functions in Roles | 21 | 0.16033891 | 0.0246 |
Predictors of cancer for Hispanics.
| Term | Number of Splits | Column Contribution | Portion | |
|---|---|---|---|---|
| Spiritual Support | 41 | 2.51879811 | 0.4094 | |
| Body mass index | 18 | 0.82973589 | 0.1349 | |
| Stress | 23 | 0.60238034 | 0.0979 | |
| Functions in Roles | 20 | 0.54764955 | 0.0890 | |
| Times Using Alcohol | 24 | 0.48150149 | 0.0783 | |
| Sleepiness | 24 | 0.4798417 | 0.0780 | |
| Physical Activity, Minutes/Week | 18 | 0.30328675 | 0.0493 | |
| MTHFR Deficiency | 15 | 0.22482419 | 0.0365 | |
| 8 | 0.16489531 | 0.0268 |
Predictors of cancer for Whites.
| Term | Number of Splits | Column Contribution | Portion | |
|---|---|---|---|---|
| Physical Activity, Minutes/Week | 44 | 1.64218014 | 0.1916 | |
| BMI | 40 | 1.46162302 | 0.1705 | |
| Times Using Alcohol | 25 | 1.29832268 | 0.1514 | |
| Functions in Roles | 35 | 1.15435558 | 0.1346 | |
| MTHFR Deficiency | 29 | 1.11010973 | 0.1295 | |
| Sleepiness | 34 | 0.73467524 | 0.0857 | |
| Stress | 21 | 0.61669827 | 0.0719 | |
| 19 | 0.32183586 | 0.0375 | ||
| Spiritual Support | 14 | 0.23328255 | 0.0272 |
Baseline logistic regression model and generalized regression elastic net models on the predictors of colorectal cancer from gene–environment interactions (of total gene polymorphisms).
| Logistic Regression Original Model with Validation | Generalized Regression Elastic Net Model | |||||
|---|---|---|---|---|---|---|
| With AICc Validation | With Leave-One-Out Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | −0.2875 | 0.6144 | 0.3218 | 0.4096 | 0.3486 | 0.3785 |
| Gender (Male/Female) | 1.5023 | 0.0119 | 1.2972 | 0.0074 | 1.4286 | 0.0018 |
| BMI * Alcohol Use, Interaction | −2.2790 | 0.0367 | −1.9512 | 0.0146 | −1.2376 | 0.0062 |
| Total Polymorphisms | −0.7185 | 0.1865 | −1.1444 | 0.0125 | −2.1202 | 0.0063 |
| BMI | 1.3637 | 0.0602 | 0.7541 | 0.1993 | 0.8991 | 0.1036 |
| Alcohol Use | 0.5468 | 0.4038 | 0 | 1.000 | 0 | 1.000 |
| Misclassification Rate | 0.3714 | - | 0.2963 | - | 0.2804 | - |
| AICc | 56.98 | - | 138.81 | - | - | - |
| AUC | 0.7817 | - | 0.7531 | - | 0.7652 | - |
* Denotes Interaction; - not available; AICc: Akaike’s information criterion with corrections; AUC: Area under the curve.
Figure 1Receiver operating characteristic (ROC) curve and AUC for the baseline logistic regression model (left panel), elastic net with Akaike’s information criteria with correction validation model (middle), and leave-one-out validation model (right panel) on the predictors of colorectal cancer from gene –environment interactions (of total gene polymorphisms).
Baseline logistic regression model and generalized regression elastic net models on the predictors of colorectal cancer from gene –environment interactions (of single genes).
| Logistic Regression Original Model With Validation | Generalized Regression Elastic Net Model | |||||
|---|---|---|---|---|---|---|
| Elastic Net Model With AICc Validation | With Leave-One-Out Cross Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | 0.5768 | 0.5445 | 1.2292 | 0.0498 | 1.3171 | 0.0487 |
| Gender (Male/Female) | 3.1964 | 0.3465 | 1.4525 | 0.0049 | 1.8934 | 0.0006 |
| Gender (Male/Female) * BMI | −4.2655 | 0.0039 | −1.9736 | 0.0219 | −-2.5539 | 0.0042 |
| −2.3824 | 0.0345 | −0.9065 | 0.0523 | −1.1847 | 0.0174 | |
| 2.2401 | 0.1157 | 1.2404 | 0.0667 | −1.5750 | 0.0253 | |
| Exposure to Pollution | −0.8194 | 0.2853 | −1.2110 | 0.0368 | −1.2466 | 0.0458 |
| −0.8694 | 0.1426 | −0.6792 | 0.0975 | 1.3172 | 0.0800 | |
| BMI | 0.8029 | 0.3465 | 0 | 1.000 | 0 | 1.000 |
| Misclassification Rate | 0.4103 | - | 0.3241 | - | 0.3396 | - |
| AICc | 85.24 | - | 140.69 | - | - | - |
| AUC | 0.5842 | - | 0.7536 | - | 0.7639 | - |
* Denotes Interaction; - not available; AICc: Akaike’s information criterion with corrections; AUC: Area under the curve.
Figure 2Receiver operating characteristic curve and AUC for baseline logistic regression model (left panel), elastic net with Akaike’s information criteria with correction validation model (middle), and leave-one-out validation model (right panel) on the predictors of colorectal cancer –environment interactions (of single genes).