| Literature DB >> 34692549 |
Su Yon Jung1, Eric M Sobel2,3, Matteo Pellegrini4, Herbert Yu5, Jeanette C Papp2.
Abstract
BACKGROUND: Disparities in cancer genomic science exist among racial/ethnic minorities. Particularly, African American (AA) and Hispanic/Latino American (HA) women, the 2 largest minorities, are underrepresented in genetic/genome-wide studies for cancers and their risk factors. We conducted on AA and HA postmenopausal women a genomic study for insulin resistance (IR), the main biologic mechanism underlying colorectal cancer (CRC) carcinogenesis owing to obesity.Entities:
Keywords: African and Hispanic/Latino American women; attributable risk; colorectal cancer; endogenous estrogen; glucose homeostasis; polyunsaturated fatty acid; random survival forest; smoking
Year: 2021 PMID: 34692549 PMCID: PMC8529283 DOI: 10.3389/fonc.2021.760243
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Random survival forest comparing rankings between minimal depth and variable of importance (VIMP). (A) African American women. (Note: The 1 genetic marker within the blue oval was identified as the topmost influential predictor. (B) African American women. (MFA, monounsaturated fatty acid; PFA, polyunsaturated fatty acid; SFA, saturated fatty acid. Note: The 6 lifestyle variables within the orange oval were identified as the topmost influential predictors. (C) Hispanic American women. (Note: The 5 genetic markers within the blue oval were identified as the topmost influential predictors. (D) Hispanic American women. (MFA, monounsaturated fatty acid; PFA, polyunsaturated fatty acid; SFA, saturated fatty acid. Note: The 6 lifestyle variables within the orange oval were identified as the topmost influential predictors).
Predictive measures C-index and AUC of the topmost genetic and lifestyle factors in association with colorectal cancer risk.
| African American women | Hispanic American women | |||||
|---|---|---|---|---|---|---|
| Type of variable | Topmost influential variables* | C-index | AUC (95% CI) | Topmost influential variables* | C-index | AUC (95% CI) |
|
|
| 0.4715 | 0.561 (0.491 – 0.631) |
| 0.7064 | 0.798 (0.688 – 0.907) |
|
| 0.8175 | |||||
|
| 0.8048 | |||||
|
| Years as a regular smoker | 0.5023 | 0.627 (0.566 – 0.689) | % calories from MFA/day | 0.5979 | 0.675 (0.526 – 0.823) |
| % calories from PFA/day | 0.5356 | Number of cigarettes/day | 0.5245 | |||
| Age at menopause | 0.5486 | Age at menopause | 0.5655 | |||
| Age at enrollment | 0.6014 | % calories from SFA/day | 0.5836 | |||
| Duration of OC use | 0.6223 | % calories from PFA/day | 0.5896 | |||
| Dietary total sugars | 0.6301 | Dietary vitamin K | 0.5721 | |||
|
| 1 SNP + | 0.647 (0.586 – 0.708) | 3 SNPs + | 0.830 (0.721 – 0.939) | ||
AUC, area under the receiver operating characteristic curve; CI, confidence interval; C-index, concordance index; MFA, monounsaturated fatty acid; OC, oral contraceptive; PFA, polyunsaturated fatty acid; SFA, saturated fatty acid; SNP, single-nucleotide polymorphism.
*Topmost predictive variables were selected on the basis of random survival forest analysis with a multimodal approach.
Figure 2Out-of-bag concordance index (C-index) and area under the receiver operating characteristic curve (AUC) for the topmost genetic and lifestyle factors (MFA, monounsaturated fatty acid; PFA, polyunsaturated fatty acid; SFA, saturated fatty acid SNP; single-nucleotide polymorphism (A) African American women. (a) C-index of SNPs (Improvement in c-index was observed when the top variable [•] was added to the model, whereas other variables [○] did not further improve the accuracy of prediction. (b) AUC of the top SNP (c) C-index of lifestyles (Improvement in c-index was observed when the top 6 variables [•] were added to the model, whereas other variables [○] did not further improve the accuracy of prediction.) (d) AUC of the top 6 lifesytles (e) AUC of the top 1 SNP and 6 lifestyles (B). Hispanic American women. (a) C-index of SNPs (Improvement in c-index was observed when the top 3 variable [•] was added to the model, whereas other variables [○] did not further improve the accuracy of prediction.) (b) AUC of the top 3 SNPs (c). C-index of lifestyles (Improvement in c-index was observed when the top 6 variables [•] were added to the model, whereas other variables [○] did not further improve the accuracy of prediction.) (d) AUC of the top 6 lifestyles (e) AUC of the top 3 SNP and 6 lifestyles.
Figure 3Cumulative incidence rate of colorectal cancer for the topmost predictive genetic and lifestyle variables selected from a random survival forest analysis (Dashed red lines indicate 95% confidence intervals. (A) African American women: 1 genetic and 6 lifestyle factors. (B) Hispanic American women: 3 genetic and 6 lifestyle factors.
African American women: combined effect of risk genotypes and risk lifestyles on colorectal cancer risk and population-attributable risk percentage.
| Number of risks | n | HR (95% CI) |
| PAR (%)† |
|---|---|---|---|---|
|
| ||||
| 0 | 2,756 | reference | 22.9 | |
| 1 | 1,936 |
|
| |
|
| ||||
| 0 | 3,097 | reference | 33.6 | |
| 1 | 1,595 |
|
| |
|
| ||||
| 0 | 1,859 | reference | 44.9 | |
| Risk genotypes only | 1,238 | 1.51 | 0.2450 | |
| Risk lifestyles only | 897 |
|
| |
| Both risks of genotypes and lifestyles | 698 |
|
| |
|
|
|
CI, confidence interval; HR, hazard ratio; PAR, population attributable risk. Numbers in bold face are statistically significant.
†PAR(%) reflects, in total African American women, a risk of colorectal cancer attributable to the risk genotypes and the risk lifestyles, both singly and in combination.
£The number of risk genotype (PCSK1 rs9285019 TC+CC) was defined as follows: 0 (none) vs. 1 (1 risk allele).
¶The number of lifestyles (≥ 20 years as a regular smoker, ≤ 6.8% of calories from polyunsaturated fatty acid/day, > 42 years old at menopause, 56–79 years old at enrollment, 5–37 years of oral contraceptive use, and > 60.5 g of dietary total sugars) was determined on the basis of analysis for the combined lifestyle factors ( ) and defined as follows: 0 (null/1/2/3/4 risk lifestyles) vs. 1 (5/6 risk lifestyles).
§The combined number of risk genotypes and risk lifestyles was based on risk genotype defined as 0 (none) and 1 (1 risk allele), and risk lifestyles defined as 0 (null/1/2/3/4 risk lifestyles) and 1 (5/6 risk lifestyles). The ultimate number of risk genotypes combined with risk lifestyles was defined as 0 (no risk genotypes and risk lifestyles); and risk genotypes (only risk genotypes) and risk lifestyles (only risk lifestyles), separately and together.
*p values with false discovery rate < 0.05 are shown after multiple comparison corrections via the Benjamini-Hochberg method.
Hispanic American women: combined effect of risk genotypes and risk lifestyles on colorectal cancer risk and population-attributable risk percentage.
| Number of risks | n | HR (95% CI) |
| PAR (%)† |
|---|---|---|---|---|
|
| ||||
| 0 | 1,495 | reference | 66.8 | |
| 1 | 491 |
|
| |
|
| ||||
| 0 | 1,850 | Reference | 26.2 | |
| 1 | 136 |
|
| |
|
| ||||
| 0 | 1,394 | Reference | ||
| Risk genotypes only | 456 |
|
| 73.3 |
| Risk lifestyles only | 101 | 4.97 (0.52 – 47.76) | 0.1653 | |
| Both risks of genotypes and lifestyles | 35 |
|
| |
|
|
|
CI, confidence interval; HR, hazard ratio; PAR, population attributable risk. Numbers in bold face are statistically significant.
†PAR(%) reflects, in total Hispanic African women, a risk of colorectal cancer attributable to the risk genotypes and the risk lifestyles, both singly and in combination.
£The number of risk genotypes (IFT172 rs780104 GG; GCKR rs6753534 CC; and NRBP1 rs704791 TT) was defined as follows: 0 (none/1 risk allele) vs. 1 (2/3 risk alleles).
¶The maximum combined number of lifestyles (> 15.9% of calories from monounsaturated fatty acid [FA]/day, ≥ 25 cigarettes/day, ≤ 38 years old at menopause, > 12.4% of calories from saturated FA/day, ≤ 4.7% of calories from polyunsaturated FA/day, and ≤ 55.6 mg of dietary vitamin K) was 3. The number of lifestyles was determined on the basis of analysis for the combined lifestyle factors ( ) and defined as follows: 0 (null/1/2 risk lifestyles) vs. 1 (3 risk lifestyles).
§The combined number of risk genotypes and risk lifestyles was based on risk genotypes defined as 0 (none/1 risk allele) and 1 (2/3 risk alleles), and risk lifestyles defined as 0 (null/1/2 risk lifestyles) and 1 (3 risk lifestyles). The ultimate number of risk genotypes combined with risk lifestyles was defined as 0 (no risk genotypes and risk lifestyles); and risk genotypes (only risk genotypes) and risk lifestyles (only risk lifestyles), separately and together.
*p values with false discovery rate < 0.05 were shown after multiple comparison corrections via the Benjamini-Hochberg method.
Colorectal cancer attributable risk for the lifestyle factors detected as the topmost predictive variables in both African American and Hispanic American women.
| Overlapped variables:the topmost predictors | African American Women | Hispanic American women |
|---|---|---|
| AR (%) | AR (%) | |
| Smoking† | 61.7 | 87.4 |
| Age at menopause | 28.2 | 57.1 |
| percent calories from PFA/day | 12.7 | 48.9 |
AR, attributable risk; PFA, polyunsaturated fatty acid.
†The modeled variable for smoking factor is years as a regular smoker in African American women and the number of cigarettes smoked daily in Hispanic American women.