| Literature DB >> 16539725 |
Mitchell S Wachtel1, Yan Zhang, Maurizio Chiriva-Internati, Eldo E Frezza.
Abstract
BACKGROUND: Although impacts upon gastric cancer incidence of race, age, sex, and Lauren type have been individually explored, neither their importance when evaluated together nor the presence or absence of interactions among them have not been fully described.Entities:
Mesh:
Year: 2006 PMID: 16539725 PMCID: PMC1479359 DOI: 10.1186/1471-2407-6-65
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Schema for acquisition of data from SEER for this study.
| Statistic: Crude Rates | |
| Case Only: {Site and Morphology.Site recode} = 'Stomach' | |
| Option: Select only malignant behaviour | |
| Row: Year of dx, 92–96, 97–01 [Year of diagnosis] | |
| Men and Women [Sex] | |
| Stomach cancer types [Histologic Type ICD-O-3] | |
| Race, Asian or not, no unknown [Race recode Y] | |
| Column: Age recode with <1 year olds | |
| Stomach cancer types [Histologic Type ICD-O-3] | |
| intestinal = 8144 | |
| non-intestinal = 8142,8145,8490 | |
| Year of dx, 92–96, 97–01 [Year of diagnosis] | |
| 1992–1996 = 1992, 1993, 1994, 1995, 1996 | |
| 1997–2001 = 1997, 1998, 1999, 2000, 2001 | |
| Sex [Sex] | |
| Male = Male | |
| Female = Female | |
| Race, Asian or not, no unknown [Race recode Y] | |
| Asian = Asian or Pacific Islander | |
| Non-Asian = All other except unknown |
Frequency distribution of 7882 persons who developed stomach cancer, by five year period, by sex, by race, and by Lauren type.
| five year period | 1992–1996 | 1997–2001 |
| 3,429 | 4,453 | |
| Sex | Men | women |
| 4,399 | 3,483 | |
| Race | Asian | non-Asian |
| 2,059 | 5,823 | |
| Lauren type | type 1 | Type 2 |
| 1,992 | 5,890 |
Frequency distribution of denominator population by five year period, by sex, and by race.
| five year period | 1992–1996 | 1997–2001 |
| 68,395,787 | 76,759,882 | |
| Sex | men | women |
| 67,831,186 | 77,324,483 | |
| Race | Asian | non-Asian |
| 14,600,067 | 130,555,602 |
Frequency distribution of persons with cancer and persons in denominator by age group.
| Ages | Persons with Cancer | Persons in Denominator | ||
| 40–44 | 353 | 4.5% | 29,331,208 | 20.2% |
| 45–49 | 442 | 5.6% | 25,218,817 | 17.4% |
| 50–54 | 542 | 6.9% | 20,412,681 | 14.1% |
| 55–59 | 631 | 8.0% | 15,655,389 | 10.8% |
| 60–64 | 762 | 9.7% | 13,042,756 | 9.0% |
| 65–69 | 1,056 | 13.4% | 11,913,627 | 8.2% |
| 70–74 | 1,250 | 15.9% | 10,623,230 | 7.3% |
| 75–79 | 1,203 | 15.3% | 8,522,907 | 5.9% |
| 80–84 | 874 | 11.1% | 5,660,379 | 3.9% |
| 85+ | 769 | 9.8% | 4,774,675 | 3.3% |
| Total | 7,882 | 100.0% | 145,155,669 | 100.0% |
Univariate regression of the natural logarithm of the rate of stomach cancer on five year period, on sex, on race, on Lauren type, and on age group.
| Explanatory variable | Intercept | R2 | Estimate | Std Error | t | Pr (> | t|) |
| five year period | -10.40 | 0.002 | 0.16 | 0.26 | 0.61 | 0.54 |
| Sex | -9.98 | 0.042 | -0.68 | 0.26 | -2.62 | 0.01 |
| Race | -11.01 | 0.174 | 1.38 | 0.24 | 5.77 | 4.1 × 10-8 |
| Lauren type | -10.98 | 0.158 | 1.31 | 0.24 | 5.44 | 2.0 × 10-7 |
| age group | -12.54 | 0.491 | 0.40 | 0.03 | 12.35 | < 1 × 10 -10 |
Comparisons of linear regression models, with and without outlier, of the natural logarithm of the rate of stomach cancer on the main explanatory variables. The difference between the residual sum of squares (RSS) before and after each explanatory variable had been added to regression (ΔRSS) was divided by RSS and multiplied by the error df to yield F, whose numerator df was 1 and denominator df was the error df.
| WITHOUT OUTLIER | |||||
| Model Covariates | RSS | ΔRSS | error df | F | |
| Null | 403.26 | ||||
| five year period | 402.96 | 0.30 | 157 | 0.1 | 0.73 |
| five year period + sex | 388.32 | 14.64 | 156 | 5.9 | 0.02 |
| five year period +sex + race | 319.23 | 69.09 | 155 | 33.5 | 3.8 × 10-8 |
| five year period + sex + race + Lauren type | 255.93 | 63.30 | 154 | 38.1 | 5.7 × 10-9 |
| five year period + sex + race + Lauren type + age | 50.64 | 205.29 | 153 | 620.2 | < 1 × 10 -10 |
| WITH OUTLIER | |||||
| Model Covariates | RSS | ΔRSS | error df | F | |
| Null | 437.81 | ||||
| five year period | 436.79 | 1.02 | 158 | 0.4 | 0.54 |
| five year period + sex | 418.50 | 18.29 | 157 | 6.9 | 0.01 |
| five year period +sex + race | 342.38 | 76.12 | 156 | 34.7 | 2.3 × 10-8 |
| five year period + sex + race + Lauren type | 273.23 | 69.15 | 155 | 39.2 | 3.5 × 10-9 |
| five year period + sex + race + Lauren type + age | 58.14 | 215.10 | 154 | 569.8 | < 1 × 10 -10 |
Comparisons of linear regression models, with and without outlier, of the natural logarithm of the rate of stomach cancer on the main explanatory variables and each of ten interaction variables. The main effects (ME) comprised the explanatory variables five year period, sex, race, Lauren type, and age. The difference between the residual sum of squares (RSS) before and after the addition of each interaction variable to ME (ΔRSS) was divided by RSS and multiplied by error d.f. to yield F, whose numerator df was 1 and denominator df was error df.
| WITHOUT OUTLIER | |||||
| Model Covariates | RSS | ΔRSS | error df | F | |
| ME | 50.64 | ||||
| ME + five year period:sex | 50.60 | 0.05 | 152 | 0.1 | 0.71 |
| ME + five year period:race | 50.09 | 0.55 | 152 | 1.7 | 0.20 |
| ME + five year period: Lauren type | 50.51 | 0.13 | 152 | 0.4 | 0.53 |
| ME + five year period:age | 50.04 | 0.60 | 152 | 1.8 | 0.18 |
| ME + sex:race | 50.62 | 0.02 | 152 | 0.1 | 0.82 |
| ME + sex:Lauren type | 43.99 | 6.65 | 152 | 23.0 | 3.9 × 10-6 |
| ME + sex:age | 50.63 | 0.02 | 152 | 0.0 | 0.83 |
| ME + race:Lauren type | 45.00 | 5.64 | 152 | 19.0 | 2.3 × 10-5 |
| ME + race:age | 49.50 | 1.14 | 152 | 3.5 | 0.06 |
| ME + Lauren type:age | 31.62 | 19.03 | 152 | 91.5 | < 1 × 10 -10 |
| WITH OUTLIER | |||||
| Model Covariates | RSS | ΔRSS | error df | F | |
| ME | 58.14 | ||||
| ME + five year period:sex | 57.95 | 0.19 | 153 | 0.5 | 0.48 |
| ME + five year period:race | 57.22 | 0.92 | 153 | 2.5 | 0.12 |
| ME + five year period: Lauren type | 57.80 | 0.34 | 153 | 0.9 | 0.35 |
| ME + five year period:age | 57.88 | 0.25 | 153 | 0.7 | 0.41 |
| ME + sex:race | 58.01 | 0.13 | 153 | 0.3 | 0.57 |
| ME + sex:Lauren type | 50.34 | 7.79 | 153 | 23.7 | 2.8 × 10-6 |
| ME + sex:age | 58.11 | 0.02 | 153 | 0.1 | 0.81 |
| ME + race:Lauren type | 51.44 | 6.70 | 153 | 19.9 | 1.6 × 10-5 |
| ME + race:age | 57.50 | 0.63 | 153 | 1.7 | 0.20 |
| ME + Lauren type:age | 36.88 | 21.25 | 153 | 88.2 | < 1 × 10 -10 |
Final multiple linear regression models, with and without outlier, of the natural logarithm of the rate of stomach cancer.
| WITHOUT OUTLIER | ||||||
| Parameter | Estimate | Std Error | t | df | Pr(>| t|) | 95% Conf Int |
| Intercept | -14.15 | 0.11 | -134.44 | 151 | < 1 × 10 -10 | -14.35 – -13.94 |
| Sex | -1.07 | 0.08 | -13.15 | 151 | < 1 × 10 -10 | -1.23 – -0.91 |
| Race | 1.74 | 0.08 | 21.42 | 151 | < 1 × 10 -10 | 1.58 – 1.90 |
| Lauren type | 2.59 | 0.15 | 17.53 | 151 | < 1 × 10 -10 | 2.30 – 2.89 |
| Age | 0.52 | 0.01 | 36.69 | 151 | < 1 × 10 -10 | 0.49 – 0.55 |
| age:Lauren type | -0.24 | 0.02 | -12.19 | 151 | < 1 × 10 -10 | -0.28 – -0.20 |
| race:Lauren type | -0.77 | 0.11 | -6.72 | 151 | 3.6 × 10 -10 | -0.99 – -0.54 |
| sex:Lauren type | 0.83 | 0.11 | 7.28 | 151 | < 1 × 10 -10 | 0.61 – 1.06 |
| Overall Model | Std Error | R2 | F | df num | df den | |
| 0.36 | 0.95 | 421 | 7 | 151 | < 1 × 10 -10 | |
| WITH OUTLIER | ||||||
| Parameter | Estimate | Std Error | t | df | Pr(>| t|) | 95% Conf Int |
| Intercept | -14.23 | 0.11 | -125.58 | 152 | < 1 × 10 -10 | -14.45 – -14.01 |
| Sex | -1.12 | 0.09 | -12.73 | 152 | < 1 × 10 -10 | -1.29 – -0.94 |
| Race | 1.79 | 0.09 | 20.38 | 152 | < 1 × 10 -10 | 1.62 – 1.96 |
| Lauren type | 2.68 | 0.16 | 16.71 | 152 | < 1 × 10 -10 | 2.36 – 2.99 |
| Age | 0.53 | 0.02 | 34.72 | 152 | < 1 × 10 -10 | 0.50 – 0.56 |
| age:Lauren type | -0.25 | 0.02 | -11.74 | 152 | < 1 × 10 -10 | -0.30 – -0.21 |
| race:Lauren type | -0.82 | 0.12 | -6.59 | 152 | 6.7 × 10 -10 | -1.06 – -0.57 |
| sex:Lauren type | 0.88 | 0.12 | 7.11 | 152 | < 1 × 10 -10 | 0.64 – 1.13 |
| Overall Model | Std Error | R2 | F | df num | df den | |
| 0.39 | 0.95 | 384 | 7 | 152 | < 1 × 10 -10 | |
Figure 1Plot of the natural logarithms of cancer rates, denoted as ln(ca), as a function of age in years for Asian men. Red references Lauren type 1 gastric cancer. Blue references Lauren type 2 gastric cancer. Lines represent predicted values. [see Additional file 1]
Figure 2Plot of the natural logarithms of cancer rates, denoted as ln(ca), as a function of age in years for Asian women. Red references Lauren type 1 gastric cancer. Blue references Lauren type 2 gastric cancer. Lines represent predicted values. [see Additional file 1]
Figure 3Plot of the natural logarithms of cancer rates, denoted as ln(ca), as a function of age in years for non-Asian men. Red references Lauren type 1 gastric cancer. Blue references Lauren type 2 gastric cancer. Lines represent predicted values. [see Additional file 1]
Figure 4Plot of the natural logarithms of cancer rates, denoted as ln(ca), as a function of age in years for non-Asian women. Red references Lauren type 1 gastric cancer. Blue references Lauren type 2 gastric cancer. Lines represent predicted values. [see Additional file 1]
Quantitative assessments of model adequacy.
| Shapiro-Wilks normality test performed on standardized residuals. | ||||
| Without outlier | 0.984 | 0.06 | ||
| With outlier | 0.960 | 0.0001 | ||
| T test to see if mean of standardized residuals was not zero. | ||||
| Mean | T | df | ||
| Without outlier | -0.003 | -0.03 | 158 | 0.97 |
| With outlier | -0.003 | -0.03 | 159 | 0.97 |
| Bartlett's test to see if the standardized residuals, divided into four groups by their corresponding fitted values, lacked constant variances. | ||||
| K2 | Df | |||
| Without outlier | 4.43 | 3 | 0.22 | |
| With outlier | 15.02 | 3 | 0.002 | |