PURPOSE: We examined the degree of exclusion bias that may occur due to missing data when grouping prostate cancer cases from the SEER (Surveillance, Epidemiology and End Results) database into D'Amico clinical risk groups. Exclusion bias may occur since D'Amico staging requires all 3 variables to be known and data may not be missing at random. MATERIALS AND METHODS: From the SEER database we identified 132,606 men with incident prostate cancer from 2004 to 2006. We documented age, race, Gleason score, clinical T stage, PSA and geographic region. Men were categorized into D'Amico risk groups. Those with 1 or more unknown tumor variables (prostate specific antigen, T stage and/or Gleason score) were labeled unclassified. We compared the value of the other 2 known clinical variables for men with known vs unknown prostate specific antigen, Gleason score and T stage. Demographics were compared for those with and without missing data. Results were compared using chi-square and logistic regression. RESULTS: Of the men 33% had 1 or more unknown tumor variables with T stage the most commonly missing variable. There was no clinically significant difference in the value of the other 2 known tumor variables when T stage or prostate specific antigen was missing. Men older than 75 years were more likely to have unknown variables than younger men. There was significant geographic variation in the frequency of unclassified D'Amico data. CONCLUSIONS: In studies in which the data set is limited to men who can be classified into a D'Amico risk group 33% of eligible patients are excluded from analysis. Such men are older and from certain SEER registries but they have tumor characteristics similar to those with complete data.
PURPOSE: We examined the degree of exclusion bias that may occur due to missing data when grouping prostate cancer cases from the SEER (Surveillance, Epidemiology and End Results) database into D'Amico clinical risk groups. Exclusion bias may occur since D'Amico staging requires all 3 variables to be known and data may not be missing at random. MATERIALS AND METHODS: From the SEER database we identified 132,606 men with incident prostate cancer from 2004 to 2006. We documented age, race, Gleason score, clinical T stage, PSA and geographic region. Men were categorized into D'Amico risk groups. Those with 1 or more unknown tumor variables (prostate specific antigen, T stage and/or Gleason score) were labeled unclassified. We compared the value of the other 2 known clinical variables for men with known vs unknown prostate specific antigen, Gleason score and T stage. Demographics were compared for those with and without missing data. Results were compared using chi-square and logistic regression. RESULTS: Of the men 33% had 1 or more unknown tumor variables with T stage the most commonly missing variable. There was no clinically significant difference in the value of the other 2 known tumor variables when T stage or prostate specific antigen was missing. Men older than 75 years were more likely to have unknown variables than younger men. There was significant geographic variation in the frequency of unclassified D'Amico data. CONCLUSIONS: In studies in which the data set is limited to men who can be classified into a D'Amico risk group 33% of eligible patients are excluded from analysis. Such men are older and from certain SEER registries but they have tumor characteristics similar to those with complete data.
Authors: Stephen B Williams; Jinhai Huo; Karim Chamie; Marc C Smaldone; Christopher D Kosarek; Justin E Fang; Leslie A Ynalvez; Simon P Kim; Karen E Hoffman; Sharon H Giordano; Brian F Chapin Journal: Cancer Date: 2017-01-18 Impact factor: 6.860
Authors: David J Press; Salma Shariff-Marco; Daphne Y Lichtensztajn; Diane Lauderdale; Adam B Murphy; Pushkar P Inamdar; Mindy C DeRouen; Ann S Hamilton; Juan Yang; Katherine Lin; Donald Hedeker; Christopher A Haiman; Iona Cheng; Scarlett Lin Gomez Journal: Cancer Epidemiol Biomarkers Prev Date: 2021-11-30 Impact factor: 4.254
Authors: Stephen B Williams; Zhigang Duan; Karim Chamie; Karen E Hoffman; Benjamin D Smith; Jim C Hu; Jay B Shah; John W Davis; Sharon H Giordano Journal: BJU Int Date: 2016-09-16 Impact factor: 5.588
Authors: Andrew T Wong; Joseph J Safdieh; Justin Rineer; Joseph Weiner; David Schwartz; David Schreiber Journal: Int Urol Nephrol Date: 2015-09-02 Impact factor: 2.370
Authors: S D Ellis; M E Nielsen; W R Carpenter; G L Jackson; S B Wheeler; H Liu; M Weinberger Journal: Prostate Cancer Prostatic Dis Date: 2015-04-07 Impact factor: 5.554