| Literature DB >> 31694302 |
Chenjin Ma1,2, Yuan Xue2,3, Shuangge Ma1,2.
Abstract
In cancer research, population-based survival analysis has played an important role. In this article, we conduct survival analysis on patients with brain tumors using the SEER (Surveillance, Epidemiology, and End Results) database from the NCI (National Cancer Institute). It has been recognized that cancer survival models have spatial and temporal variations which are caused by multiple factors, but such variations are usually not "abrupt" (that is, they should be smooth). As such, spatially and temporally pooling all data and analyzing each spatial/temporal point separately are either inappropriate or ineffective. In this article, we develop and implement a spatial- and temporal-smoothing technique, which can effectively accommodate spatial/temporal variations and realize information borrowing across spatial/temporal points. Simulation demonstrates effectiveness of the proposed approach in improving estimation. Data on a total of 123,571 patients with brain tumors diagnosed between 1911 and 2010 from 16 SEER sites is analyzed. Findings different from separate estimation and simple pooling are made. Overall, this study may provide a practically useful way for modeling the survival of brain tumor (and other cancers) using population data.Entities:
Keywords: cancer survival analysis; penalized estimation; population data; spatial- and temporal-smoothing
Year: 2019 PMID: 31694302 PMCID: PMC6895900 DOI: 10.3390/cancers11111732
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Representative analysis results using the proposed and separate estimations: estimated coefficient (circle or triangle) and its 95% confidence interval (vertical bar) for race (White) at each time interval and location.
Figure 2Representative analysis results using the proposed (red) and separate (blue) estimations: estimated coefficient and 95% confidence interval (vertical bar) for each variable at each time interval and location = San Francisco-Oakland.
Figure 3Analysis results for each time interval and by pooling data from all locations: estimated coefficient and 95% confidence interval (vertical bar) for each variable.
Figure 4Analysis results for each location and by pooling data from all time intervals: estimated coefficient and 95% confidence interval (vertical bar) for race (White).