| Literature DB >> 24366047 |
Geoffrey M Jacquez1, Janice Barlow, Robert Rommel, Andy Kaufmann, Michael Rienti, Gillian AvRuskin, Jawaid Rasul.
Abstract
Marin County (California, USA) has among the highest incidences of breast cancer in the U.S. A previously conducted case-control study found eight significant risk factors in participants enrolled from 1997-1999. These included being premenopausal, never using birth control pills, lower highest lifetime body mass index, having four or more mammograms from 1990-1994, beginning drinking alcohol after age 21, drinking an average two or more alcoholic drinks per day, being in the highest quartile of pack-years of cigarette smoking, and being raised in an organized religion. Previously conducted surveys provided residential histories; while statistic accounted for participants' residential mobility, and assessed clustering of breast cancer cases relative to controls based on the known risk factors. These identified specific cases, places, and times of excess breast cancer risk. Analysis found significant global clustering of cases localized to specific residential histories and times. Much of the observed clustering occurred among participants who immigrated to Marin County. However, persistent case-clustering of greater than fifteen years duration was also detected. Significant case-clustering among long-term residents may indicate geographically localized risk factors not accounted for in the study design, as well as uncertainty and incompleteness in the acquired addresses. Other plausible explanations include environmental risk factors and cases tending to settle in specific areas. A biologically plausible exposure or risk factor has yet to be identified.Entities:
Mesh:
Year: 2013 PMID: 24366047 PMCID: PMC3924444 DOI: 10.3390/ijerph110100271
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Residential histories as space-time step functions. The axes x and y define a geographic domain (i.e., longitude and latitude decimal degrees), the t axes represents time (i.e., date). The study extends from time t to time t. The residential histories for persons i and j are shown as step functions through space-time. For example, person i begins the study residing at location x. They remain at that geographic coordinate until the instant before time t1, when they move to x. The duration of time they reside at this first place of residence is ω0.
Figure 2Venn diagram illustrating types of space-time clusters that can be identified using Ǫ-statistics. The rectangle represents all Ǫ statistics in a study, significant or not. Each circle represents clusters that are found statistically significant locally (e.g., excess of cases about case i at time t, ), over a cases’ life course (e.g., excess of cases about the residential history of case i, ), and globally at a given time t when all cases are considered together (e.g., large-scale spatial clusters at time t, ). These cluster sets and their intersections (Ã, , , ) can provide insights into, and generate hypotheses regarding, disease etiologies. When the underlying Ǫ-statistics have been adjusted for the risk factors and covariates found significant in the parent case-control study these cluster types identify where, when, and to whom to allocate unexplained (e.g., excess) risk (Table 1).
Description of cluster sets, summary space-time pattern descriptions, and disease etiologies that may give rise to those patterns.
| Cluster set | Description | Pattern | Possible etiology |
|---|---|---|---|
| Cases ( | Cases ( | ||
| Clustering over the life course | Cases ( | ||
| Temporal case clustering | Large scale spatial clustering of cases at time
| ||
|
| Locations and time when cases with significant clustering over their life course are members of a geographically localized cluster. Includes both ephemeral and persistent clusters. | ||
| Cases ( | |||
| Cases that have clustering over their life course and are part of large-scale spatial clusters at times | |||
| Cases that have clustering over their life course, are part of large scale clusters at time t and whose local clusters | Etiology is similar to set |
Statistics to evaluate the overall significance of the cluster types in a manner that does not involve multiple testing.
| Cluster type | Cluster description | Test statistic | Probability of test statistic |
|---|---|---|---|
| Local case-time | |||
| Life course | |||
| Temporal case clustering | |||
|
| |||
Number of possible test statistics including those significant and not significant for each cluster type, using the duration-weighted tests. Here n1t is the number of cases extant in the study area at time t.
| Cluster type | Cluster description | Test statistic | Number of possible elements in each set ( |
|---|---|---|---|
| Local case-time | |||
| Life course |
| ||
| Temporal case clustering |
|
Figure 3The study population is comprised of “movers” and “stayers”. Prior to 1980, the study population was comprised primarily of long-term residents, illustrated by the bimodal distribution of residence months (RESMONTHS, upper left histogram). The time plot (graph lower left) shows how residence time at each participant’s current residence changes through time. Each line corresponds to a study participant. Lines highlighted in orange are current Marin residents who have lived in the same home for at least 240 months (20 years), when the study was conducted. The map of Marin on 10-9-1977 (right) is comprised almost entirely of long-term residents.
Figure 4Life course place of residence for study participants in Marin County (upper left), California (upper right), and from across the United States (bottom). Cases are denoted as purple circles, and controls as gray crosses.
Figure 5The sensitivity analysis using the Global Ǫ-statistic found significant global clustering of cases relative to controls after statistical adjustment for covariates and risk factors at k = 2, 3, and 4, with p < 0.01 at k = 4 nearest neighbors. This analysis was conducted at the spatial scale of Marin County.
Figure 6The probability of the Ǫ test for spatial clustering of cases relative to controls through time. This test assesses whether and when there is spatial clustering of cases when all of the cases and controls are considered simultaneously at a given time t. p = 0.05 is shown by the blue horizontal line. The count of the number of observations below this line is = 122 and is highly significant (p = 0.000001625).
Figure 7Life course clusters in Epoch 1. Locations of cases with significant clustering over their life course at the end of Epoch 1. Five cases had residential history data recorded on 2-1-1968. One case with significant clustering resided in Marin County (upper left), two were in the Bay area (bottom), and two in the Northeast near Long Island (upper right).
Figure 8Locations of cases with significant clustering over their life course at the end of Epoch 3. At that time 9 cases with significant clustering resided in Marin County (upper left), with other clusters found in the upper Midwest (upper right), southern California and Long Island (bottom).
Figure 9Local clusters of breast cancer cases (large red circles) for twenty year (left) and fifteen year (right) residents of Marin County, California. The observed clustering is statistically significant when residential mobility and significant risk factors and covariates are accounted for.