Literature DB >> 18441405

A method to model season of birth as a surrogate environmental risk factor for disease.

Jimmy Thomas Efird1, Susan Searles Nielsen.   

Abstract

Environmental exposures, including some that vary seasonally, may play a role in the development of many types of childhood diseases such as cancer. Those observed in children are unique in that the relevant period of exposure is inherently limited or perhaps even specific to a very short window during prenatal development or early infancy. As such, researchers have investigated whether specific childhood cancers are associated with season of birth. Typically a basic method for analysis has been used, for example categorization of births into one of four seasons, followed by simple comparisons between categories such as via logistic regression, to obtain odds ratios (ORs), confidence intervals (CIs) and p-values. In this paper we present an alternative method, based upon an iterative trigonometric logistic regression model used to analyze the cyclic nature of birth dates related to disease occurrence. Disease birth-date results are presented using a sinusoidal graph with a peak date of relative risk and a single p-value that tests whether an overall seasonal association is present. An OR and CI comparing children born in the 3-month period around the peak to the symmetrically opposite 3-month period also can be obtained. Advantages of this derivative-free method include ease of use, increased statistical power to detect associations, and the ability to avoid potentially arbitrary, subjective demarcation of seasons.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18441405      PMCID: PMC3684403          DOI: 10.3390/ijerph5010049

Source DB:  PubMed          Journal:  Int J Environ Res Public Health        ISSN: 1660-4601            Impact factor:   3.390


Introduction

Cancer researchers have sought to demonstrate whether a link exists between season of birth and childhood diseases such as cancer [1-13]. An early narrow window of susceptibility during prenatal development [14-15] or perhaps infancy is believed to exist for childhood cancer. These periods are characterized by rapid cell growth and division and a yet undeveloped immune system. Oncogenic viruses [16-18] and chemicals [19-20] have been shown under laboratory conditions to readily induce cancers when applied during specific periods in development versus adulthood. Therefore, evidence of an association between childhood cancer and season of birth may suggest a role for a seasonally variable environmental exposure in its etiology. Exposure to infectious agents, pesticides, indoor environmental tobacco smoke and other sources of polycyclic aromatic hydrocarbons, and use of antihistamines are a few examples of environmental factors that conceivably may follow a seasonal pattern. Other factors of interest to consider in the study of childhood cancer and seasonality include harmonic variation in population mixing, diet, temperature, humidity, sunlight/photoperiod, levels of vitamin D3 and endogenous hormones. A number of statistical tests for the analysis of harmonic data have been presented in the literature [21-42]. This paper presents a novel and easy to use adaptation of earlier methods that is suitable for analyzing season of birth as a risk factor for diseases such as childhood cancer.

Methods

Logistic regression is used to estimate the probability for disease in relation to potential risk factors and confounding variables [43]. The technique has been widely used in epidemiologic studies, including case-control studies to examine the etiology of childhood cancer. Letting x denote a study participant’s values for the (r) predictor variables in a logistic regression model, the probability for disease (D) is computed as: where â,b̂1,…,b̂ are the intercept and coefficients estimated from the data using maximum likelihood methods. In a case-control study, the odds ratios (ORs) can be determined from the logistic regression model and are the exponentiated values of by the corresponding estimated regression coefficients b̂1,…,b̂. A p-value for a specific predictor variable may be determined by taking twice the logarithm of the ratio of the likelihood of the data under the model including the variable to the likelihood without the variable. The resulting value is compared to a χ2 statistic with 1 degree of freedom. A predictor variable in the simplest case may be expressed as a dichotomous variable, e.g., whether birth occurred in summer. However, more complex forms may be appropriate. A variable such as date of birth (DOB, coded as an integer from 1 to 365) may be expressed as a trigonometric function [44-45]. In this example, let where ξMAX. is determined iteratively by finding the value from 1 to 365 that maximizes the coefficient b̂1. The maximum 3-month seasonal period of risk is found by taking the 91.25 day-wide interval centered on ξMAX·. Analogously, the minimum risk period is found by taking the symmetrically opposite 3-month interval centered on ξMIN·. The seasonal association is visualized by plotting b̂1x1 (i.e., harmonic displacement) against DOB over the range 1 to 365. A single p-value can be obtained as described above for this predictor variable to test whether a seasonal pattern exists. An OR for disease in the maximum versus minimum 3-month seasonal period and corresponding 95% confidence interval (CI) also may be computed, using standard methods [43]. In the case of a leap year, the 29th day of February is recoded as calendar day 59 so that the respective year consists of 365 days.

Example

Using hypothetical childhood cancer birth-date data from a case-control study (Appendix 1), we conducted analyses using the methods described above, and for comparison, the typical, more basic method to examine whether there is a seasonal pattern in children’s DOB. The identification of an underlying sinusoidal trend would be consistent with the hypothesis of a seasonally varying exposure (e.g., viruses, use of pesticides) as a possible etiologic risk factor for childhood cancer.
Appendix 1:

Hypothetical case-control data (Cases n=134, Controls n=261)

Day of birthNo. of casesNo. of controlsDay of birthNo. of casesNo. of controlsDay of birthNo. of casesNo. of controlsDay of birthNo. of casesNo. of controlsDay of birthNo. of casesNo. of controls
1117710139012120228801
3127810140122130128910
4107911141032141029010
5028010142112150229101
6018101143112160229310
7018301144022170129401
8218501146202211129610
12028611147022232129714
15118701148012240129801
17028802150012250129912
18208902152012263330010
19109010154112270230101
21119110155012280130310
22119202157102300130420
23119311159022311230501
24109421160012320130601
26019601161012341130703
27119711164102350230801
29019910166022360230912
320210002167102371131112
331110212168202390131210
351010301169012400131312
360110710170012420231411
392210801172012450131610
401110911174112460132001
421111010175122482032221
430211104176012490132310
450111211177012512332411
461011301179202520132501
471111411180012531232701
480111501182022550132810
490111610184102561232901
501011710186112570133110
510111801187022621033610
531011902188022631133811
541112010190022640134001
561012211191012650234101
571112301194012661134201
581012402195012670134501
591112510196022691034611
600212620197102700134701
622012701198112730134910
630112901199012740135101
661013010200112760135602
680213102202112770235801
691213210203022780135921
700113301204112790136101
711013510208012821236203
730113602209022840136401
750113801210012870136511
In this example, no significantly increased OR for childhood cancer (all p>0.05) was observed for pair-wise seasonal comparisons when defined in simple categorizations, here as fall (September, October, November), winter (December, January, February), spring (March, April, May), and summer (June, July, August), although the lower confidence limit for ‘winter versus summer’ was just slightly less than unity (Table 1). However, when applying equation (2) to the data in a logistic regression model, a statistically significant (p = 0.0165) seasonal pattern was observed, with peak risk occurring in early February at day 33 (Figure 1). The respective OR for childhood cancer when comparing the maximum versus minimum 3-month seasonal period was 2.2 (95% CI=1.2–4.1).
Table 1:

Odds ratios for childhood cancer by season of child’s birth using hypothetical data (Cases n = 134, Controls n = 261)

Season of birthÞOdds ratio95% Confidence interval
Winter vs. Spring1.2(0.68–2.1)
Spring vs. Fall1.0(0.56–1.8)
Winter vs. Fall1.2(0.67–2.2)
Winter vs. Summer1.8(0.99–3.3)
Spring vs. Summer1.5(0.84–2.7)
Fall vs. Summer1.5(0.82–2.8)

Spring = {March, April, May}; Summer = {June, July, August}; Fall = {September, October, November}; Winter = {December, January, February}.

Figure 1:

Sinusoidal logistic regression model for hypothetical childhood cancer – birth date data.

Discussion

We have presented a simple, iterative logistic regression-based method to analyze seasonal data. The method represents a generalization of earlier trigonometric models yet is easier to apply and interpret. A novel aspect of the technique is its ability to optimally fit a sinusoidal curve to the underlying data by plotting harmonic displacement against calendar time. An additional key feature of this approach is the ability to obtain an overall p-value and an OR for disease in the “maximum versus minimum” 3-month seasonal period and a corresponding 95% CI. Whereas no single method provides a universal solution to handle harmonic data, the current method accommodates varying length of months, different populations at risk, adjustment for potentially confounding variables, and is fairly robust when used for small samples. The associated statistical test inherently will have greater family-wise power to detect a sinusoidal pattern when compared to chi-square methods or performing multiple pair-wise tests for seasonality. Analogous to a dose response relationship based upon a best-fitting monotonic model and a priori mechanism of action, multiplicity correction is not necessary for sinusoidal logistic regression because there is only one parameter and one statistical test. Furthermore, it takes into account the order of events (e.g., consecutively high/low time periods) and in contrast to pair-wise seasonal comparisons, the underlying definition of season in the current model is not arbitrary for a start and end date, but is determined via the model algorithm. Several limitations may apply to the use of sinusoidal logistic regression. For example, parameter estimates may be biased if there is a discrepancy between observed values and values expected under the model. Accordingly, the data should be examined for goodness-of-fit using a standard procedure such as the Hosmer-Lemeshow test [46]. Erroneous results may occur in the case of multiple within-year cycles or competing out-of-phase cycles resulting in a cancelling of effects (e.g., opposing seasonal effects by histologic subgroup). A minor modification can be made to the sinusoidal function to allow for multiple cycles [25–26, 34, 40]. For example, a lunar cycle having multiple peaks per year may be modeled by substituting “365” in the denominator of equation (2) with “29.53” (i.e., the number of days in the lunar cycle). When appropriate, stratification is advised in the latter situation as a means to minimize “cancelling of effects.” Further, the lack of a seasonal effect does not necessarily rule out the etiologic importance of putative risk factors that vary in the environment seasonally. Conversely, the seasonal association of a specific risk factor with childhood cancer does not necessarily imply causality. As with any statistical test, the results of this method should be carefully interpreted in light of underlying limitations and biologic plausibility.
  34 in total

1.  Statistical analysis of the seasonal variation in demographic data.

Authors:  J Fellman; A W Eriksson
Journal:  Hum Biol       Date:  2000-10       Impact factor: 0.553

2.  Studying seasonality by using sine and cosine functions in regression analysis.

Authors:  A M Stolwijk; H Straatman; G A Zielhuis
Journal:  J Epidemiol Community Health       Date:  1999-04       Impact factor: 3.710

3.  Seasonal variation in month of birth and diagnosis of early childhood acute lymphoblastic leukemia.

Authors:  H T Sørensen; L Pedersen; J Olsen; K Rothman
Journal:  JAMA       Date:  2001-01-10       Impact factor: 56.272

4.  Seasonal variation of livebirths, stillbirths, extramarital births and twin maternities in Switzerland.

Authors:  A W Eriksson; J Fellman
Journal:  Twin Res       Date:  2000-12

5.  The recognition and estimation of cyclic trends.

Authors:  J H EDWARDS
Journal:  Ann Hum Genet       Date:  1961-05       Impact factor: 1.670

6.  Seasonal birth frequencies in parameters.

Authors:  H J STUTVOET
Journal:  Acta Genet Stat Med       Date:  1951

7.  Seasonality in birth weight in Israel: new evidence suggests several global patterns and different etiologies.

Authors:  Gabriel Chodick; Varda Shalev; Iris Goren; Peter D Inskip
Journal:  Ann Epidemiol       Date:  2007-02-14       Impact factor: 3.797

8.  Medulloblastoma and birth date: evaluation of 3 U.S. datasets.

Authors:  Edward C Halperin; Marie L Miranda; Dorothy M Watson; Stephen L George; Matthew Stanberry
Journal:  Arch Environ Health       Date:  2004-01

9.  Seasonality of birth for cancer in Northern England, UK.

Authors:  R G Feltbower; M S Pearce; H O Dickinson; L Parker; P A McKinney
Journal:  Paediatr Perinat Epidemiol       Date:  2001-10       Impact factor: 3.980

10.  Winter birth in association with a risk of brain tumor among a Finnish patient population.

Authors:  Arja Mainio; Helinä Hakko; John Koivukangas; Asko Niemelä; Pirkko Räsänen
Journal:  Neuroepidemiology       Date:  2006-07-14       Impact factor: 3.282

View more
  8 in total

1.  Season of birth and other perinatal risk factors for melanoma.

Authors:  Casey Crump; Kristina Sundquist; Weiva Sieh; Marilyn A Winkleby; Jan Sundquist
Journal:  Int J Epidemiol       Date:  2014-01-22       Impact factor: 7.196

2.  Season of birth and risk of Hodgkin and non-Hodgkin lymphoma.

Authors:  Casey Crump; Jan Sundquist; Weiva Sieh; Marilyn A Winkleby; Kristina Sundquist
Journal:  Int J Cancer       Date:  2014-04-25       Impact factor: 7.396

3.  Perinatal risk factors for acute myeloid leukemia.

Authors:  Casey Crump; Jan Sundquist; Weiva Sieh; Marilyn A Winkleby; Kristina Sundquist
Journal:  Eur J Epidemiol       Date:  2015-06-26       Impact factor: 8.082

4.  Perinatal and familial risk factors for acute lymphoblastic leukemia in a Swedish national cohort.

Authors:  Casey Crump; Jan Sundquist; Weiva Sieh; Marilyn A Winkleby; Kristina Sundquist
Journal:  Cancer       Date:  2014-11-21       Impact factor: 6.860

5.  Perinatal and familial risk factors for brain tumors in childhood through young adulthood.

Authors:  Casey Crump; Jan Sundquist; Weiva Sieh; Marilyn A Winkleby; Kristina Sundquist
Journal:  Cancer Res       Date:  2014-12-15       Impact factor: 12.701

Review 6.  Season of birth and risk for adult onset glioma.

Authors:  Jimmy T Efird
Journal:  Int J Environ Res Public Health       Date:  2010-04-28       Impact factor: 3.390

7.  Sinusoidal cox regression-a rare cancer example.

Authors:  Jimmy Thomas Efird
Journal:  Cancer Inform       Date:  2010-11-28

8.  Seasonal Variation in Skin Cancer Diagnosis.

Authors:  Fortunato Bianconi; Giuseppe M Masanotti; Arcangelo Liso; Francesco La Rosa; Emilio Duca; Fabrizio Stracci
Journal:  Front Public Health       Date:  2016-04-28
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.