Elsevier

Annals of Epidemiology

Volume 17, Issue 9, September 2007, Pages 679-688
Annals of Epidemiology

Weather Variability and the Incidence of Cryptosporidiosis: Comparison of Time Series Poisson Regression and SARIMA Models

https://doi.org/10.1016/j.annepidem.2007.03.020Get rights and content

Purpose

Few studies have examined the relationship between weather variables and cryptosporidiosis in Australia. This paper examines the potential impact of weather variability on the transmission of cryptosporidiosis and explores the possibility of developing an empirical forecast system.

Methods

Data on weather variables, notified cryptosporidiosis cases, and population size in Brisbane were supplied by the Australian Bureau of Meteorology, Queensland Department of Health, and Australian Bureau of Statistics for the period of January 1, 1996-December 31, 2004, respectively. Time series Poisson regression and seasonal auto-regression integrated moving average (SARIMA) models were performed to examine the potential impact of weather variability on the transmission of cryptosporidiosis.

Results

Both the time series Poisson regression and SARIMA models show that seasonal and monthly maximum temperature at a prior moving average of 1 and 3 months were significantly associated with cryptosporidiosis disease. It suggests that there may be 50 more cases a year for an increase of 1°C maximum temperature on average in Brisbane. Model assessments indicated that the SARIMA model had better predictive ability than the Poisson regression model (SARIMA: root mean square error (RMSE): 0.40, Akaike information criterion (AIC): −12.53; Poisson regression: RMSE: 0.54, AIC: −2.84). Furthermore, the analysis of residuals shows that the time series Poisson regression appeared to violate a modeling assumption, in that residual autocorrelation persisted.

Conclusions

The results of this study suggest that weather variability (particularly maximum temperature) may have played a significant role in the transmission of cryptosporidiosis. A SARIMA model may be a better predictive model than a Poisson regression model in the assessment of the relationship between weather variability and the incidence of cryptosporidiosis.

Introduction

Cryptosporidiosis is an intestinal illness caused by a microscopic parasite called Cryptosporidium. This organism has been recognized as a parasite of a wide variety of vertebrates, but it was not known to cause disease in humans until 1976. Since then, it has been identified as a cause of sporadic human gastrointestinal disease as well as of large outbreaks, particularly water-borne outbreaks 1, 2. It has been estimated that between 250 million and 500 million humans are infected annually with Cryptosporidium in Asia, Africa and Latin America (3). Over the last 9 years (1996–2004), a total of 5744 laboratory-confirmed cryptosporidiosis cases have been identified in Queensland, Australia (4). The major symptoms are abdominal cramps and watery diarrhea. In immunocompromised hosts, Cryptosporidium can cause severe and life-threatening diarrhea (5). The incubation period for cryptosporidiosis ranges from 2 to 12 days and averages about 7 days (3). There is no effective treatment for the disease, and thus prevention remains the sole public health strategy.

The incidence of cryptosporidiosis has been shown to peak between summer and autumn. However, the patterns reported have been variable, with higher prevalence in hot months, cool months, or rainy seasons (6). The reasons for such seasonal heterogeneity are poorly understood (7). Furthermore, few studies have examined the relationship between weather variables and cryptosporidiosis in literature, although there appears to be seasonal patterns of cryptosporidiosis.

Time series analyses have been increasingly used in epidemiological research.8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 Both generalized linear models (GLMs), in particular Poisson regression, and seasonal auto-regression integrated moving average (SARIMA) models have been widely used in the analyses of time series data 22, 23, 24, 25. Recently, more flexible semiparametric extensions in the form of generalized additive models (GAMs) have also been used for epidemiological time series (15). SARIMA models have been traditionally used for forecasting in economics and have become well established in the commercial and industrial fields 11, 12. We are compelled to use Poisson regression or more traditional SARIMA models because there is a paucity of models for time-series count data and the lack of available software applications that allow the implementation of previously developed theory. These models allow more explicit description of seasonal and other temporal changes in complex data sets. However, both Poisson regression and SARIMA models have not been systematically compared by analyzing the same set of infectious disease data.

The aims of this paper are three-fold. First, it examines the potential impact of weather variability on the transmission of cryptosporidiosis. Second, it explores the difference in the predictive ability between Poisson regression and SARIMA models. Finally, the possibility of developing an empirical forecast system was investigated using time-series models for cryptosporidiosis in Brisbane, Australia.

Section snippets

Study Area

Brisbane, the capital of Queensland State, is a subtropical city situated on the eastern coast of Australia and covers approximately 1326 km2 (Fig. 1). Within the administrative boundaries of Brisbane City Council, which also determined the study area of this investigation, the population size was 883,449 on June 30, 2001 (26). In this study, Brisbane was chosen as the main research site because it has the highest population density with most cases (23.4%) of cryptosporidiosis during the period

Descriptive Analysis

Table 1 shows the summary statistics for each variable. The monthly mean incidence rate of cryptosporidiosis, maximum temperature, rainfall, and relative humidity was 1.45/100,000, 26.40°C, 76.82 mm, and 51.87%, respectively, between 1996 and 2004 in Brisbane. Fig. 2, A shows that there was a striking variation in the monthly incidence rate of cryptosporidiosis in Brisbane. The number of cases was the highest in February 2002 (178 cases; incidence rate: 20.12/100,000). The large peaks of

Discussion

The results of this study suggest that weather variability (particularly maximum temperature) may have played a significant role in the transmission of cryptosporidiosis in Brisbane either directly or through other unmeasured variables. The key determinants of the cryptosporidiosis transmission observed in this analysis included maximum temperature at lags of 1 to 3 months and relative humidity at a lag of 1 month. We obtained similar results from two different time series models: SARIMA and

References (50)

  • R. Catalano et al.

    Time series designs of potential interest to epidemiologists

    Am J Epidemiol

    (1987)
  • U. Helfenstein

    Box-Jenkins modelling of some viral infectious diseases

    Stat Med

    (1986)
  • U. Helfenstein

    The use of transfer function models, intervention analysis and related time series methods in epidemiology

    Int J Epidemiol

    (1991)
  • U. Helfenstein

    Box-Jenkins modelling in medical research

    Stat Methods Med Res

    (1996)
  • L. Clancy et al.

    Effect of air-pollution control on death rates in Dublin, Ireland: an intervention study

    Lancet

    (2002)
  • F. Dominici et al.

    On the use of generalized additive models in time-series studies of air pollution and health

    Am J Epidemiol

    (2002)
  • S. Hajat et al.

    Associations of cold temperatures with GP consultations for respiratory and cardiovascular disease amongst the elderly in London

    Int J Epidemiol

    (2002)
  • C. Pope et al.

    Lung cancer, cardiopulmonary mortality, and long-term exposure to fine particulate air pollution

    JAMA

    (2002)
  • J. Borghi et al.

    Is hygiene promotion cost-effective? A case study in Burkina Faso

    Trop Med Int Health

    (2002)
  • T. Abeku et al.

    Forecasting malaria incidence from historical morbidity patterns in epidemic-prone areas of Ethiopia: a simple seasonal adjustment method performs best

    Trop Med Int Health

    (2002)
  • R. Allard

    Use of time-series analysis in infectious disease surveillance

    Bull World Health Organ

    (1998)
  • F. Nobre et al.

    Dynamic linear model and SARIMA: a comparison of their forecasting performance in epidemiology

    Stat Med

    (2001)
  • A. Tobias et al.

    Use of poisson regression and box-jenkins models to evaluate the short-term effects of environmental noise levels on daily emergency admissions in Madrid, Spain

    Eur J Epidemiol

    (2001)
  • S. Zeger

    A regression model for time series of counts

    Biometrika

    (1988)
  • M. Campbell

    Time series regression for counts: an investigation into the relationship between Sudden Infant Death Syndrome and environmental temperature

    J Royal Stat Soc A

    (1994)
  • Cited by (72)

    • Emerging microbial contaminants in the ocean

      2023, Oceans and Human Health: Opportunities and Impacts
    • Bayesian spatio-temporal modelling to assess the role of extreme weather, land use change and socio-economic trends on cryptosporidiosis in Australia, 2001–2018

      2021, Science of the Total Environment
      Citation Excerpt :

      While our study captures very wet and dry years at a postal area scale, it is also plausible that spikes in reported cryptosporidiosis are driven by localized periods of high intensity rainfall following long dry periods (Kent et al., 2015; King and Monis, 2007; Hu et al., 2007; Hu et al., 2010a) and our analysis looking annual rainfall anomalies was not able to capture this time-lagged temporal structure. Studies in Queensland indicate that monthly rainfall variability was not associated with incidence (Hu et al., 2007), however, a weekly analysis found a 5% (95% CI 8% to 2%) decrease in weekly cryptosporidiosis rates with every 1 mm increase in weekly rainfall at a moving average of lag 0–1 week (Hu et al., 2010a). Future work in Australia investigating the influence of rainfall at a finer temporal resolution at a postal area level would help clarify how short-term rainfall patterns affect dynamics of infectious disease incidence.

    • Understanding the complex seasonality of seasonal influenza A and B virus transmission: Evidence from six years of surveillance data in Shanghai, China

      2019, International Journal of Infectious Diseases
      Citation Excerpt :

      Decomposition procedures are able to describe the trend and seasonal factors in a time-series. In the present study, the systematic seasonal variations in influenza positivity rate were determined using this analysis, and the trend in influenza positivity rate during the overall study period was identified by removing any systematic seasonal variations (Hu et al., 2007; Zhang et al., 2017). Data were analyzed using R 3.5.1 (R Core Team, R: A language and environment for statistical computing.

    • Weather and gastrointestinal disease in Spain: A retrospective time series regression study

      2018, Environment International
      Citation Excerpt :

      Nichols et al., in a study in England and Wales, found that both low rainfall in the three previous weeks (<20 mm per week) or heavy rainfall in the previous week of the outbreak (>40 mm in a week) were significantly associated with the risk of drinking water-related outbreaks (Nichols et al., 2009). In addition, gastroenteritis cases (Hu et al., 2007; Iñiguez et al., 2016; Singh et al., 2001) and outbreaks (Thomas et al., 2006) have been related with high temperatures in several countries, although sometimes cryptosporidiosis (Britton et al., 2010) and Norovirus infections (Lopman et al., 2009) have been associated with cold temperatures. In Valencia (Spain), Iñiguez et al. found increased risk of hospitalizations due to gastrointestinal diseases in days of extreme heat compared to normal days only for short lags (0–1 days) (Iñiguez et al., 2016).

    View all citing articles on Scopus
    View full text