Background
Shigellosis, caused by
Shigella spp which are Gram-negative bacteria, is a fecal-oral transmission disease with a symptom of severe colitis and induced by a very low infectious dose [
1,
2]. In developing countries,
Shigella spp are one of the top four attributable causes of moderate–severe diarrhea in children younger than 5 years. Shigellosis is an illness that kills roughly 750,000 people per year [
3‐
5]. Although the incidence of shigellosis has declined remarkably due to the rapid improvements in water supply and sanitation in China [
6], a considerable disease burden still exists and is unevenly distributed across the country [
7‐
9]. Because of the different transmission conditions resulted from social-economic factors such as water supply and hygiene practices in different areas, the transmission of shigellosis is heterogeneous and is still a major public health problem in many areas in China [
7,
10,
11]. The transmission interruption has a public importance in China as well as many developing countries. Generally, whether an infectious disease spreads or not depends on the size of transmissibility or “transmission force” behind the transmission. However, to our knowledge, there is no study investigating the transmissibility of shigellosis. Therefore, an understanding the transmissibility of shigellosis is of great significance to ensure a better disease prevention and control.
The transmissibility of an infectious disease is commonly quantified by basic reproduction number (
R0), which is defined as the expected number of secondary infections that result from introducing a single infected individual into an otherwise susceptible population [
12‐
15]. However, because of the disease intervention or the decreasing proportion of susceptible individuals due to the spread of the pathogen, it is hard to estimate
R0. Therefore, effective reproduction number (
Reff) is commonly employed instead, with the definition of the number of new cases which was produced by a typical case during the period of infection [
16,
17]. From the definition, it is clear that when
Reff > 1, the disease is able to spread in the population. If
Reff < 1, the infection will be cleared from the population. Therefore,
Reff = 1 is considered as the “epidemic threshold”.
Although statistical model, like Bayesian model, was employed to explore the epidemiological characteristics [
18], there is few research to simulate and forecast the population-based transmission [
19]. To explore the transmissibility, we have performed a study to estimate the relative transmissibility of shigellosis among male and female individuals in Hubei Province, China [
20]. And we found that the transmissibility of shigellosis was different in females was higher than that of males [
20]. However, we still do not know the values of
Reff and their trends. In this study, we used the dataset of reported shigellosis cases collected in Hubei Province, China, developed a mathematical model to fit the data, estimated the values and trends of
Reff, and forecasted the transmission interruption year of the disease.
Methods
Data collection
Hubei Province, located in the center of China and north of the Dongting Lake, has a population of more than 58 million. Hubei is known as the “province of thousands of lakes”. Except the main stream of the Yangtze River and Han River, there are 4228 rivers and 755 lakes in the province. The large water areas provide an easy spread of Shigella spp. Therefore, it is of great value to study shigellosis transmissibility in Hubei Province, China.
In this study, we used the dataset of reported shigellosis cases that was built from January 2005 to December 2017 in the province. The sex, age, illness onset date, and diagnosis date of each case was included in the data. Incidences of 12 urban areas (including 7 central districts in Wuhan City and 5 districts in Yichang City) and 14 rural areas (including 5 outer suburbs districts in Wuhan City, and 8 counties or county-level cities in Yichang City) from 2005 to 2017 were collected to compare the difference between urban and rural areas. All the cases were collected from the China Information System for Disease Control and Prevention. All the cases were identified following the case definitions from “Diagnostic criteria for bacterial and amoebic dysentery (WS287-2008)” announced by the National Health Commission of the People’s Republic of China:
A)Suspected cases: Diarrhea, with purulent or mucous stool or watery stool or sparse stool, accompanied by severe symptoms after internal emergencies, has not yet been identified form other causes of diarrhea.
B)Clinically diagnosed cases: A patient who has the following 4 items: 1) unclean diet and/or contact history with dysentery patients; 2) sudden onset, chills and high fever, followed by abdominal pain, diarrhea and internal urgency, severe defecation 10 to 20 times a day, but not much, purulent stool, and moderate systemic poisoning symptoms; or severe symptoms like convulsions, headache, systemic muscle ache, dehydration and electrolyte disorders, may have left lower abdominal tenderness with bowel ringing hyperactivity; 3) routine stool examination showed that white blood cells or pus cells > = 15/HPF (400 folds), erythrocyte and phagocyte could be seen; 4) excluding diarrhea caused by other causes.
C)Confirmed cases: Shigella spp was tested positive by fecal culture from clinical diagnosis of cases.
In this study, we only included clinically diagnosed cases and confirmed cases for the analysis.
The transmission models
In this study, two models were developed according to the transmission routes of the disease. According to our previous research [
14,
19], we firstly built a susceptible-exposed-symptomatic-asymptomatic-recovered-water/food (SEIARW) model (denoted as Model 1) in which two routes (person–to–person and reservoir–to–person) were considered. For the route of person–to–person, two routes (symptomatic–to–susceptible and asymptomatic–to–susceptible) were both considered. In the model, people were divided into susceptible (
S), exposed (
E), symptomatic (
I), asymptomatic (
A), and recovered (
R) individuals. The reservoir, including water or food, was denoted as
W. In order to normalize the dimensions of people and
Shigella spp population, we set
s =
S/
N,
e =
E/
N,
i =
I/
N,
a =
I/
A,
r =
R/
N,
w =
εW/
μN,
b =
βN, and
bw =
μβWN/
ε.
In the model,
N is assumed to denote the total population. The parameter
β is the transmission relative rate from person to person,
βW is the transmission relative rate from reservoir to person,
k is the relative transmissibility of asymptomatic to symptomatic individuals,
ω is the incubation relative rate,
p is the proportion of asymptomatic individuals,
γ is the infectious period relative rate of symptomatic individuals,
γ’ is the infectious period relative rate of asymptomatic individuals,
ε is the relative rate of the pathogen’s lifetime,
c is the shedding rate of the asymptomatic comparing to the infectious, and
μ is the pathogen shedding coefficient of infectious individuals. The equations of the model are as follows:
$$ \frac{ds}{dt}=- bs\left(i+ ka\right)-{b}_w sw $$
$$ \frac{de}{dt}= bs\left(i+ ka\right)+{b}_w sw-\omega e $$
$$ \frac{di}{dt}=\left(1-p\right) we-\gamma i $$
$$ \frac{da}{dt}= pwe-{\gamma}^{\prime }a $$
$$ \frac{dr}{dt}=\gamma i+{\gamma}^{\prime }a $$
$$ \frac{dw}{dt}=\varepsilon \left(i+ ca-w\right) $$
We also developed a susceptible-exposed-symptomatic-asymptomatic-recovered (SEIAR) model (denoted as Model 2) which only includes the transmission route of person–to–person [
21‐
23]. The equations of the model are as follows:
$$ \frac{ds}{dt}=- bs\left(i+ ka\right) $$
$$ \frac{de}{dt}= bs\left(i+\mathrm{k}a\right)-\omega e $$
$$ \frac{di}{dt}=\left(1-p\right) we-\gamma i $$
$$ \frac{da}{dt}= pwe-{\gamma}^{\prime }a $$
$$ \frac{dr}{dt}=\gamma i+{\gamma}^{\prime }a $$
In the model, the effective reproduction number (
Reff) was calculated by the equation as follows:
$$ {R}_{eff}= bs\left(\frac{1-p}{\gamma }+\frac{kp}{\gamma^{\prime }}\right) $$
Parameter estimation
There are eight parameters (
b,
bw,
k,
ω,
p,
γ,
γ’,
c, and
ε) in the above models. According to our previous research [
19],
k,
ω,
p,
γ,
γ’,
c, and
ε are disease-specific parameters which could be estimated from literatures. The incubation period of shigellosis is 1–4 days [
24,
25], and most commonly 1 day, therefore,
ω = 1.0. The proportion of asymptomatic infection ranges from 0.0037 to 0.27 [
26‐
28], and it can be set
p = 0.1. The infectious period of symptomatic infection is 13.5 days [
19], therefore,
γ = 0.0741. According to our previous research [
19], the infectious period of asymptomatic infection could be simulated 5 weeks in our model, thus
γ’ = 0.0286. Because of that the die-off rate of
Shigella spp is about 24.5 h [
29],
ε = 0.6931 was consequently simulated [
19]. In our previous research [
19], a typical case has diarrhea about 3.2 times (range 3–12 times) per day but an asymptomatic individual only sheds stool once per day, thus
c = 0.3125. Due to reduction of shedding frequency, the relative transmissibility of asymptomatic individual (
k) was modeled to be a reduced quantity as 0.3125 [
19].
However, b and bw are scenario- or area-specific parameters which depend on different outbreaks and periods. Therefore, these two parameters are confirmed by curve fitting by Models 1 and 2 to the collected data. To simulate the contribution of b and bw during the transmission, we performed a “knock-out” simulation which was based on the following four scenarios: A) b = 0 and bw = 0; B) b = 0; C) bw = 0; D) control (no intervention).
Simulation method and statistical analysis
Berkeley Madonna 8.3.18 (developed by Robert Macey and George Oster of the University of California at Berkeley. Copyright©1993–2001 Robert I. Macey & George F. Oster) was employed for model simulation and least root mean square was adopted to assess the goodness of fit. The simulation methods were the same as the previously publications [
12,
14,
19,
30,
31]. The chi-square and trend chi-square tests and
t test were performed by SPSS 13.0 (IBM Corp., Armonk, NY, USA).
Eleven equations (Linear, Logarithmic, Inverse, Quadratic, Cubic, Compound, Power, S, Growth, Exponential, Logistic) were employed to fit the data of
Reff which was calculated by the reported data. The equations of the 11 models were shown as follows:
$$ \mathrm{Linear}:f(x)={b}_0+{b}_1x $$
$$ \mathrm{Logarithmic}:f(x)={b}_0+{b}_1\ln (x) $$
$$ \mathrm{Inverse}:f(x)={b}_0+\frac{b_1}{x} $$
$$ \mathrm{Quadratic}:f(x)={b}_0+{b}_1x+{b}_2{x}^2 $$
$$ \mathrm{Cubic}:f(x)={b}_0+{b}_1x+{b}_2{x}^2+{b}_3{x}^3 $$
$$ \mathrm{Compound}:f(x)={b}_0+{b}_1^x $$
$$ \mathrm{Power}:f(x)={b}_0+{x}^{b_1} $$
$$ \mathrm{S}:f(x)={e}^{\left({b}_0+\frac{b_1}{x}\right)} $$
$$ \mathrm{Growth}:f(x)={e}^{\left({b}_0+{b}_1x\right)} $$
$$ \mathrm{Exponential}:f(x)={b}_0{e}^{b_1x} $$
$$ \mathrm{Logistic}:f(x)=\frac{1}{\frac{1}{u}+{b}_0+{b}_1^x} $$
In the equations, x and f(x) refer to time (year) and dependent variables (Reff), respectively; b0, b1, b2, b3, and u refer to the coefficients of the models which were estimated by curve fitting with the data. Determination coefficient (R2) was employed to evaluate the curve fitting.
Discussion
To our knowledge, this is the first study to estimate the long-term transmissibility and forecast the Reff trend of Shigellosis in China. Located in the lake region, Hubei Province has a high burden of shigellosis and provides a large data for the modelling study. Our model developed in Hubei Province could also be used to study the transmission of the shigellosis in other places. Therefore, our study has an important significance to provide a reference for deeply understanding the transmission characteristics of the disease.
Validity of the model
In our study, the SEIARW model was employed to fit the epidemic curves of the reported data, the results of Chi-square test showed a high good-of-fitness of our model to the reported data, suggesting that the model is suitable for this study and can be used to estimate the transmissibility of the disease.
The epidemiological characteristics of the disease
In this study, we found that the incidence of the disease was still in a high level although with a decreasing trend. This finding is similar to the epidemiological characteristic in many areas in China [
9‐
11,
18].
Our findings of sex and age distribution showed that: 1) male individuals were more likely to be infected by
Shigella spp than females; 2) individuals who were infants or old were more likely to be infected by the pathogen. These findings are similar to the epidemiological characteristics of shigellosis [
24,
32,
33]. The intervention focusing on male, children, newborns and elders might be more effective to prevent and control the disease. It is also important to monitor the disease among those populations.
Our findings of area distribution showed that the incidence of the disease had a high heterogeneity in different sub-regions of Hubei Province. Shigellosis had a high transmission in large cities such as Wuhan City (the capital city of the province) and Yichang City (which has a population more than 400 million) or cities near Wuhan City such as Xiantao City. We also found that the incidence of shigellosis in urban areas was higher than that in rural areas. It might due to the higher population density in urban areas. However, the transmissibility remains unclear in urban and rural areas and further research is needed to explore the force of the transmission in different areas.
About a half of the cases were diagnosed in 1.7 days from their illness onset date. However, this also meant that about 50% of the cases were diagnosed after 1.7 days from their illness onset date. There is a room to improve the surveillance system’s ability of diagnosing the disease immediately and to shorten the DID of Shigellosis cases.
The transmissibility of shigellosis
The “knock-out” simulation showed that parameter
bw had almost no contribution during the transmission which meant that the transmission route of reservoir to person was interrupted in the area. This is mainly attributed to the government’s progress of improving sanitation drinking water and lavatories, food safety, and people’s health behaviors like drinking boiled water. Therefore, the remained major transmission route is person to person which will be the main public health focus to control the disease. This result also provided a mathematical evidence of the epidemiological characteristics of shigellosis that the pathogen was transmitted primarily through person-to-person [
24]. Therefore, a combined strategy of case management would be strongly recommended including case isolation and treatment, environment disinfection, surveillance system improvement, and individuals’ health behaviors improvement like hand hygiene.
Our results showed that the mean value of
Reff of Shigellosis was 1.19, meaning that the expected number of secondary infections that resulted from introducing a single infected individual into an otherwise susceptible population was only 1.19 on average. This revealed that the transmissibility of shigellosis is lower than many other infectious diseases such as influenza, Ebola virus disease, and Norovirus infection [
12,
14,
15,
34].
In addition, the transmissibility of Shigellosis decreased slightly from 2005 to 2017. Although the Linear, Quadratic, and Cubic models had the best goodness of fit, the 95%CIs of Quadratic model and Cubic model were not stable after the year 2019 (Fig.
7). Therefore, the Linear model was recommended to predict the trends of the transmissibility. The Linear model predicted that the
Reff would down to an epidemic threshold (
Reff = 1.00) in 2029. This was an interesting result which meant that the disease would probably be eliminated after the year of 2029. But this result was based on the assumption that the transmission condition would remain stable in the future years. Therefore, this trend might be different in other provinces. Hubei Province is located in a lake region with many cities suffered from floods frequently. The transmissibility of Shigellosis would be increased after the flood and provides a potential transmission route of water to person. On the other hand, the year of the transmission interrupted might be ahead of schedule since the disease control strategy would be strengthened in the future years, especially with the progress of “Health China” action launched by the Chinese government. More researches are needed to simulate the trend of transmissibility of the shigellosis with the interventions considered.
Limitations
Limited by the availability of data, there are several limitations in this study.
Firstly, some environment factors (temperature, humidity, and rainfall) which might affect the spread of the disease were not considered in the model.
Secondly, due to the transmission heterogeneity, the features and trends of the transmissibility of the disease might be different in different areas in Hubei Province.
Thirdly, our study only focused in Hubei Province, the characteristics and the trends of transmissibility of shigellosis might be different in other provinces. The applicability of the mathematical models remains unclear to different datasets of reported shigellosis. However, Hubei Province has a high burden of shigellosis and provides us a large data for the modelling. The ordinary differential equation model has a strong applicability. It could be used in different populations including school or community with small-scale outbreaks [
14,
19], and in different diseases such as shigellosis, norovirus infection, Ebola virus disease, and influenza [
12,
14,
19,
31].
In addition, we did not simulate the sensitivity and specificity improvement of the surveillance system by the model. More researches are needed to perform the simulation of shorten DID in different scenarios.
Conclusions
The disease burden of Shigellosis is still in a high level in Hubei Province, China. Individuals who were children, new-born, elders, and living in urban areas were more likely to be infected by Shigella spp. Most of the cases were diagnosed in 3 days from their illness onset date. The transmission route of reservoir to person had no contribution to the disease transmission in this area. The transmission of the disease would probably be interrupted in the year of 2029.
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit
http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (
http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.