Background
Plasmodium malariae is one of six
Plasmodium spp. that cause malaria in humans (
Plasmodium falciparum,
Plasmodium vivax,
P. malariae, Plasmodium ovale curtisi,
P. ovale wallikeri [
1], and
Plasmodium knowlesi [
2]).
Plasmodium malariae exhibits unique characteristics; it is the only
Plasmodium species with a 72-h long erythrocytic stage in humans, and it can maintain low parasitaemia in humans for a decade [
3], while still being infectious to
Anopheles mosquito (vector). Although
P. malariae is widely distributed in malaria endemic regions, fewer molecular studies have been conducted in this species compared with those in
P. falciparum and
P. vivax.
Plasmodium malariae often maintains low parasitaemia and commonly co-infects with the highly prevalent species,
P. falciparum and
P. vivax. Consequently, designing experiments to study the epidemiology of
P. malariae is difficult.
Comparative genomics of
Plasmodium spp., including
P. malariae, has been used to elucidate the evolutionary history of
Plasmodium spp. that infect humans [
4,
5]. Population genetic studies of
P. malariae should be conducted for more understanding in genetic diversity of this parasite. Measurement of gene polymorphism might be helpful for more understanding in biology of
P. malariae. The polymorphic genes such as genes encoding for antigen are usually selected for using as genetic markers. One of the prominent surface antigens that is important for sporozoite function and invasion to hepatocyte is the circumsporozoite protein (CSP). CSP has been used as a marker to measure the population diversity of
P. falciparum [
6],
P. vivax [
7], and
P. knowlesi [
8].
CSP is the major surface protein of
Plasmodium sporozoites. The gene encoding for CSP (
csp) comprises 1 central repeat region and 2 nonrepeat end regions (N- and C-terminals). The N-terminal nonrepeat region contains a conserved region I located before the central repeat region. The C-terminal nonrepeat region contains 3 subregions, namely Th2R, conserved region II, and Th3R. Th2R and Th3R are identified as T-cell epitope regions [
9] and are variable in natural
Plasmodium populations [
10].
csp has been studied in
P. malariae samples collected from African countries [
11]. Heterogeneity of sequences was reported among the isolates from sub-Saharan Africa with polymorphism essentially limited to the central repeat region [
11]. A recent survey of
P. malariae in Kenya also revealed high diversity in
csp sequence [
12]. However, polymorphisms in
csp have not been reported in
P. malariae isolates of Asia. The aim of this study is to analyse polymorphisms in
csp of
P. malariae field isolates collected from Thailand, Myanmar, Lao PDR, and Bangladesh. Understanding the sequence diversity within
csp of
P. malariae would contribute to more understanding in nature of this parasite’s distribution in the regions.
Discussion
In this study, perform genetic characterization of
pmcsp in 89 isolates collected from Thailand, Myanmar, Lao PDR, and Bangladesh.
Plasmodium
malariae isolates collected from African countries show heterogeneity in
pmcsp sequence [
10]. Similar results have been reported for 38
P. malariae isolates collected from Kenya [
12]. A total of 143
pmcsp sequences were analysed, including those obtained from 89 isolates collected in this study along with 38 previously published African isolates. The
pmcsp DNA divergence was higher in Thailand and Myanmar compared with that in Kenya, reflecting the overall DNA divergence in Asia which was also higher than that in Africa. This finding is opposed with the divergence that have been studied in
P. falciparum csp from Thailand and Myanmar compared to Kenya [
20]. The nucleotide diversity in
pfcsp was higher in Kenya than that in Thailand and Myanmar [
20]. This is likely related to lower transmission intensity of
P. falciparum in Thailand and Myanmar than Kenya. High diversity in
pmcsp collected from Thailand and Myanmar might related to wide range of collecting sites in each country with various time point of sample collection (year 2002–2016). However, the overall
pmcsp gene characteristic of the sample collected from Asia and Africa is restricted to their continents as demonstrated by the phylogenetic tree analysis. It is referred to the genetic differentiation of
pmcsp in Asia and Africa.
The analysis focused on 2 main parts of
csp: 1 central repeat region and 4 nonrepeat regions, including conserved regions I and II, Th2R, and Th3R. The most characteristic feature of the central repeat region in
pmcsp is the tetrapeptide repeat unit. Previous studies have identified 2 major types of repeat units: NAAG and NDAG. The NAAG repeat was present in all
P. malariae isolates. However, the NAAG repeat found in Asian isolates was highly polymorphic than previously reported isolates [
10,
12].
Plasmodium malariae isolates from African countries carried high copy numbers of the NAAG repeat (40–79 copies), whereas isolates from Asian countries carried either low (0–38) or high (40–51) copy number of the NAAG repeat. By contrast, the NDAG repeat was highly conserved with low diversity, and its copy numbers varied from 2 to 9. The NDAG repeat is likely to be a universal repeat. The average numbers of the repeat units NAAG and NDAG between the Asian and African samples were used to generate the sampling model. The Markov Chain Monte Carlo (MCMC) technique in WinBUGS [
18] was used to fit the data. The NAAG repeat unit revealed a wide range of repeat numbers in Asian countries compared to that of African countries. Additionally, a novel tetrapeptide repeat unit NAPG was identified in this study. More than half of the Asian isolates carried the NAPG repeat. The NAPG repeat is likely to be geographical restricted to Asian samples. To validate this hypothesis, more samples from other Asian countries should be studied. Similar repeat units have been identified in
P. falciparum and
P. vivax. By contrast, more than 46 repeat units have been reported in
P. knowlesi [
21]. The NANP repeat in the central repeat region of
pfcsp has been shown to represent an important target of antibodies isolated from individuals with naturally acquired immunity to malaria [
22]. The central repeat region of
csp is the immunodominant region and each
Plasmodium species carries a unique pattern of tetrapeptide repeats in this region. The type and copy number of tetrapeptide units may be dependent on the immune pressure in different malaria endemic regions.
Analysis of the nonrepeat regions, Th2R and Th3R, which serve as T cell epitopes revealed 6 and 9 amino acid haplotypes among 143
P. malariae isolates, respectively. Only one major haplotype was identified in both Th2R and Th3R, accounting for 91.61 and 74.83% of isolates, respectively. Amino acid haplotypes in Th2R and Th3R in
P. malariae isolates were unrestricted to geographical location and showed low diversity compared with Th2R and Th3R haplotypes of
P. falciparum [
23] and
P. knowlesi [
8]. These data suggest that the central repeat region of
csp may be responsible for the immune response in which the diversity might result in different levels of host immune system between Asia and Africa. The nonrepeat region from both N- and C-terminal parts of
pmcsp were used to estimate genetic differentiation between populations. Pairwise comparison of
P. malariae from Asia and Africa revealed high level of differentiation between the two continents, which is in concordance to previous studies on genetic differentiation in
P. falciparum and
P. vivax from Asia compared to other continents [
10,
20,
24]. To gain a better understanding of the natural distribution of
pmcsp polymorphisms, more number of samples from other malaria endemic regions should be investigated. Analysis of the genetic diversity of
pmcsp will be valuable for understanding the population structure of
P. malariae, which will further help in the development of strategies to eliminate malaria.
Authors' contributions
NS, AD and MI contributed to study design. MM, PNN, FS, FN, and SP collected samples. NS undertook laboratory work. NS, NJW, ND, AD and MI analyzed data. NS drafted the manuscript. All authors read and approved the final manuscript.