Background
Although the incidence of malaria infections and malaria-associated mortality has reduced in many African countries [
1‐
3], transmission continues in endemic regions despite intensified efforts towards prevention, control and eradication [
4,
5]. This is due, in part, to the high genetic diversity of
Plasmodium falciparum that contributes to increased transmission rate and spread of resistant parasites [
6]. Therefore, understanding the extent of genetic diversity, transmission intensity, and parasite population structure in Nigeria—the most malaria burdened country—is essential if the goal of malaria control or elimination is to be achieved.
Molecular techniques play important roles in the analyses of genetic diversity, transmission dynamics, and population structure of
P. falciparum field isolates. Early molecular studies focused mostly on the use of polymorphic markers, such as merozoite surface protein 1 (
msp-1), merozoite surface protein 2 (
msp-2) and glutamate-rich protein (
glurp) to characterize
P. falciparum genetic diversity and structure in Nigeria [
7‐
9]. These markers were also useful in monitoring drug efficacy with regards to classification of recurrent
P. falciparum parasitaemia as re-infection or recrudescent infection [
6,
10,
11]. However, there have been contrasting reports of polymorphisms in MSP-1 and MSP-2 in earlier studies in Nigeria [
6,
12‐
14], which is associated with the fact that these antigenic markers are often under intense immune pressure [
15‐
17]. The genotyping results provided by these markers can, therefore, potentially lead to a masked and distorted view of the population structure and transmission patterns which may account for observed variations across parasite populations circulating in a given environment [
6].
Microsatellite loci have been suggested to be better alternatives to
msp-1, msp-2 and
glurp due to their abundance, putative neutrality and higher levels of polymorphisms [
18]. This molecular technique remains one of the most efficient and reliable methods for analyzing the genetic diversity of falciparum populations for epidemiological and drug efficacy purposes within countries and across continents [
19]. In past studies of using microsatellite analyses, it was observed that parasites from areas of low malaria transmission [
19] (< 1% infection) have less genetic diversity but more population structure and greater linkage disequilibrium (i.e., more non-random association among alleles across multiple loci) [
4,
19‐
21]. Contrary, in regions of high malaria transmission, individuals are more likely to be infected by more than one
P.
falciparum parasite thereby resulting in an increase in the rate of recombination and subsequently, highly diverse population with low linkage disequilibrium [
18,
19,
22]. Although, some studies report a deviation from the norm whereby high levels of heterozygosity (a measure of genetic diversity) is observed in several low transmission countries [
18,
23,
24]. This suggests that a high level of heterozygosity may reflect past human demographic processes as opposed to recent epidemiological factors [
25].
The objective of this study was to investigate the genetic diversity of circulating P. falciparum parasites and their population structures in Nigerian children 6–96 months old with uncomplicated infections, treated with artemisinin-based combination therapy (ACT).
Methods
Study site
This is a retrospective, cross-sectional, community-based study that is part of a larger drug therapeutic efficacy testing (DTET) study for monitoring anti-malarial efficacy of artesunate-amodiaquine (AA), artemether-lumefantrine (AL) and dihydroartemisinin-piperaquine (DHP) in the treatment of uncomplicated P. falciparum infections in children aged 6–96 months old. A cohort of children were enrolled from nine sentinel sites of the National Malaria Elimination Program of the Federal Ministry of Health (located in six geographical zones of Nigeria), namely; Numan (n = 48), Bodinga (n = 50), Kura (n = 100), Barkin Ladi (n = 100), Ilorin (n = 58), Ibadan (n = 50), Otuasegha (n = 45), Agbani (n = 100) and Ogwa (n = 82) in Adamawa, Sokoto, Kano, Plateau, Kwara, Oyo, Bayelsa, Enugu and Imo States, respectively.
Study population
Children aged 6–96 months old were eligible for enrollment in the efficacy study if they had symptoms compatible with uncomplicated malaria such as fever, anorexia, vomiting or abdominal discomfort with or without diarrhea with P. falciparum infections.
Sample collection
Filter papers containing dried blood spots (DBS) obtained from 633 children confirmed as malaria positive by microscopy were randomly selected for this study. All DBS samples were collected in 2014 (Adamawa, Bayelsa, Imo, Kwara, Oyo and Sokoto) and 2018 (Enugu, Kano and Plateau). Samples were collected for a duration of three months (July–September) which represents intense malaria transmission season in Nigeria. Two to three drops of finger-pricked blood samples were blotted on 3 mm Whatman filter paper (Whatman International Limited, Maidstone, UK) before treatment initiation (Day 0). The blood samples impregnated on to filter papers were allowed to air-dry properly at room temperature, and DBS were kept in airtight envelopes with silica gel at room temperature until analysed.
DNA was extracted from DBS for parasite genetic diversity and population structure studies as previously described [
26]. DNeasy Blood and Tissue extraction kit (Qiagen, Germany) was used to extract parasite DNA from DBS following the manufacturer's protocol.
Plasmodium falciparum genotyping by microsatellite loci analysis
Semi-nested PCR amplification of 12
P. falciparum microsatellite loci was done using a previously described protocol [
17]. The 12 microsatellite loci were Poly A, PfG377, TA81, ARA2, TA87, TA40, TA42, 2490, TA1, TA60, TA109 and PfPk2 [
27]. FAM, YAK YELLOW, and ATTO550N-labeled PCR products for the different loci amplified were pooled together for electrophoresis on the ABI 3500XL Genetic Analyzer at the African Centre of Excellence for the Genomics of Infectious Diseases (ACEGID), Redeemer’s University Ede, Osun State, Nigeria. Peakscanner (Applied Biosystems) and GeneMarker (Softgenetics) software were used for normalization across runs and automatic determination of allele length and peak heights in samples containing multiple alleles per locus.
Data analysis
Microsatellite data was retrieved from the Genetic Analyzer 3500XL and formatted in Microsoft excel (version 16.44) as previously described [
28]. Subsequent genetic analysis was only done on samples where all microsatellite markers were successfully amplified. Multiple alleles at a given locus was assumed if minor peaks observed were more than 20% the height of the predominant peak [
6]. Although minor alleles were scored in samples containing them, only the predominant alleles were considered for all population genetic and structure analysis. Multi-clone infections were defined as those that had at least two loci containing multiple alleles (only samples with two alleles were included for analysis), while single clone infections were defined as those containing one allele for all microsatellite loci or when one locus contained multiple alleles [
6]. Haplotypes were computed using ARLEQUIN software version 3.11 [
29] from both single-clone and multi-clone infections. The predominant allele at each locus was used to define twelve-locus parasite haplotype in multiple-clone infections [
30].
Measures of parasite genetic diversity
Number of effective alleles (Ne) and number of different alleles (Na)
The number of effective alleles (Ne) and number of different alleles (Na) were computed per locus for each State involved in the study using GENALEX 6.5 [
28].
Allelic richness
Allelic richness (Ar) was computed using FSTAT (v 3.1) as the average number of alleles per locus [
31].
Expected heterozygosity
The expected heterozygosity (He), a measure of parasite genetic diversity, represents the probability of being infected by two parasites with different alleles at a given locus, was calculated using ARLEQUIN software version 3.11 [
29] with the formula:
$${\text{He}}\; = \;[\left( {{\text{n}}/{\text{n}}\, - \,{1}} \right) \, ({1}\, - \,\Sigma {\text{p}}^{{2}} )]$$
(1)
where n is the number of isolates analysed, and p represents the frequency of each different allele at a locus [
29]. The He values range from 0 to 1. Values closer to 0 indicate little or no genetic diversity while values closer to 1 indicate high genetic diversity.
Measures of parasite population differentiation
Analysis of molecular variance (AMOVA)
Inter- and intra-population variance was determined with analysis of molecular variance (AMOVA, i.e., ΦPT). ΦPT value of zero (0) is considered indicative of no genetic differentiation among populations.
Fixation index (Fst)
The population divergence was measured by calculating the fixation index (Fst) for all pairs of parasite population in each State using the GENALEX 6.5 software. An Fst value between 0–0.05 was classified as little genetic differentiation, 0.05–0.15: moderate genetic differentiation, 0.15–0.25: great genetic differentiation and values greater than 0.25 represented very great genetic differentiation [
32].
Cluster analysis
A Bayesian model implemented in the program STRUCTURE v2.3 [
33] was used to determine the number of populations or genetic clusters present in Nigerian States considered in this study. A linked model with admixture was used with 5 replicates for each value of k (from 2 to 6), and a burn-in period of 50,000 iterations of Monte Carlo Markov chains [
34]. To obtain the optimal number of genetic populations, estimation of ΔK described by Evanno analysis was done using Structure Harvester [
35].
Linkage disequilibrium (LD)
Multilocus linkage disequilibrium measured as the standardized index of association (I
SA) was calculated using the program LIAN version 3.5 [
36] for the whole dataset and a data- subset with haplotypes from only confirmed single-clone infections, as a precaution against the bias that may result from presence of any false dominant haplotypes [
37]. This index was calculated as:
$$\left( {I^{S}_{A} } \right) \, = \, ({1}/{\text{n}}{-}{1 }(\left( {{\text{VD}}/\left( {{\text{VE}}} \right){-}{ 1}} \right)$$
(2)
where VE is the expected variance of the nth number of loci for which two individuals differ. VD is the observed variance. Randomization test was done to determine whether the ratio of VD/VE was significantly higher than 1.
Discussion
Nigeria remains the country with the highest global malaria burden. Hence, molecular studies on
P. falciparum diversity and population structure become essential in monitoring the impact of different intervention strategies in the control of malaria transmission. This study employed the use of 12 microsatellite loci to evaluate
P. falciparum genetic diversity and population structure in nine Nigerian States. Although microsatellite are better alternatives to polymorphic markers, such as
msp-1,
msp-2, and
glurp, there are only a few reports of its use in studies conducted in Nigeria [
6]. Analysis of the microsatellite data generated in this study revealed high parasite genetic diversity across all States. For instance, it was observed that the mean Ne (computed as the average of the sum of Ne values across the 12 microsatellite loci) in the nine States, ranged from 6.2–8.2. This is expected because the number of Ne detected per locus is likely to be high in areas with high malaria endemicity and vice versa [
5,
19].
As such, this study’s Ne values were comparable to those reported in other high-endemic regions of Sub-Saharan Africa [
4,
5,
38]. Similarly, values of allelic richness (Ar) obtained in this study further confirmed the high level of parasite genetic diversity in all nine States (range: 7.09–14.27) and identical to those reported in other malaria endemic countries [
39]. Furthermore, computed expected heterozygosity (He) values observed in all nine States were high ranging from 0.776–0.842. This is similar to those reported in other malaria endemic countries [
5,
6,
18,
40,
41]. This further emphasizes the earlier conclusion from this study, high parasite genetic diversity and parasite transmission within the country [
42]. It is equally important to note that when samples were stratified as Northern States (Adamawa, Kano, Kwara, Plateau and Sokoto) and Southern States (Bayelsa, Enugu, Imo, and Oyo) in a bid to investigate the possible influence of geographic location of observed parasite diversity, there was no significant difference in measures of genetic diversity i.e., Ne, He, and Ar values (P > 0.05). This is equally expected as malaria endemicity continues to be high across the country.
Although parasite genetic diversity was high, further analysis of microsatellite data revealed low parasite population differentiation. It was observed that when all nine States were considered as a single population, the overall association index was 0.0065 (P < 0.01), which is weaker than those typically reported in regions with low transmission [
21,
23]. Studies have associated low LD values such as those reported in this study, to high levels of malaria transmission; which leads to increased cross-breeding and meiotic recombination that results in LD breakdown [
5,
6,
19,
43]. The pairwise genetic differentiation (Fst) among study sites showed low to moderate genetic variation (0.008–0.105; P < 0.001). This is similar to what was reported earlier in Nigeria [
6]. Furthermore, the observed low population differentiation was confirmed by AMOVA (0.039). This implies that only about 3.9% of genetic differentiation exist amongst the nine States investigated. Cluster analysis also showed that only three parasite clusters exist amongst all the nine States. However, the majority of parasite populations from 2014 (Adamawa, Bayelsa, Imo, Sokoto and Kwara) were in the first cluster (blue) with the exception of Oyo (Green). While in 2018, parasites from Enugu were in the third cluster (green), those from Plateau were in the second cluster (red) and those from Kano were distributed between cluster 2 and 3. Although, samples analysed in this work were representative of the country as a whole, a major limitation was that samples from each State were only collected at a single time point (i.e., either 2014 or 2018) thus, a spatio-temporal analysis could not be done. This perhaps would have provided more insights into the variations in clustering patterns observed in this study.
In summary, it has been observed that parasites from areas of low malaria transmission [
19] (< 1% infection) show less genetic diversity, more population structure and greater linkage disequilibrium [
4,
19‐
21]. In this study, the contrary has been observed i.e., high genetic diversity, low population structure and weak linkage disequilibrium. This is typical in regions of high malaria transmission, as individuals are more likely to be infected by more than one
P.
falciparum parasite thereby resulting in an increase in the rate of recombination and subsequently, high diverse population with low linkage disequilibrium [
18,
19,
22]. It is plausible that the low to moderate genetic differentiation between States observed is as a result of immense human migration between these populations as part of the usual socioeconomic activities and indiscriminate vector migration within the country [
6,
37,
44,
45].
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit
http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (
http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.