Substitution rate and natural selection in parvovirus B19

Stamenković, Gorana G.; Ćirković, Valentina S.; Šiljić, Marina M.; Blagojević, Jelena V.; Knežević, Aleksandra M.; Joksić, Ivana D.; Stanojević, Maja P.

doi:10.1038/srep35759

Download PDF

Article
Open access
Published: 24 October 2016

Substitution rate and natural selection in parvovirus B19

Gorana G. Stamenković¹,
Valentina S. Ćirković²,
Marina M. Šiljić²,
Jelena V. Blagojević¹,
Aleksandra M. Knežević²,
Ivana D. Joksić³ &
…
Maja P. Stanojević²

Scientific Reports volume 6, Article number: 35759 (2016) Cite this article

1422 Accesses
12 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The aim of this study was to estimate substitution rate and imprints of natural selection on parvovirus B19 genotype 1. Studied datasets included 137 near complete coding B19 genomes (positions 665 to 4851) for phylogenetic and substitution rate analysis and 146 and 214 partial genomes for selection analyses in open reading frames ORF1 and ORF2, respectively, collected 1973–2012 and including 9 newly sequenced isolates from Serbia. Phylogenetic clustering assigned majority of studied isolates to G1A. Nucleotide substitution rate for total coding DNA was 1.03 (0.6–1.27) x 10⁻⁴ substitutions/site/year, with higher values for analyzed genome partitions. In spite of the highest evolutionary rate, VP2 codons were found to be under purifying selection with rare episodic positive selection, whereas codons under diversifying selection were found in the unique part of VP1, known to contain B19 immune epitopes important in persistent infection. Analyses of overlapping gene regions identified nucleotide positions under opposite selective pressure in different ORFs, suggesting complex evolutionary mechanisms of nucleotide changes in B19 viral genomes.

Mutational spectrum of SARS-CoV-2 during the global pandemic

Article Open access 27 August 2021

A global analysis of conservative and non-conservative mutations in SARS-CoV-2 detected in the first year of the COVID-19 world-wide diffusion

Article Open access 30 December 2021

Phylogenetic molecular evolution and recombination analysis of complete genome of human parechovirus in Thailand

Article Open access 21 April 2021

Introduction

Human parvovirus B19 (B19) is a widespread member of the family Parvoviridae that causes a variety of clinical manifestations, from asymptomatic to persistent infection associated with different autoimmune diseases^1,2. As all parvoviruses, B19 depends on the S phase of the host cell for replication, resulting in its wider tropism for fetal tissues and much narrower tropism range for adult cells².

B19 virions are nonenveloped icosahedral particles with a linear single-stranded DNA genome of approximately 5600 bp. At both ends of the B19 genome, there are identical inverted terminal repeats of 383 nt in length. Coding sequence of the B19 genome (≈4.8 kb) is divided in two main open reading frames (ORFs), one encoding the nonstructural protein (NS1) and the other encoding both major VP2 and minor VP1 structural proteins^1,3. The only difference between VP1 and VP2 is in the N terminal “unique region” (uVP1) composed of 227 amino acids. VP2 builds 95% of the capsid containing self-assembly domains that lead to formation of highly stable particles. The role of VP1 is not essential for capsid formation, but its uVP1 region is critical for virus entry via phospholipase A2 (vPLA2) domain⁴. NS1 is the main non-structural multi-functional protein, with the central role in controlling viral DNA replication and transcription^3,5,6. In addition, NS1 induces cell cycle arrest, apoptosis and modulation of host innate immunity^7,8,9.

B19 infection induces long-lasting antibody and cellular responses¹⁰. Viremic phase onsets in the first week of infection and reaches extremely high viral concentrations of 10¹⁰ to 10¹³per mL of plasma/serum^3,11. Viremia declines with appearance of IgM antibodies against linear and conformational epitopes of viral capsid proteins VP1 and VP2, with the peak levels during the third weeks after infection. Majority of studies found that, irrespective of the underlying disease, NS1-specific IgG antibodies appear late in infection, principally in patients who develop persisting viremia^10,12.

B19 sequences cluster into three genotypes, further divided to subtypes. Currently, in addition to the worldwide predominant genotype 1, with subgenotypes 1A and 1B, genotypes 2 and 3 with two subtypes 3a and 3b are identified^13,14. All genotypes have similar functional, structural and immunological characteristics and comprise the same serotype¹⁵.

Members of the family Parvoviridae are characterized by high genetic diversity with substitution rates in the range of 1–2 × 10⁻⁴ per site per year, similar to those of ssRNA viruses¹⁶. So far, B19 substitution rate has been estimated on partial NS1 and VP1 gene sequences for genotypes 1 and 3, with two studies investigating near full-length B19 genome, albeit including limited number of sequences^14,17,18,19. Lately, the number of B19 genome sequences deposited in DNA sequence databases has largely increased. We aimed to reevaluate B19 genome variability data and phylogenetic relations in the most prevalent B19 genotype 1, using near complete coding DNA (cDNA) sequences currently present in the GenBank database, together with newly acquired B19 sequences from Serbia, generated for this study. Further, with different codon-based maximum likelihood methods we analyzed the extent of selection pressure on particular genes or codons, aiming to investigate the impact of natural selection to high B19 substitution rate.

Results

Phylogenetic analysis

The results of phylogenetic analysis were consistent, by all the applied methods. Reconstructed phylogenetic tree revealed clustering of genotype 1A isolates into two large lineages, containing 122/133 (93.13%) of all analyzed isolates (Fig. 1), one consisted of 80/122 and another one of 42/122 isolates, corresponding to clusters 1A1 and 1A2, respectively. Remaining 9/133 isolates, sampled in a large time span from 1973 to 2003, formed 4 additional distinct small clusters. Local, Serbian isolates RS2 to RS5 formed a sub-clade in the major cluster of subtype 1A1, whereas isolate RS1 was found separated in second major cluster 1A2.

Average nucleotide distance in the whole analyzed near complete cDNA dataset of 133 B19 genotype 1 isolates was 0.014, s.d. = 0.009. Nucleotide distance between subgenotypes 1A and 1B was 0.055, s.d. = 0.003. Intragroup nucleotide distance for 1A subgenotype was 0.013, s.d. = 0.005.

Substitution rates

Root to tip linear regression analysis revealed sufficient temporal structure of the collected dataset (R2 = 0.15, Correlation coefficient = 0.39 (Fig. 2). Evolutionary rate was calculated on near complete cDNA dataset and on the same dataset partitioned based on open reading frames (ORF1 and ORF2). Results are shown in Table 1.

Table 1 B19 genotype 1 nucleotide substitution rate.

Full size table

The highest substitution rate was observed for VP2, the functional part of VP1, 2.32 × 10⁻⁴ substitutions/site/year, respectively. The values of substitution rates for NS1 and VP1 were very close and significantly lower than VP2, yet higher than the rate of cDNA (Table 1). Marginal distributions of the rates from the different genome partitions for the strict clock data are presented in Fig. 3, showing that indeed they overlap for the regions NS1 and VP1 mutually and with both cDNA and VP2 on either side of the graph. However, this is not the case with marginal distributions for cDNA and VP2. Hence, we conclude that the rate for VP2 is indeed significantly different compared to cDNA. Since the 95% HPD interval for nucleotide substitution rate in uVP1 was found to be rather large, encompassing the values for other partitions, highlighting the uncertainty in the rate estimate in this region, we excluded this region from the comparisons.

Natural selection

Regarding two main ORFs of B19 genome, overall selection pressure, measured as the mean ratio of nonsynonymous (dN) to synonymous substitutions (dS) per site (dN/dS) was 0.150 and 0.087 for NS1 and VP1, respectively.

Majority of variable codons in both NS1 and VP1 genes were found to be under strong negative selective pressure or neutrally evolving (P < 0.1) (Fig. 4). Detection of positively selected sites by different analytical algorithms applied is presented in the Supplementary Table S2. In short, 10 codons in VP1 and 9 codons in NS1 were identified under diversifying selection (Supplementary Table S2, Fig. 4). Of those, 3 sites in VP1 and 1 position in NS1 were identified as positively selected by two or more analysis methods.

Selection pressure analysis on B19 genes coding for small protein products of 7.5 kDa, 9 kDa and 11 kDa estimated mean dN/dS of 0.297, 0.760 and 0.463, respectively, which is substantially higher compared to large proteins (Supplementary Table S2).

In particular, we compared traces of natural selection in overlapping B19 genes that are expressed in different reading frames. Substitution T2276C is non-synonymous in the NS1 gene (F554 → L) with indication of positive selection (FEL and IFEL, P < 0.05), whereas the same substitution is synonymous when expressed as the first nucleotide of codon 65 (TTG → CTG) in the 7.5 kDa gene, where it is found to be subjected to purifying selection force (P < 0.05, Supplementary Table S2). Transition T3061G was positively selected in codon 63 of 9 kDa (TTG → TGG, L63W, FEL P < 0.05), whereas in uVP1 it induces synonymous substitution with negative selection force on codon 146 (GTT → GTG, P < 0.1 in FEL and IFEL). Similarly, non-synonymous substitution G2916A, in the 9 kDa gene (GCA → ACA, A15T) is found under positive selection (FEL P < 0.05), while in the same time, it is found under neutral selective force in the uVP1 gene (AGC → AAC, S98N, Supplementary Table S2).

To elucidate a non-random usage of synonymous codons for specific amino acids we calculated Relative Synonymous Codon Usage (RSCU) values, which are presented as the observed frequency of a codon, divided by its expected frequency under the assumption of equal codon usage²⁰. RSCU values, calculated for two ORFs of B19 genome, showed that some codons were favored in both major genes (Supplementary Table S3). For NS1, this is the case for synonymous substitutions in codons with the highest value of negative selection (normalized dN-dS < -10), such as 116 (GTG → GTA, RSCU are 1.06:1.37), 320 (AAG → AAA, RSCU are 0.51:1.49) and 586 (CAG → CAA, RSCU are 0.86:1.14). In VP1, the highest value of negative selection (normalized dN-dS < -2) for synonymous substitutions was found for codons: 515 (TTC → TTT, RSCU are: 0.05:1.95), 546 (GGT → GGA, RSCU are 0.95:1.81) and 583 (CAG → CAA, RSCU are 0.86:1.14).

Natural selection and substitution rate analyses of B19 genome, as evolutionary parameters, showed that substitution rate of the ORF2 region, coding for VP1 and VP2 structural proteins was higher compared to ORF1, albeit with stronger purifying selection (VP1 = 1.64 vs. NS1 = 1.36 substitutions/site/year x 10–4; mean dN/dS for VP1 = 0.087 vs. NS1 = 0.150). In the partitioned analyses of VP1, uVP1 region was characterized by the highest dN/dS ratio, albeit with the lowest substitution rate, compared to genes coding for large B19 proteins (Table 1, Fig. 4).

Discussion

We studied substitution rate and natural selection in parvovirus B19 genotype 1.

Parvovirus B19 genotype 1, the most frequent worldwide, is known to be divided into subgenotypes: the predominant 1A and rarely found 1B^1,2. So far, only two complete cDNA subgenotype 1B sequences originating from Vietnam are present in the GenBank (DQ357064 and DQ357065). Besides, Barros de Freitas et al. detected several subgenotype 1B isolates in patients with hematological disorders from Brazilian Amazon region, based on 476 nt in ORF2 genome region²¹. The same phylogenetic study described for the first time two clear clades in subgenotype 1A, designated as 1A1 and 1A2. This clustering within subgenotype 1A was confirmed by phylogenetic analysis based on 446 nt in NS1 gene region, also in Brazilian isolates¹⁹, and in isolates collected in the Netherlands based on almost complete cDNA genome sequences²².

Our phylogenetic analysis of B19 genotype 1 is in line with previous findings of two main clades within subgenotype 1A, subtypes 1A1 and 1A2. However, the bootstrap support for the key nodes is less than 70%, indicating insufficient phylogenetic resolution to firmly support the existence separate subtypes. Of note, we found 9 isolates clustering within the 1A genotype, yet outside the two presumed 1A subtypes (Fig. 1). Gathering of additional sequence data would help to elucidate the true topology of B19 genotype 1 phylogeny and the distinct subtypes and clades, as proposed by some investigators^19,22.

Generally, phylogenetic clustering in our analysis did not tend to correlate to either collection time or geographical location of sequence origin. However, some isolates from the same country and close collection date, did cluster together, e.g. isolates from France (FN669503, FN669504, FN669506, 2009), Germany (AJ781031 to AJ781038, 2002), Brazil (KC13327, KC13344, KC13331, KC13333-2009 to 2010) and Japan (AB126262 to AB126264, 1996 to 1998, AB126271-2000, AB030673-1986, AB030693-1992). On the other hand, isolates from the Netherlands, the largest collection of 62 almost complete B19 sequences from the same country, collected in the period 2003 to 2009, were found dispersed along both clades 1A1 and 1A2 (Fig. 1). Similarly, newly obtained isolates in our study were found scattered within the subtype 1A1 (RS4 and RS6 to RS8, collected in 2011 to 2012) and the isolate RS1, collected in 2009, was the only one that clustered in the subtype 1A2. Generally, existing B19 molecular epidemiology studies found subtype 1A1 to be more prevalent than 1A2, although these two subtypes co-exist around the world. However, clear functional (biological) difference between the subtypes in ability to cause different symptoms has not been found^19,22,23.

Our estimation based on near complete cDNA of genotype 1A sequences, isolated in different parts of the world over time span of 40 years revealed the rate of accumulation of nucleotide changes in the B19 cDNA of 1.03 ± 0.1 × 10⁻⁴ substitutions/site/year. Previous studies, based on the alignment of a similar length (4216 nt, positions 654 to 4869 nt) but much smaller in size, estimated B19 evolutionary rate at 1.83 × 10⁻⁴ substitution/site/year for genotype 1, and 2.0 × 10⁻⁴ substitution/site/year for genotype 3^14,17.

Recent studies of viral evolutionary dynamics have shown that evolutionary rate of ssDNA viruses is similar to the one of ssRNA viruses. Besides human parvovirus B19, substitution rate in the order of magnitude of 10⁻⁴ was shown for other ssDNA viruses such as the circovirus SEN-V and plant geminivirus Tomato yellow leaf curl virus^16,24. Substitution rate depends on many factors concerning viral life cycle, such as mutation rate, generation time, transmission and natural selection¹⁶. Mutation rate of B19 virus, in the sense of real time measurement of nucleotide changes in biological systems has not been measured so far. It could be expected to be high, in view of single stranded architecture of B19 genome, of small size and high replication turnover in the acute faze of infection, when viremia frequently reaches 10¹⁰–10¹³ genome equivalents/mL¹¹. Since fixation of mutation is directly influenced by forces of natural selection, we compared substitution rate and selection pressure on discrete gene regions. Our findings (Table 1) are in line with previous reports by Shackelton and Holmes that estimated substitution rates for both ORF1 and ORF2 to be in the same order of magnitude as complete coding DNA, however, substitution rate of ORF2 was higher compared to ORF1¹⁷.

In our study selection analysis revealed negative selection to be in action on the most of the B19 genome. Functional aspects of viral proteins may significantly effect on selection pressure, limiting fixation of new mutations. As a major structural protein accounting for about 95% of the capsid, VP2 also contains the receptor-binding site i.e. globoside-binding motif from 577aa to 677aa^2,3. Involvement of VP2 in capsid composition and cell entry determines numerous strongly negatively selected codons and weak positive selection. We identified codons under strong negative selection in other VP2 positions with known or presumed functional roles: non-synonymous substitution V21A close to the VP2 N-terminal end could be important in protein folding and capsid formation, whereas the basic motif (493KLGPRKATGRW503) found in the VP2 C-terminal region is known to be necessary for nuclear localization of viral particle^25,26,27. Other codons (515, 546, 583 and 709) under highest negative selection pressure in VP2 contain synonymous substitutions, reflecting codon usage pattern as depicted by us or composition and gDNA packaging into the virion, as also speculated by Shackelton et al.²⁰

Complete loss of virus infectivity in uVP1 null mutants exemplifies the crucial role of uVP1 protein in viral life cycle, in particular, enzymatic role of B19 phospholipase A2-like (vPLA2), encoded by uVP1 nt 3011–3208 (codons 130–195). This is reflected by full conservation of uVP1 motifs 130YXGXG134 and 153HDXXY157, found in our analysis^4,28.

NS1 has a conserved architecture consisting of: N-terminal DNA-binding/nickase and endonuclease domain⁵; central region with conserved NTP-binding and helicase/ATPase motifs²⁹; and the unique C-terminal region suggested to interact with different proteins^18,30. We found substantial portion of the NS1 protein to be under purifying selection, in particular regions implicated in its enzymatic functions. C-terminal end of NS1 has been previously shown to be highly polymorphic and involved in host-protein interactions and promoter trans-activation^18,30. In our analysis, positive selection was absent from previously defined transactivation domains (TAD1 to TAD3)³¹, participating in disruption of cell cycle at the S phase.

In our analysis ten codons in VP1 and 9 codons in NS1 were identified under diversifying selection with any of the methods used, whereas only 3 sites in VP1 and 1 position in NS1 were identified as positively selected by two or more analysis methods, reflecting inherent limitations of the methods used to estimate evidence of selective pressure at the codon level. In their analysis of a smaller B19 dataset оf 38 sequences Shackelton and Holmes did not reveal any codons with positive selection¹⁷. However, it was shown that the ability of the employed approaches to identify codons under positive selection pressure greatly depends on the number of sequences used in analyses³².

Positions under positive selection in different parts of the ORF2 could reflect immune response as the main driver of natural selection, since the 3 identified positively selected VP1 codons (4, 12, 107) fall within known VP1 immune dominant epitopes (VP1-F1 (aa 2–100), VP1-F2 (aa 99–227))^33,34,35.

A number of studies dating from the nineties have identified several B cell epitopes in VP2, conformational and linear, implicated mostly in the acute phase immune response^12,35. Many amino acid substitutions in conformational epitopes do not affect antibody binding³³, leading to the increased limit of viability (the so-called error threshold), resulting in selective forces being neutral¹⁶. Consequently, in spite of higher substitution rate in VP2 (uVP1 = 1.11 vs. VP2 = 2.32 substitutions/site/year x 10⁻⁴), stabilizing selection is more expressed in this region (mean dN/dS: uVP1 = 0.325, VP2 = 0.055). Notably, in our analyses, positively selected codons appeared rarely in the complete VP1 and VP2, characterized as episodic selection, contrary to uVP1 with diversifying substitutions, pivotal for long persistent infection. Signals of diversifying pervasive selection identified at NS1 positions 195 and 279 coincide with antigenic determinants known to be implicated in persistent infection^10,12,24.

Genes coding for B19 small proteins (7.5 kDa and 9 kDa) are known to be the least variable in B19 genome^17,18, as also shown in our analysis. So far, only few sequences of the 11 kDa have been deposited to public sequence databases. In this study, we contributed with 6 full sequences of 11 kDa sequences of B19 genotype 1, in addition to 21 previously exiting in the GenBank. Based on very limited dataset, this region is highly conserved with rare and mostly neutral substitutions.

Of note, we identified opposite action of natural selection on the same nucleotide positions in the overlapping genes, expressed in different reading frames. Sustainability of these polymorphisms in overlapping genes could depend on the influence of the nucleotide substitution on adaptive evolution of both proteins and their overall impact on viral fitness.

Conclusion

Here, we present phylogenetic analysis of the largest dataset of 133 near complete coding B19 genotype 1 sequences analyzed so far. Substitution rate analysis confirmed high substitution rate of B19 DNA genome, comparable to RNA viruses, in the range of 10⁻⁴ substitution/site/year. Generally, negative selection was found in action on the most of the B19 genome, with diversifying selection operating at certain codon positions, located mainly in antigenic domains and consequently driven by immune response pressure. Complex mechanism of maintenance of genome variability is demonstrated by codon selection analyses in overlapping gene regions with selection in opposite direction in the same nucleotide positions.

Gathering of additional sequence data would help to elucidate parvo B19 genotype 1 evolution.

Materials and Methods

Patient samples

We collected 10 blood samples (with EDTA) from patients seropositive for B19 IgM and/or IgG, or having symptoms indicative of B19 infection. Four out of ten were serial samples from same individuals. All samples were collected during 2009–2012, after obtaining informed consent from the patients. Plasma was separated immediately (except for RS-1 that was frozen as whole blood), and stored frozen (−20 °C) prior to testing. The study was approved by the institutional ethical committee of the Clinical Center of Serbia. All experimental methods involving human participants were carried out in accordance with the relevant guidelines and regulations.

B19 genome detection and sequencing

Viral DNA was extracted from 200 μL of plasma or blood using QIAamp MinElute Virus Spin Kit or QIAamp DNA Blood Virus Spin kit, respectively (QIAGEN GmbH, Germany), according to manufacturer’s instruction, and eluted with 60 μL of elution buffer. Complete cDNA was PCR amplified with primers listed in Supplementary Table S4. Amplicons were sequenced in both directions by BigDye Terminator v3.1 Cycle Sequencing Kit (PE Applied Biosystems, Foster City, CA) and sequences were basecalled and assembled by ABI softwares: Sequencing Analysis 5.1 and SeqScape software, v 2.5.

Nucleotide sequences were successfully retrieved from 8/10 analyzed patient samples: 5 near complete and 4 partial B19 genome sequences were obtained, designated RS1-10. Obtained B19 sequences were deposited in the GenBank under accession numbers KR005636- KR005644 (Supplementary Table S1). No unusual stop codons, frame-shifts, insertions or deletions were found in the obtained sequences, except for RS1 isolate, lacking the start codon for 11 kDa protein. Two sequence pairs, successively sampled from the same individuals, were recovered: isolates RS1 and RS2, retrieved during an exchange transfusion for fetal hydrops (RS1) and then from the mother after one year (RS2); isolates RS8 and RS9/RS10 collected within an interval of three weeks, from adult patient with migratory arthritis and pericarditis. Notably, there was no nucleotide divergence between isolates of the latter pair, whereas distance between the former pair was 2.6% (s.d. = 0.02), among the highest ones in collected genotype 1A isolates.

Sequence datasets

B19 genotype 1 sequences present in the GenBank database covering ≈95% of genomic cDNA (from nt position 665 to 4851, numbering according to reference B19 isolate NC_000883.2) were collected, resulting in the total of 137 sequences. Only sequences with available collection time/place containing no deletions and insertions were included in the study (Supplementary Table S1A). For the analyses of selection pressure we analyzed codon-based alignments: for ORF1 and ORF2, with the total of 146 and 214 sequences, respectively, and additional 27 sequences for 11 kDa small protein within ORF2, deposited in the GenBank database (Supplementary Table S1B).

Phylogenetic analysis

Multiple nucleotide sequence alignments were created of almost complete cDNA sequences and separately for two reading frames (ORF1 and ORF2), using CLUSTAL W, as implemented in MEGA 6 software³⁶. The best-fit nucleotide substitution model for aligned sequences was determined by jModeltest 2.1.4 software³⁷ using all 88 proposed models. Bayesian information criterion (BIC) was used to determine the model of nucleotide substitution that best fits the data for each of the subsets analyzed. Alignments were screened for recombination using RDP, GENECONV, Bootscan, SiScan recombination detection approaches as implemented in the program RDP4 v.4.36³⁸. Recombination screening of the analyzed dataset of 137 B19 genotype 1 cDNA sequences (Supplementary Table S1A) detected 4 putative recombinants, that were excluded from further phylogenetic analyses (DQ225148, AB126266, KC013312 and AB126270).

Further phylogenetic analyses, construction of trees and nucleotide distance calculation were performed by both neighbor joining and maximum-likelihood approach implemented in Phylogenetic Analysis Using Parsimony (PAUP) version 4.0b10 software package³⁹. Bootstrap support for the tree nodes of the reconstructed phylogenetic trees was calculated with 1000 replicates by IQTREE v. 1.1.0 software⁴⁰. Bootstrap values exceeding 70% were considered significant.

To explore temporal structure of the sequences included in the analysis an exploratory root-to-tip linear regression was performed with TempEst v. 1.5, by importing ML phylogenetic tree constructed in PhyMl v.3.0^41,42. This method performs a linear regression between the time of sampling of each tip and the genetic distance from the root.

Substitution rates

The nucleotide substitution rates were estimated using Bayesian Markov Chain Monte Carlo (MCMC) approach implemented in BEAST v.1.8.3⁴³. In order estimate the best fit evolutionary model, the analyses were initially performed under both strict and relaxed (uncorrelated exponential and uncorrelated lognormal) molecular clocks, with Bayesian skygrid as coalescent tree priors. The MCMC chain was run for 30,000,000 steps with parameter values sampled at every 3000 steps. Log marginal likelihoods were determined by generalized stepping stone sampling. The best fit model was chosen according to Bayes factor. The analysis under the chosen model was performed in two additional runs in 50,000,000 steps each, with sampling at every 1000 steps and the results were combined using LogCombiner 1.8.3 (implemented in BEAST) with 10% burn-ins removed from each run. The resulting log files were further explored in Tracer 1.6 to ascertain convergence of the chain and ESS values >200 for all parameters. The uncertainty in the parameter estimates were assessed by 95% HPD interval (Table 1). Marginal probability distribution test integrated in Tracer v.1.6 was applied for comparison data of mean nucleotide substitution rate for all analyzed genome regions.

Estimation of evolutionary pressure

Selection pressure was analyzed in two B19 major proteins (NS1 and VP1) and three small proteins (7.5 kD, 9 kDa and 11 kDa protein), based on the alignments described above and in Supplementary Table S1B.

Evolutionary pressure was assessed using HyPhy software package implemented by the Datamonkey web-based facility (http://www.datamonkey.org)⁴⁴. Overall selection pressure, measured as the mean ratio of nonsynonymous (dN) to synonymous substitutions (dS) per site (dN/dS), was estimated using four different likelihood approaches for analyzed datasets: the Single Likelihood Ancestor Counting (SLAC), Fixed-Effects Likelihood (FEL) internal branch Fixed-Effects Likelihood (IFEL) and Random-Effects Likelihood (REL) methods⁴⁴. In addition, we used mixed effects model of evolution (MEME) that is capable of identifying instances of both episodic and pervasive positive selection at the level of an individual site⁴⁵. For all the methods, Tamura-Nei model (TrN) or Hasegawa-Kishino-Yano (HKY85) were used as nucleotide substitution model. Separate phylogenetic tree for each analyzed partition was inferred by the neighbor-joining method (NJ) implemented in the HyPhy package available on the Datamonkey webserver. The diversity scores were considered to be significant at a confidence interval of p ≤ 0.1. Relative Synonymous Codon Usage (RSCU) values were determined by MEGA 6 software^36,46,47.

Additional Information

How to cite this article: Stamenković, G. G. et al. Substitution rate and natural selection in parvovirus B19. Sci. Rep. 6, 35759; doi: 10.1038/srep35759 (2016).

References

Servant-Delmas, A., Lefrere, J. J., Morinet, F. & Pillet, S. Advances in Human B19 Erythrovirus Biology. J. Virol. 84, 9658–9665 (2010).
Article CAS Google Scholar
Gallinella, G. Parvovirus B19 achievements and challenges. ISRN Virology 898730, doi: 10.5402/2013/898730 (2013).
Zhi, N. et al. Molecular and functional analyses of a human parvovirus B19 infectious clone demonstrates essential roles for NS1, VP1, and the 11-kilodalton protein in virus replication and infectivity. J. Virol. 80, 5941–5950 (2006).
Article CAS Google Scholar
Deng, X. et al. The Determinants for the enzyme activity of human Parvovirus B19 phospholipase A2 (PLA2) and its influence on cultured cells. Plos One 8, e61440 (2013).
Article CAS ADS Google Scholar
Raab, U. et al. NS1 protein of parvovirus B19 interacts directly with DNA sequences of the p6 promoter and with the cellular transcription factors Sp1/Sp3. Virology 293, 86–93 (2002).
Article CAS Google Scholar
Cotmore, S. F., Gottlieb, R. L. & Tattersall, P. Replication initiator protein NS1 of the parvovirus minute virus of mice binds to modular divergent sites distributed throughout duplex viral DNA. J. Virol. 81, 13015–13027 (2007).
Article CAS Google Scholar
Morita, E. & Sugamura, K. Human parvovirus B19-induced cell cycle arrest and apoptosis. Springer Semin. Immunopathol. 24, 187–199 (2002).
Article Google Scholar
Chen A. Y. et al. The small 11 kDa nonstructural protein of human parvovirus B19 plays a key role in inducing apoptosis during B19 virus infection of primary erythroid progenitor cells. Blood 115, 1070–1080 (2010).
Article CAS Google Scholar
Thammasri, K. et al. Human Parvovirus B19 induced apoptotic bodies contain altered self-antigens that are phagocytosed by antigen presenting cells. Plos One 8, e67179, doi: 10.1371/journal.pone.0067179 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Modrow, S. & Dorsch, S. Antibody responses in parvovirus B19 infected patients. Pathol. Biol. 50, 326–331 (2002).
Article CAS Google Scholar
Koppelman, M., Rood, I., Fryer, J., Baylis, S. & Cuypers, H. Parvovirus B19 genotypes 1 and 2 detection with real-time polymerase chain reaction assays. Vox Sang. 93, 208–215 (2007).
Article CAS Google Scholar
Tolfvenstam, T., Lundqvist, A., Levi, M., Wahren, B. & Broliden, K. Mapping of B-cell epitopes on human parvovirus B19 non-structural and structural proteins. Vaccine 19, 758–763 (2000).
Article CAS Google Scholar
Servant, A. et al. Genetic diversity within human erythroviruses: identification of three genotypes. J. Virol. 76, 9124–9134 (2002).
Article CAS Google Scholar
Parsyan, A., Szmaragd, C., Allain, J. P. & Candotti, D. Identification and genetic diversity of two human parvovirus B19 genotype 3 subtypes. J. Gen. Virol. 88, 428–431 (2007).
Article CAS Google Scholar
Ekman, A. et al. Biological and immunological relations among human parvovirus B19 genotypes 1-3. J. Virol. 81, 6927–6935 (2007).
Article CAS Google Scholar
Duffy, S., Shackelton, L. A. & Holmes, E. C. Rates of evolutionary change in viruses: patterns and determinants. Nat. Rev. Genet 9, 267–276 (2008).
Article CAS Google Scholar
Shackelton, L. & Holmes, E. Phylogenetic Evidence for the Rapid Evolution of Human B19 Erythrovirus. J. Virol. 80, 3666–3669 (2006).
Article CAS Google Scholar
Norja, P., Eis-Hübinger, A. M., Söderlund-Venermo, M., Hedman, K. & Simmonds, P. Rapid sequence change and geographical spread of human parvovirus B19: comparison of B19 virus evolution in acute and persistent infections. J. Virol. 82, 6427–6433 (2008).
Article CAS Google Scholar
Slavov, S. N., Kashima, S., Silva-Pinto, A. C. & Covas D. T. Genotyping of Human parvovirus B19 among Brazilian patients with hemoglobinopathies. Can. J. Microbiol. 58, 200–205 (2012).
Article CAS Google Scholar
Shackelton, L., Parrish, C. & Holmes, E. Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses. J. Mol. Evol. 62, 551–563 (2006).
Article CAS ADS Google Scholar
Barros de Freitas, B. R. et al. The “pressure pan” evolution of human erythrovirus B19 in the Amazon, Brazil. Virology 369, 281–287 (2007).
Article CAS Google Scholar
Molenaar-de Backer, M. W. A., Lukashov, V. V., van Binnendijk, R. S., Boot, H. J. & Zaaijer, H. L. Global co-existence of two evolutionary lineages of parvovirus B19 1a., different in genome-wide synonymous positions. Plos One 7, e43206, 10.1371/journal.pone.0043206 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Suzuki, M., Yoto, Y., Ishikawa, A. & Tsutsumi, H. Analysis of nucleotide sequences of human parvovirus B19 genome reveals two different modes of evolution, a gradual alteration and a sudden replacement: a retrospective study in Sapporo, Japan, from 1980 to 2008. J. Virol. 83, 10975–10980 (2009).
Article CAS Google Scholar
López-Bueno, A., Villarreal, L. P. & Almendral, J. M. Parvovirus variation for disease: a difference with RNA viruses? Curr. Top. Microbiol. Immunol. 299, 349–370 (2006).
PubMed Google Scholar
Bleker S., Sonntag, F. & Kleinschmidt, J. A. Mutational analysis of narrow pores at the fivefold symmetry axes of adeno-associated virus type 2 capsids reveals a dual role in genome packaging and activation of phospholipase A2 activity. J. Virol. 79, 2528–2540 (2005).
Article CAS Google Scholar
Shackleton, L., Parrish, C., Truyen, U. & Holmes, E. High rate of viral evolution associated with the emergence of carnivore parvovirus. P. Natl. Acad. Sci. USA 102, 379–384 (2005).
Article ADS Google Scholar
Pillet, S., Annan, Z., Fichelson, S. & Morinet, F. Identification of a nonconventional motif necessary for the nuclear import of the human parvovirus B19 major capsid protein (VP2). Virology 306, 25–32 (2003).
Article CAS Google Scholar
Dorsch, S. et al. The VP1 unique region of parvovirus B19 and its constituent phospholipase A2-like activity. J. Virol. 76, 2014–2018 (2002).
Article CAS Google Scholar
Toan, N. L. et al. Phylogenetic analysis of human parvovirus B19, indicating two subgroups of genotype 1 in Vietnamese patients. J. Gen. Virol. 87, 2941–2949 (2006).
Article CAS Google Scholar
Kivovich, V., Gilbert, L., Vuento, M. & Naides, S. J. The Putative metal coordination motif in the endonuclease domain of human Parvovirus B19 NS1 is critical for NS1 induced S phase arrest and DNA damage. Int. J. Biol. Sci. 8, 79–92 (2012).
Article CAS Google Scholar
Luo, Y., Kleiboeker, S., Deng, X. & Qiu, J. Human parvovirus B19 infection causes cell cycle arrest of human erythroid progenitors at late S phase that favors viral DNA replication. J. Virol. 87, 12766–12775 (2013).
Article CAS Google Scholar
Añez G., Morales-Betoulle, M. E. & Rios, M. Circulation of different lineages of dengue virus type 2 in Central America., their evolutionary time-scale and selection pressure analysis. Plos One 6, e27459, doi: 10.1371/journal.pone.0027459 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Musiani, M. et al. Immunoreactivity against linear epitopes of parvovirus B19 structural proteins. Immunodominance of the amino-terminal half of the unique region of VP1. J. Med. Virol. 60, 347–352 (2000).
Article CAS Google Scholar
Dorsch, S. et al. The VP1-unique region of parvovirus B19: amino acid variability and antigenic stability. J. Gen. Virol. 82, 191–199 (2001).
Article CAS Google Scholar
Kaikkonen, L. et al. Acute-phase-specific heptapeptide epitope for diagnosis of parvovirus B19 infection. J. Clin. Microbiol. 37, 3952–3956 (1999).
CAS PubMed PubMed Central Google Scholar
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0. Mol. Bio. Evol. 30, 2725–2729 (2013).
Article CAS Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9, 772 (2012).
Article CAS Google Scholar
Martin, D. P., Murrell, B., Golden, M., Khoosal, A. & Muhire, B. RDP4: Detection and analysis of recombination patterns in virus genomes. Virus Evol. 1, 1–5, doi: 10.1093/ve/vev003 (2015).
Article Google Scholar
Swofford, D. L. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. (ed. Swofford, D. L. ) (Sinauer Associates, 2003).
Minh, B. Q., Nguyen, M. A. T. & von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013).
Article CAS Google Scholar
Rambaut, A., Lam, T. T., Carvalho, L. M. & Pybus, O. G. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evolution 2(1), vew007, doi: 10.1093/ve/vew007 (2016)
Article PubMed PubMed Central Google Scholar
Guindon, S. New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS Google Scholar
Drummond, A. J., Suchard, M. A., Xie, D. & Rambaut, A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973 (2012).
Article CAS Google Scholar
Delport, W., Poon, A. F., Frost S. D. W. & Kosakovsky Pond, S. L. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics 26, 2455–2457 (2010).
Article CAS Google Scholar
Kosakovsky Pond, S. L., Frost, S. D. W. & Muse, S. V. HyPhy: hypothesis testing using phylogenies. Bioinformatics 21, 676–679 (2005).
Article Google Scholar
Schneider, B. et al. Simultaneous persistence of multiple genome variants of human parvovirus. B19. J. Gen. Virol. 89, 164–176 (2008).
Article CAS Google Scholar
Murrell, B. et al. Detecting Individual Sites Subject to Episodic Diversifying Selection. PLoS Genet 8, e1002764, doi: 10.1371/journal.pgen.1002764 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank Dr. Milica Nešić for assistance in the collection of samples. This study was supported by the Ministry of Education, Science and Technological Development of Republic of Serbia, grant No. 175024.

Author information

Authors and Affiliations

Department of Genetic Research, Institute for biological research “Siniša Stanković”, University of Belgrade, 142 Despot Stephan Blvd, Belgrade, 11060 R, Serbia
Gorana G. Stamenković & Jelena V. Blagojević
Institute for Microbiology and Immunology, School of Medicine, University of Belgrade, 1/1 Dr Subotića St, Belgrade, 11000 R, Serbia
Valentina S. Ćirković, Marina M. Šiljić, Aleksandra M. Knežević & Maja P. Stanojević
Clinic of Obstetrics and Gynecology “Narodni front”, 62 Kraljice Natalije St, Belgrade, 11000 R, Serbia
Ivana D. Joksić

Authors

Gorana G. Stamenković
View author publications
You can also search for this author in PubMed Google Scholar
Valentina S. Ćirković
View author publications
You can also search for this author in PubMed Google Scholar
Marina M. Šiljić
View author publications
You can also search for this author in PubMed Google Scholar
Jelena V. Blagojević
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandra M. Knežević
View author publications
You can also search for this author in PubMed Google Scholar
Ivana D. Joksić
View author publications
You can also search for this author in PubMed Google Scholar
Maja P. Stanojević
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived the study: G.G.S. and M.P.S. Wrote the paper: G.G.S. and M.P.S. Performed the analysis: G.G.S., V.S.Ć., M.P.S. and J.V.B. Carried out the experiments: G.G.S., V.S.Ć., M.M.Š., I.D.J. and A.M.K. Supervised fieldwork and experiments: G.G.S. and M.P.S.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Stamenković, G., Ćirković, V., Šiljić, M. et al. Substitution rate and natural selection in parvovirus B19. Sci Rep 6, 35759 (2016). https://doi.org/10.1038/srep35759

Download citation

Received: 27 April 2016
Accepted: 03 October 2016
Published: 24 October 2016
DOI: https://doi.org/10.1038/srep35759

This article is cited by

The epidemiological and genetic characteristics of human parvovirus B19 in patients with febrile rash illnesses in China
- Haoran Jiang
- Qi Qiu
- Xiaomei Li
Scientific Reports (2023)
Molecular-genetic characterization of human parvovirus B19 prevalent in Kerala State, India
- Dayakar Seetha
- Heera R. Pillai
- Radhakrishnan R. Nair
Virology Journal (2021)
Molecular analysis of goose parvovirus field strains from a Derzsy’s disease outbreak reveals local European-associated variants
- Hakan Isidan
- Turhan Turan
- Alparslan Coskun
Archives of Virology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Mutational spectrum of SARS-CoV-2 during the global pandemic

A global analysis of conservative and non-conservative mutations in SARS-CoV-2 detected in the first year of the COVID-19 world-wide diffusion

Phylogenetic molecular evolution and recombination analysis of complete genome of human parechovirus in Thailand

Introduction

Results

Phylogenetic analysis

Substitution rates

Natural selection

Discussion

Conclusion

Materials and Methods

Patient samples

B19 genome detection and sequencing

Sequence datasets

Phylogenetic analysis

Substitution rates

Estimation of evolutionary pressure

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

The epidemiological and genetic characteristics of human parvovirus B19 in patients with febrile rash illnesses in China

Molecular-genetic characterization of human parvovirus B19 prevalent in Kerala State, India

Molecular analysis of goose parvovirus field strains from a Derzsy’s disease outbreak reveals local European-associated variants

Comments

Search

Quick links