Introduction
Worldwide, human noroviruses are responsible for 16% of the sporadic cases and outbreaks of acute gastroenteritis [
1]. Although the norovirus illnesses are self-limiting in healthy individuals, the risk they pose to young children, elderly people, and immunocompromised patients cannot be ignored [
2,
3]. Annually, approximately 200,000 children deaths resulted by noroviruses and bring a huge social burden of 60 billion dollars [
4]. A low infectious dose and easily transmissibility make challenging to control the noroviruses burden [
5]. However, efforts to develop noroviruses vaccines were hindered due to multiple factors [
6,
7].
Noroviruses consist of at least 31 genotypes in the two major noroviruses genogroups, GI and GII. For the past two decades, the GII.4 genotype has been dominant worldwide, leading ~ 75% of noroviruses infections [
8,
9]. However, the emergence of novel GII.17 catching publics’ attention, has outcompeted GII.4 as the predominant cause of noroviruses outbreaks in several countries in late 2014 [
10]. The first GII.17 strain reported in GenBank was Hu/GII.17/C142/1978/GUF (GII.17.1978). Then the strain stayed “antigenically static” until sufficient point insertions and deletions in its capsid gene to change the pattern [
11,
12]. Residues substitutions at and around binding sites of histo-blood group antigens (HBGAs) and antigenic features may drive the emergence of a new pandemic variant emergence or immune evasion. Although the novel GII.17 strains seem to infect new target populations continually with altered ligand binding properties and antigenicity [
13,
14], these sites have not been investigated. Findings of extensive antigenic diversity between various noroviruses strains and the interrelationships between herd protective immunity and the virus antigenic heterogeneity are warranted, which assist in developing noroviruses vaccine and revealing the evolutionary mechanisms of the virus.
Researches on virus-like particles (VLPs) which mirror in morphology and antigenicity to natural virions, are the reliable and robust approaches to shed light on the evolutionary mechanisms of noroviruses and control the burden of infections. P domain resides the outmost of the virion, which replicates many details of VLP, including antibody binding sites [
15]. Although the number of P protein and VLP applications has been rapidly grown and advance the vaccine development, the production procedures are time-consuming. Evolutionary mechanism might reach a further step if a rapid screening method is applied.
In the absence of culture methods and a small animal model, mutagenesis analysis is broadly used to explore norovirus’s antigenic features. As mentioned above, P proteins expressed by
E. coli system and VLP expressed by insect cells are time-consuming. To further explore GII.17 antigenic variation, we applied the cell-free protein synthesis (CFPS) system, an alternative to express panels P domain proteins with mutations; this system is widely known for its high productivity and is increasingly used in research. As the platform is an “open” system, the additional components, such as essential energy and modified amino acids can be supplemented conveniently. The most commonly used system, the
E. coli S30 extract-based CFPS system, offers the highest protein yield [
16] and is more suitable for producing mutant strains with which binding assays using monoclonal antibodies (mAbs) [
17,
18].
Herein, we aimed to provide valuable antigenic features of the 2014–2015 GII.17 pandemic using GII.17 P proteins and E. coli S30 extract-based CFPS system. Combined with bioinformatics analyses, functional regions of GII.17 capsid gene were predicted and a panel of mutant GII.17 variants with chimeric regions were generated. An enzyme immunoassay (EIA) was then used to define the potential new antigenic features of GII.17 with mouse selected monoclonal antibodies (mAbs). Finally, the function of region screening by the CFPS system was confirmed by the cell-based system.
Materials and methods
Data collection and preparation for bioinformatics analysis
All full-length nucleotide sequences of the GII.17
VP1 region (> 1560 bp) were obtained from GenBank. The sequences were downloaded with the information of the accession number, collection date and geographical region. Accession numbers of sequences for phylogenetic tree construction were listed in Additional File
1. To minimize the number of the sequences with high similarity, CD-HIT software [
19] was used to remove redundant sequences with a threshold of 99% similarity. Multiple sequences of the constructed dataset were aligned using Multiple Alignment in Fast Fourier Transform (MAFFT) software [
20].
Evolutionary analysis
A best-fit nucleotide substitution model was determined using Akaike information criterion (AIC) as implemented in jModelTest [
21], which was used to reconstruct a maximum-likelihood (ML) tree with a bootstrap analysis of 1000 replicates using PhyML version 3.0 [
22]. Phylogenetic trees were analyzed using Bayesian inference via the Markov Chain Monte Carlo (MCMC) method in the BEAST 1.10.4 software [
23]. Best substitution models (GTR + G) were determined for the selected sequences by comparison of the Bayesian Information Criterion (BIC) values using jModelTest. Molecular clock models calculated the nucleotide substitution rates of norovirus GII.17, as well as the time to the most recent common ancestor (TMRCA). The MCMC runs were performed with chain lengths of 40,000,000 steps, sampling every 4000 steps, for the selected sequences. Convergence of parameters were analyzed in the Tracer version 1.6 (
http://tree.bio.ed.ac.uk/software/tracer/) and effective sample sizes (ESS) values of 200 or more were accepted. 95% highest probability density (HPD) values were used to estimate statistical uncertainty in parameter values. Files were combined in LogCombiner v1.10.4, and the maximum clade credibility trees were generated by discarding the first 10% of trees using TreeAnnotator v1.10.4. The tree was visualized in the FigTree 1.4.2.
Plasmid construction and CFPS reactions
A new emerging GII.17 was obtained from stool sample in a continuous norovirus surveillance study in Guangzhou, China of 2014–2015 [
24]. Norovirus was detected in stool samples by targeting region A and C. Viral RNA was extracted from clarified supernatant using QIAamp Viral RNA MiniKit (QIAgen, Hilden, Germany). Whole-genome sequences of GII.17 were obtained using a newly established “4 + 1 + 1” PCR strategy [
25]. Other GII.17 capsid genes were synthesized chemically through Generay following codon optimization. Then we cloned the GII.17 P domain construct (amino acids 222–540) of GII.17 into the pET-22b ( +) expression vector using Nde I and Xho I restriction sites.
Proteins were expressed from the plasmid using the E. coli S30 Extract System (Promega, USA), incubated at 37℃ for 1 h with shaking at 1200 rpm in a 1.5 mL Eppendorf tube. A standard reaction comprised 18 µL of E. coli S30 lysate, 20 µL of reaction mix with amino acids, and 1 µg of the plasmid template harboring the target gene, made to 50 µL with water.
Protein aggregation analysis and pure P proteins preparation
To quantify protein yield, the cell-free reaction mixture was used to perform an EIA with anti-GII.17 mAbs (mAb 2A11 and mAb 2D11, Genecreat, China). The stool sample of positive norovirus cases were subjected to RT-PCR to obtain the ORF1 region. We then cloned capsid gene of GII.17 and expressed proteins. Mouse were immunized with GII.17 antigen and splenocytes were isolated from the mouse, then fused with myeloma. We generated a dose–response curve by using various concentrations of pure GII.17-L343/P protein binding to mAb. Pure proteins were diluted using “empty” cell-free reaction mixture to create a similar procedure to that of other cell-free proteins. Pure proteins were expressed using a pGEX-4T-1/GII.17 clade D P recombinant in E. coli strain BL21 and purified using affinity chromatography. The GST tag was removed from target proteins using Thrombin (GE Healthcare Life Sciences) at 22℃ for 20 h. Subsequently, the expression level of cell-free proteins was quantified using the “standard curve”.
Enzyme immunoassay
The binding assay was carried out using recombinant P domain proteins (wild-type and mutants) and mAb. In brief, cell-free expressed P proteins were diluted and coated on microtiter plates at 4 °C overnight. After blocking with 1% bovine serum albumin, the two mAb (Genecreat, China) diluted in PBS were incubated with purified P domain proteins at 37 °C for 1 h. Following washing with PBST, HRP-conjugated goat anti-mouse IgG (Bioss, China) was added for incubating 30 min at 37 °C. After color development, read the absorbance at 450 nm. An OD450 of > twofold background after background subtraction was scored positive by the assay.
Antibody blockade of P domain proteins binding assay
P proteins were pretreated with decreasing concentrations of mAbs for 1 h at 37 °C and added to wells coated with pig gastric mucin type III (PGM, Sigma-Aldrich, USA) and incubated for 1 h at 37 °C. Followed by washing steps, mouse anti-GII.17 P domain hyperimmune serum diluted 1:2000 in PBS was added and incubated for 1 h at 37 °C. After washing with PBST, HRP-conjugated goat anti-mouse IgG was added for incubating for 30 min at 37 °C. Blocking rates were calculated by comparing binding levels of in the presence and absence of mAbs. Sera from free P proteins-immunized animals were used as negative controls.
Production of P proteins with mutations
Mutant P domain proteins with single or poly amino acid substitutions were designed and introduced into the GII.17 P domain using a wild-type templates. Site-directed residue mutations were performed using assembly PCR and the corresponding primer pairs (Additional File
2). After sequencing confirmation, mutant P domain proteins were expressed using the CFPS system and analyzed as described above.
Structural modeling
Structural homology models representing the GII.17 capsid P domain were generated using the Swiss-Model server of the Swiss Institute of Bioinformatics [
26], with default settings. Briefly, the Fasta format GII.17 capsid sequence was uploaded into the server, and the “Select Templates” option was selected. Models of P dimer was created using the templates and downloaded in.pdb format. Protein structures were visualized, and the figures were rendered using MacPyMOL version 2.1 (
http://www.pymol.org). Selected of strains accession numbers are summarized in Additional File
3.
Statistical analysis
Statistical analyses were performed using GraphPad Prism 7.04 (GraphPad Software, Inc, La Jolla, CA, USA). An analysis of variance (ANOVA) test was used to determine the mAb binding affinity between wild-type and mutant proteins. P value < 0.05 was considered significant difference.
Discussion
Genome analyses are a powerful tool to demonstrate the molecular evolution of the VP1 regions of GII.17 since its emergence. A GII.17-based MCMC time-scaled evolutionary phylogenetic tree showed that all the selected strains grouped into four clusters (clades A to D) [
30,
31]. Apart from capsid gene, polymerase regions differences also contributed to the epidemiologic characteristics of the novel GII.17 strain [
12,
32]. However, we did not assess the polymerase region of GII.17 because it was beyond our scope. Although predictive tools and empirical evidence continue to explore the dominant GII.4 genotypes, another strain GII.17 causing outbreaks should not be neglected. Unfortunately, limited information has been revealed regarding the cause of GII.17 outbreaks in the winter season of 2014–2015. To better understand the reason for the emergence of epidemic GII.17 clade D, we selected four representative strains, one from each cluster, to determine the effects of sequence change, thereby attempting to identify GII.17 antibody epitopes via P domain proteins.
To the best of our knowledge, this study is the first to report the antigenic epitope of GII.17 using rapid screening method, also providing direct evidence of the evolutionary mechanism in the major capsid gene of noroviruses. The sudden emergence of epidemic GII.17 strains emphasizes the need for understanding the norovirus molecular profile; thereby obtaining a global picture for vaccine development and molecular evolution. Novel GII.17 strains were expected to gain new antigenic features and a wider HBGAs binding profile compared to the previous strains of GII.17, which has been reported to result in the emergence of GII.17 [
13,
14], allowing these variants to continually interact with new target populations [
8].
Based on bioinformatics analyses and the conserved epitopes demonstrated in GII genogroup noroviruses, we previously deduced several potential epitopes of the new GII.17 variant [
33]. However, few studies have reported any relevant evidence. To address these possibilities, we performed mutational analysis. Antibody responses were similar to the P proteins expressed in
E. coli BL21, indicating that the cell-free system had translated the amino acids precisely. In this study, changes in residues 293–300 negatively affected mAb 2D11 binding, supporting their role in ligand binding and antigenicity. To confirm the key residues affecting antibody binding-related conformations, we then constructed a panel of mutants with alanine substitutions in amino acids 293–300. Substitutions of residues 295–300 resulted in a loss of mAb 2D11-binding activity. Furthermore, amino acid 298 appeared to be the key mAb 2D11-binding residue, while amino acids 295, 299, and 300 remained subordinate; but, the roles of subordinate residues should not be ignored. GII.17 clade A and GII.17 clade C strains with substitutions of residues as in GII.17 clade D confirmed this result. Together, these data provide valuable information for developing a vaccine with a possible neutralizing epitope.
Interestingly, two amino acids (I294 and N295) in epitope A did not interact with mAb 2D11 directly, rather mediating mAb 2A11 binding by changing particle conformation. Six continuous residues in the P domain, from amino acids 293 to 300, did not affect binding activity with mAb 2A11. Data of mAb 2A11 binding with reverse mutants GII.17 clade A mirrored this result. For GII.17 clade A, P protein bound to mAb 2A11 but could not react with mAb 2D11. It meant that P protein of GII.17 clade A could be expressed correctly without the specific epitope recognized by mAb 2D11. However, chimeric GII.17 clade A obtained the ability to bind to mAb 2D11 with lower protein yield (less binding to mAb 2A11), demonstrating mAb 2D11 recognized the residues of 293–300 on GII.17 clade D. Then we wondered whether 293-300aa on GII.17 clade D was involved in a ligand binding pocket. Although the data of binding assay did show the positive results, we could not obtain consistent results over three replicate experiments. Self-assemble may be restricted by platform components. Additionally, a previous study investigating CFPS system development for norovirus VLPs synthesis did not apply a binding assay [
29], potentially explaining the same unstable results. To determine whether these corresponding changes associated with ligand recognition, we expressed mutant P domain proteins in a cell-based system. Results of the ligand-binding assay showed that substituted residues 293–300 could bind with PGM, indicating that these charge changes did not correlate well with ligand attachment.
Our site-directed mutagenesis analysis of GII.17 mirrored the findings of a recent report [
34]. We and others’ results both showed that 293-300aa were an important epitope in GII.17 capsid gene, although different techniques were applied. The comparison reflected the advantages of the CFPS system: we used plasmids to perform mutagenesis analysis, while Yi et al
., might spend more time to express VLPs for screening mutants [
34]. The putative antibody recognition site on the P proteins was similar to epitope A of GII.4 [
35,
36]. Changes in GII.4 epitope A may have coincided with the appearance of a pandemic strain [
37]. Furthermore, this epitope is also the main target for blocking antibodies detected in human polyclonal sera [
38]. Several binding patterns were identified in GII.4 epitope A, including conformational restriction and linear sequence dependent [
38], and the latter pattern matched our results for GII.17. Eight residues, with four binding sites, are present in epitope A of GII.4, namely, linear 294–298, 368, 372–373, and a rare group [
39,
40]. Among these positions, we confirmed the linear epitope of residues 295–300, which is in line with GII.4. It has been reported that epitope A is hypervariable, with key residues loss correlating with viral escape from herd immunity [
35]. The GII.17 clade C cluster, as opposed to previous GII.17 clade B and GII.17 clade D strains, could not bind mAb 2D11, confirming the existence of evolving epitopes, namely, those that escaped antibody recognition from mutations in and around surface-exposed regions that varied over time; epitope reemergence revealed evidence as to why the GII.17 pandemic emerged.
After decades of GII.4 pandemics, we learned that such epochal evolution in response to herd immunity leads to severe public health threats. Based on these data, we proposed a model in which several variable regions in the GII.17 sequence are fixed, with mutations in these regions likely restricted to several amino acids; this model was developed in GII.4, but without solid evidence [
40]. Over time, norovirus strains have rapidly evolved, continuously escaping herd immunity. Understanding the specific residues and how noroviruses use them to evolve would greatly contribute to vaccine development. Ideally, a vaccine should contain all possible changes within the epitopes identified. Furthermore, unlike the 2014 epidemic, only a few cases of GII.17 norovirus infections have been reported in the past few years, suggesting that it evolved more rapidly than we thought [
41]. As such, to develop a universal vaccine that covers both existing and future pre-pandemic norovirus strains, it is imperative that the proposed model be confirmed.
Owing to a lack of optimal cell culture system, various purified norovirus VLPs are in progress to better understand high-resolution molecular information related to evolution or vaccine research. Structural characterization of non-infected norovirus VLPs in complex with ligands or receptors is vital for providing a global picture of epitopes [
42]. However, efforts to produce norovirus VLPs in insect cells are further impeded by the complex procedures and low yields of the correct architecture [
29]. As such, mutational analysis of P proteins is well-accepted for mapping norovirus epitopes using cell-based produce systems, such as
E. coli BL21. Considering this time-consuming procedure, it may be more appropriate to first apply the CFPS system to screen a panel of dozens of mutant proteins. Although self-assemble in the CFPS system was limited and protein expressed using the CFPS system could not bind to ligands, confirming the impact of key residues screened by the CFPS system via functional proteins using cell-based production systems could save much time and labor.
Limitations existed in this study. We did not use GII.17-specific mAbs that could map strain-specific motifs. As such, we could not investigate the effects of other epitopes on virus binding and neutralization. Although protein expressed using the CFPS system could not be calibrated accurately, the inability to absolute quantification could be compensated by relative quantitation. Another limitation is that the P proteins expressed using fast-expressing system is inability of binding with HBGAs. Further solutions to promote ligands binding stably with cell-free proteins are warranted.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.