Background
Methods
Data collection
Sequence and phylogenetic analysis
Visualization of amino acid substitutions and glycosylation sites in HA protein using 3D models of influenza a subtypes
Prediction of mutations effect on HA function
Estimation of evolution rate and positive selection
Statistical analysis
Results
Temporal distribution of influenza
Year | Dominant Subtype | pH1N1 | H3N2 | ||||||
---|---|---|---|---|---|---|---|---|---|
Number of positive samples (WHO) | Number of sequences analyzed (NCBI) | Vaccine strain | Number of positive samples (WHO) | Number of sequences analyzed (NCBI) | Vaccine strain | ||||
H1 | N1 | H3 | N2 | ||||||
2009
| pH1N1 | 21,586 | 132 | 97 | A/California/07/2009 | 1093 | 22 | 5 | A/Brisbane/10/2007 |
2010
| pH1N1 | 4136 | 68 | 42 | A/California/07/2009 | 280 | 1 | 1 | A/Brisbane/10/2007 |
2011
| pH1N1 | 3717 | 36 | 36 | A/California/07/2009 | 2183 | 22 | 22 | A/Perth/16/2009 |
2012
| H3N2 | 660 | 27 | 29 | A/California/07/2009 | 2373 | 37 | 32 | A/Perth/16//2009 |
2013
| pH1N1 | 4395 | 33 | 32 | A/California/07/2009 | 1568 | 13 | 8 | A/Victoria//361/2011 |
2014
| pH1N1 | 3559 | 25 | 25 | A/California/07/2009 | 2883 | 40 | 33 | A/Texas/50/2012 |
2015
| pH1N1 | 9742 | 157 | 79 | A/California/07/2009 | 1688 | 47 | 23 | A/Switzerland/9715293/2013 |
2016
| pH1N1 | 8320 | 60 | 2 | A/California/07/2009 | 4784 | 47 | 8 | A/Hong Kong/4801/2014 |
2017
| pH1N1 | 10,143 | – | – | A/Michigan/45/2015 | 3891 | 10 | – | A/Hong Kong/4801/2014 |
Total
| 66,228 | 538 | 342 | 20,743 | 239 | 132 |
Molecular evolution analysis of HA and NA genes
Evolution rates and positive selection
Number of analysed sequences | Nucleotide substitution rate (×10−3) | Positive selection | |||||||
---|---|---|---|---|---|---|---|---|---|
Mean | 95% HPD | SLAC | FEL | ||||||
Mean (dN/dS) | N | AA | Mean (dN/dS) | N | AA | ||||
H1 | 512 | 1.91 | 1.68–2.1 | 0.336 | 3 | 10–239-240 | 0.311 | 8 | 13–202–239-240-305-468-543-546 |
H3 | 239 | 1.47 | 1.27–1.68 | 0.299 | 1 | 239 | 0.267 | 2 | 19–69 |
N1 | 342 | 1.93 | 1.7–2 | 0.352 | – | – | 0.311 | – | – |
N2 | 132 | 3.09 | 2.5–3.1 | 0.238 | – | – | 0.208 | – | – |
Phylogenetic tree analysis
Amino acid sequence variation in HA and NA proteins
Genetic characterization of pH1N1 viruses
Molecular evolution of H1 protein
H1 | Similarity to pH1N1 vaccine strain (%) | ||||||
---|---|---|---|---|---|---|---|
Sa | Sb | Ca | |||||
179 | 180 | 202 | 220 | ||||
California/2009 | S | K | S | S | Sa | Sb | Ca |
2016 |
N
|
Q
|
T
|
T
| 84.6 | 91.7 | 94.7 |
Michigan/2015a | N | Q | T | T |
Localization of potential glycosylation sequons in HA and NA of pH1N1 viruses | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
HA | ||||||||||||
Amino acid position | ||||||||||||
Year | Number of sequences analyzed | Stem | Head | Stem | ||||||||
2009 | 132 |
28
|
40
|
104
|
304
|
498
|
557
| |||||
2010 | 68 |
28
|
40
|
104
|
304
|
498
|
557
| |||||
2011 | 36 |
28
|
40
|
104
| 179 (2%) |
304
|
498
|
557
| ||||
2012 | 27 |
28
|
40
|
104
|
304
|
498
|
557
| |||||
2013 | 33 |
28
|
40
|
104
|
304
| 489* (18%) |
498
|
557
| ||||
2014 | 25 |
28
|
40
|
104
|
304
|
498
|
557
| |||||
2015 | 157 |
28
|
40
|
104
| 179 (85%) |
304
|
498
|
557
| ||||
2016 | 60 |
28
|
40
|
104
| 179 (99%) |
304
| 489* (2%) |
498
|
557
| |||
NA | ||||||||||||
Year | Number of sequences analyzed | Amino acid position | ||||||||||
2009 | 97 | 44 (11%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
386
| ||
2010 | 42 | 42 (36%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
386
| ||
2011 | 36 | 42 (100%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
386*
| ||
2012 | 29 | 28 (7%) | 42 (100%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
386
| |
2013 | 32 | 42 (100%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
386
| ||
2014 | 25 | 42 (100%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
386
| ||
2015 | 79 | 42 (100%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
386
| ||
2016 | 3 | 42 (100%) |
50
|
58
|
63
|
68
|
88
|
146
|
235
|
Molecular evolution of N1 protein
Genetic characterization of H3N2 viruses
Molecular evolution of H3 protein
H3 | Similarity to vaccine strains (%) | |||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
A1 | A2 | A3 | B1 | B2 | C1 | C2 | E1 | D | ||||||||||||||||||||||||
121 | 135 | 142 | 144 | 145 | 157 | 158 | 159 | 160 | 189 | 194 | 53 | 278 | 62 | 212 | 215 | |||||||||||||||||
Brisbane/2007 | N | T | R | N | N | L | K | F | K | N | L | D | N | E | T | P |
A
|
B
|
C
|
E
|
D
| |||||||||||
2009 | – | – | – | – | – | – | K |
N
| – | – |
K
| – | – | – | – | – |
A
| – | 100 | 89.5 | 100 | 100 | 97 | |||||||||
Perth/2009 | N | T | R |
K
| N | L |
N
| F | K |
K
| L | D | N |
K
| T |
S
| ||||||||||||||||
2012 | – | – | – | K |
N
| N |
S
| – | – | – | – | – | – | D |
A
|
P
| – |
A
|
P
|
90
|
100
|
84
| 100 |
88
| ||||||||
Victoria/2011 | N | T | R | N | N | L |
N
| F | K |
K
| L | D | N | E |
A
| P | ||||||||||||||||
2014 | – | – |
G
| – |
S
| L |
S
| – | F |
Y
|
S
| – | – | – | – | – | – | – | – | 90 | 89.5 | 84 | 100 | 100 | ||||||||
Texas/2012 | N | T | R | N | N | L |
N
| F | K |
K
| L | D |
K
| E |
A
| P | ||||||||||||||||
2015 | – | – | R |
G
|
S
|
S
| – | – |
Y
|
T
| – | – | – | – | – | – | – | 86.3 | 89.5 | 100 | 100 | 100 | ||||||||||
Switzerland/2013 | N | T |
G
| N |
S
| L |
N
|
S
| K |
K
| L | D |
K
| E |
A
| P | ||||||||||||||||
2016 | N |
K
| – | G |
R
|
S
|
K
| – | – | – |
F
|
Y
| – | – | – | – |
N
| K |
N
| – | – | – | 82 | 89.5 | 66.6 | 100 | 100 | |||||
Hong Kong/2014 | N | T | R |
S
|
S
| L |
N
|
Y
| K |
K
|
P
| D |
K
| E |
A
| P | ||||||||||||||||
2017 |
K
|
K
| R |
G
| S |
K
| – | – | – | – |
T
| – |
L
| – | – | – | – | – | 77 | 89.5 | 100 | 100 | 100 |
Reference strain | AA substitution | Antigenic Site | 2009 (%) | 2011 (%) | 2012 (%) | 2013 (%) | 2014 (%) | 2015 (%) | 2016 (%) | 2017 (%) | ||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Brisbane/2007 | HA1 | L3I | 7 | 5 | 85 | 64 | 100 | |||||
Q33R | 9 | 19 | 80 | 93 | 68 | 100 | ||||||
S45 N | 23 | 5 | 69 | 90 | 96 | 83 | 100 | |||||
T48I | 23 | 5 | 54 | 87.5 | 91 | 72 | 100 | |||||
D53N |
C1
| 9 | 32 | 30.7 | 4 | 4 | 20 | |||||
E50R | 10 | |||||||||||
Q75H | 10 | |||||||||||
N81K |
E2
| 8 | 6 | |||||||||
Y94H | 9 | 38 | 12.5 | 4 | ||||||||
N121K |
A1
| 4 | 25.5 | 80 | ||||||||
T128A | 31 | 72.5 | 6 | 12.7 | ||||||||
R142 | G |
A3
| 9 | 2 | 30 | 72 | 10 | 12.7 | 30 | |||
K | 13.5 | 12.7 | 10 | |||||||||
N144 | S |
A3
| 5 | 87 | 40 | 60 | ||||||
K | 21 | 30 | ||||||||||
N145S |
A3
| 59 | 48.5 | 100 | 100 | 100 | 100 | 100 | ||||
L157S |
B1
| 4 | 4 | 15 | 17.5 | 4 | 6 | |||||
F159 | Y |
B1
| 15 | 87 | 62 | 100 | ||||||
S | 15 | 8 | ||||||||||
K160 T |
B1
| 83 | 53 | 90 | ||||||||
N171K | 6 | 19 | 60 | |||||||||
N189K |
B2
| 73 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |||
A198S | 77 | 43 | 61.5 | 100 | 91 | 83 | 100 | |||||
T212A |
D
| 63.6 | 95 | 100 | 84 | 100 | 91 | 85 | 100 | |||
V223I | 91 | 46 | 84 | 100 | 100 | 85 | 100 | |||||
N225D | 20 | 89 | 68 | 100 | ||||||||
R261Q | 15 | 8 | 20 | |||||||||
N278K |
C2
| 9 | 54 | 97 | 93 | 74 | 100 | |||||
E280A | 4 | 38 | 4 | |||||||||
Q311H | 7 | 85 | 59 | 60 | ||||||||
N312S | 82 | 91 | 62 | 69 | 100 | 100 | 91 | 100 | ||||
HA2 | V347 | M | 9 | 23 | 15 | |||||||
K | 22 | 40 | 4 | |||||||||
G484E
| 2 | 27.5 | 50 | |||||||||
D487N | 59 | 43 | 23 | |||||||||
D489N | 9 | 13 | 15 | 12 | 87 | 64 | 100 | |||||
Total number of substitutions/years |
5
|
17
|
16
|
19
|
23
|
25
|
28
|
23
|
Localization of potential glycosylation sequons in HA and NA of H3N2 viruses | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
HA | ||||||||||||
Amino acid position | ||||||||||||
Year | Number of sequences analyzed | Stem | Head | Stem | ||||||||
2009 | 22 |
8
|
22
|
63
|
126
|
133
|
165
|
246
|
385
|
483
| ||
2011 | 22 |
8
|
22
| 45(23%) |
63
|
126
|
133
|
165
|
246
|
385
|
483
| |
2012 | 37 |
8
|
22
| 45(5%) |
63
|
126
|
133
|
165
|
246
|
385
|
483
| |
2013 | 13 |
8
|
22
| 45(69%) |
63
|
126
|
133
|
165
|
246
|
385
|
483
| |
2014 | 40 |
8
|
22
| 45(92%) |
63
|
126
|
133
| 158(5%) |
165
|
246
|
385
|
483
|
2015 | 47 |
8
|
22
| 45(100%) |
63
|
126
|
133
| 158(81%) |
165
|
246
|
385
|
483
|
2016 | 47 |
8
|
22
| 45(80%) |
63
|
126
|
133
| 158(53%) |
165
|
246
|
385
|
483
|
2017 | 10 |
8
|
22
| 45(100%) |
63
|
126
|
133
| 158(100%) |
165
|
246
|
385
|
483
|
NA | ||||||||||||
Year | Number of sequences analyzed | Amino acid position | ||||||||||
2009 | 5 |
61
|
70
|
86
|
146
|
234
| ||||||
2011 | 22 |
61
|
70
|
86
|
146
|
234
| 367 (90.6%) | |||||
2012 | 32 |
61
|
70
|
86
|
146
|
234
| 367 (100%) | |||||
2013 | 8 |
61
|
70
|
86
|
146
|
234
| 367 (100%) | |||||
2014 | 33 |
61
|
70
|
86
|
146
|
234
| 367 (100%) | |||||
2015 | 23 |
61
|
70
|
86
|
146
|
234
| 367 (100%) | |||||
2016 | 8 |
61
|
70
|
86
|
146
|
234
| 367 (100%) |