Mutational landscape of the surface antigen of hepatitis B virus in patients with hepatocellular carcinoma
- Open Access
- 01.12.2025
- Research
Abstract
Introduction
Results
The detection of three mutations in HBsAg with significantly high frequency in HCC patients
The mutational profile at the three sites was significantly associated with HCC in a multivariate analysis
Evidence of positive selection was detected at the three sites
The regions spanning the mutations are important for viral morphogenesis and recognition by immunity
The structural analysis suggests an effect of N103D on the inter-molecule interactions of HBsAg
N103D is associated with a reduction in the score of predicted B and T cell epitopes
Tables
Age | Sex | BMI | ALT | HBV DNA | |
|---|---|---|---|---|---|
(years) | No. (%) | (IU/l) | (IU/ml) | ||
Inactive | 39 | Male= 119 (66%) | 27.55 | 27.00 | 197.0 |
(n = 180) | (31.0–50.0) | Female= 61 (34%) | (24.77–31.83) | (19.00–41.00) | (19.00–1329) |
Active | 37 | Male= 51 (82.3%) | 28.07 | 61.00 | 1.1x10\(^{6}\) |
(n = 62) | (26–45.25) | Female= 11 (17.7%) | (21.47–31.93) | (40.5–100) | (52932–9.4x10\(^{7}\)) |
LC | 54 | Male= 27 (75%) | 26.15 | 38.50 | 1522 |
(n = 36) | (44–59.50) | Female= 9 (25%) | (22.35–29.36) | (27–66) | (57–5.3x10\(^{4}\)) |
HCC | 62 | Male= 25 (96.2%) | 23.98 | 56.50 | 12967 |
(n = 26) | (52.5–70) | Female= 1 (3.8%) | (21.62–27.35) | (39.50–79.25) | (481.8–7.4x10\(^{5}\)) |
Total | |||||
(n = 304) |
Univariate logistic regression analysis | |||||||||
|---|---|---|---|---|---|---|---|---|---|
HCC | non-HCC | Odds ratio | |Z| | p value | C-statistic | p value | Likelihood | p value | |
(95% CI) | ratio test | ||||||||
Age | 62 | 40 | 1.146 | 5.653 | <0.0001 | 0.882 | <0.0001 | 53.86 | <0.0001 |
(years) | (52.5–70) | (31–50) | (1.097–1.207) | ||||||
ALT | 56.5 | 34 | 1.000 | 0.051 | 0.9592 | 0.697 | 0.0052 | 0.003 | 0.9582 |
(IU/l) | (39.50–79.25) | (21–58) | (0.996–1.001) | ||||||
BMI | 23.98 | 27.44 | 0.8790 | 2.531 | 0.0114 | 0.712 | 0.0047 | 7.227 | 0.0072 |
(21.62–27.35) | (23.88–31.23) | (0.791–0.967) | |||||||
Sex | Male = 25 (96.2%) | Male = 197 (70.9%) | 10.28 | 2.266 | 0.0235 | 0.627 | 0.0330 | 10.51 | 0.0012 |
Female = 1 (3.8%) | Female = 81 (29%) | (2.123 −185.0) | |||||||
No. of | 1 | 0 | 2.695 | 4.003 | <0.0001 | 0.658 | 0.0078 | 15.18 | <0.0001 |
mutations | (0–1.25) | (0–1) | (1.658 −4.430) | ||||||
HBV DNA | 12967 | 782 | 1–2\({\times }\)10−8 | 0.770 | 0.4415 | 0.619 | 0.1334 | 1.134 | 0.2870 |
(IU/ml) | (481.8–7.4x105) | (47.25 to 21788) | (1–1\({\times }\)10−7 to | ||||||
1+2\({\times }\)10−9) | |||||||||
Multivariate logistic regression analysis | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
Odds ratio | |Z| | p value | VIF | C-statistic | p value | AIC | AIC | Pseudo | Reject null | |
(95% CI) | (no model) | (model) | R squared | hypothesis? (simpler model is correct) | ||||||
Age | 1.125 | 4.206 | <0.0001 | 1.060 | ||||||
(years) | (1.070–1.196) | |||||||||
Yes, p | ||||||||||
BMI | 0.821 | 2.561 | 0.0104 | 1.025 | 0.913 | <0.0001 | 116.7 | 78.27 | 0.39 | value |
(0.696–0.945) | <0.0001 | |||||||||
No. of | 2.197 | 1.062 | 0.2883 | 1.052 | ||||||
mutations (1) | (0.5034 −9.859) | |||||||||
No. of | 6.816 | 1.809 | 0.0705 | 1.036 | ||||||
mutations (2) | (0.717–54.28) | |||||||||
No. of | 25.70 | 2.167 | 0.0302 | 1.059 | ||||||
mutations (3) | (1.717 −810.9) | |||||||||
S ORF | Overlapping | ||||||||
|---|---|---|---|---|---|---|---|---|---|
Codon | MEME | FEL | SLAC | FUBAR | P ORF | MEME | FEL | SLAC | FUBAR |
Codon | |||||||||
pre-S | |||||||||
7 | \(\surd\) | 187 | \(\surd\) | \(\surd\) | \(\surd\) | ||||
40 | 220 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | ||||
54 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 234 | ||||
73 | 253 | ||||||||
98 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 278 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) |
103 | 283 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | ||||
130 | \(\surd\) | \(\surd\) | 310 | ||||||
136 | \(\surd\) | \(\surd\) | 316 | ||||||
147 | 327 | ||||||||
149 | \(\surd\) | 329 | |||||||
160 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 340 | ||||
S | |||||||||
8 | \(\surd\) | \(\surd\) | 351 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | ||
30 | 373 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | ||||
110 | \(\surd\) | \(\surd\) | \(\surd\) | 453 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | |
204 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 547 | ||||
207 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 550 | ||||
208 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 551 | ||||
210 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 553 | ||||
213 | \(\surd\) | \(\surd\) | \(\surd\) | \(\surd\) | 556 | ||||
S ORF | Overlapping | ||
|---|---|---|---|
Codon | Selection | P ORF | Selection |
Codon | |||
pre-S | |||
7 | – | 187 | – |
40 | – | 220 | Positive |
54 | – | 234 | Positive |
73 | Positive | 253 | Negative |
98 | – | 278 | Positive |
103 | Positive | 283 | Positive |
130 | – | 310 | Positive |
136 | – | 316 | Negative |
147 | – | 327 | Negative |
149 | – | 329 | Negative |
160 | – | 340 | Negative |
S | |||
8 | – | 351 | – |
30 | Positive | 373 | Positive |
110 | – | 453 | – |
204 | – | 547 | – |
207 | Positive | 550 | Negative |
208 | Positive | 551 | Negative |
210 | – | 553 | – |
213 | – | 556 | – |
Position in pre-S/S | Region | Function | Reference | Corresponding Position in polymerase | Region | Function | Reference |
|---|---|---|---|---|---|---|---|
103 | Pre-S1 C terminus | aa 94–117 B/T cell epitope, aa 81–105 cytosolic anchorage domain (heat shock protein Hsc70 binding site), aa 103–127 nucleocapsid binding site. | 283 | Spacer domain | Unknown | [39] | |
30 | S (first cytosolic domain CYL-I) | S antigen secretion and incorporation into nascent virions. | 373 | RT domain (N-Terminus) | Part of conserved region involved in pgRNA packaging. | ||
208 | S (TM4) | S antigen secretion. | [31] | 551 | Downstream of the YMDD motif (aa 538–541) | Unknown |