Background
Methods
Data sources and pseudogene acquisition
Establishment of pseudogene pair-based prognostic signature
Validation of the prognostic performance of the pseudogene pair model
Comparison with other clinicopathological features and the novel prognostic model
Identification and enrichment analysis of pseudogene-related protein-coding genes
Statistical analysis
Results
Establishing the pseudogene pair-based signature
Variables | Subgroups | TCGA (N = 370) | ICGC(N = 231) |
---|---|---|---|
Age | < 60 | 169 | 44 |
> = 60 | 201 | 187 | |
Sex | Male | 249 | 179 |
Female | 121 | 62 | |
Stage | I | 171 | 36 |
II | 85 | 104 | |
III | 85 | 72 | |
IV | 5 | 19 | |
NA | 24 | 0 | |
Grade | I | 55 | – |
II | 177 | – | |
III | 121 | – | |
IV | 12 | – | |
NA | 5 | – | |
Survival status | Dead | 130 | 42 |
Living | 240 | 189 | |
Vascular invasion | Positive | 108 | – |
Negative | 206 | – | |
NA | 56 | – | |
Family history | Positive | 112 | 73 |
Negative | 207 | 143 | |
NA | 51 | 15 | |
Prior malignancy | Positive | – | 29 |
Negative | – | 202 | |
NA | – | 0 |
Genepair1 | Full name | Genepair2 | Full name | Coef |
---|---|---|---|---|
ABCC6P2 | ATP binding cassette subfamily C member 6 pseudogene 2 | DSTNP2 | DSTN pseudogene 2 | −0.133577486 |
ANXA2P2 | annexin A2 pseudogene 2 | AZGP1P1 | AZGP1 pseudogene 1 | 0.06815618 |
ANXA2P2 | annexin A2 pseudogene 2 | HLA-J | major histocompatibility complex, class I, J | 0.337854755 |
AQP7P1 | aquaporin 7 pseudogene 1 | HLA-J | major histocompatibility complex, class I, J | 0.433464122 |
AQP7P1 | aquaporin 7 pseudogene 1 | MT1DP | metallothionein 1D, pseudogene | 0.220401079 |
AZGP1P1 | AZGP1 pseudogene 1 | CYP21A1P | cytochrome P450 family 21 subfamily A member 1, pseudogene | −0.171662304 |
AZGP1P1 | AZGP1 pseudogene 1 | GGTA1P | glycoprotein alpha-galactosyltransferase 1, pseudogene | −0.330772998 |
C3P1 | complement component 3 precursor pseudogene | MT1L | metallothionein 1 L, pseudogene | −0.211202632 |
CA5BP1 | carbonic anhydrase 5B pseudogene 1 | LPAL2 | lipoprotein(a) like 2, pseudogene | 0.140891921 |
DSTNP2 | DSTN pseudogene 2 | PLGLA | plasminogen like A | 0.139199981 |
DSTNP2 | DSTN pseudogene 2 | WASH3P | WASP family homolog 3, pseudogene | 0.332685477 |
HLA-J | major histocompatibility complex, class I, J | MSTO2P | misato family member 2, pseudogene | −0.356768111 |
HLA-J | major histocompatibility complex, class I, J | RP9P | RP9 pseudogene | −0.035991571 |
HSPA7 | heat shock protein family A (Hsp70) member 7 (pseudogene) | NAPSB | napsin B aspartic peptidase, pseudogene | 0.384325838 |
LPAL2 | lipoprotein(a) like 2, pseudogene | PLGLA | plasminogen like A | 0.092279424 |
NAPSB | napsin B aspartic peptidase, pseudogene | NSUN5P1 | NSUN5 pseudogene 1 | −0.339252375 |
NUDT16P1 | nudix hydrolase 16 pseudogene 1 | PLGLA | plasminogen like A | 0.20989673 |
PLGLA | plasminogen like A | RP9P | RP9 pseudogene | −0.137033874 |
RP9P | RP9 pseudogene | WASH3P | WASP family homolog 3, pseudogene | 0.424813675 |
Association between signature risk score and clinical characteristics
Validation and assessment of the established signature
Univariate analysis | Multivariate analysis | |||||
---|---|---|---|---|---|---|
HR | 95%CI | P-value | HR | 95%CI | P-value | |
TCGA cohort | ||||||
Age | 1.01 | 0.996–1.025 | 0.174 | 1.01 | 0.996–1.024 | 0.168 |
Sex | 0.776 | 0.531–1.132 | 0.188 | 0.912 | 0.614–1.353 | 0.646 |
Grade | 1.133 | 0.881–1.456 | 0.33 | 0.927 | 0.706–1.219 | 0.588 |
Stage | 1.68 | 1.369–2.062 | < 0.0001 | 1.33 | 1.070–1.654 | 0.01 |
riskScore | 3.583 | 2.726–4.709 | < 0.0001 | 3.416 | 2.551–4.576 | < 0.0001 |
ICGC cohort | ||||||
Sex | 0.515 | 0.270–0.982 | 0.044 | 0.42 | 0.215–0.819 | 0.011 |
Age | 0.998 | 0.966–1.032 | 0.917 | 0.989 | 0.955–1.025 | 0.558 |
Stage | 2.238 | 1.532–3.269 | < 0.0001 | 2.16 | 1.459–3.198 | 0.0001 |
Prior malignancy | 1.658 | 0.692–3.975 | 0.257 | 2.287 | 0.912–5.734 | 0.078 |
Cancer history | 0.794 | 0.404–1.563 | 0.505 | 0.706 | 0.351–1.421 | 0.329 |
riskScore | 2.337 | 1.490–3.664 | 0.0002 | 1.902 | 1.201–3.014 | 0.006 |
Comparison with previous existed prognostic signatures
Functional analysis of co-expression genes
ID | Description | P value | P adjust |
---|---|---|---|
GO:0004896 | cytokine receptor activity | 1.64E-11 | 6.31E-09 |
GO:0001637 | G protein-coupled chemoattractant receptor activity | 4.05E-08 | 3.23E-06 |
GO:0004950 | chemokine receptor activity | 4.05E-08 | 3.23E-06 |
GO:0019955 | cytokine binding | 4.21E-08 | 3.23E-06 |
GO:0016493 | C-C chemokine receptor activity | 1.65E-07 | 9.05E-06 |
GO:0019957 | C-C chemokine binding | 2.54E-07 | 1.30E-05 |
GO:0019956 | chemokine binding | 3.92E-07 | 1.88E-05 |
GO:0023023 | MHC protein complex binding | 6.72E-07 | 3.04E-05 |
GO:0042287 | MHC protein binding | 1.33E-06 | 5.67E-05 |
GO:0032395 | MHC class II receptor activity | 2.54E-05 | 0.001027582 |
GO:0030246 | carbohydrate binding | 0.000143695 | 0.004598232 |
GO:0001608 | G protein-coupled nucleotide receptor activity | 0.000175825 | 0.005193614 |
GO:0045028 | G protein-coupled purinergic nucleotide receptor activity | 0.000175825 | 0.005193614 |
GO:0030695 | GTPase regulator activity | 0.000508907 | 0.012607749 |
KEGG:hsa04662 | B cell receptor signaling pathway | 4.67E-12 | 1.85E-10 |
KEGG:hsa04062 | Chemokine signaling pathway | 2.79E-09 | 4.07E-08 |
KEGG:hsa04660 | T cell receptor signaling pathway | 1.36E-07 | 1.45E-06 |
KEGG:hsa04650 | Natural killer cell mediated cytotoxicity | 3.92E-07 | 4.02E-06 |
KEGG:hsa04060 | Cytokine-cytokine receptor interaction | 1.07E-06 | 1.03E-05 |
KEGG:hsa04064 | NF-kappa B signaling pathway | 0.000577911 | 0.00390442 |
KEGG:hsa05235 | PD-L1 expression and PD-1 checkpoint pathway in cancer | 0.001387015 | 0.008934958 |
KEGG:hsa05231 | Choline metabolism in cancer | 0.008939182 | 0.047618336 |