Skip to main content
Erschienen in: BMC Cancer 1/2018

Open Access 01.12.2018 | Research article

Discovering novel SNPs that are correlated with patient outcome in a Singaporean cancer patient cohort treated with gemcitabine-based chemotherapy

verfasst von: Vachiranee Limviphuvadh, Chee Seng Tan, Fumikazu Konishi, Piroon Jenjaroenpun, Joy Shengnan Xiang, Yuliya Kremenska, Yar Soe Mu, Nicholas Syn, Soo Chin Lee, Ross A. Soo, Frank Eisenhaber, Sebastian Maurer-Stroh, Wei Peng Yong

Erschienen in: BMC Cancer | Ausgabe 1/2018

Abstract

Background

Single Nucleotide Polymorphisms (SNPs) can influence patient outcome such as drug response and toxicity after drug intervention. The purpose of this study is to develop a systematic pathway approach to accurately and efficiently predict novel non-synonymous SNPs (nsSNPs) that could be causative to gemcitabine-based chemotherapy treatment outcome in Singaporean non-small cell lung cancer (NSCLC) patients.

Methods

Using a pathway approach that incorporates comprehensive protein-protein interaction data to systematically extend the gemcitabine pharmacologic pathway, we identified 77 related nsSNPs, common in the Singaporean population. After that, we used five computational criteria to prioritize the SNPs based on their importance for protein function. We specifically selected and screened six candidate SNPs in a patient cohort with NSCLC treated with gemcitabine-based chemotherapy.

Result

We performed survival analysis followed by hematologic toxicity analyses and found that three of six candidate SNPs are significantly correlated with the patient outcome (P < 0.05) i.e. ABCG2 Q141K (rs2231142), SLC29A3 S158F (rs780668) and POLR2A N764K (rs2228130).

Conclusions

Our computational SNP candidate enrichment workflow approach was able to identify several high confidence biomarkers predictive for personalized drug treatment outcome while providing a rationale for a molecular mechanism of the SNP effect.

Trial registration

NCT00695994. Registered 10 June, 2008 ‘retrospectively registered’.
Hinweise

Electronic supplementary material

The online version of this article (https://​doi.​org/​10.​1186/​s12885-018-4471-x) contains supplementary material, which is available to authorized users.
Abkürzungen
CI
Confidence Interval
ddG
Free energy change
dFdCTP
Gemcitabine triphosphate
MAF
Minor allele frequency
NSCLC
Non-small cell lung cancer
nsSNPs
non-synonymous SNPs
OS
Overall survival
PFS
Progression free survival
SNPs
Single Nucleotide Polymorphisms

Background

Gemcitabine (2′-2′ difluorodeoxycytidine) is a deoxycytidine analogue with antitumor activity against a variety of solid tumors such as non-small cell lung cancer (NSCLC), breast cancer [1] and pancreatic cancer [2]. Gemcitabine requires phosphorylation to mono-, di-, and triphosphates (dFdCTP) to be active. This mechanism results in a unique pattern of self-potentiation of the drug and when this drug is incorporated into the DNA during replication, it causes chain termination. Gemcitabine also has multiple intracellular targets. Up- or downregulation of these targets may confer resistance to this drug.
Wider availability and lower costs of genome and expression profile sequencing made application of those techniques in clinical practice feasible; thus, the scientific question of how patient-specific mutations and chromosomal aberrations influence personal clinical outcomes via biomolecular mechanisms has become acute [3, 4]. For example, pharmacogenetics studies published in the last decades have provided evidence that Single Nucleotide Polymorphisms (SNPs) can causally influence patient outcome such as drug response and toxicity after drug intervention [5]. Most SNPs associated with patient outcome have been found in genes involved in the drug pharmacology i.e. affecting drug transport, metabolism and/or activity with drugs. Soo et al. tested 26 SNPs from nine genes that are already known to be directly associated with gemcitabine transport, metabolism and activity [6]. They found several SNPs that were associated with patient outcome in Singaporean NSCLC patients treated with gemcitabine [6]. However, a systematic approach to investigate the relationship between gene variants and patient outcome is still lacking. Therefore, the purpose of this study is to develop a systematic pathway approach to accurately and efficiently predict novel non-synonymous SNPs (nsSNPs) that could be causative to gemcitabine-based chemotherapy treatment outcome in Singaporean NSCLC patients. After detailed SNP analysis, we prioritized the SNPs based on their importance in protein function and molecular mechanism. From the top-ranking SNPs, we specifically selected six final candidate SNPs for clinical validation. We genotyped these six SNPs in a Singaporean patient cohort and have found that three out of the six SNPs correlated with patient outcome.

Methods

Pyrimidine metabolism as a starting pathway to find more genes in the gemcitabine pharmacologic pathway

Pyrimidine metabolism is known to be critical in the pharmacologic pathway of gemcitabine, which is a pyrimidine analogue. Therefore, we used the pyrimidine metabolism pathway in KEGG (hsa00240; http://​www.​genome.​jp/​kegg-bin/​show_​pathway?​org_​name=​hsa&​mapno=​00240&​mapscale=​&​show_​fdescription=​show) [7] which contains 100 genes as a starting point. In addition, we also did extensive literature search to find more genes that are directly associated with gemcitabine transport [6, 8, 9]. Apart from literature review, information from PharmGKB (http://​www.​pharmgkb.​org/​) has also been referenced. As a result, six membrane transporters implicated in the uptake of gemcitabine i.e. SLC28A1 (Entrez GeneID:9154), SLC28A2 (Entrez GeneID:9153), SLC28A3 (Entrez GeneID:64078), SLC29A1 (Entrez GeneID:2030), SLC29A2 (Entrez GeneID:3177), SLC29A3 (Entrez GeneID:55315) and three transporters implicated in the efflux of gemcitabine i.e. ABCC5 (Entrez GeneID:10057), ABCC10 (Entrez GeneID:89845) and ABCG2 (Entrez GeneID:9429) were added to the pathway. In total, 109 genes (the 100 genes in pyrimidine metabolism and the 9 membrane transporter genes) were used as starting proteins to find more interaction partners by using our in-house comprehensive protein-protein interaction (PPI) data.

Adding more potentially related proteins to the pathway using comprehensive PPI data

We used our in-house comprehensive PPI data to find additional proteins that could be related to the gemcitabine pharmacologic pathway. Comprehensive PPI data was consolidated by integrating experimentally-validated PPIs from nine databases, i.e. BIND [10], BioGRID [11], IntAct [12], DIP [13], MINT [14], MPact [15], HPRD [16], GNP (http://​genomenetwork.​nig.​ac.​jp/​index_​e.​html) and MPPI [17], to provide unique PPIs together with the accumulation of evidence such as experimental type and PubMed IDs. The method we used to integrate multiple databases is provided in detail in this website i.e. http://​ipid.​bii.​a-star.​edu.​sg/​annie/​home.​do#ui-tabs-1. The final interaction set contains 1,148,484 unique PPIs including 227,731 human PPIs. We used only human PPIs in this study. We extended the pyrimidine metabolism in KEGG (hsa00240) using the conservative requirement that the new protein must have been reported to interact with at least two out of the 100 proteins that are already in the pathway. After that, we collected nsSNPs from NCBI/dbSNPs build 136 [18] that are linked to these human genes using the NCBI E-utilities tool [19] with search terms “missense”, “nonsense” or “frameshift”. Information from databases “ensembl_mart_66” and “homo_sapiens_variation_66_37” [20] was then used to annotate each of the nsSNPs retrieved from E-utilities i.e. Ensembl’s genotype, Ensembl’s transcription ID, Ensembl’s consequence type, NCBI’s consequence type, HGVS genomic, HGVS coding, HGVS protein, PolyPhen-2 and SIFT prediction for reference. A script was written in python and was run on 1st March 2012.

Finding common SNPs in the Singaporean population

We used allele frequency information from the Singapore Genome Variation Project (SGVP) [21] to find common SNPs in the Singaporean population among the retrieved nsSNPs. The SGVP provides a publicly available resource of 1.6 million SNPs genotyped in 268 individuals from the Chinese, Malay, and Indian ethnicities in the Singaporean population. In this study, a common SNP is defined as one with a minor allele frequency (MAF) of ≥5% in at least one out of three ethnic groups i.e. Chinese, Malays or Indians.

Five criteria to filter candidate SNPs

After common SNPs are selected from the retrieved SNPs, five criteria were used to narrow down the common SNPs to select only those that are likely to affect protein function i.e.
4.1)
The SNP’s MAF from SGVP is specifically higher in the Singaporean Chinese compared to the Singaporean Malay or the Singaporean Indian ethnicity since about 87% of our patients are Singaporean Chinese (refer to Table 1).
 
4.2)
Result of PolyPhen-2 [22] prediction is “possibly damaging” or “probably damaging” by using rsID of each SNP as input. We used the batch query option of PolyPhen-2 with HumDiv classifier model and genome assembly GRCh37/hg19. For those SNPs that could not retrieve result from the batch query option, we input rsID one by one to the PolyPhen-2 website (http://​genetics.​bwh.​harvard.​edu/​pph2/​) to retrieve the result.
 
4.3)
Result of SIFT [23] prediction is “Affect protein function”. SIFT results were first retrieved using “SIFT dbSNP batch tool” which was run on 21 March 2012 to pre-screen the results. After that, orthologue sequences (select only “1:1 orthologs”) were retrieved from either OMA browser [24] or Orthologue search against NCBI Non-redundant protein set on ANNOTATOR [25] and were used to create a multiple sequence alignment with MAFFT (L-INS-I settings) [26]. We deleted those sequences that have large gaps using Jalview [27].
 
4.4)
A SNP is located in the functional domain of a protein. We used the amino acid sequence of the gene that the SNP is located in as input to do “Prim-Seq-An w/Pfam” analysis in ANNOTATOR [25] using default settings which include HMMER against many protein domain databases e.g. SMART, Pfam to retrieve functional domain information of the protein. Later, we annotated whether a SNP is located in any functional domain of the corresponding protein or not.
 
4.5)
Average free energy change (ddG, kcal/mol) of the protein by the SNP as predicted by FoldX from 5 runs [28] is significant i.e. more than 0.5 kcal/mol or less than − 0.5 kcal/mol. The menu option “Mutate residue” in the FoldX plugin for YASARA [29] was used to predict free energy changes of the protein when the wild-type amino acid is mutated to another amino acid to predict the effect of SNPs on protein structure. The structure of the protein associated with the SNP of interest was energy minimized using the “RepairPDB” function in FoldX before mutating the residue from wild-type amino acid to the SNP’s amino acid and calculating stability change. To perform this analysis, a 3D protein structure or homology model is needed, so a template or crystal structure that contains the SNP’s region is retrieved by using either “NCBI-BLAST” of the protein sequence against PDB (E-Value cutoff 0.001 with BLOSUM62 matrix) or HHPRED against PDB (E-Value cutoff 0.001) on ANNOTATOR [25]. If there is a crystal structure available where the SNP is located, we use the crystal structure as an input to FoldX. For SNPs in proteins without crystal structures but found to have appropriate homologous template structures, we model the structure by homology modeling using MODELLER [30] with loop refinement.
 
Table 1
Characteristics of patients who were treated with gemcitabine-based chemotherapy
Characteristics at diagnosis
NSCLC patients (n = 92)a
Ethnicity
 
  Chinese
80
  Malay
9
  Indian
0
  Others
2
  No data
1
Gender
 
  Male
67
  Female
24
  No data
1
Stage of Cancer
 
  Stage III
14
  Stage IV
77
  No data
1
Performance Status (ECOG)
 
  0
58
  1
33
  No data
1
aCould not retrieve any data from one patient and there is another patient who had no survival data
Finally, after consideration of the five criteria in each of the 77 SNPs, we selected only the top-ranking candidate SNPs for genotyping in the Singaporean patient cohort with known clinical trial data.

Study population

The Singapore National Healthcare Group Domain Specific Review Board reviewed and approved the study. All the patients provided written informed consent before study entry. The study was conducted in accordance to Good Clinical Practice guidelines. A total of 92 non-small cell lung cancer (NSCLC) patients were recruited for the study and were analysed. All the patients received their treatments in the Department of Haematology-Oncology at National University Hospital of Singapore. Patients with not more than two lines of prior systemic chemotherapy were recruited to receive gemcitabine (750-1000 mg/m2 on day 1 and day 8) and carboplatin (AUC 5 mg/ml on day 1) every 3 weeks. Radiographic assessments were done to evaluate tumor response every two cycles according to RECIST criteria. Safety assessments were performed at every cycle including weekly full blood counts to monitor haematological toxicities. Demographic profiles of the patients are summarized in Table 1. We could not retrieve any data from one NSCLC patient and there is another stage IV NSCLC Chinese, male with ECOG = 0 patient who had no survival data as well. So in total, 90 NSCLC patients were available for survival and toxicity analysis.

Blood collection and genomic DNA extraction

A total of 8 ml peripheral blood was obtained from each patient. The blood was drawn into heparinized vacutainer tubes (Becton Dickinson) and mononuclear cells isolated by Ficoll-Hypaque density gradient centrifugation according to manufacturer’s instructions (GE Healthcare, Chalfont St Giles, United Kingdom). The DNA in turn was extracted from the mononuclear cells using the Puregene DNA purification kit (Gentra Systems, Minneapolis, MN).

PCR (polymerase chain reaction) and pyrosequencing

First, PCR products were immobilized on streptavidin-coated beads and denatured to produce single-stranded products. Pyrosequencing was performed using the PyroMark Gold Q24 reagent and the PyroMark Q24 system (Qiagen), according to the manufacturer’s protocol. Primers for pyrosequencing were designed with the PyroMark Assay Design Software 2.0. Primers, including biotin-labelled and sequencing primers are represented in Additional file 1: Table S1. Sequencing analysis was performed using PyroMark Q24 version 2.0.6 software in the allele quantification analysis (QA) mode.

Statistical analysis to find correlation between the candidate SNPs and patient outcome

Kaplan-Meier methods and log-rank test were used to analyse results for overall survival and progression-free survival in the NSCLC patient cohort. Grade 3 or 4 haematological toxicities and its association with gene variants were analysed using Chi-squared test. All statistical analyses were two-sided and the SPSS software version 16.0 was used. P value of less than 0.05 were considered to indicate nominal statistical significance. Predictor variables – including gender, age, stage and ECOG), and the 6 SNPs – were initially correlated with categorical outcomes (grade 3/4 neutropenia and thrombocytopenia) using the chi-squared test, and with time-to-event outcomes (overall and progression free survival) using the log-rank test in univariate fashion. Next, clinical variables and SNPs which were found to be significant in the univariate analyses were included in multivariate Cox or logistic regression to obtain adjusted p values and effect sizes.

Results

5046 nsSNPs were found to be linked to the 178 genes in the gemcitabine pharmacologic pathway

The overall workflow and result in each step are described in Fig. 1. We used 100 proteins in the human pyrimidine metabolism metabolic pathway (KEGG:hsa00240) as a starting point and then used our in-house comprehensive PPI data which comprise of unique 227,731 human PPIs integrated from nine public databases (detail in Method section) to extend the pathway by using a conservative requirement that the new protein needs to connect to at least two other out of the 100 proteins that are already in the pyrimidine pathway. By using this criterion, we found an additional 69 proteins from the comprehensive PPI data that can be connected to the pathway. Therefore, 169 genes (100 genes in the pyrimidine metabolism and an additional 69 new genes) together with the 9 membrane transporters from literature review were used to find nsSNPs that are linked to these genes. By using NCBI E-utilities, 5046 nsSNPs were found to be linked to all 178 genes from dbSNPs. 2540 of these nsSNPs (50.34%) came from the newly added proteins. Next, using allele frequency data from SGVP [21], we found that only 77 (in 54 genes) of the retrieved nsSNPs have MAF with more than or equal to 5% in at least one ethnicity in the Singaporean population. We called these 77 nsSNPs as common SNPs in this study. These common SNPs contain 73 missense, 3 nonsense and 1 frameshift mutations. Detailed information of all common nsSNPs are described in Additional file 1: Table S2. Among the 77 common SNPs, eight were found to be previously tested in the Singaporean patient cohort in NSCLC [6] (please refer “*” after rsIDs in Additional file 1: Table S2). Out of the eight, three of them were proven to be associated with patient outcome i.e. SLC28A1 D521N (rs2242046), SLC28A2 P22L (rs11854484) and SLC28A2 S75R (rs1060896) although these SNPs passed only one or none in our criteria (Additional file 1: Table S2). Later, POLA2 G583R (rs487989) which is one of the eight SNPs and passed two of our criteria was proven to be strongly associated with mortality rate and survival time among Singaporean NSCLC patients treated with gemcitabine [31].

15 out of 77 common nsSNPs have significant results in three out of five criteria

We used five criteria (as described in detail in the Method section) to narrow down the common SNPs to select only those that are likely to affect protein function (Fig. 1). The first criterion was to select SNPs that have higher MAF in the Chinese population since the majority of our patients are Singaporean Chinese. 23 out of the 77 nsSNPs were found to match this criterion. The second and third criteria were based on prediction results from PolyPhen-2 and SIFT, respectively, both of which are evolutionary sequence conservation-based approaches. We used the batch-query tool of PolyPhen-2 to parse pre-calculated results of all common nsSNPs. For those SNPs that could not fetch results from the batch-query tool, we used RefSeq amino acid sequence ID of each gene as input to retrieve the result from the PolyPhen-2 website. From the PolyPhen-2 result (i.e. the second criteria), there were 17 common SNPs predicted to be either “probably” or “possibly damaging” (Additional file 1: Table S2). For the SIFT analysis (i.e. the third criteria), we used the “SIFT dbSNP batch tool” to retrieve prediction results for all common nsSNPs and found that only 9 of them were predicted as “Deleterious” with the SIFT score equal to or less than 0.05 (Additional file 1: Table S2). The fourth criterion is to check if the SNPs lie on any functional domain. To do this, we retrieved HMMER results against Pfam and SMART using “Prim-Seq-An w/Pfam” analysis on ANNOTATOR and found that 35 out of the 77 SNPs are located in a functional domain of their proteins. The final criterion was to investigate whether the SNP affects protein structural stability using FoldX. Only 27 common SNPs were in a region with a known structure or high similarity to a known structure which allows us to do homology modeling and FoldX analysis. 17 of them returned significant results from FoldX. Results of the five criteria of all 77 common nsSNPs are described in Additional file 1: Table S2.
Finally, we were able to narrow down the 77 common nsSNPs to 15 which have significant results in at least three out of the five criteria (Additional file 1: Table S2). We also considered some SNPs that retrieved border line results. Lastly, literature review was performed to understand the functional role of the genes associated with the SNPs to further select SNPs that are most likely to affect the gemcitabine pathway. Finally, we identified the following six SNPs, that is, ABCG2 Q141K (c.421C > A, rs2231142), SLC29A3 S158F (c.473C > T, rs780668), HELB T980I (C > T. rs1168312), NT5C2 D549E (c.1647C > T, rs3740387), POLR2A N764K (c.2292C > T, rs2228130) and CTDP1 T221M (c.662C > T, rs2279103) as final candidate SNPs based on the importance of the SNPs for protein function and drug-related molecular mechanisms (Fig. 2). Five out of the six final candidate SNPs passed three out of five criteria and only CTDP1 T221M is selected based on a borderline result of the computational selection criteria because it seemed plausible from our literature study. These SNPs are then genotyped in a Singaporean NSCLC patient cohort with known patient outcome for gemcitabine-based therapy.

Genotyping of the six final candidate SNPs

A total of 90 NSCLC patients that have survival data available were genotyped for the six final candidate SNPs. Genotype information of 90 samples is shown in Additional file 1: Table S3. Genotype of HELB T980I could not be retrieved from 2 out of 90 patients. Therefore, we used 88 NSCLC patient data to perform survival and toxicity analyses in the next step. We could not find any patient who has the TT genotype of POLR2A N764K.

ABCG2 Q141K and SLC29A3 S158F are associated with increased survival in NSCLC

Kaplan-Meier analysis was performed to determine any correlation of the six final candidate SNPs with overall survival (OS) and progression free survival (PFS) in the NSCLC patient cohort (n = 88). ABCG2 Q141K (c.421 C > A, rs2231142) was found to be associated with increased median PFS. Patients with CA/AA genotype were shown to have longer PFS compared to CC genotype i.e. 9.12 months [95% CI 1.83-16.4 months] vs 5.51 months [95% CI 4.31-6.71 months] respectively, HR 0.51 (95% CI 0.31-0.83), adjusted P = 0.007 (Table 2).
Table 2
Univariate and multivariate Cox regression analyses of progression-free survival (PFS) and overall survival (OS) of the six final candidate SNPs with clinical parameters in the NSCLC cohort
 
Factors/Genotype
Number/
Progression Free Survival (PFS)
Number/
Overall Survival (OS)
Median PFS (months)
Univariate analysis
Multivariate analysis
Median OS (months)
Univariate analysis
Multivariate analysis
 
HR (95% CI)
P value
HR (95%CI)
P value (adjusted)
 
HR (95% CI)
P value
HR (95%CI)
P value (adjusted)
Gender
Male
66
0.92 (0.55-1.55)
0.767
66
0.76 (0.43-1.36)
0.357
Female
22
22
Age
< 62
42
0.81 (0.51-1.28)
0.370
42
0.85 (0.51-1.40)
0.523
≥62
46
46
Stage
3
19
1.59 (0.91-2.76)
0.100
19
1.60 (0.84-3.02)
0.151
4
69
69
ECOG
0
27
1.22 (0.73-2.04)
0.451
27
1.95 (1.06-3.57)
0.031
2.13 (1.14-3.97)
0.018
1
61
61
ABCG2 Q141K
CC
5.51
0.48 (0.29-0.78)
0.003
0.51 (0.31-0.83)
0.007
11.57
0.52 (0.31-0.89)
0.017
0.61 (0.35-1.06)
0.080
CA/AA
9.12
20.19
SLC29A3 S158F
CC
5.08
0.56 (0.33-0.96)
0.034
0.64 (0.37-1.10)
0.108
8.43
0.52 (0.30-0.92)
0.024
0.49 (0.27-0.88)
0.017
CT/TT
7.84
17.64
NT5C2 D549Ea
CC
5.25
0.71 (0.41-1.21)
0.206
10.49
0.67 (0.38-1.17)
0.157
CT/TT
7.57
17.31
HELB T980I
CC
8.2
1.42 (0.90-2.25)
0.133
17.05
1.23 (0.75-2.04)
0.412
CT/TT
5.74
11.28
CTDP1 T221M
CC
7.02
1.24 (0.74-2.09)
0.421
13.35
1.10 (0.62-1.96)
0.741
CT/TT
6.56
16.85
POLR2A N764K
CC
7.02
1.15 (0.55-2.42)
0.711
13.34
1.14 (0.54-2.40)
0.734
CT
7.57
14.03
aLatest dbSNP build annotated this SNP as NT5C2 D549D (synonymous snp)
Data in bold are those that have p < 0.05
SLC29A3 S158F (c.473C > T, rs780668) was found to be associated with increased OS. Patients with CT/TT genotype were shown to have longer median OS compared to CC genotype i.e. 17.64 months [95% CI 10.55-24.73 months] vs 8.43 months [94% CI 1.21-15.64 months], HR 0.49 (95% CI 0.27-0.88), adjusted P = 0.017 (Table 2). Association with OS/PFS could not be found in four other variants (Table 2).

ABCG2 Q141K and POLR2A N764K are correlated with gemcitabine cytotoxicity

The ABCG2 Q141K variant (the CA/AA genotype) was not only associated with improved PFS but was also found to be associated with increased toxicity i.e. higher risk of grade 3 or 4 thrombocytopenia (low platelet count) compared to the wild-type genotype (CC) (70.7% vs 44.7% respectively, HR 3.79 (95% CI 1.42-10.1) adjusted P = 0.008) (Table 3). Interestingly, the wild-type CC genotype of POLR2A N764K variant was found to be associated with a higher risk of grade 3 or 4 thrombocytopenia at 61.5% compared to 20.0% of the SNP’s CT genotype, HR 0.18 (95% CI 0.03-0.98), adjusted P = 0.048 (Table 3).
Table 3
Univariate and multivariate analyses by chi square and logistic regression, respectively, of grade 3 or 4 neutropenia and thrombocytopenia of the six final candidate SNPs with clinical parameters in the NSCLC cohort
 
Factors/Genotype
Number/ Percentage of grade 3/4 neutropenia (all cycles)
Grade 3/4 neutropenia
Number/Percentage of grade 3/4 thrombocytopenia (all cycles)
Grade 3/4 thrombocytopenia
Univariate analysis
Multivariate analysis
Univariate analysis
Multivariate analysis
OR (95%CI)
P value
OR (95%CI)
P value (adjusted)
OR (95%CI)
P value
OR (95%CI)
P value (adjusted)
Gender
Male
66
2.74 (0.83-9.04)
0.097
66
4.78 (1.46-15.7)
0.01
4.44 (1.27-15.6)
0.020
Female
22
22
Age
< 62
42
1.03 (0.42-2.51)
0.942
42
0.56 (0.24-1.31)
0.178
≥62
46
46
Stage
3
19
2.21 (0.78-6.24)
0.136
19
2.84 (0.99-9.12)
0.052
4
69
69
ECOG
0
27
0.80 (0.30-2.14)
0.659
27
0.87 (0.35-2.17)
0.758
1
61
61
ABCG2 Q141K
CC
59.60%
2.10 (0.84-5.28)
0.113
44.70%
2.99 (1.23-7.25)
0.015
3.79 (1.42-10.1)
0.008
CA/AA
75.60%
70.70%
SLC29A3 S158F
CC
50.00%
2.67 (0.99-7.22)
0.054
45.60%
1.85 (0.70-4.89)
0.109
CT/TT
72.70%
60.60%
NT5C2 D549E*
CC
57.10%
1.76 (0.64-4.84)
0.272
57.10%
0.98 (0.37-2.65)
0.973
CT/TT
70.20%
56.70%
HELB T980I
CC
70.00%
0.73 (0.30-1.80)
0.499
56.00%
1.08 (0.46-2.53)
0.859
CT/TT
63.20%
57.90%
CTDP1 T221M
CC
66.70%
1.07 (0.38-3.01)
0.090
54.60%
1.46 (0.54-3.94)
0.457
CT/TT
68.20%
63.60%
POLR2A N764K
CC CT
70.50 40.00%
0.28 (0.07-1.08)
0.065
61.50 20.00%
0.16 (0.03-0.79)
0.024
0.18 (0.03-0.98)
0.048
Data in bold are those that have p < 0.05

Discussion

In this study, three out of the six candidate SNPs were confirmed to be associated with NSCLC patient outcome i.e. OS, PFS and side effect. To the best of our knowledge, this is the first study showing association of ABCG2 Q141K (rs2231142), SLC29A3 S158F (rs780668) and POLR2A N764K (rs2228130) with NSCLC patient outcome treated with gemcitabine-based chemotherapy. ABCG2 belongs to the ABCG subfamily and ABC transporter superfamily. The ABCG family has five members i.e. ABCG2, ABCG1, ABCG4, ABCG5 and ABCG8. ABCG2 consists of a nucleotide-binding domain (NBD) in the amino terminus followed by six putative transmembrane domains (Fig. 3a). The ABCG2 Q141K SNP is located at the NBD in the cytoplasmic part of the protein. The c.421A allele frequency of ABCG2 Q141K is known as one of the common SNPs in Asian people (about 26-35%) [32]. Moreover, this SNP has been shown to be associated with increased risk of gout [33]. When we created our own detailed multiple sequence alignment using all members in the ABCG family, we found that glutamine in this position is well conserved in ABCG2 orthologs but not in other members in the family, therefore Q141 can be considered as an ABCG2-subfamily specific conserved residue (Fig. 3b). ABCG2 is the only member in this family that is not involved in cholesterol efflux but it mediates the efflux of a wide range of xenobiotics including gemcitabine, using ATP as an energy source [34]. There is in vitro evidence that ABCG2 Q141K decreases efflux activity and increases intracellular gemcitabine levels and it has been known to be associated with impaired ABCG2 activity by lowering protein expression level or decreasing ATPase activity [35]. The study supports the observation that ABCG2 itself plays a role in decreasing intracellular concentration of gemcitabine. Another in vitro study demonstrates significantly worse overall survival for carriers of the ABCG2 421A-allele treated with platinum-based drugs [36]. Mizuarai et al. described that the ATPase activity of the Q141K variant was reduced approximately 1.3-fold compared to the activity of the wild type ABCG2 in polarized LLC-PK1 cell lines, resulting in increased drug accumulation and decreased drug efflux in the variant ABCG2-expressing cells [36]. According to BLAST against PDB, a crystal structure of Malk, the ATP subunit of the maltose transporter from E.coli (PDB:1Q12 chain A) [37] was the top hit with a E-value of 2.0E-17. We used this template to do homology modeling of the NBD region (position 41-299) of ABCG2 using MODELLER. The ABCG2 model is shown in Fig. 3c. We used this model to calculate the stability change upon mutation by FoldX and found the average free energy changes (ddG) when mutating Q to K at position 141 of ABCG2 to be 1.93 kcal/mol with a standard deviation (SD) of 0.10 kcal/mol. This suggests that the SNP has a destabilizing effect on the protein structure which is in agreement with a recent finding that Q141 causes instability in the NBD [38]. The SNP is located in the loop region which is relatively near the ATP binding site of the dimer and changing the neutral side-chain glutamine to positively-charged side-chain lysine may affect the scaffold of the neighboring ATP binding site formed by the homodimer (Fig. 3c). Therefore, it can be proposed that if a patient has this variant and is treated with gemcitabine, efflux of gemcitabine can be diminished resulting in an increase in the intracellular concentration of gemcitabine in cancer cells and it is thus more effective at killing cancer cells. However, since normal cells also have this SNP which causes accumulation of the drug and other substrates exported by this protein, this SNP is also linked to increased toxicity in normal cells (Fig. 4).
Our study also showed for the first time that patients who were carrying either the CT or TT of SLC29A3 473 C > T (rs780668) were associated with increased OS. SLC29A3 belongs to the equilibrative nucleoside transporter (ENT) family, responsible for passive nucleoside transport and has 11 transmembrane helices (TMs) within the nucleoside transporter domain (Pfam:PF01733) (Fig. 5a). SLC29A3 S158 is likely to be a subfamily specific residue since serine at this position is fully conserved from human to fish among SLC29A3 orthologs but cannot be seen in other members in the family (Fig. 5b). This residue may relate to its unique function compared to other members in that it seems to function in the inner membrane of mitochondria and/or in the lysosome which requires an acidic pH environment and the position of the SNP seems to localize outside of the inner membrane of mitochondria [39]. For homology modeling of SLC29A3, we had a problem retrieving correct TMs models using MODELLER with loop refinement. The template used was a crystal structure of the glycerol-3-phosphate transporter from E.Coli (PDB:1PW4) chain A which has only 12% identity to our query. This problem is common when we used MODELLER which is more suitable for modeling soluble proteins than membrane proteins. Therefore, we used another software called Memoir [40] which is a homology modelling algorithm designed specifically for membrane proteins. A homology model from this software retrieved 11 TMs with long N-terminus and long loop regions between TM6 and 7 and the correct SNP’s position on the 3D structure (Fig. 5c). According to FoldX, SLC29A3 S158F was predicted to have a significant destabilizing effect with average ddG of 2.18 kcal/mol which could be explained by the strong change from the polar side chain serine to the larger and more hydrophobic side chain phenylalanine (Fig. 5d). Besides potential involvement of the conserved wildtype serine in the transport process, increasing the hydrophobicity through the mutation at the outside interface with the membrane may result in deeper insertion of the affected helix in the membrane. SLC29A3 can transport gemcitabine into organelles, e.g., mitochondria [39]. Moreover, SLC29A3 could be involved in the mitochondrial toxicity of nucleoside drugs [8]. Since SLC29A3 S158F is most likely to affect protein function, it may have a significant impact on transporting gemcitabine into mitochondria.
POLR2A (DNA-directed RNA polymerase II subunit RPB1) encodes the largest subunit (out of 12 subunits) of RNA polymerase II (Pol II) which catalyzes the RNA synthesis from DNA. POLR2A contains a carboxy terminal domain (CTD) which is composed of 52 heptapeptide repeats that are necessary for the polymerase activity (Fig. 6a). POLR2A N764K is located in the RNA polymerase’s domain 4 (Pfam:PF05000) which is also known as the funnel domain. The N764 is highly conserved among orthologs (Fig. 6b) and both PolyPhen-2 and SIFT analyses predicted that the SNP affects protein function (Additional file 1: Table S2). We created a homology model of POLR2A without the CTD using a crystal structure of yeast RNA polymerase II (PDB:1I3Q chain A) as a template (%identity = 50.3%) (Fig. 6c). The SNP is in the loop region and it was predicted to have a destabilizing effect by FoldX (average ddG 1.13 kcal/mol) (Fig. 6d). This could be due to the longer and charged lysine side chain causing a change in the conformation of the local loop structure. In our study, interestingly, wild type (CC) is found to be associated with higher grade 3 or 4 thrombocytopenia when compared to the CT variant (Table 3). Further analysis is needed to understand the mechanism for this. There is an evidence that dFdCTP is incorporated into RNA which is concentration- and time-dependent, resulting in inhibition of RNA synthesis [41]. In human parental NSCLC cells with a different inherent gemcitabine resistance, sensitivity to gemcitabine was related to differences in RNA corporation [42]. Since the SNP is found to be strongly deleterious from our analyses, it would be interesting to investigate whether POLR2A itself plays a role in the gemcitabine pharmacologic pathway and whether POLR2A N764K SNP has any implications on RNA synthesis.
NT5C2 (5′-nucleotidase, cytosolic II) encodes a hydrolase that serves a crucial role in cellular purine metabolism by acting primarily on inosine 5′-monophosphate (IMP) or guanosine monophosphate (GMP). The 5′-nucleotidase is a huge family of enzymes that catalyze the dephosphorylation of deoxy- and ribonucleoside monosphosphates to nucleoside analogues and inorganic phosphates [43, 44]. NT5C2 is known to dephosphorylate monophosphorylated gemcitabine. The NT5C2 D549 is the first charged amino acid in the last 13 acidic residues on the C-terminus of the protein (Fig. 7a). A study in 1999 proved the importance of the highly acidic C-terminus in NT5C2 protein using cDNA constructs encoding proteins lacking either N- or C-terminus and obtained the kinetic and molecular characteristics of the recombinant proteins [45]. When the last 13 acidic residues on the C-terminus were eliminated, there was a drastic reduction in the catalytic competence of the enzyme by lowering both the substrate affinity and the specific productivity. Furthermore, the capability of the protein to form a tetramer was significantly compromised. From the experiment above, it was concluded that the region of glutamic and aspartic acid residues in the C-terminus of the enzyme is necessary for the complete function of NT5C2. Another study, on bovine NT5C2 found that the C-terminus is perhaps involved with the modulation of enzyme function [46]. In our detailed SNP analysis, the D549 is well conserved among orthologs of NT5C2 (Fig. 7b) but is not conserved throughout the NT5C enzyme family (data not shown). D549E was predicted as deleterious by both PolyPhen-2 and SIFT. However, D and E are both negatively charged amino acids and the following acidic C-terminus shows a mixed pattern of the two so it is not mechanistically clear how this mutation could alter the enzyme activity of NT5C2. In agreement with this, we do not find significant correlations with survival or toxicity for NT5C2 D549E.
HELB T980I and CTDP1 T221M, unlike the other four SNPs that we predicted to directly affect the gemcitabine pharmacologic pathway, were selected because they could potentially affect the pathway indirectly in order to establish if more remotely related SNPs are still useful candidates for further studies. According to our criteria, HELB T980I passed three criteria i.e. significant results in SIFT, PolyPhen-2 and high MAF in Singaporean Chinese (Additional file 1: Table S2) while CTDP1 T221M passed two criteria (borderline) i.e. the SNP is in a domain region and has significant FoldX stability change (Additional file 1: Table S2). HELB T980I lies close to the phosphorylation sites in the phosphorylation regulated subcellular localization (PSLD) domain at the C-terminus of the protein. The PSLD domain has been suggested to play a significant role in regulating the subcellular localization of HELB [47]. The same study also suggested a possible role of HELB in DNA repair. Another study gives further evidence that HELB is recruited by Replication Protein A to mitigate replication stress [48]. So, we hypothesized that the SNP could impair HELB’s DNA replication stress mitigating effect of gemcitabine-induced DNA damage which could lead to reduced recovery from gemcitabine-induced replication stress, and hence to a better anti-cancer activity of gemcitabine. CTDP1 dephosphorylates a phosphorylated C-terminal domain of POLR2A to facilitate Pol II recycling for transcription [49]. There is evidence suggesting that CTDP1 may play a role in DNA damage response as well [50]. However, from our clinical results, we could not see any correlation between these two SNPs and the patient outcome. Therefore, it can be suggested that when we choose SNPs in the final step, we should expect a high chance for correlation with drug response only for those that are affecting the pharmacologic pathway directly.

Conclusions

Overall, our bioinformatics approach can be used to select a small number of potential causative SNPs that can be rationalized with molecular mechanisms of their effects. Our workflow can also be applied to any pathway of interest that could affect other phenotypes. Nowadays, using GWAS to find SNPs associated with common diseases or adverse drug reactions are the norm but even though thousands of case samples have been used, one review found that about 30% of the results lead to a null finding [51]. Furthermore, significant GWAS hits are often not easily explainable to have functional effects e.g. when they are intronic or synonymous SNPs. Although our study may be limited in sample size, we managed to find that three out of six final candidate SNPs are associated with patient outcome. However, more association studies and possibly molecular and cellular studies are needed to further establish the value of these three SNPs as biomarkers of patient outcome. While we have to acknowledge that this approach of selective filters does not guarantee to find all SNPs involved in a studied phenotype, the main benefit is that one can reduce the space of possible candidates to a small experimentally tractable number with higher chance of being relevant. We do believe that other candidate SNPs that passed three out of five criteria would also be potential candidates to be tested further. We hope our approach and findings will pave the way to more meaningful biomarkers and personalized treatment options in the future.

Acknowledgements

We are thankful to Hong Sain Ooi who helped us on collecting protein-protein interaction data from different public databases.

Funding

This work was supported by the Agency for Science, Technology and Research (A*STAR) Joint Council Office (JCO) Grant JCOAG04_FG03_2009. The clinical trial was supported by the BMRC grant 06/1/26/19/482. RAS is supported by the National Research Foundation, Singapore and the Singapore Ministry of Education under its Research Centers of Excellence initiative. The funders had no role in the design of the study, data collection and analysis, interpretation of data or preparation of the manuscript.

Availability of data and materials

All data analysed during this study are included in this published article and Additional file.
The Singapore National Healthcare Group Domain Specific Review Board reviewed and approved the study. All the patients provided written informed consent before study entry. The study was conducted in accordance to Good Clinical Practice guidelines.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://​creativecommons.​org/​licenses/​by/​4.​0/​), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://​creativecommons.​org/​publicdomain/​zero/​1.​0/​) applies to the data made available in this article, unless otherwise stated.
Literatur
1.
Zurück zum Zitat Blackstein M, Vogel CL, Ambinder R, Cowan J, Iglesias J, Melemed A. Gemcitabine as first-line therapy in patients with metastatic breast cancer: a phase II trial. Oncology. 2002;62:2–8.CrossRefPubMed Blackstein M, Vogel CL, Ambinder R, Cowan J, Iglesias J, Melemed A. Gemcitabine as first-line therapy in patients with metastatic breast cancer: a phase II trial. Oncology. 2002;62:2–8.CrossRefPubMed
2.
Zurück zum Zitat Burris HA 3rd, Moore MJ, Andersen J, Green MR, Rothenberg ML, Modiano MR, Cripps MC, Portenoy RK, Storniolo AM, Tarassoff P, Nelson R, Dorr FA, Stephens CD, Von Hoff DD. Improvements in survival and clinical benefit with gemcitabine as first-line therapy for patients with advanced pancreas cancer: a randomized trial. J Clin Oncol. 1997;15:2403–13.CrossRefPubMed Burris HA 3rd, Moore MJ, Andersen J, Green MR, Rothenberg ML, Modiano MR, Cripps MC, Portenoy RK, Storniolo AM, Tarassoff P, Nelson R, Dorr FA, Stephens CD, Von Hoff DD. Improvements in survival and clinical benefit with gemcitabine as first-line therapy for patients with advanced pancreas cancer: a randomized trial. J Clin Oncol. 1997;15:2403–13.CrossRefPubMed
3.
4.
Zurück zum Zitat Eisenhaber F. A decade after the first full human genome sequencing: when will we understand our own genome? J Bioinforma Comput Biol. 2012;10:1271001.CrossRef Eisenhaber F. A decade after the first full human genome sequencing: when will we understand our own genome? J Bioinforma Comput Biol. 2012;10:1271001.CrossRef
5.
Zurück zum Zitat Ma Q, Lu AYH. Pharmacogenetics, pharmacogenomics, and individualized medicine. Pharmacol Rev. 2011;63:437–59.CrossRefPubMed Ma Q, Lu AYH. Pharmacogenetics, pharmacogenomics, and individualized medicine. Pharmacol Rev. 2011;63:437–59.CrossRefPubMed
6.
Zurück zum Zitat Soo RA, Wang LZ, Ng SS, Chong PY, Yong WP, Lee SC, Liu JJ, Choo TB, Tham LS, Lee HS, Goh BC, Soong R. Distribution of gemcitabine pathway genotypes in ethnic Asians and their association with outcome in non-small cell lung cancer patients. Lung Cancer. 2009;63:121–7.CrossRefPubMed Soo RA, Wang LZ, Ng SS, Chong PY, Yong WP, Lee SC, Liu JJ, Choo TB, Tham LS, Lee HS, Goh BC, Soong R. Distribution of gemcitabine pathway genotypes in ethnic Asians and their association with outcome in non-small cell lung cancer patients. Lung Cancer. 2009;63:121–7.CrossRefPubMed
8.
Zurück zum Zitat Errasti-Murugarren E, Pastor-Anglada M. Drug transporter pharmacogenetics in nucleoside-based therapies. Pharmacogenomics. 2010;11:809–41.CrossRefPubMed Errasti-Murugarren E, Pastor-Anglada M. Drug transporter pharmacogenetics in nucleoside-based therapies. Pharmacogenomics. 2010;11:809–41.CrossRefPubMed
9.
Zurück zum Zitat Fukunaga AK, Marsh S, Murry DJ, Hurley TD, McLeod HL. Identification and analysis of single-nucleotide polymorphisms in the gemcitabine pharmacologic pathway. Pharm J. 2004;4:307–14. Fukunaga AK, Marsh S, Murry DJ, Hurley TD, McLeod HL. Identification and analysis of single-nucleotide polymorphisms in the gemcitabine pharmacologic pathway. Pharm J. 2004;4:307–14.
11.
Zurück zum Zitat Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006;34(Database issue):D535–9.CrossRefPubMed Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006;34(Database issue):D535–9.CrossRefPubMed
12.
Zurück zum Zitat Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, Derow C, Dimmer E, Feuermann M, Friedrichsen A, Huntley R, Kohler C, Khadake J, Leroy C, Liban A, Lieftink C, Montecchi-Palazzi L, Orchard S, Risse J, Robbe K, Roechert B, Thorneycroft D, Zhang Y, Apweiler R, Hermjakob H. IntAct--open source resource for molecular interaction data. Nucleic Acids Res. 2007;35(Database issue):D561–5.CrossRefPubMed Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, Derow C, Dimmer E, Feuermann M, Friedrichsen A, Huntley R, Kohler C, Khadake J, Leroy C, Liban A, Lieftink C, Montecchi-Palazzi L, Orchard S, Risse J, Robbe K, Roechert B, Thorneycroft D, Zhang Y, Apweiler R, Hermjakob H. IntAct--open source resource for molecular interaction data. Nucleic Acids Res. 2007;35(Database issue):D561–5.CrossRefPubMed
13.
Zurück zum Zitat Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D. The database of interacting proteins: 2004 update. Nucleic Acids Res. 2004;32(Database issue):D449–51.CrossRefPubMedPubMedCentral Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D. The database of interacting proteins: 2004 update. Nucleic Acids Res. 2004;32(Database issue):D449–51.CrossRefPubMedPubMedCentral
14.
Zurück zum Zitat Chatr-aryamontri A, Ceol A, Palazzi LM, Nardelli G, Schneider MV, Castagnoli L, Cesareni G. MINT: the molecular INTeraction database. Nucleic Acids Res. 2007;35(Database issue):D572–4.CrossRefPubMed Chatr-aryamontri A, Ceol A, Palazzi LM, Nardelli G, Schneider MV, Castagnoli L, Cesareni G. MINT: the molecular INTeraction database. Nucleic Acids Res. 2007;35(Database issue):D572–4.CrossRefPubMed
15.
Zurück zum Zitat Güldener U, Münsterkötter M, Oesterheld M, Pagel P, Ruepp A, Mewes H-W, Stümpflen V. MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res. 2006;34(Database issue):D436–41.CrossRefPubMed Güldener U, Münsterkötter M, Oesterheld M, Pagel P, Ruepp A, Mewes H-W, Stümpflen V. MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res. 2006;34(Database issue):D436–41.CrossRefPubMed
16.
Zurück zum Zitat Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, Balakrishnan L, Marimuthu A, Banerjee S, Somanathan DS, Sebastian A, Rani S, Ray S, Harrys Kishore CJ, Kanth S, Ahmed M, Kashyap MK, Mohmood R, Ramachandra YL, Krishna V, Rahiman BA, Mohan S, Ranganathan P, Ramabadran S, Chaerkady R, Pandey A. Human protein reference database--2009 update. Nucleic Acids Res. 2009;37(Database issue):D767–72.CrossRefPubMed Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, Balakrishnan L, Marimuthu A, Banerjee S, Somanathan DS, Sebastian A, Rani S, Ray S, Harrys Kishore CJ, Kanth S, Ahmed M, Kashyap MK, Mohmood R, Ramachandra YL, Krishna V, Rahiman BA, Mohan S, Ranganathan P, Ramabadran S, Chaerkady R, Pandey A. Human protein reference database--2009 update. Nucleic Acids Res. 2009;37(Database issue):D767–72.CrossRefPubMed
17.
Zurück zum Zitat Pagel P, Kovac S, Oesterheld M, Brauner B, Dunger-Kaltenbach I, Frishman G, Montrone C, Mark P, Stümpflen V, Mewes H-W, Ruepp A, Frishman D. The MIPS mammalian protein-protein interaction database. Bioinformatics. 2005;21:832–4.CrossRefPubMed Pagel P, Kovac S, Oesterheld M, Brauner B, Dunger-Kaltenbach I, Frishman G, Montrone C, Mark P, Stümpflen V, Mewes H-W, Ruepp A, Frishman D. The MIPS mammalian protein-protein interaction database. Bioinformatics. 2005;21:832–4.CrossRefPubMed
18.
Zurück zum Zitat Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM. Sirotkin K: dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2001;29:308–11.CrossRefPubMedPubMedCentral Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM. Sirotkin K: dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2001;29:308–11.CrossRefPubMedPubMedCentral
20.
Zurück zum Zitat Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, Fitzgerald S, Gil L, Gordon L, Hendrix M, Hourlier T, Johnson N, Kähäri AK, Keefe D, Keenan S, Kinsella R, Komorowska M, Koscielny G, Kulesha E, Larsson P, Longden I, McLaren W, Muffato M, Overduin B, Pignatelli M, Pritchard B, Riat HS, et al. Ensembl 2012. Nucleic Acids Res. 2012;40:D84–90.CrossRefPubMed Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, Fitzgerald S, Gil L, Gordon L, Hendrix M, Hourlier T, Johnson N, Kähäri AK, Keefe D, Keenan S, Kinsella R, Komorowska M, Koscielny G, Kulesha E, Larsson P, Longden I, McLaren W, Muffato M, Overduin B, Pignatelli M, Pritchard B, Riat HS, et al. Ensembl 2012. Nucleic Acids Res. 2012;40:D84–90.CrossRefPubMed
21.
Zurück zum Zitat Teo Y-Y, Sim X, Ong RTH, Tan AKS, Chen J, Tantoso E, Small KS, Ku C-S, Lee EJD, Seielstad M, Chia K-S. Singapore genome variation project: a haplotype map of three southeast Asian populations. Genome Res. 2009;19:2154–62.CrossRefPubMedPubMedCentral Teo Y-Y, Sim X, Ong RTH, Tan AKS, Chen J, Tantoso E, Small KS, Ku C-S, Lee EJD, Seielstad M, Chia K-S. Singapore genome variation project: a haplotype map of three southeast Asian populations. Genome Res. 2009;19:2154–62.CrossRefPubMedPubMedCentral
23.
Zurück zum Zitat Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4:1073–81.CrossRefPubMed Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4:1073–81.CrossRefPubMed
24.
Zurück zum Zitat Schneider A, Dessimoz C, Gonnet GH. OMA browser—exploring orthologous relations across 352 complete genomes. Bioinformatics. 2007;23:2180–2.CrossRefPubMed Schneider A, Dessimoz C, Gonnet GH. OMA browser—exploring orthologous relations across 352 complete genomes. Bioinformatics. 2007;23:2180–2.CrossRefPubMed
25.
Zurück zum Zitat Eisenhaber B, Kuchibhatla D, Sherman W, Sirota FL, Berezovsky IN, Wong W-C, Eisenhaber F. The recipe for protein sequence-based function prediction and its implementation in the ANNOTATOR software environment. Methods Mol Biol. 2016;1415:477–506.CrossRefPubMed Eisenhaber B, Kuchibhatla D, Sherman W, Sirota FL, Berezovsky IN, Wong W-C, Eisenhaber F. The recipe for protein sequence-based function prediction and its implementation in the ANNOTATOR software environment. Methods Mol Biol. 2016;1415:477–506.CrossRefPubMed
26.
Zurück zum Zitat Katoh K, Toh H. Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinform. 2008;9:286–98.CrossRefPubMed Katoh K, Toh H. Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinform. 2008;9:286–98.CrossRefPubMed
27.
Zurück zum Zitat Waterhouse AM, Procter JB, DMA M, Clamp M, Barton GJ. Jalview version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–91.CrossRefPubMedPubMedCentral Waterhouse AM, Procter JB, DMA M, Clamp M, Barton GJ. Jalview version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–91.CrossRefPubMedPubMedCentral
28.
Zurück zum Zitat Schymkowitz J, Borg J, Stricher F, Nys R, Rousseau F, Serrano L. The FoldX web server: an online force field. Nucleic Acids Res. 2005;33(Web Server issue):W382–8.CrossRefPubMedPubMedCentral Schymkowitz J, Borg J, Stricher F, Nys R, Rousseau F, Serrano L. The FoldX web server: an online force field. Nucleic Acids Res. 2005;33(Web Server issue):W382–8.CrossRefPubMedPubMedCentral
29.
Zurück zum Zitat Van Durme J, Delgado J, Stricher F, Serrano L, Schymkowitz J, Rousseau F. A graphical interface for the FoldX forcefield. Bioinformatics. 2011;27:1711–2.CrossRefPubMed Van Durme J, Delgado J, Stricher F, Serrano L, Schymkowitz J, Rousseau F. A graphical interface for the FoldX forcefield. Bioinformatics. 2011;27:1711–2.CrossRefPubMed
30.
Zurück zum Zitat Eswar N, Eramian D, Webb B, Shen M-Y, Sali A. Protein structure modeling with MODELLER. Methods Mol Biol. 2008;426:145–59.CrossRefPubMed Eswar N, Eramian D, Webb B, Shen M-Y, Sali A. Protein structure modeling with MODELLER. Methods Mol Biol. 2008;426:145–59.CrossRefPubMed
31.
Zurück zum Zitat Mah TL, Yap XN, Limviphuvadh V, Li N, Sridharan S, Kuralmani V, Feng M, Liem N, Adhikari S, Yong WP, Soo RA, Maurer-Stroh S, Eisenhaber F, Tong JC. Novel SNP improves differential survivability and mortality in non-small cell lung cancer patients. BMC Genomics. 2014;15(Suppl 9):S20.CrossRefPubMedPubMedCentral Mah TL, Yap XN, Limviphuvadh V, Li N, Sridharan S, Kuralmani V, Feng M, Liem N, Adhikari S, Yong WP, Soo RA, Maurer-Stroh S, Eisenhaber F, Tong JC. Novel SNP improves differential survivability and mortality in non-small cell lung cancer patients. BMC Genomics. 2014;15(Suppl 9):S20.CrossRefPubMedPubMedCentral
32.
Zurück zum Zitat de Jong FA, Marsh S, Mathijssen RHJ, King C, Verweij J, Sparreboom A, McLeod HL. ABCG2 pharmacogenetics: ethnic differences in allele frequency and assessment of influence on irinotecan disposition. Clin Cancer Res. 2004;10:5889–94.CrossRefPubMed de Jong FA, Marsh S, Mathijssen RHJ, King C, Verweij J, Sparreboom A, McLeod HL. ABCG2 pharmacogenetics: ethnic differences in allele frequency and assessment of influence on irinotecan disposition. Clin Cancer Res. 2004;10:5889–94.CrossRefPubMed
33.
Zurück zum Zitat Dong Z, Guo S, Yang Y, Wu J, Guan M, Zou H, Jin L, Wang J. Association between ABCG2 Q141K polymorphism and gout risk affected by ethnicity and gender: a systematic review and meta-analysis. Int J Rheum Dis. 2015;18:382–91.CrossRefPubMed Dong Z, Guo S, Yang Y, Wu J, Guan M, Zou H, Jin L, Wang J. Association between ABCG2 Q141K polymorphism and gout risk affected by ethnicity and gender: a systematic review and meta-analysis. Int J Rheum Dis. 2015;18:382–91.CrossRefPubMed
34.
Zurück zum Zitat Furukawa T, Wakabayashi K, Tamura A, Nakagawa H, Morishima Y, Osawa Y, Ishikawa T. Major SNP (Q141K) variant of human ABC transporter ABCG2 undergoes lysosomal and proteasomal degradations. Pharm Res. 2009;26:469–79.CrossRefPubMed Furukawa T, Wakabayashi K, Tamura A, Nakagawa H, Morishima Y, Osawa Y, Ishikawa T. Major SNP (Q141K) variant of human ABC transporter ABCG2 undergoes lysosomal and proteasomal degradations. Pharm Res. 2009;26:469–79.CrossRefPubMed
35.
Zurück zum Zitat de Wolf C, Jansen R, Yamaguchi H, de Haas M, van de Wetering K, Wijnholds J, Beijnen J, Borst P. Contribution of the drug transporter ABCG2 (breast cancer resistance protein) to resistance against anticancer nucleosides. Mol Cancer Ther. 2008;7:3092–102.CrossRefPubMed de Wolf C, Jansen R, Yamaguchi H, de Haas M, van de Wetering K, Wijnholds J, Beijnen J, Borst P. Contribution of the drug transporter ABCG2 (breast cancer resistance protein) to resistance against anticancer nucleosides. Mol Cancer Ther. 2008;7:3092–102.CrossRefPubMed
36.
Zurück zum Zitat Mizuarai S, Aozasa N, Kotani H. Single nucleotide polymorphisms result in impaired membrane localization and reduced atpase activity in multidrug transporter ABCG2. Int J Cancer. 2004;109:238–46.CrossRefPubMed Mizuarai S, Aozasa N, Kotani H. Single nucleotide polymorphisms result in impaired membrane localization and reduced atpase activity in multidrug transporter ABCG2. Int J Cancer. 2004;109:238–46.CrossRefPubMed
37.
Zurück zum Zitat Chen J, Lu G, Lin J, Davidson AL, Quiocho FA. A tweezers-like motion of the ATP-binding cassette dimer in an ABC transport cycle. Mol Cell. 2003;12:651–61.CrossRefPubMed Chen J, Lu G, Lin J, Davidson AL, Quiocho FA. A tweezers-like motion of the ATP-binding cassette dimer in an ABC transport cycle. Mol Cell. 2003;12:651–61.CrossRefPubMed
38.
Zurück zum Zitat Woodward OM, Tukaye DN, Cui J, Greenwell P, Constantoulakis LM, Parker BS, Rao A, Köttgen M, Maloney PC, Guggino WB. Gout-causing Q141K mutation in ABCG2 leads to instability of the nucleotide-binding domain and can be corrected with small molecules. Proc Natl Acad Sci U S A. 2013;110:5223–8.CrossRefPubMedPubMedCentral Woodward OM, Tukaye DN, Cui J, Greenwell P, Constantoulakis LM, Parker BS, Rao A, Köttgen M, Maloney PC, Guggino WB. Gout-causing Q141K mutation in ABCG2 leads to instability of the nucleotide-binding domain and can be corrected with small molecules. Proc Natl Acad Sci U S A. 2013;110:5223–8.CrossRefPubMedPubMedCentral
39.
Zurück zum Zitat Govindarajan R, Leung GPH, Zhou M, Tse C-M, Wang J, Unadkat JD. Facilitated mitochondrial import of antiviral and anticancer nucleoside drugs by human equilibrative nucleoside transporter-3. Am J Physiol Gastrointest Liver Physiol. 2009;296:G910–22.CrossRefPubMedPubMedCentral Govindarajan R, Leung GPH, Zhou M, Tse C-M, Wang J, Unadkat JD. Facilitated mitochondrial import of antiviral and anticancer nucleoside drugs by human equilibrative nucleoside transporter-3. Am J Physiol Gastrointest Liver Physiol. 2009;296:G910–22.CrossRefPubMedPubMedCentral
40.
Zurück zum Zitat Ebejer J-P, Hill JR, Kelm S, Shi J, Deane CM. Memoir: template-based structure prediction for membrane proteins. Nucleic Acids Res. 2013;41(Web Server issue):W379–83.CrossRefPubMedPubMedCentral Ebejer J-P, Hill JR, Kelm S, Shi J, Deane CM. Memoir: template-based structure prediction for membrane proteins. Nucleic Acids Res. 2013;41(Web Server issue):W379–83.CrossRefPubMedPubMedCentral
41.
Zurück zum Zitat Ruiz van Haperen VW, Veerman G, Vermorken JB, Peters GJ. 2′,2′-Difluoro-deoxycytidine (gemcitabine) incorporation into RNA and DNA of tumour cell lines. Biochem Pharmacol. 1993;46:762–6.CrossRefPubMed Ruiz van Haperen VW, Veerman G, Vermorken JB, Peters GJ. 2′,2′-Difluoro-deoxycytidine (gemcitabine) incorporation into RNA and DNA of tumour cell lines. Biochem Pharmacol. 1993;46:762–6.CrossRefPubMed
42.
Zurück zum Zitat Bergman AM, Pinedo HM, Peters GJ. Determinants of resistance to 2′,2′-difluorodeoxycytidine (gemcitabine). Drug Resist Updat. 2002;5:19–33.CrossRefPubMed Bergman AM, Pinedo HM, Peters GJ. Determinants of resistance to 2′,2′-difluorodeoxycytidine (gemcitabine). Drug Resist Updat. 2002;5:19–33.CrossRefPubMed
43.
44.
Zurück zum Zitat Bogan KL, Brenner C. 5′-Nucleotidases and their new roles in NAD+ and phosphate metabolism. New J Chem. 2010;34:845–53.CrossRef Bogan KL, Brenner C. 5′-Nucleotidases and their new roles in NAD+ and phosphate metabolism. New J Chem. 2010;34:845–53.CrossRef
45.
Zurück zum Zitat Spychala J, Chen V, Oka J, Mitchell BS. ATP and phosphate reciprocally affect subunit association of human recombinant high km 5′-nucleotidase. Role for the C-terminal polyglutamic acid tract in subunit association and catalytic activity. Eur J Biochem. 1999;259:851–8.CrossRefPubMed Spychala J, Chen V, Oka J, Mitchell BS. ATP and phosphate reciprocally affect subunit association of human recombinant high km 5′-nucleotidase. Role for the C-terminal polyglutamic acid tract in subunit association and catalytic activity. Eur J Biochem. 1999;259:851–8.CrossRefPubMed
46.
Zurück zum Zitat Allegrini S, Scaloni A, Careddu MG, Cuccu G, D’Ambrosio C, Pesi R, Camici M, Ferrara L, Tozzi MG. Mechanistic studies on bovine cytosolic 5′-nucleotidase II, an enzyme belonging to the HAD superfamily. Eur J Biochem. 2004;271:4881–91.CrossRefPubMed Allegrini S, Scaloni A, Careddu MG, Cuccu G, D’Ambrosio C, Pesi R, Camici M, Ferrara L, Tozzi MG. Mechanistic studies on bovine cytosolic 5′-nucleotidase II, an enzyme belonging to the HAD superfamily. Eur J Biochem. 2004;271:4881–91.CrossRefPubMed
47.
Zurück zum Zitat Gu J, Xia X, Yan P, Liu H, Podust VN, Reynolds AB, Fanning E. Cell cycle-dependent regulation of a human DNA helicase that localizes in DNA damage foci. Mol Biol Cell. 2004;15:3320–32.CrossRefPubMedPubMedCentral Gu J, Xia X, Yan P, Liu H, Podust VN, Reynolds AB, Fanning E. Cell cycle-dependent regulation of a human DNA helicase that localizes in DNA damage foci. Mol Biol Cell. 2004;15:3320–32.CrossRefPubMedPubMedCentral
48.
Zurück zum Zitat Guler GD, Liu H, Vaithiyalingam S, Arnett DR, Kremmer E, Chazin WJ, Fanning E. Human DNA helicase B (HDHB) binds to replication protein a and facilitates cellular recovery from replication stress. J Biol Chem. 2012;287:6469–81.CrossRefPubMed Guler GD, Liu H, Vaithiyalingam S, Arnett DR, Kremmer E, Chazin WJ, Fanning E. Human DNA helicase B (HDHB) binds to replication protein a and facilitates cellular recovery from replication stress. J Biol Chem. 2012;287:6469–81.CrossRefPubMed
50.
Zurück zum Zitat Jeong S-J, Kim H-J, Yang Y-J, Seol J-H, Jung B-Y, Han J-W, Lee H-W, Cho E-J. Role of RNA polymerase II carboxy terminal domain phosphorylation in DNA damage response. J Microbiol. 2005;43:516–22.PubMed Jeong S-J, Kim H-J, Yang Y-J, Seol J-H, Jung B-Y, Han J-W, Lee H-W, Cho E-J. Role of RNA polymerase II carboxy terminal domain phosphorylation in DNA damage response. J Microbiol. 2005;43:516–22.PubMed
51.
Zurück zum Zitat Chan SL, Jin S, Loh M, Brunham LR. Progress in understanding the genomic basis for adverse drug reactions: a comprehensive review and focus on the role of ethnicity. Pharmacogenomics. 2015:1–19. Chan SL, Jin S, Loh M, Brunham LR. Progress in understanding the genomic basis for adverse drug reactions: a comprehensive review and focus on the role of ethnicity. Pharmacogenomics. 2015:1–19.
Metadaten
Titel
Discovering novel SNPs that are correlated with patient outcome in a Singaporean cancer patient cohort treated with gemcitabine-based chemotherapy
verfasst von
Vachiranee Limviphuvadh
Chee Seng Tan
Fumikazu Konishi
Piroon Jenjaroenpun
Joy Shengnan Xiang
Yuliya Kremenska
Yar Soe Mu
Nicholas Syn
Soo Chin Lee
Ross A. Soo
Frank Eisenhaber
Sebastian Maurer-Stroh
Wei Peng Yong
Publikationsdatum
01.12.2018
Verlag
BioMed Central
Erschienen in
BMC Cancer / Ausgabe 1/2018
Elektronische ISSN: 1471-2407
DOI
https://doi.org/10.1186/s12885-018-4471-x

Weitere Artikel der Ausgabe 1/2018

BMC Cancer 1/2018 Zur Ausgabe

Update Onkologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.