Introduction

Acute myeloid leukemia (AML) is the most common acute leukemia affecting adults. It arises from the accumulation of abnormal myeloid cells in the bone marrow that disrupts the production of normal blood cells.1 Cytogenetic characterization of blast cells is the most important prognostic factor for predicting remission rate, relapse and overall survival (OS). Based on cytogenetic findings, AML patients can be categorized into three different risk subgroups: favorable, intermediate and poor.2, 3 Approximately 60% of patients with AML harbor abnormal karyotypes (AK) that are non-random, somatically acquired chromosomal translocations and inversions.3, 4 Among the remaining, as many as 40% of patients with AML have no cytogenetic abnormality, representing normal karyotype (NK), and are classified into the intermediate-risk group.3, 5 For patients in the favorable and poor cytogenetic risk groups, treatment choice is well defined. However, for patients in the intermediate-risk group having no cytogenetic abnormalities, the most optimal treatment is controversial because this group is very heterogeneous about treatment response, survival and risk of relapse.6, 7 The FLT3-ITD, NPM1 and CEBPA mutations have been reported to predict OS and first complete remission duration (CR1D) in NK-AML.8, 9, 10, 11 The inherited genetic polymorphisms are also reported in SULT1C2, XPA and MDR1 in intermediate-risk group AML patients. These genes encode drug-metabolism enzyme, drug transport protein and DNA repair protein.12 Whether these genetic markers classify a clinically relevant subgroup in NK-AML patients remains uncertain. For individualizing treatment, there is still need for additional genetic markers in NK-AML patients.

The nucleoside analog cytosine arabinoside (AraC) is a major component of the most common treatment for AML patients. It is metabolized by a number of pharmacogenetic enzymes (for example, cytidine deaminase, deoxycytidine kinase, dCMP deaminase and 5′-nucleotidase).13, 14, 15 Among these drug-metabolizing enzymes, cytidine deaminase (CDA) is related with pyrimidine salvaging and catalyzes the irreversible hydrolytic deamination of cytidine to uridine nucleosides.16 Therefore, CDA is responsible for the inactivation of AraC during chemotherapy of AML. One nonsynonymous single-nucleotide polymorphism (SNP), rs2072671 (79A>C), in CDA changes lysine to glutamine, resulting in decreased enzyme activity, reported several in vitro studies.17, 18 Previous Chinese AML association study reported that rs2072671 was associated with decreased sensitivity to AraC used in chemotherapy of childhood leukemias.19 More recently, it was reported that the 79 C/C and −451T/T genotypes in CDA were associated with decreased global DNA methylation level in cells and shorter OS in FLT3–ITD-positive/NPM1-positive patients in NK-AML.20

The goal of this study was to investigate the potential roles of common genetic polymorphisms in CDA gene as prognostic factors in AML through the comparative analysis of 95 AMLs (50 NK-AMLs and 45 AK-AMLs) and 241 normal controls (NC) in Korean population. Our findings indicate that polymorphisms of CDA could be the potential diagnostic and prognostic markers in the NK-AML patients.

Materials and methods

Patient characteristics and treatment protocol

A total of 95 newly diagnosed AML patients (56 males and 39 females) who had marrow samples available for DNA isolation were enrolled in the study from Seoul National University Hospital, Seoul, Korea between December 2001 and September 2009. The patient characteristics were as follows: the median age was 49 years (range 16–76 years). The FAB (French–American–British) classification showed the following distribution of subtypes: M0, 2 patients (2%); M1, 13 (14%); M2, 43 (45%); M4, 30 (32%); M5, 5 (5%); and M7, 2 patients (2%). Patients diagnosed with acute promyelocytic leukemia (M3 FAB subtype) were excluded from the study. The criteria used to describe a cytogenetic clone and karyotype followed the recommendations of the International System for Human Cytogenetic Nomenclature.21 The cytogenetic risk groups were classified according to the MRC10 criteria, as described previously22 (favorable risk: AML associated with t(8;21), t(15;17), or inv(16); unfavorable risk: the presence of a complex karyotype, −5, del(5q), −7 or abnormalities of 3q; intermediate risk: the remaining group of patients, including those with 11q23 abnormalities, +8, +21, +22, del(9q), del(7q) or other miscellaneous structural or numerical defects not included in the favorable or unfavorable risk groups). Information of first induction chemotherapy outcome was available in 90 patients. All patients were treated with standard induction chemotherapy (idarubicin 12 mg m−2 per day intravenous for 3 consecutive days and AraC 200 mg m−2 per day intravenous for 7 consecutive days). In patients over 60 years of age, the dose of idarubicin was modified to 10 mg m−2 per day. Consolidation therapies were performed based on two more cycles of high-dose AraC (3 g m−2 per day intravenous twice a day on days 1, 3 and 5). The institutional review board of Seoul National University Hospital approved the study protocol, and all subjects provided informed consent.

SNP genotyping

Among 46 HapMap SNPs (release #27) detected in CDA, 18 tagging SNPs were selected based on their minor allele frequencies (>5%) and linkage disequilibrium in the HapMap database. Genotyping was performed at a multiplex level using the Illumina GoldenGate genotyping system (Illumina Inc., San Diego, CA, USA).23 The genotype quality score for retaining data was set to 0.25 and all SNPs were successfully genotyped.

Statistics

To determine whether the individual variant was in equilibrium at each locus in the population (Hardy–Weinberg equilibrium), χ2 tests were performed. We examined a widely used measure of linkage disequilibrium between all pairs of biallelic loci, D' (the correlation coefficient [Delta, |D'|]) and r2 using Haploview.24 Association with AML was analyzed under a logistic model by adjusting for age and sex as a covariate using the SAS (Cary, NC, USA). In order to correct for multiple comparisons, Bonferroni correction was applied based on the number of independent SNPs (n=16) and three different models (n=3). To determine the ethnic difference of CDA polymorphisms, χ2 test and Fisher’s exact test were performed. Cox proportional hazard model was applied to determine the significance of difference in OS times and CR1D. Haplotypes were inferred using PHASE algorithm ver. 2.0.25 Subsequently, statistical analysis was performed using SAS version 9.1.

Results

In order to examine the relationship between common CDA polymorphisms and AML patients, 18 SNP markers were genotyped using the Illumina GoldenGate genotyping assay. Pairwise comparisons between the SNPs revealed two sets of absolute linkage disequilibriums (|D’|=1 and r2=1) (Figure 1c). The position and frequency of the genetic variants genotyped in the CDA gene are shown in Figure 1a. A total of 16 SNPs (3 SNPs in the promoter: rs1253904, rs532545 and rs603412; 1 nonsynonymous SNP: rs2072671; and 12 SNPs in intron: rs471760, rs818199, rs577042, rs4655226, rs10799647, rs818194, rs10916827, rs580032, rs527912, rs1689924, rs477155 and rs12404655) were analyzed. Among the 14 haplotypes constructed, 5 common haplotypes (frequency >5%) were used for further analysis (Figure 1b).

Figure 1
figure 1

Gene map of cytidine deaminase (CDA). (a) Map of CDA on chromosome 1p36.2–p35. Coding exons are marked by black blocks and 5′ and 3′ untranslated regions (UTRs) by white blocks (Ref. NM#. NM_001785.2). The frequencies in parentheses were based on the Korean acute myeloid leukemia (AML) patients (n=95). Asterisks indicate single-nucleotide polymorphisms (SNPs) that were used in statistical analysis. (b) Haplotypes in CDA. Sixteen CDA polymorphisms were used for haplotype construction. Five haplotypes with over 5% of frequency were used in statistical analyses. (c) Linkage disequilibrium (LD) coefficient (r2) among CDA polymorphisms. Scores in the boxes mean r2.

A total of 336 samples were used for the association study, consisting of 95 AMLs (50 NK-AML and 45 AK-AML) and 241 NC. The observed genotypes and allele frequencies for the CDA SNPs were in the Hardy–Weinberg equilibrium (P>0.05). Several CDA SNPs and haplotypes manifested a marginal association with AML when the entire AMLs and NC were analyzed (P=0.01–0.04; Supplementary Table 1). Four SNPs were also marginally associated with CR1D after chemotherapy was applied to the AMLs (Supplementary Table 2). However, when the patients were stratified according to karyotype status, the most significantly associated variant was found between NK-AML and NC (Table 1). In contrast, none of SNPs and haplotypes manifested any association between AK-AML and NC (Supplementary Table 3). The rs532545, rs2072671, rs471760, rs4655226, rs818194 and ht3 were strongly associated with the risk of NK-AML (P=0.03–0.0007; Table 1). The most significant association was discovered in the intronic SNP, rs4655226 (P=0.0007, Pcorr=0.03 and odds ratio=2.5; Table 1). Although the marginal association of the nonsynonymous SNP, rs2072671 (K27Q), completely disappeared after Bonferroni correction, rs4655226 maintained the significance (Pcorr=0.03) that was strongly linked with rs2072671 (r2=0.86). In addition, rs2072671 was reported as having significant associations with enzyme activity in previous studies.20, 26 Therefore, in subsequent statistical analysis, we selected the nonsynonymous SNP, rs2072671, as a most promising candidate.

Table 1 Allele test of 16 CDA SNPs and haplotypes with risk of NK-AML compared with NCs

In the logistic analysis, rs2072671 was found to be significantly associated with the risk of NK-AML compared with NC (P=0.009 and odds ratio=2.44). The frequencies of the AC and CC genotypes were higher in the NK-AML than that of the NC genotype (0.50 vs 0.28 in NK-AML vs NC; Table 2). Taking the subjects carrying the AA genotype for rs2072671 as a reference, the subjects carrying genotype AC showed an increased risk of NK-AML (P=0.008 and odds ratio=2.46).

Table 2 Association analysis of rs2072671 with NK-AML compared with NC and AK-AML

In the Cox regression analysis, rs2072671 (K27Q) was shown to have significant correlation with clinical outcomes of NK-AML after the first course of chemotherapy (Figure 2). NK-AML patients with AC or CC genotypes had worse OS (P=0.03, hazard ratio=1.84) in Figure 2a and CR1D (P=0.007, hazard ratio=3.24) outcomes compared with patients with the common homozygous AA genotype in Figure 2b. In contrast, rs2072671 genotypes were not associated with treatment response in AK-AML patients (Supplementary Figure 1).

Figure 2
figure 2

Differences in overall survival (OS) and first complete remission duration (CR1D) depending on rs2072671 (79 A>C). In normal karyotype acute myeloid leukemia (AML) patients, heterozygous AC and rare homozygous CC genotypes were significantly associated with a shorter OS and CR1D. (a) The mean OS was 5.5 vs 19 vs 29 months for CC vs AC vs AA, P=0.03. (b) The mean CR1D was 1.9 vs 13 vs 33 months for CC vs AC vs AA, P=0.007.

Discussion

CDA is an important enzyme responsible for catalyzing the irreversible hydrolytic deamination of cytidine into uridine nucleoside, one of the enzymes that regulate the homeostasis of the cellular pyrimidine pool.16 Therefore, CDA plays a crucial role in the metabolism of AraC, the pyrimidine analog drug for AML treatments, and variants of CDA may influence subsequent treatments through the alteration of gene expression and function. Our results showed several CDA SNPs and haplotypes have significant correlations with the increased risk of NK-AML.

The nonsynonymous SNP, rs2072671 (K27Q), is highly associated with the risk of NK-AML and has a strong relationship with the clinical outcomes of chemotherapy for NK-AML. The heterozygotes (AC) and minor allele homozygotes (CC) of rs2072671 A>C are more common in NK-AML than in AK-AML and NC. The AC and CC were also significantly associated with a shorter OS and CR1D in NK-AML patients. A study by Falk et al.20 reported that there was no significant difference in rs2072671 and rs532545 genotype distribution between NK-AML and HapMap-CEU reference (P>0.8). Moreover, these two SNPs were not associated with the treatment response of the entire NK-AMLs in the Swedish population. This discrepancy between the Swedish study and this study may be attributed to the allele frequency differences between the Korean and Swedish populations (Supplementary Table 4). In order to identify the ethnic differences between the 16 CDA SNPs, we analyzed the genotype distribution used in Fisher’s test among four ethnicities, including Korean, Japanese, Han Chinese and European-American populations, using Hapmap data (Supplementary Table 5). The distribution of rs532545, rs603412, rs2072671, rs471760, rs580032 and rs1689924 genotypes was significantly different between the Korean and European-American populations. These frequency differences between the CDA SNPs may reflect a distinct ethnic background between the Asian and European-American patients, and thus might affect the clinical phenotype of AML.

Karyotype abnormalities are appeared to be associated with multidrug resistance (MDR) gene regulation. MDR-positive phenotype is more often observed in patients with abnormal karyotype.27, 28 In AML patients with abnormal karyotypes, highly expressed MDR gene may be correlated with response to chemotherapy. In contrast, in NK-AML patients, without MDR overexpression, CDA gene may be one of major regulating factor.

A study by Vasile et al.29 indicated that the AA genotype of rs2072671 was associated with an improved progression-free survival in non-small-cell lung cancer patients treated with gemcitabine, and this was similar to our results. In previous studies, the AA variant of the major allele homozygote in rs2072671 was found to increase CDA enzyme activity and in vitro deamination of AraC.17, 18 The samples with heterozygous (AC) and minor homozygote (CC) genotypes in rs2072671 had significantly reduced CDA enzyme activity compared with samples for the common homozygous (AA) genotype.30 The decreased CDA activity may not maintain the cellular pyrimidine pool.31 It could instead increase the chance of mismatch during DNA replication. As such, point mutations would occur more frequently in patients with the AC and CC genotypes. Although NK-AML patients have normal karyotypes at the chromosomal level, they may have increased point mutations at the DNA level. The whole-exome sequencing method would be useful in verifying this hypothesis. However, we were unable to apply exome sequencing in this study. On the other hand, variants of CDA might influence to reduce the degree of DNA methylation. Farrell et al.32 suggested that pharmacological inhibition of CDA affected the lower level of DNA methylation. Other study group reported that NK-AML patients with C allele of rs2072671 and T allele of rs532545 showed a significantly less methylated pattern.20 Together, variants of CDA may influence the decreased DNA methylation level and alter the normal gene expression and downstream signaling in NK-AML patients. Another possible mechanism explaining the poor prognosis in NK-AML by the CDA variant is that the chemotherapeutic agent may be remaining at high concentrations in the blood as a result of the decreased CDA activity, thereby inducing the apoptosis of normal blood cells and increasing the cytotoxicity.33

In our study, the insufficient number of NK-AML cases is a major limitation. Further study will be required to validate the association in an independent population. In conclusion, our results strongly support CDA SNP as a potential prognostic marker for NK-AML patients. Among the several possibilities, the precise functional mechanisms underlying the harmful effects of the heterozygotes and the rare allele homozygotes of CDA rs2072671 on the clinical outcome of NK-AML still remain to be elucidated.