Background
Colorectal cancer (CRC) is one of the most common cancers and is the third leading cause of cancer-related death worldwide [
1,
2]. In Europe and the USA, CRC represents one of the main causes of cancer deaths [
1,
3]. In Asia, CRC is the fourth leading cause of mortality by cancer, and its incidence is increasing [
4,
5]. In recent years, the incidence of CRC is increasing in China, which accounts for about 6.5% of total cancers in urban areas and 4.6 % in rural areas [
6]. Previous epidemiological studies have identified dietary factors, such as consumption of meat, especially red meat, alcohol consumption, and cigarette smoking as possible risk factors for the development of CRC [
7,
8]. However, most individuals exposed to these known dietary risk factors never develop CRC while many CRC cases develop among individuals without those known risk factors, suggesting that other factors such as genetic factors also play important roles in colorectal carcinogenesis.
The xeroderma pigmentosum complementation group C (XPC) is one of the key members in the nucleotide excision repair (NER) pathway [
9]. The NER pathway is the primary mechanism for removal of adducts from DNA, and thus is an important part of the cellular defense against a large variety of structurally unrelated DNA lesions. The XPC binds to HR23B and forms the XPC-HR23B complex, which is involved in the DNA damage recognition and DNA repair initiation in the NER pathway and the binding of XPC to damaged DNA is the rate-limiting step for NER [
10‐
12]. The
XPC gene is located at chromosome 3p25 and contains 16 exons and 15 introns. There are at least 687 reported single nucleotide polymorphisms (SNPs) in the
XPC gene region (
http://www.ncbi.nlm.nih.gov/snp/). Among all the identified SNPs, Lys939Gln polymorphism has received much attention in recent years. It is a substitution of lysine for glutamine in exon 15 of the
XPC gene [
13], and the variant 939Gln allele have been reported to correlated with reduced DNA repair activity and increased cancer risk [
14‐
16].
Over the last two decades, several molecular epidemiological studies have evaluated the association between
XPC Lys939Gln polymorphism and CRC risk, but the results remain controversial and inconclusive. For genetic association studies that check candidate polymorphisms, sample size is an important influencing factor for study accuracy. Small sample size might have insufficient power to explore a true association of modest effect [
17], especially for complex multifactorial disease such as CRC [
18]. Combining data from all eligible studies by meta-analysis has the advantage of increasing statistical power and reducing random error and obtaining precise estimates for some potential genetic associations. Therefore, in this study, we conducted a quantitative meta-analysis including all eligible studies.
Methods
Search strategy
We searched Pubmed, Embase and Cochrane library databases for all articles on the association between
XPC Lys939Gln polymorphism and CRC risk using the following combined keywords: ‘xeroderma pigmentosum group C’, ‘XPC’, ‘colon cancer’, ‘rectal cancer’ and ‘colorectal cancer’. The latest search was done in December 2013, without any language restriction. Additional articles were identified through the references cited in the first series of articles selected. Articles included in the meta-analysis were in any language, with human subjects, published in the primary literature and had no obvious overlap of subjects with other studies. Among overlapping reports, only the studies with more information on origin of cases/controls were retained. The study was performed according to the proposal of Meta-analysis of Observational Studies in Epidemiology group (MOOSE) [
19].
Selection criteria
The following inclusion criteria were used for literature selection: (i) Case–control or cohort studies which evaluated the association between XPC Lys939Gln polymorphism and CRC risk; (ii) sufficient genotype data were presented to calculate the odds ratios (ORs) and 95% confidence intervals (95% CIs); (iii) control population did not contain malignant tumor patients. Major reasons for exclusion of studies were (i) review, or meta-analysis, or letter, or comment; (ii) duplicated studies, or studies without raw data we need; and (iii) studies that focused on HNPCC or FAP. Family–based studies of pedigrees with several affected cases per family were also excluded, because their analysis is based on linkage considerations.
Two authors (Qiliu Peng and Xianjun Lao) independently reviewed and extracted data from all eligible studies. Data extracted included the first author, year of publication, country of origin, ethnicity, genotyping method, matching criteria, source of control, CRC ascertainment, total numbers of cases and controls and genotype frequencies of cases and controls. Ethnic backgrounds were categorized as Caucasian, and Asian. Smoking status (smoker or nonsmoker) was additionally recorded for stratified analysis. Smokers included current smokers and former smokers. Nonsmokers had never smoked. Cancer location was divided into colon cancer and rectum cancer and was also additionally recorded for the stratified analysis. To ensure the accuracy of the extracted information, the two authors checked the data extraction results and reached consensus on all of the data extracted. If different results were generated, they would check the data again and have a discussion to come to an agreement. A third reviewer (Weizhong Tang) was invited to the discussion if disagreement still existed.
Quality score assessment
The quality of eligible studies was evaluated independently by two authors (Qiliu Peng and Xue Qin) according to a set of predefined criteria (Table
1) based on the scale of Thakkinstian et al. [
20]. The revised criteria cover the representativeness of cases, source of controls, ascertainment of CRC, total sample size, quality control of genotyping methods, and Hardy-Weinberg equilibrium (HWE) in the control population. Disagreements were resolved by consensus. Scores ranged from 0 (lowest) to 10 (highest). Articles with scores equal to or less than 6 were considered “low-quality” studies, whereas those with scores higher than 6 were considered “high-quality” studies.
Table 1
Scale for quality assessment
Representativeness of cases | |
Selected from cancer registry or multiple cancer center sites | 2 |
Selected from oncology department or cancer institute | 1 |
Selected without clearly defined sampling frame or with extensive inclusion/exclusion criteria | 0 |
Source of controls | |
Population or community based | 2 |
Both population-based and hospital-based/healthy volunteers/blood donors | 1.5 |
Hospital-based controls without colorectal cancer | 1 |
Cancer-free controls without total description | 0.5 |
Not described | 0 |
Ascertainment of colorectal cancer | |
Histological or pathological confirmation | 2 |
Diagnosis of colorectal cancer by patient medical record | 1 |
Not described | 0 |
Sample size | |
>1000 | 2 |
200-1000 | 1 |
<200 | 0 |
Quality control of genotyping methods | |
Clearly described a different genotyping assay to confirm the data | 1 |
Not described | 0 |
Hardy-Weinberg equilibrium | |
Hardy-Weinberg equilibrium in controls | 1 |
Hardy-Weinberg disequilibrium in controls | 0.5 |
No checking for Hardy-Weinberg disequilibrium | 0 |
Statistical analysis
Crude odds ratios (ORs) with 95% confidence intervals (CIs) were used to assess the association between the
XPC Lys939Gln polymorphism and CRC risk. We evaluated the
XPC Lys939Gln polymorphism and CRC risk using co-dominants (Gln/Gln vs. Lys/Lys and Gln/lys vs. Lys/Lys), recessive model (Gln/Gln vs. Gln/lys + Lys/Lys), and dominant model (Gln/Gln + Gln/lys vs. Lys/Lys). The Chi-square-based Q statistic test [
21,
22] was used to evaluate the between-study heterogeneity. If the result of the heterogeneity test was
P
Q
< 0.10, the pooled ORs were analyzed using the random-effects model [
23]. Otherwise, the fixed-effects model [
24] was selected. Subgroup analyses were performed by ethnicity, source of control, cancer location, smoking, and study quality. Sensitivity analysis was conducted by sequential omission of individual study to assess the robustness of the results. Publication bias was assessed using a Begg’s funnel plot and Egger’s regression asymmetry test [
25]. If publication bias existed, the Duval and Tweedie non-parametric “trim and fill” method was used to adjust for it [
26]. The distribution of the genotypes in the control population was tested for HWE using a goodness-of-fit Chi-square test. All analyses were performed using Stata software, version 12.0 (Stata Corp., College Station, TX). All
P values were two-sided. To ensure the reliability and the accuracy of the results, two authors entered the data into the statistical software programs independently with the same results.
Discussion
Maintenance of genomic integrity by DNA repair genes is an essential component of normal cell homeostasis necessary to cell growth, differentiation, and apoptosis [
37,
38]. Increasing evidence indicating that reduced DNA repair capacity, due to various DNA repair gene polymorphisms, is associated with increased risk and susceptibility to human solid tumors [
16,
39,
40]. XPC is a key member in the NER pathway. It is involved in the recognition and initiation of the genome repair of NER pathway [
10,
41,
42]. Polymorphisms in the
XPC gene may alter DNA repair capacity of the NER pathway, which further play a critical role in carcinogenesis [
43]. To date, several epidemiological studies have evaluated the association between
XPC Lys939Gln polymorphism and CRC risk, but the results remain inconclusive. Meta-analysis has been recognized as an important tool to more precisely define the effect of selected genetic polymorphisms on the risk for disease and to identify potential important sources of between-study heterogeneity [
2]. To derive a more precise estimation of the relationship, we performed this meta-analysis. Our meta-analysis based on 8 case–control studies suggested that the
XPC Lys939Gln polymorphism contributes to increased CRC susceptibility.
In subgroup analysis by ethnicity, statistical significant increased CRC risk was detected in Asians. However, no significant association was found in Caucasians. Actually, it might not be uncommon for the same polymorphism playing different roles in cancer susceptibility among different ethnic populations, because cancer is a complicated multi-genetic disease, and different genetic backgrounds may contribute to the discrepancy. Nevertheless, owing to the limited number of relevant studies included in the meta-analysis, the observed association between XPC Lys939Gln polymorphism and CRC risk is likely to be caused by chance because study with small sample sizes may have insufficient statistical power to detect a slight effect or may have generated a fluctuated risk estimate. Currently there were only four studies on XPC Lys939Gln polymorphism and CRC risk for Asian populations, and Caucasian populations, respectively. Therefore, the observed results of our study should be interpreted with caution.
When stratified according to the quality score of the articles, statistical significant increased CRC risk was observed in high quality studies but not in low quality studies. The possible reason for the discrepancy may be that the existence of selection bias and recall bias in the low quality studies. In addition, genotyping methods without quality control in studies of low quality should be also considered when deciphering these inconsistent results. In subgroup analysis according to the source of control, statistical significant increased CRC risk was found in both population-based studies and hospital-based studies. However, the ORs and 95% CIs differ largely in these two subgroups. The reason may be that the hospital-based studies have a high risk of producing unreliable results because hospital-based controls may not always be truly representative of the general population. Therefore, a methodologically preferable design, such as using a proper and representative population-based high quality study, is of great value in case–control studies.
It is well established that the carcinogenesis of CRC is a result of the interaction between environmental factors and genetic background [
18,
44]. Besides the role of genetic variants, smoking behavior shows an important effect on the CRC susceptibility. It has been reported that smoking increased CRC risk threefold [
36]. It is thought that smoking increased cancer risk due to chemicals such as hydrocarbons, arylamines, nitrosamines, and the formation of reactive oxygen species as by-products of the above compounds [
45] that are known to induce bulky adducts, base damage, and DNA strand breaks. DNA repair mechanisms are paramount in correcting the changes on DNA and provide unmutated DNA while replication goes on [
46]. The XPC protein plays a crucial role in repairing the DNA damage caused by tobacco smoke. Individuals with the
XPC variant genotype may possess deficient DNA repair capability. Accordingly, the XPC protein product may be less efficient in repairing the DNA lesions induced by tobacco smoke, and thereby could enhance the susceptibility, favoring the development of CRC. Therefore, we carried out subgroup analysis according to smoking status. Our results showed an increased CRC risk in nonsmokers but not in smokers, which was inconsistent with the hypothesis above. The results should be interpreted with caution because of the limited numbers of the original studies. Therefore, further studies concerning stratification for smoking are needed to increase power for the association estimation.
Some possible limitations in this meta-analysis should be acknowledged. First, in subgroup analysis by ethnicity, the included studies regarded only Asians and Caucasians. Data concerning other ethnicities such as Africans were not found. Thus, additional studies are warranted to evaluate the effect of this functional polymorphism on CRC risk in different ethnicities, especially in Africans. Second, our results were based on unadjusted estimates. We did not perform the analysis adjusted for other covariates such as age, drinking status, environment factors, and so on, because of the unavailable original data of the eligible studies. Third, in subgroup analyses by ethnicity, cancer location, and smoking status, the sample size of the subgroups was relatively small for stratified analyses, which may lead to relatively weak power to detect the real relationship.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Competing interest
All authors declare that they have no competing interest in relation to this study.
Authors' contributions
QP, XQ performed the literature search, data extraction, statistical analysis and drafted the manuscript. QP, XL, WT, ZC, and RL participated in data extraction. QP, XQ, ZC, SL supervised the literature search, data extraction, statistical analysis and drafted the manuscript. All authors read and approved the final manuscript.