Introduction

The incidence rates of nasopharyngeal carcinoma (NPC) are low throughout most of the world, typically less than 1 per 100 000.1 However, NPC is common in Southeast Asia and Northern African countries. Moreover, NPC is among the leading causes of death in Cantonese and several other southern Chinese populations.2 In fact, the incidence rate per 100 000 for males is higher than 40 in some Cantonese-speaking populations in Guangdong province.

Familial clustering of NPC has been widely documented in the Chinese population.3,4,5,6,7,8,9 One rare documented example7 is one family living in Nanhai County of Guangdong Province having 15 NPC affected family members, two family members affected with hepatic cell carcinomas and one with breast cancer. In southern Chinese populations, several case–control studies have confirmed that the proportion of NPC cases with a positive first-degree family history is greater than 5%, with similar proportions found in high-risk areas such as Hong Kong (7.2%), Yulin (6.0%), and Guangzhou (5.9%).3,4 All of these data suggest an important role for familial patterns of inheritance in NPC pathogenesis. Complex segregation analysis has been successfully used to define the model of inheritance in esophageal and colorectal cancers.10,11 However, to date, there have been no large-scale complex segregation analyses to elucidate the genetic mechanisms underlying carcinogenesis in NPC patients.

This study aims to investigate the fit of single gene, polygenic and multifactorial models to the observed pattern of transmission of NPC in a hospital-based family study conducted by Cancer Center of Sun Yat-Sen University (CCSYSU). A total of 1903 Cantonese pedigrees ascertained through probands treated at the CCSYSU were enrolled, and complex segregation analysis was performed using the program POINTER.12

Subjects and methods

From January 1999, we interviewed Cantonese NPC patients receiving treatment in the Department of NPC at CCSYSU, which is the largest center for cancer prevention and treatment in southern China. After we received permission from their physicians, we contacted the patients regarding participation in the study and we then interviewed all consented patients. By June 2001, 1903 Cantonese cases were enrolled in the study.

Each patient's diagnosis was pathologically confirmed as NPC. Each consenting NPC patient was regarded as a proband, and all families were ascertained through a single proband, with the exception of two families for which two probands were independently ascertained. In these two cases, where more than one family member was being treated for NPC, a single questionnaire was administered to avoid duplication of family information.

A structured questionnaire was administered to the proband of each family, where patients were asked to provide verbal answers. Information was gathered on the probands themselves, as well as the family history of three successive generations (first- and second-degree relatives). Whenever probands were unsure of their answers, we interviewed additional family members to ensure the accuracy of the information. Questions included the proband's contact information, date of birth, sex, date of diagnosis, birthplace, birthplace of grandparents, his or her physician, dialect and other general characteristics. The family history included any incidence of cancer in the proband's relatives, including cause and date of death for deceased relatives, status of cancer occurrence, type of cancer, date of diagnosis, hospital and records of diagnosis. The age and gender information of the probands are summarized in Table 1.

Table 1 Descriptive information for the NPC probands

Complex segregation analysis was performed using the unified mixed model.12 The mixed model assumes that a phenotype is influenced by the additive and independent effect of a major gene, together with a multifactorial component (genetic and environmental) and a random environmental effect. Pedigrees were partitioned into nuclear families and analyzed using the program POINTER under joint likelihood.12 Nuclear families, including all first- and second-degree relatives in the sample, were coded and totaled 3747 families.

The ascertainment probability, as used in POINTER, is ∼0 if the probability of ascertaining a family increases in proportion to the number of affected offspring (single selection) and close to 1 if the probability of ascertaining a family is independent of the number of affected offspring (complete selection). Where appropriate, first-degree relatives of the proband were partitioned into nuclear families containing the proband as a parent (complete selection) and as a child (incomplete selection). Nuclear families of second-degree relatives containing affected individuals were coded for truncate selection (ascertainment probability=1). For incomplete selection only two families in this large sample had more than one independently ascertained proband, suggesting a relatively low ascertainment probability, but not single selection. We therefore used a value of 0.1 but we also examined all models while varying the ascertainment probability over the range 0.001–0.2, and the results were found to be highly robust to changes in the specified ascertainment model (data not shown).

All individuals with known age (at the time of ascertainment) were assigned to liability classes according to age and gender. Liability classes and their frequencies were based on the observed cumulative risk of NPC over 10-year age intervals from a Hong Kong cancer database over the period 1990–1994.13 Such information is not available in the Cantonese population in GuangDong, and the Hong Kong Cantonese population is an appropriate representative population to GuangDong Cantonese.

The POINTER program permits the construction of four male and four female liability classes, which describe age-specific risks. Therefore, eight classes were formed according to the age ranges given in Table 2. Cumulative incidence figures, to the mid-point of each class, were calculated given the rates per 100 000 as described in Lee et al (2003).14 As any gene causing specific mortality must decrease in frequency with age, we determined risks using the approach described by Iselius et al.14 Using the mortality figures from Lee et al,13 we calculated cumulative mortalities (Table 2) and risk, R, attributed to the jth liability class as: Rj=(Ij–Mj−1)/(1–Mj−1), where Ij is the cumulative incidence to the mid-point and Mj−1 is the cumulative specific mortality to the end of the preceding class. Although some data on age at the onset of disease are available, the data are incomplete and possibly subject to error because of the difficulty in defining onset. Further studies might usefully distinguish early and late onset as a route to discriminating between genetic and environmental etiology.

Table 2 Gender-specific cumulative incidence, mortality and risk

The parameters estimated in the analysis are major locus dominance (d, where d=0 refers to a recessive major gene, d=0.5 additive and d=1 dominant), difference on the liability scale between homozygotes (t), and gene frequency (q).12 There are two parameters to test multifactorial heritability. The parameter H represents polygenic heritability in children, where H=Ck/V, in which Ck is the multifactorial component and V is the overall variance. The second parameter is Z for which HZ represents the multifactorial heritability in parents, where Z=Ca/Ck, the ratio of the multifactorial component in adults and children. Significant deviation of Z from 1 suggests a generational difference in multifactorial heritability. To test competing hypotheses, minus twice log likelihood (−2 ln L+C), calculated under a more general model, is subtracted from that for a reduced model. The difference in large samples is distributed as a χ2 with a number of degrees of freedom equal to the number of free parameters. For non-nested models, Akaike's Information Criterion (AIC) is used. AIC is computed as −2 ln L+C plus twice the number of estimated parameters. The model with the lowest AIC has the most parsimonious fit to the data.

Results

The results in Table 2 show gender-specific cumulative incidence, mortality and risk of NPC in the study population. The results from POINTER (Table 3) provide overwhelming support for a model that does not include major gene inheritance but reflects a polygenic/multifactorial pattern. The difference in AIC between the best-fitting single gene model (additive) and the polygenic model is 81.4. Furthermore, support for a multifactorial model over a polygenic model has a χ2 (1 df) of 35.4. The difference in AIC between the multifactorial model and the best-fitting single gene model (additive) is 114.8. Attempts to fit the full mixed model (major genes plus a polygenic component) always resulted in elimination, through bounding, of the major gene so that only the polygenic component remained.

Table 3 Results of POINTER segregation analysis of NPC

The results of the present study strongly support a multifactorial mode of inheritance with no evidence for a substantial role of major genes in the inheritance of NPC. Our results are consistent with various epidemiological and genetic studies. Below, we will briefly review a body of evidence for both environmental and genetic factors for NPC based on previous findings.

Discussion

EBV infection

Epstein–Barr virus (EBV) is a ubiquitous herpes virus that infects more than 90% of the human population and establishes a lifelong viral persistence in the host. EBV has been consistently identified as an important risk factor for NPC, with a dose–response relationship between EBV antibodies and NPC risk.15,16,17,18

Cigarette smoking

A recent IARC monograph (2002)19 reviewed existing evidence and established cigarette smoking as a risk factor for NPC based on the consistent findings of increased risk in both high- and low-risk geographic regions for NPC, the presence of a dose–response relationship with duration or amount of smoking, and a reduction in risk after quitting.

Salted fish consumption

Cantonese-style salted fish, which contains high levels of nitrosamines, has long been thought to be a risk factor for NPC.8,20,21,22,23,24,25,26 Together, these studies have suggested earlier first exposure, longer duration of consumption, higher frequency of consumption and steaming the fish instead of frying or grilling increases the risk for NPC. More recently, Ward et al27 confirmed that nitrosamine and nitrite intake during childhood may play a role in the development of NPC.

Fruits and vegetables

The association between NPC and fruit and vegetable intake depends strongly on whether the fruits and vegetables are fresh or preserved. As reviewed by Yu and Yuan,8 a number of studies have reported a protective effect of higher intake of fresh fruits and vegetables. However, the findings are not always consistent. On the contrary, increased risk of NPC was observed in association with frequent adult consumption of preserved fruits and vegetables.

Genetic studies

HLA Bw46 has been shown to be associated with increased risk of NPC.28,29 Hildesheim et al30 reported an association of HLA class I and II alleles (HLA-A*0207 and HLA-B*4601) and extended haplotypes with NPC in a well-designed case–control study conducted in Taiwan. Further, a meta-analysis of 13 HLA association studies in southern China performed by Goldsmith et al31 reported evidence for positive associations between NPC and the HLA alleles A2, B14 and B46 and negative associations for the alleles A11, B13 and B22. Genetic linkage study has been used as one of the most common approaches for identification of cancer susceptibility genes.32 More recently, a genome-wide search was conducted on a series of high-risk NPC families from Cantonese-speaking populations in the Pearl River area, Guangdong Province. Both multipoint parametric and nonparametric linkage analysis supported evidence for linkage.33 The inclusion of EBV antibody titer as a covariate in the nonparametric linkage analysis gave stronger linkage signals.

The results presented here concord with those reported by Chen et al.34 They conducted a community-based case–control study with 347 cases and controls in Taiwan and suggested that EBV infection is a risk factor for NPC and the effect of cigarette smoking on NPC is modified by age. They estimated the heritability of NPC as 0.60 based on a multifactorial inheritance model.

Results from this study suggest that NPC is a multifactorial malignancy with complex etiology involving both genetic and environmental factors. The penetrance of the susceptibility genes could be low, just like in breast cancer.35 It has become apparent that, for a multifactorial malignancy, family-based genome-wide linkage studies may have their limitations, particularly in the absence of accurately measured relevant environmental factors. One of our future goals is to obtain more complete epidemiological data including EBV infection, smoking, consumption of salted fish, and preserved vegetables on NPC families in addition to the biological material that we have collected. We will also take advantage of newly developed genetic epidemiological tools such as a covariate-based linkage analysis method36 to test for linkage between the NPC risk and genetic markers that are biologically plausible, allowing for potential interaction between these genes and environmental factors. It has been shown by Goddard et al37, Olson et al38 and Devlin et al39 that power to detect linkage increases when a covariate that measures heterogeneity is included in the analysis model. We are in the process of setting up a long-term collaboration among clinicians, radiologists, epidemiologists, molecular biologists as well as statistical geneticists to achieve a collection of a large number of families, documented with confirmed diagnosis, age of diagnosis, history of EBV infection, cigarette smoking and dietary factors. We remain hopeful that dissecting the complex etiology for a multifactorial trait such as NPC is feasible.