Study participants and design
The Shanghai Breast Cancer Study is a population-based case-control study conducted in urban Shanghai. Detailed study design and data collection procedures have been described elsewhere [
9]. Briefly, cases were permanent Shanghai residents between the ages of 25 and 64 years who were newly diagnosed with breast cancer between August 1996 and March 1998. Through a rapid case ascertainment system, supplemented by the population-based Shanghai Cancer Registry, 1,602 eligible breast cancer patients were identified, and 1,459 (91.1%) completed in-person interviews using a structured questionnaire. The initial cancer diagnoses for all patients were confirmed by two senior pathologists through a review of pathological slides. Information about clinical cancer characteristics, including TNM (tumor, node, metastasis) stage, treatment for cancer, and estrogen receptor (ER) and progesterone receptor (PR) status, was obtained by medical record review using a standard protocol. The major reasons for nonparticipation were refusal (109 cases; 6.8%), death before the patient could be interviewed (17 cases; 1.1%), and our inability to locate the patient (17 cases; 1.1%)
Eligible controls were randomly selected from the Shanghai Resident Registry, which contains demographic information for all residents of urban Shanghai, and were frequency-matched on age by 5-year intervals to the predetermined age distribution of the cases reported to the Shanghai Cancer Registry from 1990 to 1993. Of the 1,734 eligible controls, 1,556 (90.3%) completed interviews. The major reasons for nonparticipation of the eligible controls were refusal (166 controls, 9.6%) or death or a prior cancer diagnosis (2 controls, 0.1%).
The structured questionnaire used for this study included information on demographic factors, menstrual and reproductive history, hormone use, previous disease history, family history of cancer, physical activity, tobacco and alcohol use, and a quantitative food-frequency questionnaire. All participants were measured for current weight, circumferences of the waist and hips, and sitting and standing height. In addition to the in-person interviews and anthropometric measurement, 10 ml blood samples were collected from 1,193 (82%) cases and 1,310 (84%) controls. These samples were processed on the same day and stored at -70°C.
The methodology for the follow-up of cancer cases was described previously [
10]. All 1,459 cancer patients were followed through January 2003 with active follow-up and record linkage to the death certificates of the Vital Statistics Unit of the Shanghai Center for Disease Control and Prevention. In all, 1,290 (88.4%) patients successfully completed the follow-up interview either in person (
n = 1,241; 85%) or by telephone (
n = 49; 3.4%) between March 2000 and December 2002. Among them, 197 patients were deceased. Through interviews with patients – or, for deceased patients, next of kin – information on disease progress, recurrence of cancer, quality of life, and cause of death (if the patient had died) was obtained. For the remaining 169 participants, who could not be contacted in person or by phone, linkage to the death certificates was completed in June 2003. Forty deaths were identified through the linkage, and information on the date of death and cause of death was obtained. The remaining 126 subjects who had no match in the death registry were assumed to be alive on December 30, 2002, 6 months before the linkage in order to allow for a possible delay of entry of the death certificates into the registry. Four subjects had insufficient information for the record linkage and were excluded from survival analysis. Finally, 1,129 cases and 1,229 controls were included in the case-control comparison and and 1,125 cases were included in the survival analysis
Genotyping methods
Genomic DNA was extracted from buffy coat fractions using a Puregene® DNA Purification kit (Gentra Systems, Minneapolis, MN, USA) following the manufacturer's protocol. DNA concentration was measured by PicoGreen® dsDNA Quantitation Kit (Molecular Probes, Eugene, OR, USA). The allelic discrimination of the MMP12 gene A-82G and A1082G polymorphisms were assessed with the ABI PRISM 7900 Sequence Detection Systems (Applied Biosystems, Foster City, CA, USA), using the fluorogenic 5' nuclease assay with primers and probes obtained from ABI (Assay ID: C_15880589_10 and C_785907_10). PCR was performed in a total volume of 5 μl, which contained 2.5 ng DNA, 1 × TaqMan Universal PCR Master Mix, each primer at 900 nM, and each probe at 200 nM. The thermal cycling conditions were as follows: 95°C for 10 min to activate the AmpliTaq Gold enzyme, followed by 40 cycles of 92°C for 15s and 60°C for 1 min. The fluorescence level was measured with an ABI PRISM 7900HT Sequence Detector (Applied Biosystems), resulting in clear identification of three genotypes.
The laboratory staff was blind to the identity of the subjects. Quality control samples were included in the genotyping assays. Each 384-well plate contained four water, eight CEPH 1347-02 DNA, eight blinded quality control samples, and eight unblinded quality control samples. The concordances for the blinded samples were 98% for A-82G and 100% for A1082G polymorphisms, respectively. Genotypes for polymorphisms of A-82G in the MMP12 gene were successfully determined for 1,118 cases and 1,223 controls and those of A1082G for 992 cases and 976 controls.
Statistical methods
The χ2 test and t-test were used for comparing characteristics of cases and controls. Minor genotypes AG or GG of A-82G and A1082G were combined in stratified analysis because of the small number of subjects in each category. Odds ratios and 95% confidence intervals (CIs) were derived using unconditional logistic regression models. To evaluate the association of MMP12 with survival, Cox proportional hazard models were applied after adjusting for age, TNM stages, and ER/PR status. The proportional hazard assumption of the Cox model was examined by graphic evaluation of Schoenfeld's residual plot. All P values presented in this paper are two-sided. SAS software was used for statistical analysis (version 9.1; SAS Institute, Cary, NC, USA).