Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The Identification and Characterization of Novel N-glycan-based Biomarkers in Gastric Cancer

  • Long Liu ,

    Contributed equally to this work with: Long Liu, Bing Yan, Junlong Huang

    Affiliation Department of Traditional Chinese Medicine, Changhai Hospital, Second Military Medical University, Shanghai, China

  • Bing Yan ,

    Contributed equally to this work with: Long Liu, Bing Yan, Junlong Huang

    Affiliation Department of Traditional Chinese Medicine, Changzheng Hospital, Second Military Medical University, Shanghai, China

  • Junlong Huang ,

    Contributed equally to this work with: Long Liu, Bing Yan, Junlong Huang

    Affiliation Department of Nautical Medicine, Laboratory of Stress Medicine, Faculty of Naval Medicine, Second Military Medical University, Shanghai, China

  • Qunhao Gu,

    Affiliation Department of General Surgery, Yueyang Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai, China

  • Lina Wang,

    Affiliation Department of Traditional Chinese Medicine, Changhai Hospital, Second Military Medical University, Shanghai, China

  • Meng Fang,

    Affiliation Department of Laboratory Medicine, Eastern Hepatobiliary Surgery Hospital, Second Military Medical University, Shanghai, China

  • Jianpeng Jiao,

    Affiliation Department of Traditional Chinese Medicine, Changzheng Hospital, Second Military Medical University, Shanghai, China

  • Xiaoqiang Yue

    yuexiaoqiang@163.com

    Affiliation Department of Traditional Chinese Medicine, Changhai Hospital, Second Military Medical University, Shanghai, China

Abstract

Background and Aims

To identify and validate N-glycan biomarkers in gastric cancer (GC) and to elucidate their underlying molecular mechanism of action.

Methods

In total, 347 individuals, including patients with GC (gastric cancer) or atrophic gastritis and healthy controls, were randomly divided into a training group (n=287) and a retrospective validation group (n=60). Serum N-glycan profiling was achieved with DNA sequencer-assisted/fluorophore-assisted carbohydrate electrophoresis (DSA-FACE). Two diagnostic models were constructed based on the N-glycan profiles using logistic stepwise regression. The diagnostic performance of each model was assessed in retrospective, prospective (n=60), and follow-up (n=40) cohorts. Lectin blotting was performed to determine total core-fucosylation, and the expression of genes involved in core-fucosylation in GC was analyzed by reverse transcriptase-polymerase chain reaction.

Results

We identified at least 9 N-glycan structures (peaks) and the levels of core fucose residues and fucosyltransferase were significantly decreased in GC. Two diagnostic models, designated GCglycoA and GCglycoB, were constructed to differentiate GC from control and atrophic gastritis. The areas under the receiver operating characteristic (ROC) curves (AUC) for both GCglycoA and GCglycoB were higher than those for CEA, CA19-9, CA125 and CA72-4. Compared with CEA, CA19-9, CA125 and CA72-4, the sensitivity of GCglycoA increased 29.66%, 37.28%, 56.78% and 61.86%, respectively, and the accuracy increased 10.62%, 16.82%, 25.67% and 28.76%, respectively. For GCglycoB, the sensitivity increased 27.97%, 35.59%, 55.09% and 60.17% and the accuracy increased 21.26%, 24.64%, 31.40% and 34.30% compared with CEA, CA19-9, CA125 and CA72-4, respectively. After curative surgery, the core fucosylated peak (peak 3) and the total core fucosylated N-glycans (sumfuc) were reversed.

Conclusions

The results indicated that the diagnostic models based on N-glycan markers are valuable and noninvasive alternatives for identifying GC. We concluded that decreased core-fucosylation in both tissue and serum from GC patients may result from the decreased expression of fucosyltransferase.

Introduction

Gastric cancer (GC) is the fourth most prevalent cancer and the third leading cause of cancer-related death, with an incidence of approximately 930,000; annually, it is responsible for over 700,000 deaths worldwide, and the five-year survival rate is 20-30% [1]. For patients with GC, survival is dictated by the pathologic stage of the disease at the time of diagnosis. Unfortunately, the common symptoms of GC are not specific to the disease, and early stage GC may not cause noticeable symptoms. Despite increasing knowledge of the molecular mechanisms that regulate malignant transformation and metastasis, the overall survival rate of patients with GC has not improved significantly. Several molecules have been recommended as GC biomarkers, including carcinoembryonic antigen (CEA) for postoperative surveillance, carbohydrate antigen 19-9 (CA19-9), carbohydrate antigen 125 (CA125) and carbohydrate antigen 72-4 (CA72-4) [2-6]. However, none of these tumor markers has demonstrated sufficient sensitivity or specificity for diagnosing GC at an early stage. Alternatively, gastroscopy increases the rate of definitive diagnoses, but the diagnostic value of gastroscopy is limited by cost, risks, and inconvenience. Therefore, there is an urgent need for the development of noninvasive biomarkers that enable the early detection of GC.

Increasing evidence suggests the alteration of N-linked glycan could be regarded as a potential biomarkers for diagnosing cancer. Previous studies observed a significant changes of N-linked glycan in different cancers and cancer cell lines which includes pancreatic cancer, breast cancer, prostate cancer, ovarian cancer and liver cancer [7-11]. Additional analysis in GC also uncovered that free complex-type N-glycans accumulated in MKN7 and MKN45 cell lines [12]. However, the fluctuation and variation of N-linked glycan in GC patients are still largely unknown.

In our previous studies using DNA sequencer-assisted/fluorophore-assisted capillary electrophoresis (DSA-FACE), we demonstrated that a branching α-1,3-fucosylated triantennary glycan and a biantennary glycan were highly specific and sensitive candidate HCC biomarkers [13]. In the current study, we utilized DSA-FACE to retrospectively profile serum N-glycans in samples from patients with GC or atrophic gastritis and from healthy individuals. We characterized the identified GC N-glycan markers with receiver operating characteristic (ROC) curves and validated the markers in prospective and follow-up cohorts. Our objective was to identify a promising biomarker for predicting and detecting GC with improved specificity and sensitivity.

Materials and Methods

1.1: Retrospective cohort

The study protocol was approved by The Chinese Ethics Committee of Human Resources at the Second Military Medical University. Written informed consent was obtained from the patients and the healthy controls.

In total, 247 patients with GC (n=138) or atrophic gastritis (n=109) were recruited between 2010 and 2012. The diagnoses for all the enrolled patients were histopathologically confirmed by 2 pathologists at Changhai Hospital, Second Military Medical University (Shanghai, China). In the control group, 128 age- and sex-matched healthy volunteers (disease-free) were enrolled during the same time period. The mean age and the sex distribution were matched in the 3 groups. A summary of the patient data is provided in Table 1. GC patients who received preoperative radiotherapy, chemotherapy, or chemoradiotherapy were excluded from the study.

Mean ± SD or No. (%)
CharacteristicControl (n=128)Atrophic Gastritis (n=109)GC (n=138)
Age, y50.10±5.9552.11±6.2951.01±6.01
Men75 (58.59)60 (55.05)70 (50.72)
CEA-positive 5 (3.91)23 (21.10)62 (44.93)
CA19-9-positive 11 (8.59)20 (18.35)53 (38.41)
CA125-positive 7 (5.47)10 (9.17)28 (20.29)
CA72-4-positive 8 (6.25)12 (11.01)22 (15.94)
TNM stage
I22 (15.94)
II28 (20.29)
III63 (45.65)
IV25 (18.12)

Table 1. Characteristics of the training group.

Abbreviations: GC, gastric carcinoma; SD, standard deviation.
CSV
Download CSV

A total of 60 patients (20 in each group) were randomly selected from the 3 groups described above for retrospective verification; the remaining patients (n=315) were included in the training set to construct the diagnostic model. During the 2 years of the study, 40 of the 138 patients with GC in the training group were monitored before and after curative surgery, based on availability. The training group included 118 patients with GC, 89 patients with atrophic gastritis, and 108 healthy controls.

Laboratory and clinical data for all the participants were obtained from clinical medical records, pathology reports, and personal interviews. The data collected included sex, age, and gastric cancer features (such as tumor location, histologic grade, depth of invasion, and lymph node metastasis). Serum samples were obtained before surgical resections; these samples were collected from whole blood using a standard protocol, centrifuged at 10,000xg for 20 minutes, and stored at -80°C. Disease progression in the GC patients was classified according to the seventh edition of The American Joint Committee [14]: 20 patients (16.95%) had stage I disease, 25 (21.17%) had stage II disease, 54 (45.76%) had stage III disease, and 19 (16.10%) had stage IV disease.

1.2: Prospective cohort

To evaluate the predictive value of the models established in the retrospective study described above, we prospectively investigated an additional cohort (n=60) of patients with GC (n=20) or atrophic gastritis (n=20) and of healthy individuals (n=20) from May 2012 to November 2012 at the same hospital. The procedures and strategies were the same as those described above.

1.3: Tissue samples

Tissue samples were obtained from 20 of the 138 GC patients; 2 slices, a tumor and an adjacent tissue sample, were taken. The tissue samples (approximately 1 cm3) were immediately frozen at -80°C and subjected to reverse transcriptase-polymerase chain reaction (RT-PCR) and lectin blotting. All the tissues were used in accordance with the Institutional Review Board Regulations of the Second Military Medical University.

1.4: Routine detection of tumor markers

Routine tumor marker assays were performed using standard methods and reagents. CEA and CA19-9 levels were determined on an Abbott I2000, and CA72-4 and CA125 levels were measured using a Roche Cobas E601. The cut-off levels recommended by the manufacturer for CEA, CA19-9, CA125 and CA72-4 were 5.0 μg/L, 39 U/L, 40 U/ml and 9.8 U/ml, respectively. The assays were performed in the Department of Laboratory Medicine, Changhai Hospital, the Second Military Medical University, Shanghai.

1.5: Serum protein N-Glycan profiling

Serum protein N-glycan analyses were performed as previously described [13]. Briefly, the N-glycans in 2 μL of serum were released from the proteins with peptide-N-glycosidase F (PNGase F) (New England Biolabs, Boston, MA) labeled with 8-aminonaphtalene-1,3,6-trisulphonic acid (Invitrogen, Carlsbad, CA). Sialic acid was removed using arthrobacter ureafaciens sialidase (Roche Bioscience, Palo Alto, CA), and the processed samples were analyzed using DSA-FACE technology on a capillary electrophoresis-based ABI3130 Genetic Analyzer (Applied Biosystems, Foster City, CA). The 9 highest peaks that were detected in all the samples (accounting for >90% of the total serum N-glycans) were analyzed using GeneMapper (Applied Biosystems). Each N-glycan structure was described numerically by normalizing its height to the sum of the heights of all the peaks, and we analyzed these data using SPSS 18.0 statistical software (SPSS Inc., Chicago, IL).

1.6: Tissue protein extraction and lectin blotting

The tissues were homogenized using a mortar and pestle and resuspended in lysis buffer containing a protease inhibitor cocktail (Roche Diagnostics, Meylan, France). The unlysed fraction was removed by centrifugation (twice at 12000xg for 10 minutes at 4°C). The concentration of soluble protein was determined using the BioRad assay (BioRad, Marnes-la-Coquette, France), and the samples were stored at -80°C.

In total, 25 μg of serum protein or 50 μg of protein extracted from frozen tissue was separated by 10% sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The gels were stained with CBB G250, or the proteins in the gel were transferred to a nitrocellulose membrane (Whatman/Schleicher & Schuell, Versailles, France) to detect core-fucosylated proteins. The membranes were blocked overnight at 4°C with 5% bovine serum albumin in Tris-buffered saline (TBS: 140 mM NaCl and 10 mM Tris-HCl) and then incubated for 1 hour at room temperature with 5 μg/mL of biotinylated lens culinaris agglutinin A (LCA) (Vector Laboratories, Burlingame, CA) in TBS containing 0.05% Tween-20 (TBST). After 4 washes of 10 minutes each with TBST, the membranes were incubated with IRDye 800CW Streptavidin (1:10,000; LI-COR Biosciences, Lincoln, NE) for 1 hour at room temperature, washed 4 times with TBST, and developed using the Odyssey Infrared Imaging System (LI-COR Biosciences). Purified albumin (Sigma, St. Louis, MO) was used as a negative control for the lectin blot.

1.7: Total RNA extraction from tissue and quantitative real-time PCR

RNA was extracted from frozen tissues using a Qiagen RNeasy mini kit according to the manufacturer’s instruction (QIAGEN GmbH, Hilden, Germany). The purity and concentration of the RNA were determined using a spectrophotometer (Eppendorf, Hamburg, Germany). cDNA was synthesized from 2 μg of total RNA using a reverse transcription reagent (Toyobo, Osaka, Japan). The primers were designed using Primer Express (Applied Biosystems), and the sequences are presented in Table 2.

GeneForward Primer (5’-3’)Reverse Primer (5’-3’)
Fut85'CCTGGCGTTGGATTATGCTCA 3'5'CCCTGATCAATAGGGCCTTCT 3'
GDP-Tr5'CTGCCTCAAGTACGTCGGTG 3'5'CCGATGATGATACCGCAGGTG 3'
GAPDH5'ATGGGGAAGGTGAAGGTCG 3'5'GGGGTCATTGATGGCAACAATA 3'

Table 2. Polymerase chain reaction primer pair sequences.

Abbreviations: A, adenosine; C, cytidine; G, guanosine; T, thymidine; Fut8, fucosyltransferase; GDP-Tr, guanosine diphosphate-fucose transporter
CSV
Download CSV

The cDNAs were amplified in an Applied Biosystems 7300 Real-Time PCR machine in a total reaction volume of 20 μL that contained 10 μL of 2X Fast SYBR Green Master Mix (Applied Biosystems, includes Fast-Start Taq DNA polymerase reaction buffer), the deoxynucleotide triphosphate mix (including deoxyuridine triphosphate, SYBR Green I dye, and MgCl2), and 2 μL of the primers for each gene (at a final concentration of 0.5 μΜ each). Each reaction was performed in triplicate. The PCR cycling conditions were as follows: denaturation at 95°C for 5 minutes followed by 40 cycles of 95°C for 15 seconds, 59°C or 55°C for 15 seconds, and 72°C for 45 seconds.

The relative expression of α-1,6-fucosyltransferase (Fut8) and guanosine diphosphate (GDP)-fucose transporter (GDP-fuc-Tr) in each sample was normalized to the expression of the GAPDH housekeeping gene by subtracting the threshold cycle (Ct) value of GAPDH from that of Fut8 or GDP-Tr (∆Ct). The fold difference was calculated by subtracting the ∆Ct of the test sample from that of the control sample to obtain the ∆∆Ct and subsequently the 2^-∆∆Ct. Threshold cycle values beyond 40 cycles were considered below the detectable level. Melt curves were obtained for each reaction to guarantee that a single product was amplified.

1.8: Statistical analysis

All the quantitative variables were expressed as the mean ± standard deviation, unless otherwise indicated. The quantitative variables were compared using Student’s t-test, analyses of variance, or nonparametric tests. Pearson’s correlation coefficients and the associated probabilities (P) were used to evaluate the correlations between the parameters; Spearman’s correlation coefficients were calculated for ordinal categorical variables. Novel glycan biomarkers were identified and characterized based on a forward stepwise logistic regression analysis. The diagnostic performance of individual biomarkers and of the diagnostic models was evaluated using ROC curve analysis. The sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy were calculated using optimal cutoff values selected from the ROC curves. All the reported P values are 2-tailed, and P values <0.05 were considered statistically significant. The statistical analyses were performed using SPSS 18.0 for Windows (SPSS Inc.).

Results

2.1: Different N-Glycan profiles in patients with GC or atrophic gastritis and in healthy controls

Using DSA-FACE technology, we examined the N-glycan profiles in patients with GC (n=118) or atrophic gastritis (n=89) and in healthy individuals (n=108). We quantified and statistically compared the peaks in the 3 groups. At least 9 N-glycan structures (peaks) were identified in all the samples (Figure 1).

thumbnail
Figure 1. A typical desialylated N-glycan profile of serum protein.

At least 9 peaks can be identified. Peaks 1, 2, 5 and 9 increased (red arrows), and peaks 3, 6 and 7 decreased (green arrows) in gastric carcinoma compared with normal controls. The structures of the N-glycan peaks are shown below the chart. The open circles indicate b-linked galactose; the triangles, a/b-1,3/6-linked fucose; and the solid circles, a/b-linked mannose.

https://doi.org/10.1371/journal.pone.0077821.g001

Callewaert et al and Liu et al previously published a structural analysis of these N-glycans [15,16]. The average relative abundance of these N-glycan structures is presented in Table 3. The abundance of the structures in peaks 1, 2, 3, 5, 6, 7, and 9 was significantly different in the GC, atrophic gastritis, and healthy control groups, indicating that different N-glycan patterns existed under various pathophysiological conditions. Compared with the healthy control group, peaks 1, 2, 5 and 9 were increased (P<0.05) and peaks 3, 6, and 7 were decreased in the GC group (P<0.001). The abundance of the structures in peaks 3, 5, 6, 7, and 9 was significantly different in the GC group compared with the atrophic gastritis group (Figure 2).

Means±SD
VariableControl (n=128)Atrophic Gastritis (n=108)GC (n=138)FP
Age, ya50.10±5.9552.11±6.2951.01±6.010.270.766
Peak 1a6.96±1.677.64±2.028.20±2.7510.09<0.001
Peak 2a1.09±0.341.20±0.381.31±0.493.120.045
Peak 3a6.28±1.636.47±1.385.76±1.2810.48<0.001
Peak 4a5.76±0.935.61±0.815.75±0.821.120.327
Peak 5a40.02±3.7539.60±3.7642.16±4.3314.94<0.001
Peak 6a21.40±2.6120.51±2.8516.99±2.9888.36<0.001
Peak 7a5.87±1.416.02±1.715.21±1.2111.15<0.001
Peak 8a7.94±1.587.59±2.247.46±1.822.240.108
Peak 9a2.33±0.922.59±1.554.00±1.7748.12<0.001
sumfucab47.37±4.4047.47±5.3043.21±5.9427.01<0.001

Table 3. N-Glycan profiling by DNA sequencer-assisted/fluorophore-assisted capillary electrophoresis.

Abbreviations: GC, gastric carcinoma; SD, standard deviation.
a analyses of variance
b Sumfuc represents the total abundance of α-1,6-fucosylated structures (the sum of peaks 1, 2, 3, 4, 6, and 7).
CSV
Download CSV
thumbnail
Figure 2. The changes in N-glycan levels for the 3 groups.

In the gastric cancer (GC) group, the levels (represented as 95% confidence intervals [CIs]) of agalacto, core-α-1,6-fucosylated biantennary glycan (NGA2F,peak1), core-α-1,6-fucosylated bisecting biantennary glycans (NGA2FB, peak2) and branching α-1,3-fucosylated triantennaries (NA3FB, peak 9) were modestly elevated (P<0.05) and those of bigalacto core-α-1,6-fucosylated biantennary glycans (NA2F, peak 6) were decreased.

https://doi.org/10.1371/journal.pone.0077821.g002

2.2: Construction and evaluation of a diagnostic model based on N-Glycan markers to differentiate gastric cancer patients from healthy controls

We evaluated the GC-related N-glycan alterations based on a logistic regression analysis. Logistic regression coefficients were utilized to estimate the odds ratios for each of the independent variables. The GCglycoA mathematic formula was constructed to differentiate the GC patients from the healthy controls (GCglycoA=-1.072+0.957*peak4-0.331*peak6+0.646*peak9). To assess the ability of GCglycoA, CEA, CA19-9, CA125 and CA72-4 to discriminate GC patients, we characterized the area under the ROC curves (AUC). Compared with CEA (AUC=0.74), CA19-9 (AUC=0.76), CA125 (AUC=0.72) and CA72-4 (AUC=0.67), GCglycoA more effectively distinguished GC patients from normal controls (AUC=0.88) in the training group (Figure 3A). Table 4 lists the sensitivity, specificity, PPV, NPV, and accuracy for the prediction of GC by CEA, CA19-9, CA125, CA72-4 and GCglycoA. CEA at the recommended value of 5.0 ng/mL had a sensitivity of 45.76%, 37 U/mL CA19-9 had a sensitivity of 38.14%, 40 U/mL CA125 had a sensitivity of 18.64%, and 9.8 U/mL CA72-4 had a sensitivity of 13.56%. An optimal cutoff value of -0.772 was selected for GCglycoA based on the ROC curve analysis. At this cutoff value, GCglycoA had a sensitivity of 75.42%, which is an increase in sensitivity of 29.66%, 37.28%, 56.78% and 61.86% compared with CEA, CA19-9, CA125 and CA72-4, respectively. The diagnostic accuracy of GCglycoA in differentiating GC patients from healthy controls increased by 10.62%, 16.82%, 25.67% and 28.76% compared with CEA, CA19-9, CA125 and CA72-4, respectively. When the model was applied to the retrospective validation group, the sensitivity increased 45.00%, 45.00%, 55.00% and 55.00% and the accuracy increased 15.00%, 17.50%, 20.00% and 20.00% compared with CEA, CA19-9, CA125 and CA72-4, respectively (Table 5).

thumbnail
Figure 3. Receiver operating characteristic (ROC) curve analyses for the prediction of gastric carcinoma (GC).

(A) The ROC analysis for distinguishing between GC and control subjects using an N-glycan marker-based GC diagnostic model (GCglycoA), CEA, CA19-9, CA125 or CA72-4. The areas under the ROC curve (AUCs) indicate the diagnostic power: CEA (0.74), CA19-9 (0.76), CA125 (0.72), CA72-4 (0.67) and GCglycoA (0.88). The diagnostic model was constructed by using forward stepwise logistic regression analysis:

GCglycoA = -1.072+0.957peak4-0.331peak6+0.646peak9. (B) The ROC analysis for distinguishing between GC and atrophic gastritis using the GCglycoB diagnostic model, CEA, CA19-9, CA125 or CA72-4. The AUCs indicate the diagnostic power: GCglycoB (0.82), CEA(0.65), CA19-9 (0.63), CA125 (0.69) and CA72-4 (0.64). The diagnostic model was constructed by using forward stepwise logistic regression analysis: GCglycoB=5.273-1.371peak2+0.781peak4-0.453peak6+0.221peak9.

https://doi.org/10.1371/journal.pone.0077821.g003

Cutoff valueActual status, No. of subjects
TestGC+GC-Sensitivity, %Specificity, %  PPV, %NPV, %Accuracy, %
CEA (5 ng/mL)GC+54545.7695.3791.5361.6869.47
GC-64103
CA19-9 (37 U/mL)GC+451038.1490.7481.8257.3163.27
GC-7398
CA125 (40 U/mL)GC+22718.6493.5275.8651.2754.42
GC-96101
CA72-4 (9.8 U/mL)GC+16813.5692.5966.6749.5151.33
GC-102100
GCglycoA (-0.77)GC+891675.4285.1984.7676.0380.09
GC-2992

Table 4. The Diagnostic Power for Differentiating Gastric Carcinoma from the Controls in the Training Group.

Abbreviations: + positive; - negative; GC, gastric carcinoma; GCglycoA, N-glycan marker-based gastric cancer diagnostic model A; NPV, negative predictive value; PPV, positive predictive value.
CSV
Download CSV
Cutoff valueActual status, No. of subjects
TestGC+GC-Sensitivity, %Specificity, %  PPV, %NPV, %Accuracy, %
CEA (5 ng/mL)GC+8040.00100.00100.0062.5070.00
GC-1220
CA19-9 (37 U/mL)GC+8140.0095.0088.8961.2967.50
GC-1219
CA125 (40 U/mL)GC+6030.00100.00100.0058.8265.00
GC-1420
CA72-4 (9.8 U/mL)GC+6030.00100.00100.0058.8265.00
GC-1420
GCglycoA (-0.77)GC+17385.0085.0085.0085.0085.00
GC-317

Table 5. The Diagnostic Power for Differentiating Gastric Carcinoma from the Controls in the Retrospective Verification Group.

Abbreviations: + positive; - negative; GC, gastric carcinoma; GCglycoA, N-glycan marker-based gastric cancer diagnostic model A; NPV, negative predictive value; PPV, positive predictive value.
CSV
Download CSV

The prospective evaluation indicated that the sensitivity improved 65.00%, 65.00%, 85.00% and 80.00% and the accuracy improved 30.00%, 27.50%, 37.50% and 37.50% compared with CEA, CA19-9, CA125 and CA72-4, respectively (Table 6).

Cutoff valueActual status, No. of subjects
TestGC+GC-Sensitivity,%Specificity,%PPV, %NPV, %Accuracy,%
CEA (5 ng/mL)GC+6130.0095.0085.7157.5862.50
GC-1419
CA19-9 (37 U/mL)GC+6030.00100.00100.0058.8265.00
GC-1420
CA125 (40 U/mL)GC+2010.00100.00100.0052.6355.00
GC-1820
CA72-4 (9.8 U/mL)GC+3115.0095.0075.0052.7855.00
GC-1719
GCglycoA -0.77GC+19295.0090.0090.4894.7492.50
GC-118

Table 6. The Diagnostic Power for Differentiating Gastric Carcinoma from the Controls in the Prospective Verification Group.

Abbreviations: + positive; - negative; GC, gastric carcinoma; GCglycoA, N-glycan marker-based gastric cancer diagnostic model A; NPV, negative predictive value; PPV, positive predictive value.
CSV
Download CSV

2.3: Construction and evaluation of a diagnostic model to differentiate gastric cancer from atrophic gastritis

Using a logistic regression model, another diagnostic model (GCglycoB) was established to differentiate between GC and atrophic gastritis: GCglycoB=5.273-1.371*peak2+0.781*peak4-0.453*peak6+0.221*peak9. The optimal cutoff value for GCglycoB (0.594) was selected based on the ROC curve analysis. In the training group, the AUC for GCglycoB was 0.82, whereas the AUCs for CEA, CA19-9, CA125 and CA72-4 were 0.65, 0.63, 0.69 and 0.64, respectively (Figure 3B). The diagnostic accuracy of GCglycoB in differentiating GC from atrophic gastritis increased 21.26%, 24.64%, 31.40% and 34.30% and the sensitivity increased 27.97%, 35.59%, 55.09% and 60.17% compared with CEA, CA19-9, CA125 and CA72-4, respectively (Table 7). In the validation group, the 85.00% accuracy of GCglycoB represented an increase of 40.00%, 45.00%, 65.00% and 55.00% and the 85.00% sensitivity indicated an increase of 22.50%, 22.50%, 30.00% and 30.00% compared with CEA, CA19-9, CA125 and CA72-4, respectively (Table 8).

Cutoff valueActual status, No. of subjects
TestGC+GC-Sensitivity, %Specificity, %  PPV, %NPV, %Accuracy, %
CEA (5 ng/mL)GC+541945.7678.6573.9752.2459.90
GC-6470
CA19-9 (37 U/mL)GC+451738.1480.9072.5849.6656.52
GC-7372
CA125 (40 U/mL)GC+22818.6491.0173.3345.7649.76
GC-9681
CA72-4 (9.8 U/mL)GC+16813.5691.0166.6744.2646.86
GC-10281
GCglycoB (0.594)GC+88873.7391.0191.5872.3281.16
GC-3081

Table 7. The Diagnostic Power for Differentiating Gastric Carcinoma from Atrophic Gastritis in the Training Group.

Abbreviations: + positive; - negative; GC, gastric carcinoma; GCglycoB, N-glycan marker-based gastric cancer diagnostic model B; NPV, negative predictive value; PPV, positive predictive value.
CSV
Download CSV
Cutoff valueActual status, No. of subjects
TestGC+GC-Sensitivity, %Specificity, %  PPV, %NPV, %Accuracy, %
CEA (5 ng/mL)GC+9445.0080.0069.2359.2662.50
GC-1116
CA19-9 (37 U/mL)GC+8340.0085.0072.7358.6262.50
GC-1217
CA125 (40 U/mL)GC+4220.0090.0066.6752.9455.00
GC-1618
CA72-4 (9.8 U/mL)GC+6430.0080.0060.0053.3355.00
GC-1416
GCglycoB (0.594)GC+17385.0085.0085.0085.0085.00
GC-317

Table 8. The Diagnostic Power for Differentiating Gastric Carcinoma from Atrophic Gastritis in the Retrospective Verification Group.

Abbreviations: + positive; - negative; GC, gastric carcinoma; GCglycoB, N-glycan marker-based gastric cancer diagnostic model B; NPV, negative predictive value; PPV, positive predictive value.
CSV
Download CSV

In the prospective group, GCglycoB was more accurate (85.00% vs. 30.00%, 30.00%, 10.00% and 15.00%) and sensitive (90.00% vs. 62.50%, 62.50%, 55.00% and 55.00%) than CEA, CA19-9, CA125 and CA72-4, respectively (Table 9).

Cutoff valueActual status, No. of subjects
TestGC+GC-Sensitivity, %Specificity, %PPV, %NPV, %Accuracy, %
CEA (5 ng/mL)GC+6130.0095.0085.7157.5862.5
GC-1419
CA19-9 (37 U/mL)GC+6130.0095.0085.7157.5862.5
GC-1419
CA125 (40 U/mL)GC+2010.00100.00100.0052.6355.00
GC-1820
CA72-4 (9.8 U/mL)GC+3115.0095.0075.0052.7855.00
GC-1719
GCglycoB (0.594)GC+17185.0095.0094.4486.3690.00
GC-319

Table 9. The Diagnostic Power for Differentiating Gastric Carcinoma from Atrophic Gastritis in the Prospective Verification Group.

Abbreviations: + positive; - negative; GC, gastric carcinoma; GCglycoB, N-glycan marker-based gastric cancer diagnostic model B; NPV, negative predictive value; PPV, positive predictive value.
CSV
Download CSV

2.4: Decreased levels of total core-fucosylated proteins in gastric cancer

The total level of core α-1,6-fucose residues (the sum of peaks 1, 2, 3, 4, 6, and 7) was significantly lower (P<0.001; Table 2) in GC patients than in atrophic gastritis patients and normal controls. To confirm this finding, we evaluated the concentration of core fucosylated proteins in the serum and tissue of patients with GC using LCA because it specifically recognizes glycoproteins with α-1,6-fucosylated-N-acetyl-D-glucosamine-asparagine (GlcNAc-Asp) in the trimannosyl core. In serum, there were fewer LCA-binding core fucose residues in the GC group than in the atrophic gastritis and normal groups (Figure 4A). The total core fucose abundance was lower in GC tumors than in paired adjacent tissue (Figure 4B). To determine whether the change in total core fucosylation in GC tumors was related to altered glycosylation biosynthesis, the abundance of mammalian α-1,6-fucosyltransferase (Fut8) and guanosine diphosphate transporter (GDP-Tr) in GC tumors and adjacent tissue was analyzed by RT-PCR. The results revealed that Fut8 mRNA expression was lower in tumors than in adjacent tissue (Figure 4C). However, there was no significant difference in guanosine diphosphate (GDP)-fucose transporter (GDP-fuc-Tr) gene expression between tumors and adjacent tissue (Figure 4D).

thumbnail
Figure 4. The abundance of total core fucosylated residues, α-1,6-fucosyltransferase (Fut8) and guanosine diphosphate (GDP)-fucose transporter (GDP-fuc-Tr) using lectin blotting and reverse transcription-polymerase chain reaction (RT-PCR).

(A) Lectin blots of serum proteins probed with lens culinaris agglutinin A (LCA). The horizontal axis represents the experimental groups: control (n=20), atrophic gastritis (n=20), and gastric carcinoma (GC) (n=20); each pool consists of 3 homogenous samples. The vertical axis indicates the ratio of fucosylated protein to total protein. The difference between the groups was statistically significant (P <0.001). (B) Lectin blotting of tissue proteins probed with LCA. The horizontal axis represents the experimental groups: tumor tissue (n=20) and adjacent tissue (n=20). The vertical axis indicates the ratio of fucosylated protein to total protein. The difference between the groups was not statistically significant (P >0.05). (C) The relative messenger RNA (mRNA) expression of Fut8 in tissue as measured by RT-PCR. The horizontal axis represents the experimental groups: tumor tissue (n=20) and adjacent tissue (n=20). The vertical axis indicates the relative expression of Fut8. The difference between the groups was statistically significant (P <0.001). (D) The relative mRNA abundance of GDP-fuc-Tr in tissue as measured by RT-PCR. The horizontal axis represents the experimental groups: tumor tissue (n=20) and adjacent tissue (n =20). The vertical axis indicates the relative abundance of GDP-fuc-Tr. The difference between the groups was not statistically significant (P >0.05).

https://doi.org/10.1371/journal.pone.0077821.g004

2.5: Changes in N-Glycan markers before and after curative surgery

The abundance of the structures in peaks 1, 4, and 9 was significantly different after surgery compared with before surgery (Table 10). The results indicated that the increases in the core fucosylated peak (peak 3) and total core fucosylated N-glycans (sumfuc) were reversed after surgery. Ten weeks after surgery, the GCglycoA value became negative in 11 of the 40 patients with GC, as did the GCglycoB value in 22 of the 40 patients with GC.

Mean±SD or Median (interquartile range)T -Test
VariableBefore SurgeryAfter SurgeryaT or ZP
Peak 1b8.92±3.748.49±3.342.0380.048
Peak 2b1.24±0.591.52±1.74-1.0510.300
Peak 3b5.96±1.526.14±1.33-0.6310.531
Peak 4b5.73±0.695.21±1.012.6020.013
Peak 5b40.81±4.8742.27±5.27-1.5130.138
Peak 6b16.74±2.9817.30±2.61-0.8090.423
Peak 7b5.28±1.505.50±1.14-0.7160.478
Peak 8b7.60±2.077.82±1.85-0.5860.562
Peak 9b4.11±1.853.29±1.692.0470.047
sumfucc43.88±7.7744.16±6.10-0.2170.830
GCglycoAd1.07 (0.49, 3.12)0.28 (-0.90, 1.14)-2.7820.005
GCglycoBd1.31 (0.56, 2.41)0.41 (-0.54, 1.19)-2.3790.017

Table 10. N-Glycan Profiling in Patients With Gastric Cancer Before and After Curative Surgery.

Abbreviations: GCglycoA and GCglycoB, N-glycan marker-based gastric cancer diagnostic models A and B, respectively; SD, standard deviation.
a Ten weeks after surgery.
b Paired-sample T test
c Sumfuc indicates the total abundance ofα-1,6-fucosylated structures.
d Wilcoxon signed-rank test.
CSV
Download CSV

2.6: Correlations between the diagnostic models and the clinical parameters

Currently, the tumor biomarkers CEA, CA19-9, CA125 and CA72-4 are widely used for the screening and monitoring of GC. Therefore, we determined the correlations between the individual N-glycan markers (including the diagnostic models), the tumor biomarkers and the TNM stage.

The results indicated that only the TNM stage was positively associated with GCglycoA (r=0.355, P <0.001) and GCglycoB (r=0.345, P <0.001) (Table 11). The abundance of the structures in peaks 6 and 9 was significantly different between the TNM stages (Table 12).

CorrelationCEACA19-9CA125CA72-4TNM StageGCglycoAGCglycoB
Peak 1
r0.210a0.127a0.006a-0.023a0.088a0.250b0.055b
P0.0160.1480.9440.7930.3160.0040.532
Peak 2
r0.009a0.038a0.011a-0.067a0.010a0.224b-0.099b
P0.9170.6670.9040.4450.9060.0100.258
Peak 3
r0.083a0.094a0.066a0.039a-0.115a-0.216b-0.346b
P0.3480.2860.4550.6540.1930.013<0.001
Peak 4
r-0.045a0.160a0.047a-0.006a-0.091a0.122b0.074b
P0.6130.0680.5940.9430.3010.1670.403
Peak 5
r-0.029a-0.156a-0.020a0.097a0.108a0.299b0.478b
P0.7430.0760.8160.2700.2180.001<0.001
Peak 6
r0.009a0.020a0.044a-0.114a-0.348a-0.757b-0.787b
P0.9210.8170.6180.197<0.001<0.001<0.001
Peak 7
r-0.039a0.089a-0.075a-0.161a-0.245a-0.368b-0.517b
P0.6580.3110.3940.0660.005<0.001<0.001
Peak 8
r-0.135a-0.058a0.002a-0.087a-0.035a-0.215b0.011b
P0.1240.5130.9840.3210.6920.0140.897
Peak 9
r0.028a-0.072a-0.034a0.095a0.315a0.794b0.591b
P0.7510.4130.7030.280<0.001<0.001<0.001
GCglycoA
r-0.033b-0.042b-0.012b0.046b0.355b10.869b
P0.7100.6370.8960.605<0.001<0.001
GCglycoB
r-0.052b-0.045b0.027b0.105b0.345b0.869b1
P0.5580.6110.7600.232<0.001<0.001
Sumfucc
r0.061a0.151a0.040a-0.108a-0.170a-0.353b-0.580b
P0.4860.0850.6510.2210.052<0.001<0.001

Table 11. The Correlation Between N-Glycan Markers and Pathobiochemical Parameters.

Abbreviations: r, correlation coefficient.
a Pearson’s correlation analysis
b Spearman’s correlation analysis
c Sumfuc indicates the total abundance ofα-1,6-fucosylated structures.
CSV
Download CSV
Mean ± SD
VariableStage I (n=22)Stage II (n=28)Stage III (n=63)Stage IV (n=25)FP
Peak 1a7.94±1.877.83±3.218.21±2.548.82±3.440.6000.616
Peak 2a1.21±0.381.24±0.511.23±0.421.67±0.491.1080.348
Peak 3a6.09±1.165.75±1.255.61±1.125.83±1.790.7700.513
Peak 4a5.81±0.855.97±0.745.63±0.795.78±0.961.1160.345
Peak 5a40.36±4.1042.25±4.4543.12±4.2141.31±4.282.6460.052
Peak 6a19.66±3.0917.13±2.4516.33±2.6815.99±2.789.258<0.001
Peak 7a5.72±1.175.30±1.055.14±1.254.82±1.222.3120.079
Peak 8a7.49±1.747.43±1.887.41±1.737.58±2.180.0490.986
Peak 9a2.67±1.583.97±1.804.17±1.334.84±2.276.834<0.001
Sumfucb46.43±4.7743.21±5.5542.14±6.9442.91±6.942.9440.036

Table 12. N-Glycan Profiling in Patients With Gastric Carcinoma Based on TNM Stage.

Abbreviations: SD, standard deviation.
a analyses of variance
b Sumfuc indicates the total abundance of α-1,6-fucosylated structures.
CSV
Download CSV

Discussion

Glycosylation is one of the most common post-translational modifications. Changes in N-linked glycosylation occur during the development of various diseases [17]. Among the various methods for identifying novel serum glycobiomarkers [18,19], DSA-FACE is a simple and efficient technology for measuring changes in N-glycans in biological fluids, such as serum. We previously used this technology to support the diagnosis of HCC and colorectal cancer [13]. In the current study, we used DSA-FACE to analyze N-linked glycans and to identify the specific N-glycan profile during the development of gastric cancer.

Clinically, there are no reliable tumor biomarkers for gastric cancer, but CEA, CA19-9, CA125 and CA72-4 are commonly used to screen for this disease. However, they do not provide sufficient sensitivity or reliability for early detection. CEA is a highly heterogeneous glycoprotein that contains 60% carbohydrate. Individual differences in glycoprotein patterns are expected. Therefore, we hypothesized that N-glycan profiling would be more effective than the detection of individual glycosylated molecules.

Based on the statistical analyses and forward stepwise logistical regression after performing serum DSA-FACE, we constructed 2 diagnostic models: GCglycoA (for distinguishing between GC and control) and GCglycoB (for distinguishing between GC and atrophic gastritis). In the training group, both models had a better diagnostic performance (accuracy >80.0%) than CEA, CA19-9, CA125 or CA72-4. We confirmed that the diagnostic values of GCglycoA and GCglycoB were greater than those of CEA, CA19-9, CA125 and CA72-4 in a retrospective validation group and in an independent prospective cohort.

In analyzing the abundance of individual N-glycan structures, we determined that the abundance of core-fucosylated structures decreased significantly in gastric cancer. This is similar to what occurs in colorectal cancer [20] but differs from what has been reported for other cancers, such as the increased core fucosylation of alpha-fetoprotein that was observed in HCC [21,22]. To validate our findings, we performed lectin blotting of tissue and serum samples. Using HCC serum as a positive control, we observed decreased N-glycan fucosylation in the lectin blots of GC serum samples. A similar trend was present in lectin blots of GC tissue.

Protein fucosylation is regulated by several fucosyltransferases, the GDP-fucose synthetic pathway, and GDP-fuc-Tr [23]. GDP-fuc-Tr is important for GDP-fucose transport. Core fucosylation is catalyzed by Fut8. We determined the expression of Fut8 and GDP-fuc-Tr to analyze the relevant steps for altering core fucosylation. The results indicated that Fut8 gene expression was significantly lower in tumors than in adjacent tissue and that GDP-fuc-Tr gene expression did not differ significantly between tumor and adjacent tissue. The results indicated that decreased Fut8 expression could be responsible for the reduced core 1,6-fucosylation in GC.

Previously, we determined that peak 9 (a branching α-1,3-fucosylated triantennary glycan, NA3Fb) was associated with the development of HCC and colorectal cancer [9,21]. Notably, peak 9 was also significantly increased in patients with gastric cancer. N-acetylglucosaminyltransferase V (GnT-V) is responsible for the increase in multiantennary glycans (NA3Fb, peak 9) [24]. This suggested that GnT-V might be involved in malignant transformation, and the molecular mechanism by which N-glycosylation patterns are altered in gastric cancer remains a critical challenge for future studies.

Conclusions

We concluded that serum N-glycan profile was altered during the development of gastric cancer. The serum N-glycan might aid clinicians to diagnose gastric cancer in its earliest stages and would also contribute to distinguish it from other similar disorders. Our study provided additional insight into the molecular and cellular biology of gastric cancer by confirming that glycosylation is involved in the pathogenesis of cancer. A future glycobiological approach will focus on the changes in glycosylation that accompany each step of cancer development and progression to improve the diagnosis, monitoring, and screening of gastric cancer.

Author Contributions

Conceived and designed the experiments: LL. Performed the experiments: LL BY JH QG LW MF JJ. Analyzed the data: LL BY JH XY. Contributed reagents/materials/analysis tools: XY. Wrote the manuscript: LL XY.

References

  1. 1. Parkin DM, Bray F, Ferlay J, Pisani P (2005) Global cancer statistics, 2002. CA Cancer J Clin 55: 74-108. doi:https://doi.org/10.3322/canjclin.55.2.74. PubMed: 15761078.
  2. 2. Ychou M, Duffour J, Kramar A, Gourgou S, Grenier J (2000) Clinical significance and prognostic value of CA724 compared with CEA and CAl9-9 in patients with gastric cancel. Dis Markers16: 105-110. doi:https://doi.org/10.1155/2000/595492. PubMed: 11381189.
  3. 3. Tocchi A, Costa G, Lepre L, Liotta G, Mazzoni G et al. (1998) The role of serum and gastric juice levels of carcinoembryonic antigen, CA19-9 and CA72-4 in patients with gastric cancer. J Cancer Res Clin Oncol 24: 450-455.
  4. 4. Gaspar MJ, Arribas I, Coca MC, Díez-Alonso M (2001) Prognostic value of carcinoembryonic antigen, CA19-9 and CA 72-4 in gastric carcinoma. Tumour Biol 22: 318-322. doi:https://doi.org/10.1159/000050633. PubMed: 11553862.
  5. 5. Carpelan-Holmström M, Louhimo J, Stenman UH, Alfthan H, Haglund C (2002) CEA, CA19-9 and CA72-4 improve the diagnostic accuracy in gastrointestinal cancers. Anticancer Res 22: 231l-2316.
  6. 6. Haga Y, Sakamoto K, Egami H, Yoshimura R, Mori K et al. (1986) Clinical significance of serum CA125 values in patients with cancers of the digestive system. Am J Med Sci 292: 30-34. doi:https://doi.org/10.1097/00000441-198607000-00006. PubMed: 3459357.
  7. 7. Nakano M, Nakagawa T, Ito T, Kitada T, Hijioka T et al. (2008) Site-specific analysis of N-glycans on haptoglobin in sera of patients with pancreatic cancer: a novel approach for the development of tumor markers. Int J Cancer 122: 2301-2309. doi:https://doi.org/10.1002/ijc.23364. PubMed: 18214858.
  8. 8. Saldova R, Reuben JM, Abd Hamid UM, Rudd PM, Cristofanilli M (2011) Levels of specific serum N-glycans identify breast cancer patients with higher circulating tumor cell counts. Ann Oncol 22: 1113-1119. doi:https://doi.org/10.1093/annonc/mdq570. PubMed: 21127012.
  9. 9. de Leoz ML, An HJ, Kronewitter S, Kim J, Beecroft S et al. (2008) Glycomic approach for potential biomarkers on prostate cancer: profiling of N-linked glycans in human sera and pRNS cell lines. Dis Markers 25: 243-258. doi:https://doi.org/10.1155/2008/515318. PubMed: 19126968.
  10. 10. Kim YG, Jeong HJ, Jang KS, Yang YH, Song YS et al. (2009) Rapid and high-throughput analysis of N-glycans from ovarian cancer serum using a 96-well plate platform. Anal Biochem 391: 151-153. doi:https://doi.org/10.1016/j.ab.2009.05.015. PubMed: 19457428.
  11. 11. Lattová E, McKenzie EJ, Gruwel ML, Spicer V, Goldman R et al. (2009) Mass spectrometric study of N-glycans from serum of woodchucks with liver cancer. Rapid Commun Mass Spectrom 23: 2983-2995. doi:https://doi.org/10.1002/rcm.4202. PubMed: 19685473.
  12. 12. Ishizuka A, Hashimto Y, Naka R, Kinoshita M, Kakehi K et al. (2008) Accumulation of free complex-type N-glycans in MKN7 and MKN45 stomach cancer cells. Biochem J 413: 227-237. doi:https://doi.org/10.1042/BJ20071562. PubMed: 18399796.
  13. 13. Fang M, Zhao YP, Zhou FG, Lu LG, Qi P et al. (2010) N-glycan based models improve diagnostic efficacies in HBV-related hepatocellular carcin AJCC cancer staging manual oma. Int J Cancer 127: 148-159. doi:https://doi.org/10.1002/ijc.25030. PubMed: 19904744.
  14. 14. Edge SB, Byrd DR, Compton CC et al. 7th1 ed. . New York: Springer-Verlag, 2009: 117-126.
  15. 15. Callewaert N, Van Vlierberghe H, Van Hecke A, Laroy W, Delanghe J et al. (2004) Noninvasive diagnosis of liver cirrhosis using DNA sequencer based total serum protein glycomics. Nat Med 10: 429-434. doi:https://doi.org/10.1038/nm1006. PubMed: 15152612.
  16. 16. Liu XE, Desmyter L, Gao CF, Laroy W, Dewaele S et al. (2007) N-glycomic changes in hepatocellular carcinoma patients with liver cirrhosis induced by hepatitis B virus. Hepatology 46: 1426-1435. doi:https://doi.org/10.1002/hep.21855. PubMed: 17683101.
  17. 17. Kobata A, Amano J (2005) Altered glycosylation of proteins produced by malignant cells, and application for the diagnosis and immunotherapy of tumors. Immunol Cell Biol 83: 429-439. doi:https://doi.org/10.1111/j.1440-1711.2005.01351.x. PubMed: 16033539.
  18. 18. Turnbull JE, Field RA (2007) Emerging glycomics technologies. Nat Chem Biol 3: 74-77. doi:https://doi.org/10.1038/nchembio0207-74. PubMed: 17235338.
  19. 19. Miyamoto S (2006) Clinical applications of glycomic approaches for the detection of cancer and other diseases. Curr Opin Mol Ther 8: 507-513. PubMed: 17243486.
  20. 20. Zhao YP, Ruan CP, Wang H, Hu ZQ, Fang M et al. (2012) Identification and assessment of new biomarkers for colorectal cancer with serum N-glycan profiling. Cancer 1: 1-12. PubMed: 21853445.
  21. 21. Miyoshi E, Moriwaki K, Nakagawa T (2008) Biological function of fucosylation in cancer biology. J Biochem 143: 725-729. PubMed: 18218651.
  22. 22. Li D, Mallory T, Satomura S (2001) AFP-L3: a new generation of tumor marker for hepatocellular carcinoma. Clin Chim Acta 313: 15-19. doi:https://doi.org/10.1016/S0009-8981(01)00644-1. PubMed: 11694234.
  23. 23. Okuyama N, Ide Y, Nakano M, Nakagawa T, Yamanaka K et al. (2006) Fucosylated haptoglobin is a novel marker for pancreatic cancer: a detailed analysis of the oligosaccharide structure and a possible mechanism for fucosylation. Int J Cancer 118: 2803-2808. doi:https://doi.org/10.1002/ijc.21728. PubMed: 16385567.
  24. 24. Pinho SS, Reis CA, Paredes J, Magalhães AM, Ferreira AC et al. (2009) The role of N-acetylglucosaminyltransferase III and V in the post-transcriptional modifications of E-cadherin. Hum Mol Genet 18: 2599-2608. doi:https://doi.org/10.1093/hmg/ddp194. PubMed: 19403558.