Background
Lung cancer is the leading cause of cancer-related death and the global five-year survival rate is only 10% to 15%. Lung cancer is also more variable in its biological behavior and can be divided into two histological groups: small-cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). NSCLC which accounts for approximately 80% of all lung cancers, includes adenocarcinoma (AC), squamous cell carcinoma (SCC) and large-cell carcinoma. The incidence of adenocarcinoma appears to be increasing globally.
Cancer is the result of the accumulation of genetic and epigenetic alterations, which in turn means there are different profiles of gene expression in various lung cancers [
1]. Studying differentially expressed genes between cancer and normal tissues is crucial to understanding carcinogenesis and identifying novel biomarkers for cancer [
2,
3]. Several technologies are available to obtain profiles of the differentially expressed genes: representational difference analysis, serial analysis of gene expression, oligonucleotide microarrays, or suppression subtractive hybridization (SSH) [
4], among others. SSH is a polymerase chain reaction (PCR)-based cDNA subtraction technique that allows selective amplification of target cDNA while simultaneously suppressing non-target cDNA amplification. The cDNA library generated by hybridization and subtraction techniques reduces abundantly expressed housekeeping genes or genes commonly expressed in both control and treated individuals, thereby normalizing the cDNA expression profiles during library construction [
4]. As a result, this technique significantly enhances the chances of differentially expressed genes [
5]. SSH has been successfully applied to a wide variety of malignant diseases including lung cancer for the generation of cDNA libraries [
6‐
10].
In previous studies, samples were obtained from either culture cells or tissues from different individuals. The inherent problem in this sampling was that the SSH library generated using cultured cells may provide some incorrect information, because genes could have varied or mutated. Similarly, the SSH library constructed using tissues of different individual leads to the problem that the differentially expressed genes in various individuals could not be neutralized during hybridization, and these genes could be incorrectly deemed as being cancer-related. To correct these shortfalls, we used lung AC tissue and its adjacent nonmalignant lung tissue to establish two cDNA libraries by SSH, to obtain more accurate information of differentially expressed genes in lung ACs.
After genome BLAST, 177 up-regulated and 59 down-regulated genes in lung ACs were obtained from the forward-subtracted library (FSL) and the reverse-subtracted library (RSL), respectively. Further bioinformatic analysis demonstrated that these genes were involved in a wide range of cellular functions. The vast majority of these genes were newly identified to be abnormally expressed in lung cancer. Subsequently, we selected 16 differentially expressed genes to investigate their mRNA levels on lung cancer tissue samples as the first stage of screening. According to real-time RT-PCR analysis,
DDR1,
HSP90B1,
SDC1,
RPSA,
ERGIC3, and
LPCAT1 were up-regulated significantly in NSCLCs, while
GPX3,
TIMP3 were down-regulated significantly. ERGIC3 is located in endoplasmic reticulum and Golgi apparatus of NRK cells [
11], however, the function of ERGIC3 is unclear in lung cancer. Therefore, expression of ERGIC3 in NSCLCs was further confirmed at the protein level by western blot and immunohistochemistry analysis and we studied the pathophysiological functions of
ERGIC3.
Methods
Patients and tissue samples
The primary tumors and adjacent nonmalignant lung tissues were obtained at the time of surgery and quickly frozen in liquid nitrogen. No patients were treated before undergoing surgical resection. The adjacent nonmalignant lung tissues which were away from the cancer tissues at least 5 cm, did not contain cancer cells but usually appeared inflammatory response and fibrosis. Pathological diagnosis was based on light microscopy according to the World Health Organization classification [
12]. Tumors were staged according to TNM criteria published by the International Union Against Cancer in 1997 [
13]. Tumor regions selected for RNA and protein isolation contained a tumor cellularity greater than 60%. The use of all of the human tissue samples and the experimental procedures for this study were reviewed and approved by the Tumor Hospital of Yunnan Province and Kunming Institute of Zoology. All researches were carried out according to the Helsinki Declaration.
Cell culture
Six lung cancer cell lines and an immortalized human bronchial epithelial cell line (BEAS-2B) were used. A549 (AC), 801-D (large cell carcinoma), NCI-H446 (SCLC), and BEAS-2B were obtained from Cell Bank of Kunming Institute of Zoology, Chinese Academy of Sciences (CAS, Kunming, China); SPCA-1 (AC) was purchased from the Cell Bank of Type Culture Collection, CAS (Shanghai, China); EPLC-32M1 (SCC) and GLC-82 (AC) were obtained from German Cancer Research Center (Heidelberg, Germany) and Chinese National Cancer Institute, Chinese Academy of Medical Sciences (Beijing, China), respectively. The BEAS-2B cell line was fed with DMEM, while the other cell lines were cultured with RPMI 1640 containing 10% fetal bovine serum (FBS) and maintained in a humidified incubator with 5% CO2 at 37°C.
Construction of the subtractive cDNA library
To construct the SSH library, we used a tumor tissue and its adjacent nonmalignant lung tissue derived from one patient with well-differentiated lung adenocarcinoma. The tumor tissue contained a tumor cellularity greater than 80%. The adjacent nonmalignant lung tissue was away from the cancer tissues at least 10 cm and appeared completely normal histological structure. Total RNA was isolated using TRIzol (Invitrogen Corp., Carlsbad, CA, USA). Poly(A)+ RNA was isolated using the PolyATtract® mRNA isolation systems (Promega Corp., Madison, WI, USA). The cDNA synthesis and subtraction were performed using the PCR-select™ cDNA subtraction kit (Clontech, Palo Alto, CA, USA). Using the cDNA of lung adenocarcinoma tissue as tester and its adjacent nonmalignant lung tissue as driver, we generated the FSL which represented up-regulated transcripts. Using the cDNA of lung adenocarcinoma tissue as driver and its adjacent nonmalignant lung tissue as tester, the RSL which represented down-regulated transcripts, was constructed. The subtracted cDNA fragments obtained in each experiment were cloned into the pGEM-T Easy vector (Promega Corp.) and used to transform E.coli strain DH5α cells. All the transformants were isolated from white colonies on X-gal/isopropyl-beta-D-thio-galatopyranoside agar plates.
Quantitative real-time polymerase chain reaction (q-RT-PCR)
Total RNA was isolated from lung cancer tissues and adjacent nonmalignant tissues using TRIzol (Invitrogen Corp.). RNA was used as a template to synthesize the first strand cDNA with M-MLV reverse transcriptase (Promega Corp.). The q-RT-PCR reactions were performed on an ABI stepone™ real-time RT-PCR system using the SYBR Green dye method. The q-RT-PCR reactions were performed in 25 μl volumes that included 0.5 μl of SYBR green, 8 ng of cDNA template and 1.0 μl each of the forward and reverse primers (10 μM) (see Additional file
1). The PCR was done under the conditions at 94°C for 30 seconds, then 40 cycles of amplification at 94°C for 30 seconds, 60°C for 30 seconds. Each gene was normalized to the internal β-actin levels. Each sample was run in triplicate to ensure quantitative accuracy, and the threshold cycle numbers (Ct) were averaged. The results were reported as tumor tissues: normal tissues (T:N) ratios and calculated using the 2
−ΔΔCt method [
14].
Western blot
Total protein was extracted from cultured cells and tissues, separated by electrophoresis, and then transferred into PVDF membrane according to the routine protocol. The blotted membrane was incubated with the rabbit anti-ERGIC3 polyclonal antibody (Sigma-Aldrich, St. Louis, MO, USA) followed by horseradish peroxidase-conjugated anti-rabbit secondary antibody (AbMART, Shanghai, China), and then proteins were detected with chemiluminescence reagents (Thermo Fisher Scientific Inc., Waltham, MA, USA). The membrane was reprobed with an antibody against β-actin (Abcam, Cambridge, UK) as a control for equivalent protein loading.
Immunohistochemical staining
Immunohistochemical staining was done on 4-μm-thick paraffin sections cut from the formalin fixed tissues. Heat-induced epitope retrieval was performed in Tris-EDTA Buffer (10 mM Tris Base, 1 mM EDTA Solution, 0.05% Tween 20, pH 9.0). The sections were then incubated in 3% H2O2 for 10 minutes to eliminate endogenous peroxidase activity. The primary rabbit anti-ERGIC3 polyclonal antibody (Abcam) and horseradish peroxidase-conjugated anti-rabbit secondary antibody were used. Color development was accomplished with 3,3'-Diaminobenzidin. The nuclei were then counterstained with hematoxylin. Only manifest cytoplasmic staining was defined as a positive reaction. Negative controls were incubated with normal rabbit serum instead of the polyclonal antibody.
Immunofluorescence staining and confocal microscopy
Immunofluorescence analysis was done on the cultured cells. Briefly, the cells grown on coverslips were fixed in −20°C acetone for 10 minutes and then incubated with the rabbit anti-ERGIC3 polyclonal antibody (Abcam), and anti-MUC1 mouse monoclonal antibody (mAb) A76-A/C7 (Glycotope, Berlin, Germany), anti-ST mAb [
15], anti-calreticulin mAb (Abcam), and anti-58K Golgi protein mAb (Abcam) at 4°C overnight. Following the washes, the cells were incubated with FITC-coupled anti-rabbit antibody (BD sciences, Franklin Lakes, NJ, USA) or with Cy3-coupled anti-mouse antibody (Millipore, Billerica, MA, USA) for 1 hour. Subsequently, the nuclei were finally counterstained with diamidinophenylindole. The slides were analyzed under confocal laser scanning microscopy.
Construction of the expression vector and the cell transfection
To generate the plasmid expressing ERGIC3, we cloned the open reading frame of ERGIC3 and used the following specific primers (upper case, restriction enzyme sequences): forward, 5'-GGTGGTGAATTCatgaggcgctggggaagct-3'; reverse,5'-GGTGGTGGATCCaccgaggagggtgactacgttgtctt-3'. After running PCR, the product was cut by EcoR I and BamH I. The purified DNA was ligated into the pLXSN expression vector (Clontech Corp.). The empty vector served as a control. The vectors were transfected into BEAS-2B cell by lipofectamine LTX and PLUS (Invitrogen Corp.) according to the instructions. The over-expression of ERGIC3 was validated by q-RT-PCR and western blot.
Gene silencing by RNA interference
For gene silencing of ERGIC3, pGPU6/GFP/Neo-ERGIC3-shRNA and its control vector, pGPU6/GFP/Neo-NC-shRNA, were used (Jima Corp., Shanghai, China). Briefly, the oligonucleotide GGAGGACTATCCAGGCATTGT was designed to interfere specifically with ERGIC3 gene expression. As a negative control, we used the oligonucleotide GTTCTCCGAACGTGTCACGT, which has no significant match in a BLASTn search (human NCBI nr database). The vectors were transfected into GLC-82 cells by lipofectamine LTX and PLUS. The gene silencing of ERGIC3 was validated by q-RT-PCR and western blot.
Cell proliferation assay
Cell proliferation analysis was based on the capacity of mitochondrial enzymes to transform MTT to MTT formazan. More succinctly, after being transfected for 48 hours, GLC-82 and BEAS-2B were collected and transferred into 96-well plates (4*103 cells/well), then all cells were treated with MTT (5 mg/ml) every 24 hours. 10 μl of MTT was added to each well and cells were incubated at 37°C for 2 hours. Then, the culture medium with dye was removed and dimethylsulfoxide was added at 100 μl per well for formazan solubilization. The absorbance of converted dye was measured at a wavelength of 490 nm using a 96-well microplate reader (model 680, Bio-Rad Laboratories).
Cell migration assay
The transfected GLC-82 and BEAS-2B cells were used for the cell migration assay. The transwell migration assay was conducted in 24 well plates with membrane inserts (8 mm pore size; Millipore). The cells were then seeded in the upper chamber of transwells (105 cells/well) without FBS. The lower chambers were loaded with 500 μl of medium with 10% FBS. After incubation for 24 hours, filters were washed and the cells on the upper surface were gently removed with a cotton swab. The cells were fixed with 95% ethanol, and were stained by Giemsa I (Jiancheng Corp. Nanjing, China) and Giemsa II. The cells were then counted under microscope. The experiments were repeated three times.
Statistical analysis
Measurement data (levels of mRNA and protein, the data of cellular migration and proliferation) and enumeration data (the data of immunohistochemical staining) were respectively analyzed using the paired t-test and the Fisher’s exact test (two-tailed). All of the values were evaluated using IBM SPSS 19 (SPSS inc., Chicago, Illinois). Differences were considered significant if P < 0.05.
Discussion
Differentially expressed genes are the fundamental driver of both the diversity and complexity of tumor phenotypes. Generating subtracted cDNA libraries is the first step in identifying genes differentially expressed in cancer cells. This construction is made easier by the simple and highly effective suppression subtractive hybridization (SSH) technique. SSH has been successfully applied to a wide variety of malignant diseases for the generation of cDNA libraries [
6‐
10].
In this study, we constructed two libraries of differentially expressed genes using the lung AC tissue and its adjacent nonmalignant lung tissue from a single patient. We compared and analyzed the results from all the lung cancer SSH libraries, including our own. Surprisingly, most of the differentially expressed genes were unequal in these libraries. This could be explained that diverse cancer tissues and cell lines lead to different gene expression profiles. The genetic abnormality of cancer appears a great diversity. Accordingly, using various samples to generate new cDNA libraries in order to find novel differentially expressed genes in lung cancer can play a significant role in advancing research into lung cancer. In this regard, our SSH libraries add useful data that can further be used in the discovery of lung cancer-associated genes. Indeed, the vast majority of the differentially expressed genes of our libraries were not known to be abnormally expressed in lung cancer.
Bioinformatics analysis was subsequently performed to interpret the differentially expressed genes of our FSL and RSL libraries. The functions of these genes were involved in several aspects: molecular transport; cellular signaling and interaction; cellular polarization; cell cycle; DNA replication, recombination, and repair; cell apoptosis; cellular growth and proliferation; cellular movement. Here we would like to emphasize several interesting facts observed during this study. 1) 12.4% (22/177) of our FSL was ribosomal protein genes but these proteins only made up 1.7% (1/59) of the genes found in our RSL. The large amount of ribosomal proteins up-regulated in the lung AC tissues from this data seems to indicate that thriving cancer cells need plenty of ribosomal proteins. 2) Cancer is a multi-genetic disease. Individual tumors accumulate an average of 90 mutant genes [
16]. In our study, the results based on the two libraries of differentially expressed genes demonstrated that in a given tumor more than 200 genes simultaneously exhibited abnormal expression and the number of the up-regulated genes was more than that of the down-regulated genes, indicating that even in a tumor the genetic abnormality is greatly extensive and complex. Although the case which was used to construct the libraries is very limited, the study would also have provided clues to the trend of gene expression alterations.
A large fraction of the differentially expressed genes found in cancer were likely to be “passenger” genes that are not to be integral to neoplasia. However, screening of “non-passenger” genes (functionally important alterations) among the differentially expressed genes and studying their functions are helpful for the elucidation of tumorigenesis mechanisms and the discovery of new diagnostic and therapeutic targets. As a first stage analysis, 16 genes were selected among the differentially expressed genes from our FSL and RSL libraries for further study. Through q-RT-PCR analysis, we found six genes (
DDR1,
HSP90B1,
SDC1,
RPSA,
ERGIC3, and
LPCAT1) significantly up-regulated and two genes (
GPX3 and
TIMP3) significantly down-regulated in the lung cancer tissues. Among the eight genes significantly up- or down-regulated in the lung cancers in this study, the six genes have been noted in previous studies for the connection between their expression and lung cancer: up-regulated
DDR1[
17],
HSP90B1[
18],
SDC1[
19], and
RPSA[
20], as well as down-regulated
GPX3[
21] and
TIMP3[
22]. Importantly, this means that for the first time, over-expression of
ERGIC3 and
LPCAT1 have been linked to lung cancer.
LPCAT1 may contribute to total choline metabolite accumulation via phosphatidylcholine remodeling, thereby altering the colorectal cancer lipid profile, a characteristic of malignancy [
23], but this was investigated in a separate work as the current study focuses on ERGIC3.
ERGIC3 (previously labeled Erv46, ERp43) is an integral membrane protein that cycles between the ER and Golgi [
24,
25]. ERGIC3 has large domains in the ER lumen, and its short N- and C-terminal tail sequences expose to the cytosol and the two transmembrane segments each, which would be available for protein-protein interactions in the ER lumen or membrane [
26]. Through the systematic and serial screening, we found that the ERGIC3 mRNA and protein were highly over-expressed in lung cancer cells. In the cultured cells, ERGIC3 was mainly located at the Golgi apparatus and ER, in agreement with previous descriptions [
11]. We observed that the distribution of ERGIC3 was associated with the cellular shape. In the round cells, ERGIC3 was located around the nucleus, but it was at the side of the nucleus in the fusiform cells. This phenomenon indicates that the localization of ERGIC3 is identical to the distribution of the Golgi apparatus and ER. ERGIC3 may be a well marker of the Golgi apparatus and ER in cancerous cells.
We also found that ERGIC3 was closely co-localized with MUC1 and ST. MUC1 is a high molecular weight transmembrane glycoprotein. The protein backbone of MUC1 is synthesized at the ER and glycosylated at the Golgi apparatus. ST is a key enzyme that sialylates lactosaminyl termini of complex type oligosaccharides in an α2,6 linkage. ST is located predominantly in the trans-Golgi network, and could be secreted by some cells [
27]. ERGIC3 is involved in the protein transport between the ER and the Golgi apparatus. We guess that ERGIC3 could participate in the transport of MUC1 and ST from the ER to the Golgi apparatus. Further studies will hopefully address this issue.
In our study, ERGIC3 was found positive in 89% specimens of lung cancer by immunohistochemical staining. In contrast, ERGIC3 was not expressed in normal bronchial epithelial cells and alveolar cells. These results from our pilot study suggested that ERGIC3 may be a potential biomarker for lung cancer. However, more studies must be performed to permit final conclusions. Our laboratory has already begun the relevant research.
We further investigated pathophysiological functions of the altered expression of ERGIC3 in lung cancer cells. In our study, over-expressed ERGIC3 promoted the cellular proliferation. A recent study demonstrated that ERGIC3 played important roles in cell growth and ER stress-induced apoptosis [
28]. Additionally, we also found that up-regulation of ERGIC3 facilitated cellular migration. The cellular proliferation and migration are essential events during carcinogenesis and cancerous invasion. ERGIC3 may play an active role in the development and progression of lung cancer. At present, the full mechanisms by which ERGIC3 promotes cellular proliferation and migration are not understood. Previous studies demonstrated that Erv41p-Erv46p complex interacts with glucosidase II and modulates glucosidase I activity, as well as cells lacking a cycling Erv41p-Erv46p complex display a mild glycoprotein processing defect [
26], and the mutation of ERGIC3 could reduce the transport between the ER and the Golgi apparatus [
29]. It is then tempting to speculate that abnormally expressed ERGIC3 could affect the cellular proliferation and migration through the disruption of glucosidase activity and protein intracellular transport.
Competing interests
The authors declare no conflict of interest.
Authors’ contributions
MW carried out the molecular biology studies, participated in the design of study and drafted the manuscript. TT carried out the immunoassays, participated in the molecular biology studies. YH provided the clinical materials, participated in the analysis of the data. YC conceived of the study, participated in its design and coordination, and helped to draft the manuscript. All authors read and approved the final manuscript.