Background
There is a significant amount of ongoing work aimed at defining the role of circulating tumor cells (CTC) in peripheral blood (PBL) and disseminated tumor cells (DTC) in bone marrow (BM) of breast cancer patients. However, due to a variety of available tumor cell detection methods and use of different gene-markers, recently published studies show a wide range of results that are often contradictory and difficult to compare to one another. The main tumor cell detection methods have been immunocytochemistry (ICC) with cytokeratin-specific antibodies [
1‐
11] and RT-PCR analysis based on overexpression of cancer-associated gene-markers [
4,
6,
12‐
29]. PCR methodology for detection of breast cancer has most frequently employed mammaglobin (
mam) and cytokeratin 19 (
CK19) genes. Some studies have also used a new CellSearch System technology that employs immunomagnetic separation of epithelial cells based upon expression of cytokeratins or
EpCAM and visualization of the tumor cells by immunoflorescent microscopy [
30].
Our laboratory has extensive experience in detection of cancer cells using multi-marker real-time RT-PCR methodology [
31‐
35]. To address the clinical relevance of molecular detection of occult breast cancer, we initiated a multi-institutional prospective cohort study. The primary objective of the study was to determine whether the molecular detection of occult breast cancer by multi-marker real-time RT-PCR in patients with pathology-negative axillary lymph nodes (ALN) is a clinically relevant predictor of disease recurrence. An interim analysis of 489 patients enrolled in the study showed a statistically significant association between molecular detection of occult breast cancer in the ALN and traditional predictors of poor prognosis in subjects with pathology-negative ALN [
33]. In addition, in a separate publication we show that the sensitivity of sentinel lymph node (SLN) analysis to predict pathologic status of ALN was significantly increased by the addition of molecular analysis [
34].
There are several cancer-associated gene markers used in the detection of breast cancer cells. Based on the heterogenous nature of the breast cancer, the multi-marker panel approach has shown to increase the sensitivity of molecular assay to detect the presence of disseminated cancer cells. However, the prognostic value of each individual marker is not known and therefore the ultimate goal would be to identify genes that are capable of differentiating patients with poor prognosis from the patients with a more favorable prognosis. Having a tool to recognize the subset of patients with unfavorable molecular characteristics could potentially translate into a better clinical outcome. In this interim analysis we examine the detection rate of cancer cells in PBL and in BM using an established 7-gene marker panel and evaluated whether there were any definable associations of any individual gene with the traditional predictors of prognosis.
Methods
MIMS Trial Study Design
A prospective cohort study design was adopted where, upon recruitment, eligible participants with Stage I, IIa, or IIb breast cancer were requested to consent to tissue sampling from axillary lymph nodes (ALN), sentinel nodes (SLN), bone marrow (BM), and peripheral blood (PBL). Tissue sampling was accomplished at the time of surgical intervention. The study was carried out in compliance with the Helsinki Declaration ethical principles in medical research involving human subjects. All specimens were collected through the Medical University of South Carolina Institutional Review Board for Human Research approved protocols (HR 9551, HR 8374, HR 8903, HR 8432). Informed consent was obtained in accordance with each participating center's Institutional Review Board guidelines. The design, enrollment criteria, tissue acquisition protocols, and determination of gene expression values for patients enrolled in the MIMS trial are described in more detail in a separate publication [
33]. The current study focuses on the subset of 215 patients with PBL samples and the subset of 177 patients with BM samples. Real-time RT-PCR analyses for cancer-associated genes were performed on all specimens at the Central Molecular Diagnostics Laboratory at the Medical University of South Carolina (MUSC). The Clinical Innovation Group (TCIG, Charleston, SC) (later known as the Data Coordination Unit (DCU) in the Department of Biostatistics, Bioinformatics and Epidemiology at MUSC) served as the coordinating center, and all study data were collected, processed and analyzed at this central facility.
Blood and bone marrow samples from breast cancer subjects
Bone marrow aspirates were obtained from patient's left and or right anterior or posterior iliac crests under anesthesia at the time of operation. A 10 or 20 cc syringe with a 16–18 gauge bone marrow aspirate needle was used to aspirate 3–6 ml of bone marrow into a syringe and then immediately transferred to a sterile EDTA vacutainer. Peripheral blood samples were obtained before surgery or following the induction of anesthesia. A total of 5–10 ml of blood was drawn from a peripheral vein into a sterile EDTA vacutainer. Blood and bone marrow samples were then shipped at room temperature to the Central Molecular Diagnostics Laboratory at the MUSC for immediate processing by Ficoll density gradient centrifugation (Ficoll-Paque Plus; Amersham Biosciences). All the specimens inside US arrived in 24 hours and international shipments arrived in 48 hours. One mL of bone marrow was used for Cytospin preparation and stained for ICC analysis. These bone marrow samples were evaluated by a cytopathologist for the presence of micrometastases using cytokeratin AE1/AE3. Please note that the specimen acquisition protocol was amended after the initiation of the MIMS trial and for that reason only a subset of patients was included in this analysis.
Blood and bone marrow samples from control subjects without evidence of malignancy
In order to define baseline expression levels for the molecular markers used in this study, PBL and BM samples from control subjects were procured. Informed consent was obtained for BM aspiration from 49 patients undergoing orthopedic surgery at MUSC and for PBL drawn from 49 healthy volunteers. None of the control subjects had any history or clinical evidence of malignancy. Four to six ml of BM aspirate or 5–10 ml of PBL was transferred to an EDTA vacutainer and sent to the Central Molecular Diagnostics Laboratory to be processed by Ficoll density gradient centrifugation and analyzed by real-time RT-PCR.
RNA isolation and cDNA synthesis
Buffy coats were obtained by Ficoll density gradient centrifugation, and total cellular RNA was isolated using a guanidinium thiocyanate-phenol-chloroform solution (RNA STAT-60™; TEL-TEST, Friendswood, TX). Briefly, cells were re-suspended in 1 ml of RNA STAT-60™. Total RNA was isolated as per the manufacturer's instructions with the exception that 1 μL of a 50 mg/mL solution of glycogen (Sigma, St. Louis, MO) was added to the aqueous phase prior to addition of isopropanol. Glycogen was used as a nucleic acid carrier to enhance RNA precipitation. The RNA pellet was dissolved in 50 μl of 1x RNA secure buffer (Ambion, Austin, TX). RNA was quantified by spectrophotometry at 260 nm. cDNA was made from 5 μg of total RNA using 200 U of M-MLV reverse transcriptase (Promega, Madison, WI) and 0.5 μg Oligo (dT)12–16 in a reaction volume of 20 μl (10 min at 70°C, 50 min at 42°C, 15 min at 70°C).
Real-time RT-PCR
The real-time RT-PCR primers have been previously reported [
31,
36,
37]:
mglo: F 5'-GCCGTGTGAACCATGTGACTTT, R 5'-CCAAATGCGGCATCTTCAAA;
PDEF: F 5'-AGTGCTCAAGGACATCGAGACG, R 5'-AGCCACTTCTGCACATTGCTG;
mam: F 5'-CGGATGAAACTCTGAGCAATGT, R 5'-CTGCAGTTCTGTGAGCCAAAG;
CK19: F 5'-CATGAAAGCTGCCTTGGAAGA, R 5'-TGATTCTGCCGCTCACTATCAG;
muc1: F 5'-ACCATCCTATGAGCGAGTACC, R 5'-ACCATCCTATGAGCGAGTACC;
PIP: F 5'-GCCAACAAAGCTCAGGACAAC, R 5'-GCAGTGACTTCGTCATTTGGAC;
EpCAM: F 5'-CGCAGCTCAGGAAGAATGTG, R 5'-TGAAGTACACTGGCATTGACGA;
ErbB2: F 5'-CTGGTGACACAGCTTATGCCCT, R 5'-ATCCCCTTGGCAATCTGCA. Analyses were performed on a PE Biosystems Gene Amp
® 5700 Sequence Detection System (Foster City, CA). All reaction components were purchased from PE Biosystems. The standard reaction volume was 10 μl and contained 1X SYBR Green PCR Buffer; 3.5 mM MgCl
2; 0.2 mM each of dATP, dCTP, and dGTP; 0.4 mM of dUTP; 0.25 U AmpliTaq Gold
®; 0.1 U AmpErase
® UNG enzyme; 0.7 μl cDNA template; and 0.25 mM of both forward and reverse primer. The initial step of PCR was 2 min at 50°C for AmpErase
® UNG activation, followed by a 10-min hold at 95°C. Cycles (n = 40) consisted of a 15 sec denaturation step at 95°C, followed by a 1 min annealing/extension step at 60°C. The final step was a 60°C incubation for 1 min. All reactions were performed in triplicate. The cycle of threshold (C
t) analysis was set at 0.5 relative fluorescence units.
Primary data analysis
Real-time RT-PCR data were quantified as Ct values that are inversely related to the amount of starting template: high Ct values correlate with low levels of gene expression, whereas low Ct values correlate with high levels of gene expression. Each gene was analyzed in triplicate. Results were normalized to an internal control reference gene, β2-microglobin, by subtracting the mean Ct value of β2-microglobin from the mean Ct value of each respective gene (ΔCt value). Samples for which Ct values for β
2
-microglobin were equal or higher than 22 were considered to contain inadequate RNA and were excluded from the analysis. Approximately 10% of samples we rejected from the analysis based on this criterion. If the mean Ct value for a gene of interest was higher or equal to 38, the gene expression was considered to be undetectable. In order to define baseline levels of gene expression and to define thresholds for marker positivity, 49 specimens of PBL and 49 specimens of BM obtained from patients with no evidence of malignancy were analyzed. To be consistent with the previous molecular analyses of lymph nodes, threshold values for each individual marker were set at three standard deviations from the mean ΔCt value in the control group. A subject was considered to be positive for the molecular analysis if at least one marker in the panel was above the defined threshold. Data from real-time RT-PCR analyses were compiled in a Microsoft Access database and submitted to the DCU at MUSC for statistical analyses. The molecular analysis was generated blinded to clinical outcome and patients' clinicopathologic data.
Bone marrow cytopathology and cytokeratin ICC staining
Specimens were collected, washed in CytoLyt
® (Cytyc, Boston, MA) and then resuspended in PreservCyt
® (Cytyc). Two ThinPrep (TP) slides were prepared and stained with Papanicolaou stain, and one slide was used for immunocytochemistry (ICC). A monoclonal antibody for cytokeratin (AE1/AE3) was used in conjunction with an automated immunostaining system (DAKO Autostainer, DAKO Cytomation, Carpeteria, CA) and a Nexus immunohistochemistry slide staining apparatus (Ventana Medical Systems Inc, Tuscon, AZ). Immunostaining was performed with the avidin-biotin immunoperoxidase (ABC-peroxidase) method of Hsu et al [
38]. Briefly, the slides were incubated with primary antibody for 30 minutes and then incubated with secondary biotinylated antibody for 4 minutes. To visualize the antibody, the TP was treated with diaminobenzidine (0.05%) in 0.05 M Tris-HCL buffer (pH 7.8) with 0.03% H
2O
2 for 6 minutes and then washed in H
2O. TP was counterstained with hematoxylin, dehydrated, cleared in xylene, and mounted in Permount. The specimens were analyzed by a skilled cytopathologist.
Statistical analysis
SAS Version 9.1 Software (SAS Institute Inc., SAS Campus Drive, Cary, North Carolina) was used for the analysis of pathological and molecular outcome. Chi-square analyses were conducted to explore the association between pre-defined baseline covariates that have been associated with pathological outcome in prior studies and PBL and BM RT-PCR positivity/negativity status. Pre-defined baseline covariates were tumor size, histological grade, estrogen receptor status, progesterone receptor status, her2neu status, and St. Gallen risk category (
minimal/low risk: tumor size ≤ 1 cm, positive ER and/or PR status, grade I and age ≥ 35;
intermediate risk: tumor size >1 or 2 cm, positive ER and/or PR status, and grade I; and
high risk: lymph node positive, tumor size > 2 cm, negative ER and/or PR status, grade II or III, or age <35) [
39]. Statistical significance was defined as p-values < 0.05.
Discussion
This paper describes molecular analyses of PBL and BM samples from a subgroup of breast cancer patients who were enrolled into a prospective multi-institutional study with the primary goal to establish the clinical relevance of micrometastatic disease detected by RT-PCR in pathology negative axillary lymph nodes. Our previous reports from this study strongly suggest that over-expression of cancer-associated gene-marker is a valid surrogate for occult micrometastatic breast cancer [
33,
34]. Using these gene markers [
mam, CEA, CK19, PIP, muc1, PSE, Erb (BM only) and
EpCAM (PBL only)] we analyzed 215 PBL samples and 177 BM samples from patients with T1-T3 primary breast cancer without clinical evidence of metastatic disease.
Using a predetermined rigorous threshold level (three standard deviations from the mean expression in normal PBL), 136 patients out of 215 (63.3%) had a positive signal in at least one cancer-associated marker in their PBL sample. According to the other studies, the incidence of CTC in PBL detected by RT-PCR ranged from 5% to 62% for one-marker analyses [
13,
15,
16,
19‐
24,
26‐
29] and from 31% to 83% for analyses by multi-marker gene-panels [
25‐
29]. The most frequently used markers were
CK19 and
mam. Our study, in contradiction, suggested that
CK19 has high expression level in normal control samples and is therefore not reliable detector of CTC. Although the
CK19 primers were designed to avoid the amplification of
CK19 pseudogenes [
40], we recognize that we cannot entirely exclude this possibility. In addition, we are aware of the limitations of using Ficoll density gradient cell separation methodology. Because of the low tumor cell burden in PBL and BM, the accuracy of tumor cell detection is greatly affected by the gene background expression levels. The genes like
CK19,
muc1,
PSE and
EpCAM that show significant background expression in normal samples, loose its accuracy in tumor cell detection when Ficoll density gradient cell separation methodology is used. In fact, in a separate publication we have demonstrated that using OncoQuick tumor cell enrichment method significantly reduces the background gene expression and therefore increases the sensitivity of tumor cell detection compared to the methodology employing Ficoll density gradient[
27].
The
mam gene on the other hand, because of its exquisite tissue specificity, did not show any expression in normal PBL. We observed a positive
mam signal in 29 (13.5%) patients, which is comparable to studies by Roncella et al [
20] and Benoy et al [
13] who reported mam positivity in 12% (16/137) and 14% (16/116; M0) of patients, respectively. Other studies have showed
mam-based CTC detection ranging from 41% to 62% [
24,
26,
28,
29].
Positivity thresholds for cancer-associated gene-expression in BM were also set at three standard deviations from the mean in normal BM. Based on this cut-off, 19 patients out of 177 (10.7%) were positive by RT-PCR. All 19 samples were positive for one cancer-associated marker. Additionally, in a subgroup of 83 BM samples analyzed by ICC, five (6%) resulted in a positive staining for cytokeratins. Two out of these five samples were also positive by RT-PCR (one for
mam and another for
PIP). Reports from other investigators on the incidence of DTC in BM detected by RT-PCR ranged from 12% to 53% [
4,
12‐
18] and as high as 80% [
6] in metastatic disease. DTC detection by ICC for cytokeratins ranged from 13.2% to 62% (review by Braun et al [
1]; [
6]). In comparison to these reports the detection of DTC in our study appears to be relatively low. Although our study population contained mainly early stage breast cancer patients (55% in Stage I, 27% in Stage IIA,14% in Stage IIB and 5% in Stage IIIA), we also suspect that the limited volume of bone marrow (average of 3–4 ml) in combination of Ficoll density gradient methodology may not have been sufficient to achieve optimal sensitivity.
One of our goals in this study was to evaluate whether the expression of any individual gene was associated with poor prognostic indicators. Although the follow-up data for the breast cancer patients in this study is not yet available, we looked at the possible association of the detection of CTC and DTC with traditional clinicopathologic prognostic indicators employing Chi-Square and/or Fisher's exact tests. Among tumor size, histologic grade, ER-, PR-, Her2neu-status, lymph node status and high risk category, we observed a statistically significant association between marker positivity in PBL and histologic grade (grade II-III vs. grade I; p = 0.0083). There were no associations between marker positivity in PBL and pathologic (H&E) and/or molecular (multi-marker RT-PCR) status of axillary lymph nodes. Interestingly, overexpression of the mammaglobin gene alone had also statistically significant association with high grade (p = 0.0315) and showed a trend towards ER-negative tumors (22% vs. 11%) and a high risk category (15% vs. 6%), suggesting that
mam gene may be a poor prognostic indicator (Table
3). Although we are not aware of other studies showing similar results on
mam, there are reports of statistically significant association between
mam-based CTC detection and tumor size [
28], clinical stage [
24,
41], nodal status [
42] and distant metastases [
42‐
44] supporting the concept of
mam gene being a poor prognostic indicator.
In our study, marker positivity in BM had no statistically significant association with any of the traditional prognostic indicators, however, the results of
mam and
PIP gene expression in PBL had statistically significant association with the
mam and
PIP gene expression in BM (p = 2.5E-04 and p = 0.0188, respectively; Table
4). We suggest that this result shows the close connection of PBL and BM compartments and that
mam and
PIP overexpression is not random but truly indicate the presence of tumor cells. Overall concordance between PBL and BM results were 90.6% for
mam and 87.0% for
PIP, which is mainly due to the concordance of double negative findings. Concordance for gene-panel was 46.4%. In comparison, Benoy et al demonstrated 68% of concordance between PBL and BM samples using
CK19 and 75% concordance between PBL and BM samples using
mam gene [
13].
Clinical relevance of CTC in PBL and DTC in BM can only be studied with sufficient follow-up data. The most comprehensive study has been reported on detection of bone marrow micometastases published by Braun et al in
the New England Journal of Medicine [
1]. They performed a pooled analysis of a total of nine separate studies involving more than 4,500 breast cancer patients. Braun et al concluded that patients with BM micrometastases had poor overall survival (OS), breast-cancer-specific survival and poor disease-free survival (DFS) and distant-disease-free survival. A prospective, multi-center study by Cristofanilli et al used a new CellSearch System (Veridex) to determine if circulating tumor cells can predict survival in metastatic breast cancer. They tested 177 patients and found that patients with 5 or more tumor cells per 7.5 ml before the therapy and at the first follow-up visit had shorter median progression-free survival and OS compared to the patients with fewer than 5 circulating cells [
30]. Benoy et al (CK19
PCR, mam
PCR) showed worse OS in patients with
CK19 and
mam expression in BM but not in PBL [
13]. Median OS was reported to be shorter in patients with
CK
ICC positive cells in PBL according to Bauernhofer et al [
9]. Detection of
CK19 positive cells by RT-PCR in PBL in stage I and II was associated with reduced disease-free interval and OS [
45].
Acknowledgements
Source of Support: Department of Defense N00014-99-1-0784
MIMS Study Group: Marshall M. Urist MD, University of Alabama, Birmingham, AL; Bruce G. Mann MD, Royal Melbourne Hospital, Victoria, Australia; Gerard Doherty MD, Virginia M. Hermann MD, Washington University, St. Louis, MO; Arnold D. Hill MD, St Vincent's University Hospital, Dublin, Ireland; Oleg Eremin MD, Mohamed El-Sheemy MD, Ph.D, Lincoln & Louth County Hospital, Lincoln, England; Richard K. Orr MD, Spartanburg Regional Medical Center, Spartanburg, SC; Alvaro A. Valle MD, University of Tennessee, Chattanooga, TN; Michael A. Henderson MD, St Vincent's Hospital, Victoria, Australia; Lucile Adams-Campell MD, Howard University Cancer Center, Washington, D.C.; Sonia L. Sugg MD, Medical College of Wisconsin, Milwaukee, WI; Eric Frykberg MD, University of Florida Hospital, Jacksonville, FL; Karen Yeh MD, Medical College of Georgia, Augusta, GA; Richard M. Bell MD, Palmetto Health Alliance, Columbia, SC
We acknowledge the HCC tissue procurement bank and specifically thank Dr. Debra Hazen-Martin and Margaret Romano.
Competing interests
The author(s) declare that they have no competing interests.
Authors' contributions
KM contributed to the study design and data interpretation, performed and supervised RNA extraction and real-time RT-PCR, drafted the manuscript. RHM contributed to the study design, performed the statistical analysis, contributed to data interpretation, drafted the manuscript. MBR contributed to the study design and data interpretation, performed RNA extraction and real-time RT-PCR. WEG contributed to the study design and data interpretation, drafted the manuscript. RH interpreted the bone marrow cytology and immunocytochemistry results. DHS provided normal bone marrow specimens. KC coordinated and carried out the protocols for obtaining normal peripheral blood and normal bone marrow specimens. MM contributed to the study design and data interpretation, supervised RNA extraction and real-time RT-PCR, drafted the manuscript. DJC designed the study, contributed to data interpretation, drafted the manuscript and served as a mentor for the entire project. All authors read and approved the final manuscript.