Introduction
Metastasis of breast cancer to the brain represents an area of high un-met medical need; 15–30% of patients with metastatic breast cancer will develop brain metastases, and the brain is the first site of metastasis in 7–16% of metastatic patients [
1]. Survival upon a brain metastasis diagnosis is 4 months to 2 years, with worse prognosis for patients with triple negative (TN) breast cancer or multiple brain lesions, and better prognosis for those with oestrogen receptor-positive (ER
+) or human epidermal growth factor receptor 2-positive (HER2
+) disease or a single brain metastasis [
2,
3]. The incidence of brain metastasis for breast and other cancers has been increasing, likely due to improved imaging and other diagnostic technologies and more effective systemic and targeted therapies which prolong patient survival by controlling extra-cranial disease. However, agents that control systemic disease often have poor brain penetrance resulting in failure to control disease progression [
4,
5].
Over the past decade there has been significant progress in understanding the mechanisms underpinning brain metastasis, including the identification of key molecular events required for successful tumour cell transmigration across the blood-brain barrier and for avoiding apoptosis once the cells have reached the brain parenchyma [
1,
5‐
8]. However, these studies have relied heavily on directly inoculating tumour cells into the left ventricle of the heart and on the use of human cell lines in immunocompromised mice. Such a scenario fails to recapitulate the clinical setting where low numbers of circulating tumour cells will encounter different organs and have to avoid immune attack [
7]. To better mimic human disease, we used a model of spontaneous metastasis of 4T1 mouse mammary carcinoma cells in syngeneic, immunocompetent BALB/c mice. We have profiled 4T1 sublines derived from primary tumours and from tumour cells that had disseminated to the lungs and brain and performed in-vivo validation experiments and assessment of clinical datasets. Using this approach, we identified a role for
ID2 and
ALDH3A1 in promoting metastatic colonisation and for
ID2 in promoting brain-specific metastasis.
Methods
Cells and reagents
4T1 cells were obtained from the American Type Culture Collection (ATCC), tagged with luciferase using lentiviral particles expressing Firefly luciferase (Amsbio), and grown in Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 10% fetal bovine serum (FBS). MDA-MB-231-Luc cells were obtained from Sibtech and grown in DMEM supplemented with 10% FBS. Where indicated, 4T1-Luc cells were transduced with lentiviral particles expressing H2B-mRFP as previously described [
9] and RFP
+ cells enriched by fluorescence-activated cell sorting (FACS). Cells were short tandem repeats (STR) tested regularly using the StemElite ID system (Promega). Both cell types were routinely tested for mycoplasma and used within 10 passages after resuscitation. Mouse astrocytes were purchased from ScienCell and maintained in astrocyte basal medium supplemented with FBS and astrocyte growth supplement. Recombinant human transforming growth factor (TGF)β-1 and bone morphogenetic protein (BMP)7 were purchased from R&D systems. Details of short hairpin RNA (shRNA) lentiviruses, full length open reading frame (ORF) clone expression systems, quantitative reverse-transcription polymerase chain reaction (RT-qPCR) reagents, and antibodies used in this study are provided in Additional file
1 (Tables S1–S4).
For shRNA knockdown of Id2 or Aldh3a1, 5 × 104 4T1-Luc cells were transduced with lentiviral particles (Sigma; Mission transduction particles) at a multiplicity of infection (MOI) of 2. At 24 h post-transduction, the medium was replaced with fresh medium containing 10% FBS. Stably transduced cells were selected in 2.5 μg/mL puromycin for two passages.
For ectopic expression of Id2 or Aldh3a1, 8 μg of bicistronic mammalian expression vector pReceiver-Lv166 mCherry vector with or without full length ORF for mouse Id2 (EX-Mm03201-Lv166) or Aldh3a1 (EX-Mm28326-Lv166-GS) purified plasmid, 4 μg of packaging plasmid psPAX2, and 1.5 μg envelope plasmid pMD2.G were co-transfected into the HEK293T cells using OptiMEM and Lipofectamine 2000. At 48 h post-transfections, virus-containing medium was collected and used to directly infect 4T1-Luc or MDA-MB-231-Luc cells. At 72 h post-infection, cells were FACS sorted to enrich for mCherry-positive cells.
In-vivo experiments
All animals were monitored on a daily basis by staff from the ICR Biological Service Unit for signs of ill health.
To isolate tumour cells disseminated to metastatic sites for gene expression profiling, 1 × 104 4T1-Luc cells in 50 μL phosphate-buffered saline (PBS) were inoculated subcutaneously into 6- to 8-week-old female BALB/c mice. Once primary tumours reached the maximum (mean diameter ≥ 15 mm) allowable size, the mice were sacrificed. Primary tumours, lungs, and brains were harvested at necropsy. Primary tumours were individually cut into small pieces, homogenized using a McIlwain Tissue Chopper (Campden Instruments), and digested in L-15 medium containing 3 mg/mL collagenase type I at 37 °C for 1 h, followed by digestion with 0.025 mg/mL DNase1 at 37 °C for 5 min. After erythrocyte lysis using Red Blood Cell Lysing Buffer (Sigma), the cell suspension was plated into a 10-cm dish in 10 mL DMEM plus 10% FBS. Individual lungs and brains were placed in 1 mL PBS on a 40-μm sieve in a 6-cm plate, mechanically dissociated by pushing through the sieve, and cultured in 2 mL DMEM plus 10% FBS in 6-cm dishes. When primary tumour-, brain- and lung-derived 4T1 colonies were visible, cells were passaged 3–4 times before RNA was extracted from individual sublines for gene expression profiling.
For experimental metastasis assays, 6- to 8-week-old female BALB/c or NOD SCID gamma (NSG) mice were inoculated with 4T1-Luc or MDA-MB-231 cells. For intracranial inoculations, mice were anaesthetised with isoflurane and injected with 1 × 104 4T1-Luc cells in 5 μL PBS into the brain at a rate of 2.5 μL tumour cells/min using a stereotaxic frame with pre-defined co-ordinates relative to bregma (x = −2 mm, y = 1 mm, z = −2 mm). At post-mortem, brains were in-vivo imaging system (IVIS) imaged ex-vivo, fixed in 4% paraformaldehyde for 24 h, and paraffin embedded. For intracardiac inoculation, mice were anaesthetized with isoflurane and 5 × 104 4T1 (BALB/c mice) or 3 × 105 MDA-MB-231 cells (NSG mice) were injected into the left ventricle of the heart in 100 μL PBS. At the end of the experiment, post-mortem tissues were IVIS imaged ex-vivo, fixed in 4% paraformaldehyde for 24 h, and either paraffin embedded or frozen.
For RNA expression analysis of freshly isolated cells, 4T1-Luc-RFP cells were inoculated either subcutaneously (5 × 105 cells), intravenously via the lateral tail vein (1 × 105 cells) or, as described above, intracranially (1 × 104 cells). Then, 9–13 days later, primary tumours, lungs, and brains were collected. Primary tumours were dissociated using the MACS mouse tumour dissociation kit (Miltenyi Biotec), and lungs and brain were dissociated using the MACS lung dissociation kit. RFP-positive 4T1-Luc cells were FACS sorted directly into RLT lysis buffer (Qiagen) for RNA extraction.
For fluorescent imaging of brain sections, whole 4% paraformaldehyde-fixed brains were submerged in 30% sucrose in PBS at 4 °C before moulding in OCT and freezing in dry ice plus isopentane. The frozen brain was cryostat sectioned at 20-μm intervals. For imaging of mCherry-positive cells, sections were defrosted, washed in PBS, DAPI stained, mounted, and scanned using the Vectra 3.0 automated quantitative pathology imaging system (Perkin Elmer).
For histological and immunohistochemical analysis, formalin-fixed paraffin-embedded (FFPE) brain sections were haematoxylin and eosin (H&E) or antibody stained and scanned on the NanoZoomer digital slide scanner (Hamamatsu). Tumour burden was quantified using ImageJ in a coronal section taken at the median level through each brain.
Gene expression profiling
RNA extracted (RNeasy Mini kit) from independently isolated 4T1 sublines derived from primary tumour (T,
n = 3), brain metastases (B,
n = 4), and lung metastases (L,
n = 3) was subjected to microarray analysis on Mouse WG-6 v2.0 expression BeadChips (Illumina, San Diego, CA, USA). RNA amplification, labelling, and hybridization were performed according to the manufacturer’s instructions at Cambridge Genomic Services. The raw data were extracted using GenomeStudio Software and was processed in R using the lumi package (
http://www.bioconductor.org). In brief, data were: 1) filtered to remove any non-expressed probes (detection
p > 0.01) across samples involved in a given group comparison; 2) transformed using the variance-stabilising transformation; and 3) normalised using the robust spline normalisation method.
Sample relations were estimated using unsupervised hierarchical clustering (Euclidean distance, average linkage) based on 17,550 probes. Two-sample
t tests (with random variance model) were used to identify differentially expressed genes between 1) L and T, 2) B and L, and 3) B and T sublines using the BRB-Array Tools (
https://brb.nci.nih.gov/BRB-ArrayTools) with a threshold of parametric
p value < 0.001. When multiple probes were mapped to the same gene, the most variable probe measured by interquartile range (IQR) across the samples was selected to represent the gene. Gene expression data are deposited at GEO with the accession number GSE110101.
RT-qPCR
RNA from cultured cells or whole mouse tissue or from freshly isolated tumour cells was extracted using the RNeasy Mini kit or the RNeasy Plus Micro kit, respectively, according to the manufacturer’s instructions. RNA was eluted in 10–30 μL nuclease-free water. The RNA concentration was measured in a 1-μL sample using the Qubit2.0 Fluorometer (Invitrogen) or the ND-1000 Spectrophotometer (Nanodrop). cDNA was produced by reverse transcribing 150–500 ng RNA using the QuantiTect reverse transcription kit (Qiagen) or SuperScript IV First-Strand Synthesis System (Invitrogen) according to the manufacturer’s instruction. qPCR was performed on 11.25 ng cDNA (4.5 μL) with 0.5 μL Taqman Gene Expression Assay probe and 5 μL 2× qPCR Master mix per well. Relative quantification was performed using QuantStudio Real-time PCR software or on an ABI Prism 7900HT sequence detection system. Each reaction was performed in triplicate. Data were analysed using QuantStudio Real-time PCR or SDS 2.2.1 software, and relative expression levels were normalised, unless otherwise stated, to B2m/B2M or Gapdh endogenous control, with a confidence interval of 95% for all assays.
Cell based assays
For spheroid growth assays, 7.5 × 102 cells/well were sorted into U-bottom low adherence 96-well plates (Corning) in DMEM containing 2% FBS. At 7 days post-seeding, the viability of the cells in the three-dimensional tumour spheroids was assessed using CellTiter-Glo (Promega) with luminescence quantified using a Victor X5 plate reader.
For the anoikis assay, 5 × 104 cells/well were seeded into low-adherence six-well plates (Costar) in DMEM containing 2% FBS. At 24 h post-seeding, cells were stained with Annexin V-APC/PI Apoptosis Detection Kit (eBioscience) and analysed using a BD Biosciences LSRII flow cytometer with FACSDIVA and FlowJo software. Cell viability was measured as a proportion of healthy (Annexin-negative, PI-negative) cells.
Human and mouse datasets
The expression levels of
ID2 and
ALDH3A1 and their relation to the receptor status of ER, progesterone receptor (PR), and HER2 were assessed for breast cancer samples in The Cancer Genome Atlas (TCGA) [
10]. The expression level of
BMP7 in non-tumour-bearing mice was assessed in 1) brain astrocytes, neurons, and microglia using the Srinivasan et al. RNAseq dataset [
11], and 2) brain microglia and astrocytes using the Kamphuis et al. microarray dataset [
12]. Clinical significance (distant metastasis-free survival) of
ID2 and
ALDH3A1 expression in ER
– breast cancers was assessed using publicly available data from Gyorffy et al. [
13]. Associations of
ID2 and
ALDH3A1 mRNA levels and brain metastasis were tested in four breast cancer datasets (GSE2034, GSE2603, GSE12276, and GSE14020), normalized by MAS5.0, log
2 transformed, and batch corrected. The tumour subtype information was published in a previous study [
14]. The datasets contained 104 ER
– breast cancer patients who either had no metastatic relapse (
n = 71) or brain-only metastatic relapse (
n = 33). Brain metastasis relapse-free survival analysis was performed using the upper tertile of gene expression to dichotomise the breast cancers.
To assess the expression of
ID2 in primary breast cancers and breast cancer brain metastases, the datasets described in Schulten et al. (GSE100534) [
15] and Harrell et al. (GSE26338) [
16] were retrieved. GSE26338 contains data deposited from seven different platforms. Samples run on the GPL5325 platform were enriched for metaplastic breast cancers and were therefore excluded.
Statistical analysis
Statistical analyses were performed using GraphPad Prism 7. Unless stated otherwise, data represent the mean values ± standard error of the mean (SEM). Where the groups followed normal distribution and had equal variances, the significance of the differences of the groups was tested using either unpaired Student’s t test (two groups) or one-way analysis of variance (ANOVA; multiple groups) followed by Bonferroni post-hoc testing for correcting multiple comparisons. If groups did not follow a normal distribution, non-parametric Mann-Whitney (two groups) or Kruskal-Wallis (multiple groups) tests were used. Statistical significance was defined as *p < 0.05, **p < 0.01, and ***p < 0.001.
Discussion
In this study, we generated a set of 4T1 sublines derived from primary tumours and from tumour cells that had spontaneously disseminated to either the brain or lungs of BALB/c mice. By pair-wise gene-expression profiling, we sought to identify differentially expressed transcripts specifically associated with metastasis to the brain. Moreover, as these sublines were derived in the context of a spontaneous metastasis model where the number of circulating tumour cells reaching secondary sites is very limited, we reasoned that differentially expressed genes were more likely to be involved in survival of cells in the brain and the early stages of metastatic colonisation rather than the expansion of established macrometastatic disease.
Two differentially expressed genes were selected for in-vivo validation, Aldh3a1 and Id2. Downregulation of either gene resulted in reduced tumour burden following intracranial inoculation whereas ectopic expression in either mouse or human tumour cells resulted in increased metastatic burden in the brain following intracardiac inoculation but did not promote increased metastasis to the lung.
Tumour cells colonising the brain face unique challenges. First, the tumour cells have to navigate across the blood-brain barrier that separates the brain from the general circulation [
32]. Second, the cellular and non-cellular composition of the brain is distinct from that found both in the primary tumour and at other metastatic sites. For example, tumour cells metastasising to the brain have to avoid apoptosis mediated by astrocyte-secreted Fas ligand [
26,
33] and detection and elimination by reactive microglia [
6]. Similarly, the absence of stromal fibroblasts and fibrillar collagen [
31] means that infiltrating tumour cells have to adapt to the foreign extracellular matrix composition. Finally, successful metastasising cells have to adapt to an altered metabolic environment [
7,
34]. The brain consumes ~ 20% of the body’s glucose-derived energy but, when blood glucose is low, the brain can adapt its metabolism to use acetate, ketone bodies, or fatty acids as alternative fuels. Metastasising tumours cells need to have metabolic flexibility to survive in this environment.
As has been reported in other studies [
25‐
28], in the models used here it was observed that tumour cells that disseminated to the brain parenchyma remained closely located to the brain capillaries (see Fig.
3). It is now well established that cancer stem cells reside in niche microenvironments, including perivascular niches in the brain [
35], and that these niches serve to maintain the tumour cell ‘stemness’ [
36]. As both ID2 and ALDH3A1 have been implicated in promoting stem cell-like features in tumour cells [
18,
24,
37], their elevated expression may provide an advantage in the early stage of metastatic colonisation by retaining cells in the niche until they are ready to face the full challenges of the brain microenvironment. In addition, among the ALDH family, ALDH3A1 has a specific role in peroxidic aldehyde metabolism [
24,
38]. In metabolically challenging conditions, particularly in conditions of high oxidative stress, production of reactive oxygen species (ROS) leads to peroxidation of lipids, which in turn gives rise to cytotoxic lipid aldehydes. The ability of ALDH3A1 to function as lipid aldehyde scavenger [
39] would provide a survival advantage for disseminating tumour cells encountering adverse conditions. However, it was notable in this study that, when expression of
ALDH3A1 was examined in breast cancer clinical datasets, high expression was significantly associated with an increased risk of distant metastatic relapse but not with brain-specific relapse (Fig.
4c, d). Consequently, although the models used in this study identified increased
Aldh3a1 expression to be associated with increased brain but not lung metastasis, the clinical data point to a more general pro-metastatic function. In contrast,
ID2 expression in clinical samples correlated with increased distant metastatic relapse and brain-specific metastatic relapse.
ID proteins are negative regulators of basic HLH (bHLH) transcription factors and, additionally, ID2 can bind and over-ride the tumour suppressor function of retinoblastoma (RB) tumour suppressor. In normal cells, these activities underpin the role of ID proteins in inhibiting cell differentiation and promoting cell proliferation. In cancers, ID proteins are found at a higher level than in normal adult tissues where they function to sustain self-renewal of stem or stem-like cancer cells and inhibit apoptosis and entry of tumour cells into senescence [
17,
18]. For ID2 it has been shown that expression is required to maintain glioma [
40], glioblastoma [
41], head and neck [
42], and colorectal cancer stem/stem-like cells. Previously it has been demonstrated in normal mouse mammary cells and other non-transformed epithelial cell types that exposure to TGFβ resulted in decreased
ID2 mRNA and protein levels, whereas BMP7 could both over-ride the TGFβ-mediated
ID2 repression and, on its own, promote increased
ID2 expression [
29,
30,
43]. BMPs are members of the TGFβ superfamily but, whereas TGFβs bind to the TGFβ type 1 receptors ALK1 and ALK5, BMPs bind to BMP-specific ALK receptors, with BMP7 binding to ALK2 [
44].
Given the role reported here for ID2 in promoting breast cancer metastasis to the brain, it was striking that, although levels of Tgfb1 mRNA were approximately two-fold higher in the normal mouse lung compared with the brain, levels of Bmp7 mRNA were > 100-fold higher in the brain compared with the lung. Moreover, whereas treatment with TGFβ1 had little impact on ID2 expression in either mouse mammary carcinoma or human breast cancer cells, BMP7 treatment resulted in a > 50-fold increase in ID2 mRNA levels and this level of expression was sustained for at least 5 days following BMP7 withdrawal. Together with the demonstration that increased Id2 expression protects tumour cells from loss of attachment-induced anoikis and promotes growth of three-dimensional tumour spheroids, the data presented here support a model in which spontaneously metastasising breast cancers cells with higher ID2 expression will have a survival advantage when they disseminate to the brain and that, once in the brain, cells that are receptive to BMP7 signalling to maintain high-level ID2 expression will be better able to progress to macrometastatic disease by tolerating the harsh pro-apoptotic brain microenvironment.
Conclusions
The incidence of brain metastasis in breast and other cancers is increasing, yet patient prognosis after a diagnosis of metastatic relapse in the brain remains dismal. This situation is further compounded by the lack of prospective clinical trial data to assess the efficacy of current system therapies in patients with brain metastases [
1]. In addition, there is an urgent need to develop methodologies for the identification of patients at high risk of developing brain metastasis and to better understand the unique biology associated with tumour cell colonisation of the brain to identify potential new therapeutic strategies. Using a model that more faithfully recapitulates the dissemination of tumour cells to secondary sites, combined with in-vivo and in-silico validation studies, we identified ID2 as a promising brain metastasis promoter. Future studies will be required to assess whether monitoring ID2 mRNA or protein levels in primary tumours or in plasma samples could provide a prognostic biomarker for patients at higher risk of relapse in the brain. Finally, it is well documented that ID family proteins play a key role in developmental processes and that expression is downregulated in most normal adult tissues but can be re-activated in cancer cells [
17,
18]. Although this pattern of expression makes ID proteins potential therapeutic targets, helix-loop-helix proteins are typically described as ‘undruggable’. However, in recent years there have been rapid technological advances in developing inhibitors of protein-protein interaction as well as approaches for targeted protein degradation such as proteolysis targeting chimaeras (PROTACs), bifunctional molecules for hijacking E3 ligases or small molecules that redirect E3 ligase activity [
45]. The development of reagents that can inhibit ID2 dimerisation or promote its degradation will, in the future, allow robust assessment of ID2 as a potential therapeutic target to prevent or limit the development of breast cancer brain metastasis.