Background
Breast cancer is heterogenous in its presentation, showing marked diversities spanning its morphological features, clinical outcomes, intrinsic subtypes and prevalence patterns. In Western countries, breast cancer is the most common female cancer and the leading cause of cancer mortality [
1]. In the Kingdom of Saudi Arabia (KSA), although its incidence is lower than in Western countries [
2], it is ranked highest amongst all the malignancies seen in Saudi females, comprising 21.8% [
3]. Breast cancer in Saudi women displays features and characteristics that make it distinct from what is seen in Western nations [
4]. Breast cancers in Saudi women are generally of a high grade, are locally advanced at the time of diagnosis, and affect predominantly females between 46-50 years of age, which is noticeably different from the median of 60-65 years seen in industrialized Western nations [
5], where locally advanced disease is much less common.
The existing histological classification systems for breast cancer are far from being accurate in predicting the prognosis of a given patient [
6]. Morphologically identical tumors can display divergent clinical outcomes. This can predominantly be attributed to molecular class differences that exist amongst the histologically similar types. A detailed knowledge about these molecular classes and their specific identification could lead to the development of a diagnostically more advanced and prognostically more beneficial reporting system that could offer much more precise prediction scores [
7]. For this reason, there is an urgent need to learn more about the underlying biology of this disease [
8].
The development of molecular analytical methods dates back to a quarter of a century, when immunohistochemistry (IHC) first allowed us to segregate these tumors into two main classes: ER+ and ER-. A decade later, the next step forward was the emergence of nucleic acid
in situ hybridization. This led to the identification of two new categories, dependent on whether HER2 was amplified or not. Many other single gene molecular markers were assessed, but failed to prove clinically relevant [
9]. Further progress in this area led to development of gene expression profiling, providing the simultaneous quantification of multiple genes; the final results could then be combined and used as a more reliable prognostic and predictive score [
7].
Perou
et al. were the first to draft a classification system based on gene expression analysis, and this consisted of four major molecular classes of breast cancer: luminal-like, basal-like, normal-like, and HER-2 positive [
10]. Subsequent studies suggested the existence of more molecular classes [
11‐
13] and this ultimately led to addition of a fifth subtype, with the molecular analytic spectrum now expanding to luminal A (LUMA), luminal B (LUMB), human epidermal growth factor receptor-2 (HER2) overexpressing, basal-like, and normal-like [
12].
A further advancement in this field was the use of IHC as a surrogate marker for gene expression profiling. Studies confirmed that it could reliably identify the major molecular subgroups of invasive breast carcinoma [
14‐
16]. Recently published studies have used five IHC surrogate markers (ER, PR, HER2, CK5/6, and EGFR) for molecular class distinction [
17‐
19], with luminal tumors being categorized by hormone receptor (HR) positivity, HER2 expression a feature of HER 2 tumors, and CK5/6 and/or EGFR indicative of basal-like tumors. The molecular dimensions were further diversified by the introduction of a hybrid category comprising the HER2 + luminal tumors [
20].
Diversities in the classification systems continue to emerge, but current studies still lack the potential to formulate a simple, practical, and easily applicable classification system for gene expression analysis that can completely and concisely encompass all the divergent facets of breast cancer [
20]. In addition, the major studies conducted so far are predominantly reflective of breast cancer patterns, subtypes, and behaviour in Western populations; there remains the need to explore these issues in different regions.
We initially conducted a pilot study in which gene expression profiling by real time quantitative PCR was used to determine the "intrinsic" Saudi breast cancer subtypes and its prevalence was compared to the more commonly profiled Caucasian population [
19]. A comparison between the IHC profiles of these breast cancers was correlated to the gene expression analysis and a discrepancy rate of 39% was identified, most conspicuously in the luminal type. That work suggested that breast cancer from Saudi and Caucasian populations may have a similar biology, but show variability in the subtype distributions. It was concluded that analysis of a larger cohort of patients was needed to precisely determine the molecular taxonomy of breast cancer in the Saudi population.
The present study used a larger regional based cohort, and the spectrum of molecular patterns seen in the Saudi population was analyzed by combining and modifying the current and upcoming molecular trends. The prevalence patterns obtained were compared and contrasted with Western patterns and other regionally based studies. A correlation of each subtype with Ki-67 proliferation index, p53 mutation status, histological type and grade of the tumor was also carried out.
Discussion
In this study we have demonstrated that by using IHC as a surrogate for gene expression profiling, there is a unique molecular class prevalence pattern in our population, a relative paucity of luminals and preponderance of unclassified cases being its highlighting features. In addition, other than a strong correlation between invasive lobular carcinoma and hormone receptor expression, there was no association between molecular classes and the histologic type of the cancer. This perfectly matches with the general belief that differences in prognosis and response to treatment within morphologically identical tumors can largely be attributed to genetic/molecular differences [
6]. It also emphasizes a justifiable need for replacing current morphology based classification systems of breast cancer by a molecular classification that may be more relevant prognostically [
7].
Several issues in our study need to be discussed. The unclassified category in our study comprised 42.86% of cases, which is significantly higher than in studies by Carey
et al. (6.25%) [
17], Tamimi
et al. (4.9%) [
18], Cheang
et al. (8%) [
26], and Bhargava
et al. (4.87%) [
20] (p = < 0.001). It is also significantly higher than that reported by Kim
et al. [
27] in Korean women (15.9%, p = < 0.001). Adebamowo
et al., in their study on Nigerean women [
28] did not report any cases in the unclassified category. The presence of a dominant unclassified category merits further investigation to gain further insight into its nature. It highlights the inability of currently used molecular analytic spectrum to adequately represent all the molecular types and stresses on the immense need for the development of new prognostic and predictive markers in breast cancer diagnosis, as has also been suggested by Brennan
et al. [
29].
Unclassified cases were initially considered to be synonymous with '
normal-like' breast cancers. These tumors cluster with non-tumoral breast cancer cells and exhibit over-expression of PIK3R1 and AKR1C1, in addition to other genomic alterations [
30]. The current concept states that the
'normal-like' subtype is absolutely different from the unclassified (penta negative) "ER-, PR-, HER2-, CK5/6, and EGFR-" group, as absent or decreased expression of basal markers is not a feature compatible with the '
normal-like' molecular class [
31]. They are very good prognostically [
30] and are grouped with the luminals, both of which exhibit low pathologic complete remission rates of 6% [
32].
According to Huo
et al., the unclassified category comprises two contrasting branches, a bad prognostic branch, characterized by the expression of vascular endothelial growth factor, B-cell lymphoma extra-large protein, and Cyclin E, and the a good prognosis branch, characterized by expression of B-cell lymphoma protein 2 and Cyclin D1 as its distinguishing features [
33]. The unclassified and '
normal-like' are completely separate entities and IHC surrogates for these categories have not yet been developed. Associating these with a particular set of negative or absent markers may lead to misinterpretations of their intrinsic biological characteristics [
33].
In this study 35.35% of the unclassified tumors exhibited a moderate to high Ki-67 proliferation index, and 16.1% demonstrated p53 mutations, suggesting the presence of two divergent patterns of unclassified tumors in our population. This hypothesis has not yet been validated, but further analysis of this finding is essential.
Hormone receptor positive tumors are usually diagnosed irrespective of stratification of receptor staining intensity or distinction between ER and PR positivity. Studies demonstrate that the benefit of endocrine therapy is directly related to estrogen receptor levels [
34], with responses beginning in tumors with > 10% positive nuclei, and increasing gradually with increased receptor expression [
35]. Strong ER+ cases benefit from endocrine therapy alone, in contrast to those with low to moderate ER positivity ( by Alred score) that demonstrate a better outcome with the addition of chemotherapy [
36]. Therefore, the practice of assigning all strata of ER positivity to one group should be revised.
The prognostic significance of PR presence, in ER + tumors, with regards to endocrine (tamoxifen) treatment is controversial [
37]. Studies suggest that PR has no prognostic predictive role in these cases [
34,
38]. The level of PR positivity is an independent prognostic marker, with Stendahl
et al. demonstrating that the response to tamoxifen begins to be seen in tumors with more than 75% positive nuclei [
35]. With these separate prognostic implications, the concept of a conjoined ER/PR group also merits re-evaluation.
Luminal tumors in our study comprised 19.9% of all cases, again, a prevalence pattern that is unique to our population. This is in stark contrast to the prevalence pattern seen in others, specifically, the more profiled Caucasian populations [
17,
18,
20,
26], as well as the populations from regional based studies in Korea [
27] and Nigeria [
28]. LUMA represented merely 3.9% of all cases, which is significantly less than those reported in the other studies, which ranged from 54%-77.6% (p = < 0.001). Analysis with hybrid "luminal-HER2" and "HER2-basal" cases included among luminal and HER2 classes, respectively [
17,
18,
20] still yielded a significantly low prevalence of luminal tumors including LUMA in our population. According to studies in Western populations, Luminal-A tumors are often the most prevalent and have a relatively good prognosis. Adelaide
et al. [
39] showed luminal-A class to be characterized by a gain of 1q, 8p12-ter, loss of 1p36-ter, and an amplification in 5q35, along with other genetic alterations [
30]. It is possible that the low representation of luminal tumors, specifically the LUMA, seen in our population may be reflective of a different set of genetic alterations. This possibility requires investigation, but such queries provide the framework for future research.
LUMB breast cancers are characterized by expression of HR along with HER2 associated genes (i.e., ERBB2 and GRB7) and a cell proliferation pattern designated by expression of MK167, CCNBI and MYBL2 [
26]. They are associated with a poorer clinical outcome than LUMA tumors. Identification of luminal B tumors at the protein level is a point of controversy. Some authors have used the co-expression of HR and HER2 to define this group, based on the fact that the HER2 associated genes (i.e ERBB2 and GRB7) are expressed in 30-50% of LUMB tumors, and realizing that this will not identify all of the tumors in this category, with a number of them ultimately being classified as LUMA tumors [
17]. Bhargava
et al. defined LUMA and LUMB as pure hormone receptor positive, the differentiating feature between them being the intensity of receptor positivity [
20]. Cheang
et al. added that a Ki67 proliferation index of more than 13.25% is a hallmark of LUMB tumors [
26]. Ki67 is a nuclear marker of cell proliferation, and its expression correlates proportionally to poorer clinical outcomes [
40‐
42]. Although it is not currently included in routine clinical decisions, recent studies [
43,
44], have indicated that alterations in its levels after neo-adjuvant endocrine treatment simultaneously alters the long-term patient outcome. As suggested from gene expression profiling, only 30% of LUMB tumors are HER2 positive, indicating that this clinical marker is not sensitive enough to identify most LUMB breast cancers [
26]. These tumors are also endocrine (tamoxifen) resistant and require estrogen deprivation in addition to blockage of HER2 pathways [
34]. Therefore, including them as an integral component of endocrine sensitive tumors may not be justified. Due to these complexities, the HER2+ tumors need to be considered separately from pure luminal tumors, which should be further categorized as luminal A and luminal B, with those showing co-positivity of HER2 grouped into a separate hybrid category Luminal-HER2 hybrids [
20].
The prevalence of LUMB in our study (16%) was similar to those of other studies [
17,
18,
20]. The only study with which this group showed a striking disparity was that of Cheang
et al. (16 vs. 32%) [
26]. The reason may be that they have used a Ki-67 proliferation index of 13.25% as a cutoff between LUMA and LUMB, as compared with no cutoff used by us. They demonstrated that although gene expression profiling remains the most sensitive method, Ki-67 proliferation added to ER, PR, and HER2 status can be used to identify additional LUMB tumors that would not otherwise be identified by these three markers alone [
26]. This may have led to increased expression of LUMB in their study. Ki-67 is a well established proliferation marker in cancer and an excellent biomarker for LUMB tumors. Ki-67 assessment has been a matter of controversy as some studies have used 10% [
45,
46] or 20% [
47,
48] as cutoff points, while others reported use of values around the mean [
49] or median [
50,
51]. In our study we did not use a cutoff point but used 10% as a starting point for moderate proliferative activity and demonstrated that 22.2% of LUMA and 43.24% of LUMB tumors had a moderate to high Ki-67 labeling index. As a matter of fact, LUMA tumors showed the least prevalence of moderate to high Ki-67 proliferation index in our study. This is in agreement with Bhargava
et al [
20] who reported the lowest average Ki-67 labeling index in their LUMA group.
The prevalence of the HER2+ class in our study was almost identical to that observed by Kim
et al. [
27] in North Korean women (17.31% vs 17.1%). However, it was significantly higher than the prevalence reported in each of the other five studies that we used for the comparison (17.31% vs. a mean of 5.33%) (p = <0.001) [
17,
18,
20,
26,
28]. This difference re-emphasizes that there are regional variations in prevalence, a concept further supported Al-Kuraya
et al. who demonstrated that a markedly higher frequency of HER2 and MYC amplifications are seen in Saudi as compared to Swiss breast cancers [
2].
HER2 over-expression is prognostically unfavorable but is associated with better responsiveness to trastuzumab (Herceptin) therapy and anthracycline-based chemotherapy [
45]. Thus its accurate analysis and precise quantification is essential [
46] to identify patients who are eligible for trastuzumab therapy [
47].
Studies have demonstrated that the most common mechanism (90-96%) of HER2 over-expression is by gene amplification [
45,
48‐
50]. Both IHC and FISH have been validated as methods for its analysis [
46]. There has recently been the introduction of bright field
in situ hybridization techniques such as chromogenic
in situ hybridization (CISH) and silver enhanced
in situ hybridization (SISH). Here, a peroxidase enzyme labeled probe with chromogenic detection is used instead of a fluorescent labeled probe, hence the results can be visualized by standard bright field microscopy. Advantages over FISH are that simultaneous histological and HER2 assessment can be carried out; moreover the signals do not decay over time [
46]. We employed SISH for confirmation of the HER2 positivity on all our cases.
HER2 positivity in our study was seen in two sets of cases, exclusive HER2 positive cases and HER2 positivity in association with luminal and basal markers as part of the spectrum of the hybrid class. Bull
et al. have demonstrated increased frequency of p53 mutations in carcinomas with neu/erbB2 amplifications [
51]. In our study we also observed that 45% of HER2+ tumors demonstrated focal or diffuse positivity, thus making HER2 the major class exhibiting p53 mutations. This level for HER2 was significantly higher than the other types (p = 0.001). The combination of HER2 and p53 mutations is associated with an increased risk of disease recurrence and overall mortality in comparison with patients who have only one, or neither, of the mutations [
51]. p53 over-expression was not encountered in the LUMA class which is consistent with the fact that p53 over-expression is known to be associated with high proliferation rates and a poor clinical outcome [
52].
In addition to HER2 positive LUMB cases categorized as LBHH, we also encountered LABH, LBBH and HBH cases (Table
2). In an attempt to define which of the molecular classes these cases may be most similar to, we compared the Ki-67 proliferative index and p53 expression levels in LUMB-basal hybrid and LUMB-HER2 hybrid cases to those of LUMB, basal, and HER2 classes. In this context, LUMB-HER2 hybrid cases seem to represent a biological overlap between its two constituent classes. Their p53 expression appeared at par with that of LUMB but was significantly lower than that of HER2. On the other hand, their Ki67 proliferation index (moderate to high) was significantly higher than that of both the HER2 and the LUMB molecular classes. This varied, distinctive pattern merits them an attribution of a separate molecular class rather than a subtype of the luminal class as conceived by several authors [
17,
18]. In this we agree with Bhargava
et al. [
20]. On the other hand, LUMB-basal hybrid cases seem to be closer to the LUMB class. Consequently, they may be considered as part of a "spectrum" of the LUMB class rather than a separate subtype. This is again in agreement with Bhargava
et al. [
20]. A similar analysis was not possible for LUMA-basal and HER2-basal hybrid cases because of their limited number (3 cases each). These preliminary observations, however, need to be validated through clinical investigation of a large number of patients. But it will still be important to determine if this grouping will be valid therapeutically, and whether the co-existence of basal markers in luminal and HER2 categories should become the basis for a different treatment approach.
The basal category, with a frequency of 9.95%, represented a pattern harmonious with Bhargava
et al. (10.7%) [
20], Cheang
et al. (9%) [
26], and Tamimi
et al.(10.9%) [
18]. The only study with a higher basal prevalence was Carey
et al., with 20.1% [
17]. Tan
et al [
53] demonstrated that increased levels of Ki 67 show a correlation with immuno-negativity for estrogen receptors, high tumor grade and increased mitotic activity, while in another study Kuroda
et al [
54] showed high levels of Ki-67, p-53 and P-glycoprotein more in basal than in non basal types of breast cancer. In our study too the Ki-67 index was highest in the basal type followed by HER2, hybrids, LUMB, UC and finally LUMA. The significantly high Ki67 proliferation index in our basal-like class of breast cancer is in agreement with Carey
et al [
17], Siziopikou & Colbeigh [
55] and Bhargava
et al [
20].
Triple negative breast cancer is a type of aggressive breast cancer lacking the expression of ER, PR and HER2. It was found to be much more frequent in BRCA 1 and BCRA 2 positive than in BRCA-negative patients by Atchley
et al [
56]. The currently available ER-targeted and HER-2-based therapies are not effective for treating triple negative breast cancer. Recent studies have revealed a number of novel molecular features of triple negative breast cancer. Gaining better insights into these molecular pathways may lead to identification of novel biomarkers and targets for development of diagnostic and therapeutic approaches for its prevention and treatment [
57]. The term "triple negative" is a term frequently used, sometimes interchangeably with basal type, by both researchers and clinicians. It is now accepted that not all triple negative cancers are basal [
58] and that they should not be taken as synonymous. There is an overlap with the basal group in 60-90% of cases and they are a component of the unclassified penta negative type. Triple negative tumors not exhibiting basal markers are the unclassified or the penta negative group in our study.
Authors' contributions
DT, MS, and AA, AO were responsible for the design of the study, selection of the cases, collection of information, and TMA. DT, MS, AA were also responsible for IHC and SISH staining and interpretation of the results. DT and MS were evaluating and reporting the results of the stains. DT, MS, AA were involved in the writing up of the manuscript. AH helped in the statistical analysis. All authors have read and approved the final manuscript.