Introduction
High-grade serous ovarian carcinoma (HGSOC) is the most common histotype of epithelial ovarian carcinoma (EOC), accounting for nearly 75% of all cases. With a 5-year overall survival (OS) rate of approximately 40%, it is also the most lethal histotype [
1]. About 80% of HGSOC patients are diagnosed with advanced-stage disease, classified as stage III or IV according to the International Federation of Gynecology and Obstetrics (FIGO) criteria [
2,
3]. Despite an initial high chemotherapy response rate of 85%, the 5-year OS rate of these patients with advanced-stage disease is below 30%, primarily due to the development of chemotherapy resistance [
1,
2,
4‐
6]. In contrast, patients with early-stage disease (FIGO stages I and II) have a 5-year OS rate exceeding 70% [
1,
3,
5].
Even when accounting for clinical characteristics such as age, outcome of debulking surgery, and FIGO stage, variations exist in chemotherapy responsiveness, recurrence rates, and OS rates [
6‐
10]. Within HGSOC, four specific molecular subtypes were identified through microarray gene expression profiling and further delineated as the mesenchymal, immunoreactive, proliferative, and differentiated subtype [
8,
9]. With a median OS of less than 30 months, the mesenchymal subtype had the poorest prognosis, whereas the immunoreactive subtype showed the best prognosis, with a median OS extending beyond 45 months [
7,
10]. Moreover, retrospective studies showed varying treatment responses among these subtypes. For example, the proliferative subtype seemed to have the best response to carboplatin, while the mesenchymal subtype was particularly receptive to taxane-based chemotherapy [
11]. Additionally, both the proliferative and mesenchymal subtypes appeared to have the greatest benefit of bevacizumab [
12]. As such, these molecular subtypes may not only serve as potential prognostic indicators but also contribute to a more personalized treatment approach. However, the high costs, difficult sample quality assurance, and time-intensive nature of gene expression analysis limit routine clinical application.
To overcome this limitation, a histopathological approach for HGSOC subtyping has been proposed by Murakami et al. (2016) [
13]. They introduced a classification system of four histological subtypes based on the initial molecular classification and observed significant differences in OS between these subtypes. Tumors with a destructive desmoplastic reaction and a mesenchymal transition or labyrinthine pattern were classified as mesenchymal, exhibiting the lowest OS rates. Tumors displaying a smooth invasive front and an immunoreactive pattern, defined as a median of more than 100 surrounding and 50 infiltrating lymphocytes from five visual fields at × 400 magnification, were labeled immunoreactive, showing the longest progression-free survival (PFS) and OS. The remaining tumors were categorized into the solid and proliferative subtype or the papilloglandular subtype, corresponding to the gene expression-based proliferative and differentiated subtypes, respectively. However, the authors reported that the reproducibility of the subtypes was considered suboptimal for clinical practice, with an average consistency rate of 74% between six observers and an average Cohen’s
κ coefficient of 0.64 [
13,
14].
In their next study, Miyagawa et al. (2023) observed a similar low concordance rate, with a Fleiss’
κ coefficient of only 0.348, when applying these criteria to whole slide images (WSI). Consequently, they further delineated the criteria for the mesenchymal, solid and proliferative, and papilloglandular subtypes. Moreover, given the challenges of observing cellular details on WSIs, they adapted the definition of the immunoreactive subtype to require at least 25% of the stromal area to be infiltrated by lymphocytes. These refined and adjusted criteria improved the Fleiss’
κ coefficient to 0.549, but differences in survival between the subtypes based on these criteria were not evaluated [
14].
The objective of the current study was to assess the clinical utility of HGSOC histological subtypes within a cohort of patients with HGSOC. To achieve this, we examined the interobserver reliability to ascertain reproducibility and analyzed the prognostic variances in survival among histopathological subtypes of HGSOC, as defined by Murakami et al. (2016) and Miyagawa et al. (2023) [
13,
14]. The findings of this study may contribute to a more personalized treatment approach for HGSOC by improving the prediction of prognosis and forming the basis of future research into treatment response differences between the subtypes.
Discussion
In this study, we classified 208 HGSOC cases into mesenchymal, immunoreactive, solid and proliferative, or papilloglandular subtypes, based on histopathological criteria established by Murakami et al. and Miyagawa et al. [
13,
14]. We observed only a fair interobserver agreement for each method. Importantly, no differences in 5-year progression-free and overall survival rates were observed between the subtypes, irrespective of the classification method employed. Consequently, subtyping HGSOC based on the currently available histopathological classification methods offers no clear advantage in clinical practice.
To make well-informed treatment decisions based on the predicted prognosis and treatment response of the histological subtypes, it is crucial that these subtypes are consistent and reproducible. In the study of Murakami et al. (2016), six observers classified the cases, resulting in an average concordance rate of 74% (range: 61–89%, average Cohen’s
κ = 0.65, range: 0.46–0.85), a rate they regarded as suboptimal for use in clinical practice [
13]. In their subsequent study by Miyagawa et al. (2023), the application of the Murakami et al. criteria yielded a Fleiss’
κ of 0.348 between four observers. After refining the definitions of the mesenchymal, solid and proliferative, and papilloglandular subtypes, and adjusting the definition of the immunoreactive subtype due to challenges in observing cellular details on WSI, the coefficient increased to 0.549 [
14]. In our study, two specialized gynecological pathologists reached a 62.5% concordance rate (Cohen’s
κ = 0.34), utilizing the refined criteria of Miyagawa et al., except for the immunoreactive subtype, which was based on the criteria of Murakami et al. It was not possible to calculate Cohen’s
κ for the cases classified using Miyagawa et al. criteria due to reclassification of tumors jointly classified as immunoreactive, solid and proliferative, or papilloglandular under the Murakami et al. criteria. However, using the immunoreactive definition of Miyagawa et al. likely has a limited influence on the determined Cohen’s
κ, as only 13 out of 86 cases changed subtype. We argue that this reproducibility rate is not robust enough for clinical utilization in individual patients [
16,
17].
The observed low concordance rate in subtype classification may be the underlying reason for the absence of differences in PFS and OS between the subtypes in our study. To minimize misclassification, each case was rigorously reviewed by two specialized gynecological pathologists. However, a possible overrepresentation of the mesenchymal subtype was observed, with 122 of 208 cases (58.7%) being classified as mesenchymal. This contrasts with the 36% in the study of Murakami et al., and 44% in the study of Miyagawa et al. [
13,
14]. Cohen’s
κ for distinguishing mesenchymal versus non-mesenchymal subtype was 0.352 (69.2% concordance rate) in our study. In comparison, the study of Murakami et al. reported a Cohen’s
κ of 0.70 (85% concordance rate), while the study of Miyagawa et al. reported a Fleiss’
κ of 0.703. This difference might contribute to the contrasting survival outcomes for the mesenchymal subtype, as this study could not replicate the lower OS rate for the mesenchymal subtype observed in the study of Murakami et al. Moreover, Murakami et al. reported a higher PFS and OS rate for the immunoreactive subtype compared to the other subtypes. Our study showed a similar, although not statistically significant, trend. A potential explanation for this might be the limited number of cases we classified as immunoreactive using Murakami et al. criteria (
n = 10, 4.8%). Using the revised definition of Miyagawa et al., this number increased to 23 (11%). However, this did not lead to differences in survival outcomes.
Khashaba et al. (2022) also investigated the correlation between histopathological subtypes of HGSOC and survival outcomes [
18]. However, their classification algorithm differed from that of Murakami et al., encompassing variations in both the histopathological subtypes and the criteria used for the classification [
13,
18]. Cases featuring a cellular stromal reaction in > 10% of tumor tissue were classified as mesenchymal. All other cases with a mitotic count of > 30/10 high power fields were classified as proliferative. Cases showing a lower mitotic count, but > 20 lymphocytes infiltrating tumor nests per high power field, were categorized as immunoreactive. The remaining cases were categorized as differentiated, and further divided into those with solid endometrioid transitional (SET) or (micro)papillary features [
18]. These criteria were derived from different studies and were not compared with the gene expression-based subtypes [
13,
19‐
21]. Therefore, we did not apply this classification method to our cohort. Nevertheless, Khashaba et al. identified the histopathological subtype as a significant factor for PFS (
p = 0.008) in a univariate analysis, although no significant correlation was found with OS. The mesenchymal subtype demonstrated the shortest median PFS, whereas the differentiated subtype with SET features exhibited the longest median PFS. Additionally, primary debulking surgery, compared to interval debulking surgery, and administration of paclitaxel and carboplatin chemotherapy, compared to other chemotherapeutic agents, also showed correlation with a better PFS (
p = 0.004 and
p = 0.001, respectively) [
18]. However, multivariate analysis on PFS was not conducted, probably due to the small sample size of the study (
n = 85). Consequently, the correlation between the histopathological subtype based on the criteria of Khashaba et al. and PFS remains uncertain [
18].
Research has also indicated that treatment responses might vary among the subtypes. Specifically, the mesenchymal subtype has been reported to have the most favorable response to taxane-based chemotherapy and potentially to dose-dense chemotherapy as well [
11,
13]. Furthermore, bevacizumab is thought to be more efficacious in the mesenchymal and proliferative subtypes [
12,
14,
22]. However, since the standard treatment regimen for our patients included carboplatin and paclitaxel, without the addition of bevacizumab, our study could not explore these potential differential responses to therapy.
One of the strengths of our study is that all cases were independently reviewed by two gynecological pathologists. In instances of discordance, a joint review was undertaken, ensuring consistent and accurate subtype classification. While the initial agreement between pathologists was relatively low, consensus was eventually reached for all cases. Furthermore, subtyping was performed by the same two pathologists for both classification systems. This approach effectively eliminated variance between classification systems that might arise from discrepancies in individual pathologist evaluations.
A limitation of our study was that we did not perform RNA sequencing. Consequently, we could not validate whether the subtypes, as identified by the two gynecological pathologists, aligned with those determined by gene expression analysis. In the study of Murakami et al., concordance between the gene expression-based subtypes and histopathological subtypes was examined but limited to a subset of samples (
n = 59). They observed agreement in approximately 70% of the cases, showing a significant association between the subtypes (
p < 0.001) [
13]. However, given the observed low concordance rate between pathologists in our study, there is a clear need for more straightforward subtyping criteria, particularly for the mesenchymal subtype, as the presence of this subtype may impact the choice of treatment [
11‐
14,
22]. Future investigations should prioritize refining these criteria to improve interobserver consistency, and subsequently examine survival outcomes.
We suggest further exploration of the application of artificial intelligence (AI) or machine learning. The feasibility of histotype classification using machine learning has already been demonstrated [
23]. For HGSOC subtyping, AI or machine learning could support the refinement of HGSOC subtype criteria by identifying novel or alternative histopathological features. Moreover, these computational approaches may have the capacity to determine HGSOC subtypes autonomously. However, to facilitate such research, high-quality RNA-sequencing data is necessary, considering the current challenges in achieving adequate reproducibility rates for histopathological subtypes. Once AI or machine learning is refined through training with robust data, it could play an important role in subtyping HGSOC, enhancing diagnostic precision, and informing treatment strategies.
In conclusion, the current histopathological subtypes of HGSOC as defined by Murakami et al. and Miyagawa et al. demonstrate only fair reproducibility. Furthermore, in our cohort of 208 HGSOC patients, no differences in overall and progression-free survival between the subtypes were observed. As such, the implementation of these subtypes in clinical practice remains premature. Further research is warranted to establish criteria that are more straightforward and to investigate their association with survival outcomes.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.