1 Introduction
It is estimated that in 2020, breast cancer in females surpassed lung cancer, becoming the primary cause of new cancer cases globally. The number of new cases reported was 2.3 million, and 685,000 fatalities occurred [
1]. Breast cancer subtypes are characterized by the presence of hormone receptors (HR) and the human epidermal growth factor receptor 2 (HER2). These subtypes display varying therapeutic sensitivities and clinical prognoses [
2]. Triple-negative breast cancer (TNBC) comprises 15–20% of total breast cancers and is known for its aggressive progress, high incidence of recurrence, and poor prognosis. TNBC is defined by the absence of HR expression as well as the absence of HER2 overexpression or amplification [
3,
4]. Unlike other breast cancer subtypes that utilize therapeutic targets like ER or HER, TNBC currently lacks approved targeted treatments. As a consequence, systemic chemotherapy remains the accepted standard of care for patients with TNBC [
3]. Due to the limited treatment options available for TNBC, it is crucial to urgently investigate new targets that can enhance the prognosis of this condition. The identification of effective target genes is vital to making targeted therapy for TNBC more feasible. Recently, researchers have used bioinformatics techniques, such as single-cell analysis and RNA-seq analysis, to explore the mechanisms underlying TNBC [
5‐
7]. To gain a deeper understanding of the molecular mechanisms underlying TNBC, it is crucial to integrate bioinformatics approaches with Mendelian randomization for the exploration of TNBC-associated biomarkers. Weighted gene co-expression network analysis (WGCNA) is a method for detecting hub genes related to TNBC, but few studies have been done in this regard. Furthermore, there has been no application of Mendelian randomization to validate the results of transcriptome analysis in TNBC.
In oncology, microarray analysis is used for various clinical purposes, including molecular cancer classification, tumor response prediction, and prediction of prognosis [
8]. Using the WGCNA algorithm, highly correlated genes are systematically integrated into multiple modules [
9]. WGCNA is a powerful tool for discovering the relationship between genes and clinical phenotypes and has been used to identify cancer markers like gastric cancer [
10] and ovarian cancer [
11]. As a result, identification of the expression of the appropriate biomarkers for identification and therapeutic evaluation is crucial for understanding the mechanisms of diseases such as TNBC [
12,
13]. It was the goal of this study to identify core genes, novel biomarkers, or possible mechanisms associated with TNBC.
An epidemiological method, Mendelian randomization (MR), can be used to reinforce causal inference by using instrumental variables from genetic variants of an exposure [
14]. As genetic variants are distributed randomly at conception and, consequently, uncorrelated with significant confounders, this approach minimizes any residual confounding [
15]. MR requires the selection of genetic variants that are highly related to the exposure under investigation. As alleles are inherited randomly, individuals are assigned to different levels of exposure dosage [
16]. In this study, the hub gene, CCNB1, was examined with two samples of MR data to determine if it is associated with the risk of TNBC.
In this work, differentially expressed genes (DEGs) between normal ductal cells of the breast and TNBC were examined. Using WGCNA, the most relevant modules were identified and intersected with DEGs, leading to the discovery of five potential diagnostic biomarkers, namely CDC2, CCNB1, CCNA2, TOP2A, and CCNB2. These biomarkers have the potential to contribute to the investigation of the mechanism of TNBC and serve as targets for immune therapy. Additionally, the causal relationship between CCNB1 expression and TNBC was verified through Mendelian randomization.
4 Discussion
TNBC is a heterogeneous cancer from both biological and clinical perspectives, posing an unmet need due to its aggressive features and unfavorable prognosis [
28]. Its chemoresistance, rapid invasion, atypical symptoms, and limited treatment options in clinical settings are major factors responsible for its poor outcome [
29]. In this study, WGCNA and DEGs were used to obtain core genes, and we conducted analyses on immune infiltration and immune cell correlation. Our findings for the first time confirm the positive causal role of the CCNB1 gene in TNBC through Mendelian randomization.
Disease-related genes and biomarkers are valuable tools for detecting, diagnosing, prognosing, and monitoring therapeutic responses [
30]. In a recent study, PPP1R14B was upregulated in TNBC tissues and correlated with paclitaxel resistance [
31]. In breast carcinoma, TRPS1 was identified as a highly specific marker, particularly for TNBC based on TCGA database analysis and immunochemistry [
32]. Another study identified four other genes as prognostic signatures for the disease-free interval by using DEG and PPI analysis [
33]. Furthermore, through DEGs, WGCNA and PPI, our study discovered that the hub gene associated with TNBC was CCNB1, along with four other genes (CDC2, CCNA2, TOP2A, and CCNB2). The performance of our nomogram model in predicting triple-negative breast cancer was satisfactory, with CCNB1 being the most significant gene. By calculating the ROC curves, the efficacy of the five hub genes in distinguishing between TNBC and the normal group was assessed. The nomogram exhibited satisfactory AUC values, validating its potential as a reliable diagnostic tool. Importantly, CCNB1 demonstrated the highest discriminatory power. Therefore, it is essential to investigate the mechanism by which CCNB1 facilitates TNBC and increases its incidence.
CCNB1, one crucial molecule regulating the progression of the G2/M phase, is crucial for the cell cycle in mitosis [
34]. Due to the significance of cell division and the cell cycle for tumor development, CCNB1 is crucial for tumor development. Overexpression of CCNB1 has been found in various tumors and is related to poor outcomes compared to the control group [
35,
36]. CCNB1 expression is elevated in breast cancer tissue, and the expression of this biomarker demonstrates a significant correlation with patient survival time, tumor burden, methylation, infiltration of immune cells, as well as the absence of estrogen receptor expression [
37]. Previous research has demonstrated notable links between CCNB1 and the absence of hormonal receptors, as well as the presence of HER2 receptors [
38]. Additionally, CCNB1 has been related to TNBC in previous studies. Overexpression of CCNB1 is an unfavorable prognostic factor for TNBC patients compared to the normal group [
39]. The decrease in cell viability at the G2/M phase in TNBC cells was observed upon the knockdown of PNO1, which was accompanied by the downregulation of CCNB1 and CDK1 protein expression [
40]. Deregulated PNO1 also inhibited tumor growth in vivo and decreased the number and confluency of TNBC cells in vitro [
40]. In this study, CCNB1 was found to be overexpressed in the TNBC group and exhibited strong performance in both the nomogram and the ROC curve. These findings align with those of previous research, thus further confirming our results. Our study provides additional evidence supporting CCNB1 as a promising therapeutic target for TNBC.
The involvement of immune cells in TNBC was investigated using cibersort's immune infiltration analysis in this study. A significant disparity in the expression patterns of diverse immune cell subsets was observed, including naive B cells, CD8 T cells, resting CD4 memory T cells, follicular helper T cells, resting NK cells, monocytes, macrophages M1, resting dendritic cells, mast cells resting, activated mast cells, and eosinophils. These findings are in line with previous research conducted in the field of cutaneous melanoma, which showed higher levels of activated CD4 + T cell infiltration in metastatic samples, indicating their potential contribution to cancer metastasis [
41]. The CCNB1-specific CD4 T cell response has been studied insufficiently. However, T cell assay analysis demonstrated that CCNB1 has many CD4 T cell epitopes that are recognized differently by naive and memory CD4 T cells [
42]. Notably, there was a positive correlation observed between CCNB1 expression in TNBC and activated CD4 memory T cells, while an inverse association was noted with T follicular helper cells and memory B cells. Furthermore, it has been shown that immune checkpoint therapy enables T follicular helper cells to enhance B immune cell activity, supporting the anti-tumor response [
43]. The activation of B cells in T cells and the generation of antibodies play a vital role in the immune reaction. Therefore, these findings highlight the significance of tumor-infiltrating lymphocytes as a clinically relevant and reproducible biomarker that could impact the prognosis and treatment response of TNBC.
GWASs have significantly impacted the field of genetics in the last decade, particularly in complex disease research. They offer an unbiased method for exploring the genetic foundation of complex diseases [
44]. The current investigation is the first to employ a two-sample MR analysis using numbers of GWASs to explore the causal relationship between CCNB1 levels and TNBC risk. MR is a comparable methodology to prospective randomized controlled trials (RCTs), which mitigates systematic biases impacting observational studies like confounding and reverse causality [
45]. In this study, MR was creatively employed to authenticate the transcriptomics analysis findings. The findings suggested a possible causal link between serum CCNB1 levels and an increased risk of TNBC. In order to effectively minimize the regression dilution resulting from detection errors, highly accurate genotyping was used. To ensure the reliability of the findings, the MR-Egger regression test showed no indications of horizontal pleiotropy or heterogeneity.
Although this study revealed meaningful findings, certain limitations should not be ignored. Firstly, to increase the convincingness of the results, we should have included more TNBC datasets. Unfortunately, we were only able to analyze three datasets due to the lack of microarray data in the TNBC field. Secondly, although we employed bioinformatics analysis to examine the candidate hub genes and their potential functions related to TNBC development and used Mendelian randomization for validation, further biological experiments and clinical validation are necessary. These additional experiments and validations will help us confirm the exact mechanisms underlying the identified hub genes contributing to TNBC.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.