Introduction
Chronic obstructive pulmonary disease (COPD) and lung cancer are the two respiratory diseases with the highest mortality rates globally [
1]. There is a significant comorbidity relationship between COPD and lung cancer, with COPD patients having a markedly higher risk of developing lung cancer compared to the general population [
2]. Relevant studies have shown that the relationship between COPD and lung cancer may originate from the chronic inflammatory state caused by COPD itself [
3], as well as the resultant immune dysfunction [
4,
5].
In recent years, Mendelian randomization studies have been widely used to explore causal relationships between complex diseases [
6,
7]. As a genetic matching approach, it can effectively control for confounding from environmental and behavioral factors, providing more reliable causal inference. However, this method has not been extensively adopted in research on the relationship between COPD and lung cancer. Additionally, advances in single-cell sequencing technologies have enabled in-depth profiling of various cell subpopulations [
8]. This technique has seen widespread application in cancer research, revealing the important roles of immune cells in tumor initiation and progression. Nevertheless, there have been fewer studies incorporating immune factors to investigate the link between COPD and lung cancer thus far. Regulatory T cells (Tregs) are thought to be involved in the pathogenesis of both COPD and lung cancer. Tregs can suppress immune responses, and their excessive activation has been associated with development and poor prognosis of COPD and lung cancer [
9,
10]. However, the causal relationship between COPD and lung cancer as well as the mechanistic role of Tregs remains unclear.
In the present study, we utilized Mendelian randomization to explore the causal relationship between these two complex diseases [
11]. Additionally, we leveraged single-cell sequencing technology to analyze infiltration and expression patterns of different cell types at the single-cell resolution [
12]. Our study provides an in-depth investigation into the causal link between COPD and lung cancer as well as the mediating role of Tregs, laying a theoretical foundation concerning the connection between the two respiratory illnesses and immunotherapies.
Methods
Immunity-wide GWAS data
The summary statistics for each immunophenotype can be publicly accessed from the GWAS Catalog (accession numbers from GCST90001391 to GCST90002121, encompassing a total of 731 immunophenotypes, including absolute cell (AC) counts (n = 118), median fluorescence intensity (MFI) reflecting surface antigen levels (n = 389), morphological parameters (MP) (n = 32), and relative cell (RC) counts (n = 192). These features encompass various developmental stages and cell types of immune cells. The original GWAS for immunophenotypes utilized data from 3,757 European individuals, with non-overlapping cohorts. The significance level for instrumental variables (IV) for each immunophenotype was set at 1 × 10^(-5). We pruned these SNPs (linkage disequilibrium [LD] r2 threshold of < 0.1 within a 500 kb distance).
Lung cancer and COPD GWAS data
The lung cancer (n = 492,803) and COPD (n = 468,475) GWAS data were obtained from the GWAS Catalog (accession numbers ebi-a-GCST90018875 and ebi-a-GCST90018807). The significance level for each instrumental variable (IV) was set at 5 × 10^(-8). We pruned these SNPs (linkage disequilibrium [LD] r2 threshold of < 0.001 within a 10,000 kb distance)
scRNA-seq data
The scRNA-seq data of human lung cancer patients were downloaded from the GEO database with the accession number: GSE131907 and GSE173896. We utilized the Seurat R package to identify distinct cell types and examine variations in immune cell infiltration. Cells that met specific criteria, such as having fewer than 200 genes, over 5,000 genes, or more than 20% mitochondrial expression, were excluded from the analysis. Raw counts were normalized using the ‘NormalizeData’ function, and variable genes were identified using the ‘FindVariableGenes’ function. Subsequently, dataset expression values were scaled and centered using the ‘ScaleData’ function to reduce dimensionality. Principal component analysis (PCA) and the uniform manifold approximation and projection (UMAP) methods were employed to visualize the data in lower dimensions, with the first two dimensions chosen for plotting. Cell clustering was performed using the ‘FindClusters’ function, and highly expressed genes within each cell cluster were determined using the ‘FindAllMarkers’ function. Additionally, the ‘FindMarkers’ function was used to identify differentially expressed genes (DEGs) between two cell populations.
Statistical analysis
Our Mendelian randomization (MR) analysis was primarily conducted using the inverse variance-weighted (IVW) method with the R package TwoSampleMR. To account for the potential influence of horizontal pleiotropy, we employed a commonly used method, MR-Egger, which identifies the presence of horizontal pleiotropy if its intercept term is significant. Heterogeneity in the effect size of SNP-specific causal effects during two-sample MR was assessed using Cochran’s Q-test. Finally, we performed a leave-one-out sensitivity analysis to evaluate the impact of individual SNPs on the overall estimates. We defined the presence of heterogeneity when the P-value for Cochran’s Q-statistic was less than 0.05. Although heterogeneity was detected, it did not impact the results of the IVW analysis, and our conclusions remain reliable. We incorporated a Colorful-clouds two-stepMR analysis to validate the mediating effect of regulatory T cells in the association between COPD and lung cancer. The bioinformatics analysis is done through R software and bioinformatics tools.
Discussion
We identified a causal association between the genetic liability to Treg infiltration and lung cancer, as well as a causal association between the genetic liability to COPD and Treg infiltration. Further validation through single-cell transcriptome sequencing analysis and multivariable Mendelian randomization confirmed Treg cells as intermediate factors in the causal association between COPD and lung cancer. Multiple studies have demonstrated a correlation between chronic obstructive pulmonary disease (COPD) and lung cancer, with COPD patients exhibiting a markedly higher risk for developing lung cancer compared to the general population [
13‐
15]. Cigarette smoking is a shared risk factor for both COPD and lung cancer [
16]. Additionally, the chronic inflammatory state induced by COPD itself also increases the likelihood of lung carcinogenesis [
17], through several potential mechanisms: (1) COPD-associated chronic inflammation stimulates tumor emergence; (2) COPD destroys alveolar architecture, enabling easier access of carcinogens into the lungs; (3) COPD impairs systemic immunosurveillance, heightening possibilities for malignant transformation. The close association between the development of COPD and lung cancer has been extensively reported, with numerous studies suggesting that the progression of COPD may contribute to the occurrence of lung cancer. Our Mendelian randomization analysis provided causal evidence supporting this association between COPD and lung cancer.
The initiation and progression of lung cancer is profoundly impacted by the modulation of immune cells within the tumor microenvironment. At different stages of lung carcinogenesis, fluctuations in the abundance and functional status of various immune cell subsets lead to dampened immune surveillance and cytotoxicity against malignant cells, thereby fostering tumor growth and progression. Specifically, decreased tumor-infiltrating CD8 + cytotoxic T lymphocytes in the lung cancer microenvironment fail to effectively contain tumor expansion [
18]. In contrast, increased proportions of tumor-associated macrophages [
19], regulatory T cells [
20], tumor-associated neutrophils [
21], and other cell types assist tumor immune evasion through angiogenesis promotion, immunosuppressive factor secretion, and related mechanisms, conferring survival and proliferative advantages. Additionally, diminished natural killer cell activity weakens cytotoxic responses against lung cancer [
22].
Prior to our study, there was a lack of research predicting the association between immune cells and the occurrence of lung cancer from a genetic perspective. Through Mendelian randomization analysis, we identified 51 immune cell phenotypes with causal associations with lung cancer. CD4 + and CD8 + T cell subsets play vital roles in the pathological process of COPD. The proportions of CD8 + T cells are elevated in the peripheral blood and airway inflammation of COPD patients, positively correlating with the severity of airway limitation and disease progression [
23,
24]. CD8 + T cells can directly kill alveolar and airway epithelial cells, releasing proinflammatory cytokines like TNF-α and resulting in airway and lung tissue damage [
25]. In contrast, CD4 + T lymphocytes mainly participate in COPD pathology by secreting cytokines including IL-17 and IFN-γ [
26]. In summary, the major pathogenic effect of CD8 + T cells is direct cytotoxicity against structural cells, whereas CD4 + T lymphocytes contribute to COPD development via release of inflammatory mediators.
Our findings demonstrated an increase in CD4 + T lymphocytes along with a decrease in CD8 + T lymphocytes among patients with chronic obstructive pulmonary disease (COPD). This is consistent with existing literature reports, such as the study by Barcelo et al. showing elevated peripheral blood CD4 + T cell percentages combined with reduced CD8 + T cell proportions in COPD patients [
9].We observed a causal association between COPD and both CD4 + and CD8 + T cells. Subsequent validation through single-cell sequencing analysis confirmed that the development of COPD is accompanied by an increase in CD4 + T cell infiltration and a decrease in CD8 + T cell infiltration. Multiple previous studies have demonstrated that regulatory T cells (Tregs) play significant roles in the pathogenesis of both COPD and lung cancer. The quantity of Tregs is elevated in the peripheral blood and bronchoalveolar lavage fluid (BALF) of COPD patients, positively correlating with the degree of pulmonary function impairment [
27,
28]. Tregs can inhibit effector T lymphocyte activity and mitigate inflammatory responses in lung tissue, but excessive long-term activation may also lead to increased risks of bacterial and viral infections [
29]. In lung cancer, higher infiltrative proportions of Tregs in tumor tissue suppress anti-tumor immune responses and associate with poor prognosis [
20]. Within the subtypes of CD4 + T cells, we found a significant increase in Treg cell infiltration in both COPD and lung cancer, with Treg cells acting as intermediate factors mediating the causal association between COPD and lung cancer.
Several limitations exist in our study. Firstly, the GWAS data were derived from European databases, representing only the European population and not encompassing all ethnicities. Secondly, our single-cell data were based on a small sample size of 15 patients, and further validation with a larger dataset is warranted.
In conclusion, our analysis demonstrates that Treg cells act as intermediate factors in mediating the causal association between COPD and lung cancer. This provides important insights for the prevention of lung cancer in COPD patients in the future.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit
http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (
http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.