Background
Overall survival (OS) and progression-free survival (PFS) are the primary study endpoints in randomized controlled trials (RCTs) testing novel treatments in oncology [
1]. Particularly, the assessment of OS is commonly recognized as the best instrument to measure the efficacy of novel therapeutic strategies, compared to standard available treatments. PFS improvements, although suggestive of clinical efficacy of anticancer treatments, do not necessarily translate into OS advantage [
2‐
4]. Hence, PFS should be considered solely as a surrogate estimate of efficacy. However, despite this awareness among clinicians and researchers, some drugs have received regulatory approval based on improvements in PFS, non-reinforced by positive OS results [
5].
In addition to increasing OS, the aim of new treatments should be to improve the quality of life (QoL) of patients with cancer [
6]. In recent years, the oncology community devoted more attention to the assessment of QoL outcomes in RCTs as well as in routine clinical practice. Indeed, this recognition led to the publication of guidelines by the European Society for Medical Oncology (ESMO) for a comprehensive assessment of patient-reported outcomes (PROs) in clinical practice [
7]. Furthermore, the ESMO Magnitude of Clinical Benefit Scale (ESMO-MBCS), developed to measure thoroughly the weight of clinical benefit of new anticancer therapies, assigns positive scores for treatments that demonstrate improvement in QoL [
8,
9], thus encouraging their use with a stronger recommendation.
In patients with cancer, QoL is certainly influenced by symptoms caused by disease burden, side effects of therapies, and daily life domains, such as emotional, physical, and social functions. However, although the development of new anticancer treatments aims to prolong the survival of patients, such progresses are not always associated with improvements in QoL. Previous studies analyzed data from RCTs in oncology and revealed that, for systemic anticancer treatments, improvements in PFS did not correlate with QoL benefits [
10,
11]. More recently, by investigating a selected list of RCTs in oncology published in 2019, Samuel and colleagues found a positive correlation between improvements in OS and QoL benefits for experimental cancer therapies [
12]. For patients, as well as medical oncologists, understanding the potential benefits of treatments in terms of both survival and QoL is crucial to take a shared decision regarding the most appropriate therapeutic approach. Of note, previous studies revealed that integration of patient-reported symptom monitoring in clinical practice might promote benefits on both global QoL and OS [
13].
Assessment of QoL is particularly relevant in patients with metastatic non-small cell lung cancer (NSCLC), due to symptoms that have a strong negative impact on QoL, such as dyspnea, cough, hemoptysis, shortness of breath, pain, and possible neurological symptoms [
14]. Furthermore, because of disease extent and treatment-related adverse events, in addition to the aforementioned symptoms, these patients frequently experience deterioration of psychological dimension, characterized by anxiety, depression, and malnutrition [
15]. In recent years, the treatment landscape of patients with metastatic NSCLC has been revolutionized by immune checkpoint inhibitors (ICIs) and novel selective inhibitors for oncogene-addicted disease. In addition to better survival outcomes, some of these novel treatments demonstrated improvements in QoL as well. However, it is unclear whether a correlation between QoL outcomes and survival outcomes exists and the extent of such correlation for various drug classes.
Despite the well-known relevance of QoL assessment in patients undergoing treatments for cancer, we and other groups have previously revealed that QoL results are poorly reported in publications of RCTs in oncology, hindering a complete evaluation of the effects of new anticancer drugs [
16‐
19]. Furthermore, some publications of trial results describe QoL data with an inappropriate positive framing, overestimating the real benefits obtained with experimental treatments [
12].
Herein, we report the results from our study aiming to investigate QoL outcomes in publications of phase III RCTs testing novel systemic treatments in patients with metastatic NSCLC, published between 2012 and December 2021. We selected phase III RCTs because they explore new therapies that may receive approval from regulatory agencies. In addition, RCTs should include QoL assessment among study endpoints. We evaluated whether QoL results correlate with PFS and OS outcomes, exploring such correlation in different drug classes. Finally, we analyzed how QoL results were described in these manuscripts and if favorable framing was enriched in profit studies, compared to non-profit trials.
Methods
Data source and search strategy
We analyzed publications of phase III RCTs investigating novel systemic therapies for the treatment of metastatic NSCLC, published between January 2012 and December 31, 2021. The following terms were searched in PubMed: “NSCLC” OR “non-small cell lung cancer” OR “non small cell lung cancer” OR “lung cancer.” The following PubMed filters were applied: “Clinical Trial, Phase III”, “Randomized controlled trials,” and “English Language.” The research string can be found in Additional file
1: Table S1. The PubMed database was interrogated in October 2022. All information about trials was collected from the article or through the
https://www.clinicaltrials.gov website.
Inclusion and exclusion criteria
The titles and abstracts were examined. Only phase III RCTs in metastatic NSCLC that reported QoL results and at least one survival outcome between OS and PFS were considered eligible for further analysis. The following RCTs were excluded from the analysis: (1) trials testing surgery and/or radiotherapy; (2) trials non-including QoL among study endpoints; (3) trials not reporting QoL results in primary or secondary publications; (4) trials not reporting results of at least one survival outcome between PFS and OS; (5) trials evaluating multiple schedules of administration of the same drug; (6) trials of supportive care or behavioral approaches; (7) not phase III RCTs; (8) trials in locally advanced, neo-adjuvant, or adjuvant settings; (9) study protocols; (10) subgroup or post hoc analysis of previously published trials; (11) brief reports or case studies; (12) research of screening methodologies; and (13) pooled analysis of multiple trials. The remaining full texts were downloaded for further analysis.
Data collection
Data were extracted from publications and reported in an electronic database. These included first author, digital object identifier (DOI), name of the trial, journal, date of publication, and class of therapy investigated (chemotherapy, immunotherapy, or target therapy). Data were analyzed by two independent investigators. Discrepancies were resolved by consensus.
The following data were also extracted from trials: histological subtype (squamous cell carcinoma (SCC), non-squamous cell carcinoma (NSCC), “non-specified histology”); if required, mandatory mutations for patients enrolment (“EGFR mutations,” “ALK translocations,” or “KRAS mutations”); funding (“profit,” when the trial was sponsored by a pharma company, or “no profit,” when the trial was designed and conducted by academic institution/s); results (“positive,” when statistically significant advantage in the primary endpoint was reached in experimental arm over control arm, or “negative”, when primary endpoint was not met. For RCTs with PFS and OS as co-primary endpoints, results were considered positive only if both endpoints were met); study design (“superiority” when the aim of the trial was to detect an advantage in experimental drug over standard treatment or “non-inferiority” when the aim was to show a same efficacy between the experimental and the control arm); masking (“blinded,” when neither researchers nor patients were aware of the assigned treatment, or “open-label”); involved countries (“multi,” involving institutions of two or more different countries, or “single” when they belonged to a single country); and primary endpoints (OS, PFS, safety, QoL, and ORR).
Assessment of QoL in RCTs was investigated by examining the methods sections of the articles or study protocols. When protocols were not available as supplementary material of publications, they were searched on the
https://www.clinicaltrials.gov/ website. We interrogated study protocols to find out whether QoL was assessed among endpoints (primary, secondary, exploratory endpoint, or non-analyzed) and the types of tests utilized in the trial to measure QoL.
For trials not reporting QoL results in primary publication, we assessed their disclosure in secondary manuscripts. Secondary articles were searched among the list of articles obtained with our research strategy (Additional file
1: Table S1). Only for one trial, a secondary manuscript with QoL results was published later than December 2021 [
20]. We also searched potential secondary publications in PubMed using the name of the drug and the study’s acronym, but this research did not produce any additional results. Five trials of non-reporting QoL results in primary or secondary publications disclosed QoL data at international conferences and were included in the analysis. For trials reporting QoL results in both primary and secondary QoL-focused manuscripts, QoL data were extracted from the secondary articles, supposed to be more accurate.
For RCTs with non-available or immature OS data at the time of primary publication, we evaluated the final OS results reported in secondary publications.
Data interpretation
Global QoL results were evaluated based on the comparison between the experimental arm and the control arm, as previously reported [
12]. Hence, we did not consider comparisons of “before and after treatment” in each arm. Based on these premises, QoL outcomes were classified as follows: (1) “superior,” when a statistically significant improvement in global health status (GHS)/global QoL results were recorded in the experimental arm, compared to control arm; (2) “inferior,” when in the experimental arm, a statistically significant decline in GHS/global QoL was found, compared to control arm; (3) “no statistically significant difference” in terms of GHS/global QoL between the experimental and control arms. When only for some symptoms or some domains a statistically significant improvement was found, but with no improvement in GHS/global QoL in the trial, QoL outcomes were classified in the “no statistically significant difference” category.
We analyzed abstracts and full texts to assess how authors described and interpreted QoL results. The description was considered “favorable” when (i) authors claimed that global QoL “did not worsen” upon treatment, because a novel therapy should aim to guarantee an improvement in QoL, and not only a reduced worsening; (ii) authors emphasized the effect of treatment on QoL outcomes with superior scores only in some symptoms or functions but without a concomitant improvement in GSH/global QoL; and (iii) authors declared an improvement in QoL results in the same arm, comparing pre- and post-treatment outcomes, without an appropriate comparison between the two arms. The description of QoL data was considered “neutral” when the description and discussion of QoL results were coherent with the data reported in the manuscript or at international conferences. Two investigators independently assessed the description of QoL results in RCTs. Discrepancies were resolved by consensus.
Statistical analysis
Statistical differences between the analyzed groups were calculated using the chi-square test or Fisher’s exact test in the Prism – GraphPad software v.9.
Discussion
In the last years, the prognosis of patients with metastatic NSCLC has certainly improved. Indeed, target therapies for oncogene-addicted NSCLC, as well as immunotherapy in non-oncogene-addicted tumors, have produced a remarkable extension of survival, in comparison with the previously available standard treatments. However, despite this considerable improvement in survival outcomes, less information is available about the effect of novel treatments on QoL. Previous publications revealed that new oncology therapies approved by the Food and Drug Administration (FDA) and European Medicines Agency (EMA) often lack published QoL data [
2,
5]. Furthermore, only a small fraction of FDA- and EMA-approved anticancer drugs demonstrated significant improvements in QoL, compared to the standard of care [
21,
22]. Herein, we investigated the effects of new treatments on QoL, in RCTs testing novel therapies in metastatic NSCLC published in a 10-year time span (2012–2021). Furthermore, we assessed whether QoL results correlated with survival outcomes.
First, we observed a high rate (
n = 40/158, 25.3%) of trials non-including QoL assessment among study endpoints (Fig.
1). In addition, 37/158 (23.4%) RCTs, although declaring assessment of QoL among endpoints in methods and/or in the study protocol, did not report QoL data in primary or secondary publications or at international conferences (Fig.
1). These data are consistent with previous studies revealing an inadequate assessment and reporting of QoL data in RCTs in lung cancer [
17,
18].
We found a positive correlation between QoL results and PFS outcomes (Table
2). Instead, a positive correlation between QoL and OS outcomes was not found (Table
2). Several reasons may explain this difference. First, we observed that 30/60 (50%) trials with positive PFS results did not register concomitant OS improvements with the experimental treatment (Table
2). Discordance between PFS and OS was previously reported in trials of NSCLC, as well as other solid tumors [
3,
23,
24]. The effects of patients’ crossover in RCTs, as well as the different treatments used after the failure of experimental therapy in the trial, may certainly influence the OS results [
25]. However, we think that results from this analysis should generate further confrontation and discussion in communities of clinicians and scientists. Indeed, novel treatments leading to longer OS but without concomitant improvements in QoL should be carefully evaluated by regulatory agencies. This is particularly relevant for treatments that guarantee only modest prolongation of OS.
Interestingly, we found that the correlation between QoL improvement and PFS was statistically significant only for target therapies, particularly for EGFR and ALK inhibitors (Table
3). We believe that the size of this correlation could be even larger than what we observed. Indeed, our analysis included some of the first trials with EGFR inhibitors that enrolled patients regardless of
EGFR mutational status. The correlation between QoL improvements and PFS prolongation is probably due to the well-known efficacy of EGFR and ALK TKIs in NSCLC harboring
EGFR mutations or
ALK rearrangements. In addition, many of these drugs have better tolerability than chemotherapy, thus influencing the results reported in QoL questionnaires by the patients. We believe that these results may help clinicians in predicting response and treatment adherence. Indeed, an early improvement in QoL, which can be easily measured in clinical practice with QoL questionnaires, may be useful to predict response to treatments. Hence, we believe that an accurate assessment of the patients’ symptoms, as well as evaluation of QoL using specific questionnaires, should be routinely performed in clinical practice, in order to identify patients who are benefitting from treatment with target therapies and, more importantly, to recognize signs and symptoms evocative of treatment failure and disease progression. Future studies investigating a correlation between changes in QoL at early time points and response to treatment may further elucidate a predictive role for QoL questionnaires.
In our study, we evaluated the experimental treatments as “superior” in terms of QoL only when they were directly compared to the control arm. Indeed, assuming that novel treatments should guarantee an improved QoL compared to the standard of care, an “intra-arm” comparison between “before” and “after” treatment is not acceptable [
26]. However, trials testing novel treatments in the adjuvant setting, compared to placebo, may require further thinking. Indeed, due to the absence of macroscopic disease-causing symptoms, for novel adjuvant treatments, compared to placebo, promoting improvements in disease-free survival (DFS) and OS, a non-deterioration of QoL, instead of real improvement, could be acceptable.
We observed that several publications reported QoL results in a favorable manner. As also previously reported, there is a correlation between sponsorship by pharma companies and favorable interpretation of QoL results (Table
5) [
12]. Furthermore, we found that a favorable description of QoL results was frequent in trials with positive results (Additional file
4: Table S4). In this context, the various and non-uniform ways of reporting QoL data in manuscripts are certainly one of the causes of such misinterpretation. We found that several publications, in the description of QoL results, only reported details about improvements in lung cancer symptoms, such as cough, pain, and dyspnea, or specific functions. However, the same authors did not provide a concomitant description of GHS/global QoL, even if the main or supplementary tables of the manuscript reported a non-improvement in GHS/global QoL, hindering an appropriate and comprehensive evaluation of QoL results.
Assessment of survival outcomes, such as PFS and OS, relies on well-established methods. Instead, several different tools are used to assess QoL in trials and clinical practice. The absence of a uniformly accepted methodology impedes a homologous and systematic assessment of QoL in clinical trials, as well as a comprehensive evaluation of the effects of novel treatments in NSCLC. We found that EORTC, EuroQoL, LCSS, and FACT were the most used tools among the 81 selected RCTs (Table
1), assessing different aspects of the health-related QoL. Of note, in some cases, these tests were only listed among exploratory endpoints (data not shown), thus lacking a pre-specified hypothesis [
26]. Hence, the results from these analyses should be used with caution, particularly in terms of statistical evaluation.
We acknowledge that our work has some limitations. First, the data regarding the correlation between QoL results and PFS/OS outcomes are influenced by the fact that a large fraction of RCTs (n = 77/158, 48.7%) did not disclose QoL results. Therefore, our analysis was limited to only 81 RCTs, thus influencing a global evaluation in all trials of metastatic NSCLC. Next, even in studies reporting QoL data, during the course of the trial, a variable percentage of patients did not compile QoL tests, further reducing the amount of available data for a correct assessment. Next, it has to be considered that the open-label design of trials may influence the way patients perceive the treatment, ultimately affecting the compilation of QoL questionnaires. Furthermore, trials included in our analysis were heterogeneous, including patients with various NSCLC histologies, different lines (first, second, or later line) of treatment, and varied tumor mutational status. However, the results from our study might stimulate further investigation in the field of QoL in NSCLC as well as other solid cancers.
Declarations
Competing interests
AS1 reports honoraria from Eli Lilly, MSD, and Janssen and travel support from Bristol-Myers Squibb and AstraZeneca. MDM reports the following COIs: AstraZeneca, advisory board, personal, and consultancy about the role of osimertinib as adjuvant treatment of NSCLC; Boehringer Ingelheim, invited speaker, personal, and compensation for a talk and a document about the role of patient-reported outcomes in clinical trials and in clinical practice, with a specific focus on lung cancer; Janssen, advisory board, personal, and advisory board about the role of chemotherapy and hormonal treatment in hormone-sensitive prostate cancer; Merck Serono, advisory board, personal, consultant for the new indication of avelumab in urothelial cancer; Merck Sharp & Dohme, advisory board, personal, participation in advisory boards about olaparib in pancreatic cancer, about olaparib in prostate cancer, and about immunotherapy in lung cancer; Novartis, advisory board, personal, and consultancy about clinical trial methodology and clinical trial results interpretation; Pfizer, advisory board, personal, and consultancy about the results obtained with lorlatinib and dacomitinib in advanced non-small cell lung cancer; Roche, advisory board, personal, and consultancy about immunotherapy in SCLC; Takeda, advisory board, personal, and consultancy about role and interpretation of the patient-reported outcomes and quality of life in clinical trials; Beigene, local PI, onstitutional, and local PI of trial with tislelizumab in hepatocellular carcinoma; Exelixis, local PI, institutional, and local PI of a trial with cabozantinib and atezolizumab in advanced HCC; Merck Sharp & Dohme, local PI, institutional, and local PI of trials with pembrolizumab in hepatocellular carcinoma; Pfizer, local PI, institutional, and local PI of a trial with sasanlimab in NMI bladder cancer; Roche, local PI, institutional, and local PI of a trial with atezolizumab and bevacizumab in advanced HCC; Tesaro—GlaxoSmithKline, research grant, institutional, and financial support and drug supply for the Meet-URO12 trial (niraparib as maintenance treatment of urothelial carcinoma after first-line treatment with platinum-based chemotherapy). RB reports honoraria from BMS, MSD, AstraZeneca, Lilly, Roche, and Novartis.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.