Background
Central in the development of new immunotherapies for patients with acute myeloid leukemia (AML) lies the specific targeting of AML-associated antigens [
1,
2]. A specific interest goes to Wilms’ tumor protein 1 (WT1) which has an acknowledged role as a tumor oncogene in a variety of malignancies, including AML [
2,
3]. As such, WT1 has been identified as a nearly universal tumor-associated antigen (TAA) overexpressed in numerous solid and hematological cancers [
4‐
7], and has been listed as the most interesting cancer antigen for immune therapies [
8]. However, as a self-antigen also expressed in healthy tissues (e.g. cells located within the kidney and reproductive glands [
9]), T-cell clones of high affinity are usually eliminated after negative selection in the thymus. Therefore, the frequency of high-affinity T-cell receptors (TCRs) towards WT1-epitopes in circulating T cells is low. It is hypothesized that WT1-targeted immunotherapies might increase the frequency of anti-AML T cells activity resulting in higher survival rates. Currently, multiple therapies are under investigation for AML, including dendritic cell (DC)-based vaccination [
10‐
12], peptide therapy [
13‐
15], CAR-T cell therapy [
16], TCR-T cell therapy [
17‐
20], and (bispecific) antibody therapy [
21]. Many of these therapies are based on activation of WT1-specific T cells targeting cancer cells overexpressing the WT1 antigen. Hence, the success of these immune therapies relies on recognition of the WT1 antigen by the TCR of these WT1-specific T cells. Despite the large interest in WT1, the WT1-specific TCR repertoire in AML patients has not been fully investigated so far.
TCRs are heterodimeric membrane proteins consisting most often of an alpha and a beta chain (αβ TCRs), each containing three hypervariable domains called complementary determining region 1 (CDR1), CDR2, and CDR3. The CDRs are in contact with the peptide and/or the major histocompatibility complex (MHC) molecule. Especially the CDR3 alpha and beta regions are important in identifying its epitope partner as these largely interact with the epitope surface [
22,
23]. Due to the high diversity of these CDR3 sequences in each TCR repertoire, the immune system is able to recognize a broad spectrum of peptides. This TCR diversity is achieved through TCR gene rearrangement, a process in which discrete segments composing the TCR genes (V and J in the alpha chain, and V, D and J in the beta chain) randomly join by somatic recombination. The introduction of short insertions and deletions at the connections between rearranged genes further increases TCR diversity [
24].
TCR sequencing on blood samples and biopsies facilitates the study of the composition of this repertoire. In this sense, to analyze the TCR repertoire of a sample, both bulk and single-cell sequencing can be used. In general, bulk sequencing is less expensive per T cell and thus allows a higher number of T cells to be sequenced. However, it does not enable the recovery of paired TCR alpha-beta chains, as in single cell sequencing, restricting repertoire analyses to the beta chain. In addition, antigen-specific T cells can be isolated by sorting tetramer-positive T cells specific for certain peptide-MHC complexes [
25] with or without concomitant selection of T cells that upregulate the expression of activation markers such as CD137 [
26,
27]. T-cell sorting is followed by TCR sequencing, aiding the study of antigen-specific TCR repertoires. Due to the increasing availability of public epitope-specific TCR data, new methods were designed to circumvent these laborious experiments by focusing on the computational identification of epitope-specific TCRs. This progress was possible since TCRs with similar CDR3 beta sequences are known to often recognize the same epitope [
28]. Thus, previously identified epitope-specific TCRs can be used to identify common patterns that underlie the recognition of epitopes by TCRs. This is the basis of tools such as TCRex [
29], which use machine learning methods to extract epitope-specific recognition patterns in the TCR CDR3 beta sequence to predict the epitope specificity for new TCRs. In addition, the finding of shared similar patterns in the sequences of epitope-specific TCRs has led to the development of clustering methods such as clusTCR [
30] and ALICE [
31]. These allow the identification of potentially epitope-specific clusters by grouping TCRs based on their CDR3 sequence. Importantly, the binding of a TCR with one epitope does not exclude the binding with other epitopes. T cells can recognize more than one epitope and thus be cross-reactive. Hence, epitope-specific clusters might contain T cells specific for various epitopes.
In this study, we applied recently developed computational techniques to study the TCR repertoire for two human leukocyte antigen (HLA)-A*02:01-restricted WT1-derived peptides: WT1
37 − 45 and WT1
126 − 134, hereafter referred to as WT1-37 and WT1-126, respectively [
32,
33]. These two epitopes have a strong binding activity for HLA-A*02:01, an allele that is highly frequent in different human populations, are immunogenic and can induce T-cell responses. Moreover, compared to other WT1 epitopes, WT1-126 and WT1-37 epitopes elicit more high-avidity T-cell responses [
34]. Therapeutic TCRs for these two epitopes have been described and tested in clinical trials, highlighting the relevance of WT1-126 and WT1-37 epitopes in the context of anti-cancer TCR-T cell therapies. Given all these criteria, the objective was to train machine learning methods for the identification of TCRs recognizing either the WT1-37 or the WT1-126 peptide and to investigate the ability of trained machine learning methods to identify new WT1-specific TCR β sequences in the repertoires of patients with AML. In this paper, the term WT1-specific TCRs is used to refer to both WT1-126 and WT1-37 epitope-specific TCRs. We first generated a data set of specific TCR β sequences isolated from WT1-37 or WT1-126 peptide-stimulated peripheral blood T cells of healthy donors [
33] for establishing an in-house WT1-TCR database (DB) and subsequent training of individual TCRex models [
29]. The usability of the trained computational models was tested by analyzing the TCR repertoires of AML patients from a previously published study [
35].
Discussion
WT1 is an acknowledged immunogenic tumor-associated antigen and, hence, the target of many immunotherapies under development. To improve these antigen-specific immunotherapies, prior studies have sought to interrogate the T cell repertoires of cancer patients [
46‐
48]. Given that tumor-specific T-cell responses are initiated upon recognition of a cancer epitope by TCRs, there is considerable interest in identifying T cells that are specific to these epitopes. Identification of epitope-specific T cells is usually done using multimer assays on peripheral blood or tissue samples of cancer patients. However, this requires separate multimers for every epitope of interest increasing cost and time. The greater availability of bulk TCR repertoire sequencing has sparked interest in using the TCR repertoire as a multiplexed diagnostic assay [
49]. Specificities against a large collection of epitopes can be assessed from the same TCR repertoire using
in silico annotation models predicting epitope-TCR pairings. Such annotation models require training on previously identified epitope-specific TCRs for any target epitope, as unseen epitope prediction remains an unsolved problem [
50]. However, the limited availability of patient samples for rare malignancies complicates the collection of TCR data for different epitopes. To address this issue, here, we propose a workflow to efficiently isolate WT1-specific TCRs from healthy individuals and train WT1-specificity prediction models. A clear difference in the number of unique TCR beta sequences for the two studied epitopes was observed. These numbers can partially be explained by the fact that, after a second IVS, the frequency of WT1-37/HLA-A2 tetramer-positive T cells was approximately 8-times higher compared to WT1-126/HLA-A2 tetramer-positive T cells. Therefore, a higher frequency of tetramer-positive T cells would appear to be linked to an increase in the number of unique TCR sequences. A report from the phase II WIN trial, in which HLA-A2-positive CML and AML patients were vaccinated with two DNA vaccines containing either the WT1-37 or WT1-126 epitope, showed similar results. In this trial, T-cell responses towards the WT1-37 epitope were detected using a tetramer assay in 6 out of 10 of the vaccinated patients and in 2 out of 10 towards the WT1-126 peptide [
51]. These results seem to indicate that the WT1-37 peptide may potentially be more immunogenic than the WT1-126 peptide. In this regard, the Immune Epitope Database (IEDB)’s online tool “T cell class I pMHC immunogenicity predictor” scores the WT1-37 peptide as more immunogenic than the WT1-126 peptide (immunogenicity score: 0.1631 versus 0.04585; peptide positions masked for HLA-A*02:01) [
52]. However, whether peptide immunogenicity correlates with the observed discrepancies in TCR repertoires and number of unique TCR sequences is still a question that needs to be investigated.
The resulting TCR sequences from in vitro expanded T cells from healthy donors showed similarities across donors, which allowed the training and use of WT1-specific models for the identification of WT1-specific TCRs. Previous studies have already demonstrated the possibility of expanding and isolating WT1-specific T cells from healthy individuals [
33,
53‐
55]. These T cells exhibit antitumor activity and thus hold great promise for the identification of TCRs in the context of developing adoptive T-cell immunotherapies [
55]. However, the extent to which this data represented WT1-specific TCRs from cancer patients, which is crucial for monitoring and biomarker identification, remained unknown. It remained unclear whether WT1-specific TCRs derived from healthy donors could be mapped to the repertoires of cancer patients. Therefore, in this study, we trained WT1-specific prediction models from TCR data derived from healthy donors and assessed the performance of the models on AML patients from a previously published independent study [
35]. This allowed tracking of WT1-specific TCRs in AML patients. By using this approach, we identified more WT1-specific TCRs in the repertoires of the AML patients compared to healthy individuals (12 versus 2). A similar result was found when healthy individuals were vaccinated against Yellow Fever Virus (YFV). Here, the vaccinated repertoires contained more unique YFV-specific TCRs than the pre-vaccinated repertoires [
29]. Similar to [
56] and [
57], we believe that the contact with AML cells stimulates the expansion of WT1-specific TCRs. This is further supported by other studies demonstrating a higher level of WT1-specific T cells in leukemic patients in contrast to healthy individuals [
34,
58]. Since no HLA background information was given in the original study, we could not assess if the identified difference is a consequence of a varying HLA background in the AML and healthy volunteers as HLA-A*02:01 is needed to present the studied epitopes. In general, our results demonstrate that the assessment of WT1-TCR repertoire research as a diagnostic tool for AML might be considered [
49]. However, it is important to keep in mind that these results solely rely on in-silico prediction models. No experimental validation was performed to prove the specificity of the identified TCRs for the studied epitopes. In addition, these models give no info regarding the binding affinity. T cells binding one of the WT1 epitopes with low affinity, might not be activated properly and thus fail to initiate tumor destruction [
59]. Therefore, the use of these prediction models facilitates the identification of potential WT1-specific TCR patterns in the TCR repertoires of AML patients, but further research is needed to validate these patterns.
Along with the annotation of tumor-specific T cells on an individual TCR basis, computational methods are being developed to discover groups of TCRs recognizing the same epitope [
30,
31,
60,
61]. These so-called clustering methods were established after the observation that TCRs with similar amino acid sequences frequently show similar epitope specificities [
28,
62,
63]. Big TCR clusters in a repertoire are indicative of a convergent proliferation against a limited set of epitopes and can therefore capture an ongoing or past tumor-specific immune response [
31,
64]. During the analysis of potential WT1-specific TCR clusters, we observed that TCRs from relapsed AML patients were more restricted to a limited number of clusters when compared to AML patients in complete remission. Therefore, it is suggested that displaying more diverse WT1-specific patterns, as seen in patients in complete remission, may be associated with a greater protection against relapse. In this sense, it is already known that TCR repertoire diversity plays a role in treatment response. Various studies have shown a link between TCR diversity and response to immune checkpoint inhibitors [
65‐
67]. Moreover, a study on DC-based vaccination has demonstrated an increase in the diversity of the melanoma neoantigen-specific TCR repertoire following treatment [
68]. In addition, Rezvani et al. demonstrated that the WT1-specific CD8 + T cells of patients with chronic myelogenous leukemia or AML targeted more WT1-epitopes than healthy volunteers [
34]. In that line, our group made an association between the levels of WT1-specific CD8 + T cells targeting different WT1-epitopes and the clinical response of AML patients following a WT1-based dendritic cell vaccine [
11], while Hoffman et al. discovered that patients responding to donor lymphocyte infusion (DLI) targeted more leukemia associated epitopes than those who did not respond [
69]. Taken together, these data underscore the impact of TCR repertoire diversity in targeting different tumor epitopes. In this study, we identified a relation between the diversity of epitope-specific TCR signatures for individual WT1-epitopes, as defined by the presence of TCRs in WT1-specific clusters, and the response to HSCT. This signal is based on the two WT1-epitopes for which we generated data from healthy individuals, and thus represents a fraction of the overall cellular immunity against the WT1 antigen. Further investigation of the full TCR repertoire is warranted, with an extended coverage of WT1-associated epitopes. Hence, the same protocol can be repeated for other WT1-epitopes resulting in additional prediction models expanding the identification of WT1-specific TCRs.
Overall, we demonstrate that the WT1-specific T-cell repertoire of AML patients holds information regarding response to HSCT therapy. This might be explained by the graft-versus-leukemia effect as reported earlier [
70]. Indeed, similar results were found by Rezvani et al. [
57], who demonstrated that in patients with acute lymphoblastic leukemia, a drop in the expression of WT1 in the blood was supported by the identification of WT1-specific CD8 + T cells after transplantation. Importantly, no WT1-specific CD8 + T cells were identified before transplantation and patients lacking WT1 expression did not gain these WT1-specific T cells. Thus, the WT1-specific TCR repertoire has potential as a promising biomarker for response prediction to cancer treatment. Although TCR sequencing might be more expensive than a specific biomarker assay, TCR repertoire analysis could provide invaluable data for the allocation of personalized immunotherapies to patients that would most likely respond to therapy, especially considering the high costs of personalized immunotherapy. In addition to the absolute count of unique WT1-specific TCRs and the diversity of the WT1-specific TCR repertoire, the activity and function of these T cells needs to be compared between the different patient groups. Furthermore, no baseline data was available, thus the presence of the WT1-signal repertoires prior to treatment could not be assessed. This information could provide additional insight into the existence of biomarkers in the baseline repertoires and their evolution over time, potentially aiding in patient stratification [
71,
72]. In conclusion, we have developed a workflow that starts from a robust antigen-specific primary T-cell expansion platform to collect WT1-specific TCRs from the blood of healthy individuals. The collected TCRs were used successfully to train computational models for the identification of WT1-specific TCRs in independent data sets from AML patients. Our workflow revealed differences in the number of unique WT1-specific TCRs and cluster diversity between patients in complete remission and those experiencing relapse. The described platform is extrapolatable to other cancer antigens, enabling the identification of tumor-specific clonotypes within full TCR repertoires. This can aid the research of biomarkers for cancer immunotherapies.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.