Abstract
Therapy-resistant leukemia stem and progenitor cells (LSC) are a main cause of acute myeloid leukemia (AML) relapse. LSC-targeting therapies may thus improve outcome of patients with AML. Here we demonstrate that LSCs present HLA-restricted antigens that induce T-cell responses allowing for immune surveillance of AML. Using a mass spectrometry–based immunopeptidomics approach, we characterized the antigenic landscape of patient LSCs and identified AML- and AML/LSC-associated HLA-presented antigens absent from normal tissues comprising nonmutated peptides, cryptic neoepitopes, and neoepitopes of common AML driver mutations of NPM1 and IDH2. Functional relevance of shared AML/LSC antigens is illustrated by presence of their cognizant memory T cells in patients. Antigen-specific T-cell recognition and HLA class II immunopeptidome diversity correlated with clinical outcome. Together, these antigens shared among AML and LSCs represent prime targets for T cell–based therapies with potential of eliminating residual LSCs in patients with AML.
The elimination of therapy-resistant leukemia stem and progenitor cells (LSC) remains a major challenge in the treatment of AML. This study identifies and functionally validates LSC-associated HLA class I and HLA class II–presented antigens, paving the way to the development of LSC-directed T cell–based immunotherapeutic approaches for patients with AML.
See related commentary by Ritz, p. 430.
This article is featured in Selected Articles from This Issue, p. 419
INTRODUCTION
A major challenge in treatment of acute myeloid leukemia (AML) is the elimination of leukemia stem and progenitor cells (LSC). LSCs represent therapy-resistant cells that persist after treatment and are considered a main cause of relapse (1, 2). The latter occurs in a large proportion of patients despite the treatment advances in recent years and initially high remission rates resulting in the still very high mortality of AML (3).
Evidence that LSCs can be targeted and eliminated by the immune system arises from graft-versus-leukemia effects after allogeneic stem cell transplantation (4). However, immunotherapeutic approaches such as immune checkpoint inhibition, bispecific antibodies, and chimeric antigen receptor T cells that revolutionized treatment of other malignant diseases in recent years (5–8), showed only limited success in AML so far (9). One reason might be that these therapies are directed against bulk AML cells and do not specifically target LSCs. Thus, the identification of novel LSC-specific immune targets is of paramount importance to develop therapies with the potential to eradicate those cells. The target structures for such T cell–mediated immune responses are tumor-associated antigenic peptides that are presented on the surface of cancer cells by human leukocyte antigen (HLA) molecules. Hematopoietic stem and progenitor cells (HSPC), in health and disease, constitutively present antigens not only on HLA class I but also via HLA class II molecules allowing for direct interaction with antigen-specific CD4+ T cells, suggesting an immune surveillance mechanism that may effectively suppress leukemia onset upon LSC transformation (10). Promiscuous HLA class II–presented peptides, which bind to multiple HLA alleles (11, 12), might represent universally applicable targets for T cell–based immunotherapies to eliminate LSCs in AML. However, a systematic characterization of the immunopeptidome, that is, the entirety of naturally presented HLA-restricted peptides, and definition of the specific antigens that allow for LSC-directed immune surveillance is lacking. Mass spectrometry–based analyses of the immunopeptidome from different tumor entities enabled the identification of different groups of naturally presented tumor antigens: (i) neoepitopes from tumor-specific mutations (13, 14), (ii) cryptic neoepitopes originating from noncoding regions such as 5′ and 3′ untranslated region (UTR), noncoding RNAs (ncRNA), intronic and intergenic regions, or shifted reading frames in annotated protein coding regions (off-frame; refs. 15–20) and (iii) nonmutated tumor-associated antigens arising through differential gene expression or protein processing in tumor cells (21–25).
Here, we characterized the antigenic landscape of LSCs and AML bulk cells using mass spectrometry–based immunopeptidomics. We identified AML/LSC-associated (defined as shared among AML bulk cells and LSCs) HLA-restricted peptides that mediate immune surveillance in AML and constitute broadly applicable antigens for T cell–based immunotherapeutic approaches to specifically target LSCs in patients with AML.
RESULTS
Mass Spectrometry–Based Immunopeptidomics Uncovers the Antigenic Landscape of Primary LSCs
To investigate the immunopeptidomic landscape of primary LSCs (phenotypically defined as CD34+CD38− cells throughout this study), we isolated them from peripheral blood mononuclear cells (PBMC) of patients with AML. We screened samples of 26 patients with AML for the presence of CD34+CD38− LSCs (26) revealing a median LSC frequency of 0.2% (Supplementary Table S1; Supplementary Fig. S1), selecting 11 samples for sorting. LSC enrichment resulted in a median frequency of 92.1% CD34+CD38− cells postsorting (Fig. 1A and B; Supplementary Table S1). Stemness features of the sorted LSCs were validated by transplantation assays in NOD/SCID/IL2Rγnull (NSG) mice as reflected by in vivo leukemic engraftment of human CD33+ and CD33+CD117+ cells in the bone marrow, peripheral blood, spleen, and liver (Fig. 1C).
As T cell—based immunotherapy requires sufficient HLA expression on target cells, we quantified HLA surface expression of CD34+CD38− LSCs and CD34+CD38+ AML cells as well as CD34+ HSPCs using PBMCs from patients with AML and healthy volunteers (HV), respectively. HLA surface expression on LSCs was comparable with CD34+CD38+ AML cells and CD34+ HSPCs for HLA class I and slightly decreased for HLA-DR (Fig. 1D and E).
Mass spectrometry–based analysis of the naturally presented HLA-restricted peptides, the so-called immunopeptidome, of LSC and corresponding bulk AML samples revealed a total of 16,342 and 32,961 unique HLA class I (comprising HLA-A, -B, and -C) ligands, respectively (Fig. 1F). 16,638 and 25,128 different HLA class II- (comprising HLA-DR, -DP, and -DQ) presented peptides were identified for LSCs and corresponding bulk AML samples, respectively (Fig. 1G).
HLA class I- and HLA class II-presented peptides of LSCs and corresponding AML bulk cells showed comparable specificities in terms of amino acid compositions and peptide length distributions (Fig. 1H and I; Supplementary Fig. S2A and S2B). The amino acid composition of the LSC- and AML bulk cell-presented peptides, that is, the abundance of certain amino acids, revealed no differences (Fig. 1H), even with respect to specific positions within the peptides (Supplementary Fig. S2A and S2B). The peptide length distribution is also comparable between the cell types, with the majority of HLA class I- and HLA class II–restricted peptides being 9 and 15 to 16 amino acids long, respectively (Fig. 1I). LSC- and corresponding AML bulk cell–derived immunopeptidomes revealed an overlap of HLA-presented peptides with 39.4% and 35.1% shared HLA class I and HLA class II peptides, respectively. 6.8% and 18.7% of the total identified HLA class I- and HLA class II-restricted peptides, respectively, showed exclusive presentation on LSCs (Fig. 1J).
Together, these data demonstrate that LSCs present HLA class I- and HLA class II–restricted antigens comparable but not identical to AML bulk cells highlighting the importance to integrate LSC-specific targets when selecting AML-associated antigens for immunotherapeutic approaches.
Comparative Immunopeptidome Profiling Identifies AML/LSC-Associated HLA Class I–Restricted Peptides
To explore antigens shared between LSCs (sorted CD34+CD38−) and bulk AML cells (unsorted PBMCs) that might be suitable for the dual immunotherapeutic targeting of AML bulk cells and LSCs (termed AML/LSC antigens), we comprehensively mapped the HLA class I immunopeptidome of 47 primary AML samples (Supplementary Tables S2 and S3) including the above-described samples sorted for LSCs. We identified a total of 72,042 unique HLA class I ligands that are derived from 10,609 source proteins (Supplementary Table S3). Thereby, 97% coverage of the estimated maximal attainable number of distinct source proteins was obtained (Fig. 2A). The AML dataset comprised 48 different HLA class I allotypes. 99.9% of the world population carry at least one of these allotypes (Fig. 2B). Curated AML/LSC immunopeptidomes were compared with a benign immunopeptidome dataset (25, 27), which comprises among others PBMCs, CD34+-enriched HSPCs, and various solid organ tissues and contains 72,129 unique HLA class I ligands (Fig. 2C), to identify AML/LSC-associated antigens absent from benign tissues. The distribution of HLA allotypes in the AML cohort was comparable with the benign immunopeptidome cohort (Supplementary Fig. S3A). For the identification of broadly applicable AML-associated antigens, we aimed for the selection of target antigens that not only fulfill the criterion of AML exclusivity, but also exhibit a high prevalence within the AML cohort and are presented by common HLA class I allotypes. Therefore, allotype-specific comparative profiling of AML and benign immunopeptidome datasets were performed for the common allotypes HLA-A*01 (30% frequency in AML cohort), -A*02 (49%), -B*07 (26%), -B*08 (23%), and -C*07 (55%, Supplementary Fig. S3A), which achieve 71% coverage within the world population (28). This revealed, respectively, 48, 28, 185, 161, and 11 AML-exclusive antigens with high allotype-specific frequencies ranging between 20% and 58% within the AML cohort (Fig. 2D; Supplementary S3B and S3C). 99.5% (433/435) of AML antigens were not presented on PBMC samples obtained from patients with AML in molecular remission (Supplementary Table S3), thereby further confirming AML specificity (Supplementary Table S4). AML-associated peptides showed different intensity ranks in the patients’ specific immunopeptidomes most ranking within the second and third quartiles of the intensity distribution (Supplementary Fig. S4).
By comparing the AML bulk and benign datasets to the immunopeptidomics data of the LSC-sorted samples, we identified 2,322 AML/LSC shared antigens, absent on any benign tissue including CD34+-enriched HSPCs (Fig. 2E; Supplementary Table S5). Within these AML/LSC-exclusive antigens, 181 peptides represented high frequent AML-associated targets identified for the common HLA allotypes HLA-A*01 (30/48, 62.5% of A*01 targets are shared on LSCs), A*02 (4/28, 14.3%), B*07 (95/185, 51.4%), B*08 (44/161, 27.3%), and C*07 (8/11, 72.7%, Fig. 2F; Supplementary Table S4). In total, 41.8% (181/433) of the high frequent AML-exclusive antigen targets were shared by LSCs (Fig. 2F; Supplementary Table S4) and thus constitute promising targets for the dual T cell–based targeting of AML bulk cells and LSCs.
Identification of AML/LSC-Associated HLA Class II–Restricted Antigens
Because CD4+ T cells also play important direct and indirect roles in anticancer immunity, we aim to identify HLA class II–presented AML and AML/LSC antigens. A total of 61,205 unique HLA class II peptides originating from 5,922 source proteins and obtaining 85% of the estimated maximum attainable source protein coverage were identified by mass spectrometry–based immunopeptidomics (Supplementary Tables S2 and S3; Fig. 3A). Utilizing a previously established immunopeptidome profiling platform (25), we delineated three groups of AML-associated antigens: (i) peptide targets, (ii) protein targets, and (iii) hotspot targets (Supplementary Table S6). Overlap analysis and comparative profiling with a benign dataset on peptide level revealed 10,931 AML-exclusive HLA class II–restricted peptides (Fig. 3B) of which 5 were found in at least 15% of samples without detection of any length variants on benign samples (Fig. 3C; Supplementary Fig. S5A; Supplementary Table S6). HLA peptide source protein profiling revealed 311 AML-exclusive proteins, of which CCL23 and RRS1 show frequent (17% and 15% of AML samples) and significant AML-associated presentation with 7 and 2 proteotypic peptides, that is, uniquely derived from the respective proteins, respectively (Supplementary Fig. S5B–S5D; Supplementary Table S6). Hotspots of antigen presentation reflect distinct regions of proteins, which are more prone to produce HLA peptides. These hotspots depend on proteasomal cleavage, peptide processing, and HLA-binding rules (29). In particular for HLA class II–presented antigens, which are characterized by the presentation of different length variants in distinct patients, hotspot analysis represents a suitable tool for antigen identification. We here identified 5 AML-associated hotspots by peptide clustering, based on mapping identified peptides to their positions within the source protein, with representation frequencies of at least 15% within KIT, FLT3, AP2B1, HPRT, and IL1AP (Fig. 3D; Supplementary Fig. S5E; Supplementary Table S6).
AML-associated peptide, protein, and hotspot targets were screened for their presentation on LSC samples (Fig. 3E and F; Supplementary Table S7), which identified 66.7% (8/12) as AML/LSC shared antigens (Fig. 3G; Supplementary Table S6) and thus prime targets for dual T cell–based targeting of LSCs and AML bulk cells. Of note, one high frequent LSC-exclusive peptide (on 27% of LSC samples) as well as 4 LSC-associated peptides and 3 LSC-associated protein targets (on 27% of LSC samples with dual presentation on AML bulk cells) were identified in the HLA class II immunopeptidome data, which constitute further highly interesting candidates for LSC immune targeting (Fig. 3H; Supplementary Fig. S5F; Supplementary Table S6). The abundance of AML- and AML/LSC-associated antigens differ in the patient-individual immunopeptidomes with the majority ranking in the second and third quartiles of the intensity distribution (Supplementary Fig. S6).
Mutation-Derived and Cryptic Neoantigens Are Represented in the AML Immunopeptidome
The HLA class I and HLA class II AML/LSC immunopeptidome datasets were further screened for naturally presented, mutation-derived neoepitopes from common AML-specific mutations (195 mutations within 23 genes representing 107 different mutation sites; Supplementary Table S8). Two naturally presented NPM1 mutation–derived HLA-A*11- and HLA-A*03- (NPMmut_A*11 AVEEVSLRK, NPMmut_A*03 LAVEEVSLR) and one IDH2 R140Q mutation–derived HLA class II- (IDH2mut_II KLKKMWKSPNGTIQNILGGTVF) restricted neoepitope(s) were identified on AML bulk cells, but not on LSCs, respectively. The three neoepitopes were identified in 100% (2/2), 33% (1/3), and 100% (1/1) of HLA-matched samples with the appropriate mutation profile, respectively. Mass spectrometric identification was validated by comparative measurement of synthetic peptides and verification of the respective mutations for each patient (Fig. 3I; Supplementary Figs. S7 and S8). In total, 96% (22/23) of mutation-bearing proteins are represented by HLA peptides that are identified in AML and/or benign immunopeptidomes. In contrast, only 19% (20/107) of the specific mutation sites within these proteins were directly covered by wild-type peptides, as most recurrent AML-specific mutations are located in “dark spots” of the immunopeptidome, defined as protein regions without any detectable HLA-presented peptides (Fig. 3D). Besides classical neoepitopes derived from AML-specific mutations, 623 cryptic AML-associated HLA class I peptides derived from noncanonical gene products were identified in the AML immunopeptidomes using the established algorithm Peptide-PRISM developed for the detection of cryptic peptides (30). AML-associated cryptic peptides were mainly derived from 5′ UTR and off-frame regions (Fig. 4A). 109 of these peptides were identified on LSC samples with 26 LSC-exclusive peptides (Fig. 4B) mainly originating from off-frame regions (Fig. 4C and D). High frequent AML/LSC-associated cryptic neoepitopes were validated using isotope-labeled synthetic peptides (Fig. 4E).
Preexisting and De Novo Inducible AML/LSC Antigen-Specific T-cell Responses in Patients with AML and HVs
To evaluate whether the immunopeptidome-defined antigens comprising neoepitopes, cryptic peptides and AML/LSC-associated antigens, mediate T-cell effector functions in vitro and in vivo, two cohorts of patients with AML and HVs (Supplementary Tables S9 and S10) were investigated for antigen-specific T cells and corresponding immune responses (Table 1; for spectra validation see Supplementary Figs. S7 and S8). Artificial antigen-presenting cell (aAPC)-based in vitro priming experiments were used as an in vitro model for the feasibility of vaccination-induced de novo T-cell priming of naïve T cells and ELISpot assays were utilized for the detection of preexisting memory T-cell responses demonstrating an in vivo peptide-specific T-cell activation and active antitumor immune response. Using in vitro priming of naïve CD8+ T cells of HLA-matched HVs, de novo induction, and effective expansion of antigen-specific CD8+ T cells was observed for 14 of 15 HLA class I peptides including neoepitopes and cryptic peptides in 66% to 100% of analyzed HV samples (Fig. 5A; Table 1; Supplementary Fig. S9A). De novo induction of antigen-specific T cells was also achieved using samples of patients with AML without preexisting immune responses (Fig. 5B; Supplementary Fig. S9A). Antigen-specific T cells showed multifunctionality with IFNγ and TNF cytokine production as well as upregulation of the degranulation marker CD107a upon peptide stimulation (Fig. 5C; Supplementary Fig. S9B). NPMmut_A*11-directed CD8+ T cells specifically lysed NPMmut_A*11-loaded autologous cells in vitro with up to 82% target cell lysis compared with unspecific effector cells at various effector-to-target ratios (Fig. 5D).
IFNγ ELISpot assays after peptide-specific 12-day in vitro expansion revealed preexisting HLA class I peptide–specific memory T-cell responses in up to 30% and 8% of HLA-matched AML patient and HV samples, respectively, targeting 26.3% (5/19) of HLA class I–restricted peptides (Fig. 5E; Table 1), with immune responses mainly mediated by CD4+ T cells (Fig. 5F; Supplementary Fig. S10). No preexisting memory T-cell responses targeting cryptic peptides (UTR5_ CHRFAM7AA*02, Off-frame_TSPAN2B*07) were observed in patients with AML or HVs, whereas neoepitope-specific memory responses were detectable frequently in AML patient samples (up to 30%, Table 1). Moreover, strong preexisting CD4+ T cell–mediated immune responses targeting HLA class II–restricted peptides (86.7%; 13/15) were observed using IFNγ ELISpot assays after 12-day in vitro expansion in up to 15% and 33% of patients with AML and HVs, respectively (Fig. 6A–C; Supplementary Fig. S10, Table 1). The FLT3-derived peptide FLT3II (SPGPFPFIQDNISFYA) elicited an additional CD8+ T cell–mediated immune response (Supplementary Fig. S11A and S11B). In silico prediction with the respective patient's HLA allotype revealed four potential HLA class I–restricted peptides embedded in the long HLA class II sequence (Supplementary Fig. S11C).
Overall, 20% (15/76) of patients with AML and 16% (14/89) of HVs showed preexisting immune responses targeting the AML and AML/LSC antigens (Fig. 6D).
HLA Class II AML/LSC-Specific T-cell Responses Are Mediated by Clonally Expanded Th1 Memory CD4+ T Cells
We further characterized T-cell responses to HLA class II AML/LSC peptides by single-cell RNA sequencing (scRNA-seq) in combination with T-cell receptor (TCR) V(D)J sequencing (TCR-seq) analysis of peptide-stimulated and IFNγ secretion-based sorted CD4+ T cells. IFNγ+ antigen-specific CD4+ T cells showed an activated and cytotoxic effector memory phenotype with expression of Th1-specific TNF and TBX21 (encoding T-bet), the cytotoxicity markers GZMB (encoding granzyme B), and PRF1 (encoding Perforin-1), paired with an absence of the naïve or central memory markers SELL (encoding CD62L) and CCR7, the exhaustion marker TOX, and the Th2 cytokines IL4 and IL13 (Fig. 6E and F; Supplementary Fig. S12A and S12B). Single-cell TCR sequencing showed a high clonality of the IFNγ+ antigen-specific activated and cytotoxic CD4+ T cells (Fig. 6G; Supplementary Fig. S12C). A comparable phenotype of AML/LSC peptide-stimulated CD4+ T cells was shown in an ex vivo performed multicolor flow cytometry assay with a Th1-directed CD45RO+CD62L− effector memory T-cell population (Supplementary Fig. S12D and S12E).
Presentation and Immune Recognition of AML and AML/LSC Antigens Associates with Improved Survival of Patients with AML
Finally, the association of immunopeptidome-defined antigen presentation and corresponding peptide-specific immune recognition with clinical characteristics, disease control, and outcome of patients with AML was investigated. Presentation of AML-exclusive antigens, in terms of uniquely presented peptides per AML sample, that is, the diversity of the immunopeptidome, did not differ significantly according to demographics and AML disease characteristics including age, sex, ELN risk classification, karyotype, FLT3-ITD, and NPM1 mutation (AML immunopeptidome cohort; Supplementary Table S11), neither did these variables themselves affect patient survival (Supplementary Table S12). Whereas the diversity of the HLA class I AML-exclusive immunopeptidome did not show an impact on event-free (EFS) and overall (OS) survival in a retrospective analysis of the AML immunopeptidome cohort (Supplementary Fig. S13A; Supplementary Table S13), diverse presentation of HLA class II–restricted AML-exclusive antigens was associated with significantly longer OS (Fig. 6H; Supplementary Fig. S13B) suggesting a central role of these antigens for immune control in AML. This is further underscored by a correlation of preexisting memory T-cell responses with patient survival revealing improved EFS in a retrospective analysis for patients with AML showing spontaneous memory CD4+ T-cell responses assessed by IFNγ ELISpot assays after 12-day in vitro expansion targeting the HLA class II–restricted AML/LSC-associated antigens as compared with patients without AML/LSC-specific immune responses (Fig. 6I; Supplementary Table S14). Demographics such as sex and age did not affect patient survival (Supplementary Table S15). Within the patient group, OS was better in patients that underwent hematopoietic stem cell transplantation (Supplementary Table S15). A trend for improved EFS could also be observed in a correlation of preexisting memory T-cell responses with patient survival for AML patients after allogeneic stem cell transplantation (Supplementary Table S14) showing HLA class II–restricted AML/LSC-associated antigen-specific T-cell responses in ex vivo IFNγ ELISpot assays (Supplementary Fig. S13C).
DISCUSSION
In this study, we demonstrate that primary AML progenitor cells present HLA-restricted cancer antigens that induce T-cell responses and presumably mediate immune surveillance in human AML. These antigens comprise cancer-associated cryptic neoepitopes as well as unmutated self-antigens. Of note, neoantigens derived from cancer-specific mutations, described as main specificities of T-cell responses induced by immune checkpoint inhibition in solid tumors with high mutational burden (31, 32), were only identified for single AML-specific mutations, limited to specific HLA allotypes and only identified on AML bulk cells but not specifically on LSCs. The sensitivity of shotgun mass spectrometric discovery approaches, even in the context of immense technical improvements in the last decades, is limited as the immunopeptidome is a highly dynamic, rich, and complex assembly of peptides. Therefore, low-level presentation of these neoepitopes on LSCs cannot be excluded. The low frequent detection of neoepitopes on AML bulk cells is in line with several reports that show a distorted correlation of mRNA expression and limited or even lacking immunopeptidome presentation for mutated and unmutated tumor antigens (13, 33–35). This further highlights the immunopeptidome as independent complex layer formed by the antigen presentation machinery that does not necessarily mirror the transcriptome or proteome calling for direct and unbiased methods of HLA-restricted antigen identification as realized by our mass spectrometry–based immunopeptidomics approach. However, it is a well-known issue that immunoprecipitation-based HLA peptide isolation cannot distinguish between HLA-restricted peptides presented on the cell surface from intracellular HLA:peptide complexes that might never reach the cell surface and are thus not suitable candidates for immunotherapy approaches (36, 37). Therefore, potential tumor antigen candidates have to be validated and their cell surface presentation have to be proven prior to their clinical application. The prove of peptide immunogenicity, in particular, the detection of preexisting memory T-cell responses in patients, which we were able to detect for the majority of the analyzed AML/LSC-associated antigens, validate the surface presentation, and T-cell activation of the respective antigens.
Besides mutation-derived neoepitopes, noncanonical cryptic peptides were recently suggested as potential tumor antigens (15–19). Such peptides arise through rapid degradation (38) of noncoding translation products from novel open-reading frames (ORF), 5′ and 3′ UTRs, ncRNA, intronic and intergenic regions, or shifted reading frames in annotated protein-coding regions (off-frame; refs. 18–20, 39–42). Recent large-scale pan-cancer screens (15–17) revealed the average proportion of noncanonical peptides within the immunopeptidome to roughly 1% to 3%. Interestingly, some noncanonical peptides were confirmed to be shared across different individuals as well as tumor entities, whereas others are tumor-specific. We identified 623 AML-associated noncanonical cryptic peptides accounting to 0.9% of the total AML-derived immunopeptidome. The main sources of noncanonical peptides in our study were 5′ UTR and off-frames, which is in line with other studies (16, 17, 20). 5′ UTRs, translated through non-AUG start codons in upstream ORFs (42), contributed to 41% of noncanonical peptides. Exonic regions translated in alternative frames were the source for 35% of noncanonical peptides, which might arise from initiation codon readthrough (43), novel ORFs (40), or ribosomal slippage (39) during translation. Immune responses targeting cryptic neoepitopes were reported infrequent. Chong and colleagues screened more than 500 noncanonical peptides; however, immune recognition was only detected for a single peptide (15). In line, we could not detect any preexisting T-cell responses in patients with AML or HVs targeting cryptic peptides.
AML arises from molecular alterations at the HSPC level. Alike in healthy hematopoiesis, LSCs are the origin of clonal growth and therefore considered responsible for maintenance of the leukemic population (44, 45). Given the central role of LSCs in development and pathogenesis of leukemic disease (1, 2), therapeutic elimination of this population is central to prevent relapses. However, the biological similarity between LSCs and normal HSPCs as well as the specific biological properties of LSCs hamper the development of therapeutic strategies for their effective destruction (46). With regard to antigen presentation, we observed no significant differences in HLA surface expression between HV-derived HSPCs and patient-derived LSCs and bulk AML cells, in particular no loss or significant downregulation previously postulated as mechanism by which AML cells escape immune surveillance (47, 48). HLA ligands presented by LSCs and AML bulk cells showed similar features in terms of physiochemical properties. Comparative immunopeptidome profiling delineated a distinct population of LSC-exclusive antigens not presented on benign HSPCs. Within the entirety of AML-exclusive antigens, a relevant proportion of peptides were shared between LSCs and bulk AML cells.
Beyond CD8+ T cells, CD4+ T cells also play a central role in the development and maintenance of effective antitumor immunity (49). Previous reports showed that spontaneous and vaccine-induced tumor-specific T-cell responses are predominantly mediated by CD4+ T cells (24, 50–53). Furthermore, HLA class II–restricted tumor antigen presentation and antigen-specific CD4+ T-cell responses correlate with clinical outcome of patients with cancer (54, 55). HSPCs present MHC class II–restricted antigens that interact with antigen-specific CD4+ T cells in mice (10). This immune surveillance mechanism effectively eliminates transformed progenitor cells, thereby preventing leukemia onset. We detected highly frequent spontaneous memory T cells targeting HLA class II–restricted antigens in patients with AML and HVs and showed that this antigen-specific T-cell recognition and HLA class II immunopeptidome diversity mediate immune surveillance and impact clinical outcome in patients with AML. This observation, which has to be validated in future prospective larger cohort studies, provides first evidence for the impact of immunopeptidome diversity and peptide-specific T-cell responses for patient survival and elucidates the pathophysiologic role of HLA class II antigen presentation and underscores its relevance for immune control of malignant disease (47, 56). The latter is based on (i) activation of CD4+ T cells that mediate direct and indirect effector functions in anticancer immunity, (ii) additional direct activation of CD8+ T cells by embedded HLA class I T-cell epitopes, and (iii) the promiscuous binding to multiple different HLA allotypes, which qualifies these AML/LSC antigens as highly promising novel targets for broadly applicable immunotherapeutic approaches like vaccines, adoptive T-cell transfer, or TCR-engineered therapies.
Together, our results unraveled the immunopeptidomic landscape of LSCs and delineated AML/LSC-associated antigens that mediate immune surveillance in AML and may facilitate the development of novel LSC-directed immunotherapeutic approaches for patients with AML. We are currently preparing a multicentric, open-label, phase I clinical study that will address two central issues of former peptide-based vaccination trials (57–60) by selection of a personalized multipeptide vaccine from a peptide warehouse comprising our naturally presented AML/LSC-associated antigens and adjuvating the vaccine with the novel TLR1/2 agonist XS15 (61) that enables the induction of superior, long-lasting T-cell responses and thus will provide an in vivo evaluation of the AML/LSC antigens directly in humans.
METHODS
Patients and Blood Samples
For immunopeptidome analysis, PBMCs or bone marrow mononuclear cells from patients with AML (Supplementary Table S3) at the time of diagnosis, at relapse (n = 52), or in molecular remission (n = 8) as well as hematopoietic stem cell apheresis from G-CSF–mobilized blood donations (n = 8) of HVs (n = 1) and patients with nonhematologic malignancies (n = 7) were collected at the Departments of Hematology and Oncology in Tübingen and Dresden, Germany as well as at the Department of Medicine, Divisions of Hematology and Medical Oncology at the San Francisco University of California. For T cell—based assays, PBMCs from HVs (Supplementary Table S10, n = 92) and patients with AML (Supplementary Table S9, n = 78) after allogenic stem cell transplantation or in complete remission at different time points after standard treatment were collected. Cells were isolated by density gradient centrifugation and stored at −80°C. Clinical and survival data were collected within a follow-up phase of up to 48 months after the date of diagnosis. Written informed consent was obtained in accordance with the Declaration of Helsinki protocol. The study was performed according to the guidelines of the local ethics committees (373/2011B02, 454/2016B02, EK 20805217). HLA typing was carried out by the Department of Hematology and Oncology, Tübingen, Germany. Patient and HV demographic and clinical characteristics are provided in Supplementary Tables S3, S9, and S10.
HLA Surface Quantification
HLA surface expression was determined using the QIFIKIT bead-based quantification flow cytometric assay (Dako, catalog no. K007811–8) according to the manufacturer's instructions. In brief, cells were stained either with the pan-HLA class I-specific W6/32 mAb, the HLA-DR–specific L243 mAb (produced in-house), or IgG isotype control (BioLegend, catalog no. 400202, RRID: AB_2927399, clone MOPC-173), respectively. Polyclonal goat FITC anti-mouse antibody (Dako, catalog no. F047902, RRID: AB_578665) was used as a secondary antibody. After washing with normal mouse serum (eBioscience, catalog no. 24–5544–94) surface marker staining was performed using PE/Cy7 anti-human CD38 (BioLegend, catalog no. 356608, RRID: AB_2561903, clone HB-7), APC anti-human CD34 (BD Biosciences, catalog no. 555824, RRID: AB_398614, clone 581), and Pacific Blue anti-human CD45 (BD Biosciences, catalog no. 642275, RRID: AB_1645755, clone 2D1) antibodies. Aqua fluorescent reactive dye (Invitrogen, catalog no. L34957) was used as viability marker. Analyses were performed on a FACS Canto II cytometer (BD Biosciences). Only cell populations with ≥100 cells were analyzed for their HLA surface expression.
LSC Enrichment
Enrichment of LSCs from AML samples were either performed by fluorescence-activated cell sorting (FACS) at the Institute for Stem Cell Biology and Regenerative Medicine, Stanford, CA (UPN3–8, UPN11), or by magnetic-activated cell sorting (MACS) at the Institute for Cell Biology, Department of Immunology, University of Tübingen, Tübingen, Germany (UPN01, UPN02, UPN09, UPN10). For FACS, PBMCs were stained with APC anti-human CD34 (BD Biosciences, catalog no. 340441, RRID: AB_400514, clone 8G12), PE/Cy7 anti-human CD38, and PerCP/Cy5.5 anti-human CD3 (BioLegend, catalog no. 300328, RRID: AB_1575008, clone HIT3a), CD19 (BioLegend, catalog no. 302229, RRID: AB_2275547, clone HIB19), CD20 (BioLegend, catalog no. 302325, RRID: AB_893285, clone 2H7), and CD56 (BioLegend, catalog no. 318321, RRID: AB_893391, clone HCD56) mAbs and sorted on a FACSAria II or FACSAria III (BD Biosciences). MACS was performed with the human CD34 MultiSort (Miltenyi Biotec, catalog no. 130–056–701) and CD38 MicroBead Kits (Miltenyi Biotec, catalog no. 130–092–263). Sorted cells were stained with PE/Cy7 anti-human CD38, APC anti-human CD34, and Pacific Blue anti-human CD45 mAbs to determine the purity. Aqua fluorescent reactive dye was used as viability marker. Analyses were performed on a FACSCanto II cytometer. CD34+ HSPCs were magnetically enriched (CD34 MicroBead Kit, Miltenyi Biotec, catalog no. 130–046–702) from hematopoietic stem cell apheresis from G-CSF–mobilized blood donations of HVs and patients with nonhematologic malignancies.
Mice and Xenotransplantation Assays
Xenotransplantation assays were performed at the Department of Biomedicine, University of Basel and University Hospital Basel, Switzerland. NOD.Cg-Prkdcscid IL2rgtmWjl/Sz mice (NSG, The Jackson Laboratory, strain # 005557) were maintained under pathogen-free conditions according to the Swiss federal and state regulations. All animal experiments were approved by the Veterinäramt Basel-Stadt (24981). Xenotransplantation assays were performed as previously described (62). In brief, 6 × 105 primary human sorted CD34+CD38− LSCs were transplanted via intrafemoral injection into 8-week-old female NSG mice (n = 4). Engraftment was monitored as previously described (62) via routine bone marrow punctures or assessment of peripheral blood. Engraftment was defined as ≥1% human leukemic cells in murine peripheral blood or bone marrow as analyzed by multicolor flow cytometry using antibodies against human leukemic antigens. The panel includes fluorescent antibodies against human CD33 (BD Biosciences, catalog no. 555450, RRID: AB_395843, clone WM53), CD34 (BD Biosciences, catalog no. 340441, RRID: AB_400514, clone 8G12), CD133 (BD Biosciences, catalog no. 566595, RRID: AB_2739755, clone 293C3), CD117 (BD Biosciences, catalog no. 339195, RRID: AB_647418), CD45 (BD Biosciences, catalog no. 561865, RRID: AB_10896120, clone HI30), CD14 (eBiosciences, catalog no. 17–0149–42, RRID: AB_10669167, clone 61D3), CD13 (eBiosciences, catalog no. 12–0138–42, RRID: AB_10853031, clone WM15), CD3 (BioLegend, catalog no. 317318, RRID: AB_1937212, clone OKT3), and CD19 (BioLegend, catalog no. 302208, RRID: AB_314238, clone HIB19). All mice underwent final bone marrow, peripheral blood, and organ assessment by multicolor flow cytometry.
Isolation of HLA Ligands
HLA class I and HLA class II molecules were isolated from snap-frozen cell pellets by standard immunoaffinity purification (63) using the pan-HLA class I–specific W6/32, the pan-HLA class II–specific Tü-39, and the HLA-DR–specific L243 mAbs (produced in-house) cross-linked to CNBr-activated Sepharose (Sigma-Aldrich) to extract HLA ligands. Cells were lysed in lysis buffer [CHAPS (Panreac AppliChem), cOmplete protease inhibitor cocktail tablet (Roche) in PBS] for 1 hour on a shaker at 4°C, sonicated, and centrifuged (45 minutes, 4,000 rpm) and incubated again for 1 hour. Lysates were cleared by sterile filtration (5-μm filter unit; Merck Millipore) and cyclically passed through a column-based setup overnight at 4°C. Columns were washed with PBS (30 minutes) and ddH2O (1 hour). Peptides were eluted by 0.2% trifluoroacetic acid (TFA), isolated by ultrafiltration (Amicon filter units; Merck Millipore), lyophilized, and desalted using ZipTip pipette tips with C18 resin (Merck).
Mass Spectrometric Data Acquisition
For the mass spectrometric analysis (64), peptides were loaded on a 75 μm × 2 cm PepMap Nanotrap Column (Thermo Fisher Scientific) at a flow rate of 4 μL/minute for 10 minutes. Subsequent separation was performed by nanoflow high-performance liquid chromatography (RSLCnano, Thermo Fisher Scientific) using a 50 μm × 25 cm PepMap rapid separation column (Thermo Fisher Scientific, particle size of 2 μm) and a linear gradient ranging from 2.4% to 32.0% acetonitrile at an flow rate of 0.3 μL/minute over the course of 90 minutes. Eluting peptides were analyzed in technical replicates in an online-coupled Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher Scientific) equipped with a nanoelectron spray ion source using a data dependent acquisition mode using a top speed collisional-induced dissociation (CID, normalized collision energy 35%, HLA class I peptides) or higher-energy collisional dissociation (HCD, normalized collision energy 30%, HLA class II peptides) fragmentation method. MS1 and MS2 spectra were detected in the Orbitrap with a resolution of 120,000 and 30,000, respectively. The maximum injection time was set to 50 ms and 150 ms for MS1 and MS2, respectively. The dynamic exclusion was set to 7 and 10 seconds for HLA class I and HLA class II, respectively. Mass range for HLA class I peptide analysis was set to 400–650 m/z with charge states 2+ and 3+ selected for fragmentation. For HLA class II peptide analysis, mass range was limited to 400–1,000 m/z with charge states 2+ to 5+ selected for fragmentation.
Data Processing
Data processing was performed as described previously (64). In brief, the SEQUEST HT search engine (University of Washington, Seattle, WA; ref. 65) was used to search the human proteome as comprised in the Swiss-Prot database (20,279 reviewed protein sequences, September 27, 2013) without enzymatic restriction. Precursor mass tolerance was set to 5 ppm, and fragment mass tolerance to 0.02 Da. Oxidized methionine was allowed as a dynamic modification. The FDR was estimated using the Percolator algorithm (66) and limited to 5% for HLA class I and 1% for HLA class II. Peptide lengths were limited to 8 to 12 amino acids for HLA class I and to 8 to 25 amino acids for HLA class II. Protein inference was disabled, allowing for multiple protein annotations of peptides. HLA class I annotation was performed using NetMHCpan 4.0 (67, 68) and SYFPEITHI (69) annotating peptides with percentile rank below 2% and ≥60% of the maximal score, respectively. Comparative profiling approaches, that is, overlap analysis and frequency-based comparisons of AML- and benign-derived immunopeptidomes, were performed with curated AML and LSC immunopeptidome data excluding peptides that are presented on only one single sample with PSM counts ≤3 (“one hit wonders”).
Screening for Neoepitopes
For neoepitope screening, we used a non-patient–individual mutFASTA, which includes the TOP100 recurrent AML-associated missense mutations specified in the COSMIC database (https://cancer.sanger.ac.uk/cosmic; ref. 70) supplemented with the most common NPM1 frame-shift mutations (type A, B, C, D, and E; ref. 71) as well as FLT3-ITD (72) and FLT3-TKD (73–77) mutations (Supplementary Table S8). Data processing of AML immunopeptidomics data with the mutFASTA were performed as described above. To minimize false positive identifications, more stringent filter criteria with XCorr ≥1 and ΔCn ≥0.2 were applied. After manual spectrum validation, candidate neoepitopes were produced as isotope-labeled synthetic peptides and used for spectral comparison and validation.
Peptide Synthesis and Spectrum Validation
Peptides were produced by the peptide synthesizer Liberty Blue (CEM) using the 9-fluorenylmethyl-oxycarbonyl/tert-butyl strategy (78). Spectrum validation of the experimentally eluted peptides was performed by computing the similarity of the spectra with corresponding isotope-labeled synthetic peptides measured in a complex matrix. The spectral correlation was calculated between the MS/MS spectra of the eluted and the synthetic peptide (79).
Identification of Cryptic Peptides
Cryptic HLA class I peptides were identified using Peptide-PRISM as described recently (30). De novo peptide sequencing was performed with PEAKS X (Bioinformatics Solutions Inc; ref. 80). Raw data refinement was performed with the following settings: (i) merge options: no merge; (ii) precursor options: corrected; (iii) charge options: no correction; (iv) filter options: no filter; (v) process: true; (vi) default: true; and (vii) associate chimera: yes. De novo sequencing was performed with parent mass error tolerance set to 10 ppm. Fragment mass error tolerance was set to 0.15 Da, and enzyme was set to none. The following variable modifications have been used: oxidation (M), pyro-Glu from Q (N-term Q), and carbamidomethylation (C). A maximum of three variable posttranslational modifications were allowed per peptide. Up to 10 de novo sequencing candidates were reported for each identified fragment ion mass spectrum, with their corresponding average local confidence score. Because we applied the chimeric spectra option of PEAKS X, two or more TOP10 candidate lists could be assigned to a single fragment ion spectrum. Two tables (“all de novo candidates” and “de novo peptides”) were exported from PEAKS for further analysis. All de novo sequence candidates were matched against the six-frame translated human genome (hg38) and the three-frame translated human transcriptome (ENSEMBL 90) using Peptide-PRISM. Results were filtered to 10% FDR for each category (CDS, UTR5, OffFrame, ncRNA, UTR3, intronic, and intergenic). NetMHCpan 4.0 was used to predict binding affinities for all identified HLA class I peptides for all HLA alleles of the corresponding sample. Shown AML-associated cryptic peptides are AML-exclusive peptides never identified on any benign tissue sample.
Amplification of Peptide-specific T Cells and IFNγ ELISpot Assay
PBMCs from patients with AML and HVs were pulsed with 1 μg/mL (HLA class I) or 5 μg/mL (HLA class II) per peptide and cultured for 12 days adding 20 U/mL IL2 (Novartis) on days 3, 5, and 7 (64, 81). Peptide-stimulated PBMCs were analyzed by ELISpot assay on day 12 (82). Spots were counted using an ImmunoSpot S5 analyzer (CTL) and T-cell responses were considered positive when >10 spots/500,000 cells were counted and the mean spot count was at least three-fold higher than the mean spot count of the negative control. The intensity of T-cell responses is depicted as calculated spot counts, which were calculated as the mean spot count of duplicates normalized to 5 × 105 cells minus the normalized mean spot count of the respective negative control.
Refolding
Biotinylated HLA–peptide complexes were manufactured as described previously (83) and tetramerized using PE-conjugated streptavidin (Invitrogen) at a 4:1 molar ratio.
Induction of Peptide-specific CD8+ T Cells with aAPCs
Priming of peptide-specific CTLs was conducted using aAPCs as described before (23, 84). In detail, 800,000 streptavidin-coated microspheres (5.6-μm diameter, Bangs Laboratories) were loaded with 200 ng biotinylated peptide–HLA complexes and 600 ng biotinylated anti-human CD28 antibody (clone 9.3, in-house production). MACS-sorted CD8+ T cells (CD8 Microbeads, Miltenyi Biotec, catalog no. 130–045–201) were cultured with 4.8 U/μL IL2 (R+D) and 1.25 ng/mL IL7 (PromoKine). Weekly stimulation with aAPCs (200,000 aAPCs per 1 × 106 CD8+ T cells) and 5 ng/mL IL12 (PromoKine) was performed four times. Induction of peptide-specific T cells was analyzed by tetramer staining.
Cytokine and Tetramer Staining
The frequency and functionality of peptide-specific CD8+ T cells was analyzed by tetramer and ICS as described previously (82, 85). For ICS, cells were pulsed with 10 μg/mL of individual peptide and incubated with 10 μg/mL Brefeldin A (Sigma-Aldrich) and 10 μg/mL GolgiStop (BD Biosciences) for 12-16 hours. Staining was performed using Cytofix/Cytoperm (BD Biosciences), PE/Cy7 anti-human CD8 (BioLegend, catalog no. 344711, RRID: AB_2044007, clone SK1), APC/Cy7 anti-human CD4 (BioLegend, catalog no. 300518, RRID: AB_314086, clone RPA-T4), Pacific Blue anti-human TNF (BioLegend, catalog no. 502920, RRID: AB_528965, clone MAb11), FITC anti-human CD107a (BioLegend, catalog no. 328606, RRID: AB_1186036, clone H4A3), and PE anti-human IFNγ (BioLegend, catalog no. 506507, RRID: AB_315440, clone B27) mAbs. Aqua fluorescent reactive dye (Invitrogen, catalog no. L34957) was used as viability marker. PMA and ionomycin (Sigma-Aldrich) served as positive control. The following peptides were used as negative control peptides: GSEELRSLY, POL_HV1BR, HLA-A*01; YLLPAIVHI, DDX5_HUMAN, HLA-A*02; RLRPGGKKK, GAG_HV1BR, HLA-A*03; TPGPGVRYPL, NEF_HV1BR, HLA-B*07; DIAARNVL, FAK1_HUMAN, HLA-B*08; ASEDYVAPPK, MKX_HUMAN, HLA-A*11; ETVITVDTKAAGKGK, FLNA_HUMAN, HLA class II. The frequency of peptide-specific CD8+ T cells after aAPC-based priming was determined by PE/Cy7 anti-human CD8 mAb and HLA:peptide tetramer-PE staining. Aqua fluorescent reactive dye (Invitrogen, catalog no. L34957) was used as viability marker. Cells of the same donor primed with an HLA-matched control peptide were used as negative control. The priming was considered successful if the frequency of peptide-specific CD8+ T cells was >0.1% of CD8+ T cells within the viable single cell population and at least three-fold higher than the frequency of peptide-specific CD8+ T cells in the negative control. The frequency of tetramer+, IFNγ+, TNF+, and CD107a+ T cells is depicted as calculated frequency, which is the frequency in the test well minus the frequency of the respective negative control. The same evaluation criteria were applied for ICS results. Samples were analyzed on a FACSCanto II cytometer.
Cytotoxicity Assay
Cytolytic capacity of peptide-specific CD8+ T cells was analyzed using the flow cytometry-based VITAL assay as described before (86, 87). Autologous CD8-depleted PBMCs were loaded with the test peptide or an HLA-matched control peptide and labeled with CFSE (Invitrogen) or FarRed (Invitrogen), respectively. Effector cells were added in the indicated effector-to-target ratios. Specific lysis of peptide-loaded target cells was calculated relative to control targets.
In-depth Phenotyping by Multicolor Flow Cytometry
For multicolor flow cytometry–based phenotyping of peptide-specific CD4+ T-cell responses, cells were stimulated with the pool of HLA class II–restricted AML/LSC-associated peptides (10 μg/mL of each peptide) and incubated with 10 μg/mL Brefeldin A and 10 μg/mL GolgiStop for 12-16 hours. Staining was performed using Cytofix/Cytoperm, PE/Cy7 anti-human CD8 (BioLegend, catalog no. 344711, RRID: AB_2044007, clone SK1), APC/Cy7 anti-human CD4 (BioLegend, catalog no. 300518, RRID: AB_314086, clone RPA-T4), Pacific Blue anti-human TNF (BioLegend, catalog no. 502920, RRID: AB_528965, clone MAb11), PE anti-human IFNγ (BioLegend, catalog no. 506507, RRID: AB_315440, clone B27), APC anti-human CD45RO (BioLegend, catalog no. 304210, RRID: AB_314426), PE-Dazzle 594 anti-human IL4 (BioLegend, catalog no. 500832, RRID: AB_2564036), and Brilliant Violet 650 anti-human CD62 L (BioLegend, catalog no. 304831, RRID: AB_2561461) mAbs. Zombie Aqua (BioLegend, catalog no. 423101) was used as viability marker. PMA and ionomycin served as positive control. The peptide ETVITVDTKAAGKGK (FLNA_HUMAN) was used as negative control. Samples were analyzed on a LSR Fortessa cytometer (BD Biosciences).
Single-Cell Immune Profiling
In vitro amplified and HLA class II peptide pool-stimulated memory T cells of patients with AML were enriched by the IFNγ Secretion Assay – Cell Enrichment and Detection Kit (PE) (Miltenyi Biotec, catalog no. 130–054–201) and prepared according to the 10 × Genomics cell preparation protocol. Single cells were partitioned into Gel Beads-in-Emulsion (GEMs) together with 10 × barcoded Gel Beads and reverse transcriptase enzymatic reaction using the Chromium X instrument (10X Genomics). Single-cell gene expression libraries and single-cell TCR (VDJ) libraries were then prepared using the Chromium Next GEM Single Cell 5′Kit v2 (10X Genomics, catalog no. PN-1000263), the Library Construction Kit (10X Genomics, catalog no. PN-1000190), and the Chromium Single Cell Human TCR Amplification Kit (10X Genomics, catalog no. PN-1000252) according to the manufacturer's instructions. Libraries were pooled and sequenced on a NOVASEQ 6000 (Illumina) at 37,773, 80,897, and 46,925 mean reads per cell, respectively. Samples were demultiplexed using bcl-convert version 3.9.3 (Illumina). Barcode processing, alignment, VDJ annotation, and single-cell 5′gene counting were performed using Cell Ranger Software version 7.1.0 (10X Genomics). Further data processing, visualization, and analysis were performed using scanpy version 1.9.1 and scirpy version 0.12.2 (88, 89) for all samples combined. Cells with unique gene counts <500 and without VDJ sequence associated, as well as cells with >20% of mitochondrial genes, were removed from the analysis. Data was log-normalized to a size factor of 10,000. Only highly variable genes were considered for dimensional reduction. Effect of total counts was regressed out and counts were scaled to unit variance and zero mean for each gene. The dimensionality reduction was done using principal component analysis (PCA). The neighborhood graph and UMAP embedding were computed using the UMAP algorithm (90) for 30 neighbors and the first 60 principal components (n_neighbors = 30, n_PC = 60). Unsupervised clustering was performed using the Leiden algorithm (91) resulting in 15 clusters with 14,697 cells in total. CD3+ clusters were selected for downstream analysis yielding 9 clusters with 3,058 cells (UPN114), 1,826 cells (UPN120) and 5,144 cells (UPN82), respectively. CD4 expressing clusters with 1,676 cells (UPN114), 616 cells (UPN120), 3,814 cells (UPN82) were then extracted from the previous clustering for further downstream analysis.
Software and Statistical Analysis
Overlap analysis was performed using BioVenn (92). The population coverage of HLA allotypes was calculated by the IEDB population coverage tool (www.iedb.org; ref. 28). For saturation analysis, the mean number of unique source proteins for a given cohort size (number of samples) has been calculated by 1,000 random samplings from the entirety of AML immunopeptidomes, that is, the number of source proteins of randomly picked samples was summed up for each cohort size and this process was repeated 1,000 times before calculating the average. An in-house Python script was used for the calculation of FDRs of AML-associated peptides at different presentation frequencies (64). Hotspot analysis (hotspot length ≥8 amino acids) of HLA class II immunopeptidomes was performed using an in-house R script that maps identified peptides according to their sequence onto its source protein and calculates representation frequencies of single amino acid positions within the respective cohorts. Flow cytometric data was analyzed using FlowJo 10.0.8 (Treestar). For survival analysis investigating the impact of the immunopeptidome diversity, peptide yields of AML-exclusive peptides were normalized to the cell number applied for immunopeptidome analysis. OS and EFS were depicted for low and high immunopeptidome diversity according to the median peptide yields in the AML immunopeptidome cohort and calculated by Kaplan–Meier method. The log-rank test was performed to test the difference of survival between the groups. For survival analysis investigating the impact of preexisting antigen-specific immune responses against HLA class II-restricted AML- and LSC-associated peptides as detected by IFNγ ELISpot assays patients were dichotomized into the group of responders showing a peptide-specific T-cell response and nonresponders without any detectable peptide-specific T-cell response. All figures and statistical analyses were generated using GraphPad Prism 9.4.1 (GraphPad Software). Data are displayed as mean with SD, box plots as median with 25th or 75th quantiles and min/max whiskers. Continuous data were tested for distribution and individual groups were tested by use of two-sided χ2 test, unpaired t test, unpaired Mann–Whitney U test, Kruskal–Wallis test, or paired Wilcoxon signed rank test, all performed as two-sided tests. If applicable adjustment for multiple testing was done. P values of <0.05 were considered statistically significant.
Data Availability
The mass spectrometry immunopeptidomics data generated in this study has been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE (93) partner repository with the dataset identifier PXD038691. The mass spectrometry immunopeptidomics raw data (.raw files) can be viewed by the Thermo Xcalibur Qual Browser or other .raw file viewers. The processed search engine output (.msf files) can be viewed by the Proteome Discoverer software. The scRNA-seq data generated in this study have been deposited in the NCBI's Gene Expression Omnibus database with the dataset identifier GSE235080.
Authors’ Disclosures
A. Nelde reports a patent for EP22206337.2 pending. H. Schuster reports personal fees from Immatics Biotechnologies GmbH outside the submitted work; and is a currently employed by Immatics. J.S. Heitmann reports other support from Synimmune GmbH outside the submitted work. R. Majeti reports personal fees and other support from Kodikaz Therapeutic Solutions, Orbital Therapeutics, and Pheast Therapeutics; personal fees from 858 Therapeutics; and other support from Myelogene outside the submitted work. H.-G. Rammensee reports grants from DKTK, EXC2180, Ernst-Jung-Preis, and grants from Landesforschungspreis Baden-Württemberg during the conduct of the study; in addition, H.-G. Rammensee has a patent for “AML associated peptides” pending. J.S. Walz reports grants from German Research Foundation, German Research Foundation under Germany's Excellence Strategy, German Cancer Consortium, Wilhelm Sander Stiftung, grants from José Carreras Leukämie-Stiftung, German Cancer Aid, and grants from Fortüne Program of the University of Tübingen during the conduct of the study; in addition, J.S. Walz has a patent for EP22206337.2 pending. No disclosures were reported by the other authors.
Authors’ Contributions
A. Nelde: Conceptualization, data curation, formal analysis, investigation, visualization, writing–original draft, project administration, writing–review and editing. H. Schuster: Conceptualization, formal analysis, investigation, writing–review and editing. J.S. Heitmann: Resources, data curation, formal analysis, writing–review and editing. J. Bauer: Investigation, writing–review and editing. Y. Maringer: Investigation, writing–review and editing. M. Zwick: Investigation, writing–review and editing. J.-P. Volkmer: Investigation, writing–review and editing. J.Y. Chen: Investigation, writing–review and editing. A.M. Paczulla Stanger: Formal analysis, investigation, writing–review and editing. A. Lehmann: Formal analysis, visualization, writing–review and editing. B. Appiah: Formal analysis, funding acquisition, writing–review and editing. M. Märklin: Investigation, writing–review and editing. E. Rücker-Braun: Resources, writing–review and editing. H.R. Salih: Resources, writing–review and editing. M. Roerden: Resources, data curation, funding acquisition, writing–review and editing. S.M. Schroeder: Resources, data curation, writing–review and editing. M.-F. Häring: Resources, writing–review and editing. A. Schlosser: Formal analysis, writing–review and editing. J. Schetelig: Resources, writing–review and editing. M. Schmitz: Resources, funding acquisition, writing–review and editing. M. Boerries: Formal analysis, funding acquisition, writing–review and editing. N. Köhler: Formal analysis, funding acquisition, investigation, writing–review and editing. C. Lengerke: Resources, funding acquisition, investigation, writing–review and editing. R. Majeti: Investigation, writing–review and editing. I.L. Weissman: Resources, investigation, writing–review and editing. H.-G. Rammensee: Formal analysis, funding acquisition, writing–review and editing. J.S. Walz: Conceptualization, resources, formal analysis, supervision, funding acquisition, project administration, writing–review and editing.
Acknowledgments
We thank Ulrike Schmidt, Claudia Falkenburger, Beate Pömmerl, and Ulrich Wulle for technical support. This work was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation, WA 4608/1–2, to J.S. Walz; CRC1479 Project ID 441891347-P03, to N. Köhler, CRC/TRR167-Project ID 259373024-Z01, to M. Boerries and B. Appiah; CRC 1160-Project ID 256073931-Z02, to M. Boerries; CRC1479 Project ID 441891347-S1, to M. Boerries), the Deutsche Forschungsgemeinschaft under Germany's Excellence Strategy (EXC2180-390900677, to J.S. Walz and H.-G. Rammensee and CIBSS – EXC 2189 project ID 390939984, to N. Kohler), the Federal Ministry of Education and Research (03ZU1111LB, to M. Schmitz; MIRACUM-FKZ 01ZZ1801B and PM4Onco-FKZ 01ZZ2322A, to M. Boerries), the German Cancer Consortium (DKTK, to C. Lengerke, H.-G. Rammensee, and J.S. Walz), the Ernst Jung Prize for Medicine (to H.-G. Rammensee), the Landesforschungspreis of Baden-Württemberg (to H.-G. Rammensee), the Wilhelm Sander Stiftung (2016.177.3, to J.S. Walz), the José Carreras Leukämie-Stiftung (DJCLS 05 R/2017, to J.S. Walz), the Deutsche Krebshilfe (German Cancer Aid, 70114948, to J.S. Walz), the Swiss National Science Foundation (310030_179239, to C. Lengerke), the European Research Council (HemStem Consolidator Grant, to C. Lengerke), the Else Kröner-Fresenius-Stiftung (2019_A74, to N. Kohler), and the Fortüne Program of the University of Tübingen (2451–0-0 and 2581–0-0, to J.S. Walz and M. Roerden).
The publication costs of this article were defrayed in part by the payment of publication fees. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.
NoteSupplementary data for this article are available at Blood Cancer Discovery Online (https://bloodcancerdiscov.aacrjournals.org/).