SF3B1 mutations, which occur in 20% of patients with myelodysplastic syndromes (MDS), are the hallmarks of a specific MDS subtype, MDS with ringed sideroblasts (MDS-RS), which is characterized by the accumulation of erythroid precursors in the bone marrow and primarily affects the elderly population. Here, using single-cell technologies and functional validation studies of primary SF3B1-mutant MDS-RS samples, we show that SF3B1 mutations lead to the activation of the EIF2AK1 pathway in response to heme deficiency and that targeting this pathway rescues aberrant erythroid differentiation and enables the red blood cell maturation of MDS-RS erythroblasts. These data support the development of EIF2AK1 inhibitors to overcome transfusion dependency in patients with SF3B1-mutant MDS-RS with impaired red blood cell production.
MDS-RS are characterized by significant anemia. Patients with MDS-RS die from a shortage of red blood cells and the side effects of iron overload due to their constant need for transfusions. Our study has implications for the development of therapies to achieve long-lasting hematologic responses.
Cancer-causing mutations can dysregulate transcriptional programs and cause mutant cells to become dependent on certain molecular regulators or signaling pathways. The regulators or pathways that cancer cells become reliant upon vary based on the cell type and the dynamics of the cell states affected by the mutations (1).
Some of the best examples of hematologic alterations that can induce specific cell identity–based morphologic and clinical phenotypes are hotspot mutations in the splicing factor SF3B1 (SF3B1MT). SF3B1MT are highly recurrent initiating events in myelodysplastic syndromes (MDS), arising in the most primitive hematopoietic stem cells (HSC; ref. 2). Although found in different hematopoietic cell types (3), SF3B1MT preferentially deregulate erythroid progenitors and are hallmarks of a subtype of MDS characterized by the accumulation of ringed sideroblasts (RS) in the bone marrow (BM), MDS-RS (4–7).
Several studies have dissected the molecular mechanisms by which SF3B1MT modulate RNA splicing in MDS-RS. The SF3B1MT-induced aberrant splicing of genes involved in heme biosynthesis and mitochondrial iron transport leads to the abnormal deposition of iron in erythroid cells, resulting in dysfunctional hemoglobin synthesis and RS formation (8–12). However, given that few studies have elucidated how SF3B1MT affect distinct transcriptional states in erythroid progenitors and precursors, the dysregulated transcriptional programs induced by iron accumulation and the ways by which these programs arrest erythroid differentiation are largely unknown.
Here, using single-cell technologies, we sought to elucidate how SF3B1MT disrupt the transcriptional states of MDS-RS hematopoietic stem and progenitor cells (HSPC) and downstream differentiating erythroblasts to find disease dependencies that are not predicted by SF3B1MT-induced aberrant splicing.
SF3B1MT Do Not Impair Erythropoiesis at the Level of HSPCs
To dissect at the single-cell level the cellular and transcriptomic changes induced by SF3B1MT in cells undergoing erythroid differentiation, we performed single-cell RNA-sequencing (scRNA-seq) analysis of lineage-negative (Lin−) CD34+ HSPCs isolated from the BM of five untreated SF3B1-mutant MDS-RS patients and two age-matched healthy donors (HD; Supplementary Table S1). Our analyses identified 16 cellular clusters driven by cellular differentiation profiles (Fig. 1A) that we defined based on the differential expression of validated lineage-specific transcriptional factors (TF) and cellular markers (refs. 13–16; Supplementary Fig. S1A; Supplementary Table S2). To dissect the changes in lineage specification induced by SF3B1MT, we compared the distribution of HD and SF3B1-mutant HSPCs in the distinct hematopoietic clusters and found that HSPCs from MDS-RS patients had a predominant erythroid/megakaryocytic (Ery/Mk) differentiation (Fig. 1B). Importantly, MDS-RS Ery/Mk cells had higher expression of KLF1 (P = 0) and lower expression of FLI1 (P = 4.68−36), the two major regulators of Ery and Mk differentiation, respectively (17), which suggests that these cells were more erythroid-biased than those from HDs were (Supplementary Fig. S1B). Differential expression analyses revealed that SF3B1-mutant HSPCs undergoing Ery/Mk differentiation, compared with their healthy counterparts, exhibited upregulation of genes in the MYC signaling and oxidative phosphorylation pathways (Fig. 1C; Supplementary Fig. S1C and S1D), which underscores the state of metabolic activation in SF3B1-mutant cells. Similar results were observed when we analyzed the expression profile of most immature hematopoietic cells (cluster 1; Supplementary Fig. S1E and S1F).
To further evaluate whether the fate of SF3B1-mutant Ery/Mk cells was biased toward erythroid differentiation at the expense of megakaryopoiesis, we performed single-cell assays for transposase-accessible chromatin with high-throughput sequencing (scATAC-seq) to profile the chromatin accessibility landscape in Lin− CD34+ HSPCs isolated from three SF3B1-mutant MDS-RS patients and two age- and gender-matched HDs (Supplementary Table S1). We identified 8 cellular clusters with distinct TF binding motif enrichment in open chromatin regions (Fig. 1D; Supplementary Table S3). Cells in cluster 5 were in the earliest differentiation state and were characterized by the highest activities of TFs involved in retinoid and nuclear factor family signaling (Supplementary Fig. S1G). Conversely, cells in clusters 2 and 3 showed the highest activities of TFs involved in Ery/Mk specification, including many members of the GATA family of TFs (Fig. 1E). Cells in clusters 1 and 4 were myeloid-biased and had the highest activities of the TFs SPI1 (PU.1) and CEBPA (Supplementary Fig. S1H). Differential analysis of chromatin accessibility among the samples showed that MDS-RS HSPCs at the latest stage of Ery/Mk differentiation before lineage specification (cluster 2) had significantly decreased open chromatin peaks at the distal elements of genes involved in platelet production (hemostasis; Fig. 1F; Supplementary Table S4). Because distal elements are better than promotors in defining how cells are poised for further differentiation (18), these data demonstrate that SF3B1MT do not arrest erythropoiesis at the HSPC level but instead induce HSC metabolic activation and promote differentiation toward the erythroid lineage at the expense of megakaryopoiesis.
SF3B1MT Arrest Erythroblasts’ Terminal Differentiation and Activate the EIF2AK1-Induced Response Pathway to Heme Deficiency
To evaluate whether SF3B1MT affects terminal erythroid differentiation, we performed scRNA-seq analysis of mononuclear cells (MNC) isolated from the BM of the same five SF3B1-mutant MDS-RS patients and two HDs whose HSPC compartments we analyzed previously. Our analysis identified 20 cellular clusters inclusive of all major BM cell types, which we defined according to known cellular surface markers and lineage potential (ref. 19; Fig. 2A; Supplementary Fig. S2A; Supplementary Table S5). As expected, differential analysis of BM cell type composition revealed that SF3B1-mutant MNCs, compared with HD MNCs, were characterized by an erythroid predominance (Supplementary Fig. S2B), which is consistent with the accumulation of RS in the BM of these patients (Supplementary Table S1). To determine the mechanistic basis by which SF3B1MT affect erythroid maturation and impair the release of red blood cells into the peripheral blood (PB), we analyzed the frequency and expression profile of each erythroid cluster within the erythroblastic population. We identified 8 cellular clusters representing 7 different stages of erythroblast differentiation (Fig. 2B), which we defined based on the expression of previously validated surface markers, TFs, and effector molecules involved in erythropoiesis (20, 21). Cells in the early stages of erythropoiesis, including phenotypic burst-forming unit-erythroid cells (BFU-E; cluster 18) and colony-forming unit-erythroid cells (CFU-E; cluster 9), were characterized by variable expression of the TF GATA2 and the surface markers CD34, CD36, CD117 (cKit), and CD105 (ENG), whereas cells undergoing terminal differentiation, including pro-erythroblasts (Pro-E; cluster 4), basophilic erythroblasts (Baso-E; cluster 12), polychromatophilic erythroblasts (Poly-E; cluster 7), and orthochromatic erythroblasts (Ortho-E; clusters 1 and 11), were characterized by progressively increased expression of the TFs GATA1 and KLF1, the erythropoietin and transferrin receptors (EPOR and TFRC, i.e., CD71), the membrane protein glycophorin A (GYPA, i.e., CD235a), and hemoglobin subunits. Pre-erythrocytes (Pre-E; cluster 13)—erythroblasts in the reticulocyte maturation phase, which occurs before the release of erythrocytes into the PB—expressed genes involved in hemoglobin synthesis most highly and surface markers that characterize the upstream stages of erythroblast differentiation most lowly (Supplementary Fig. S2C). Differential analysis of the cluster distribution between HD and SF3B1-mutant samples showed that SF3B1-mutant erythroblasts had an increased frequency of cells at the latest stages of erythroid differentiation, mainly at the Ortho-E stage (Fig. 2C). To exclude the possibility that the arrest of the SF3B1-mutant erythroblasts in the late stages of differentiation was the result of an increase in EPO in the BM of MDS-RS patients, we analyzed the changes induced by EPO administration in the BM MNCs of one MDS-RS patient carrying the SF3B1K700E mutation before and after treatment with EPO as supportive therapy (Supplementary Fig. S2D; Supplementary Table S6). EPO administration did not induce any changes in either the frequencies of the erythroid populations (Supplementary Fig. S2E) or these cells’ transcriptional profile (Supplementary Table S7).
Together, these data suggest that SF3B1MT arrest erythropoiesis at the final stage of erythroblast maturation by inhibiting the transition from Ortho-E to Pre-E, which results in the accumulation of terminally differentiated cells in the BM. Consistent with these observations, immunophenotypic analysis showed that, compared with those from HDs, terminally differentiated erythroblasts from SF3B1-mutant MDS-RS patients had a significantly higher frequency of cells in the orthochromatic stage V of differentiation (Supplementary Fig. S2F), as reported previously (22, 23).
To dissect the molecular mechanisms underlying the SF3B1MT-induced arrest of terminal erythroid differentiation, we analyzed the differential expression profiles between HD and SF3B1-mutant cells at each stage of erythroblast differentiation. These analyses revealed that SF3B1-mutant Pro-E, Baso-E, and Poly-E had significantly upregulated genes involved in the response of eukaryotic translation initiation factor 2 alpha kinase 1 (EIF2AK1) to heme deficiency, including EIF2AK1, EIF2S1, ATF4, and DDIT3 (Fig. 2D; Supplementary Fig. S2G). EIF2AK1 is a metabolic stress-sensing kinase that in homeostasis binds to its natural ligand, heme (24). In conditions that limit heme production, EIF2AK1 phosphorylates its main target, EIF2S1, and leads to decreased translation of globin mRNAs and increased expression of the stress response effector ATF4 (25). ATF4 and its downstream effector, DDIT3, are major regulators of autophagy (26) and activate a set of targets involved in autophagy to maintain cell survival in response to different types of stress stimuli, including those affecting mitochondrial functions (27–29). Accordingly, we observed that SF3B1-mutant Ortho-E exhibited a significant upregulation of genes involved in mitochondrial autophagy (Fig. 2E), including PINK1, SQSTM1 (p62), and LAMPTOR5 (Supplementary Fig. S2H). These data are consistent with an attempt by Ortho-E to remove dysfunctional mitochondria, alleviate the iron overload caused by SF3B1MT (8, 30), and maintain their survival.
To functionally validate these findings, we analyzed the morphology of erythroid cells isolated from the BM of SF3B1-mutant and SF3B1-wild-type (SF3B1WT) MDS-RS patients to assess differences in the ultrastructural features of autophagy. Transmission electron microscopy showed that SF3B1-mutant erythroblasts had significantly more autophagic vesicles than SF3B1WT erythroblasts do (Fig. 2F). To determine whether SF3B1MT instigated autophagy activation, we used CRISPR/Cas9 genome editing technology to introduce the SF3B1 missense mutation c.2098A>G (K700E) into the erythroblastic cell line K562. This mutation results in the K700E amino acid substitution, which is recurrently detected in patients with SF3B1-mutant MDS-RS (30). Consistent with our findings in primary samples, SF3B1-mutant cells in physiologic conditions or after hemin-induced erythroid differentiation had EIF2AK1 pathway upregulation (Supplementary Fig. S2I) and had significantly more autophagic vesicles (Supplementary Fig. S2J and S2K) and higher expression of LC3B protein (Supplementary Fig. S2L) than their parental counterparts did. Together, these data demonstrate that SF3B1MT arrest erythroid terminal differentiation and activate the EIF2AK1-induced response pathway to maintain erythroblasts’ survival in response to heme deficiency.
Hypomethylating Agent Therapy Inhibits the EIF2AK1-Induced Response Pathway to Heme Deficiency
We hypothesized that if EIF2AK1 pathway activation plays a role in arresting the maturation of terminally differentiated SF3B1-mutant erythroblasts, it should progressively decrease once patients achieve hematologic remission and become transfusion-independent following hypomethylating agent (HMA) therapy. To test this hypothesis, we dissected at the single-cell level the cellular and transcriptomic changes induced by HMA therapy in Lin−CD34+ HSPCs and BM MNCs, which were isolated from two SF3B1-mutant MDS-RS patients at the time of diagnosis and at the time of hematologic response following HMA therapy (Supplementary Table S1). Although HMA therapy did not overcome the aberrant differentiation of HSCs toward the erythroid lineage (Fig. 3A; Supplementary Fig. S3A and S3B) or significantly decrease the SF3B1MT burden in BM MNCs (Supplementary Fig. S3C; Supplementary Table S8), it did reduce the accumulation of erythroblasts in the BM (Fig. 3B; Supplementary Fig. S3D and S3E; Supplementary Table S9) by inducing these cells’ differentiation. Indeed, primary cells derived from SF3B1-mutant CD34+ MDS-RS cells cultured in erythroid differentiation media for 8 days and then treated with 5-azacitidine for 3 days had a higher percentage of terminally differentiated CD235a (GYPA)+ erythroblasts than those treated with vehicle (21.9% vs. 7.62%; Supplementary Fig. S3F).
Differential analysis of the cluster distribution in the erythroid compartment showed that, compared with those from untreated patients, erythroblasts from HMA-treated patients who achieved hematologic remission had a decreased frequency of Ortho-E (Fig. 3C). Differential expression profiles of untreated and HMA-treated cells at different stages of terminal erythroid differentiation revealed that HMA-treated Pro-E, Baso-E, and Poly-E (clusters 3, 6, and 5, respectively) had significant downregulation of the major effectors of the EIF2AK1 response (Fig. 3D; Supplementary Fig. S3G). Accordingly, the expression of autophagy genes was significantly decreased in HMA-treated Ortho-E (clusters 2 and 12; Fig. 3E; Supplementary Fig. S3H). Together, these data suggest that HMA therapy can overcome EIF2AK1 pathway activation in terminally differentiated erythroblasts in patients who have had hematologic remission and have become transfusion-independent.
Inhibition of EIF2AK1 Overcomes the Accumulation of RS and Enables Red Blood Cell Production
We hypothesized that the inhibition of EIF2AK1 pathway activation induces the maturation of SF3B1-mutant erythroblasts. To test this hypothesis, we used CRISPR/Cas9 genome editing technology to delete EIF2AK1 in primary erythroblasts derived from a three-phase erythroid differentiation culture system of SF3B1-mutant CD34+ cells (31). EIF2AK1 depletion in MDS-RS cells using previously validated single-guide RNAs (sgRNA; ref. 32; Fig. 4A) significantly increased terminally differentiated CD71 (TFRC)+CD235a (GYPA)+ and CD71−CD235a+ erythroblasts at the expense of less-differentiated CD71+CD235a− cells after 13 days of culture (Fig. 4B; Supplementary Fig. S4A). In contrast, EIF2AK1 depletion in HD CD34+ cells had no significant effect on erythroid differentiation (Supplementary Fig. S4B and S4C). To validate these results, we performed an scRNA-seq analysis of three MDS-RS samples treated with nontargeting (NT) or EIF2AK1 sgRNAs at day 13 of erythroid differentiation (Fig. 4C). Differential analysis of the cluster distribution showed that, compared with those treated with the NT sgRNAs, cells treated with EIF2AK1 sgRNAs had a significantly higher frequency of terminally differentiated cells (clusters 1 and 7; Fig. 4D; Supplementary Fig. S4D). These cells were characterized by the highest expression of both CD71 and CD235a (cluster 1) or CD235a (cluster 7; Supplementary Fig. S4E) and genes involved in heme metabolism (Supplementary Fig. S4F; Supplementary Table S10). Differential expression analysis of these clusters revealed that, as expected, EIF2AK1 sgRNA–treated MDS-RS erythroblasts were characterized by the downregulation of genes involved in the response of EIF2AK1 to heme deficiency (Fig. 4E), including the major effectors ATF4 and DDIT3 (Supplementary Fig. S4G). EIF2AK1-mediated pathway downregulation was associated with the significantly increased expression of genes that regulate heme metabolism and are involved in erythroid differentiation (Fig. 4F; Supplementary Fig. S4H). Similar results were obtained from bulk RNA-seq analysis of three other SF3B1-mutant MDS-RS samples treated with NT or EIF2AK1 sgRNAs at 13 days of erythroid differentiation (Supplementary Fig. S4I). Notably, downregulation of EIF2AK1 signaling was associated with an increased expression of genes involved in heme biosynthesis and mitochondrial iron transport, including all regulators of protoporphyrin IX biosynthesis, such as ALAD, ALAS2, HMBS, UROD, ABCB6, CPOX, TMEM14C, PPOX, and FECH (Supplementary Fig. S4J). Given that the missplicing of TMEM14C diminishes porphyrin abundance within the mitochondria, resulting in iron accumulation and heme deficiency, and that the increased expression of TMEM14C partially overcomes RS formation (12), these results may explain why EIF2AK1-depleted erythroblasts undergo terminal differentiation. As the SF3B1MT burden did not differ between NT sgRNA– and EIF2AK1 sgRNA–treated MDS-RS cells (Supplementary Fig. S4K), these data suggest that EIF2AK1 inhibition overcomes the arrest of erythroid differentiation induced by SF3B1MT.
Despite mechanistic insights into SF3B1MT-induced tumorigenesis, effective therapies targeting SF3B1MT-induced aberrations in MDS have not yet been developed, and therapeutic approaches based on global splicing inhibition have fallen short of expectations. Results from a phase I clinical trial of the oral SF3B1 modulator H3B-8800 in patients with myeloid neoplasms did not recapitulate the results of the preclinical studies performed using mouse models (33). These studies demonstrate that the widespread inhibition of splicing activity does not yield significant clinical improvements (34). Taken together, these data suggest that alternative therapeutic approaches are needed and that targeting specific downstream effectors of SF3B1MT in erythroblasts may represent a novel strategy to overcome ineffective erythropoiesis before other secondary genetic lesions lead to MDS progression.
In our study, we used single-cell technologies to elucidate the biological mechanisms of the SF3B1MT-induced arrest of erythropoiesis. We demonstrated that SF3B1MT significantly inhibit the differentiation of Ortho-E to Pre-E, thus impairing the release of red blood cells into the PB. We analyzed the molecular pathways that were transcriptionally upregulated in Ortho-E independently of splicing alterations and identified the EIF2AK1 response to heme deficiency as a potential driver of the SF3B1MT-induced arrest of erythroid differentiation. EIF2AK1 pathway activation in terminally differentiated SF3B1-mutant erythroblasts resulted in increased ATF4 expression, which in turn upregulated the expression of genes involved in autophagy and RS survival. We would, therefore, hypothesize that EIF2AK1 signaling exacerbates RS formation and anemia in SF3B1-mutant MDS-RS patients by reducing oxidative stress in response to heme deficiency through the EIF2AK1-induced inhibition of globin protein and biosynthetic heme enzyme expression and translation (25, 35). Consistent with this hypothesis, EIF2AK1 inhibition significantly increased the expression of both mitochondrial heme biosynthetic enzymes and iron transporters and improved the terminal differentiation of SF3B1-mutant MDS-RS erythroblasts (Fig. 5). We could not directly assess RS formation in our functional validation studies owing to the limited number of primary cells available for our assays.
Growing evidence suggests that the EIF2AK1 pathway affects erythropoiesis in response to several types of insults. The CRISPR/Cas9-mediated deletion of EIF2AK1 in sickle erythroblasts increases fetal hemoglobin production and reduces sickling in cultured erythroid cells (31, 32). EIF2AK1-mediated fetal hemoglobin regulation proceeds via the modulation of the fetal hemoglobin repressor BCL11A (36). A clinical study based on the transplantation of autologous HSCs transduced by lentiviral vectors delivering shRNAs targeting BCL11A in patients with sickle cell disease demonstrated a significant improvement of anemia and other disease manifestations (37), suggesting that pharmacologically inhibiting of EIF2AK1 may lead to similar clinical outcomes. Thus, EIF2AK1 inhibition represents one of the most promising therapeutic approaches for sickle cell anemia to date.
The EIF2AK1 effector DDIT3 is overexpressed in SF3B1WT MDS HSCs (38). DDIT3 overexpression delays the erythroid differentiation of MDS CD34+ cells by affecting the activation of erythroid transcriptional programs. The inhibition of DDIT3 in CD34+ cells isolated from MDS patients with anemia restores proper erythroid differentiation (38). Our study showed that the activation of the EIF2AK1 pathway occurs in the later stages of erythroblast maturation, as SF3B1MT-induced aberrant splicing primarily affected the function of terminally differentiated erythroblasts with the highest mitochondrial content.
Although the EIF2AK1-mediated response to heme deficiency is not completely conserved in mice, mouse models in which the gene Eif2ak1 has been knocked out have normal hematologic indices (36). Moreover, EIF2AK1 depletion is well tolerated in human erythroid cells in steady-state conditions (31, 32), as also confirmed in our study. These data suggest that the EIF2AK1 pathway could be a therapeutic target for not only sickle cell disease but also MDS-RS or other types of disorders with impaired red blood production.
In conclusion, our study highlights the importance of dissecting the conserved molecular mechanisms of the erythroid stress response to develop novel approaches that achieve lasting hematologic remission in patients with different types of ineffective erythropoiesis or aberrant erythroblast functions.
Human Primary Samples
BM aspirates from 17 patients with SF3B1-mutant MDS-RS who were referred to the Department of Leukemia at MD Anderson Cancer Center, the Cleveland Clinic Foundation, the University of California San Diego, or the Josep Carreras Leukaemia Research Institute were obtained after the approval of the corresponding Institutional Review Boards and in accordance with the Declaration of Helsinki. Diagnosis of MDS-RS was assigned according to the World Health Organization criteria (39). SF3B1MT were determined by a targeted amplicon-based next-generation sequencing (40). All available samples from patients with MDS-RS and SF3B1MT were included in the study. Baseline BM aspirates were collected from patients before any treatment. For patients for whom baseline samples were available, sequential BM samples were collected during HMA treatment. The clinical characteristics of the individuals with SF3B1-mutant MDS-RS are shown in Supplementary Table S1. BM samples from HDs were obtained from AllCells, the Department of Stem Cell Transplantation at MD Anderson Cancer Center, and the Cleveland Clinic Foundation. Written informed consent was obtained from all donors.
MNCs were collected from each BM sample immediately after BM aspiration using the standard gradient separation approach with Ficoll-Paque PLUS (catalog number #45-001-752, Thermo Fisher Scientific). MNCs were cryopreserved and stored in liquid nitrogen until use. For cell-sorting applications, MNCs were enriched in CD34+ cells using magnetic-activated cell sorting (MACS) with the CD34 Microbead Kit (catalog number #130-046-702, Miltenyi Biotec) and further purified by fluorescence-activated cell sorting (FACS) as described below.
Generation of the SF3B1K700E Cell Line
The K562 cell line was obtained by ATCC in 2017. Mycoplasma testing was routinely performed on the cell line, and cell identity was validated by short tandem repeat DNA fingerprinting before performing any experiment. Cells were cultured in 10% FBS (catalog number #S11150, Atlanta Biologicals) Iscove's modified Dulbecco's medium (catalog number #12440053, Thermo Fisher Scientific). Cells for genome editing were used after 4 passages from reception. The CRISPR/Cas9-based extreme genome editing system developed by Biocytogen (https://biocytogen.com/gene-editing) was used to generate the heterozygous SF3B1K700E cell line. The nonconserved region of the SF3B1 intron 14 (near exon 15) was targeted by Cas9 endonuclease using the following guide RNA sequence: 5′-GGACAGCTGTCCTAAAATTT GGG-3′. The repaired template DNA included at least 1-kb 5′ and 3′ homologous arms, the puromycin gene, and the K700E mutation in exon 15 (the sequence of the repaired DNA template can be provided upon request). CRISPR plasmids and template DNA (Biocytogen) were introduced into the K562 cells by electroporation. Single clones were selected by puromycin (catalog number #194539, Sigma-Aldrich) treatment (1.5 μg/mL). Mutant clones were screened by PCR using the following primers: 5′-GCATCAAGCTTGGTACCGATATTCTTATGGGCTGTGCCATCTTGC-3′ and 5′-ACTTAATCGTGGAGGATGATAGGTAATTGGTGGATTTACCTTTCCTCT3′.
SF3B1K700E mutation was confirmed by DNA sequencing. To induce erythroid differentiation, parental and SF3B1K700E K562 cells were treated with 30 μmol/L hemin (catalog number # 51280, Sigma-Aldrich) for 3 days, as described previously (23).
Fluorescence-Activated Cell Sorting
FACS of Lin−CD34+ cells was performed using antibodies against CD2 (RPA-2.10), CD3 (SK7), CD14 (MφP9), CD19 (SJ25C1), CD20 (2H7), CD34 (581), CD56 (B159), and CD235a (HIR2; all from BD Biosciences); CD4 (S3.5), CD11b (ICRF44), and CD33 (P67.6; all from Thermo Fisher Scientific); CD7 (6B7; BioLegend); and CD10 (SJ5-1B4; Leinco Technologies), as we described previously. Samples were acquired with a BD Influx Cell Sorter (BD Biosciences).
MACS-purified CD34+ cells were subjected to a three-phase in vitro culture system as described previously (36). Briefly, for phase I medium, StemSpan SFEM II (catalog number #09605, Stem Cell Technologies) was supplemented with 100 ng/mL of human stem cell factor (catalog number #300-07, PeproTech), 1 ng/mL of IL3 (catalog number #200-03, PeproTech), 3 units per mL of erythropoietin (catalog number #100-64, PeproTech), 200 μg/mL of holotransferrin (catalog number #T4132, Sigma-Aldrich), 10 μg/mL of insulin (catalog number # I9278, Sigma-Aldrich), 5% human male A/B plasma (catalog number #H4522, Sigma-Aldrich), and 10 μg/mL of heparin (catalog number #7980, Stem Cell Technologies). For phase II medium, IL3 was withdrawn after 9 days of culture. For phase III medium, the cells were cultured with StemSpan SFEM II supplemented with 3 units per mL of erythropoietin, 1 mg/mL of holotransferrin, 10 μg/mL of insulin, 5% human A/B plasma, and 10 μg/mL of heparin. Cells were maintained in phase I from the day of collection until day 8, at which time the cells underwent ribonucleoprotein-based gene editing and were transitioned to phase II medium. On day 13 of culture, the cells were transitioned to phase III medium. Samples for western blot and flow cytometry analyses were harvested on day 13 or 15 of culture, respectively.
To evaluate whether HMA therapy induces erythroblast differentiation, erythroblasts derived from SF3B1-mutant CD34+ MDS-RS cells were cultured in phase I differentiation medium for 8 days and then treated with vehicle or 0.1 μmol/L 5-azacitidine (catalog number #A2385, Sigma-Aldrich) for 3 days. Samples for flow cytometry analysis were harvested on day 13 of culture.
Ribonucleoprotein Electroporation in CD34+ Cells
Ribonucleoprotein-based gene editing was performed as described previously (32). Briefly, CD34+ cells were electroporated with the previously validated EIF2AK1-targeting sgRNAs TTGTTGGCTATCACACCGCG and ATAGTCGAGAGAAACAAGCG or the nontargeting sgRNAs GCACTACCAGAGCTAACTCA and GTACGTCGGTATAACTCCTC, together with the Cas9 Nuclease V3 (catalog number #1081060, Integrated DNA Technologies) using the P3 Primary Cell 4D-Nucleofector X Kit S (catalog number # V4XP-3032, Lonza). Chemically modified sgRNAs were purchased from Synthego.
For erythroblast staging analysis, HD and SF3B1-mutant MDS-RS BM MNCs were analyzed as described previously (22). Briefly, MNCs were stained with the following antibodies: PE-CD45 (clone HI30, BD Biosciences), PE-CY7-CD235a (clone HI264, BioLegend), APC-CD49d (clone 9F10, BioLegend), and FITC–band 3 (a generous gift from Dr. Xiuli An, New York Blood Cancer Center).
For erythroid differentiation analyses, cells on day 15 of culture were stained with antibodies against PE-CD45, APC-CD71 (clone mA712, BD Biosciences), and FITC-CD235a (clone HIR2, BD Biosciences). Dead cells were excluded with DAPI staining (catalog number D1306, Thermo Fisher Scientific). Samples were acquired with a BD LSRFortessa Cell Analyzer, and the cell populations were analyzed using FlowJo software (version 10.5.3, FlowJo, LLC).
Cells were washed with a 10% FBS/phosphate-buffered saline solution (catalog numbers #16-140-07 and #14-190-250, respectively, Thermo Fisher Scientific), and pellets were resuspended in the Mammalian Cell and Tissue Extraction Kit buffer (catalog number #K269-500, BioVision Incorporated) and incubated for 15 minutes with gentle shaking. Lysates were then collected after centrifugation at 12,000 rpm for 20 minutes at 4°C. The amount of protein was quantified using the Qubit Protein Assay Kit (catalog number #Q33212, Thermo Fisher Scientific) and a Qubit Fluorometer (Thermo Fisher Scientific). Sodium dodecyl sulfate–polyacrylamide gel electrophoresis and western blotting were performed following standard protocols. Blotted membranes were incubated with primary polyclonal antibodies against EIF2AK1 (MBS2538144, Mybiosource), phospho-EIF2S1 (ab32157, Abcam), LC3B (2775, Cell Signaling), and vinculin (clone hVIN-1, Sigma-Aldrich), and with secondary anti-mouse and anti-rabbit digital antibodies (R1005 and R1006, respectively, Kindle Biosciences LLP). Membranes were developed using the SuperSignal West Pico PLUS Chemiluminescent Substrate (catalog number #34580, Thermo Fisher Scientific) in a KwikQuant Imager (Kindle Biosciences LLP).
Lin−CD34+ and MNC samples were enriched by FACS. Sample preparation and sequencing were performed at MD Anderson Cancer Center's Advanced Technology Genomics Core. Sample concentration and cell suspension viability were evaluated using a Countess II FL Automated Cell Counter (Thermo Fisher Scientific). Samples were normalized for input onto the Chromium Single-Cell A Chip Kit (catalog number #PN120236, 10X Genomics), and single cells were lysed and barcoded for reverse transcription. Equal amounts of each uniquely indexed sample library were pooled together. Pooled libraries were sequenced using a NovaSeq6000 SP 100-cycle flow cell (Illumina). Sequencing analysis was carried out using 10X Genomics’ Cell Ranger software, version 3.0.2.
The adapter trimming was done automatically using the Cell Ranger software. Alignment was performed using “Star” as a part of the Cell Ranger analysis. Cell Ranger default settings were used for all quality thresholds. The sequencing was run as a paired-end using the following run format: 28 cycles Read 1, 8 cycles Index 1, 0 cycles Index2, and 91 cycles Read2. Barcodes and unique molecular identifiers were sequenced in Read1, and the transcriptional information was sequenced in Read2. The samples were single-indexed.
After sequencing, raw reads were aligned to the human genome (hg38), and the digital expression matrix was generated using cellranger count. Individual samples were merged to generate the digital expression matrix using cellranger aggr. The R package Seurat was used to analyze the digital expression matrix. Cells with fewer than 100 genes and fewer than 500 unique molecular identifiers detected were removed from further analysis. The Seurat function NormalizeData was used to normalize the raw counts. Variable genes were identified using the FindVariableGenes function. The Seurat ScaleData function was used to scale and center expression values in the data set for dimensional reduction. Default parameters were used for the Seurat functions. When needed, samples were integrated using the Seurat functions FindIntegrationAnchors and IntegrateData. Principal component analysis (PCA) and uniform manifold approximation and projection (UMAP) were used to reduce the dimensions of the data, and the first two dimensions were used in plots. To cluster the cells and determine the marker genes for each cluster, we used the FindClusters and FindAllMarkers functions, respectively. Differential expression analysis of the samples was performed using the FindMarkers function and the Wilcoxon rank-sum test. The Benjamini–Hochberg procedure was applied to adjust the false discovery rate.
Total RNA-seq Analysis
Total RNA from NT sgRNA– or EIF2AK1 sgRNA–treated SF3B1-mutant MDS-RS CD34+ cells at 13 days of erythroid differentiation was purified using the PicoPure RNA Isolation Kit (catalog number #KIT0204, Thermo Fisher Scientific). Quality checks and quantifications were performed with the 2100 Expert Bioanalyzer (Agilent). Illumina low input total RNA libraries were prepared using the SMARTer Stranded Total RNA-seq Kit v2–Pico Input Mammalian (catalog number #634412; Takara Bio). Briefly, 10 ng of total RNA was converted to cDNA using Takara's SMART (Switching Mechanism At 5′ End of RNA Template) technology. Illumina adapters with specific barcodes (Supplementary Table S11) were added to the double-stranded cDNA using 12 cycles of PCR. The PCR products were purified using Ampure Beads (Agencort); then, ribosomal cDNA was specifically depleted using ZapR V2. The remaining cDNA fragments were enriched by 12 cycles of PCR amplification using Illumina universal library primers. The enriched library was purified to yield the final cDNA library. Libraries were combined and sequenced on the Illumina NextSeq500 sequencer using the High Output 150 cycle flow cell with the 75nt PE format. The mapping of the RNA-seq reads to the human reference genome hg19 was performed with Tophat2. The quality of the reads was verified using the FASTQC software (v0.11.8). HTseq software (v0.11.2) was used to summarize the gene-expression counts from the Tophat2 alignment data after sorting the BAM files. Unsupervised analysis was performed using PCA and hierarchical clustering to verify any outliers and the overall similarities or differences among all samples. Differential expression analysis was performed using the DEseq2 package, as we described previously (42).
The scATAC Low Cell Input Nuclei Isolation protocol (10X Genomics) was used to isolate nuclei from FACS-purified cells. The trypan blue exclusion assay was performed to check for intact nuclei. The remaining extracted nuclei were used for the consecutive steps of the scATAC-seq library preparation protocol following 10X Genomics guidelines. Equal molar concentrations of uniquely indexed samples were pooled together. Pooled libraries were sequenced using a NextSeq500 150-cycle flow cell (Illumina). Reads were aligned to the human genome (hg38), and peaks were called using the cellranger-atac count pipeline. Individual samples were merged using the cellranger-atac pipeline to generate the peak-barcode matrix and TF-barcode matrix. To identify specific TF activity for each cell cluster, we used the R package Seurat to analyze the TF-barcode matrix. The raw counts were normalized by the sequencing depth for each cell and scaled for each TF using the NormalizeData and ScaleData functions. PCA and UMAP were applied to reduce the dimensions of the data, and the first two dimensions were plotted. The FindClusters function was used to cluster the cells. The FindAllMarkers function was used to determine the TF markers for each cluster. Differential analysis of TF activity in the samples was performed using the FindMarkers function and the Wilcoxon rank-sum test. Cluster identity was determined based on the activity of master regulators of lineage commitment, as we (43) and others (44, 45) described previously.
Briefly, the HSC cluster was defined based on the lowest activity of lineage-specific markers and highest activity of TFs that regulate stemness, such as the retinoic acid receptor, NRF, and EGR families of TFs. Lymphoid and myeloid bipotent-biased clusters were defined based on the coactivities of major regulators of lymphoid differentiation (e.g., POU4F, TBX families of TFs) and myeloid differentiation (RUNX1 and CEBPA); myeloid clusters were defined based on the highest activities of the TFs SPI1 and CEBPA (master regulators of myelopoiesis); erythroid/megakaryocyte clusters were defined based on the highest activity of the GATA family of TFs.
Cluster-specific peaks were determined using the FindAllMarkers function, and differentially accessible peaks between the samples were determined using the FindMarkers function. Each peak was associated with a specific gene based on its distance to that gene's transcription start site (TSS). Peaks overlapping with a promoter region (−1,000 bp, +100 bp) of any TSS were annotated as peaks in promoters, whereas peaks not in promoter regions but within 200 kb of the closest TSS were annotated as peaks in the distal elements. Peaks not mapped in either the promoters or distal elements were annotated as peaks in intergenic regions.
Targeted Deep Sequencing
Selected samples were sequenced at MD Anderson's Advanced Technology Genomics Core facility. We used a custom SureSelect panel of 200 genes (Agilent Technologies) that are recurrently mutated in hematologic malignancies. DNA from MNCs was fragmented and bait-captured in solution according to the manufacturer's protocols. Captured DNA libraries were then sequenced using a NovaSeq sequencer (Illumina) with 76-bp paired-end reads. Raw sequencing data from the Illumina platform were converted to fastq format and aligned to the hg19 reference genome using the Burrows-Wheeler Aligner. The aligned BAM files were subjected to mark duplication, realignment, and recalibration using Picard and GATK (https://www.broadinstitute.org/gatk/guide/best-practices?bpm=DNAseq). Preprocessed BAM files were then analyzed to detect single-nucleotide variants and small indels using the MuTect and Pindel algorithms, respectively, against a virtual normal sequence developed in-house.
Transmission Electron Microscopy
Fixed MNC samples were washed in 0.1 mol/L sodium cacodylate buffer (catalog number #RT12310, Electron Microscopy Sciences) and treated with 0.1% Millipore-filtered cacodylate buffered tannic acid (catalog number #RT21700, Electron Microscopy Sciences), postfixed with 1% buffered osmium tetroxide (catalog number #RT19172, Electron Microscopy Sciences), and stained with 1% Millipore-filtered uranyl acetate (catalog number #RT22400, Electron Microscopy Sciences). The samples were dehydrated in increasing concentrations of ethanol, and then infiltrated and embedded in LX-112 medium (catalog number #LX112-21212, Ladd Research Industries). The samples were polymerized in a 60°C oven for approximately 3 days. Ultrathin sections were cut in an Ultracut microtome (Leica), stained with uranyl acetate and lead citrate (catalog number #RT17800, Electron Microscopy Sciences), and examined with a JEM 1010 transmission electron microscope (JEOL) at an accelerating voltage of 80 kV. Digital images were obtained using an AMT Imaging System (Advanced Microscopy Techniques Corp).
Flow cytometry data were analyzed with Prism 8 software (GraphPad). The figure legends indicate the statistical test(s) used in each experiment. Statistical significance was represented as *, P < 0.05; **, P < 0.01; ***, P < 0.001; ****, P < 0.0001. Figure 5 was made using Biorender.com.
data sets generated by scRNA-seq are accessible at GEO under accession number GSE188367. data sets generated by bulk RNA-seq are accessible at GEO under accession number GSE202722.
J.P. Maciejewski reports grants from the Leukemia and Lymphoma Society, the U.S. Department of Defense, and NIH during the conduct of the study. R. Bejar reports other support from Aptose Biosciences, personal fees from Gilead and Epizyme outside the submitted work. J.S. Carew reports other support from Majestic Therapeutics, LLC outside the submitted work; in addition, J.S. Carew has a patent 9926326 issued. S. Halene reports grants and other support from NIH and other support from The Frederick A DeLuca Foundation during the conduct of the study; personal fees from FORMA Therapeutics outside the submitted work; in addition, S. Halene has a pending patent. V. Santini reports personal fees from BMS, AbbVie, Geron, Gilead, Takeda, Novartis, and Menarini outside the submitted work. K. Clise-Dwyer reports grants from NCI during the conduct of the study. No disclosures were reported by the other authors.
V. Adema: Data curation, formal analysis, methodology. F. Ma: Data curation, software, methodology. R. Kanagal-Shamanna: Methodology. N. Thongon: Methodology. G. Montalban-Bravo: Data curation. H. Yang: Methodology. S.A. Peslak: Supervision. F. Wang: Software, formal analysis. P. Acha: Methodology. F. Sole: Supervision. P. Lockyer: Methodology. M. Cassari: Methodology. J.P. Maciejewski: Supervision. V. Visconte: Methodology. I. Ganan-Gomez: Methodology. Y. Song: Resources, formal analysis. C. Bueso-Ramos: Supervision. M. Pellegrini: Supervision. T.M. Tan: Methodology. R. Bejar: Supervision. J.S. Carew: Methodology. S. Halene: Supervision. V. Santini: Supervision. G. Al-Atrash: Supervision. K. Clise-Dwyer: Methodology. G. Garcia-Manero: Resources, supervision. G.A. Blobel: Supervision. S. Colla: Conceptualization, resources, supervision, investigation, writing–original draft, writing–review and editing.
This work was supported by philanthropic contributions to The University of Texas MD Anderson AML/MDS Moon Shot, by the NIH through MD Anderson's Leukemia SPORE grant (P50 CA100632), by a Ladies Leukemia League grant to R. Kanagal-Shamanna, and by an NIH/NIDDK 5U54DK106857 grant to the Animal Modeling Core of the Yale Cooperative Center of Excellence in Hematology. S. Colla is a Scholar of the Leukemia and Lymphoma Society. N. Thongon was supported by the 2020 ASH restart award. This work used MD Anderson's South Campus Flow Cytometry and Cellular Imaging Core, the Advanced Technology Genomics Core, and the High-Resolution Electron Microscopy Facility, which are supported in part by the NIH (NCI) through MD Anderson's Cancer Center Support Grant (P30 CA16672). The authors thank Joseph Munch and Kelly Soltysiak for assistance with manuscript editing.
Note: Supplementary data for this article are available at Blood Cancer Discovery Online (https://bloodcancerdiscov.aacrjournals.org/).