Plasmablastic lymphoma (PBL) is an aggressive B-cell non-Hodgkin lymphoma associated with immunodeficiency in the context of human immunodeficiency virus (HIV) infection or iatrogenic immunosuppression. While a rare disease in general, the incidence is dramatically increased in regions of the world with high HIV prevalence. The molecular pathogenesis of this disease is poorly characterized. Here, we defined the genomic features of PBL in a cohort of 110 patients from South Africa (15 by whole-exome sequencing and 95 by deep targeted sequencing). We identified recurrent mutations in genes of the JAK–STAT signaling pathway, including STAT3 (42%), JAK1 (14%), and SOCS1 (10%), leading to its constitutive activation. Moreover, 24% of cases harbored gain-of-function mutations in RAS family members (NRAS and KRAS). Comparative analysis with other B-cell malignancies uncovered PBL-specific somatic mutations and transcriptional programs. We also found recurrent copy number gains encompassing the CD44 gene (37%), which encodes for a cell surface receptor involved in lymphocyte activation and homing, and was found expressed at high levels in all tested cases, independent of genetic alterations. These findings have implications for the understanding of the pathogenesis of this disease and the development of personalized medicine approaches.

Significance:

Plasmablastic lymphoma is a poorly studied and extremely aggressive tumor. Here we define the genomic landscape of this lymphoma in HIV-positive individuals from South Africa and identify pervasive mutations in JAK–STAT3 and RAS–MAPK signaling pathways. These data offer a genomic framework for the design of improved treatment strategies targeting these circuits.

See related commentary by Küppers, p. 23.

This article is highlighted in the In This Issue feature, p. 5

Plasmablastic lymphoma (PBL) is a highly aggressive lymphoma of preterminally differentiated B cells, which predominantly occurs in patients with human immunodeficiency virus (HIV)-related or iatrogenic immunodeficiency (1). As an AIDS-defining illness with a dismal prognosis, PBL is a particularly compelling problem in the Sub-Saharan African region, which accounts for approximately 54% of the estimated 37.9 million people living with HIV.

Despite the national rollout offering combination anti-retroviral therapy (cART) in South Africa since 2004, the prevalence of HIV-associated mature B-cell lymphomas increases yearly (2). The burden of HIV-associated lymphomas on the country's encumbered health are further compounded by the younger presentation age of HIV-infected patients, the challenging classification of these lymphomas, and an exceptionally aggressive clinical disease course that, although curable in a significant fraction of patients (3), often results in fatalities (4).

Previously being classified as diffuse large B-cell lymphoma (DLBCL), PBL was later recognized as a distinct entity (5) with well-defined histopathologic features including large plasmablastic or immunoblastic cell morphology, loss of B-cell lineage markers (CD20, PAX5), expression of plasmacytic differentiation markers (CD38, CD138, IRF4/MUM1, BLIMP1), a high proliferation index, and frequent infection by the Epstein-Barr virus (EBV; ref. 6). While EBV infection and activating MYC translocations have been reported as the major features of these tumors in a number of studies (7), the molecular pathology of PBL remains elusive in both HIV-positive and -negative individuals. Numerous case studies and some cohorts have been well described and reviewed (6–9). Only two studies have partially explored genetic aberrations in this disease: a comparative genomic hybridization study showed a pattern of segmental gains in PBL that more closely resembled DLBCL than plasma cell myeloma (10). The second study compared the transcriptional profiles of 15 PBL cases to DLBCL and extraosseous plasmacytoma (11), and showed that B-cell receptor signaling genes including CD79A/B, BLK, LYN, and SWAP70 among others, were significantly downregulated in PBL compared with DLBCL; in contrast, targets of MYB and the oncogene MYC, as well as genes reflecting the known plasmacytic immunophenotype of PBL, were overexpressed. More recently, MYC and PRDM1 were investigated for mutations, structural rearrangements, and copy number gains in 36 PBL cases (12). PRDM1 mutations were found in 8 of 16 cases, frequently in association with MYC overexpression, suggesting a coordinate role in the pathogenesis of the disease. While these studies have shed light on this rare disorder, a systematic characterization of protein-changing alterations in PBL has not been performed.

The elucidation of genes and pathways that drive the initiation and maintenance of PBL is essential to better understand the biology of this cancer and, critically, to implement improved biomarkers and more effective treatment options. Here, we combined whole-exome and transcriptome sequencing followed by targeted resequencing of 110 HIV-associated PBL cases to elucidate the mutational, transcriptional, and copy number landscape of this disease. We show that mutations in various components of the JAK–STAT and MAPK–ERK pathway pervade this lymphoma, revealing this signaling cascade as a central oncogenic driver of the disease and a candidate for targeted therapy.

The Landscape of Somatic Mutations in PBL

To determine the mutational landscape of PBL, we performed whole-exome sequencing (WES) in a discovery panel of paired tumor and normal DNA collected from 15 HIV-positive patients, followed by deep targeted sequencing of the top 34 candidate genes in 95 additional cases (Supplementary Table S1 and S2; Methods). Somatic mutations were identified from WES data using SAVI2 (13), an empirical Bayesian method. Overall, 2,149 nonsynonymous somatic variants were found in the 15 discovery cases, with a median of 45 per case and a total of 1,528 affected genes, of which 1,461 were mutated in the tumor-dominant clone (>15% cancer-specific allele frequency; Supplementary Table S3). Among the 15 WES cases, one was hypermutated (case PJ030), showing 845 sequence variants that were confirmed by RNA-sequencing (RNA-seq; 92% with a read depth ≥ 10; Supplementary Fig. S1A–S1C). Candidate genes were then selected for an extension screen based on the following criteria: (i) mutated in at least 2 discovery cases, (ii) expressed in normal and/or malignant B cells, (iii) known as a cancer driver gene, and/or (iv) with an established role in B-cell differentiation (Supplementary Tables S4 and S5; Methods). The 34 selected genes were all annotated in the Catalogue of Somatic Mutations in Cancer (COSMIC) database. The mean depth of coverage for WES was 54.0x. The mean depth of coverage for the targeted DNA sequencing was 121.7x and, on average, 99.7% of the target sequences were covered by at least 50 reads.

We found genes in the JAK–STAT, MAPK–ERK, and Notch signaling pathways were commonly mutated in our PBL cohort (Fig. 1AE). The most frequent genetic lesions affected the JAK–STAT signaling pathway with, in total, 62% of cases (68/110) harboring a mutation in at least one of five genes (STAT3, JAK1, SOCS1, JAK2, and PIM1, a direct transcriptional target of STAT3/5 that functions as part of a negative feedback loop; Fig. 1A). Among these, STAT3 was the predominant target of mutations, with 46 of 110 cases (42%) harboring clonal events (Supplementary Fig. S2A and S2B). With three exceptions, the identified variants were heterozygous missense mutations clustering in exons 19 to 22, encoding part of the SH2 domain that is required for STAT3 molecular activation via receptor association and tyrosine phosphodimer formation (14). In particular, the majority of the mutations resulted in the amino acid changes Y640F (n = 11), D661V (n = 9), S614R (n = 5), and E616G/K (n = 4;Fig. 1B), which have been categorized as gain-of-function or likely gain-of-function mutations based on experimental validation (https://oncokb.org/gene/STAT3), and overlap with the STAT3 mutation pattern described in other aggressive B-cell lymphomas and T-cell and natural killer (NK) cell malignancies (15, 16). Three additional cases showed a >75% variant allele frequency in the absence of copy number changes, consistent with a copy neutral loss of heterozygosis. Sanger-based resequencing of the involved STAT3 exons, performed in 23 cases, validated all computationally identified mutations on these samples (Fig. 1F; Supplementary Table S6).

Figure 1.

The landscape of putative driver gene mutations in PBL. A, Sample information, MYC translocation, and somatic mutation information are shown for 110 cases of PBL samples. The heatmap represents individual mutations in each sample, color-coded by type of mutation. B–E, Individual gene mutation maps for frequently mutated genes, showing mutation subtype, position, and evidence of mutational hotspots, based on COSMIC database information. Y-axis counts at the bottom of the maps reflect the number of identified mutations in the COSMIC database. F, Sanger validation of single nucleotide variants (SNV) in STAT3 mutants.

Figure 1.

The landscape of putative driver gene mutations in PBL. A, Sample information, MYC translocation, and somatic mutation information are shown for 110 cases of PBL samples. The heatmap represents individual mutations in each sample, color-coded by type of mutation. B–E, Individual gene mutation maps for frequently mutated genes, showing mutation subtype, position, and evidence of mutational hotspots, based on COSMIC database information. Y-axis counts at the bottom of the maps reflect the number of identified mutations in the COSMIC database. F, Sanger validation of single nucleotide variants (SNV) in STAT3 mutants.

Close modal

In addition to STAT3, 15/110 PBL cases harbored heterozygous missense mutations of JAK1, encoding for a tyrosine kinase that phosphorylates STAT proteins. These events did not involve the canonical hotspot seen in many other cancers (loss-of-function K860Nfs), but occurred at a highly conserved amino acid position in the JH1 kinase domain, G1097D/V (Fig. 1C). Mutations at the JAK1 G1097 codon were previously reported in intestinal T-cell lymphomas (17) and anaplastic large cell lymphoma (15), and the G1097D substitution has been documented as a gain-of-function event that triggers aberrant phosphorylation of STAT3, leading to constitutive activation of the JAK–STAT signaling pathway (15). Of note, missense mutations in STAT3 and JAK1 were found to be mutually exclusive (P = 0.04, Fisher exact test), suggesting a convergent role in activating this cascade. In contrast to STAT3 and JAK1, mutations in SOCS1 (n = 11/110 cases) were more widely distributed, consistent with its previously established role as a target of AID-mediated aberrant somatic hypermutation (Fig. 1D). Although the consequences of most SOCS1 amino acid changes will require functional validation, the presence of a start-loss mutation in one case and a nonsense mutation in a second sample is expected to result in loss-of-function and inactivation of this negative JAK/STAT regulator, consistent with a tumor suppressor role.

The second most commonly mutated program was the MAPK–ERK signaling pathway, affected in 28% of cases by mutations in the RAS gene family members NRAS (14%) and KRAS (9%), as well as in BRAF (5.5%) and MAP2K1 (3%). Nearly all (92.5%) RAS mutations were found at known functional hotspots that have been shown to affect the intrinsic RAS GTPase activity, namely G12, G13, and Q61 (ref. 18; Fig. 1E). BRAF, an integral component of the MAPK signaling cascade, was found mutated in 6 cases, including 3 harboring mutations at or around the well-characterized V600 residue (namely: V600E, K601N, and T599TT) and 3 showing mutations at other common hotspots within the protein kinase domain (G464E, V471F, and M689V). The Valine at position 600 normally stabilizes the interaction between the BRAF glycine-rich loop and the activation segments, and its glutamine substitution confers an over 500-fold increase in activity, leading to constitutive activation of the MEK/ERK signaling cascade in the absence of extracellular stimuli. In line with their predicted gain of function, the mutant alleles in both RAS family members and BRAF were observed in heterozygosis (Supplementary Tables S3 and S5) and were actively expressed (median RPKM value of KRAS: 13.253, NRAS: 35.601, and BRAF: 9.906).

In addition, 24% of PBL cases carried mutations in genes implicated in the Notch signaling pathway, including those encoding for NOTCH1, its negative regulator SPEN, and the Notch pathway corepressor NCOR2. In particular, one sample displayed a frameshift variant in the NOTCH1 gene that is predicted to generate a truncated protein lacking the C-terminal PEST domain, and thus endowed with increased protein half-life, while 6 additional cases showed amino acid changes within the EGF repeats, the juxtamembrane heterodimerization domain (N-terminal portion), and the C-terminal PEST domain. SPEN mutations include monoallelic missense substitutions that were distributed along the protein coding exons with no particular clustering, and their functional effect remains to be determined.

Missense mutations, frequently multiple within the same allele, were also found in the MYC gene (10/110, 9%), with 4 additional cases displaying silent mutations that possibly reflect the aberrant activity of the physiologic somatic hypermutation mechanism (19) as well as the recruitment of the AID mutator enzyme by the juxtaposed immunoglobulin enhancer in cases harboring chromosomal MYC translocations. MYC rearrangements with the immunoglobulin genes are the most common cytogenetic feature in PBL, present in about half of reported cases (7, 12, 20), and were identified in 36% (31/76) of our samples using FISH (Fig. 1A). The functional consequences of MYC mutations will have to be experimentally tested; however, transcriptomic analysis revealed significantly higher expression of the MYC RNA in cases positive for the MYC translocation, consistent with MYC oncogenic activation (Supplementary Fig. S3A and S3B). The high proportion of cases showing evidence of MYC dysregulation reinforces the notion that MYC is a critical contributor to this aggressive cancer.

Aside from the genes mentioned above, we observed recurrent mutations, including truncating loss-of-function events, in TET2 (10/110, 9%), TP53 (10/110, 9%), and NPHP4 (6/110, 5%). Finally, genes encoding epigenetic modifiers, transcription factors implicated in B-cell activation (FOXP1), positioning (KLF2), and terminal differentiation (PRDM1), and receptor molecules involved in tumor immune surveillance (e.g., B2M, TNFRSF14) were mutated to a lesser extent (range: 1%–5%; Fig. 1A).

Although the relatively small number of cases prevents robust statistical analyses, we did not find any significant difference in the rate of mutated genes between samples collected from the oral cavity and from other locations (Pearson correlation coefficient = 0.822, Supplementary Fig. S2B), suggesting a genetic homogeneity of the disease regardless of the tissue of origin.

Collectively, the data presented above uncover a pervasive role for mutations affecting the JAK–STAT and MAPK–ERK pathways in the genetic landscape of PBL, which may contribute to the pathogenesis of this lymphoma by enforcing constitutive signaling activation, in concert with dysregulated MYC activity.

Somatic Copy Number Changes

To identify recurrent copy number alterations (CNA) associated with PBL, we applied the SNP-FASST2 algorithm to WES data from the 15 discovery cases (Supplementary Fig. S4A), followed by validation in an independent cohort of 31 additional PBL samples that had been processed with Affymetrix OncoScan microarrays (ref. 21; Supplementary Table S7). Comparison of CNA calls obtained in parallel from the WES and OncoScan approach in a subset of 9 samples confirmed the data were highly consistent with each other (Supplementary Fig. S4B), supporting the robustness of the analysis.

Frequent copy number gains involved large chromosomal regions (>10 Mb) including chromosome 1q (20/46 cases, 43%) and the whole or most of chromosome 7 (13/46 cases, 28%). When applying the GISTIC 2.0 algorithm to the combined cohort of 46 cases, we uncovered regions of highly recurrent amplification including 6p22.2 (the most significant), 6p22.1 and 1q21.3, all of which encompass histone gene clusters (Fig. 2A). In particular, the chromosome 6p gain covered 36 genes that encode canonical histones and has been previously reported as a common alteration in a variety of cancers, where histone gains have been linked to genetic instability (22). The significant region on chromosome 1q21.3 also included the IL6R gene and the antiapoptotic MCL1 gene, which showed increased gene expression (Fig. 2C) analogous to what has been described in 26% of activated B-cell like (ABC) DLBCLs (23). Although further studies will be needed to determine the functional impact of these alterations in PBL, chromosome 1q gains have been associated with unfavorable prognosis in multiple myeloma, suggesting a role in the pathobiology of the disease.

Figure 2.

Recurrent copy number changes in PBL. A, GISTIC 2.0 results showing recurrent copy number changes in PBL samples. The green line indicates q-value = 1.0 × 10−6. B, A zoomed-in view of 11p13 on 17 cases of PBL, which shows consistent focal copy number gains of CD44. The figure was generated by the IGV browser using CNV segment files from SNP-FASST2 algorithm. C, Scatter plot representations of genes located in regions with recurrent copy number gains in PBL (q-value <1.0 × 10−6, GISTIC 2.0). The horizontal axis indicates the −log10(q-value) from the GISTIC report, and the vertical axis is the median gene expression level (normalized RPKM value) from PBL RNA-seq data (n = 20). D, IHC showing strong CD44 protein expression on tumor cells membrane in a representative sample (A10), which has a focal CD44 copy number gain.

Figure 2.

Recurrent copy number changes in PBL. A, GISTIC 2.0 results showing recurrent copy number changes in PBL samples. The green line indicates q-value = 1.0 × 10−6. B, A zoomed-in view of 11p13 on 17 cases of PBL, which shows consistent focal copy number gains of CD44. The figure was generated by the IGV browser using CNV segment files from SNP-FASST2 algorithm. C, Scatter plot representations of genes located in regions with recurrent copy number gains in PBL (q-value <1.0 × 10−6, GISTIC 2.0). The horizontal axis indicates the −log10(q-value) from the GISTIC report, and the vertical axis is the median gene expression level (normalized RPKM value) from PBL RNA-seq data (n = 20). D, IHC showing strong CD44 protein expression on tumor cells membrane in a representative sample (A10), which has a focal CD44 copy number gain.

Close modal

The second most significant focal CNA was a chromosome 11p13 regional gain targeting genes CD44 and PDHX, present in 17 of 46 cases (37%; Fig. 2AC). CD44 is a nonkinase transmembrane glycoprotein that is induced in B cells upon antigen-mediated activation and is critically involved in multiple lymphocyte functions, including migration, homing, and the transmission of signals that regulate apoptosis. This CD44 protein is also thought to increase cancer cells’ adaptive plasticity in response to the microenvironment, thus giving them a survival and growth advantage (24). Analysis of 20 samples with available RNA-seq data revealed that CD44 was consistently and highly expressed in cases harboring copy number gains (n = 2), whereas PDHX levels were undetectable, indicating that CD44 is the specific target of the 11p13 amplicon. However, all samples displayed high CD44 mRNA expression, independent of the presence of genetic aberrations (Fig. 2C); consistently, IHC analysis with a specific CD44 antibody showed very strong membranous staining in all cases tested (Fig. 2D) and confirmed this finding in an independent panel of 38 cases. Of note, although CD44 expression can be detected in normal plasma cells at both RNA and protein levels, the signal was markedly lower than in plasmablastic lymphoma cells, suggesting that elevated expression of CD44 does not simply reflect the cellular ontogeny of these tumors, and that alternative regulatory mechanisms may lead to CD44 upregulation in cases lacking CNAs (Supplementary Fig. S5A–S5C; Supplementary Table S8).

Plasmablastic Lymphoma Displays a Distinct Genetic and Transcriptional Program

To explore the genomic features of PBL in relation to other lymphoid neoplasms arising from the mature B-cell lineage, we performed unsupervised clustering analysis based on mutation frequency of the top mutated genes from three cancer types obtained from public repositories (Fig. 3A). The analysis included chronic lymphocytic leukemia (CLL; ref. 25), diffuse large B-cell lymphoma (DLBCL) transcriptionally defined as ABC and germinal center B-cell-like (GCB) subtypes (26), and multiple myeloma (27). As expected on the basis of their presumed derivation from B cells committed to plasma cell differentiation, the mutational landscape of PBL was overall closer to multiple myeloma than to other mature B-cell malignancies, with mutations in RAS family members being detected in as many as 20% of cases in both diseases, while rare in DLBCL (Fig. 3A). Conversely, the mutational landscape of PBL was highly distinct from that of both GCB- and ABC-DLBCL (Fig. 3A). In particular, mutations affecting the methyltransferase KMT2D and acetyltransferase CREBBP, two among the most commonly mutated genes in DLBCL (28), were absent in plasmablastic lymphoma. ABC-DLBCL–specific mutations such as CD79A/B and MYD88 were also lacking in PBL. Of note, STAT3 was the top mutated gene in our cohort at significantly different frequencies compared with other B-cell malignancies, making it a hallmark of PBL (Fig. 3A). The differential genetic landscape of PBL is consistent with its status as a distinct entity among mature B-cell neoplasms.

Figure 3.

Comparative analysis of PBL and other B-cell malignancies. A, Unsupervised clustering based on frequencies of the most recurrently mutated genes from PBL, multiple myeloma, CLL, and two main subtypes of DLBCL (Methods). B, Hierarchical clustering of mRNA expression profiles across plasmablastic lymphoma, multiple myeloma, CLL, DLBCL and normal B cells, including centroblast (CB), naïve B (NB), and memory B cells (MB). Note that the JAK–STAT signaling pathway does not appear in this figure because only the top 1,000 most aberrantly expressed genes were selected for this analysis. Functional enrichment analysis was performed using g:profiler.

Figure 3.

Comparative analysis of PBL and other B-cell malignancies. A, Unsupervised clustering based on frequencies of the most recurrently mutated genes from PBL, multiple myeloma, CLL, and two main subtypes of DLBCL (Methods). B, Hierarchical clustering of mRNA expression profiles across plasmablastic lymphoma, multiple myeloma, CLL, DLBCL and normal B cells, including centroblast (CB), naïve B (NB), and memory B cells (MB). Note that the JAK–STAT signaling pathway does not appear in this figure because only the top 1,000 most aberrantly expressed genes were selected for this analysis. Functional enrichment analysis was performed using g:profiler.

Close modal

To define the transcriptional profile of HIV-positive PBL and to identify unique signatures that may distinguish it from other lymphoma types, we performed RNA-seq analysis of 20 PBL samples (including 12 of the 15 discovery cases), and compared their transcriptome to that of normal B-cell subsets and other B-cell lymphoma types previously characterized in our laboratories and/or obtained from public repositories, including germinal center centroblasts (CB), naïve B cells (NB), and memory B cells (MB; ref. 29), as well as CLL (30), DLBCL (26), and multiple myeloma cell lines (31). As expected, hierarchical clustering of the top 1,000 most aberrantly expressed genes revealed that PBL and multiple myeloma were closer to each other compared with other B-cell malignancies and to normal B cells, reflecting their presumed cell of origin (Fig. 3B). Consistently, plasmablastic lymphoma and multiple myeloma lacked expression of common B-cell markers (CD19, CD20, CD40, and PAX5) and transcription factors involved in the germinal center reaction (BCL6, BCL7A, BCL11A, and SPIB), whereas expression of the master regulator of plasma cell differentiation PRDM1 and other plasma cell markers (CD138, XBP1, and IRF4) were increased (Supplementary Fig. S6). MYC expression was also higher in PBL and multiple myeloma. Other notable differences included the upregulation of IL6R, a known STAT3 target, and the downregulation of SWAP70, previously suggested as a potential biomarker of PBL (Supplementary Fig. S6; ref. 11). Finally, recent studies have suggested that EBV positive PBLs evade immune recognition by expressing the programmed cell death protein 1 (PD-1) and its PD-L1 ligand (32). We confirmed high expression of PD-L1 in our PBL cohort, although PD-1 did not follow this pattern (Supplementary Fig. S6).

We then performed functional enrichment analysis (g:profiler) to identify biological pathways that are preferentially enriched in PBL as compared with other B-cell malignancies. Whereas DLBCL and CLL were enriched for B-cell related biological programs (e.g., B-cell activation and proliferation, lymphocyte activation and differentiation, and adaptive immune response), multiple myeloma was enriched for mitotic cell-cycle processes, with highly expressed genes including MCM10, BIRC5, CENPE, BUB1, and AURKA (Fig. 3B). The most significantly enriched pathways in PBL were related to chromatin/nucleosome assembly (Fig. 3B); particularly, histone genes encoding basic nuclear proteins were highly expressed in PBL, possibly related to the recurrent copy number gains/amplification encompassing this gene cluster on chromosomes 6p22.2 and 1q21.3 (Supplementary Fig. S7A–S7D). However, the biological significance of this observation in relation to tumorigenesis remains unknown.

Activation of the JAK–STAT Pathway in Plasmablastic Lymphoma

Given the elevated frequency of mutations targeting the JAK–STAT pathway, we used computational and IHC approaches to assess the extent of activation of this signaling cascade in PBL (Methods). To this end, we first interrogated the genome-wide transcriptional profile of PBL for enrichment in the JAK–STAT signaling pathway using previously identified signatures available in the MSigDB database. This analysis revealed a significant positive enrichment for genes implicated in this pathway, consistent with constitutive activation (Fig. 4A). This signature was expressed at higher levels in PBL when compared with multiple myeloma and CLL (Fig. 4B), with the most significant difference being observed between PBL and multiple myeloma (t test, P = 4.7 × 10−11), which is congruent with the fact that the latter had few mutations in this pathway (Fig. 3A). We then performed IHC staining with anti-STAT3 and anti-phospho-STAT3 antibodies to assess the STAT3 subcellular localization and phosphorylation, used as a read-out for signaling activation. We observed strong positive nuclear signal in 61% of tested cases (19/31), which also stained positive for the phospho-STAT3 antibody. Of note, evidence of constitutively active STAT3 was detected even in the absence of genetic alterations affecting this pathway, suggesting alternative (epigenetic) mechanisms of activation and indicating a prominent role for this cascade in the pathogenesis of the disease (Fig. 4C and D; Supplementary Fig. S8; Supplementary Table S9).

Figure 4.

Activation of JAK–STAT pathway in PBL. A, Preranked gene set enrichment analysis indicating the significant positive enrichment of KEGG JAK–STAT signaling pathway in PBL. The preranked gene list was generated on the basis of the median expression level on PBL samples. B, Enrichment comparisons of JAK–STAT pathway between PBL and other B-cell malignancies by performing single-sample GSEA (ssGSEA). Pairwise P values were derived from t test. C and D, IHC plots showing pSTAT3 protein expression in >75% of tumor cells, confirming STAT3 activation in STAT3 mutated cases. Results using anti-pSTAT3 antibody are photographed at ×100 and ×400 magnification.

Figure 4.

Activation of JAK–STAT pathway in PBL. A, Preranked gene set enrichment analysis indicating the significant positive enrichment of KEGG JAK–STAT signaling pathway in PBL. The preranked gene list was generated on the basis of the median expression level on PBL samples. B, Enrichment comparisons of JAK–STAT pathway between PBL and other B-cell malignancies by performing single-sample GSEA (ssGSEA). Pairwise P values were derived from t test. C and D, IHC plots showing pSTAT3 protein expression in >75% of tumor cells, confirming STAT3 activation in STAT3 mutated cases. Results using anti-pSTAT3 antibody are photographed at ×100 and ×400 magnification.

Close modal

Virus Detection in Plasmablastic Lymphoma

To analyze the viral and bacterial make-up of PBL tumors, we used the Pandora pipeline, which extracts and aligns nonhost genetic material from tumor RNA-seq data (Methods). Potentially pathogenic species were then identified by applying the BLAST algorithm against the NCBI database of viruses and bacteria reference genomes. In our PBL cohort, 12 of 20 samples contained HIV-1 transcripts, with only three samples exceeding 100 mapped reads (Supplementary Fig. S9). In addition, EBV transcripts were detected in 18 of 20 samples, HCMV (human cytomegalovirus, or human betaherpesvirus 5) in 3 of 20 samples, and KSHV (Kaposi sarcoma-associated herpes virus, human herpes virus 8) in one sample (Supplementary Fig. S9). These three herpes viruses have a broad tropism, naturally infect B cells, and are known to be associated with many tumor types.

EBV reactivation is considered a major driver of PBL, predominantly with a latency I infection program although low levels of LMP1 gene expression can be detected (33). We thus performed a detailed RNA profiling of the EBV genome. Recapitulating previous reports, all PBL cases tested by in situ hybridization were positive for EBV-encoded RNA (EBER), while only 4 of 13 samples showed expression of the LMP1 protein on IHC (Supplementary Table S1). When using the levels of BLLF1 and EBNA2, two genes that are invariably not expressed in PBL, as threshold for positive calls, we noted that nearly the entire viral genome was transcribed at background levels in the majority of samples (Fig. 5). This is analogous to what has been observed in nonreplicating infected B cells, where a background of transcripts can be detected in the absence of protein expression, due to regulation at the ribosomal level (34). However, several genes in the BamHI-A region of the virus were abundantly transcribed in at least 50% of cases across the cohort, including those encoding for components of the viral replication machinery (namely, the DNA polymerase catalytic subunit BALF5, the single-stranded DNA-binding protein BALF2, and the lytic origin of DNA replication oriLyt), the BART encoded protein RPMS1, the viral envelope glycoprotein BALF4, and the G-protein–coupled receptor BILF1, which is expressed predominantly during the immediate early phases of infection in vitro (35, 36). LF1, LF2, and LF3 were also highly expressed in PBL, but their functional roles are less understood. Expression of the LMP-1 mRNA was detected in 17 of 20 samples, but at much lower levels (Fig. 5), consistent with the known EBV latency programs of PBL (33).

Figure 5.

EBV transcription programs in PBL. Heatmap illustrating the full genome expression of EBV in the RNA-seq data of PBL samples (virus gene counts are normalized per million host reads). 16/17 EBV-positive samples show significantly higher expression of lytic genes such as BALF4 (encoding the envelope glycoprotein B) and BALF5 (encoding the DNA polymerase catalytic subunit) compared with the expression of canonical latency programs.

Figure 5.

EBV transcription programs in PBL. Heatmap illustrating the full genome expression of EBV in the RNA-seq data of PBL samples (virus gene counts are normalized per million host reads). 16/17 EBV-positive samples show significantly higher expression of lytic genes such as BALF4 (encoding the envelope glycoprotein B) and BALF5 (encoding the DNA polymerase catalytic subunit) compared with the expression of canonical latency programs.

Close modal

Our study provides a comprehensive snapshot of the genetic landscape of HIV-associated PBL and reveals highly recurrent somatic mutations affecting the JAK–STAT3 and RAS–MAPK signaling pathway as a genetic hallmark of this disease, with STAT3 representing the most prominent target. These findings underscore a central role for this transcription factor in the pathogenesis of PBL and have implications for the diagnosis and treatment of these diseases.

The STAT3 protein is an important player in multiple immune cells where it modulates a variety of physiological processes. Within the B-cell lineage, a selective role has been recognized for this factor in the differentiation of B cells into plasma cells upon antigen stimulation, as documented by both in vitro and in vivo studies (14). Briefly, in response to CD40L- and IL21-mediated signaling by T follicular heper cells, STAT3 is phosphorylated by activated JAK kinases, translocates into the nucleus as homo- or hetero-dimers, and activates the transcription of multiple targets, including the plasma cell master regulator BLIMP1. In turn, BLIMP1 downregulates BCL6 expression, an absolute prerequisite to GC exit and plasma cell differentiation (14). The STAT3 amino acid changes identified in our study include experimentally documented gain-of-function events that are predicted to have oncogenic effects by enhancing its phosphorylation and transactivation potential (37). In addition, other well-documented genetic mechanisms were found that can activate the JAK–STAT signaling pathway, including mutually exclusive upstream mutations of JAK1 or JAK2 (18/110 cases) and the loss of the STAT3 negative regulator SOCS1 (mutated in 11/110 cases). Notably, a targeted sequencing study published while this manuscript was under revision identified recurrent STAT3 mutations in 5 of 42 cases, all of which were EBV positive (38). Thus, STAT3 dysregulation may contribute to the pathogenesis of PBL by rendering these cells signaling-independent while providing proliferation and survival signals.

Constitutive activation of the JAK–STAT signaling pathway has been reported in a number of solid and hematologic malignancies and plays a central role in two lymphomas that are immunophenotypically closely related to plasmablastic lymphoma: primary effusion lymphoma (PEL) and ALK-positive large B-cell lymphoma (LBCL). PEL is a rare and aggressive AIDS-defining disease, which is associated with infection by HHV8 and is clinically distinguishable from PBL by the presence of lymphomatous effusion in body cavities. In these cells, constitutive STAT3 activity is achieved via expression of the HHV8 viral protein IL6, which contributes to the disease in an autocrine fashion by promoting proliferation and survival (39). In ALK-positive LBCL, STAT3 activation is sustained by the ALK kinase mediated by chromosomal translocations with the CLTCL gene or the NPM1 gene (40, 41). However, direct genetic alterations of STAT3 are rare in mature B-cell lymphomas. In particular, while multiple genomic hits leading to potentiation of the JAK–STAT oncogenic pathway have been detected in 87% of Hodgkin lymphomas as well as in primary mediastinal B-cell lymphomas, the most commonly affected STAT member in these tumors is STAT6 (42–45). The high incidence of STAT3 mutational activation in HIV-associated PBL points to STAT or JAK inhibitors as promising treatment options in this lymphoma type. While anti-STAT3 therapeutic attempts are still in development, JAK inhibitor therapy, currently used in the clinical setting, was shown to be an effective antagonist to STAT3 activation, inducing apoptosis in both anaplastic large T-cell lymphomas and ovarian cancer (46).

The second important finding of this study is the identification of frequent hotspot mutations in RAS–MAPK family members. Functional RAS activation is a common molecular feature of multiple myeloma, particularly in the relapsed/refractory setting, while it is rarely observed in de novo DLBCL, suggesting a specific role in the pathogenesis of plasma cell dyscrasias. Interestingly, and different from multiple myeloma, NRAS, and KRAS mutations in PBL were never concurrently observed in the same case and were often well represented in the dominant tumor clone, consistent with early events. These data have direct implications for the clinical exploration of treatments inhibiting this pathway.

Our study also showed overexpression of the transmembrane glycoprotein CD44, frequently associated with copy number gains/amplifications at this locus. CD44 is an adhesion molecule that mediates cellular interaction with the microenvironment and participates in the trafficking of neoplastic cells in multiple myeloma, CLL, and ALL (47); moreover, CD44 was shown to increase cell resistance to apoptosis and to enhance cancer cell invasiveness. Whereas the role of CD44 in PBL requires functional dissection, its high expression is likely to compound the aggressiveness of the disease, as previously described in DLBCL of the ABC subtype (48). In light of this, successful anti-CD44–targeted therapy in a mice xenograft model of human multiple myeloma may in future represent an attractive therapeutic option for PBL (49).

The finding of increased histones mRNA abundance in PBL, together with recurrent copy number gains encompassing this gene cluster on chromosome 6, is of interest because it emerged as a distinctive feature of this disease compared with other lymphoid malignancies. Histones represent a basic component of the chromatin structure and their involvement may suggest a selective role for nucleosomal plasticity in the pathogenesis of PBL, which warrants further investigations.

Due to the small number of EBV-negative cases in our cohort (n = 2) and the overall rarity of this subset, we could not assess whether EBV status is associated with differing genetic features, as recently reported for Burkitt lymphoma (50) and suggested by the evidence of distinct transcriptomic profiles between EBV-positive and –negative PBL (32). Moreover, it remains to be determined whether regional differences exist with PBL occurring outside the context of HIV immunodeficiency, or within HIV-associated PBL, given the high homogeneity of our cohort from the Gauteng region in South Africa. Thus, additional work will be required to specifically address these questions.

We found that most of the EBV genome was transcribed at very low levels in the majority of PBL cases, while prominent expression was detected for several genes in the BamHI-A region of the virus, including some early lytic genes. However, transcription of the key early lytic gene BZLF1 was noticeably absent, suggesting an incomplete lytic program. Supporting this notion, a recent study showed that increased STAT3 expression, as observed in the majority of PBL cases, decreases the susceptibility of latently infected cells to EBV lytic activation signals via an RNA-binding protein PCBP2 (51). Thus, further protein expression studies, in particular for BRLF1 and for the BLLF1-encoded viral envelope glycoprotein gp350, are required to assess whether the observed transcriptome program translates in lytic infection. Indeed, recent translation ribosome profiling studies have clearly demonstrated a marked heterogeneity of lytic genes translation and complex levels of intracellular translation repression mechanisms at work in infected B cells (34).

In conclusion, the results of our study characterize HIV-associated PBL as a distinct subset of aggressive B-cell lymphoma and significantly contribute to our knowledge about the molecular pathogenesis of this disease through the identification of recurrently mutated genes, uncovering a major role for the dysregulation of JAK–STAT3 and RAS–MAPK signaling pathways. These results reveal new points of potential therapeutic intervention in these patients.

For complete experimental details and computational analyses, see also Supplementary Methods.

Patient Cohorts

Prospective samples (n = 15 with paired tumor-normal tissue; discovery cohort) were collected from patients with suspected PBL, upon informed written consent in line with the Declaration of Helsinki, and diagnosis was further confirmed by two independent pathologists. Matched normal DNA for these 15 patients was extracted from peripheral blood samples that were documented to be tumor-free. For targeted sequencing (n = 95 samples; extension cohort), formalin-fixed paraffin-embedded (FFPE) material was retrieved from the archives of the Department of Oral Pathology and Anatomical Pathology, National Health Laboratory Service and the University of the Witwatersrand. This panel included 36 nonoral PBL samples that were part of a published cohort (52). The study protocol was approved by the local Human Research Ethics Committee (IRB Reference M150390). A summary of demographics and phenotypic markers of the discovery cohort are displayed in Supplementary Table S1, whereas demographic information for the extension cohort is included in Supplementary Table S3. For downstream nucleic acid extraction, the tumor area (>70% tumor cells) was ringed for microdissection in all samples. A third cohort of 31 well-characterized PBL samples obtained as archived FFPE material (local Human Research Ethics Committee IRB reference M101171 and 96/2011; Supplementary Table S7) was used to validate and refine copy number aberration results observed in the WES data by using Microarray OncoScan (21).

WES and RNA-Seq

Both exome and RNA-seq library preparations and sequencing were outsourced to Centrillion Genomics Services and BGI (AmericasCorporation). Whole-exome libraries were prepared using the Agilent SureSelect Human All Exon V6 Kit (Agilent Technologies) and sequenced on HiSeq 2500 using TruSeq SBS v2 Reagent Kit (Illumina) at 2 × 100 bp paired-end reads with on-target coverage of 100X per sample. RNA-seq libraries were prepared using the Illumina TruSeq Stranded Total RNA Sample Prep Kit (Illumina). Flow cells with multiplexed samples were run on the HiSeq 2500 using an Illumina TruSeq SBS v2 Reagent Kit at 2 × 100 bp paired-end reads and a coverage of 50M reads per sample.

Mutation Calling

Fastq files were aligned to the human genome assembly (hg19) using the Burrows–Wheeler Aligner (version 0.6.2). Before further analysis, the initially aligned BAM files were subjected to preprocessing that sorted, indexed, and marked duplicated reads using SAMtools (version 1.7) and Picard (version 1). To identify somatic mutations from WES data for tumor samples with matched blood control, we applied the variant-calling software SAVI2, based on an empirical Bayesian method as published (13). Somatic mutations were identified on the basis of the final report of SAVI2, and following five additional criteria: (i) not annotated as a synonymous variant, intragenic variant, or intron variant; (ii) not annotated as a common SNP (dbSnp138); (iii) a variant allele frequency of >5% in the tumor sample; (iv) a variant allele read depth of <2 in the matched normal control; and (v) variant associated reads with overall mismatch rate ≤ 0.02 as estimated by bam-readcount (version 0.8.0, https://github.com/genome/bam-readcount). All mutations described throughout this manuscript refer to nonsynonymous mutations, unless otherwise specified.

Design of Targeted Sequencing Panel

The full coding exons of 34 genes found recurrently mutated in the discovery cohort and/or previously implicated in lymphoma were analyzed in an extension panel of 95 cases (Supplementary Table S3) by targeted capture and next generation sequencing. Genes were selected according to the following criteria: (i) allele frequency > 15%; (ii) mutated in at least 2 of 15 cases; (iii) expressed in normal or transformed B cells; and (iv) functionally annotated. In addition, we included 16 genes that were only mutated in one sample but have known roles in the pathogenesis of lymphoma and 4 genes that were not found mutated in the discovery cohort but have been previously implicated in PBL. Genes lacking clear functional annotation and/or known to represent common nonspecific mutational targets in sequencing studies (e.g., TTN, PCLO) were excluded. The complete gene list is reported in Supplementary Table S4.

Targeted Next-Generation Sequencing

The entire coding region of the 34 selected genes was isolated using the IDT xGen Predesigned Gene Capture Custom Target Enrichment Technology (Integrated DNA Technologies) and subjected to library preparation and next-generation sequencing on the Illumina HiSeq platform with 2 × 150 bp configuration. Targeted sequencing was performed at GENEWIZ. Read alignments and conventional preprocessing were conducted as described for the WES analysis. For samples lacking matched normal control, variants were filtered out if found in dbSNP database (dbSNP138) as well as in any normal sample of the WES cohort.

Copy Number Analysis

For copy number analysis from WES data (n = 15), the Biodiscovery Multiscale BAM Reference Builder (53) was used to construct a multiscale reference (MSR) file from 14 paired normal samples alignments (BAM). The MSR file was used as reference for copy-number variation (CNV) calling of all tumor alignments with the SNP-FASST2 algorithm (54), using Nexus Copy Number, v10.0 (BioDiscovery, Inc.; ref. 54). Gains and losses were defined as at least +0.3 and -0.3 log2 ratio changes, respectively, in the tumor alignment.

To validate CNV calling from WES data, 31 additional samples were processed on Oncoscan FFPE Express Arrays (Affymetrix, Thermo Fisher Scientific) (Supplementary Table S7) according to the manufacturer's instructions, followed by scanning on a GeneChip Scanner 3000 7G, with the Affymetrix GeneChip Command Console (AGCC). Paired A+T and G+C CEL files were combined and analyzed with Chromosome Analysis Suite v3.3.0.139 (ChAS, Applied Biosystems, Thermo Fisher Scientific), using the OncoScan CNV workflow for FFPE, without manual recentering. We confirmed that copy number calls from OncoScan and WES data were consistent with each other by performing OncoScan on 7 cases that had been sequenced by WES and comparing the calls obtained from the two methods. Probe genomic coordinates were aligned to hg19 and the resulting OSCHP files were analyzed by Nexus Copy Number v10.0 using the SNP-FASST2 algorithm (54) with default parameters.

To detect recurrent copy number aberrations, we applied the GISTIC 2.0 algorithm using GenePattern (https://www.genepattern.org/) to the copy number segmentations of the combined cohort of 46 patients (15 cases analyzed by WES, 31 cases analyzed by OncoScan). Recurrent regions of copy number aberrations with q-value < 1.0 × 10−6 were considered significant. Furthermore, we assessed the expression level of each gene within the significant GISTIC peaks using the median value of quantile normalized RPKM from RNA-seq data (n = 20).

Pandora: A High-Performing Pipeline for Quantifying the Bacterial and Viral Microenvironment of Bulk RNA-Seq Samples

Pandora is an open-source pipeline (https://github.com/RabadanLab/Pandora) that takes as input the total RNA sequence reads from a single sample and outputs the spectrum of detected microbial transcripts, focusing on bacteria and viruses. The Pandora workflow is divided into the following four main modules: (i) mapping to the human host genome using STAR and Bowtie2 to filter out the host reads from downstream analysis; (ii) de novo assembly of host-subtracted short reads using Trinity (55) to create contiguous full-length transcripts (contigs), which help increase the accuracy of alignment to the correct species of origin in the next step; (iii) identification of the most likely species of origin for each assembled contig with BLAST; and (iv) filtering and parsing of the BLAST results into a final report on the detected microbial abundances. We also performed gene expression profiling of all the reads that mapped to the EBV genome (56) using FeatureCounts (57) to fully characterize lytic and latent EBV programs.

Data Availability

The data that support the findings of this study are available upon request. The sequencing data have been deposited in NCBI Sequence Read Archive (SRA) under accession number PRJNA598849.

No potential conflicts of interest were disclosed.

Z. Liu: Conceptualization, investigation, visualization, methodology, writing-original draft, writing-review, and editing. I. Filip: Investigation, methodology, and writing-original draft. K. Gomez: Investigation, methodology, writing-review, and editing. D. Engelbrecht: Resources, validation, investigation, and visualization. S. Meer: Resources and investigation. P. Lalloo: Resources, validation, and investigation. P. Patel: Resources, validation and investigation.Y. Perner: Resources and investigation. J. Zhao: Investigation, visualization, and methodology. J. Wang: Investigation, writing-review, and editing. L. Pasqualucci: Conceptualization, resources, supervision, funding acquisition, investigation, methodology, writing-original draft, project administration, writing-review, and editing. R. Rabadan: Conceptualization, supervision, funding acquisition, investigation, methodology, project administration, writing-review, and editing. P. Willem: conceptualization, resources, supervision, funding acquisition, validation, investigation, visualization, writing-original draft, project administration, writing-review, and editing.

This work has been funded by NIH grants R21 CA192854 (to P. Willem, L. Pasqualucci, and R. Rabadan), R01GM117591 and U54 CA193313 (to R. Rabadan), and was initiated under the Columbia-South Africa Training Program for Research on HIV-associated Malignancies D43 CA153715, with the support of Judith Jacobson. We thank Stephen P. Goff and Henri-Jacques Delecluse for their helpful suggestions. We also thank Sonja Boy for providing additional samples for the independent copy number validation cohort, Nicole Crawford for assistance in samples collection and data curation, and Jacky Brown for help with Sanger sequencing. Whole exome capture and sequencing, and RNA sequencing were completed at Centrillion Biosciences, Inc and BGI Tech Solutions (Hong Kong). Targeted DNA sequencing was performed at Genewiz Inc.

The publication costs of this article were defrayed in part by the payment of publication fees. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

Note: Supplementary data for this article are available at Blood Cancer Discovery Online (http://bloodcancerdiscov.aacrjournals.org/).

1.
Campo E
SH
,
Harris
NL
.
WHO classification of tumors of the hematopoietic and lymphoid tissues
.
In
:
Swerdlow
SH
,
Campo
E
,
Harris
NL
,
editors
.
Lyon, France
:
IARC Press
;
2017
.
p.
321
2
.
2.
Carbone
A
,
Vaccher
E
,
Gloghini
A
,
Pantanowitz
L
,
Abayomi
A
,
de Paoli
P
et al
.
Diagnosis and management of lymphomas and other cancers in HIV-infected patients
.
Nat Rev Clin Oncol
2014
;
11
:
223
38
.
3.
Noy
A
,
Lensing
SY
,
Moore
PC
,
Gupta
N
,
Aboulafia
D
,
Ambinder
R
et al
.
Plasmablastic lymphoma is treatable in the HAART era. A 10 year retrospective by the AIDS Malignancy Consortium
.
Leuk Lymphoma
2016
;
57
:
1731
4
.
4.
Pather
S
,
Mohamed
Z
,
McLeod
H
,
Pillay
K
.
Large cell lymphoma: correlation of hiv status and prognosis with differentiation profiles assessed by immunophenotyping
.
Pathol Oncol Res
2013
;
19
:
695
705
.
5.
Delecluse
HJ
,
Anagnostopoulos
I
,
Dallenbach
F
,
Hummel
M
,
Marafioti
T
,
Schneider
U
et al
.
Plasmablastic lymphomas of the oral cavity: a new entity associated with the human immunodeficiency virus infection
.
Blood
1997
;
89
:
1413
20
.
6.
Castillo
JJ
,
Bibas
M
,
Miranda
RN
.
The biology and treatment of plasmablastic lymphoma
.
Blood
2015
;
125
:
2323
30
.
7.
Valera
A
,
Balague
O
,
Colomo
L
,
Martinez
A
,
Delabie
J
,
Taddesse-Heath
L
et al
.
IG/MYC rearrangements are the main cytogenetic alteration in plasmablastic lymphomas
.
Am J Surg Pathol
2010
;
34
:
1686
94
.
8.
Tchernonog
E
,
Faurie
P
,
Coppo
P
,
Monjanel
H
,
Bonnet
A
,
Algarte Genin
M
et al
.
Clinical characteristics and prognostic factors of plasmablastic lymphoma patients: analysis of 135 patients from the LYSA group
.
Ann Oncol
2017
;
28
:
843
8
.
9.
Teruya-Feldstein
J
,
Chiao
E
,
Filippa
D
,
Lin
O
,
Comenzo
R
,
Coleman
M
et al
.
CD20-negative large-cell lymphoma with plasmablastic features: a clinically heterogenous spectrum in both HIV-positive and-negative patients
.
Ann Oncol
2004
;
15
:
1673
9
.
10.
Chang
CC
,
Zhou
X
,
Taylor
JJ
,
Huang
WT
,
Ren
X
,
Monzon
F
et al
.
Genomic profiling of plasmablastic lymphoma using array comparative genomic hybridization (aCGH): revealing significant overlapping genomic lesions with diffuse large B-cell lymphoma
.
J Hematol Oncol
2009
;
2
:
47
.
11.
Chapman
J
,
Gentles
AJ
,
Sujoy
V
,
Vega
F
,
Dumur
CI
,
Blevins
TL
et al
.
Gene expression analysis of plasmablastic lymphoma identifies downregulation of B-cell receptor signaling and additional unique transcriptional programs
.
Leukemia
2015
;
29
:
2270
3
.
12.
Montes-Moreno
S
,
Martinez-Magunacelaya
N
,
Zecchini-Barrese
T
,
Villambrosia
SG
,
Linares
E
,
Ranchal
T
et al
.
Plasmablastic lymphoma phenotype is determined by genetic alterations in MYC and PRDM1
.
Mod Pathol
2017
;
30
:
85
94
.
13.
Trifonov
V
,
Pasqualucci
L
,
Tiacci
E
,
Falini
B
,
Rabadan
R
.
SAVI: a statistical algorithm for variant frequency identification
.
BMC Syst Biol
2013
;
7
:
S2
.
14.
Hillmer
EJ
,
Zhang
H
,
Li
HS
,
Watowich
SS
.
STAT3 signaling in immunity
.
Cytokine Growth Factor Rev
2016
;
31
:
1
15
.
15.
Crescenzo
R
,
Abate
F
,
Lasorsa
E
,
Tabbo
F
,
Gaudiano
M
,
Chiesa
N
et al
.
Convergent mutations and kinase fusions lead to oncogenic STAT3 activation in anaplastic large cell lymphoma
.
Cancer Cell
2015
;
27
:
516
32
.
16.
Song
TL
,
Nairismagi
ML
,
Laurensia
Y
,
Lim
JQ
,
Tan
J
,
Li
ZM
et al
.
Oncogenic activation of the STAT3 pathway drives PD-L1 expression in natural killer/T-cell lymphoma
.
Blood
2018
;
132
:
1146
58
.
17.
Nicolae
A
,
Xi
L
,
Pham
TH
,
Pham
TA
,
Navarro
W
,
Meeker
HG
et al
.
Mutations in the JAK/STAT and RAS signaling pathways are common in intestinal T-cell lymphomas
.
Leukemia
2016
;
30
:
2245
.
18.
Prior
IA
,
Lewis
PD
,
Mattos
C
.
A comprehensive survey of Ras mutations in cancer
.
Cancer Res
2012
;
72
:
2457
67
.
19.
Pasqualucci
L
,
Neumeister
P
,
Goossens
T
,
Nanjangud
G
,
Chaganti
R
,
Küppers
R
et al
.
Hypermutation of multiple proto-oncogenes in B-cell diffuse large-cell lymphomas
.
Nature
2001
;
412
:
341
6
.
20.
Taddesse-Heath
L
,
Meloni-Ehrig
A
,
Scheerle
J
,
Kelly
JC
,
Jaffe
ES
.
Plasmablastic lymphoma with MYC translocation: evidence for a common pathway in the generation of plasmablastic features
.
Mod Pathol
2010
;
23
:
991
9
.
21.
Boy
SC
,
van Heerden
MB
,
Babb
C
,
van Heerden
WF
,
Willem
P
.
Dominant genetic aberrations and coexistent EBV infection in HIV-related oral plasmablastic lymphomas
.
Oral Oncol
2011
;
47
:
883
7
.
22.
Maya Miles
D
,
Penate
X
,
Sanmartin Olmo
T
,
Jourquin
F
,
Munoz Centeno
MC
,
Mendoza
M
et al
.
High levels of histones promote whole-genome-duplications and trigger a Swe1(WEE1)-dependent phosphorylation of Cdc28(CDK1)
.
Elife
2018
;
7
:
e35337
.
23.
Wenzel
S
,
Grau
M
,
Mavis
C
,
Hailfinger
S
,
Wolf
A
,
Madle
H
et al
.
MCL1 is deregulated in subgroups of diffuse large B-cell lymphoma
.
Leukemia
2013
;
27
:
1381
90
.
24.
Chen
C
,
Zhao
S
,
Karnad
A
,
Freeman
JW
.
The biology and role of CD44 in cancer progression: therapeutic implications
.
J Hematol Oncol
2018
;
11
:
64
.
25.
Landau
DA
,
Tausch
E
,
Taylor-Weiner
AN
,
Stewart
C
,
Reiter
JG
,
Bahlo
J
et al
.
Mutations driving CLL and their evolution in progression and relapse
.
Nature
2015
;
526
:
525
.
26.
Schmitz
R
,
Wright
GW
,
Huang
DW
,
Johnson
CA
,
Phelan
JD
,
Wang
JQ
et al
.
Genetics and pathogenesis of diffuse large B-cell lymphoma
.
N Engl J Med
2018
;
378
:
1396
407
.
27.
Lohr
JG
,
Stojanov
P
,
Carter
SL
,
Cruz-Gordillo
P
,
Lawrence
MS
,
Auclair
D
et al
.
Widespread genetic heterogeneity in multiple myeloma: implications for targeted therapy
.
Cancer Cell
2014
;
25
:
91
101
.
28.
Pasqualucci
L
,
Dalla-Favera
R.
.
Genetics of diffuse large B-cell lymphoma
.
Blood
2018
;
131
:
2307
19
.
29.
Brescia
P
,
Schneider
C
,
Holmes
AB
,
Shen
Q
,
Hussein
S
,
Pasqualucci
L
et al
.
MEF2B instructs germinal center development and acts as an oncogene in B cell lymphomagenesis
.
Cancer Cell
2018
;
34
:
453
65
.
30.
Ramsay
AJ
,
Martinez-Trillos
A
,
Jares
P
,
Rodriguez
D
,
Kwarciak
A
,
Quesada
V
.
Next-generation sequencing reveals the secrets of the chronic lymphocytic leukemia genome
.
Clin Transl Oncol
2013
;
15
:
3
8
.
31.
Chapman
MA
,
Lawrence
MS
,
Keats
JJ
,
Cibulskis
K
,
Sougnez
C
,
Schinzel
AC
et al
.
Initial genome sequencing and analysis of multiple myeloma
.
Nature
2011
;
471
:
467
72
.
32.
Gravelle
P
,
Pericart
S
,
Tosolini
M
,
Fabiani
B
,
Coppo
P
,
Amara
N
et al
.
EBV infection determines the immune hallmarks of plasmablastic lymphoma
.
Oncoimmunology
2018
;
7
:
e1486950
.
33.
Shannon-Lowe
C
,
Rickinson
A
.
The global landscape of EBV-associated tumours
.
Front Oncol
2019
;
9
:
713
.
34.
Bencun
M
,
Klinke
O
,
Hotz-Wagenblatt
A
,
Klaus
S
,
Tsai
MH
,
Poirey
R
et al
.
Translational profiling of B cells infected with the Epstein-Barr virus reveals 5′ leader ribosome recruitment through upstream open reading frames
.
Nucleic Acids Res
2018
;
46
:
2802
19
.
35.
Hammerschmidt
W
,
Sugden
B
.
Replication of Epstein–Barr viral DNA
.
Cold Spring Harb Perspect Biol
2013
;
5
:
a013029
.
36.
Sugimoto
A
,
Sato
Y
,
Kanda
T
,
Murata
T
,
Narita
Y
,
Kawashima
D
et al
.
Different distributions of Epstein-Barr virus early and late gene transcripts within viral replication compartments
.
2013
;
87
:
6693
9
.
37.
Yu
H
,
Lee
H
,
Herrmann
A
,
Buettner
R
,
Jove
R
.
Revisiting STAT3 signalling in cancer: new and unexpected biological functions
.
Nat Rev Cancer
2014
;
14
:
736
46
.
38.
Garcia-Reyero
J
,
Martinez Magunacelaya
N
,
Gonzalez de Villambrosia
S
,
Loghavi
S
,
Gomez Mediavilla
A
,
Tonda
R
et al
.
Genetic lesions in MYC and STAT3 drive oncogenic transcription factor overexpression in plasmablastic lymphoma
.
Haematologica
2020
Apr 9 [Epub ahead of print].
39.
Cousins
E
,
Gao
Y
,
Sandford
G
,
Nicholas
J
.
Human herpesvirus 8 viral interleukin-6 signaling through gp130 promotes virus replication in primary effusion lymphoma and endothelial cells
.
J Virol
2014
;
88
:
12167
72
.
40.
Chiarle
R
,
Simmons
WJ
,
Cai
H
,
Dhall
G
,
Zamo
A
,
Raz
R
et al
.
Stat3 is required for ALK-mediated lymphomagenesis and provides a possible therapeutic target
.
Nat Med
2005
;
11
:
623
9
.
41.
Valera
A
,
Colomo
L
,
Martinez
A
,
de Jong
D
,
Balague
O
,
Matheu
G
et al
.
ALK-positive large B-cell lymphomas express a terminal B-cell differentiation program and activated STAT3 but lack MYC rearrangements
.
Mod Pathol
2013
;
26
:
1329
37
.
42.
Tiacci
E
,
Ladewig
E
,
Schiavoni
G
,
Penson
A
,
Fortini
E
,
Pettirossi
V
et al
.
Pervasive mutations of JAK-STAT pathway genes in classical Hodgkin lymphoma
.
Blood
2018
;
131
:
2454
65
.
43.
Steidl
C
,
Gascoyne
RD
.
The molecular pathogenesis of primary mediastinal large B-cell lymphoma
.
Blood
2011
;
118
:
2659
69
.
44.
Joos
S
,
Kupper
M
,
Ohl
S
,
von Bonin
F
,
Mechtersheimer
G
,
Bentz
M
et al
.
Genomic imbalances including amplification of the tyrosine kinase gene JAK2 in CD30+ Hodgkin cells
.
Cancer Res
2000
;
60
:
549
52
.
45.
Ritz
O
,
Guiter
C
,
Castellano
F
,
Dorsch
K
,
Melzner
J
,
Jais
JP
et al
.
Recurrent mutations of the STAT6 DNA binding domain in primary mediastinal B-cell lymphoma
.
Blood
2009
;
114
:
1236
42
.
46.
Johnson
DE
,
O'Keefe
RA
,
Grandis
JR
.
Targeting the IL-6/JAK/STAT3 signalling axis in cancer
.
2018
;
15
:
234
.
47.
Redondo-Muñoz
J
,
García-Pardo
A
,
Teixidó
J
.
Molecular players in hematologic tumor cell trafficking
.
Front Immunol
2019
;
10
:
156
.
48.
Tzankov
A
,
Pehrs
AC
,
Zimpfer
A
,
Ascani
S
,
Lugli
A
,
Pileri
S
et al
.
Prognostic significance of CD44 expression in diffuse large B cell lymphoma of activated and germinal centre B cell-like types: a tissue microarray analysis of 90 cases
.
J Clin Pathol
2003
;
56
:
747
52
.
49.
Zhong
Y
,
Meng
F
,
Zhang
W
,
Li
B
,
van Hest
JC
,
Zhong
Z
.
CD44-targeted vesicles encapsulating granzyme B as artificial killer cells for potent inhibition of human multiple myeloma in mice
.
J Control Release
2020
;
320
:
421
30
.
50.
Grande
BM
,
Gerhard
DS
,
Jiang
A
,
Griner
NB
,
Abramson
JS
,
Alexander
TB
et al
.
Genome-wide discovery of somatic coding and noncoding mutations in pediatric endemic and sporadic Burkitt lymphoma
.
Blood
2019
;
133
:
1313
24
.
51.
Koganti
S
,
Clark
C
,
Zhi
J
,
Li
X
,
Chen
EI
,
Chakrabortty
S
et al
.
Cellular STAT3 functions via PCBP2 to restrain Epstein-Barr virus lytic activation in B lymphocytes
.
J Virol
2015
;
89
:
5002
11
.
52.
Meer S
PY
,
McAlpine
ED
,
Willem
P
.
Extraoral plasmablastic lymphomas in a high human immunodeficiency virus endemic area
.
Histopathology
2019
;
76
:
212
21
.
53.
Villela
D
,
Costa
SS
,
Vianna-Morgante
AM
,
Krepischi
AC
,
Rosenberg
C
.
Efficient detection of chromosome imbalances and single nucleotide variants using targeted sequencing in the clinical setting
.
Eur J Med Genet
2017
;
60
:
667
74
.
54.
Ostrup
O
,
Ahlborn
LB
,
Lassen
U
,
Mau-Sorensen
M
,
Nielsen
FC
.
Detection of copy number alterations in cell-free tumor DNA from plasma
.
BBA Clin
2017
;
7
:
120
6
.
55.
Grabherr
MG
,
Haas
BJ
,
Yassour
M
,
Levin
JZ
,
Thompson
DA
,
Amit
I
et al
.
Full-length transcriptome assembly from RNA-Seq data without a reference genome
.
Nat Biotechnol
2011
;
29
:
644
.
56.
Arvey
A
,
Tempera
I
,
Tsai
K
,
Chen
HS
,
Tikhmyanova
N
,
Klichinsky
M
et al
.
An atlas of the Epstein-Barr virus transcriptome and epigenome reveals host-virus regulatory interactions
.
Cell Host Microbe
2012
;
12
:
233
45
.
57.
Liao
Y
,
Smyth
GK
,
Shi
W
.
featureCounts: an efficient general purpose program for assigning sequence reads to genomic features
.
Bioinformatics
2013
;
30
:
923
30
.