Abstract
Background: A proportion of the genetic variants involved in susceptibility to Hodgkin lymphoma differ by the tumor's Epstein–Barr virus (EBV) status, particularly within the MHC region.
Methods: We have conducted an SNP imputation study of the MHC region, considering tumor EBV status in 1,200 classical Hodgkin lymphoma (cHL) cases and 5,726 control subjects of European origin. Notable findings were genotyped in an independent study population of 468 cHL cases and 551 controls.
Results: We identified and subsequently replicated a novel association between a common genetic variant rs6457715 and cHL. Although strongly associated with EBV-positive cHL [OR, 2.33; 95% confidence interval (CI), 1.83–2.97; P = 7 × 10–12], there was little evidence for association between rs6457715 and the EBV-negative subgroup of cHL (OR, 1.06; 95% CI, 0.92–1.21), indicating that this association was specific to the EBV-positive subgroup (Phet < P = 10−8). Furthermore, the association was limited to EBV-positive cHL subgroups within mixed cell (MCHL) and nodular sclerosis subtypes (NSHL), suggesting that the association is independent of histologic subtype of cHL.
Conclusions: rs6457715, located near the HLA-DPB1 gene, is associated with EBV-positive cHL and suggests this region as a novel susceptibility locus for cHL.
Impact: This expands the number of genetic variants that are associated with cHL and provides additional evidence for a critical and specific role of EBV in the etiology of this disease. Cancer Epidemiol Biomarkers Prev; 24(12); 1838–43. ©2015 AACR.
This article is featured in Highlights of This Issue, p. 1809
Introduction
Hodgkin lymphoma is a cancer of the lymphatic system characterized by the presence of B-cell derived Hodgkin Reed–Sternberg (HRS) tumor cells (1). Hodgkin lymphoma is relatively rare, but contributes substantially to worldwide disease burden, totaling 66,000 new cases in 2012 and 25,000 deaths (2). It affects mainly young adults ages 15 to 35 years and older adults ages 55 years and over. Classical Hodgkin lymphoma (cHL) is the major form and comprises four histologic subtypes of which the nodular sclerosis Hodgkin lymphoma (NSHL) subtype is most common followed by mixed cellularity Hodgkin lymphoma (MCHL).
In approximately one third of cHL cases in industrialized countries, the HRS cells are a clonally expanded population of Epstein–Barr virus (EBV)-infected cells. Young adult and NSHL cases are less likely to be EBV-positive than older adult and MCHL cases (3); EBV-positive cases are more likely to be male in contrast with more equal gender representation among EBV-negative cases. EBV-positive cases are associated with recent EBV infection, as evidenced by an association with infectious mononucleosis, and also with weakened immunity, for example, HIV infection or iatrogenic immunosuppression after organ transplantation (4–6). The human leukocyte antigen (HLA) genes, located within the MHC region, have been implicated in the etiology of cHL (7–11). Some associations show heterogeneity by tumor EBV status (9, 11–14) reinforcing the importance of acknowledging EBV status in examining the etiology of cHL.
Here, we expanded our investigation of the MHC region based on data from a previous cHL genome-wide association study (GWAS) and performed EBV status–specific analyses using imputation procedures for both SNPs and HLA alleles.
Materials and Methods
Imputation of 791,716 SNPs in human chromosome 6 was conducted using MACH v1.0 (15) and Minimac (version 2010.12.13; ref. 16). The August 2010 release of the 1000 Genomes Project European (CEU) dataset was used as the reference panel to impute genotypes for 1,200 cHL cases (of which 265 were EBV-positive) from the EPILYMPH study, the Scotland and Newcastle Lymphoma Group (SNLG) studies, the Young Adult Hodgkin's Disease Case–Control Study (YHHCCS), the Scandinavian Lymphoma Etiology Study, and the Northern Dutch HL study, and 5,726 controls (a large subset that were part of a GWAS of cHL reported previously; ref. 11; Supplementary Table S1). IRB: the study was approved by the IARC Ethics Committee (project no. 08-21).
Poorly imputed SNPs defined by an R2 < 0.80 and a quality < 0.90 were excluded from the analyses. As a technical validation, we compared imputation dosages and direct genotypes in a subset of 562 individuals. For this, we classified subjects based on imputation dosages as homozygous wild-type (less than 0.3), heterozygous (between 0.7 and 1.3), or homozygous variant (greater than 1.7).
Hodgkin lymphoma EBV status was ascertained through in situ hybridization for EBERs and/or through immunohistochemical staining for EBV LMP-1 protein on formalin-fixed paraffin-embedded tumor samples as described previously (11).
Test of association between imputed SNPs and cHL as well as by subtypes (i.e., NSHL, MCHL, EBV-positive cHL, and EBV-negative cHL) was performed using a probabilistic dosage model in ProbABEL v0.4.3 (17), adjusted for sex and the first informative eight eigenvectors from a principal components analysis (EIGENSOFT 3.0, Broad Institute) derived using a subseries of 11,029 SNPs across the genome to control for potential population stratification (18). In additionally, we adjusted for the effect of rs2734986 (HLA-A), rs6904029 (HCG9), rs2248462 (MICB), rs2395185 (HLA-DRA), and rs6903608 (HLA-DRA), five MHC region SNPs previously associated with cHL risk.
For the replication series, 468 cHL cases and 551 study-specific controls from EPILYMPH, SNLG, YHHCCS, and the Epidemiology and Genetics Lymphoma Case–Control Study had complete genotype data for the six MHC loci of interest (five known loci and rs6457715; Supplementary Tables S1 and S2). One hundred six of the EBV-typed cases were classified as EBV-positive cHL. Genotyping of rs6457715 was performed using TaqMan Pre-Designed SNP Genotyping Assays (Applied Biosystems). The genotype distribution was in accordance with that expected by Hardy–Weinberg equilibrium and the assay had duplicate genotyping concordance rate of >99%.
From the GWAS genotyped data, classical HLA loci A, C, B, DRB1, DRB3-5, DQA1, DQB1, and DPB1 were imputed using HLA*IMP:02 (19). We confirmed the robustness of the imputation process by comparing HLA loci imputation (except DRB3-5) with directly genotyped HLA data (locus-specific PCR followed by sequence-specific oligonucleotide hybridization; ref. 20) in 334 UK and 284 Dutch individuals. The average concordance rate between HLA imputed and directly genotyped data was 93.8% (ranging from 82.8% to 99.3%; Supplementary Table S3).
Lymphoblastoid cell lines (LCLs) were generated from blood samples from 95 healthy controls and 70 posttherapy cHL patients (from blood samples collected at least 1 year after completion of all therapies) by infection with EBV. Genotyping of the LCLs was carried out using a TaqMan SNP assay. Association between HLA–DPB1 gene expression levels and genotype was assessed by linear regression using R.
Results
We performed SNP imputation of 1,200 cHL cases and 5,726 control subjects of European origin to undertake a comprehensive evaluation of the MHC region in total cHL and by EBV tumor status of cHL, while controlling for the effects of previously described susceptibility variants (11). We did not identify any novel signal (P < 10−7) in total cHL or the EBV-negative subgroup, in addition to previously described associations in MHC class I and class II regions (Supplementary Fig. S1A–S1B). However, three imputed genetic variants (rs6457715, rs6457714, and rs6457711) were associated with the EBV-positive subgroup at genome-wide significance levels (P < 10−7; Fig. 1). The rs6457715, rs6457714, and rs6457711 variants showed evidence of linkage disequilibrium (LD) and conditioning on rs6457715 (A/G) was consistent with a single association signal (risk allele frequencies: EBV-positive cHL = 0.87; controls = 0.79). We directly genotyped rs6457715 within a subset (N = 562; see Materials and Methods) of the sample in which we undertook imputation, and the concordance rate for rs6457715 between categorized dosages and direct genotypes was 99.47%, confirming the high accuracy of the imputation process. Within this discovery set, the rs6457715 major allele (A) was strongly associated with an increased risk of EBV-positive cHL [OR, 2.39; 95% confidence interval (CI), 1.80–3.18; P = 1 × 10−9], with no evidence for association in EBV-negative cHL (OR, 1.06; 95% CI, 0.91–1.24; P = 0.45; Supplementary Table S2).
We subsequently directly genotyped rs6457715 in an independent series of 468 cHL cases and 551 controls of European origin and observed similar patterns of association as in the discovery series, including a statistically significant increased risk of EBV-positive cHL [OR, 2.17; 95% CI, 1.36–3.49; P = 0.0013), but not EBV-negative (P = 0.791) or overall cHL (P = 0.622; Supplementary Table S2). The association was more pronounced in the model adjusted for known MHC cHL susceptibility variants in both the validation and replication cohorts (Supplementary Table S2). Combining the imputation-based GWAS and replication results (Fig. 2) demonstrated a 2.3-fold increased risk of EBV-positive cHL associated with rs6457715 (allele A relative to allele G; P = 7.53 × 10−12), but no effect within the EBV-negative subgroup (P = 0.405) and marked evidence for heterogeneity between the EBV-positive and EBV-negative groups (Phet = 3 × 10−8). Although the predominant cHL histologic subtypes tend to be correlated with the tumor EBV status (NSHL and MCHL more likely to be EBV-negative and positive, respectively; ref. 3), the heterogeneity of this association remained when considering the tumor EBV status within the MCHL and NSHL histologic subgroups, implying that EBV status is the main source of heterogeneity (Fig. 2).
We estimated HLA allele genotypes using imputation techniques. Multivariate regression analysis of rs6457715 adjusting for imputed alleles at each HLA locus (i.e., A, C, B, DRB1, DRB3, DRB4, DRB5, DQA1, DQB1, and DPB1) resulted in little attenuation of effect for rs6457715, suggesting that the relationship is independent of these HLA alleles (Supplementary Table S4).
rs6457715 has been suggested to be an eQTL for HLA–DPB1 (21), however, we were unable to detect a clear association (P = 0.15) between HLA–DPB1 gene expression levels and rs6457715 within 165 individuals (70 cHL patients and 95 controls) where cells have been cultured and HLA–DPB1 gene expression levels subsequently assayed by TaqMan real-time PCR.
Modeling a genetic risk score of the three independent MHC alleles associated with EBV-positive cHL [rs2734986, rs6904029 (ref. 11), and rs6457715] suggested that there was a per-allele increased odds of 1.86 for EBV-positive cHL, with the highest quartile of allele carriers having an approximately 7-fold increased odds relative to the lowest quartile (Supplementary Table S5).
Discussion
We have identified a novel group of highly correlated genetic variants, located near the MHC class-II HLA–DPB1 gene, which are associated with genetic susceptibility to cHL. Statistically robust observations were made in both discovery and validation cohorts, and cross validation of the genotyping methods confirmed their technical fidelity.
The association was restricted to the subgroup of cHL in which EBV was detected within the tumor cells by EBV EBER in situ hybridization and/or LMP-1 immunohistochemistry. This implies that these alleles are only relevant to the subset of cHL that are “EBV-positive cHL,” that is, where the HRS cells appear to originate from a clonally expanded population of EBV-infected cell. Most MCHL cases are EBV-positive, however, the heterogeneity by tumor EBV status was also observed following stratification by NSHL and MCHL subtypes of cHL. This strong heterogeneity implies a biologic interaction between EBV and the genetic alleles leading to susceptibility to HL irrespective of the histologic subtype.
The association with the genetic variants was statistically independent of the previously described cHL susceptibility alleles found within the MHC region. A proportion of these other susceptibility alleles also demonstrate heterogeneity in their effects when considering the EBV status of the tumor (9, 11, 12). The class I genetic variants rs2734986 and rs6904029, which correlated with HLA-A gene alleles A*02 and A*01, respectively, are associated with EBV-positive cHL, whereas a genetic variant located in the class II region, rs6903608 has been associated uniquely with EBV-negative cHL (9, 11, 12). As noted above for rs6457715, heterogeneity in EBV-stratified cHL risk was observed even following stratification for the NSHL and MCHL subtypes, for the rs2734986 and rs6904029 and EBV-positive cHL and rs6903608 and EBV-negative cHL. All of these observations taken together imply a remarkable relationship between tumor EBV status and the risk conferred by genetic variants within the MHC region and reinforces the importance of acknowledging EBV status when examining the etiology of Hodgkin lymphoma.
We were unable to establish a link between rs6457715 and a particular HLA allele (Supplementary Table S4), albeit within the limitation inferring HLA genotypes by imputation. rs6457715 resides within an intronic region of the pseudogene HLA–DPB2, and is suggested to be an HLA–DPB1 expression quantitative trait locus (eQTL; ref. 21). However, multiple additional independent genetic variants at this locus, not associated with EBV-positive cHL, are also suggested to be HLA–DPB1 eQTLs (Supplementary Table S6), implying that differences in HLA–DPB1 expression levels are unlikely to clearly explain the association. Furthermore, we were unable to replicate this eQTL association with rs6457715 and HLA–DPB1. As with the rs6903608 effect on EBV-negative cHL (9), the functional mechanism through which the association with rs6457715 is mediated remains ambiguous.
rs6457715 represents the third independent genetic loci within the MHC region (represented by rs2734986, rs6904029, and rs6457715) exclusively associated with EBV-positive cHL. Each of these alleles shows a strong association with cHL, with the risk allele conferring about a 2-fold increased risk per allele of developing the disease. This is an unusually pronounced effect compared with the genetic risks usually conferred by common genetic variants. When considered as a genetic risk score, in a multivariate model, there was an approximately 7-fold difference between the top and bottom 25% of allele carriers. Such a model makes a number of important assumptions, for example, the absence of LD, which is a particularly complex in this region of the genome. Nevertheless, the magnitude of this association implies some potential utility for risk prediction of EBV-positive cHL, particularly in context of other described Hodgkin lymphoma EBV positive risk factors, such as infectious mononucleosis.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Authors' Contributions
Conception and design: M. Delahaye-Sourdeix, A. Nieters, H. Hjalgrim, J.D. McKay
Development of methodology: J.D. McKay
Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): Y. Benavente, A. Nieters, N. Becker, L. Foretova, M. Maynadié, A. Staines, K. Ekstrom Smedby, I. Glimelius, P. Cocco, P. Galan, L.J. Vatten, E.J. Duell, L. Kiemeney, E. Roman, S. de Sanjosé, M. Melbye, P. Brennan, A. Diepstra, A. van den Berg, H. Hjalgrim, R.F. Jarrett
Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): M. Delahaye-Sourdeix, K.Y. Urayama, V. Gaborieau, M. Foll, S. de Sanjosé, M. Lathrop, J.D. McKay
Writing, review, and/or revision of the manuscript: M. Delahaye-Sourdeix, K.Y. Urayama, M. Foll, A. Nieters, N. Becker, L. Foretova, A. Staines, K. Ekstrom Smedby, I. Glimelius, T. Lightfoot, P. Cocco, P. Galan, L.J. Vatten, E.J. Duell, L. Kiemeney, E. Roman, S. de Sanjosé, M. Melbye, P. Brennan, A. Diepstra, A. van den Berg, H. Hjalgrim, R.F. Jarrett, J.D. McKay
Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): M. Delahaye-Sourdeix, R. Veenstra, A. Chabrier, A. Staines, E.J. Duell, E. Roman
Study supervision: A. Nieters, K. Ekstrom Smedby, L.J. Vatten, L. Kiemeney, E. Roman, J.D. McKay
Acknowledgments
The authors thank G. Thomas and Synergy Lyon Cancer (Lyon, France) for high-performance computing support.
Grant Support
This work was supported by Institut National du Cancer, France (to P. Galan); Spanish Ministry of Health grant CIBERESP (PI11/0181008-1555 and 06/02/0073) and Agència de Gestió d'Ajuts Universitaris i de Recerca–Generalitat de Catalunya (Catalonian Government 2014SGR756; to Y. Benavente, E.J. Duell, and S. de Sanjosé); Ministry of Health of the Czech Republic (MZ0 MOU2005; to L. Foretova); German José Carreras Leukemia Foundation (DJCLS_R04/08; to N. Becker); Federal Office for Radiation Protection (StSch4261 and StSch4420; to A. Nieters); EC 5th Framework Program Quality of Life (QLK4-CT-2000-00422; to Y. Benavente, A. Nieters, and S. de Sanjosé); EC 6th Framework Program (FP6-2003-FOOD-2-B; to Y. Benavente, A. Nieters, and S. de Sanjosé); La Fondation de France (1999-008471; to M. Maynadié); Compagnia di San Paolo—Programma Oncologia; Health Research Board (to A. Staines); Leukaemia Lymphoma Research (00/73, 06001, 08031, and 05045; to R.F. Jarrett); Kay Kendall Leukaemia Fund (to R.F. Jarrett); U.S. NIH (R01 CA69269; to M. Melbye); Nordic Cancer Union (16-02-D; to K. Ekstrom Smedby, M. Melbye, and H. Hjalgrim); Plan Danmark (to M. Melbye and H. Hjalgrim); Danish Cancer Research Foundation (41-08; to K. Ekstrom Smedby, M. Melbye, and H. Hjalgrim); Lundbeck Foundation (R19-A2364); Danish Cancer Society (DP 08-155; to K. Ekstrom Smedby, M. Melbye, and H. Hjalgrim); Swedish Cancer Society (2009/1084; to K. Ekstrom Smedby and I. Glimelius); Dutch Cancer Society (KWF grants RUG 2000-2315, RUG 2010-4860, RUG 2014-6698; to R. Veenstra, A. Diepstra, and A. van den Berg); Dutch Organization of Scientific Research (NWO-MW grant 920-03-136; to R. Veenstra, A. Diepstra, and A. van den Berg).
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.