Abstract
Chromosomal translocations are the hallmark genetic aberration in non–Hodgkin lymphoma (NHL), with specific translocations often selectively associated with specific NHL subtypes. Because many NHL-associated translocations involve cell cycle, apoptosis, and lymphocyte development regulatory genes, we evaluated NHL risk associated with common genetic variation in 20 candidate genes in these pathways. Genotyping of 203 tag single nucleotide polymorphisms (SNP) was conducted in 1,946 NHL cases and 1,808 controls pooled from 3 independent population-based case-control studies. We used logistic regression to compute odds ratios (OR) and 95% confidence intervals (CI) for NHL and four major NHL subtypes in relation to tag SNP genotypes and haplotypes. We observed the most striking associations for tag SNPs in the proapoptotic gene BCL2L11 (BIM) and BCL7A, which is involved in a rare NHL-associated translocation. Variants in BCL2L11 were strongly related to follicular lymphoma only, particularly rs3789068 (ORAG, 1.41; 95% CI, 1.10-1.81; ORGG, 1.65; 95% CI, 1.25-2.19; Ptrend = 0.0004). Variants in BCL7A were strongly related to diffuse large B-cell lymphoma only, particularly rs1880030 (ORAG, 1.34; 95% CI, 1.08-1.68; ORAA, 1.60; 95% CI, 1.22-2.08; Ptrend = 0.0004). The associations for both variants were similar in all three studies and supported by haplotype analyses. We also observed notable associations for variants in BCL6, CCND1, and MYC. Our results support the role of common genetic variation in cell cycle, apoptosis, and lymphocyte development regulatory genes in lymphomagenesis, and suggest that effects may vary by NHL subtype. Replication of our findings and further study to identify functional SNPs are warranted. (Cancer Epidemiol Biomarkers Prev 2009;18(4):1259–70)
Introduction
Non–Hodgkin lymphomas (NHL) are closely related diseases, each involving the malignant transformation of lymphoid cells but with distinctive morphologic, immunophenotypic, genetic, and clinical features (1). The strongest known NHL risk factor is severe immunodeficiency, but the etiologies of most lymphomas remain unexplained (1, 2). Although no major susceptibility gene has been identified, several lines of evidence reveal the contributions of genetic predisposition to NHL etiology: NHL risk is elevated among individuals with a family history of hematopoietic malignancy, migrant studies show that migrants tend to retain the NHL incidence rates and patterns of their country of origin, and common genetic variations have recently been associated with NHL risk (3-6).
Chromosomal translocations are the hallmark genetic aberration in NHL, with specific translocations often selectively associated with particular NHL subtypes (7-10). Most translocations occur as a side effect of the single- and double-stranded DNA breaks induced during endogenous processes critical to normal lymphocyte development. Specifically, early in lymphocyte development, DNA in the variable (V), diversity (D), and joining (J) regions of the immunoglobulin heavy chain (IgH) and λ light chain (IgL) loci recombines to form a functioning B-cell receptor in a process known as V(D)J recombination. Mature, antigen-stimulated B-cells undergo two processes that entail DNA breaks. The first, class switch recombination, involves recombination of DNA in the IgH constant region to produce effector antibody classes. The second, somatic hypermutation, typically induces a high rate of point mutations in the Ig V regions to produce antibodies with improved antigen affinity.
NHL-associated translocations typically result in transcriptional deregulation of a proto-oncogene or oncogene by juxtaposing it with Ig regulatory sequences, although some non–Ig translocations can also occur (7-11). Many of the genes involved in NHL-associated translocations regulate the cell cycle, apoptosis, and lymphocyte development, such as MYC, BCL2, CCND1, and BCL6. Genes in these pathways (e.g., MYC, BCL6, and PIM1) also have been identified as targets of aberrant (non-Ig) somatic hypermutation (12).
The likely importance of cell cycle, apoptosis, and lymphocyte development regulatory genes in lymphomagenesis is evident from their participation in NHL-associated translocations and their identification as targets of aberrant somatic hypermutation, yet few studies have investigated the relationship between risk of developing lymphoma and common genetic variation in these genes. We therefore investigated risk of NHL and NHL subtypes associated with common genetic variation in 20 candidate genes involved in regulating the cell cycle, apoptosis, and lymphocyte development, 7 of them in or near breakpoints for lymphoma-associated chromosomal translocations (Table 1). Our study population included 1,946 patients with NHL and 1,808 controls derived from pooling 3 population-based case-control studies. Combining data from three studies enabled us to evaluate pooled risk estimates as well as risk estimates in three independent populations, and provided sufficient sample size to investigate risk of NHL overall and the four most common NHL subtypes.
Candidate gene* (N = 20) . | Chromosomal location* . | Gene name (aliases)* . | Comments . | Total SNPs (N = 203)† . | Gene coverage‡ . | |||||
---|---|---|---|---|---|---|---|---|---|---|
Genes involved in cell cycle control and apoptosis | ||||||||||
BCL10 | 1p22 | B-cell CLL/lymphoma 10 (CLAP; mE10; CIPER; c-E10; CARMEN) | Induces apoptosis, activates nuclear factor-κB. t(1;14)(p22;q32) [BCL10;IgH] is involved in < 5% marginal zone lymphomas. | 12 | 92% | |||||
TP53I3 | 2p23.3 | Tumor protein p53 inducible protein 3 (PIG3) | Involved in p53-mediated apoptosis. | 5 | 100% | |||||
BCL2L11 | 2q12-q13 | BCL2-like 11 (BAM, BIM, BIM-α6, BIM-β6, BIM-β7, BOD, BimEL, BimL) | Induces apoptosis. BCL2 family member. | 12 | 86% | |||||
RIPK1 | 6p25.2 | Receptor (TNFRSF)-interacting serine-threonine kinase 1 (FLJ39204, RIP, RIP1) | Induces apoptosis. | 9 | 90% | |||||
PIM1 | 6p21.2 | Pim-1 oncogene | Controls cell growth, differentiation and apoptosis, particularly for the hematopoietic system. Target of aberrant hypermutation. | 1 | 50% | |||||
RIPK2 | 8q21 | Receptor-interacting serine-threonine kinase 2 (WUGSC:H_RG437L15.1, CARD3, CARDIAK, CCK, GIG30, RICK, RIP2) | Induces apoptosis. | 6 | 86% | |||||
MYC | 8q24.21 | V-myc avian myelocytomatosis viral oncogene homologue (c-myc) | Promotes cell proliferation. Target of aberrant hypermutation. t(8;14)(q24;q32) [MYC;IgH] is involved in 95-100% Burkitt lymphomas and 10% DLBCLs. t(8;12;14)(q24;q24;q32) [MYC;BCL7A;IgH] is involved in <1% Burkitt lymphomas. | 12 | 100% | |||||
CCND1 | 11q13 | Cyclin D1 (BCL1, D11S287E, PRAD1, U21B31) | Regulates the cell cycle G1-S transition. t(11;14)(q13;q32) [CCND1;IgH] is involved in 95-100% mantle cell lymphomas and <5% multiple myelomas and CLLs. | 5 | 83% | |||||
BCL2L2 | 14q11.2-q12 | BCL2-like 2 (BCL-W, BCLW, KIAA0271) | Inhibits apoptosis. BCL2 family member. | 2 | 50% | |||||
BCL2L10 | 15q21 | BCL2-like 10 (BCL-B, Boo, Diva, MGC129810, MGC129811) | Inhibits apoptosis. BCL2 family member. | 5 | 83% | |||||
BCL2A1 | 15q24.3 | BCL2-related protein A1 (ACC-1, ACC-2, BCL2L5, BFL1, GRS, HBPA1) | Inhibits apoptosis. BCL2 family member. | 8 | 89% | |||||
TP53 | 17p13.1 | Tumor protein p53 (Li-Fraumeni syndrome; LFS1, TRP53, p53) | Induces cell cycle arrest or apoptosis in response to DNA damage. Mutated in DLBCL (25%) and Burkitt lymphoma (40%). | 2 | 22% | |||||
BCL2 | 18q21.3 | B-cell CLL/lymphoma 2 | Inhibits apoptosis. BCL2 family member. t(14;18)(q32;q21) [IgH;BCL2] is involved in 70-90% follicular lymphomas, 30% DLBCLs, and <5% other NHLs. | 62 | 91% | |||||
BAX | 19q13.3-q13.4 | BCL2-associated X protein | Induces apoptosis. BCL2 family member. | 10 | 77% | |||||
BCL2L1 | 20q11.21 | BCL2-like 1 (BCL-XL/S, BCL2L, BCLX, Bcl-X, DKFZp781P2092, bcl-xL, bcl-xS) | Inhibits apoptosis. BCL2 family member. | 4 | 80% | |||||
Genes involved in lymphocyte development | ||||||||||
LMO2 | 11p13 | LIM domain only 2 (RBTN2, RBTNL1, RHOM2, TTG2) | Highly expressed in germinal center lymphocytes. Near the 11p13 T-cell translocation cluster. | 22 | 88% | |||||
AICDA | 12p13 | Activation-induced cytidine deaminase (AID, ARP2, CDA2, HIGM2) | Initiates class switch recombination and somatic hypermutation in germinal center B-cells. | 7 | 70% | |||||
BCL6 | 3q27 | B-cell CLL/lymphoma 6 (BCL5, BCL6A, LAZ3, ZBTB27, ZNF51) | Controls germinal-center formation and T-cell–dependent immune responses. Target of aberrant hypermutation. t(3;14)(q27;q32) [BCL6;IgH] is involved in 10-35% DLBCLs and 5-10% follicular lymphomas. t(3;various)(q27) [BCL6;various] is involved in 5% DLBCLs. | 11 | 92% | |||||
Gene function unknown | ||||||||||
BCL7A | 12q24.13 | B-cell CLL/lymphoma 7A (BCL7) | Function unknown. t(12;14)(q24;q32) [BCL7A;IgH] and t(8;12;14)(q24;q24;q32) [MYC;BCL7A;IgH] are involved in <1% Burkitt lymphomas. | 6 | 86% | |||||
BCL7C | 16p11 | B-cell CLL/lymphoma 7C | Function unknown. | 2 | 100% |
Candidate gene* (N = 20) . | Chromosomal location* . | Gene name (aliases)* . | Comments . | Total SNPs (N = 203)† . | Gene coverage‡ . | |||||
---|---|---|---|---|---|---|---|---|---|---|
Genes involved in cell cycle control and apoptosis | ||||||||||
BCL10 | 1p22 | B-cell CLL/lymphoma 10 (CLAP; mE10; CIPER; c-E10; CARMEN) | Induces apoptosis, activates nuclear factor-κB. t(1;14)(p22;q32) [BCL10;IgH] is involved in < 5% marginal zone lymphomas. | 12 | 92% | |||||
TP53I3 | 2p23.3 | Tumor protein p53 inducible protein 3 (PIG3) | Involved in p53-mediated apoptosis. | 5 | 100% | |||||
BCL2L11 | 2q12-q13 | BCL2-like 11 (BAM, BIM, BIM-α6, BIM-β6, BIM-β7, BOD, BimEL, BimL) | Induces apoptosis. BCL2 family member. | 12 | 86% | |||||
RIPK1 | 6p25.2 | Receptor (TNFRSF)-interacting serine-threonine kinase 1 (FLJ39204, RIP, RIP1) | Induces apoptosis. | 9 | 90% | |||||
PIM1 | 6p21.2 | Pim-1 oncogene | Controls cell growth, differentiation and apoptosis, particularly for the hematopoietic system. Target of aberrant hypermutation. | 1 | 50% | |||||
RIPK2 | 8q21 | Receptor-interacting serine-threonine kinase 2 (WUGSC:H_RG437L15.1, CARD3, CARDIAK, CCK, GIG30, RICK, RIP2) | Induces apoptosis. | 6 | 86% | |||||
MYC | 8q24.21 | V-myc avian myelocytomatosis viral oncogene homologue (c-myc) | Promotes cell proliferation. Target of aberrant hypermutation. t(8;14)(q24;q32) [MYC;IgH] is involved in 95-100% Burkitt lymphomas and 10% DLBCLs. t(8;12;14)(q24;q24;q32) [MYC;BCL7A;IgH] is involved in <1% Burkitt lymphomas. | 12 | 100% | |||||
CCND1 | 11q13 | Cyclin D1 (BCL1, D11S287E, PRAD1, U21B31) | Regulates the cell cycle G1-S transition. t(11;14)(q13;q32) [CCND1;IgH] is involved in 95-100% mantle cell lymphomas and <5% multiple myelomas and CLLs. | 5 | 83% | |||||
BCL2L2 | 14q11.2-q12 | BCL2-like 2 (BCL-W, BCLW, KIAA0271) | Inhibits apoptosis. BCL2 family member. | 2 | 50% | |||||
BCL2L10 | 15q21 | BCL2-like 10 (BCL-B, Boo, Diva, MGC129810, MGC129811) | Inhibits apoptosis. BCL2 family member. | 5 | 83% | |||||
BCL2A1 | 15q24.3 | BCL2-related protein A1 (ACC-1, ACC-2, BCL2L5, BFL1, GRS, HBPA1) | Inhibits apoptosis. BCL2 family member. | 8 | 89% | |||||
TP53 | 17p13.1 | Tumor protein p53 (Li-Fraumeni syndrome; LFS1, TRP53, p53) | Induces cell cycle arrest or apoptosis in response to DNA damage. Mutated in DLBCL (25%) and Burkitt lymphoma (40%). | 2 | 22% | |||||
BCL2 | 18q21.3 | B-cell CLL/lymphoma 2 | Inhibits apoptosis. BCL2 family member. t(14;18)(q32;q21) [IgH;BCL2] is involved in 70-90% follicular lymphomas, 30% DLBCLs, and <5% other NHLs. | 62 | 91% | |||||
BAX | 19q13.3-q13.4 | BCL2-associated X protein | Induces apoptosis. BCL2 family member. | 10 | 77% | |||||
BCL2L1 | 20q11.21 | BCL2-like 1 (BCL-XL/S, BCL2L, BCLX, Bcl-X, DKFZp781P2092, bcl-xL, bcl-xS) | Inhibits apoptosis. BCL2 family member. | 4 | 80% | |||||
Genes involved in lymphocyte development | ||||||||||
LMO2 | 11p13 | LIM domain only 2 (RBTN2, RBTNL1, RHOM2, TTG2) | Highly expressed in germinal center lymphocytes. Near the 11p13 T-cell translocation cluster. | 22 | 88% | |||||
AICDA | 12p13 | Activation-induced cytidine deaminase (AID, ARP2, CDA2, HIGM2) | Initiates class switch recombination and somatic hypermutation in germinal center B-cells. | 7 | 70% | |||||
BCL6 | 3q27 | B-cell CLL/lymphoma 6 (BCL5, BCL6A, LAZ3, ZBTB27, ZNF51) | Controls germinal-center formation and T-cell–dependent immune responses. Target of aberrant hypermutation. t(3;14)(q27;q32) [BCL6;IgH] is involved in 10-35% DLBCLs and 5-10% follicular lymphomas. t(3;various)(q27) [BCL6;various] is involved in 5% DLBCLs. | 11 | 92% | |||||
Gene function unknown | ||||||||||
BCL7A | 12q24.13 | B-cell CLL/lymphoma 7A (BCL7) | Function unknown. t(12;14)(q24;q32) [BCL7A;IgH] and t(8;12;14)(q24;q24;q32) [MYC;BCL7A;IgH] are involved in <1% Burkitt lymphomas. | 6 | 86% | |||||
BCL7C | 16p11 | B-cell CLL/lymphoma 7C | Function unknown. | 2 | 100% |
As defined by Entrez gene (http://www.ncbi.nlm.nih.gov/sites/entrez?db=gene).
Markers genotyped for each candidate gene based on selection by Tagzilla. For each gene, SNPs within the region spanning 20kb 5′ of the start of transcription (exon 1) to 10 kb 3′ of the end of the last exon were grouped using a binning threshold of r2 > 0.8.
Estimated gene coverage is based on the number of SNPs genotyped/number of bins from the designable set of SNPs (r2>0.8, minor allele frequency >5%) genotyped in the HapMap Caucasian (CEU) samples, Build 20 (http://www.hapmap.org).
Materials and Methods
Study Population
Our study population was derived from pooling three independent population-based case-control studies, which have been described in detail previously: the National Cancer Institute-Surveillance Epidemiology and End Results (NCI-SEER) NHL Case-Control Study (13, 14), the Connecticut NHL Case-Control Study (15, 16), and the New South Wales (NSW) NHL Case-Control Study (17, 18). Selected characteristics for each study are presented in Table 2. All three studies included first primary NHL cases only, and population controls were frequency matched to cases (Table 2). The pooled study population had more women than men because the Connecticut study was limited to women, and the age distribution was somewhat younger than a typical series of NHL cases because the NCI-SEER and NSW studies were limited to adults younger than age 75 y. Like the underlying populations, the study population was predominantly Caucasian and non-Hispanic.
. | NCI-SEER . | Connecticut . | NSW . | Pooled . | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Time period | 1998-2000 | 1996-2000 | 2000-2001 | |||||||||||||
Eligibility criteria | Included ages 20-74 y. Excluded known HIV-positive individuals. | Included ages 21-84 y. Excluded males. | Included ages 20-74 y. Excluded known HIV-positive individuals and organ transplant recipients. | |||||||||||||
Control selection | <65 y: random digit dialing, ≥65 y: medicare files | <65 y: random digit dialing, ≥65 y: medicare files | Electoral rolls | |||||||||||||
Matching variables | Age (5 y groups), sex, race, SEER area | Age (5 y groups) | Age (5 y groups), sex, state or territory | |||||||||||||
Risk factor data | Self-administered questionnaire, in-person interview | Self-administered questionnaire, in-person interview | Self-administered questionnaire, telephone interview | |||||||||||||
Controls | Cases | Controls | Cases | Controls | Cases | Controls | Cases | |||||||||
n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | |||||||||
Study population | ||||||||||||||||
Risk factor data† | 1057 (NA) | 1321 (NA) | 717 (NA) | 601 (NA) | 694 (NA) | 694 (NA) | ||||||||||
Genotyped for this analysis‡ | 834 (NA) | 1001 (NA) | 517 (NA) | 436 (NA) | 474 (NA) | 524 (NA) | ||||||||||
Final analytic population§ | 828 (100) | 990 (100) | 515 (100) | 436 (100) | 465 (100) | 520 (100) | 1808 (100) | 1946 (100) | ||||||||
Study site | ||||||||||||||||
Detroit SEER registry | 139 (17) | 197 (20) | — | — | — | — | 139 (8) | 197 (10) | ||||||||
Iowa SEER registry | 246 (30) | 301 (30) | — | — | — | — | 246 (14) | 301 (16) | ||||||||
Los Angeles SEER registry | 199 (24) | 234 (24) | — | — | — | — | 199 (11) | 234 (12) | ||||||||
Seattle SEER registry | 244 (29) | 258 (26) | — | — | — | — | 244 (13) | 258 (13) | ||||||||
Connecticut SEER registry | — | — | 515 (100) | 436 (100) | — | — | 515 (28) | 436 (22) | ||||||||
NSW | — | — | — | — | 446 (96) | 496 (95) | 446 (25) | 496 (26) | ||||||||
Australian Capital Territory | — | — | — | — | 19 (4) | 24 (5) | 19 (1) | 24 (1) | ||||||||
Sex | ||||||||||||||||
Male | 443 (53) | 536 (54) | — | — | 268 (58) | 304 (58) | 711 (39) | 840 (43) | ||||||||
Female | 385 (47) | 454 (46) | 515 (100) | 436 (100) | 197 (42) | 216 (42) | 1097 (61) | 1106 (57) | ||||||||
Age (y) | ||||||||||||||||
<50 | 203 (25) | 277 (28) | 98 (19) | 86 (20) | 107 (23) | 121 (23) | 408 (23) | 484 (25) | ||||||||
50-59 | 177 (21) | 235 (24) | 97 (19) | 89 (20) | 135 (29) | 171 (33) | 409 (23) | 495 (25) | ||||||||
60-69 | 285 (34) | 311 (31) | 120 (23) | 110 (25) | 151 (32) | 154 (30) | 556 (31) | 575 (30) | ||||||||
70+ | 163 (20) | 167 (17) | 200 (39) | 151 (35) | 72 (16) | 74 (14) | 435 (24) | 392 (20) | ||||||||
Race/ethnicity | ||||||||||||||||
White, non-Hispanic | 646 (78) | 829 (84) | 473 (92) | 415 (95) | 459 (99) | 507 (98) | 1578 (87) | 1751 (90) | ||||||||
Black | 112 (13) | 64 (6) | 14 (3) | 13 (3) | — | — | 126 (7) | 77 (4) | ||||||||
Asian/other/unknown | 70 (8) | 97 (10) | 28 (5) | 8 (2) | 6 (1) | 13 (3) | 104 (6) | 118 (6) | ||||||||
NHL subtype | ||||||||||||||||
DLBCL | — | 294 (30) | — | 137 (31) | — | 169 (33) | — | 600 (31) | ||||||||
Follicular lymphoma | — | 246 (25) | — | 103 (24) | — | 191 (37) | — | 540 (28) | ||||||||
Marginal zone lymphoma | — | 82 (8) | — | 29 (7) | — | 49 (9) | — | 160 (8) | ||||||||
CLL/SLL | — | 101 (10) | — | 43 (10) | — | 17 (3) | — | 161 (8) | ||||||||
Mantle cell lymphoma | — | 40 (4) | — | 10 (2) | — | 19 (4) | — | 69 (4) | ||||||||
Lymphoplasmacytic lymphoma | — | 24 (2) | — | 9 (2) | — | 23 (4) | — | 56 (3) | ||||||||
Burkitt lymphoma | — | 11 (1) | — | 0 (0) | — | 3 (1) | — | 14 (1) | ||||||||
Mycosis fungoides/Sézary syndrome | — | 18 (2) | — | 10 (2) | — | 3 (1) | — | 31 (2) | ||||||||
Peripheral T-cell lymphoma | — | 41 (4) | — | 14 (3) | — | 7 (1) | — | 62 (3) | ||||||||
NHL, not otherwise specified | — | 133 (13) | — | 81 (19) | — | 39 (7) | — | 253 (13) | ||||||||
DNA source | ||||||||||||||||
Blood | 598 (72) | 688 (70) | 515 (100) | 436 (100) | 465 (100) | 520 (100) | 1578 (87) | 1644 (85) | ||||||||
Buccal | 230 (28) | 302 (30) | — | — | — | — | 230 (13) | 302 (15) |
. | NCI-SEER . | Connecticut . | NSW . | Pooled . | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Time period | 1998-2000 | 1996-2000 | 2000-2001 | |||||||||||||
Eligibility criteria | Included ages 20-74 y. Excluded known HIV-positive individuals. | Included ages 21-84 y. Excluded males. | Included ages 20-74 y. Excluded known HIV-positive individuals and organ transplant recipients. | |||||||||||||
Control selection | <65 y: random digit dialing, ≥65 y: medicare files | <65 y: random digit dialing, ≥65 y: medicare files | Electoral rolls | |||||||||||||
Matching variables | Age (5 y groups), sex, race, SEER area | Age (5 y groups) | Age (5 y groups), sex, state or territory | |||||||||||||
Risk factor data | Self-administered questionnaire, in-person interview | Self-administered questionnaire, in-person interview | Self-administered questionnaire, telephone interview | |||||||||||||
Controls | Cases | Controls | Cases | Controls | Cases | Controls | Cases | |||||||||
n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | n (%)* | |||||||||
Study population | ||||||||||||||||
Risk factor data† | 1057 (NA) | 1321 (NA) | 717 (NA) | 601 (NA) | 694 (NA) | 694 (NA) | ||||||||||
Genotyped for this analysis‡ | 834 (NA) | 1001 (NA) | 517 (NA) | 436 (NA) | 474 (NA) | 524 (NA) | ||||||||||
Final analytic population§ | 828 (100) | 990 (100) | 515 (100) | 436 (100) | 465 (100) | 520 (100) | 1808 (100) | 1946 (100) | ||||||||
Study site | ||||||||||||||||
Detroit SEER registry | 139 (17) | 197 (20) | — | — | — | — | 139 (8) | 197 (10) | ||||||||
Iowa SEER registry | 246 (30) | 301 (30) | — | — | — | — | 246 (14) | 301 (16) | ||||||||
Los Angeles SEER registry | 199 (24) | 234 (24) | — | — | — | — | 199 (11) | 234 (12) | ||||||||
Seattle SEER registry | 244 (29) | 258 (26) | — | — | — | — | 244 (13) | 258 (13) | ||||||||
Connecticut SEER registry | — | — | 515 (100) | 436 (100) | — | — | 515 (28) | 436 (22) | ||||||||
NSW | — | — | — | — | 446 (96) | 496 (95) | 446 (25) | 496 (26) | ||||||||
Australian Capital Territory | — | — | — | — | 19 (4) | 24 (5) | 19 (1) | 24 (1) | ||||||||
Sex | ||||||||||||||||
Male | 443 (53) | 536 (54) | — | — | 268 (58) | 304 (58) | 711 (39) | 840 (43) | ||||||||
Female | 385 (47) | 454 (46) | 515 (100) | 436 (100) | 197 (42) | 216 (42) | 1097 (61) | 1106 (57) | ||||||||
Age (y) | ||||||||||||||||
<50 | 203 (25) | 277 (28) | 98 (19) | 86 (20) | 107 (23) | 121 (23) | 408 (23) | 484 (25) | ||||||||
50-59 | 177 (21) | 235 (24) | 97 (19) | 89 (20) | 135 (29) | 171 (33) | 409 (23) | 495 (25) | ||||||||
60-69 | 285 (34) | 311 (31) | 120 (23) | 110 (25) | 151 (32) | 154 (30) | 556 (31) | 575 (30) | ||||||||
70+ | 163 (20) | 167 (17) | 200 (39) | 151 (35) | 72 (16) | 74 (14) | 435 (24) | 392 (20) | ||||||||
Race/ethnicity | ||||||||||||||||
White, non-Hispanic | 646 (78) | 829 (84) | 473 (92) | 415 (95) | 459 (99) | 507 (98) | 1578 (87) | 1751 (90) | ||||||||
Black | 112 (13) | 64 (6) | 14 (3) | 13 (3) | — | — | 126 (7) | 77 (4) | ||||||||
Asian/other/unknown | 70 (8) | 97 (10) | 28 (5) | 8 (2) | 6 (1) | 13 (3) | 104 (6) | 118 (6) | ||||||||
NHL subtype | ||||||||||||||||
DLBCL | — | 294 (30) | — | 137 (31) | — | 169 (33) | — | 600 (31) | ||||||||
Follicular lymphoma | — | 246 (25) | — | 103 (24) | — | 191 (37) | — | 540 (28) | ||||||||
Marginal zone lymphoma | — | 82 (8) | — | 29 (7) | — | 49 (9) | — | 160 (8) | ||||||||
CLL/SLL | — | 101 (10) | — | 43 (10) | — | 17 (3) | — | 161 (8) | ||||||||
Mantle cell lymphoma | — | 40 (4) | — | 10 (2) | — | 19 (4) | — | 69 (4) | ||||||||
Lymphoplasmacytic lymphoma | — | 24 (2) | — | 9 (2) | — | 23 (4) | — | 56 (3) | ||||||||
Burkitt lymphoma | — | 11 (1) | — | 0 (0) | — | 3 (1) | — | 14 (1) | ||||||||
Mycosis fungoides/Sézary syndrome | — | 18 (2) | — | 10 (2) | — | 3 (1) | — | 31 (2) | ||||||||
Peripheral T-cell lymphoma | — | 41 (4) | — | 14 (3) | — | 7 (1) | — | 62 (3) | ||||||||
NHL, not otherwise specified | — | 133 (13) | — | 81 (19) | — | 39 (7) | — | 253 (13) | ||||||||
DNA source | ||||||||||||||||
Blood | 598 (72) | 688 (70) | 515 (100) | 436 (100) | 465 (100) | 520 (100) | 1578 (87) | 1644 (85) | ||||||||
Buccal | 230 (28) | 302 (30) | — | — | — | — | 230 (13) | 302 (15) |
Abbreviations: NA, not applicable.
Percentage is based on the final analytic population for this pooled analysis.
Participation (percentage interviewed among those approached) in NCI-SEER was 76% for cases and 52% for controls; in Connecticut was 72% for cases, 69% for controls <65 y, 47% for controls ≥65 y; and in NSW was 85% for cases and 61% for controls.
Study participants who did not provide a biological specimen, did not have sufficient material for DNA extraction or sufficient DNA for genotyping, or whose genotyped sex was discordant from the questionnaire data were excluded from this analysis. The Connecticut study also restricted this analysis to participants who provided a blood sample. The NSW also restricted this analysis to participants of European or Asian ethnicity (97% of participants).
The final analytic population further excluded participants with a low sample completion rate (NCI-SEER: 11 cases, 6 controls; Connecticut: 2 controls; NSW: 4 cases, 9 controls).
The protocols for each study were approved at the Institutional Review Boards of the NCI and each SEER center for the NCI-SEER study; Yale University, the Connecticut Department of Public Health, and the NCI for the Connecticut study; and all participating institutions for the NSW study. All study participants provided informed consent.
NHL Pathology Classification
All cases were histologically confirmed by the local diagnosing pathologist in the NCI-SEER study and by central review of diagnostic slides by two independent expert hematopathologists in the Connecticut study. In the NSW study, all cases were histologically confirmed by the local diagnosing pathologist, and a confirmatory central pathology review was done for cases judged to be <90% certain to be NHL on review of the diagnostic pathology report by an expert hematopathologist. In the present analyses, we evaluated NHL overall and specific NHL subtypes, grouping cases according to the WHO classification (1) using the International Lymphoma Epidemiology Consortium guidelines (19). For analyses by NHL subtype, we evaluated only the 4 most common subtypes: diffuse large B-cell lymphoma (DLBCL; 28%), follicular lymphoma (28%), marginal zone lymphoma (8%), and chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL; 8%; Table 2). Our studies primarily included SLL rather than CLL cases because these diseases were not considered the same entity until the WHO classification was introduced in 2001 (1).
Laboratory Methods
Biological Samples and DNA Extraction. Study participants who did not provide a biological specimen, did not have sufficient material for DNA extraction or sufficient DNA for genotyping, or whose genotyped sex was discordant from the questionnaire data were excluded from this analysis (Table 2). For the NCI-SEER study, DNA was extracted from blood clots or buffy coats (BBI Biotech) using Puregene Autopure DNA extraction kits (Gentra Systems), and from buccal cell samples by phenol-chloroform extraction methods (20). Genotype frequencies for individuals who provided blood compared with buccal cells were equivalent (21). For the Connecticut study, DNA was extracted from the blood samples using phenol-chloroform extraction methods (20). For the NSW study, DNA was extracted from buffy coats using Qiagen QIAamp DNA Blood Midi kits by laboratory staff at the Viral Epidemiology Section, Science Applications International Corporation-Frederick, NCI-Frederick.
Genotyping. Genotyping of tag single nucleotide polymorphisms (SNP) from 20 candidate genes involved in regulating the cell cycle, apoptosis, and lymphocyte development was conducted at the NCI Core Genotyping Facility (Advanced Technology Center11
; ref. 22) using a custom-designed GoldenGate assay (Illumina).12 The GoldenGate assay included a total of 1,536 tag SNPs; thus, this analysis was conducted as part of a panel that also included SNPs from candidate genes in other pathways. Tag SNPs were chosen from the designable set of common SNPs (minor allele frequency, >5%) genotyped in the Caucasian (CEU) population sample of the HapMap Project (Data Release 20/phase II, National Center for Biotechnology Information Build 35 assembly, dbSNPb125) using the software Tagzilla,13 which implements a tagging algorithm based on the pairwise binning method of Carlson et al. (23). For each gene, SNPs within the region spanning 20 kb 5′ of the start of transcription (exon 1) to 10 kb 3′ of the end of the last exon were grouped using a binning threshold of r2 > 0.8. When there were multiple transcripts available for genes, only the primary transcript was assessed.Quality Control, Exclusions, and Final Analytic Study Population. We excluded tag SNPs (n = 3) that failed to cluster in the genotyping calling algorithm (separately analyzed for buccal cell and peripheral blood cell samples) or did not amplify during the amplification step of the genotyping assay. SNPs with low completion rate (<90% of samples) were excluded by study (NCI-SEER blood samples, n = 1; NCI-SEER buccal cell samples, n = 4). Quality control duplicates and replicates from each study were genotyped, blinded to laboratory personnel. SNPs with concordance of <95% in the study-specific quality control samples were excluded for that study (NCI-SEER buccal cell samples, n = 1). We also excluded samples with a low completion rate (<90% of the full panel of 1536 tag SNPs; NCI-SEER, 11 cases, 6 controls; Connecticut, 2 controls; NSW, 4 cases, 9 controls). We included in our analyses 5 candidate SNPs previously genotyped by Taqman assay in at least 2 of the 3 studies and located within 1 of the 20 candidate genes in this analysis, some of the results of which have been published previously (24, 25).
The final pooled analytic study population included 1,946 cases and 1,808 controls with data for 203 SNPs (198 tag SNPs, 5 previously genotyped Taqman SNPs) in or near the 20 candidate genes in this analysis (Supplementary Table S1; Table 1). Hardy-Weinberg equilibrium was evaluated among non-Hispanic Caucasian controls (n = 1,578, 87% of the analytic population) for the pooled study population and by study (Supplementary Table S1). In the pooled study population, 3 SNPs showed evidence (P < 0.001) for deviation from Hardy-Weinberg proportions but were retained in the analysis because the quality control data did not suggest any obvious genotyping error (rs9392454, rs7941248, rs17757541).
Statistical methods
SNP-Based Analyses. We calculated odds ratios (OR) and 95% confidence intervals (CI) estimating the relative risk of NHL and NHL subtypes in relation to SNP genotype using dichotomous and polytomous unconditional logistic regression models, respectively. The homozygote of the most common allele in the pooled study population was used as the reference group. Tests for trend under the codominant model used a three-level ordinal variable for each SNP (0, homozygote common; 1, heterozygote; 2, homozygote variant). All models were adjusted for age, race/ethnicity, sex, and study center (categories listed in Table 2). We conducted analyses restricted to non-Hispanic Caucasians and stratified by age (<50, ≥ 50 y) and sex to evaluate the consistency of our results by various demographic groups. To evaluate the consistency of our results by NHL subtype, we assessed heterogeneity among NHL subtypes in the polytomous multivariate unconditional logistic regression models using the Wald χ2 statistic (results presented in Supplementary Tables). Analyses were conducted using SAS version 9.1 (SAS Institute).
We obtained a gene-level summary of association by computing the minimum P value (“minP test”), which assesses the true statistical significance of the smallest Ptrend within each gene (determined by dichotomous logistic regression, comparing NHL or NHL subtypes to controls; SNPs listed in Supplementary Table S2) by permutation-based resampling methods (10,000 permutations) that automatically adjust for the number of tag SNPs tested within that gene and the underlying linkage disequilibrium pattern (26, 27). To account for multiple comparisons with 20 candidate genes in this analysis, we applied the false discovery rate (FDR) method of Benjamini and Hochberg (28) to the minP test separately for NHL and each subtype. We considered FDR values of <0.2 for the minP test as the least likely to be due to a false positive finding and thus represent our most interesting results. Finally, we summarized the overall evidence of association of the 203 SNPs with NHL or an NHL subtype by using the “tail strength” statistic (29), a summary measure for the departure of the observed P value distribution from their expected distribution under the global null hypothesis of no association in the group of 20 candidate genes in this analysis. We assessed the significance of the tail strength statistics by generating their null distributions by permutation-based resampling of the data. Higher tail strength values (and corresponding lower P values) provide stronger evidence of association. Analyses were conducted using the MATLAB Statistics Toolbox 6.2 (The Mathworks, Inc.).
Multilocus Analyses. For those genes with at least one SNP with a Ptrend of <0.05 for NHL or an NHL subtype (n = 12 genes), we further conducted two multilocus tests. The purpose of these tests was to detect stronger associations that might have been missed by the single-SNP analyses based on linkage disequilibrium between the genotyped SNPs and a causally associated SNP. First, we conducted a likelihood ratio test, assessing the relative improvement in model fit from the inclusion of parameters for all independent SNPs (r2 < 0.8 among controls) in a particular gene, assuming a codominant model for each single nucleotide polymorphism compared with a model with just age, sex, race, and study center. Second, we conducted haplotype analyses among non-Hispanic Caucasians. We evaluated risk of NHL and NHL subtypes associated with haplotypes defined by SNPs within a sliding window of three loci across a gene (Haplo Stats, version 1.2.1, haplo.score.slide).14
A global score statistic was used to summarize the evidence of association of disease with the haplotypes for each window. In addition, we visualized haplotype structures using Haploview, version 3.11 (30), based on measures of pairwise linkage disequilibrium between SNPs. For blocks of linkage disequilibrium (Supplementary Table S3), we obtained ORs and 95% CIs for the underlying haplotypes under the assumption of an additive model (haplo.glm, minimum haplotype frequency 1%). Two SNPs (MYC rs3824120, BCL2 rs1982673) were excluded from haplotype analyses because they were genotyped in only two of three studies (Supplementary Table S1). All haplotype analyses were adjusted for age, sex, and study center.Results
In this analysis of 203 SNPs from 20 candidate genes among 1,946 patients with NHL and 1,808 population controls, the overall statistical significance for NHL of the biological pathway(s) captured by all 20 genes was P = 0.0544 (tail strength statistic, 0.1546). We observed suggestive associations (Ptrend < 0.05) for 15 SNPs with risk of NHL overall, 17 SNPs with DLBCL, 12 SNPs with follicular lymphoma, 10 SNPs with marginal zone lymphoma, and 13 SNPs with CLL/SLL (Supplementary Table S4).
We observed the most striking associations for BCL2L11 (also known as BIM) and BCL7A (FDR value for minP test, <0.2). BCL2L11 was associated with follicular lymphoma (minP = 0.0068; Table 3). SNP-based analyses revealed suggestive associations (Ptrend < 0.05) for 4 SNPs with NHL overall, and 6 SNPs with follicular lymphoma but no significant associations with any other NHL subtype (Supplementary Table S4). Two variants in linkage disequilibrium in our control population (D' = 0.99; r2 = 0.75) were particularly strongly related to follicular lymphoma (rs7567444: ORCT, 0.87; 95% CI 0.70-1.08; ORTT, 0.60; 95% CI, 0.44-0.80; Ptrend = 0.0009; rs3789068: ORAG, 1.41; 95% CI, 1.10-1.81; ORGG, 1.65; 95% CI, 1.25-2.19; Ptrend = 0.0004), with very similar risk estimates in all 3 studies (Supplementary Table S5; Table 4; Fig. 1). The multilocus analyses supported the association of BCL2L11 with follicular lymphoma and did not show stronger evidence of association than the single SNP-based analyses (Supplementary Tables S6-7).
Candidate gene . | NHL . | DLBCL . | Follicular lymphoma . | Marginal zone lymphoma . | CLL/SLL . |
---|---|---|---|---|---|
BCL10 | 0.5447 | 0.8675 | 0.2101 | 0.4424 | 0.1830 |
TP53I3 | 0.6529 | 0.6996 | 0.4719 | 0.6475 | 0.2091 |
BCL2L11 | 0.0489 | 0.4653 | 0.0068 | 0.5141 | 0.8484 |
RIPK1 | 0.7164 | 0.5040 | 0.9507 | 0.9517 | 0.9248 |
PIM1 | 0.0423 | 0.0231 | 0.4853 | 0.6349 | 0.9617 |
RIPK2 | 0.4872 | 0.9026 | 0.4508 | 0.2340 | 0.2671 |
MYC | 0.7174 | 0.4457 | 0.5316 | 0.3774 | 0.0361 |
CCND1 | 0.0744 | 0.0629 | 0.5965 | 0.3924 | 0.3516 |
BCL2L2 | 0.6374 | 0.8009 | 0.1184 | 0.6136 | 0.7437 |
BCL2L10 | 0.3458 | 0.5721 | 0.5965 | 0.9450 | 0.1807 |
BCL2A1 | 0.6037 | 0.6955 | 0.5907 | 0.6038 | 0.7183 |
TP53 | 0.4872 | 0.9042 | 0.2973 | 0.7189 | 0.0849 |
BCL2 | 0.1772 | 0.7520 | 0.1785 | 0.0506 | 0.9962 |
BAX | 0.7085 | 0.9242 | 0.4991 | 0.2834 | 0.2803 |
BCL2L1 | 0.6575 | 0.7890 | 0.2660 | 0.9542 | 0.7690 |
LMO2 | 0.1591 | 0.3511 | 0.7938 | 0.5980 | 0.4292 |
AICDA | 0.4028 | 0.1571 | 0.8852 | 0.3242 | 0.1875 |
BCL6 | 0.0616 | 0.1641 | 0.0452 | 0.0237 | 0.0574 |
BCL7A | 0.0211 | 0.0025 | 0.1809 | 0.9871 | 0.6922 |
BCL7C | 0.8592 | 0.8298 | 0.9120 | 0.6331 | 0.5307 |
Candidate gene . | NHL . | DLBCL . | Follicular lymphoma . | Marginal zone lymphoma . | CLL/SLL . |
---|---|---|---|---|---|
BCL10 | 0.5447 | 0.8675 | 0.2101 | 0.4424 | 0.1830 |
TP53I3 | 0.6529 | 0.6996 | 0.4719 | 0.6475 | 0.2091 |
BCL2L11 | 0.0489 | 0.4653 | 0.0068 | 0.5141 | 0.8484 |
RIPK1 | 0.7164 | 0.5040 | 0.9507 | 0.9517 | 0.9248 |
PIM1 | 0.0423 | 0.0231 | 0.4853 | 0.6349 | 0.9617 |
RIPK2 | 0.4872 | 0.9026 | 0.4508 | 0.2340 | 0.2671 |
MYC | 0.7174 | 0.4457 | 0.5316 | 0.3774 | 0.0361 |
CCND1 | 0.0744 | 0.0629 | 0.5965 | 0.3924 | 0.3516 |
BCL2L2 | 0.6374 | 0.8009 | 0.1184 | 0.6136 | 0.7437 |
BCL2L10 | 0.3458 | 0.5721 | 0.5965 | 0.9450 | 0.1807 |
BCL2A1 | 0.6037 | 0.6955 | 0.5907 | 0.6038 | 0.7183 |
TP53 | 0.4872 | 0.9042 | 0.2973 | 0.7189 | 0.0849 |
BCL2 | 0.1772 | 0.7520 | 0.1785 | 0.0506 | 0.9962 |
BAX | 0.7085 | 0.9242 | 0.4991 | 0.2834 | 0.2803 |
BCL2L1 | 0.6575 | 0.7890 | 0.2660 | 0.9542 | 0.7690 |
LMO2 | 0.1591 | 0.3511 | 0.7938 | 0.5980 | 0.4292 |
AICDA | 0.4028 | 0.1571 | 0.8852 | 0.3242 | 0.1875 |
BCL6 | 0.0616 | 0.1641 | 0.0452 | 0.0237 | 0.0574 |
BCL7A | 0.0211 | 0.0025 | 0.1809 | 0.9871 | 0.6922 |
BCL7C | 0.8592 | 0.8298 | 0.9120 | 0.6331 | 0.5307 |
NOTE: Bold type indicates P value of <0.05. The minP test assesses the true statistical significance of the smallest Ptrend within each gene (determined by dichotomous logistic regression, comparing NHL or NHL subtypes to controls; SNPs listed in Supplementary Table S2) by permutation-based resampling methods (10,000 permutations) that automatically adjust for the number of tag SNPs tested within that gene and the underlying linkage disequilibrium pattern (26, 27).
Candidate gene* . | dbSNP ID, SNP500 Alias* . | Genotype . | Controls . | All NHL (n = 1946) . | . | DLBCL (n = 600) . | . | Follicular lymphoma (n = 540) . | . | Marginal zone lymphoma (n = 160) . | . | CLL/SLL (n = 161) . | . |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
. | . | . | n . | n . | OR† (95% CI) . | n . | OR† (95% CI) . | n . | OR† (95% CI) . | n . | OR† (95% CI) . | n . | OR† (95% CI) . |
BCL2L11 | rs7567444, | CT | 871 | 933 | 0.87 (0.75, 1.01) | 285 | 0.88 (0.71, 1.09) | 271 | 0.87 (0.70, 1.08) | 82 | 1.00 (0.69, 1.46) | 77 | 0.92 (0.63, 1.33) |
ACOXL_01 | TT | 386 | 368 | 0.77 (0.64, 0.93) | 119 | 0.83 (0.64, 1.09) | 82 | 0.60 (0.44, 0.80) | 29 | 0.80 (0.49, 1.30) | 32 | 0.87 (0.55, 1.39) | |
Ptrend | 0.0055 | 0.1542 | 0.0009 | 0.4109 | 0.5434 | ||||||||
rs3789068, | AG | 857 | 939 | 1.16 (0.99, 1.35) | 293 | 1.16 (0.92, 1.45) | 270 | 1.41 (1.10, 1.81) | 77 | 1.15 (0.77, 1.71) | 82 | 1.17 (0.79, 1.72) | |
BCL2L11_14 | GG | 399 | 486 | 1.27 (1.06, 1.53) | 146 | 1.21 (0.93, 1.58) | 150 | 1.65 (1.25, 2.19) | 39 | 1.27 (0.80, 2.01) | 35 | 1.04 (0.65, 1.67) | |
Ptrend | 0.0093 | 0.1439 | 0.0004 | 0.3095 | 0.8038 | ||||||||
BCL7A | rs12827036, | GT | 853 | 900 | 0.83 (0.71, 0.97) | 286 | 0.87 (0.70, 1.08) | 245 | 0.79 (0.63, 0.99) | 70 | 0.91 (0.62, 1.34) | 76 | 0.88 (0.60, 1.28) |
BCL7A_02 | TT | 385 | 377 | 0.77 (0.64, 0.93) | 114 | 0.75 (0.57, 0.99) | 112 | 0.77 (0.58, 1.02) | 37 | 1.04 (0.66, 1.64) | 30 | 0.80 (0.50, 1.30) | |
Ptrend | 0.0044 | 0.0415 | 0.495 | 0.9104 | 0.3553 | ||||||||
rs1880030, | AG | 858 | 954 | 1.09 (0.94, 1.27) | 302 | 1.34 (1.08, 1.68) | 253 | 0.88 (0.71, 1.09) | 84 | 1.18 (0.82, 1.70) | 69 | 0.90 (0.62, 1.31) | |
BCL7A_03 | AA | 341 | 383 | 1.10 (0.91, 1.32) | 144 | 1.60 (1.22, 2.08) | 90 | 0.76 (0.57, 1.02) | 25 | 0.87 (0.53, 1.44) | 38 | 1.27 (0.82, 1.98) | |
Ptrend | 0.2681 | 0.0004 | 0.0585 | 0.8114 | 0.3843 | ||||||||
BCL6 | rs3172469, | GT | 757 | 828 | 1.08 (0.95, 1.24) | 269 | 1.18 (0.97, 1.43) | 214 | 0.98 (0.80, 1.21) | 67 | 1.02 (0.72, 1.43) | 70 | 1.20 (0.85, 1.70) |
BCL6_05 | GG | 120 | 152 | 1.34 (1.04, 1.74) | 43 | 1.27 (0.87, 1.85) | 45 | 1.39 (0.95, 2.03) | 10 | 0.98 (0.49, 1.96) | 20 | 2.29 (1.33, 3.93) | |
Ptrend | 0.0303 | 0.0708 | 0.3063 | 0.9723 | 0.0094 | ||||||||
rs1523475, | CT | 581 | 658 | 1.14 (0.99, 1.31) | 218 | 1.27 (1.04, 1.55) | 185 | 1.17 (0.95, 1.44) | 46 | 0.85 (0.59, 1.22) | 50 | 1.03 (0.72, 1.48) | |
BCL6_16 | TT | 64 | 86 | 1.50 (1.07, 2.11) | 23 | 1.30 (0.79, 2.14) | 22 | 1.32 (0.79, 2.20) | 2 | 0.34 (0.08, 1.42) | 11 | 2.00 (1.01, 3.96) | |
Ptrend | 0.0079 | 0.0188 | 0.0884 | 0.1255 | 0.1864 | ||||||||
MYC | rs3891248,MYC_02 | AT/AA‡ | 578 | 605 | 1.01 (0.87, 1.16) | 202 | 1.18 (0.96, 1.44) | 154 | 0.96 (0.77, 1.19) | 64 | 1.49 (1.06, 2.10) | 36 | 0.57 (0.38, 0.85) |
rs16902359, | CT/TT‡ | 482 | 462 | 0.92 (0.79, 1.08) | 153 | 1.06 (0.85, 1.32) | 118 | 0.90 (0.71, 1.14) | 42 | 1.03 (0.70, 1.52) | 27 | 0.52 (0.33, 0.82) | |
MYC_21 | |||||||||||||
CCND1 | rs603965, | AG | 883 | 967 | 1.10 (0.94, 1.27) | 307 | 1.20 (0.97, 1.49) | 268 | 1.08 (0.86, 1.35) | 86 | 1.30 (0.88, 1.91) | 78 | 1.05 (0.72, 1.53) |
CCND1_02 | AA | 321 | 403 | 1.25 (1.04, 1.52) | 126 | 1.35 (1.03, 1.77) | 108 | 1.16 (0.87, 1.54) | 31 | 1.26 (0.77, 2.05) | 35 | 1.38 (0.86, 2.19) | |
Ptrend | 0.0203 | 0.0270 | 0.3088 | 0.2868 | 0.2132 | ||||||||
rs2450254, | AT | 873 | 943 | 0.94 (0.82, 1.09) | 278 | 0.83 (0.68, 1.02) | 270 | 1.04 (0.83, 1.29) | 73 | 0.82 (0.58, 1.17) | 81 | 1.03 (0.71, 1.48) | |
CCND1_15 | TT | 335 | 312 | 0.83 (0.68, 1.00) | 91 | 0.73 (0.55, 0.97) | 88 | 0.90 (0.67, 1.21) | 24 | 0.69 (0.42, 1.14) | 28 | 0.96 (0.59, 1.55) | |
Ptrend | 0.0623 | 0.0176 | 0.6003 | 0.1195 | 0.9011 | ||||||||
LMO2 | rs3824848, | CT | 804 | 880 | 1.10 (0.96, 1.27) | 279 | 1.17 (0.96, 1.43) | 247 | 1.14 (0.93, 1.41) | 75 | 1.15 (0.81, 1.62) | 75 | 1.16 (0.82, 1.65) |
LMO2_32 | TT | 168 | 227 | 1.35 (1.08, 1.69) | 71 | 1.43 (1.04, 1.97) | 57 | 1.24 (0.88, 1.75) | 17 | 1.10 (0.62, 1.95) | 19 | 1.43 (0.83, 2.48) | |
Ptrend | 0.0098 | 0.0176 | 0.1247 | 0.5366 | 0.1805 | ||||||||
BCL2 | rs2849377, | AT | 406 | 402 | 0.86 (0.74, 1.01) | 121 | 0.84 (0.67, 1.06) | 130 | 1.04 (0.83, 1.31) | 21 | 0.49 (0.31, 0.79) | 30 | 0.78 (0.51, 1.18) |
BCL2_22 | TT | 33 | 15 | 0.41 (0.22, 0.76) | 7 | 0.61 (0.27, 1.40) | 4 | 0.38 (0.13, 1.09) | 1 | 0.30 (0.04, 2.20) | 1 | 0.36 (0.05, 2.67) | |
Ptrend | 0.0041 | 0.0634 | 0.5582 | 0.0018 | 0.1267 |
Candidate gene* . | dbSNP ID, SNP500 Alias* . | Genotype . | Controls . | All NHL (n = 1946) . | . | DLBCL (n = 600) . | . | Follicular lymphoma (n = 540) . | . | Marginal zone lymphoma (n = 160) . | . | CLL/SLL (n = 161) . | . |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
. | . | . | n . | n . | OR† (95% CI) . | n . | OR† (95% CI) . | n . | OR† (95% CI) . | n . | OR† (95% CI) . | n . | OR† (95% CI) . |
BCL2L11 | rs7567444, | CT | 871 | 933 | 0.87 (0.75, 1.01) | 285 | 0.88 (0.71, 1.09) | 271 | 0.87 (0.70, 1.08) | 82 | 1.00 (0.69, 1.46) | 77 | 0.92 (0.63, 1.33) |
ACOXL_01 | TT | 386 | 368 | 0.77 (0.64, 0.93) | 119 | 0.83 (0.64, 1.09) | 82 | 0.60 (0.44, 0.80) | 29 | 0.80 (0.49, 1.30) | 32 | 0.87 (0.55, 1.39) | |
Ptrend | 0.0055 | 0.1542 | 0.0009 | 0.4109 | 0.5434 | ||||||||
rs3789068, | AG | 857 | 939 | 1.16 (0.99, 1.35) | 293 | 1.16 (0.92, 1.45) | 270 | 1.41 (1.10, 1.81) | 77 | 1.15 (0.77, 1.71) | 82 | 1.17 (0.79, 1.72) | |
BCL2L11_14 | GG | 399 | 486 | 1.27 (1.06, 1.53) | 146 | 1.21 (0.93, 1.58) | 150 | 1.65 (1.25, 2.19) | 39 | 1.27 (0.80, 2.01) | 35 | 1.04 (0.65, 1.67) | |
Ptrend | 0.0093 | 0.1439 | 0.0004 | 0.3095 | 0.8038 | ||||||||
BCL7A | rs12827036, | GT | 853 | 900 | 0.83 (0.71, 0.97) | 286 | 0.87 (0.70, 1.08) | 245 | 0.79 (0.63, 0.99) | 70 | 0.91 (0.62, 1.34) | 76 | 0.88 (0.60, 1.28) |
BCL7A_02 | TT | 385 | 377 | 0.77 (0.64, 0.93) | 114 | 0.75 (0.57, 0.99) | 112 | 0.77 (0.58, 1.02) | 37 | 1.04 (0.66, 1.64) | 30 | 0.80 (0.50, 1.30) | |
Ptrend | 0.0044 | 0.0415 | 0.495 | 0.9104 | 0.3553 | ||||||||
rs1880030, | AG | 858 | 954 | 1.09 (0.94, 1.27) | 302 | 1.34 (1.08, 1.68) | 253 | 0.88 (0.71, 1.09) | 84 | 1.18 (0.82, 1.70) | 69 | 0.90 (0.62, 1.31) | |
BCL7A_03 | AA | 341 | 383 | 1.10 (0.91, 1.32) | 144 | 1.60 (1.22, 2.08) | 90 | 0.76 (0.57, 1.02) | 25 | 0.87 (0.53, 1.44) | 38 | 1.27 (0.82, 1.98) | |
Ptrend | 0.2681 | 0.0004 | 0.0585 | 0.8114 | 0.3843 | ||||||||
BCL6 | rs3172469, | GT | 757 | 828 | 1.08 (0.95, 1.24) | 269 | 1.18 (0.97, 1.43) | 214 | 0.98 (0.80, 1.21) | 67 | 1.02 (0.72, 1.43) | 70 | 1.20 (0.85, 1.70) |
BCL6_05 | GG | 120 | 152 | 1.34 (1.04, 1.74) | 43 | 1.27 (0.87, 1.85) | 45 | 1.39 (0.95, 2.03) | 10 | 0.98 (0.49, 1.96) | 20 | 2.29 (1.33, 3.93) | |
Ptrend | 0.0303 | 0.0708 | 0.3063 | 0.9723 | 0.0094 | ||||||||
rs1523475, | CT | 581 | 658 | 1.14 (0.99, 1.31) | 218 | 1.27 (1.04, 1.55) | 185 | 1.17 (0.95, 1.44) | 46 | 0.85 (0.59, 1.22) | 50 | 1.03 (0.72, 1.48) | |
BCL6_16 | TT | 64 | 86 | 1.50 (1.07, 2.11) | 23 | 1.30 (0.79, 2.14) | 22 | 1.32 (0.79, 2.20) | 2 | 0.34 (0.08, 1.42) | 11 | 2.00 (1.01, 3.96) | |
Ptrend | 0.0079 | 0.0188 | 0.0884 | 0.1255 | 0.1864 | ||||||||
MYC | rs3891248,MYC_02 | AT/AA‡ | 578 | 605 | 1.01 (0.87, 1.16) | 202 | 1.18 (0.96, 1.44) | 154 | 0.96 (0.77, 1.19) | 64 | 1.49 (1.06, 2.10) | 36 | 0.57 (0.38, 0.85) |
rs16902359, | CT/TT‡ | 482 | 462 | 0.92 (0.79, 1.08) | 153 | 1.06 (0.85, 1.32) | 118 | 0.90 (0.71, 1.14) | 42 | 1.03 (0.70, 1.52) | 27 | 0.52 (0.33, 0.82) | |
MYC_21 | |||||||||||||
CCND1 | rs603965, | AG | 883 | 967 | 1.10 (0.94, 1.27) | 307 | 1.20 (0.97, 1.49) | 268 | 1.08 (0.86, 1.35) | 86 | 1.30 (0.88, 1.91) | 78 | 1.05 (0.72, 1.53) |
CCND1_02 | AA | 321 | 403 | 1.25 (1.04, 1.52) | 126 | 1.35 (1.03, 1.77) | 108 | 1.16 (0.87, 1.54) | 31 | 1.26 (0.77, 2.05) | 35 | 1.38 (0.86, 2.19) | |
Ptrend | 0.0203 | 0.0270 | 0.3088 | 0.2868 | 0.2132 | ||||||||
rs2450254, | AT | 873 | 943 | 0.94 (0.82, 1.09) | 278 | 0.83 (0.68, 1.02) | 270 | 1.04 (0.83, 1.29) | 73 | 0.82 (0.58, 1.17) | 81 | 1.03 (0.71, 1.48) | |
CCND1_15 | TT | 335 | 312 | 0.83 (0.68, 1.00) | 91 | 0.73 (0.55, 0.97) | 88 | 0.90 (0.67, 1.21) | 24 | 0.69 (0.42, 1.14) | 28 | 0.96 (0.59, 1.55) | |
Ptrend | 0.0623 | 0.0176 | 0.6003 | 0.1195 | 0.9011 | ||||||||
LMO2 | rs3824848, | CT | 804 | 880 | 1.10 (0.96, 1.27) | 279 | 1.17 (0.96, 1.43) | 247 | 1.14 (0.93, 1.41) | 75 | 1.15 (0.81, 1.62) | 75 | 1.16 (0.82, 1.65) |
LMO2_32 | TT | 168 | 227 | 1.35 (1.08, 1.69) | 71 | 1.43 (1.04, 1.97) | 57 | 1.24 (0.88, 1.75) | 17 | 1.10 (0.62, 1.95) | 19 | 1.43 (0.83, 2.48) | |
Ptrend | 0.0098 | 0.0176 | 0.1247 | 0.5366 | 0.1805 | ||||||||
BCL2 | rs2849377, | AT | 406 | 402 | 0.86 (0.74, 1.01) | 121 | 0.84 (0.67, 1.06) | 130 | 1.04 (0.83, 1.31) | 21 | 0.49 (0.31, 0.79) | 30 | 0.78 (0.51, 1.18) |
BCL2_22 | TT | 33 | 15 | 0.41 (0.22, 0.76) | 7 | 0.61 (0.27, 1.40) | 4 | 0.38 (0.13, 1.09) | 1 | 0.30 (0.04, 2.20) | 1 | 0.36 (0.05, 2.67) | |
Ptrend | 0.0041 | 0.0634 | 0.5582 | 0.0018 | 0.1267 |
Candidate gene: gene of interest. Gene: Gene in which SNP is located, as genotyping included SNPs spanning 20 kb 5’ of the start of transcription (exon 1) to 10kb 3’ of the end of the last exon. SNP500 Alias: SNP500 (http://snp500cancer.nci.nih.gov).
Common homozygote used as the reference group for each SNP.
The dominant model is presented because the homozygote was rare (3.0-4.8% among controls).
BCL7A was particularly associated with DLBCL (minP = 0.0025; Table 3). Three SNPs were significantly related to DLBCL only, with one variant particularly strongly associated with DLBCL (rs1880030: ORAG, 1.34; 95% CI, 1.08-1.68; ORAA, 1.60; 95% CI, 1.22-2.08; Ptrend = 0.0004), which was consistent in all 3 studies (Supplementary Table S5; Table 4; Fig. 1). The multilocus analyses supported the association of this variant with DLBCL and did not show stronger evidence of association than the single SNP-based analyses (Supplementary Tables S6-7). Another SNP was related to risk of NHL overall (rs12827036: ORGT, 0.83; 95% CI, 0.71-0.97; ORTT, 0.77; 95% CI, 0.64-0.93; Ptrend = 0.0044), with similar statistically significant risk estimates for both DLBCL and follicular lymphoma (Table 4), and consistent risk estimates in all 3 studies (Supplementary Table S8).
We also observed notable associations for BCL6, MYC, and CCND1 (FDR value for minP test, 0.2-0.5). BCL6 was marginally associated with NHL overall (minP = 0.0616) and most subtypes (Table 3). In SNP-based analyses, scattered suggestive associations (Ptrend < 0.05) were observed for 8 SNPs for NHL overall and/or at least one NHL subtype (Supplementary Table S4). The SNP most strongly associated with NHL overall was rs1523475 (ORCT, 1.14; 95% CI, 0.99-1.31; ORTT, 1.50; 95% CI, 1.07-2.11; P trend = 0.0079; Table 4). Consistent with our previous report on rs1056932 from the Connecticut study (25), the strongest SNP associations in the pooled data set were observed for CLL/SLL (5 SNPs Ptrend < 0.05, including rs1056932). Compared with the Connecticut study, the associations for CLL/SLL in the NCI-SEER and NSW studies tended to be weaker but were in the same direction. In the pooled data set, CLL/SLL was particularly associated with rs3172469 (ORGT, 1.20; 95% CI, 0.85-1.70; ORGG, 2.29; 95% CI, 1.33-3.93; Ptrend = 0.0094), with similar risk estimates in all three studies (Table 4; Supplementary Table S5). In multilocus analyses, the likelihood ratio test showed a slightly stronger association than the minP test with NHL overall (likelihood ratio test, P = 0.0122; Supplementary Table S6), whereas the analyses of haplotypes defined by SNPs within a sliding window of three loci were similar to the SNP-based analyses and supported a stronger association for CLL/SLL than other subtypes (Supplementary Table S7).
MYC was associated with CLL/SLL (minP = 0.0361; Table 3). The two SNPs most strongly associated with CLL/SLL were in modest linkage disequilibrium in our control population (D' = 0.77; r2 = 0.45), and the homozygote was rare (3.0-4.8% among controls). Thus, we evaluated risk estimates under the dominant genetic model (rs3891248: ORAT/AA, 0.57; 95% CI, 0.38-0.85; P = 0.0060; rs16902359: ORCT/TT, 0.52; 95% CI, 0.33-0.82; P = 0.0049), which were similar in all three studies (Supplementary Table S5; Table 4). The multilocus analyses did not show stronger evidence of association than the single SNP-based analyses (Supplementary Tables S6-7).
CCND1 was weakly associated with NHL (minP = 0.0744; Table 3). Two SNPs in linkage disequilibrium in our control population (D' = 0.96; r2 = 0.53) were modestly related to NHL in the pooled study population (rs603965: ORGA, 1.10; 95% CI, 0.94-1.27; ORAA, 1.25; 95% CI, 1.04-1.52; Ptrend = 0.0203; rs2450254: ORAT, 0.94; 95% CI, 0.82-1.09; ORTT, 0.83; 95% CI, 0.68-1.00; Ptrend = 0.0623), with consistent risk estimates across all four subtypes (Table 4). The risk estimates were also generally similar across all three studies (Supplementary Table S8), although the risk estimates for the splice variant G870A (rs603965), which we previously reported for the NCI-SEER study (24), were attenuated and not significant in the Connecticut and NSW studies. The multilocus analyses did not show stronger evidence of association than the single SNP-based analyses (Supplementary Tables S6-7).
LMO2 and BCL2 were not statistically significantly associated with NHL or any NHL subtype (FDR value for minP test, >0.5; Table 3). However, in each gene, the association with NHL for at least one SNP could not be disregarded based on a Ptrend of <0.01 and consistency of risk estimates in all 3 studies and across all 4 NHL subtypes (LMO2 rs3824848: ORCT, 1.10; 95% CI, 0.96-1.27; ORTT, 1.35; 95% CI, 1.08-1.69; Ptrend = 0.0098; BCL2 rs2849377: ORAT, 0.86; 95% CI, 0.74-1.01; ORTT, 0.41; 95% CI, 0.22-0.76; Ptrend = 0.0041; Supplementary Tables S4, S5, and S8). It was also notable that of the 12 SNPs in BCL2 related to NHL or an NHL subtype, 8 were particularly related to marginal zone lymphoma, although no clear patterns emerged to implicate a particular variant (Supplementary Table S4). The multilocus analyses did not show stronger evidence of association than the single SNP-based analyses (Supplementary Tables S6-7).
Although we observed suggestive associations (Ptrend <0.05) for the one SNP we genotyped in PIM1 and for one of the two SNPs we genotyped in TP53, we could not explore these findings further because we did not have data for additional SNPs within these genes (Supplementary Table S4). We also observed suggestive associations (Ptrend < 0.05) for individual SNPs in BCL10, AICDA, and BAX, but the minP test, study-specific SNP-based anayses, and multilocus analyses generally did not support an association with risk of NHL overall or any NHL subtype (Supplementary Tables S4-8; Table 3).
Risk estimates were similar when we conducted the SNP-based analyses restricted to non-Hispanic Caucasians and stratified by age (<50, ≥ 50 years) and sex (data not shown).
Discussion
In this pooled analysis, we showed consistent evidence from three population-based case-control studies that common genetic variation in cell cycle, apoptosis, and lymphocyte development regulatory genes may play a role in lymphomagenesis, and the effects may vary by NHL subtype. In particular, we found that two variants in linkage disequilibrium in the proapoptotic gene BCL2L11 (BIM) were significantly related to follicular lymphoma risk, and one variant in BCL7A, which is involved in a rare NHL-associated translocation, was significantly related to DLBCL risk. We also observed notable associations for variants in BCL6 and CCND1 with risk of NHL overall, and variants in MYC with risk of CLL/SLL. We observed suggestive associations for at least 1 variant in 7 of the remaining 15 genes we evaluated, but overall the findings for these genes were not compelling.
BCL2L11 (also known as BIM) is a key proapoptotic member of the BCL2 family that maintains hematopoietic cell homeostasis by initiating apoptosis in lymphocytes, regulating the negative selection of autoreactive lymphocytes, and balancing the proliferative and antiapoptotic effects of BCL2 (31-34). Several isoforms of BCL2L11 created by both transcriptional and posttranslational modification have been identified and shown to have varying proapoptotic activity (35, 36). Furthermore, diminished expression of BCL2L11 has been associated with melanoma progression (37), renal cell carcinoma (38), and glioblastoma (39). We present here the first report on common genetic variation in BCL2L11. The two variants in BCL2L11 for which we observed a particularly striking association with follicular lymphoma (rs7567444, rs3789068) were in linkage disequilibrium in our control population and tag variants spanning most of BCL2L11. If our findings are replicated, it will be necessary to conduct additional genotyping across the entire gene to determine which region contains the causal variant(s).
BCL7A was identified by its participation in a three-way chromosomal translocation with MYC and IgH in a Burkitt lymphoma cell line and has also been shown to be rearranged in a mediastinal B-cell lymphoma cell line (40). Although the function of BCL7A is unknown, the protein shows homology with the actin-binding protein, caldesmon, and is part of an evolutionarily conserved family that also includes BCL7B and BCL7C (41). We present the first report on common genetic variation in BCL7A, although diminished expression of BCL7A has been associated with mycosis fungoides (42), peripheral T-cell lymphoma (43), more aggressive clinical behavior of cutaneous T-cell lymphoma (44), and poorer prognosis for DLBCL (45). The variant in BCL7A for which we observed a particularly strong association with DLBCL (rs1880030) tags eight other loci located in or near exon 5. More research is needed to discover the function of BCL7A and replicate our findings, particularly focusing on the region of the gene surrounding exon 5.
We also observed notable associations for variants in BCL6 and CCND1 with risk of NHL overall, and variants in MYC with risk of CLL/SLL. All three of these genes play important roles in the cell cycle and/or lymphocyte development (46-48) and have been implicated in lymphomagenesis by several lines of evidence (7-12, 45, 49, 50). However, there is limited previous research associating lymphoma with common genetic variation in BCL6 and CCND1, and no previous research for MYC. The BCL6 findings from the pooled data set were consistent with our previous report from the Connecticut study only (25) but do not provide support for two other previous studies of follicular lymphoma in relation to SNPs in the regulatory first intronic region of BCL6 (51, 52). The CCND1 splice variant G870A (rs603965), which we previously reported for the NCI-SEER study (24), has also been associated with acute lymphoblastic leukemia (48). Although no previous research has associated lymphoma with common genetic variation in MYC, the two rare variants in MYC (rs3891248, rs16902359) associated with CLL/SLL in this pooled analysis are singletons located in the promoter and first intronic region of MYC. Chromosomal translocation breakpoints clustered in this region have been shown to have a greater effect on MYC overexpression in Burkitt lymphomas than breakpoints in other regions of MYC (53). Because of the importance of BCL6, CCND1, and MYC in the cell cycle and/or lymphocyte development as well as carcinogenesis, we believe further study of common genetic variation in these genes and lymphoma risk is warranted.
Of the remaining 15 candidate genes we evaluated in this pooled analysis, we observed suggestive associations for at least one variant in each of 7 genes, but overall, the findings for these genes were not compelling. For three of these genes (LMO2, BCL2, BCL10), we successfully genotyped ≥85% of the SNPs identified by our tagging algorithm from both HapMap Build 20 and the current version of HapMap (Build 22). However, for the remaining four genes (TP53, PIM1, BAX, AICDA), we successfully genotyped ≤70% of the SNPs identified by our tagging algorithm from both HapMap Build 20 and the current version of HapMap (Build 22). The publication of our complete results from all SNPs in all 20 of the candidate genes can be used to compare results of future research on these variants in relation to lymphomagenesis (Supplementary Table S4).
The main strength of this analysis was our ability to evaluate the associations in three independent study populations. Interpretation of our results should also take into account several limitations. We did not have data on a sufficient number of unlinked, unassociated SNPs to quantitatively assess population structure within our data. However, it is unlikely that our results were biased by population stratification because our results were similar in three independent study populations, and it is unlikely that the same substructure would be repeated in multiple studies. In addition, our risk estimates were similar when we restricted the analytic population to non-Hispanic Caucasians (data not shown). Participation (percentage interviewed among those approached) was low in the three studies, particularly for controls. However, it is unlikely that participation bias would completely explain our findings because it is unlikely that genotype frequencies vary by willingness to participate (21). Survival bias could have influenced our results for those genotypes also associated with prognosis because some patients with more aggressive disease were too ill to participate or died before study investigators could contact them, and common genetic variants associated with NHL etiology may also be associated with survival (54). Although all cases had histologically confirmed NHL, our results for NHL subtypes could have been biased by disease misclassification among the subtypes. However, diagnostic accuracy is estimated to be >80% for most NHL subtypes (55, 56), and any disease misclassification was likely to be non-differential, thus biasing our results toward the null hypothesis. We may have had some false negative results because of inadequate coverage of the SNPs identified in HapMap, or because the genetic variation identified by HapMap does not uniformly cover the genome. Finally, our results require replication in other study populations because some findings may be the result of false positive associations. However, by combining data from three studies, we were able to evaluate pooled risk estimates as well as risk estimates in three independent populations, minimizing the chance of false positive associations particularly for our strongest findings.
In summary, we found consistent evidence in three population-based case-control studies that common genetic variation in cell cycle, apoptosis, and lymphocyte development regulatory genes may play a role in lymphomagenesis, and the effects may vary by NHL subtype. Replication of our results, particularly in studies with sufficient power to evaluate NHL subtypes, and further study to identify functional SNPs are warranted.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Grant support: All genotyping and statistical analysis for this project was supported by the Intramural Research Program of the NIH (National Cancer Institute). The National Cancer Institute-Surveillance, Epidemiology, and End Results study was also supported by the Intramural Research Program of the NIH (National Cancer Institute) and by Public Health Service contracts N01-PC-65064, N01-PC-67008, N01-PC-67009, N01-PC-67010, N02-PC-71105. The Connecticut study was also supported by NIH grant CA62006 from the National Cancer Institute. The New South Wales study was also supported by the National Health and Medical Research Council of Australia [(Bruce Armstrong) Project Grant number 990920], The Cancer Council New South Wales, and The University of Sydney Medical Foundation.
Acknowledgments
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank Mary McAdams, Peter Hui, Michael Stagner, and Zeynep Kalaylioglu of Information Management Services, Inc., for their programming support. For the NCI-SEER study, we acknowledge the contributions of the staff and scientists at the SEER centers of Iowa, Los Angeles, Detroit, and Seattle for the conduct of the study's field effort. The NSW study was made possible by access to new notifications to the NSW Central Cancer Registry, which is funded by the NSW Health Department. Ann-Maree Hughes oversaw conduct of the study and Melisa Litchfield, Maria Agaliotis, Chris Goumas, Jackie Turner, and staff of the Hunter Valley Research Foundation contributed to the data collection. Jenny Turner, study pathologist, reviewed all pathology reports and original slides as necessary.