Abstract
Background: Cigarette smoking is associated with increased head and neck cancer (HNC) risk. Tobacco-related carcinogens are known to cause bulky DNA adducts. Nucleotide excision repair (NER) genes encode enzymes that remove adducts and may be independently associated with HNC, as well as modifiers of the association between smoking and HNC.
Methods: Using population-based case–control data from the Carolina Head and Neck Cancer Epidemiology (CHANCE) Study (1,227 cases and 1,325 controls), race-stratified (White, African American), conventional, and hierarchical logistic regression models were used to estimate ORs with 95% intervals (I) for the independent and joint effects of cigarette smoking and 84 single-nucleotide polymorphisms (SNP) from 15 NER genes on HNC risk.
Results: The odds of HNC were elevated among ever cigarette smokers and increased with smoking duration and frequency. Among Whites, rs4150403 on ERCC3 was associated with increased HNC odds (AA+AG vs. GG; OR, 1.28; 95% CI, 1.01–1.61). Among African Americans, rs4253132 on ERCC6 was associated with decreased HNC odds (CC+CT vs. TT; OR, 0.62; 95% CI, 0.45–0.86). Interactions between ever cigarette smoking and three SNPs (rs4253132 on ERCC6, rs2291120 on DDB2, and rs744154 on ERCC4) suggested possible departures from additivity among Whites.
Conclusions: We did not find associations between some previously studied NER variants and HNC. We did identify new associations between two SNPs and HNC and three suggestive cigarette–SNP interactions to consider in future studies.
Impact: We conducted one of the most comprehensive evaluations of NER variants, identifying a few SNPs from biologically plausible candidate genes associated with HNC and possibly interacting with cigarette smoking. Cancer Epidemiol Biomarkers Prev; 22(8); 1428–45. ©2013 AACR.
Introduction
Head and neck cancer (HNC) includes tumors, principally squamous cell carcinomas, of the oral cavity, pharynx, and larynx (1). In the United States, an estimated 52,610 incident HNC cases and 11,500 associated deaths occurred in 2012 (2). Cigarette smoking is a major risk factor for HNC incidence with case–control studies consistently reporting elevated ORs for ever smoking, as well as dose–response gradients with duration and frequency (3). Among nonalcohol drinking HNC cases, 25% of cases are attributed to cigarette smoking (4).
Cigarette smoke contains numerous carcinogens, such as benzo-a-pyrene, that are known to cause DNA damage, including adducts (3, 5–7). Nucleotide excision repair (NER) enzymes are principally responsible for removing bulky DNA adducts, and are therefore hypothesized to be independent risk factors for HNC, as well as important modifiers of the association between smoking and HNC (5–7). Several previous studies have considered associations between variants in NER genes and HNC risk, but studies vary with regard to which specific single-nucleotide polymorphisms (SNP) were investigated and often present inconsistent evidence for analysis of the same SNP (8–49). In general, most previous studies have evaluated only a few SNPs on a single NER gene among a few hundred HNC cases (8–49). Few studies have examined the association of NER SNPs and HNC among African-Americans (16), a group shown to have a stronger association for smoking and HNC (50). Studies of cigarette–SNP joint effects have also been limited by sparse numbers of NER variants and small sample sizes and present varying results, though some studies indicate stronger associations among smokers with polymorphisms in NER genes (8, 10–12, 14, 16, 17, 23, 25, 27–29, 31, 32, 34, 36–40, 44).
To comprehensively assess associations between cigarette smoking, NER genes, and HNC risk, we used data from the Carolina Head and Neck Cancer Epidemiology (CHANCE) Study to estimate main and joint effects of cigarette smoking and 84 SNPs across 15 NER genes on HNC risk among a racially diverse population including Whites (922 cases and 1,074 controls) and African Americans (305 cases and 251 controls).
Materials and Methods
Study population
The CHANCE Study is a population-based case–control study of 1,389 cases and 1,396 controls from 46 of 100 counties in North Carolina (50–52). Eligible participants were 20 to 80 years of age (50–52). Cases were identified from the North Carolina Central Cancer Registry between January 1, 2002 and February 28, 2006 using rapid case ascertainment (50–52). Tumors were classified according to ICD-O-3 codes: squamous cell carcinomas of the oral cavity (C02.0-C02.3, C03.0-C03.1, C03.9-C04.1, C04.8-C05.0, C06.0-C06.2, and C06.8-C06.9), oropharynx (C01.9, C02.4, C05.1-C05.2, C09.0-C09.1, C09.8-C10.4, and C10.8–C10.9), hypopharynx (C12.9-C13.2 and C13.8-C13.9), larynx (C32.0-C32.3 and C32.8-C32.9), and oral cavity/pharynx not otherwise specified (C02.8-C02.9, C05.8–05.9, C14.0, C14.2, and C14.8) were included in the study, whereas tumors of the salivary glands, nasopharynx, nasal cavity, and nasal sinuses were excluded (50–53). Controls were randomly sampled from the North Carolina Department of Motor Vehicle records and frequency matched to cases within strata of age, race, and sex (50–52).
For this analysis, we excluded cases and controls who did not provide blood or buccal cell samples, whose samples were insufficient for genotyping, or whose samples did not otherwise meet quality control criteria [115 (8.3%) cases and 53 (3.8%) controls] (52). We further excluded individuals who self-reported race other than White or African American because of sparse data [26 (1.9%) cases and 18 (1.3%) controls] and cases with lip cancers because of etiologic differences [21 (1.5%) cases; ref. 52]. Our final sample included 1,227 HNC cases and 1,325 controls.
Cigarette smoking
Self-reported demographic and behavioral information was ascertained through nurse-administered questionnaires (50–52). Information on cigarette smoking included ever/never, current/former, frequency (cigarettes/day), and duration (years). Information on environmental tobacco smoke (ETS) included ever/never and duration (years) of exposure in the home and at work (50).
SNP selection and genotyping
Blood (∼90%) or buccal cell (∼10%) samples were collected from cases and controls at the time of interview for DNA extraction (52). An Illumina GoldenGate assay with Sentrix Array Matrix and 96-well standard microtiter platform was used to genotype 1,536 SNPs, including 129 SNPs in 15 NER genes (52, 54). Seventy-one tag SNPs in NER genes were selected on the basis of a case–control study of HNC at MD Anderson Cancer Center (Houston, TX), which queried National Institute of Environmental Health Sciences–Environmental Genome Project (NIEHS-EGP) and HapMap databases using selection criteria of r2 ≥ 0.80, a minor allele frequency (MAF) ≥ 0.05, 1 to 2 kb flanking regions, and the Utah Residents with Northern and Western European Ancestry (CEU) population (Supplementary Table S1; Q. Wei; personal communication; refs. 55, 56). Another 58 SNPs in NER genes were selected on the basis of several criteria including association in other cancer studies and/or potential function (Supplementary Table S1). We excluded 14 SNPs for which genotyping resulted in poor signal intensity or genotype clustering (52), as well as SNPs with a MAF less than 0.05 (31 SNPs among Whites and 36 SNPs among African Americans; Supplementary Table S1). Most excluded SNPs had been selected on the basis of previous literature and/or function (Supplementary Table S1). Among the remaining SNPs, genotype frequencies for 7 SNPs in Whites and 7 SNPs in African Americans were inconsistent with Hardy–Weinberg equilibrium (HWE; P < 0.05; Supplementary Table S1); however, because genotyping scatter plots showed reasonable genotype clustering, these SNPs were included in analyses but interpreted with caution (57). Our final analysis included 84 SNPs in 14 NER genes among Whites and 79 SNPs in 15 NER genes among African Americans.
Statistical analysis
Cigarette smoking–HNC associations.
Unconditional logistic regression models were used to estimate ORs with 95% intervals (I) for the main effects of cigarette smoking and ETS on HNC risk. Adjusted cigarette smoking and ETS models included matching factors (age, sex, and race), education, and lifetime consumption of alcohol (categorical milliliters of ethanol). ETS ORs were additionally adjusted for duration of cigarette smoking (continuous, years), as well as stratified by ever/never cigarette smoking. Information on human papillomavirus (HPV) infection is not currently available in CHANCE, and was therefore not considered in analyses. Cigarette smoking and ETS models were considered in the overall study population and stratified by race (White and African American).
SNPs–HNC associations.
For SNPs, race-stratified hierarchical unconditional logistic regression was used to estimate ORs and 95% CIs for the main effects of SNPs on HNC risk (as well as tumor site–specific risk) by including a SNP-gene matrix to account for clustering of SNP data by gene (58, 59). Because the conventional logistic regression approach of modeling one SNP at a time with P values corrected for multiple comparisons using the Bonferroni method is overly conservative as it assumes tests are independent, which is not the case with potentially correlated exposures, we selected a hierarchical approach (58, 59). Results from the conventional approach are provided in Supplementary Tables.
We used a two-stage hierarchical model:
where pi represents the probability of case status in the sample, Xij contains indicators of SNPs, and Wi represents important covariates or potential confounders (58, 59).
where βj represents the coefficients for the effects of the SNPs, Zj represents the matrix linking SNPs with their associated genes, and δj represents independent errors, which are normally distributed with a mean of zero and a variance of τ2 (58, 59). To avoid over-parameterization by modeling one large SNP-gene matrix (i.e., including all 84 SNPs across 15 genes) in a single model, 15 models, one for each gene, were used to shrink estimates for SNPs on the same gene toward a common gene effect (i.e., the Z matrix was a single column representing a single gene, with rows of 1′s for each SNP). Because SNPs on the same gene were included in the same model, we excluded some SNPs due to extreme colinearity (estimated correlation ρ > 0.98; 11 SNPs in Whites and 5 SNPs in African Americans). A semi-Bayes approach was used to set τ2 to 0.05, as this corresponded with a plausible range of expected ORs for the association between SNPs and HNC based on previous literature (i.e., 0.6 to 1.6; ref. 58). Sensitivity analyses with τ2 = 0.01, τ2 = 0.10, and τ2 = 1.0 evaluated robustness of this choice.
SNPs were defined using a dominant genetic model given the large portion of SNPs with few cases and controls homozygous for the variant allele (∼7% among Whites and ∼33% among African Americans). The referent allele for both Whites and African Americans was assigned to be the major allele based on controls from the overall study population (which was concurrent with the race-specific major allele for 98% of SNPs in Whites and 92% of SNPs in African Americans). Because genetic exposures were based on germline DNA, which would not reflect the influences of smoking, drinking, or HPV infection, SNP models were only adjusted for matching factors (sex and age) and ancestry (continuous proportion African ancestry), as informed by our directed acyclic graph (DAG) analysis (60). On the basis of previous studies of cancer among Whites and African Americans in North Carolina, 145 ancestral informative markers (AIMS) were selected on the basis of differences in allele frequencies between European and African HapMap populations and used to estimate the proportion of African ancestry in each participant based on Fisher's information criterion (FIC; refs. 52, 61–63).
Joint effects.
ORs and 95% CIs for the joint effects of cigarette smoking and SNPs in NER genes were estimated using conventional and hierarchical logistic regression. Joint effects were modeled using disjoint indicator variables for (i) individuals who smoked and had the referent genotype, (ii) individuals who did not smoke and had the variant genotype, and (iii) individuals who smoked and had the variant genotype (58). As described by Hung and colleagues, hierarchical models included a 3 × 2 gene–environment matrix to account for clustering of disjoint indicator variables by SNP and cigarette effects (i.e., the Z matrix had 2 columns, one representing SNP effects and one representing smoking effects, and 3 rows, each representing the disjoint indicator variables, with 1′s and 0′s entered according to concordance of rows and columns; ref. 58). A τ2 of 0.35 was used to correspond to expected ORs between 0.3 and 3.0 for each indicator variable (58). Sensitivity analyses with τ2 = 0.05 evaluated robustness of this choice. Joint effects models were stratified by self-reported race. Only joint effect estimates among Whites are presented because small cell counts among African Americans prohibited reliable estimation for most SNP–cigarette effects. Joint effects models were adjusted for matching factors (sex and age), education, alcohol drinking, and ancestry as both behavioral and genetic exposures were being modeled. Interactions between SNPs and cigarette smoking were assessed on the additive scale using the relative excess risk due to interaction (RERI), with 95% CIs calculated using the Hosmer and Lemeshow method (64). All statistical analyses were conducted using SAS 9.3 (Cary, NC) (65).
Results
Study population
The study population included 922 cases and 1,074 controls who self-reported race as White and 305 cases and 251 controls who self-reported African American (Table 1). The majority of cases (76.4%) and controls (69.7%) were male. Approximately, one-third of cases (33.6%) and controls (30.2%) were between the ages of 55 and 65 years. Controls were more highly educated than cases with 60.7% of controls attending college compared with 38.6% of cases.
Cigarette smoking–HNC associations
The adjusted OR for ever compared with never cigarette smoking was elevated in the overall (2.28; 95% CI, 1.81–2.88; Table 2) and race-stratified study populations (1.97; 95% CI, 1.54–2.53 among Whites and 7.75; 95% CI, 3.57–16.83 among African Americans). Furthermore, the risk of HNC increased with increasing frequency and duration of cigarette smoking (Ptrend < 0.0001). In contrast, we did not observe strong associations between ETS and HNC (Supplementary Table S2). Adjusted ORs for ever compared with never ETS exposure were not elevated when stratified by race (0.87; 95% CI, 0.63–1.19 among Whites and 0.91; 95% CI, 0.45–1.82 among African Americans) or by active cigarette smoking (0.84; 95% CI, 0.54–1.33 among never cigarette smokers and 0.92; 95% CI, 0.62–1.37 among ever cigarette smokers). Duration of ETS exposure at work or home was also not associated with HNC risk (Supplementary Table S2).
SNPs–HNC associations
Among Whites, most ORs were close to the null value for associations between SNPs and HNC (Table 3). The SNP rs4150403 on the excision repair cross-complementing 3 (ERCC3) gene, also known as xeroderma pigmentosum B (XPB), however, was statistically significantly associated with elevated HNC risk (AA+AG vs. GG; OR, 1.28; 95% CI, 1.01–1.61). In addition, another SNP on ERCC3 (XPB), rs4150496, suggested a possible reduced HNC risk among Whites (AA+AG vs. GG; OR, 0.80; 95% CI, 0.62–1.02). When we considered associations between these SNPs and each tumor site separately, associations between rs4150403 and oral cavity cancer resulted in the largest magnitude OR (1.32; 95% CI, 1.01–1.71; Supplementary Table S3). For rs4150496, associations with oral cavity and oropharyngeal cancers resulted in the smallest magnitude ORs (OR, 0.79; 95% CI, 0.60–1.04 and OR, 0.77; 95% CI, 0.56–1.06, respectively).
Among African Americans, one SNP on ERCC6 (also known as Cockayne Syndrome B, CSB), rs4253132, was significantly associated with reduced HNC risk (CC+CT vs. TT; OR, 0.62; 95% CI, 0.45–0.86; Table 4). Because of low cell counts, we were unable to assess the association between this SNP and all tumor sites among African Americans. We did find, however, that rs4253132 was significantly associated with reduced risk of laryngeal cancer (OR, 0.65; 95% CI, 0.44–0.97; Supplementary Table S4).
No other significant SNP–HNC associations were detected, including none of the extensively studied associations between SNPs in ERCC2 (also known as XPD), ERCC1, or ligase 1 (LIG1) and HNC risk. In particular, we did not find an association between rs13181 in ERCC2 (XPD) and HNC among Whites (GG+TG vs. TT; OR, 1.05; 95% CI, 0.76–1.45; Table 3) nor among African Americans (OR, 1.01; 95% CI, 0.75–1.37; Table 4). In sensitivity analyses, results from Tables 3 and 4 were robust to further adjustment for cigarette smoking and alcohol drinking and variation of τ2 (i.e., results were similar when adjusting for cigarette smoking and alcohol drinking or when τ2 = 0.01, 0.10, and 1.0 rather than 0.05, though the OR for rs4150403 among Whites was nonsignificantly elevated when adjusting for cigarette smoking and alcohol drinking or when τ2 = 0.01; data not shown). Compared with the hierarchical model, ORs (95% CIs) for the conventional model were similar though less stable, with a few additional SNP–HNC associations implicated at 0.05 α level but none at a Bonferroni-corrected significance level of 0.0006 (Supplementary Tables S5 and S6).
Joint effects
Using the conventional method (Table 5), interactions between ever cigarette smoking and 3 SNPs suggested possible departures from the null on the additive scale at an uncorrected 0.05 α level among Whites. Specifically, the interaction between cigarette smoking and rs4253132 on ERCC6 (CSB, RERI = 0.70; 95% CI, 0.14–1.26) and rs2291120 on DDB2 (XPE, RERI = 0.68; 95% CI, 0.11–1.26) seemed to be more than additive, whereas the interaction between cigarette smoking and rs744154 on ERCC4 (XPF, RERI = −1.02; 95% CI, −2.02 to −0.02) seemed to be less than additive. However, RERI estimates were generally imprecise and none were significant at a Bonferroni-corrected significance level (Table 5). Furthermore, genotype frequencies for rs4253132 on ERCC6 among Whites seemed inconsistent with HWE at a 0.05 α level, although the genotype-clustering plot seemed reasonable, and should therefore be cautiously interpreted. ORs (95% CIs) for joint effects from the hierarchical model (Table 6) were similar to estimates from the conventional method. RERI point estimates were also similar between the two methods, but we were unable to estimate 95% CIs for RERI estimates using hierarchical regression. Joint effects of SNPs and former/current cigarette smoking as well as SNPs and ETS among Whites are provided in Supplementary Tables S7 and S8, respectively, and highlight a few other potential gene–environment interactions. Among African Americans, no significant ever cigarette–SNP interactions were noted; however, estimates were unreliable due to relatively low cell counts and are therefore not presented.
Discussion
Consistent with extensive literature, we found a positive association between cigarette smoking and HNC risk (3). In particular, we found noticeably larger ORs among African Americans compared with Whites. A detailed analysis of smoking–HNC associations by race using CHANCE data has been previously published (50). Briefly, elevated HNC ORs among African American cigarette smokers were noted even when accounting for frequency and duration of smoking, mentholated versus nonmentholated cigarettes, and tumor site (50). Racial differences in carcinogen metabolism and smoking cessation patterns may be contributing factors (50).
Our study identified associations between 2 SNPs in the same NER gene and HNC among Whites. Specifically, we detected elevated HNC risk associated with rs4150403 and possibly reduced HNC risk with rs4150496. These SNPs are in intron 3 and 11, respectively, of ERCC3 (XPB), responsible for encoding a component of the transcription factor II H (TFIIH) subunit which unwinds the double helix surrounding a DNA adduct, and are not in linkage disequilibrium with each other, but are in linkage disequilibrium with untyped SNPs near or in introns or the 3′-untranslated region (UTR) of the gene (r2 > 0.80, CEU population; refs. 6, 66–69). Previous epidemiologic studies of HNC have not considered these SNPs. Only one previous study has examined the effects of any variant in ERCC3 (XPB), finding reduced HNC risk associated with rs4233583 (AA vs. CC; OR, 0.37; 95% CI, 0.15–0.90), a 3′-UTR SNP, which is correlated with rs4150496 (r2 = 0.96; CEU population; refs. 32, 67, 68).
An association between rs4253132 and reduced HNC risk was detected among African Americans in our study. This SNP occurs in intron 10 of ERCC6, which operates in transcription-coupled NER, and is in linkage disequilibrium with about a dozen other untyped intronic SNPs (r2 > 0.80; Yoruba in Ibadan, Nigeria (YRI) population; refs. 6, 66–69). Two previous studies have collectively reported on associations between 5 SNPs in ERCC6 and HNC risk; however, neither study evaluated rs4253132 nor considered an African American population. One study reported reduced HNC risk associated with rs4253211 (Arg/Pro+Pro/Pro vs. Arg/Arg; OR, 0.53; 95% CI, 0.34–0.85) and no association with rs2228527 (Arg/Gly+Gly/Gly vs. Arg/Arg; OR, 0.87; 95% CI, 0.61–1.20; ref. 8). Another study found elevated HNC risk associated with rs2228528 (GA+AA vs. GG; OR, 1.43; 95% CI, 1.02–2.01) and no association with rs2228526 (AG+GG vs. AA; OR, 0.82; 95% CI, 0.50–1.34) and rs2228529 (AG+GG vs. AA; OR, 0.79; 95% CI, 0.49–1.26; ref. 14). Our study also evaluated rs2228527, rs2228528, and rs2228529 finding near null associations among Whites and African Americans (ORs, ∼0.9). Although rs2228526, rs2228527, and rs2228529 are correlated (r2 = 1.0; CEU population), rs4253132, rs4253211, or rs2228528 are not (68).
Among all previous studies of NER variants and HNC, SNPs in ERCC2 (XPD) have been the most commonly investigated, particularly rs13181. ERCC2 (XPD) encodes a protein component of the TFIIH subunit, which denatures the double helix of DNA in preparation for excision of bulky DNA adducts (6, 66). More than 20 previous case–control studies have studied rs13181 and HNC risk, with the majority finding null associations (8–12, 15, 17, 18, 20–23, 25, 26, 30, 31, 33–35, 37, 38, 45, 48). The largest study, based on data from the International Head and Neck Cancer Epidemiology Consortium, found no association between rs13181 and HNC risk (Gln/Gln vs. Lys/Lys; OR, 1.03; 95% CI, 0.88–1.21; ref. 15). Likewise, we did not find strong evidence for an association between rs13181 and HNC risk among Whites or African Americans. Furthermore, several previous studies have found inconsistent associations for rs13181 within strata of cigarette smoking (10–12, 17, 23, 31, 34, 37, 38). In our study, we did not find an additive effect for smoking and rs13181.
Interactions between ever cigarette smoking and 3 SNPs, rs4253132 (intron 10 of ERCC6, in linkage disequilibrium with other untyped intronic SNPs; r2 > 0.80; CEU population), rs2291120 (intron 1 of DDB2, not in linkage disequilibrium with other SNPs), and rs744154 (intron 1 of ERCC4, in linkage disequilibrium with other untyped intronic SNPs and synonymous SNP rs1799801), were suggestive of possible super- or subadditive effects among Whites in our study (67–69). Using the conventional method, RERIs for these SNPs were significant at an uncorrected 0.05 α level, but not at a Bonferroni-corrected level. Using hierarchical regression, RERI point estimates were similar to those obtained from the conventional method. Although no previous studies considered interactions between cigarette smoking and rs4253132, rs2291120, or rs744154, four studies did investigate the effects of other SNPs, though not in linkage disequilibrium with implicated SNPs in our study, in ERCC6 and ERCC4 (XPF) within strata of cigarette smoking (8, 14, 27, 44, 68). Studies of rs4253211 in ERCC6 and rs1800067 and rs2276466 in ERCC4 reported similar SNP–HNC associations across strata of cigarette smokers (8, 27, 44), whereas other studies found rs2228528 on ERCC6 was associated with elevated HNC risk among ever smokers (GA+AA vs. GG; OR, 2.36; 95% CI, 1.36–4.10), but not among never smokers (OR, 0.99; 95% CI, 0.64–1.55; ref. 14) and rs3136038 on ERCC4 was associated with reduced HNC risk among nonsmokers (TT vs. CC+CT; OR, 0.55; 95% CI, 0.34–0.88), but not among smokers (OR, 0.96; 95% CI, 0.66–1.39; ref. 44).
Differences in joint effect results between the present and past studies may in part stem from differences in analytic approaches. Namely, most previous studies examined the effects of SNPs on HNC stratified by cigarette smoking but did not consider the ORs for singly and doubly exposed individuals (i.e., individuals who had the variant allele or smoked cigarettes or both), which would have allowed testing the interaction on the additive scale by calculating a RERI (8, 10–12, 14, 16, 17, 23, 25, 27–29, 31, 32, 34, 36–40, 44). Additional studies that assess interactions on the additive scale among large study populations are needed to follow-up our suggestive findings.
The present study builds upon the existing literature by (i) including one of the largest study populations to date, (ii) estimating race-stratified effects, and (iii) evaluating more NER genes, including more SNPs, than any single previous study. Besides two studies, which evaluated a limited number of SNPs in NER genes (13, 15), this is the largest candidate gene study to evaluate the independent and joint effects of cigarette smoking and SNPs in NER genes with respect to HNC. Most previous studies included a hundred to a thousand cases and controls (8–49). Furthermore, our study included more African Americans than any previous study. Only one previous study reported race-specific associations for a NER variant and upper aerodigestive cancers among 119 African Americans (16). Consideration of race-specific estimates is an important contribution of this study since HNC incidence, including patterns of risk factors such as cigarette smoking, varies by race, and SNP linkage disequilibrium patterns vary by ancestry (56, 70, 71). Despite our large overall and race-specific sample sizes, exploration of gene–environment interactions among African Americans was limited. HNC tumor site-specific estimates were also limited by sparse numbers.
In addition to including more individuals than previous studies, our analysis also examined more SNPs in NER genes than any previous study. Previous candidate gene studies have collectively examined approximately 60 SNPs in 10 NER genes and HNC risk (8–49). Our study alone included 84 SNPs across 15 NER genes. Although our study included the largest array of NER variants to date, it should be noted that selection of SNPs was based on a variety of approaches, which limited the variation captured across some genes, especially among African Americans. Specifically, tagging SNPs were not selected for all genes and SNPs were selected on the basis of only the CEU population. For this reason, we did not consider haplotypes. It is also important to note that SNPs found to be associated with HNC risk in this study occur in noncoding intronic regions and were not found to be in linkage disequilibrium with SNPs in coding regions (with the exception of rs744154, which is in linkage disequilibrium with rs1799801; refs. 67–69). Although intronic SNPs can be located within regulatory regions (e.g., splice sites; ref. 72), further research about the exact function of these SNPs will further elucidate potential associations with HNC.
Although we did not find associations between previously studied SNPs in NER genes and HNC risk, we identified two new associations. Among Whites, rs4150403 on ERCC3 (XPB) was associated with increased HNC risk. Among African Americans, rs4253132 on ERCC6 was associated with decreased HNC risk. Three suggestive ever cigarette smoking–SNP interactions were also identified. Although our study was one of the largest to date, studies with even larger sample sizes are needed to confirm these results, especially to estimate gene–environment interactions more precisely. Further studies focusing on African American and other diverse populations are recommended.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Authors' Contributions
Conception and design: A.B. Wyss, A.F. Olshan
Development of methodology: A.B. Wyss, A.H. Herring, C.L. Avery, J.T. Bensen, J.S. Barnholtz-Sloan, A.F. Olshan
Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): M.C. Weissler, W.K. Funkhouser, A.F. Olshan
Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): A.B. Wyss, A.H. Herring, C.L. Avery, M.C. Weissler, J.T. Bensen, J.S. Barnholtz-Sloan, A.F. Olshan
Writing, review, and/or revision of the manuscript: A.B. Wyss, A.H. Herring, C.L. Avery, M.C. Weissler, J.T. Bensen, J.S. Barnholtz-Sloan, W.K. Funkhouser, A.F. Olshan
Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): A.F. Olshan
Study supervision: M.C. Weissler
Acknowledgments
The authors thank Dr. Robert Millikan for his substantial contributions to the conceptual development and analyses of this article. The authors also thank Dr. Anne Hakenewerth for assistance in research development, Ms. Kathy Wisniewski for programming support, and Dr. Katie O'Brien and Mr. Nikhil Khankari for assistance on the hierarchical models.
Grant Support
This work has been supported by the U.S. National Institutes of Health (NIH), National Cancer Institute (NCI; R01CA90731-01 and 2-T32-CA09330-21-26) and National Institute of Environmental Health Sciences (NIEHS) (T32ES007018 and P30ES010126). C.L. Avery was supported in part by the National Heart, Lung, and Blood Institute (R00-HL-098458).
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.