Background: Genome-wide association studies (GWAS) have identified multiple single-nucleotide polymorphisms (SNP) associated with lung cancer. However, whether these SNPs are associated with genetic damage, a crucial event in cancer initiation and evolution, is still unknown. We aimed to establish associations between these SNPs and genetic damage caused by the ubiquitous carcinogens, polycyclic aromatic hydrocarbons (PAH).

Methods: We cross-sectionally investigated the associations between SNPs from published GWAS for lung cancer in Asians and PAH-induced genetic damage in 1,557 coke oven workers in China. Urinary PAH metabolites, plasma benzo[a]pyrene-r-7,t-8,c-10-tetrahydrotetrol-albumin (BPDE-Alb) adducts, urinary 8-hydroxydeoxyguanosine (8-OHdG), and micronuclei (MN) frequency were determined by gas chromatography-mass spectrometry, sandwich ELISA, high-performance liquid chromatography, and cytokinesis-block micronucleus assay, respectively.

Results: 13q12.12-rs753955C was suggestively associated with elevated 8-OHdG levels (P = 0.003). Higher 8-OHdG levels were observed in individuals with rare allele homozygotes (CC) than in TT homozygotes (β, 0.297; 95% confidence interval, 0.124–0.471; P = 0.001). 9p21-rs1333040C, 10p14-rs1663689G, and 15q25.1-rs3813572G were significantly associated with lower MN frequency (P values were 0.002, 0.001, and 0.005, respectively). 10p14-rs1663689G polymorphism downregulated the relationship of the total concentration of PAH metabolites to 8-OHdG levels (Pinteraction = 0.002). TERT-rs2736100G and VTI1A-rs7086803A aggravated the relationship of BPDE-Alb adducts to MN frequency, whereas BPTF-rs7216064G attenuated that correlation (all Pinteraction < 0.001).

Conclusions: Lung cancer risk–associated SNPs and their correlations with PAH exposure were associated with 8-OHdG levels and MN frequency.

Impact: Lung cancer risk–associated SNPs might influence one's susceptibility to genetic damage caused by PAHs. Cancer Epidemiol Biomarkers Prev; 23(6); 986–96. ©2014 AACR.

Lung cancer is the leading cause of cancer-related mortality worldwide, including China (1, 2), and is a complex disease caused both by genetic and environmental factors and by their interactions (3, 4). Genome-wide association studies (GWAS) have identified multiple single-nucleotide polymorphisms (SNP) that are associated with lung cancer risks, supporting the suggestion that inherited genetic factors play a significant role in lung cancer development. Environmental pollutants such as polycyclic aromatic hydrocarbons (PAH) also contribute to lung cancer risks (5–7). In China, the population attributable fraction for lung cancer caused by inhalation exposure to PAHs was >44% in isolated locations near small-scale coke oven operations, dramatically exceeding that in the overall population (1.6%; ref. 8). However, despite the fact that more than 80% of lung cancers are attributed to tobacco, only a fraction of smokers (∼10%) will ultimately develop lung cancer in their lifetimes (9), indicating that inherited genetic factors modify the effects of carcinogenic compounds in the tobacco smoke.

Genetic damage, including DNA and chromosomal damage, is a key initial event in the carcinogenic potential of PAHs (10, 11) and a crucial event in the initiation and evolution of cancer (12). Elevated genetic damage levels were observed both in individuals exposed to PAHs (13, 14) and in lung cancer patients (15) compared with controls. Of note, 8-hydroxydeoxyguanosine (8-OHdG) was a biomarker reflecting the levels of oxidative DNA damage and micronuclei (MN) frequency was widely used in the measurement of chromosomal damage (16, 17). Both were also reported to be associated with an increased lung cancer risk (18, 19). However, published evidence showed that individuals who lived in similar environments had different levels of 8-OHdG (20) and MN frequency (21), as well as different risks for lung cancer, suggesting that genetic factors were important in the chemical carcinogenesis of PAHs. However, shared genetic variants for both DNA and chromosomal damage and lung cancer risks still need to be investigated further, and it remains unknown whether lung cancer–related genetic variants contribute to disease susceptibility via the initiation of damage to DNA and chromosomes. Moreover, it is also yet to be determined whether or not these genetic variants can modify the carcinogenic effects of PAHs. GWAS has identified multiple SNPs associated with lung cancer risks, yet whether those SNPs are associated with levels of PAH-induced DNA and chromosomal damage is still unknown.

Thus, in the present study, we hypothesize that lung cancer risk–related SNPs identified from GWAS themselves may be associated with DNA and chromosomal damage and may also aggravate the magnitude of PAH-induced genetic damage. As coke oven workers had high levels of genetic damage and were at high risk for lung cancer due to their long-term occupational exposure to PAHs, we tested this hypothesis by cross-sectionally investigating the associations of SNPs identified from GWAS for lung cancer in Asians with the levels of 8-OHdG and MN frequency in coke oven workers. We then explored the relationships between these SNPs and PAH exposure, which were assessed by PAH metabolites and plasma benzo[a]pyrene-r-7,t-8,t-9,c-10-tetrahydotetrol-albumin (BPDE-Alb) adducts.

Study participants

A total of 1,715 workers were recruited from a coke oven plant in Wuhan (Hubei, China) in the autumn of 2010. All of them were classified into three groups according to their work places: Participants who worked at the top, side, and bottom of coke ovens were referred to as the high-exposure group. Individuals who worked in adjunct workplaces and in the logistics departments and offices were defined as the low-exposure and control groups, respectively. All subjects provided written informed consent and the Ethics and Human Subject Committee of Tongji Medical College approved this research project. Each participant was interviewed one-on-one to collect information on demographic characteristics, health status, body weight and height, smoking status, alcohol consumption, occupational location, and lengths of employment by means of a standardized occupational questionnaire. For each subject, a 6-mL venous blood sample was drawn and distributed into two coded vacuum tubes (1 mL for EDTA-K2 anticoagulant and 5 mL for heparinized anticoagulant) and 20 mL of morning urine samples were collected in sterile conical tubes. The main inclusion criteria for this study were that participants should have worked not less than 1 year, had no physician-diagnosed cancer, and that both blood and urine were available. Finally, 1,557 subjects (1,333 males and 224 females) who met these criteria were enrolled for further analysis.

SNP selection and genotyping

We included common SNPs (MAF > 0.05) identified from published GWAS for lung cancer risk in Asians (3, 22–29), with the exception of one research project published after our experiment was finished (30). Considering the ethnic differences, the SNPs restricted to non-Asians were ignored. Another four SNPs on 15q25 (rs2036534, rs667282, rs12910984, and rs6495309) identified in a Chinese population were also recruited (31). As rs401681, rs11080466, and rs6495309 were in high linkage disequilibrium (LD; r2 ≥ 0.8) with rs402710, rs11663246, and rs12910984, respectively, the latter ones were discarded. Finally, a total of 28 SNPs were included in this study. The iPLEX system (Sequenom) was used for genotyping. Due to the failure of primer design, we replaced rs17728461, rs2036534, rs2395185, and rs10433328 (called original SNPs) with rs16988393, rs3813572, rs9272346, and rs4677662, respectively, according to the following criteria: (i) the SNPs were in moderate LD (logarithm of odds of >3 and D' of >0.99) with original SNPs; (ii) the SNPs were located in the regions from 1,000 kb upstream to 1,000 kb downstream of the original SNPs; (iii) when multiple SNPs met the above two criteria, SNPs residing in functional regions, including exons, untranslated regions, and promoters regions, were prioritized. If none of these SNPs were located in functional regions, the strongest LD SNP was prioritized.

Determination of urinary PAH metabolites

We determined 12 urinary PAH metabolites, including 1-, 2-hydroxynaphthalene, 2-, 9-hydroxyfluorene, 1-hydroxypyrene, 1-, 2-, 3-, 4-, 9-hydroxyphenanthrene, 6-hydroxychrysene, and 3-hydroxybenzo[a]pyrene by gas chromatography-mass spectrometry, which was reported in detail previously (32) with minor modifications as described in our previous study (33). We failed to detect 6-hydroxychrysene and 3-hydroxybenzo[a]pyrene because their concentrations were less than the limits of detection (LOD). The LOD of the remained 10 PAH metabolites ranged from 0.1 to 1.4 μg/L. For measurements below LOD, we used half of LOD as default values. We calibrated the concentrations of urinary PAH metabolites by levels of urinary creatinine and expressed them as μg/mmol creatinine.

Determination of plasma BPDE-Alb adducts

The concentrations of BPDE-Alb adducts in heparin-anticoagulated plasma were detected by sandwich ELISA described in detail previously (33). Briefly, the 96-well plates were coated with rabbit anti-mouse immunoglobulin G-Fc antibody and then blocked with nonfat dry milk. After that, monoclonal antibody 8E11 and biotin-conjugated rabbit anti-albumin antibody were used as primary and secondary antibodies, respectively, according to standard ELISA procedures. Each standard or sample was assayed in duplicate. We calibrated concentrations of BPDE-Alb adducts by plasma albumin and expressed them as ng/mg albumin. The LOD of this method was approximately 1 ng BPDE-Alb adducts/mg albumin. Half of LOD were used as default values for measurements below LOD.

Determination of genetic damage

The oxidative DNA-damage levels were assessed by urinary 8-OHdG using high-performance liquid chromatography reported previously (34). The detailed experimental procedure was described in our previous study (33). The LOD of 8-OHdG was 7 nmol/L. For measurements below LOD, we used half of LOD as default values. The urinary concentrations of 8-OHdG were calibrated by levels of urinary creatinine and expressed as nmol/mmol creatinine.

The chromosomal damage was measured by cytokinesis-block micronucleus assay according to the standardized protocol reported previously (35). A total of 1,000 binucleated cells in each of two duplicative slides were examined microscopically for MN according to the scoring criteria outlined by the HUman MicroNucleus Project (17). The MN frequency of each subject was reported as the mean number of MN cells per 1,000 binucleated cells.

Bioinformatics

Because all the polymorphisms significantly associated with the genetic biomarkers were located in noncoding regions, their possible effects on transcription factor–binding sites were analyzed in silico, using the SNPInspector tool of the Genomatix software suite GEMS Launcher (Genomatix; ref. 36), which was designed to evaluate SNPs situated in noncoding regions.

Statistical analysis

One-way ANOVA, Kruskal–Wallis, and χ2 tests were used to access the differences among the control, low- and high-exposure groups with regard to continuous variables in normal distribution [age, years worked, and body mass index (BMI)], continuous variables in skewed distribution [packing year, plasma BPDE-Alb adducts, PAH metabolites and total concentrations of PAH metabolites (ΣOH-PAHs)], and categorical variables (gender, smoking, and drinking), respectively. A goodness-of-fit χ2 analysis was used to test the Hardy–Weinberg equilibrium. The LD was evaluated using Haploview 4.1. Association analysis was performed by general linear models or poisson log-linear models, in which genotypes with three levels can be modeled as nominal variables without assuming additive models. False discovery rate (FDR) adjustment was used for multiple comparisons and FDR < 0.05 was defined as the significance level.

The relationships of SNPs and internal exposure levels (ΣOH-PAHs and BPDE-Alb adducts) with genetic damage were tested by introducing the SNP × internal exposure into the models. All statistical analyses were performed using SPSS 12.0 software.

General characteristics of the subjects

General characteristics of the study population with levels of urinary PAH metabolites, plasma BPDE-Alb adducts, and genetic damage were summarized in Table 1. All subjects were classified into three groups according to their work places (for detailed information, see Materials and Methods). Higher proportions of males, drinkers and smokers, and pack-years were found in the exposure groups, compared with those in the control group (all P < 0.05, Table 1). In addition, exposure groups had higher levels of BPDE-Alb adducts and most PAH metabolites except 4-hydroxyphenanthrene. No differences were observed in the distribution of age, years worked, and BMI among these three groups. Multivariate linear regression analysis revealed that 8-OHdG and MN frequency were significantly associated with increased environmental PAH levels, adjusting for age, gender, BMI, smoking, and drinking (Ptrend < 0.001).

Table 1.

General characteristics, levels of urinary PAH metabolites, plasma BPDE-Alb adducts, and genetic damage of the workers in control and exposed groups

VariablesControl groupe (n = 497)Low-exposure groupe (n = 628)High-exposure groupe (n = 432)P
General characteristics 
 Age (y; mean ± SD) 42.48 ± 8.04 41.76 ± 8.95 42.22 ± 7.78 0.342a 
 Gender (% female) 22.7 13.2 6.5 <0.001b 
 Years worked (y; mean ± SD) 21.32 ± 9.19 20.26 ± 10.71 20.88 ± 9.22 0.197a 
 Smoking (smoking/quit/never), % 50/3.4/46.6 55.5/4.9/39.6 66.2/3.7/30.1 0.001b 
 Pack-years (mean ± SD) 10.83 ± 14.33 10.87 ± 15.01 13.40 ± 15.35 0.010a 
 Drinking (drinking/quit/never), % 29.4/3.2/67.4 33.0/1.8/65.2 38.3/1.4/60.3 0.020b 
 BMI (kg/m2, mean ± SD) 23.72 ± 3.14 23.81 ± 3.64 23.37 ± 4.39 0.160a 
PAH metabolites [μg/mmol creatinine, median (5%–95%)] 
 1-hydroxynaphthalene 1.25 (0.34–5.12) 1.61 (0.48–7.43) 2.27 (0.66–8.42) <0.001c 
 2-hydroxynaphthalene 1.26 (0.23–4.18) 1.49 (0.40–5.39) 2.29 (0.61–7.24) <0.001c 
 2-hydroxyfluorene 0.71 (0.27–2.06) 0.92 (0.33–3.92) 1.12 (0.31–4.28) <0.001c 
 9-hydroxyfluorene 0.48 (0.01–4.71) 0.58 (0.01–4.50) 0.63 (0.01–3.71) 0.045c 
 1-hydroxyphenanthrene 0.75 (0.07–4.23) 0.83 (0.10–3.34) 1.15 (0.24–4.67) <0.001c 
 2-hydroxyphenanthrene 0.25 (0.04–1.22) 0.31 (0.05–1.23) 0.38 (0.09–1.64) <0.001c 
 3-hydroxyphenanthrene 0.29 (0.03–1.15) 0.36 (0.04–1.56) 0.47 (0.04–2.11) <0.001c 
 4-hydroxyphenanthrene 0.37 (0.003–2.13) 0.32 (0.004–2.21) 0.30 (0.003–1.98) 0.332c 
 9-hydroxyphenanthrene 0.62 (0.14–3.71) 0.74 (0.17–4.20) 0.86 (0.21–3.88) <0.001c 
 1-hydroxypyrene 2.54 (0.65–13.14) 3.23 (0.91–13.11) 4.22 (1.22–17.86) <0.001c 
 ΣOH-PAHs 10.00 (3.62–36.08) 11.65 (4.74–40.03) 14.78 (5.58–50.35) <0.001c 
BPDE-Alb adducts [ng/mg albumin, median (5%–95%)] 3.93 (3.09–5.76) 4.30 (1.75–7.39) 4.73 (1.75–9.95) <0.001c 
Genetic damage biomarkers (mean ± SD)    Ptrend 
 8-OHdG (nmol/mmol creatinine) 4.47 ± 1.26 4.62 ± 1.13 4.83 ± 1.05 <0.001d 
 MN frequency (%) 3.69 ± 2.70 3.78 ± 3.02 4.02 ± 2.57 <0.001d 
VariablesControl groupe (n = 497)Low-exposure groupe (n = 628)High-exposure groupe (n = 432)P
General characteristics 
 Age (y; mean ± SD) 42.48 ± 8.04 41.76 ± 8.95 42.22 ± 7.78 0.342a 
 Gender (% female) 22.7 13.2 6.5 <0.001b 
 Years worked (y; mean ± SD) 21.32 ± 9.19 20.26 ± 10.71 20.88 ± 9.22 0.197a 
 Smoking (smoking/quit/never), % 50/3.4/46.6 55.5/4.9/39.6 66.2/3.7/30.1 0.001b 
 Pack-years (mean ± SD) 10.83 ± 14.33 10.87 ± 15.01 13.40 ± 15.35 0.010a 
 Drinking (drinking/quit/never), % 29.4/3.2/67.4 33.0/1.8/65.2 38.3/1.4/60.3 0.020b 
 BMI (kg/m2, mean ± SD) 23.72 ± 3.14 23.81 ± 3.64 23.37 ± 4.39 0.160a 
PAH metabolites [μg/mmol creatinine, median (5%–95%)] 
 1-hydroxynaphthalene 1.25 (0.34–5.12) 1.61 (0.48–7.43) 2.27 (0.66–8.42) <0.001c 
 2-hydroxynaphthalene 1.26 (0.23–4.18) 1.49 (0.40–5.39) 2.29 (0.61–7.24) <0.001c 
 2-hydroxyfluorene 0.71 (0.27–2.06) 0.92 (0.33–3.92) 1.12 (0.31–4.28) <0.001c 
 9-hydroxyfluorene 0.48 (0.01–4.71) 0.58 (0.01–4.50) 0.63 (0.01–3.71) 0.045c 
 1-hydroxyphenanthrene 0.75 (0.07–4.23) 0.83 (0.10–3.34) 1.15 (0.24–4.67) <0.001c 
 2-hydroxyphenanthrene 0.25 (0.04–1.22) 0.31 (0.05–1.23) 0.38 (0.09–1.64) <0.001c 
 3-hydroxyphenanthrene 0.29 (0.03–1.15) 0.36 (0.04–1.56) 0.47 (0.04–2.11) <0.001c 
 4-hydroxyphenanthrene 0.37 (0.003–2.13) 0.32 (0.004–2.21) 0.30 (0.003–1.98) 0.332c 
 9-hydroxyphenanthrene 0.62 (0.14–3.71) 0.74 (0.17–4.20) 0.86 (0.21–3.88) <0.001c 
 1-hydroxypyrene 2.54 (0.65–13.14) 3.23 (0.91–13.11) 4.22 (1.22–17.86) <0.001c 
 ΣOH-PAHs 10.00 (3.62–36.08) 11.65 (4.74–40.03) 14.78 (5.58–50.35) <0.001c 
BPDE-Alb adducts [ng/mg albumin, median (5%–95%)] 3.93 (3.09–5.76) 4.30 (1.75–7.39) 4.73 (1.75–9.95) <0.001c 
Genetic damage biomarkers (mean ± SD)    Ptrend 
 8-OHdG (nmol/mmol creatinine) 4.47 ± 1.26 4.62 ± 1.13 4.83 ± 1.05 <0.001d 
 MN frequency (%) 3.69 ± 2.70 3.78 ± 3.02 4.02 ± 2.57 <0.001d 

aOne-way ANOVA for differences among the different groups.

bχ2 tests for differences in the distribution frequencies among different groups.

cThe Kruskal–Wallis test for differences among different groups for nonnormal variables.

dMultivariate linear regression and the poisson log-linear model for the trend of 8-OHdG and MN frequency, respectively, with adjustment for age, gender, BMI, smoking, and drinking.

eControl group, workers worked in logistics departments and offices; low-exposure group, workers worked in adjunct workplaces; high-exposure group, workers worked at the top, side, and bottom of coke ovens.

In addition, we classified the population into two subgroups according to the median age (41 years) and further investigated the relationships of age, gender, and smoking status with the levels of PAH metabolites, BPDE-Alb adducts, and genetic damage. As Supplementary Table S1 showed, lower levels of 2-hydroxyfluorene, 3-hydroxyphenanthrene, and higher MN frequency were observed in the older age group (all P < 0.004); levels of 1-hydroxypyrene and genetic damage were significantly higher in females than in males (all P < 0.004). Smokers had significantly higher levels of 1- and 2-hydroxynaphthalene than ex- and nonsmokers (P < 0.001). Slight but not statistically significant lower levels of other PAH metabolites, BPDE-Alb adducts, and genetic damage were observed in smokers, compared with nonsmokers. The potential interpretations are as follows: (i) tobacco smoke contains much higher concentrations of naphthalene than that of other PAHs such as phenanthrene and benzo[a]pyrene (37), which significantly increases the levels of urinary 1- and 2-hydroxynaphthalene in smokers; (ii) cytochrome P450 enzymes might be downregulated by smoking (38). To some extent, the generation of BPDE-Alb adducts was, thus, reduced; (iii) the genetic damage of coke oven workers in this study might be determined by high occupational exposure to PAHs rather than tobacco smoke.

Relationships of lung cancer–related SNPs to genetic damage

Genotyping was successful for 27/28 SNPs. SNP rs9272346 and rs4677657 were not in the Hardy–Weinberg equilibrium. Thus, the remaining 25 SNPs with a genotyping rate of >95% were included in the following statistical analysis. Detailed information of these SNPs was summarized in Table 2.

Table 2.

Characteristics of the SNPs associated with lung cancer risk in Asians

SNPsRegionReported geneLocationA/aN (AA/Aa/aa)MAFCall ratePHWE
rs10937405 3q28 TP63 Intron C/T 768/622/127 0.289 0.974 0.925 
rs11080466 18p11.22 PIEZO2 Intron A/G 923/504/65 0.212 0.958 0.716 
rs1333040 9p21 CDKN2B-AS1 Intron T/C 738/631/148 0.306 0.974 0.407 
rs1663689 10p14 GATA3 Intergenic A/G 544/710/260 0.406 0.972 0.370 
rs2131877 3q29 XXYLT1 Intron T/C 458/754/306 0.450 0.975 0.984 
rs247008 5q31 IL3-CSF2-P4HA2 Intergenic C/T 397/750/370 0.491 0.974 0.804 
rs2736100 5p15.33 TERT Intron T/G 515/757/244 0.411 0.974 0.178 
rs2853677 5p15.33 TERT Intron T/C 569/741/207 0.381 0.974 0.105 
rs2895680 5q32 PPP2R2B-STK32A-DPYSL3 Intron T/C 743/621/153 0.306 0.974 0.285 
rs36600 22q12.2 MTMR3 Intron C/T 1246/256/16 0.095 0.975 0.492 
rs3813572 15q25.1 PSMA4 5′ near gene A/G 998/462/49 0.186 0.969 0.668 
rs3817963 6p21.3 BTNL2 Intron A/G 926/518/71 0.218 0.973 0.710 
rs401681 5p15.33 CLPTM1L Intron C/T 668/681/168 0.335 0.974 0.787 
rs4488809 3q28 TP63 Intron C/T 395/747/372 0.492 0.972 0.610 
rs465498 5p15.33 TERT-CLPTM1L Intron T/C 1044/424/51 0.173 0.976 0.313 
rs4677657 3q29 XXYLT1 Intron C/T 1101/206/210 0.206 0.974 <0.001 
rs4677662 3q29 XXYLT1 Intron T/C 757/633/126 0.292 0.974 0.886 
rs4809957 20q13 CYP24A1 3′UTR G/A 560/704/253 0.399 0.974 0.158 
rs6495309 15q25.1 CHRNA3 5′ near gene C/T 476/745/291 0.439 0.971 0.878 
rs667282 15q25.1 CHRNA5 Intron T/C 477/730/309 0.445 0.974 0.350 
rs7086803 10q25.2 VTI1A Intron G/A 762/641/113 0.286 0.974 0.178 
rs7216064 17q24.3 BPTF Intron A/G 633/682/200 0.358 0.974 0.368 
rs753955 13q12.12 TNFRSF19-MIPEP Intergenic T/C 755/623/140 0.297 0.975 0.404 
rs9272346 6p21.32 HLA-DQA1 5′ near gene G/A 893/112/333 0.709 0.859 <0.001 
rs9387478 6q22.2 ROS1-DCBLD1 Intergenic C/A 390/734/358 0.489 0.952 0.728 
rs9439519 1p36 AJAP1-NPHP4 Intergenic T/C 792/599/128 0.281 0.976 0.941 
rs16988393 22q12.2 MTMR3-HORMAD2-LIF Intergene T/C 1223/279/16 0.102 0.975 0.925 
rs952481 3q29 XXYLT1 Intron A/G a a a a 
SNPsRegionReported geneLocationA/aN (AA/Aa/aa)MAFCall ratePHWE
rs10937405 3q28 TP63 Intron C/T 768/622/127 0.289 0.974 0.925 
rs11080466 18p11.22 PIEZO2 Intron A/G 923/504/65 0.212 0.958 0.716 
rs1333040 9p21 CDKN2B-AS1 Intron T/C 738/631/148 0.306 0.974 0.407 
rs1663689 10p14 GATA3 Intergenic A/G 544/710/260 0.406 0.972 0.370 
rs2131877 3q29 XXYLT1 Intron T/C 458/754/306 0.450 0.975 0.984 
rs247008 5q31 IL3-CSF2-P4HA2 Intergenic C/T 397/750/370 0.491 0.974 0.804 
rs2736100 5p15.33 TERT Intron T/G 515/757/244 0.411 0.974 0.178 
rs2853677 5p15.33 TERT Intron T/C 569/741/207 0.381 0.974 0.105 
rs2895680 5q32 PPP2R2B-STK32A-DPYSL3 Intron T/C 743/621/153 0.306 0.974 0.285 
rs36600 22q12.2 MTMR3 Intron C/T 1246/256/16 0.095 0.975 0.492 
rs3813572 15q25.1 PSMA4 5′ near gene A/G 998/462/49 0.186 0.969 0.668 
rs3817963 6p21.3 BTNL2 Intron A/G 926/518/71 0.218 0.973 0.710 
rs401681 5p15.33 CLPTM1L Intron C/T 668/681/168 0.335 0.974 0.787 
rs4488809 3q28 TP63 Intron C/T 395/747/372 0.492 0.972 0.610 
rs465498 5p15.33 TERT-CLPTM1L Intron T/C 1044/424/51 0.173 0.976 0.313 
rs4677657 3q29 XXYLT1 Intron C/T 1101/206/210 0.206 0.974 <0.001 
rs4677662 3q29 XXYLT1 Intron T/C 757/633/126 0.292 0.974 0.886 
rs4809957 20q13 CYP24A1 3′UTR G/A 560/704/253 0.399 0.974 0.158 
rs6495309 15q25.1 CHRNA3 5′ near gene C/T 476/745/291 0.439 0.971 0.878 
rs667282 15q25.1 CHRNA5 Intron T/C 477/730/309 0.445 0.974 0.350 
rs7086803 10q25.2 VTI1A Intron G/A 762/641/113 0.286 0.974 0.178 
rs7216064 17q24.3 BPTF Intron A/G 633/682/200 0.358 0.974 0.368 
rs753955 13q12.12 TNFRSF19-MIPEP Intergenic T/C 755/623/140 0.297 0.975 0.404 
rs9272346 6p21.32 HLA-DQA1 5′ near gene G/A 893/112/333 0.709 0.859 <0.001 
rs9387478 6q22.2 ROS1-DCBLD1 Intergenic C/A 390/734/358 0.489 0.952 0.728 
rs9439519 1p36 AJAP1-NPHP4 Intergenic T/C 792/599/128 0.281 0.976 0.941 
rs16988393 22q12.2 MTMR3-HORMAD2-LIF Intergene T/C 1223/279/16 0.102 0.975 0.925 
rs952481 3q29 XXYLT1 Intron A/G a a a a 

Abbreviation: A/a, major/minor allele; UTR, untranslated region.

aData were missing due to genotyping failure.

The associations between lung cancer risk–related SNPs and genetic damage were shown in Tables 3 and 4. 13q12.12-rs753955 was suggestively associated with elevated 8-OHdG levels after FDR adjustment [P = 0.003 for the genotype term in the adjusted model; β, 0.297; 95% confidence interval (CI), 0.124–0.471; P = 0.001 for the comparison of CC vs. TT; Table 3]. Lower LSM ± SD of 8-OHdG levels were observed in subjects with TC or TT genotypes, compared with those carrying the rare CC genotype (TC, 4.87 ± 0.09; TT, 4.81 ± 0.09; CC, 5.11 ± 0.12; Supplementary Table S2). Bioinformatic analyses indicated that rs753955 T>C substitution led to the loss of one transcription factor–binding site (Avian C-type LTR TATA box).

Table 3.

Multivariate analyses for lung cancer risk–related SNPs on urinary 8-OHdG levels of coke oven workers

Aa vs. AAaa vs. AAAa+aa vs. AA
SNPs [major (A)/minor allele (a)]β (95% CI)aPaβ (95% CI)aPaβ (95% CI)aPaPbFDR
rs10937405 (C/T) −0.102 (−0.206 to 0.002) 0.053 −0.105 (−0.288 to 0.079) 0.264 −0.068 (−0.179 to 0.043) 0.231 0.125 0.511 
rs11080466 (A/G) −0.046 (−0.153 to 0.061) 0.397 −0.234 (−0.471 to 0.003) 0.053 −0.048 (−0.163 to 0.067) 0.415 0.132 0.511 
rs1333040 (T/C) 0.039 (−0.065 to 0.143) 0.461 0.152 (−0.019 to 0.322) 0.081 0.023 (−0.088 to 0.134) 0.685 0.211 0.511 
rs2131877 (T/C) −0.032 (−0.147 to 0.082) 0.581 −0.124 (−0.267 to 0.020) 0.091 −0.045 (−0.167 to 0.077) 0.466 0.225 0.511 
rs247008 (C/T) −0.135 (−0.254 to −0.016) 0.026 −0.112 (−0.251 to 0.028) 0.117 −0.146 (−0.274 to −0.019) 0.025 0.078 0.511 
rs2736100 (T/G) −0.134 (−0.244 to −0.025) 0.016 −0.080 (−0.231 to 0.072) 0.303 −0.100 (−0.217 to 0.018) 0.096 0.056 0.511 
rs2853677 (T/C) −0.091 (−0.198 to 0.016) 0.097 0.003 (−0.154 to 0.160) 0.967 −0.057 (−0.172 to 0.058) 0.331 0.191 0.511 
rs36600 (C/T) −0.126 (−0.256 to 0.003) 0.056 0.088 (−0.401 to 0.577) 0.725 −0.117 (−0.261 to 0.027) 0.110 0.146 0.511 
rs3813572 (A/G) −0.075 (−0.184 to 0.034) 0.180 −0.047 (−0.335 to 0.240) 0.746 −0.141 (−0.259 to −0.022) 0.020 0.401 0.689 
rs465498 (T/C) 0.001 (−0.111 to 0.113) 0.981 −0.093 (−0.374 to 0.188) 0.515 −0.081 (−0.201 to 0.040) 0.188 0.805 0.875 
rs4677662 (T/C) −0.005 (−0.109 to 0.099) 0.923 −0.132 (−0.314 to 0.051) 0.158 −0.039 (−0.151 to 0.072) 0.487 0.355 0.689 
rs6495309 (C/T) 0.060 (−0.053 to 0.173) 0.300 0.126 (−0.018 to 0.270) 0.085 0.085 (−0.035 to 0.206) 0.164 0.221 0.511 
rs667282 (T/C) 0.069 (−0.044 to 0.182) 0.231 0.131 (−0.009 to 0.272) 0.066 0.109 (−0.010 to 0.229) 0.073 0.174 0.511 
rs753955 (T/C) 0.029 (−0.076 to 0.133) 0.591 0.297 (0.124–0.471) 0.001 0.074 (−0.037 to 0.186) 0.192 0.003 0.075 
Aa vs. AAaa vs. AAAa+aa vs. AA
SNPs [major (A)/minor allele (a)]β (95% CI)aPaβ (95% CI)aPaβ (95% CI)aPaPbFDR
rs10937405 (C/T) −0.102 (−0.206 to 0.002) 0.053 −0.105 (−0.288 to 0.079) 0.264 −0.068 (−0.179 to 0.043) 0.231 0.125 0.511 
rs11080466 (A/G) −0.046 (−0.153 to 0.061) 0.397 −0.234 (−0.471 to 0.003) 0.053 −0.048 (−0.163 to 0.067) 0.415 0.132 0.511 
rs1333040 (T/C) 0.039 (−0.065 to 0.143) 0.461 0.152 (−0.019 to 0.322) 0.081 0.023 (−0.088 to 0.134) 0.685 0.211 0.511 
rs2131877 (T/C) −0.032 (−0.147 to 0.082) 0.581 −0.124 (−0.267 to 0.020) 0.091 −0.045 (−0.167 to 0.077) 0.466 0.225 0.511 
rs247008 (C/T) −0.135 (−0.254 to −0.016) 0.026 −0.112 (−0.251 to 0.028) 0.117 −0.146 (−0.274 to −0.019) 0.025 0.078 0.511 
rs2736100 (T/G) −0.134 (−0.244 to −0.025) 0.016 −0.080 (−0.231 to 0.072) 0.303 −0.100 (−0.217 to 0.018) 0.096 0.056 0.511 
rs2853677 (T/C) −0.091 (−0.198 to 0.016) 0.097 0.003 (−0.154 to 0.160) 0.967 −0.057 (−0.172 to 0.058) 0.331 0.191 0.511 
rs36600 (C/T) −0.126 (−0.256 to 0.003) 0.056 0.088 (−0.401 to 0.577) 0.725 −0.117 (−0.261 to 0.027) 0.110 0.146 0.511 
rs3813572 (A/G) −0.075 (−0.184 to 0.034) 0.180 −0.047 (−0.335 to 0.240) 0.746 −0.141 (−0.259 to −0.022) 0.020 0.401 0.689 
rs465498 (T/C) 0.001 (−0.111 to 0.113) 0.981 −0.093 (−0.374 to 0.188) 0.515 −0.081 (−0.201 to 0.040) 0.188 0.805 0.875 
rs4677662 (T/C) −0.005 (−0.109 to 0.099) 0.923 −0.132 (−0.314 to 0.051) 0.158 −0.039 (−0.151 to 0.072) 0.487 0.355 0.689 
rs6495309 (C/T) 0.060 (−0.053 to 0.173) 0.300 0.126 (−0.018 to 0.270) 0.085 0.085 (−0.035 to 0.206) 0.164 0.221 0.511 
rs667282 (T/C) 0.069 (−0.044 to 0.182) 0.231 0.131 (−0.009 to 0.272) 0.066 0.109 (−0.010 to 0.229) 0.073 0.174 0.511 
rs753955 (T/C) 0.029 (−0.076 to 0.133) 0.591 0.297 (0.124–0.471) 0.001 0.074 (−0.037 to 0.186) 0.192 0.003 0.075 

aβ (95% CI) and P values were derived from the comparisons with the reference genotype (AA) using a general linear model for ln-transformed 8-OHdG, adjusting for covariants (gender, years worked, smoking status, alcohol use, BMI, ΣOH–PAHs, and BPDE-albumin adducts).

bA P value is for the genotype term in the model, adjusting for all covariants mentioned above.

Table 4.

Multivariate analyses for lung cancer risk–related SNPs on MN frequency of coke oven workers

Aa vs. AAaa vs. AAAa+aa vs. AA
SNP [major (A)/minor allele (a)]β (95% CI)aPaβ (95% CI)aPaβ (95% CI)aPaPbFDR
rs10937405 (C/T) −0.009 (−0.063 to 0.046) 0.756 −0.102 (−0.205 to 0.001) 0.050 −0.022 (−0.075 to 0.030) 0.403 0.144 0.360 
rs11080466 (A/G) 0.037 (−0.019 to 0.094) 0.190 −0.020 (−0.151 to 0.111) 0.764 0.029 (−0.025 to 0.083) 0.293 0.373 0.547 
rs1333040 (T/C) −0.090 (−0.146 to −0.035) 0.001 0.014 (−0.076 to 0.103) 0.765 −0.066 (−0.118 to −0.014) 0.013 0.002 0.025 
rs1663689 (A/G) −0.036 (−0.094 to 0.021) 0.213 −0.145 (−0.224 to −0.065) <0.001 −0.067 (−0.121 to −0.013) 0.016 0.001 0.025 
rs2131877 (T/C) 0.044 (−0.017 to 0.105) 0.159 0.080 (0.005–0.156) 0.036 0.060 (0.003–0.118) 0.040 0.103 0.291 
rs247008 (C/T) 0.063 (−0.001 to 0.127) 0.052 0.016 (−0.059 to 0.091) 0.674 0.045 (−0.015 to 0.106) 0.145 0.105 0.291 
rs2895680 (T/C) −0.048 (−0.104 to 0.007) 0.089 0.007 (−0.083 to 0.097) 0.884 −0.033 (−0.085 to 0.020) 0.221 0.191 0.398 
rs36600 (C/T) 0.040 (−0.029 to 0.109) 0.256 0.102 (−0.145 to 0.350) 0.416 0.045 (−0.022 to 0.112) 0.191 0.394 0.547 
rs3813572 (A/G) −0.083 (−0.142 to −0.025) 0.005 −0.146 (−0.299 to 0.007) 0.062 −0.094 (−0.151 to −0.038) 0.001 0.005 0.042 
rs4488809 (C/T) 0.024 (−0.039 to 0.087) 0.456 −0.064 (−0.139 to 0.010) 0.091 −0.002 (−0.062 to 0.057) 0.937 0.029 0.181 
rs4677662 (T/C) 0.041 (−0.014 to 0.096) 0.140 0.074 (−0.022 to 0.170) 0.131 0.047 (−0.005 to 0.099) 0.077 0.170 0.386 
rs4809957 (G/A) −0.064 (−0.122 to −0.007) 0.029 −0.053 (−0.129 to 0.024) 0.177 −0.070 (−0.124 to −0.016) 0.011 0.081 0.289 
rs753955 (T/C) 0.020 (−0.035 to 0.075) 0.475 −0.106 (−0.203 to −0.009) 0.032 −0.002 (−0.055 to 0.050) 0.932 0.042 0.210 
rs16988393 (T/C) −0.007 (−0.075 to 0.061) 0.838 −0.387 (−0.710 to −0.063) 0.019 −0.019 (−0.086 to 0.048) 0.580 0.064 0.267 
Aa vs. AAaa vs. AAAa+aa vs. AA
SNP [major (A)/minor allele (a)]β (95% CI)aPaβ (95% CI)aPaβ (95% CI)aPaPbFDR
rs10937405 (C/T) −0.009 (−0.063 to 0.046) 0.756 −0.102 (−0.205 to 0.001) 0.050 −0.022 (−0.075 to 0.030) 0.403 0.144 0.360 
rs11080466 (A/G) 0.037 (−0.019 to 0.094) 0.190 −0.020 (−0.151 to 0.111) 0.764 0.029 (−0.025 to 0.083) 0.293 0.373 0.547 
rs1333040 (T/C) −0.090 (−0.146 to −0.035) 0.001 0.014 (−0.076 to 0.103) 0.765 −0.066 (−0.118 to −0.014) 0.013 0.002 0.025 
rs1663689 (A/G) −0.036 (−0.094 to 0.021) 0.213 −0.145 (−0.224 to −0.065) <0.001 −0.067 (−0.121 to −0.013) 0.016 0.001 0.025 
rs2131877 (T/C) 0.044 (−0.017 to 0.105) 0.159 0.080 (0.005–0.156) 0.036 0.060 (0.003–0.118) 0.040 0.103 0.291 
rs247008 (C/T) 0.063 (−0.001 to 0.127) 0.052 0.016 (−0.059 to 0.091) 0.674 0.045 (−0.015 to 0.106) 0.145 0.105 0.291 
rs2895680 (T/C) −0.048 (−0.104 to 0.007) 0.089 0.007 (−0.083 to 0.097) 0.884 −0.033 (−0.085 to 0.020) 0.221 0.191 0.398 
rs36600 (C/T) 0.040 (−0.029 to 0.109) 0.256 0.102 (−0.145 to 0.350) 0.416 0.045 (−0.022 to 0.112) 0.191 0.394 0.547 
rs3813572 (A/G) −0.083 (−0.142 to −0.025) 0.005 −0.146 (−0.299 to 0.007) 0.062 −0.094 (−0.151 to −0.038) 0.001 0.005 0.042 
rs4488809 (C/T) 0.024 (−0.039 to 0.087) 0.456 −0.064 (−0.139 to 0.010) 0.091 −0.002 (−0.062 to 0.057) 0.937 0.029 0.181 
rs4677662 (T/C) 0.041 (−0.014 to 0.096) 0.140 0.074 (−0.022 to 0.170) 0.131 0.047 (−0.005 to 0.099) 0.077 0.170 0.386 
rs4809957 (G/A) −0.064 (−0.122 to −0.007) 0.029 −0.053 (−0.129 to 0.024) 0.177 −0.070 (−0.124 to −0.016) 0.011 0.081 0.289 
rs753955 (T/C) 0.020 (−0.035 to 0.075) 0.475 −0.106 (−0.203 to −0.009) 0.032 −0.002 (−0.055 to 0.050) 0.932 0.042 0.210 
rs16988393 (T/C) −0.007 (−0.075 to 0.061) 0.838 −0.387 (−0.710 to −0.063) 0.019 −0.019 (−0.086 to 0.048) 0.580 0.064 0.267 

aβ (95% CI) and P values were derived from the comparisons with the reference genotype (AA) using a poisson log-linear model for MN frequency, adjusting for covariants (gender, years worked, smoking status, alcohol use, BMI, ΣOH–PAHs, and BPDE-albumin adducts).

bA P value is for the genotype term in the model, adjusting for all covariants mentioned above.

Three SNPs were significantly associated with decreased MN frequency (Table 4). Subjects carrying 9p21-rs1333040 TC genotype had lower MN frequency than those with wild TT genotype (β, −0.09; 95% CI, −0.146 to −0.035; P = 0.001 for TC vs. TT; Table 4]. When comparing TC+CC with TT; β, −0.066; 95% CI, −0.118 to −0.014; and P = 0.013. In silico analysis indicated that rs1333040 T>C substitution resulted in a gain of two transcription factor–binding sites (Drosophila motif 10 element and odd-skipped related 1) and a loss of another three sites (Doublesex and mab-3–related transcription factor 1, TEA domain-containing factors, and X-box–binding protein RFX1). 10p14-rs1663689 was significantly associated with decreased MN frequency (P = 0.001 for the genotype term in the adjusted model; β, −0.145; 95% CI, −0.224 to −0.065; P < 0.001 for GG vs. AA; Table 4). Persons with a rare GG genotype showed lower MN frequency than those with wild homozygous genotype AA (LSM ± SE: GG, 3.71 ± 0.27; AA, 4.23 ± 0.23; Supplementary Table S2). SNP-rs1663689 A>G substitution results in the loss of three transcription factor–binding sites: jumonji AT rich interactive domain 2, homeodomain transcription factor HOXC13, and serum response factor. People carrying at least one 15q25.1-rs3813572 G-allele showed lower observed MN frequency than those with AA genotype (P = 0.005 for the genotype term in the adjusted model; β, −0.094; 95% CI, −0.151 to −0.038; P = 0.001 for GG+AG vs. AA; Table 4). Bioinformatic analyses indicated that the A>G substitution resulted in the loss of five transcription factor–binding sites, including retinoic acid receptor, peroxisome proliferator–activated receptor gamma, SPI-1 proto-oncogene, myeloid zinc finger protein, and PTF1-binding sites and the gain of another three sites (two in C2H2 zinc finger transcription factors 7 and one in Myc-associated zinc finger protein). No significant associations of other SNPs with 8-OHdG levels or MN frequency were observed.

Relationships of SNPs and PAHs to genetic damage

To investigate the relationships of the lung cancer risk–associated SNPs and PAH exposure to genetic damage, we first analyzed the relationships between internal PAH exposure and levels of genetic damage. The correlation coefficient of ΣOH–PAHs for 8-OHdG was larger than that for MN frequency (r = 0.494 vs. r = 0.097; Supplementary Table S3). In addition, the MN frequency was more related with BPDE-Alb adducts than with ΣOH–PAHs (r = 0.297 vs. r = 0.154; Supplementary Table S3). Thus, we introduced SNP × ΣOH–PAHs and SNP × BPDE-Alb adducts into the statistical models to investigate the gene–environment relationship to 8-OHdG and MN frequency, respectively.

A significant relationship was observed between 10p14-rs1663689 and ΣOH–PAHs to 8-OHdG levels (Pinteraction = 0.002; Fig. 1 A; Supplementary Table S4). The ΣOH–PAHs had much less correlation with urinary 8-OHdG in individuals carrying at least one mutant allele G of rs1663689 than those with wild homozygous genotype AA (GG: β, 0.023; 95% CI, 0.015–0.030; AG: β, 0.024; 95% CI, 0.020–0.029; AA: β, 0.037; 95% CI, 0.031–0.044; Fig. 1A; Supplementary Table S4). The relationships between three SNPs (rs2736100 at TERT, rs7086803 at VTI1A, and rs7216064 at BPTF) and BPDE-Alb adducts to the MN frequency were observed (all Pinteraction < 0.001; Fig. 1B; Supplementary Table S4). Larger increased correlations of BPDE-Alb adducts to MN frequency were observed in individuals with rs2736100 GG (β, 0.045; 95% CI, 0.026–0.063; P < 0.001), rs7086803 GA+AA (β, 0.047; 95% CI, 0.034–0.060; P < 0.001), and rs7216064 AA (β, 0.051; 95% CI, 0.035–0.068; P < 0.001) than those with rs2736100 TT/TG (TT: β, 0.013; 95% CI, 0.004–0.021; P = 0.004; TG: β, 0.008; 95% CI, 0.003–0.013; P = 0.003), rs7086803 GG (β, 0.007; 95% CI, 0.002–0.011; P = 0.005), and rs7216064 AG+GG genotype (β, 0.006; 95% CI, 0.002–0.011; P = 0.006), respectively.

Figure 1.

Correlations of PAHs exposure and lung cancer risk–associated SNPs to genetic damage of coke oven workers. A, correlations of ΣOH–PAHs and SNPs to the levels of 8-OHdG, the levels of 8-OHdG were ln-transformed; B, correlations of BPDE-Alb adducts and SNPs with MN frequency.

Figure 1.

Correlations of PAHs exposure and lung cancer risk–associated SNPs to genetic damage of coke oven workers. A, correlations of ΣOH–PAHs and SNPs to the levels of 8-OHdG, the levels of 8-OHdG were ln-transformed; B, correlations of BPDE-Alb adducts and SNPs with MN frequency.

Close modal

Because there were correlations between age and years worked, and pack-years and smoking (r > 0.5; Supplementary Table S3), we did not include age and pack-years in the models mentioned above to avoid overadjustment. However, we conducted a sensitivity analysis and compared model estimates, including pack-years and age as covariations instead of years worked and pack-years. Results were generally consistent with those adjusted for years worked and smoking (data not shown).

In this study, we provided evidence that lung cancer risk–related SNPs influenced genetic damage in coke oven workers who were exposed to PAHs. 13q12.12-rs753955 was suggestively associated with elevated urinary 8-OHdG levels and three SNPs (9p21-rs1333040, 10p14-rs1663689, and 15q25.1-rs3813572) were associated with decreased MN frequency. Significant relationships between 10p14-rs1663689 and ΣOH-PAHs to 8-OHdG, and three SNPs (rs2736100 at TERT, rs7086803 at VTI1A, and rs7216064 at BPTF) and plasma BPDE-Alb adducts to MN frequency were observed, suggesting that lung cancer risk–associated SNPs modulate the associations between PAH exposure and genetic damage in coke oven workers. Bioinformatic analyses indicated that several lung cancer risk–related SNPs associated with genetic damage altered putative transcription factor–binding sites, suggesting that they might influence gene expression.

13q12.12-rs753955 is located between MIPEP and TNFRSF19. Risk-allele homozygotes (CC) at rs753955 were associated with increased 8-OHdG levels, consistent with its increased effect on lung cancer risks (26). The product of MIPEP is primarily involved in the maturation of oxidative phosphorylation (OXPHOS)–related proteins. The protein encoded by TNFRSF19 is a member of the TNF receptor superfamily, which was involved in activating the JNK (c-jun–NH2–kinase) signaling pathway (39). Although OXPHOS was involved in the formation of reactive oxygen species (40) and the JNK signaling pathway was a key modulator in the process of DNA oxidative damage (41), little is known about the potential effect of rs753955 on MIPEP and TNFRSF19 expression. Further studies are needed to explore the functional impact of this polymorphism.

9p21-rs1333040 located about 74 kb upstream of CDKN2B and within intron 12 of CDKN1B antisense RNA 1 or ANRIL. A 1.41-fold change in ANRIL mRNA expression was observed in minor allele homozygous (CC) for rs1333040 to major allele homozygous in peripheral blood (42). ANRIL was transcriptionally upregulated following DNA damage and elevated ANRIL suppressed the expression of cyclin-dependent kinase inhibitors (INK4a, INK4b, and ARF) at the late stage of DNA-damage response, allowing the cells to return to normal at the completion of the DNA repair (43). In this study, we observed decreased MN frequency in individuals with CC+TC and TC genotype, compared with those with wild TT genotype. In silico analysis showed that SNP-rs1333040 T>C substitution resulted in a gain of two transcription factor–binding sites, indicating that rs1333040 polymorphism might upregulate the expression of ANRIL, accelerating the process of the DNA repair and decreasing MN frequency. More functional studies are needed to investigate the possible role of rs1333040 in the process of genetic damage. The lack of statistical significance in MN frequency between persons with CC genotype and those with TT genotype in the present study might be partly due to a low number of individuals with CC genotype.

SNP rs1663689 is located 908 kb downstream of the transcription factor GATA3. Minor homozygotes (GG) of rs1663689 have lower MN frequency compared with major allele homozygotes (AA), consistent with its effect on lung cancer risks (3). Previous studies reported that GATA3 was the substrate of human checkpoint kinases 1 and 2, which were the kinases in the cellular DNA-damage response (44), suggesting that GATA3 might be a fundamental factor contributing to tumorigenesis by regulating DNA-damage response. Bioinformatic analyses indicated that SNP rs1663689 resulted in a loss of three transcription factor–binding sites. However, whether rs1663689 has effect on GATA3 expression requires further investigation.

SNP rs3813572, located in the 5′ end of PSMA4, is a moderate LD with rs2036534 on 15q25 (D′ = 1, logarithm of odds = 4.5), which was associated with lung cancer risk (31). In this study, SNP rs3813572 exhibited an association with decreased MN frequency. PSMA4 plays a role in promoting cancer cell proliferation and downregulation of PSMA4 expression induced apoptosis (45). Bioinformatic analyses showed that rs3813572 A>G substitution resulted in the loss of transcription factor–binding sites but its impact on PSMA4 expression remains to be elucidated.

We carried out a comprehensive evaluation of potential gene–environment correlations between 25 lung cancer risk–related SNPs and two genetic damage biomarkers. A total of four SNPs could modulate the associations between PAH exposure and genetic damage (one for 8-OHdG levels and three for MN frequency). PAH exposure has a higher correlation with genetic damage in individuals with lung cancer risk alleles (10p14-rs1663689 AA, 5p15.33-rs2736100 GG, 10q25.2-rs7086803 GA+AA, and 17q24.3-rs7216064 AA) than in those with protective alleles (rs1663689 AG/GG, rs2736100 TT/TG, rs7086803GG, and rs7216064 AG+GG), respectively. It was assumed that the genetic variants altered the duration of exposure to PAHs. Thus, the correlations of PAH exposure to genetic damage in this study might partly depend on the genotypes of these SNPs. Our hypothesis is that, given the same exposure, persons with risk alleles have higher damage levels and are more susceptible to the progress of lung cancer, compared with those carrying protective alleles.

In summary, our results suggested that lung cancer risk–related SNPs may influence one's susceptibility to PAH-induced genetic damage. These findings provided some potential mechanisms for genetic heterogeneity of PAH carcinogenesis and lung cancer development. People carrying risk alleles may suffer from more serious genetic damage and more susceptible to the initiation and evolvement of lung cancer even under similar environment. Thus, this research implied that major gains in the prevention of cancer will necessitate health and regulatory policies that protect more susceptible individuals from naturally occurring and man-made environmental carcinogens. Although the data about PAH exposure and oxidative DNA damage (8-OHdG) were reported in our previous study (33), the genotype data for SNPs associated with lung cancer risks were original and were first reported in this study, indicating that our job was not just a reanalysis of the existing data. To the best of our knowledge, this is the first study to investigate the relationships of lung cancer risk–related SNPs identified from GWAS to the genetic damage induced by PAHs. Future prospective studies with a larger sample size in populations of different ethnic backgrounds are warranted to validate our findings.

No potential conflicts of interest were disclosed.

Conception and design: X. Dai, S. Deng, W. Zhang, H. Guo, T. Wu

Development of methodology: X. Dai, S. Deng, Q. Deng, W. Zhang

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): X. Dai, S. Deng, G. Qiu, J. Ye, W. Zhang, H. Guo, T. Wu

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): X. Dai, S. Deng, J. Li, B. Yang, W. Feng, W. Zhang, H. Guo, T. Wu

Writing, review, and/or revision of the manuscript: X. Dai, S. Deng, W. Zhang, M. He, T. Wu

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): X. Dai, S. Deng, T. Wang, G. Qiu, W. Feng, Q. Deng, J. Ye, W. Zhang

Study supervision: X. Dai, S. Deng, X. He, W. Zhang, M. He, X. Zhang, H. Guo, T. Wu

The authors thank all the study subjects for participating in this study as well as all volunteers for assisting in collecting the samples and data.

This study is supported by the fund from the National Key Basic Research and Development Program (2011CB503800; to T. Wu).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Kamangar
F
,
Dores
GM
,
Anderson
WF
. 
Patterns of cancer incidence, mortality, and prevalence across five continents: defining priorities to reduce cancer disparities in different geographic regions of the world
.
J Clin Oncol
2006
;
24
:
2137
50
.
2.
She
J
,
Yang
P
,
Hong
Q
,
Bai
C
. 
Lung cancer in China: challenges and interventions
.
Chest
2013
;
143
:
1117
26
.
3.
Dong
J
,
Hu
Z
,
Wu
C
,
Guo
H
,
Zhou
B
,
Lv
J
, et al
Association analyses identify multiple new lung cancer susceptibility loci and their interactions with smoking in the Chinese population
.
Nat Genet
2012
;
44
:
895
9
.
4.
Pope
CA
,
Burnett
RT
,
Thun
MJ
,
Calle
EE
,
Krewski
D
,
Ito
K
, et al
Lung cancer, cardiopulmonary mortality, and long-term exposure to fine particulate air pollution
.
JAMA
2002
;
287
:
1132
41
.
5.
Mastrangelo
G
,
Fadda
E
,
Marzia
V
. 
Polycyclic aromatic hydrocarbons and cancer in man
.
Environ Health Perspect
1996
;
104
:
1166
70
.
6.
Armstrong
BG
,
Gibbs
G
. 
Exposure-response relationship between lung cancer and polycyclic aromatic hydrocarbons (PAHs)
.
Occup Environ Med
2009
;
66
:
740
6
.
7.
Wang
J
,
Chen
S
,
Tian
M
,
Zheng
X
,
Gonzales
L
,
Ohura
T
, et al
Inhalation cancer risk associated with exposure to complex polycyclic aromatic hydrocarbon mixtures in an electronic waste and urban area in South China
.
Environ Sci Technol
2012
;
46
:
9745
52
.
8.
Zhang
Y
,
Tao
S
,
Shen
H
,
Ma
J
. 
Inhalation exposure to ambient polycyclic aromatic hydrocarbons and lung cancer risk of Chinese population
.
Proc Natl Acad Sci U S A
2009
;
106
:
21063
7
.
9.
Minna
JD
,
Roth
JA
,
Gazdar
AF
. 
Focus on lung cancer
.
Cancer cell
2002
;
1
:
49
52
.
10.
Perera
FP
,
Hemminki
K
,
Gryzbowska
E
,
Motykiewicz
G
,
Michalska
J
,
Santella
RM
, et al
Molecular and genetic damage in humans from environmental pollution in Poland
.
Nature
1992
;
360
:
256
8
.
11.
Klaunig
JE
,
Wang
Z
,
Pu
X
,
Zhou
S
. 
Oxidative stress and oxidative damage in chemical carcinogenesis
.
Toxicol Appl Pharmacol
2011
;
254
:
86
99
.
12.
Fenech
M
. 
Chromosomal biomarkers of genomic instability relevant to cancer
.
Drug Discov Today
2002
;
7
:
1128
37
.
13.
Xiao
C
,
Chen
S
,
Li
J
,
Hai
T
,
Lu
Q
,
Sun
E
, et al
Association of HSP70 and genotoxic damage in lymphocytes of workers exposed to coke oven emission
.
Cell Stress Chaperon
2002
;
7
:
396
402
.
14.
Guo
H
,
Bai
Y
,
Xu
P
,
Hu
Z
,
Liu
L
,
Wang
F
, et al
Functional promoter -1271G>C variant of HSPB1 predicts lung cancer risk and survival
.
J Clin Oncol
2010
;
28
:
1928
35
.
15.
Orlow
I
,
Park
BJ
,
Mujumdar
U
,
Patel
H
,
Siu-Lau
P
,
Clas
BA
, et al
DNA damage and repair capacity in patients with lung cancer: prediction of multiple primary tumors
.
J Clin Oncol
2008
;
26
:
3560
6
.
16.
Chiou
CC
,
Chang
PY
,
Chan
EC
,
Wu
TL
,
Tsao
KC
,
Wu
JT
. 
Urinary 8-hydroxydeoxyguanosine and its analogs as DNA marker of oxidative stress: development of an ELISA and measurement in both bladder and prostate cancers
.
Clin Chim Acta
2003
;
334
:
87
94
.
17.
Fenech
M
,
Holland
N
,
Chang
WP
,
Zeiger
E
,
Bonassi
S
. 
The HUman MicroNucleus Project—an international collaborative study on the use of the micronucleus technique for measuring DNA damage in humans
.
Mutat Res
1999
;
428
:
271
83
.
18.
Yano
T
,
Shoji
F
,
Baba
H
,
Koga
T
,
Shiraishi
T
,
Orita
H
, et al
Significance of the urinary 8-OHdG level as an oxidative stress marker in lung cancer patients
.
Lung Cancer
2009
;
63
:
111
4
.
19.
El-Zein
RA
,
Schabath
MB
,
Etzel
CJ
,
Lopez
MS
,
Franklin
JD
,
Spitz
MR
. 
Cytokinesis-blocked micronucleus assay as a novel biomarker for lung cancer risk
.
Cancer Res
2006
;
66
:
6449
56
.
20.
Park
SY
,
Lee
KH
,
Kang
D
,
Lee
KH
,
Ha
EH
,
Hong
YC
. 
Effect of genetic polymorphisms of MnSOD and MPO on the relationship between PAH exposure and oxidative DNA damage
.
Mutat Res
2006
;
593
:
108
15
.
21.
Mielzynska-Svach
D
,
Blaszczyk
E
,
Butkiewicz
D
,
Durzynska
J
,
Rydzanicz
M
. 
Influence of genetic polymorphisms on biomarkers of exposure and effects in children living in upper Silesia
.
Mutagenesis
2013
;
28
:
591
9
.
22.
Miki
D
,
Kubo
M
,
Takahashi
A
,
Yoon
KA
,
Kim
J
,
Lee
GK
, et al
Variation in TP63 is associated with lung adenocarcinoma susceptibility in Japanese and Korean populations
.
Nat Genet
2010
;
42
:
893
6
.
23.
Timofeeva
MN
,
Hung
RJ
,
Rafnar
T
,
Christiani
DC
,
Field
JK
,
Bickeboller
H
, et al
Influence of common genetic variation on lung cancer risk: meta-analysis of 14 900 cases and 29 485 controls
.
Hum Mol Genet
2012
;
21
:
4980
95
.
24.
Yoon
KA
,
Park
JH
,
Han
J
,
Park
S
,
Lee
GK
,
Han
JY
, et al
A genome-wide association study reveals susceptibility variants for non-small cell lung cancer in the Korean population
.
Hum Mol Genet
2010
;
19
:
4948
54
.
25.
Shiraishi
K
,
Kunitoh
H
,
Daigo
Y
,
Takahashi
A
,
Goto
K
,
Sakamoto
H
, et al
A genome-wide association study identifies two new susceptibility loci for lung adenocarcinoma in the Japanese population
.
Nat Genet
2012
;
44
:
900
3
.
26.
Hu
Z
,
Wu
C
,
Shi
Y
,
Guo
H
,
Zhao
X
,
Yin
Z
, et al
A genome-wide association study identifies two new lung cancer susceptibility loci at 13q12.12 and 22q12.2 in Han Chinese
.
Nat Genet
2011
;
43
:
792
6
.
27.
Ahn
MJ
,
Won
HH
,
Lee
J
,
Lee
ST
,
Sun
JM
,
Park
YH
, et al
The 18p11.22 locus is associated with never smoker non–small cell lung cancer susceptibility in Korean populations
.
Hum Genet
2012
;
131
:
365
72
.
28.
Lan
Q
,
Hsiung
CA
,
Matsuo
K
,
Hong
YC
,
Seow
A
,
Wang
Z
, et al
Genome-wide association analysis identifies new lung cancer susceptibility loci in never-smoking women in Asia
.
Nat Genet
2012
;
44
:
1330
5
.
29.
Hsiung
CA
,
Lan
Q
,
Hong
YC
,
Chen
CJ
,
Hosgood
HD
,
Chang
IS
, et al
The 5p15.33 locus is associated with risk of lung adenocarcinoma in never-smoking females in Asia
.
PLoS Genet
2010
;
6
:
e1001051
.
30.
Dong
J
,
Jin
G
,
Wu
C
,
Guo
H
,
Zhou
B
,
Lv
J
, et al
Genome-wide association study identifies a novel susceptibility locus at 12q23.1 for lung squamous cell carcinoma in han chinese
.
PLoS Genet
2013
;
9
:
e1003190
.
31.
Wu
C
,
Hu
Z
,
Yu
D
,
Huang
L
,
Jin
G
,
Liang
J
, et al
Genetic variants on chromosome 15q25 associated with lung cancer risk in Chinese populations
.
Cancer Res
2009
;
69
:
5065
72
.
32.
Campo
L
,
Rossella
F
,
Fustinoni
S
. 
Development of a gas chromatography/mass spectrometry method to quantify several urinary monohydroxy metabolites of polycyclic aromatic hydrocarbons in occupationally exposed subjects
.
J Chromatogr B Analyt Technol Biomed Life Sci
2008
;
875
:
531
40
.
33.
Kuang
D
,
Zhang
W
,
Deng
Q
,
Zhang
X
,
Huang
K
,
Guan
L
, et al
Dose–response relationships of polycyclic aromatic hydrocarbons exposure and oxidative damage to DNA and lipid in coke oven workers
.
Environ Sci Technol
2013
;
47
:
7446
56
.
34.
Yuan
J
,
Chen
L
,
Chen
D
,
Guo
H
,
Bi
X
,
Ju
Y
, et al
Elevated serum polybrominated diphenyl ethers and thyroid-stimulating hormone associated with lymphocytic micronuclei in Chinese workers from an E-waste dismantling site
.
Environ Sci Technol
2008
;
42
:
2195
200
.
35.
Fenech
M
. 
Cytokinesis-block micronucleus cytome assay
.
Nat Protoc
2007
;
2
:
1084
104
.
36.
Genomatix software suite V3.1. Retrieved April 7, 2013. Available from
: http://www.genomatix.de
[Online]
.
37.
Ding
YS
,
Trommel
JS
,
Yan
XJ
,
Ashley
D
,
Watson
CH
. 
Determination of 14 polycyclic aromatic hydrocarbons in mainstream smoke from domestic cigarettes
.
Environ Sci Technol
2005
;
39
:
471
8
.
38.
Piipari
R
,
Savela
K
,
Nurminen
T
,
Hukkanen
J
,
Raunio
H
,
Hakkola
J
, et al
Expression of CYP1A1, CYP1B1, and CYP3A, and polycyclic aromatic hydrocarbon-DNA adduct formation in bronchoalveolar macrophages of smokers and nonsmokers
.
Int J Cancer
2000
;
86
:
610
6
.
39.
Eby
MT
,
Jasmin
A
,
Kumar
A
,
Sharma
K
,
Chaudhary
PM
. 
TAJ, a novel member of the tumor necrosis factor receptor family, activates the c-Jun N-terminal kinase pathway and mediates caspase-independent cell death
.
J Biol Chem
2000
;
275
:
15336
42
.
40.
Robertson
RP
. 
Chronic oxidative stress as a central mechanism for glucose toxicity in pancreatic islet beta cells in diabetes
.
J Biol Chem
2004
;
279
:
42351
4
.
41.
Martindale
JL
,
Holbrook
NJ
. 
Cellular response to oxidative stress: signaling for suicide and survival
.
J Cell Physiol
2002
;
192
:
1
15
.
42.
Cunnington
MS
,
Santibanez Koref
M
,
Mayosi
BM
,
Burn
J
,
Keavney
B
. 
Chromosome 9p21 SNPs associated with multiple disease phenotypes correlate with ANRIL expression
.
PLoS Genet
2010
;
6
:
e1000899
.
43.
Wan
G
,
Mathur
R
,
Hu
X
,
Liu
Y
,
Zhang
X
,
Peng
G
, et al
Long noncoding RNA ANRIL (CDKN2B-AS) is induced by the ATM-E2F1 signaling pathway
.
Cell Signal
2013
;
25
:
1086
95
.
44.
Kim
MA
,
Kim
HJ
,
Brown
AL
,
Lee
MY
,
Bae
YS
,
Park
JI
, et al
Identification of novel substrates for human checkpoint kinase Chk1 and Chk2 through genome-wide screening using a consensus Chk phosphorylation motif
.
Exp Mol Med
2007
;
39
:
205
12
.
45.
Liu
Y
,
Liu
P
,
Wen
W
,
James
MA
,
Wang
Y
,
Bailey-Wilson
JE
, et al
Haplotype and cell proliferation analyses of candidate lung cancer susceptibility genes on chromosome 15q24-25.1
.
Cancer Res
2009
;
69
:
7844
50
.