Exposure to tobacco smoke and to mutagenic xenobiotics can cause various types of DNA damage in lung cells, which, if not corrected by DNA repair systems, may lead to deregulation of the cell cycle and, ultimately, to cancer. Genetic variation could thus be an important factor in determining susceptibility to tobacco-induced lung cancer with genetic susceptibility playing a larger role in young-onset cases compared with that in the general population. We have therefore studied 102 single-nucleotide polymorphisms (SNP) in 34 key DNA repair and cell cycle control genes in 299 lung cancer cases diagnosed before the age of 50 years and 317 controls from six countries of Central and Eastern Europe. We have found no association of lung cancer risk with polymorphisms in genes related to cell cycle control, single-strand/double-strand break repair, or base excision repair. Significant associations (P < 0.05) were found with polymorphisms in genes involved in DNA damage sensing (ATM) and, interestingly, in four genes encoding proteins involved in mismatch repair (LIG1, LIG3, MLH1, and MSH6). The strongest associations were observed with heterozygote carriers of LIG1 −7C>T [odds ratio (OR), 1.73; 95% confidence interval (95% CI), 1.13-2.64] and homozygote carriers of LIG3 rs1052536 (OR, 2.05; 95% CI, 1.25-3.38). Consideration of the relatively large number of markers assessed diminishes the significance of these findings; thus, these SNPs should be considered promising candidates for further investigation in other independent populations. (Cancer Res 2006; 66(22): 11062-9)

Exposure to tobacco smoke and to mutagenic xenobiotics causes various types of DNA damage to lung cells (1). Such lesions and the mutations that arise from them can lead to alterations in the normal program of cell cycle and growth, favoring the development of cancer. Cells counteract the deleterious effects of damaged DNA through different repair pathways and strategies, including cell cycle control and apoptosis induction (2). The “base excision repair” (BER) pathway is the primary mechanism to remove incorrect and damaged bases, including those formed by reactive oxygen species, such as 8-hydroxyguanine, and methylating agents (e.g., 7-methylguanine and 3-methyladenine; ref. 3). The mismatch repair (MMR) pathway corrects errors of DNA replication that escape the proofreading functions of DNA polymerases as well as heterologies formed during recombination. It has also been linked to cell cycle checkpoint activation and cell death, and thus, alterations in this process can have wide-ranging biological consequences (4).

The “nucleotide excision repair” (NER; either “global genome” NER or “transcription-coupled” NER) acts on a variety of helix-distorting DNA lesions, such as the pyrimidine dimers caused by UV light, and chemical adducts, such as those caused by benzo(a)pyrene (5). Finally, direct reversal of DNA damage, an unusual form of DNA repair, is also present in human cells. The promutagenic lesion O6-methylguanine is repaired by such a process where a specific methyltransferase, encoded by the MGMT gene, transfers the methyl group from the DNA guanine residue to a cysteine residue located at its active site in an error-free process (6).

Cells must also cope with the presence of DNA single-strand and double-strand breaks (SSB and DSB). SSBs can arise directly from damage to the DNA or indirectly as intermediates in the BER pathway. Like many of the other DNA repair pathways in human cells, the processing of such breaks involves a series of coordinated, sequential reactions in which the damage is detected and processed and any gaps generated are filled and religated. Key proteins in this process are XRCC1, poly(ADP-ribose) polymerase 1 (PARP1), and the proliferating cell nuclear antigen (PCNA; ref. 7). The repair of DSBs, which are dangerously cytotoxic lesions formed after exposure to ionizing radiation, occurs through two main mechanisms: nonhomologous end joining (NHEJ) and the homologous recombination repair (HRR). These two processes have different fidelities: HR can be achieved with high fidelity, whereas NHEJ may result in loss of genetic information (8, 9).

A second arm in the maintenance of genome stability in the presence of DNA damage is achieved by the precise regulation of the cell cycle. DNA strand breaks and certain base adducts are very effective in inducing cell cycle blocks. Such damage is detected through different mechanisms involving the protein products of key genes, such as ataxia-telangiectasia mutated (ATM) and ATM and Rad3-related (ATR), which act upstream of p53 and other checkpoint proteins to induce cell cycle arrest (10). In addition to these mechanisms, apoptosis, the program of cell suicide, which is an important escape route to avoid damaged cells progressing to a malignant phenotype, can be activated in response to DNA damage. The ATM and p53 proteins are also involved in this process (11).

All these pathways contribute to the genome stability of the cell, and a deficiency in one or more of the relevant genes can lead to deregulated cell growth and, ultimately, to cancer development. The link between alterations in DNA repair genes and increased cancer risk is well documented in certain inherited disorders, such as xeroderma pigmentosum (deficiencies in NER), Bloom syndrome (deficiency in the DNA helicase RECQ), Fanconi anemia (FANCx genes), ataxia-telangiectasia (ATM), human nonpolyposis colorectal cancer (HNPCC, MMR genes), retinoblastoma (RB1), familial cutaneous melanoma (CDKN1A), Li-Fraumeni syndrome (TP53), familial breast/ovary cancers (BRCA1 or BRCA2), and Nijmegen breakage syndrome (NBS1). In addition, the importance of the integrity of DNA repair pathways to prevent lung carcinogenesis has been shown in several animal models where individual DNA repair genes are disrupted. For example, Xpc−/− mice show high susceptibility to lung cancer following exposure to acetylaminofluorene (12), Mgmt−/− and Parp1−/− mice to nitrosamine-induced tumors (13, 14), and Ogg1−/− mice to the spontaneous development of lung cancers (15).

Polymorphisms in DNA repair and cell cycle control genes, which affect the normal protein activity, may alter the efficiency of these processes and lead to genetic instability and increased cancer risk. Reduced DNA repair capacity and impaired control of the cell cycle are phenotypes found in lung cancer patients, and polymorphisms, which have been reported to be associated with the risk of developing lung cancer, include OGG1 S326C, XRCC1 R194W, MLH1 −93 G>A, PARP1 V762A, ERCC2 Gln751, RAD23B Val249, XPA 5′-untranslated region (UTR) G>A, CCND1 exon 4 A>G, and TP53 codon 72, intron 3, and intron 6 (1621).

These results support the hypothesis that genetic variations within DNA repair and cell cycle control pathways might be important in determining an individual risk of developing lung cancer. Thus, to investigate the role of gene variants in DNA repair and cell cycle control pathways, we have analyzed 102 single-nucleotide polymorphisms (SNPs) in 51 genes in a multicenter case-control study of early-onset lung cancer. Our rationale for restricting the analysis to subjects diagnosed before age 50 years was that the role of genetic susceptibility would be expected to be more important among subjects with a younger age of diagnosis. Indeed, previous epidemiologic studies have suggested that a large proportion of lung cancers occurring before age 50 years have a genetic component. Risks due to genetic factors are further amplified by cigarette smoking (22, 23).

Due to the quite low incidence of the disease occurring at younger ages, there are very few published studies on the genetic risk factors for early-onset lung cancer. The majority of these investigated the role of xenobiotic metabolism genes (2426), and none have explored, using a comprehensive panel of variants, the association of variants in DNA repair and cell cycle control genes and lung cancer risk.

Study population. This study was conducted in 15 centers in six Central and Eastern European countries: the Czech Republic (Prague, Olomouc, and Brno), Hungary (Borsod, Heves, Szabolcs, Szolnok, and Budapest), Poland (Warsaw and Lodz), Romania (Bucharest), Russia (Moscow), and Slovakia (Banska Bystrica, Bratislava, and Nitra). Each center followed an identical protocol and was responsible for recruiting a consecutive group of patients who were newly diagnosed with lung cancer and a comparable group of hospital-based control subjects without lung cancer from February 1998 to October 2002. All cancer diagnoses were confirmed histologically or cytologically. Eligible subjects (case patients and control subjects) must have resided in the study area for at least 1 year before recruitment. Lung cancer case patients were identified through an active search of the records of clinical and pathology departments at the participating hospitals. All centers attempted to recruit all eligible patients as soon as possible after the patient had received an initial diagnosis of lung cancer. The maximum time interval between diagnosis and recruitment was 3 months. All study subjects (case patients and control subjects) and their physicians provided written informed consent. This study was approved by the institutions at all study centers, and ethical approval was obtained from the IARC (Lyon, France), the coordinating center. At all centers, except the Warsaw center, control subjects were chosen from among inpatients and outpatients admitted to the same hospitals as the case patients or hospitals serving the same population; case patients and control subjects from each hospital were frequency matched by sex, age (±3 years), center, and referral (or residence) area. Control subjects were eligible for this study if they had been diagnosed with non-tobacco-related diseases or had undergone minor surgical procedures or had benign disorders, common infections, eye conditions (except cataract or diabetic retinopathy), or common orthopedic diseases (except osteoporosis). At the Warsaw center, control subjects were selected by random sampling of the general population using the Electronic List of Residents. Overall, the average participation rate was 91.0% among case patients and 91.2% among control subjects. Case patients and control subjects underwent an identical in-person interview during which they completed a detailed questionnaire and provided blood samples. The questionnaire collected information about demographic variables, such as sex, date of birth, and education level, medical history, family history of cancer, history of tobacco consumption, including frequency, intensity, duration, and status, history of alcohol consumption, diet history (using a general food frequency questionnaire), and occupational history. Blood samples were stored in liquid nitrogen. Of the 2,633 case patients and 2,884 control subjects who agreed to participate in this study, 2,188 (83%) case patients and 2,198 (76%) control subjects provided blood samples during the interview of whom 299 cases and 317 controls were ≤50 years of age at diagnosis and at recruitment, respectively, and form the study population for this analysis.

Selection of polymorphisms. The polymorphisms that were included in the panel to be studied were selected based on the following criteria: (a) having a biological significance shown through functional studies (e.g., OGG1 326Ser has a 7-fold higher activity for repairing 8-oxoG than the 326Cys variant), (b) associated with biological end points in epidemiologic studies [e.g., XRCC1 399Gln allele was associated with increased levels of aflatoxin B1-DNA adduct and increased bleomycin sensitivity (2729)], or (c) associated with the risk of cancer at any site in epidemiologic studies [e.g., the polymorphism at codon 194 within XRCC1 (30)]. However, for some DNA repair and cell cycle control genes, there were no variants that met these criteria. Therefore, for some genes, we added selected SNPs from dbSNP,14

which either (a) had a high frequency of the rare allele (to allow the highest statistical power to detect associations) or (b) coded for missense changes. Among the missense variants, we focused, in particular, on changes having the greatest potential for functional relevance, such as involving a proline residue (e.g., PARP P1328T) or the charge/polarity of the lateral group (e.g., CDKN2A A148T). Although the biological effect of many of the polymorphisms selected for study is not known, we expect that these criteria should have maximized the likelihood of having chosen SNPs with functional significance or associated with cancer.

Laboratory techniques. Genomic DNA was extracted from blood samples using Puregene chemistry (Gentra Systems, Minneapolis, MN). DNA concentrations were measured by using PicoGreen dsDNA quantification kits (Molecular Probes, Leiden, the Netherlands). All polymorphisms were analyzed together for a given sample by a microarray technique based on the arrayed primer extension (APEX) principle.

APEX consists of a sequencing reaction primed by an oligonucleotide anchored with its 5′-end to a glass slide and terminating just one nucleotide before the polymorphic site. A DNA polymerase extends the oligonucleotide by adding one fluorescently labeled dideoxynucleotide triphosphate (ddNTP) complementary to the variant base. Reading the incorporated fluorescence identifies the base in the target sequence. This method is suitable not only for SNPs but also for small insertion/deletion polymorphisms. Because both sense and antisense strands are probed, two oligonucleotides were designed for each polymorphism. In general, two 30-mers, one for each strand, complementary to each side of the polymorphism were designed both with their 3′-end pointing toward the polymorphism. The flanking sequences and their related APEX-oligonucleotides have been previously published (31). Five-prime (C-12) amino-linker oligonucleotides were synthesized by Sigma-Genosys (Cambridge, United Kingdom) and spotted onto silanized slides (32).

Genomic DNAs were amplified to enrich the fragments carrying the SNPs by using specific primer pairs. Then, PCR products were pooled, purified, concentrated using Millipore (Bedford, MA) Microcon MY30 columns, and fragmented. For single-base extension reaction, fragmented PCR products were incubated onto the slides together with the fluorescently labeled ddNTPs (4 × 50 pmol), 10× buffer, and 4 units ThermoSequenase (GE Healthcare, Little Chalfont, United Kingdom). All the details of the experimental protocol, including primer and probe sequences, were reported in previous articles (31, 33).

Slides were imaged by a Genorama-003 four-color detector equipped with Genorama image analysis software (Asper Biotech, Tartu, Estonia). Fluorescence intensities at each position were converted automatically into base calls by the software under the supervision of trained personnel. In case of more than one signal present on a given position, only the main signal was considered if the intensity of the weaker signal was <10% of the main signal. APEX gives a very high concordance compared with the standard genotyping or sequencing methods as assessed in previous studies (31, 33).

To ensure quality control, we followed several strategies: DNA samples from case patients and control subjects were randomly distributed, and all genotyping was conducted by personnel who were blinded to the case-control status of the subjects; each APEX oligonucleotide was spotted in replicate; each SNP was analyzed independently by genotyping both the sense and the antisense strands of the DNA (in case of disagreement, the base call was discarded); internal positive controls allowed to verify that the intensities of the four channels (A, C, T, and G) were equilibrated; base calls were scored by three independent trained operators and discordant results were rechecked and, in case of disagreement, discarded; DNA samples from individuals of known genotypes were added to check periodically the validity of the genotyping; we randomly selected 10% of the study subjects (i.e., both case patients and control subjects) and regenotyped their DNA samples for each polymorphism.

Statistical analysis. The frequency distributions of demographic variables and putative risk factors for lung cancer, including country of residence, age at recruitment (which for case patients was a proxy for age at diagnosis), sex, highest education level, and smoking status, were examined for case patients and control subjects. Former smokers were defined as smokers who stopped smoking at least 2 years before the interview. Tobacco consumption included smoking of cigarettes, pipes, and cigars. Cumulative tobacco consumption was calculated by multiplying smoking duration (in years) by smoking intensity (in the equivalent of cigarette packs) and expressed as pack-years. We categorized the subjects as light (≤14 pack-years), moderate (>14-38.26 pack-years), or heavy (>38.26 pack-years) smokers based on the tertiles of cumulative tobacco consumption among the control group. We tested the Hardy-Weinberg equilibrium of genotype distributions separately among case patients and control subjects. The minimum detectable odds ratio (OR) was calculated for each sequence variant based on its genotype frequency, our study sample size, and a statistical power of 80% as described previously (34). Our study had an 80% power to detect a minimum OR of 2.5 for relatively rare variants (5%) and a minimum OR of 1.6 for common variants (≥30%).

We used unconditional multivariate logistic regression analysis to examine associations between genetic polymorphisms and lung cancer risk by estimating ORs and 95% confidence intervals (95% CI). Genotypes were categorized into three groups (major allele homozygous, heterozygous, and homozygous variant) when the allele frequencies allowed or into two groups (major allele homozygous and minor allele carriers) for rare polymorphisms. Age, sex, country, and tobacco pack-year were included in all analyses as covariates.

We computed false-positive response probabilities (FPRP; ref. 35) for the nominally significant associations we have observed between SNPs and lung cancer risk. Prior probability is likely to be influenced by the biological knowledge of the gene, the functional significance of the variants, and the available epidemiologic evidence. It remains a subjective measure that may vary from one investigator to another based on the importance they assign to the different pieces of evidence. For this reason, we have calculated FPRP for a range of prior probabilities from 50% to 0.1%. Following Wacholder et al., we used a threshold of noteworthiness of FPRP ≤0.2. All statistical analyses were conducted using STATA software version 8.0 (Stata Corp. LP, College Station, TX). All statistical tests were two sided.

Table 1 shows the frequency distribution of demographic characteristics among the 299 cases and 317 controls. As expected, the proportion of current smokers was far higher among the cases than controls (86.6% versus 53.6%; P < 0.001). The sex and age distributions were similar among cases and controls, although there was a tendency for cases to have a lower education level than the controls (P = 0.01). With respect to the histologic types of the tumors in the cases, the numbers of adenocarcinoma, squamous, and small cell carcinomas were similar. We tested departure from Hardy-Weinberg equilibrium in the controls by a χ2 test using P = 0.01 as threshold. This threshold was chosen based on anticonservativeness of this test as noted by Wigginton et al. (36). All SNPs (except for MSH3 235G>A, MSH6 G39E, ERCC1 354T>C, ERCC2 D312N, and XRCC3 17893A>G) were in Hardy-Weinberg equilibrium in this population. The results of the initial and the repeat genotyping analyses were at least 99% concordant. Unfortunately, because of the nature of the method of genotyping where all the SNPs are analyzed on one chip, genotyping that failed for any individual SNP could not be repeated. Genotyping success rates for individual polymorphisms averaged 92%.

Table 1.

Demographic characteristics of lung cancer cases and controls

Cases, n (%)Controls, n (%)
Country   
    Romania 37 (12.4) 58 (18.3) 
    Hungary 67 (22.4) 51 (16.1) 
    Poland 101 (33.8) 108 (34.1) 
    Russia 42 (14.0) 32 (10.1) 
    Slovakia 37 (12.4) 25 (7.9) 
    Czech Republic 15 (5.0) 43 (13.6) 
Gender   
    Men 199 (66.6) 211 (66.6) 
    Women 100 (33.4) 106 (33.4) 
Age (completed), y   
    <30 0 (0.0) 4 (1.3) 
    30-34 7 (2.3) 15 (4.7) 
    35-39 17 (5.7) 26 (8.2) 
    40-44 82 (27.4) 79 (24.9) 
    45-49 193 (64.5) 193 (60.9) 
Education   
    Basic/elementary 21 (7.0) 6 (1.9) 
    Apprentice/vocational 108 (36.1) 105 (33.1) 
    Middle schools, ending by graduation 98 (32.8) 107 (33.8) 
    Postgradual, not university degree 52 (17.4) 60 (18.9) 
    University degree 20 (6.7) 35 (11.0) 
    Missing 0 (0.0) 4 (1.3) 
Smoking status   
    Never smokers 18 (6.0) 90 (28.4) 
    Former smokers 22 (7.4) 53 (16.7) 
    Current smokers 259 (86.6) 170 (53.6) 
    Missing 0 (0.0) 4 (1.3) 
Histology   
    Adenocarcinoma 86 (28.8)  
    Small cell carcinoma 61 (20.4)  
    Squamous cell 85 (28.4)  
    Other 67 (22.4)  
Total 299 317 
Cases, n (%)Controls, n (%)
Country   
    Romania 37 (12.4) 58 (18.3) 
    Hungary 67 (22.4) 51 (16.1) 
    Poland 101 (33.8) 108 (34.1) 
    Russia 42 (14.0) 32 (10.1) 
    Slovakia 37 (12.4) 25 (7.9) 
    Czech Republic 15 (5.0) 43 (13.6) 
Gender   
    Men 199 (66.6) 211 (66.6) 
    Women 100 (33.4) 106 (33.4) 
Age (completed), y   
    <30 0 (0.0) 4 (1.3) 
    30-34 7 (2.3) 15 (4.7) 
    35-39 17 (5.7) 26 (8.2) 
    40-44 82 (27.4) 79 (24.9) 
    45-49 193 (64.5) 193 (60.9) 
Education   
    Basic/elementary 21 (7.0) 6 (1.9) 
    Apprentice/vocational 108 (36.1) 105 (33.1) 
    Middle schools, ending by graduation 98 (32.8) 107 (33.8) 
    Postgradual, not university degree 52 (17.4) 60 (18.9) 
    University degree 20 (6.7) 35 (11.0) 
    Missing 0 (0.0) 4 (1.3) 
Smoking status   
    Never smokers 18 (6.0) 90 (28.4) 
    Former smokers 22 (7.4) 53 (16.7) 
    Current smokers 259 (86.6) 170 (53.6) 
    Missing 0 (0.0) 4 (1.3) 
Histology   
    Adenocarcinoma 86 (28.8)  
    Small cell carcinoma 61 (20.4)  
    Squamous cell 85 (28.4)  
    Other 67 (22.4)  
Total 299 317 

Table 2 shows ORs for polymorphisms in genes involved in detecting DNA breaks as well as in the negative modulation of cell cycle (including apoptosis). Only one SNP (IVS48+238 C>G within ATM) was found associated with a decreased risk of lung cancer (Ptrend = 0.03), with the other 34 SNPs (19 genes) showing no association. Table 3 shows similar results for SNPs within genes involved in SSB/DSB repair. These genes can be considered as downstream with respect to the first set of genes, being actually involved in the NHEJ, HRR, and SSB repair (SSBR). Among these 26 SNPs (12 genes), we found no overall association among the homozygote variants. Table 4 shows the results for SNPs within genes in the NER, BER, and MMR pathways and the MGMT gene. We analyzed 40 SNPs (20 genes) and found a borderline association for carriers of Gln751 in ERCC2 (OR, 0.7; 95% CI, 0.49-1.00), an increased risk for carriers of a SNP in the 3′-UTR of LIG3 (OR, 1.7; 95% CI, 1.13-2.56), a reduced risk for carriers of Val219 in MLH1 (OR, 0.69; 95% CI, 0.48-0.98), a borderline significant association for the homozygote carriers of the 540T allele in MSH6 (OR, 1.95; 95% CI, 1-3.78), and an increased risk for two SNPs in LIG1 (−7C>T and IVS9-21), with the strongest OR of 1.89 (95% CI, 1.24-2.87) for carriers of −7 T allele. Among these associations, it is interesting to note that LIG1, LIG3, MLH1, and MSH6 are all involved in the MMR. Calculation of FPRP showed that none of the above associations remained noteworthy (FPRP ≤ 0.2) when a prior probability of association of ≤1% was considered, and only the association of LIG1 −7C>T remained noteworthy assuming a prior probability of 10% (FPRP = 0.154).

Table 2.

Associations between lung cancer risk and SNPs of genes coding for DNA break sensors and negative regulators of the cell cycle (ATM-, ATR-, and p53-dependent cascades)

SNP namers no.Homozygotes common allele
Heterozygotes
Homozygotes rarer allele
Ptrend
CaCoCaCoOR (95% CI)CaCoOR (95% CI)
ATM: 5557 G>A -D1853N rs1801516 205 238 73 63 1.21 (0.80-1.85) 2.06 (0.50-8.49) 0.21 
ATM: IVS22-77T>C rs664677 108 85 134 170 0.68 (0.46-1.01) 47 54 0.77 (0.45-1.30) 0.18 
ATM: IVS48+238 C>G rs609429 92 70 101 113 0.7 (0.45-1.10) 35 52 0.55 (0.30-0.98) 0.03 
ATR: 632C>T -T211M rs2227928 99 111 138 138 1.21 (0.82-1.79) 49 48 1.07 (0.63-1.80) 0.63 
CCND1: 687bp 3 of STP G>C rs678653 105 112 105 114 0.84 (0.56-1.27) 34 31 1.04 (0.57-1.88) 0.82 
CCND1: 870G>A rs603965 79 81 139 156 0.81 (0.53-1.23) 71 76 0.94 (0.58-1.53) 0.79 
CDKN1B: −79C>T rs34330 168 178 111 115 1.09 (0.76-1.57) 17 17 0.96 (0.45-2.07) 0.80 
CDKN2A: 29bp 3 of STP C>G rs11515 219 238 69 70 1.09 (0.72-1.66) 0.47 (0.10-2.19) 0.92 
CDKN2A: 69bp 3 of STP C>T rs3088440 245 270 42 42 1.15 (0.69-1.89) — 0.45 
CDKN2A: A148T rs3731249 240 253 15 10 1.44 (0.60-3.49) — 0.16 
CDKN2B: C>A intron1 rs2069426 204 199 34 35 0.98 (0.56-1.71) 0.45 (0.04-4.57) 0.69 
CDKN2B: G>A intron1 rs974336 198 201 43 54 0.81 (0.50-1.32) 0.99 (0.15-6.31) 0.46 
GADD45A: 3812T>C rs532446 163 180 97 106 0.98 (0.67-1.43) 22 23 1.28 (0.65-2.50) 0.66 
MDM2: 309T>G rs2279744 100 115 147 143 1.27 (0.86-1.87) 35 44 0.88 (0.50-1.55) 0.91 
MDM2: E354E; -344 A>T rs769412 200 189 29 40 0.78 (0.44-1.36) 2.15 (0.37-12.70) 0.79 
p21/Cip1/CDKN1A: 20bp 3 of STP C>T rs1059234 251 272 43 36 1.23 (0.73-2.08) 0.91 (0.12-7.06) 0.52 
p21/Cip1/CDKN1A: S31R rs1801270 241 268 36 37 0.96 (0.56-1.65) 3.05 (0.30-31.35) 0.77 
PARP1/ADPRT1: IVS17-12 C>G rs4986819 238 249 48 55 0.92 (0.58-1.45) 1.49 (0.38-5.87) 1.00 
PARP1/ADPRT1: IVS4+12 G>A rs1805403 174 164 86 112 0.68 (0.47-1.01) 19 20 0.84 (0.41-1.72) 0.13 
PARP1/ADPRT1: P1328T rs1050112 98 113 100 115 1.16 (0.76-1.76) 32 36 0.98 (0.54-1.78) 0.84 
PARP1/ADPRT1: T802 A>T rs4986817 240 250 48 56 0.88 (0.55-1.39) 1.46 (0.37-5.75) 0.87 
PARP1/ADPRT1: V762A rs1136410 207 211 75 84 0.84 (0.57-1.25) 10 12 0.96 (0.38-2.46) 0.50 
RAD9A: 730bp 3 of STP G>A rs1064876 240 256 20 22 1.04 (0.53-2.06) — 0.59 
TP53: 14181 (C>T) rs12947788 211 211 21 31 0.6 (0.31-1.16) 0.41 (0.04-4.51) 0.09 
TP53: R72P -BstUI rs1042522 174 159 100 122 0.78 (0.53-1.13) 19 26 0.62 (0.32-1.23) 0.08 
TP53BP1: D353E rs560191 141 154 123 131 0.95 (0.66-1.37) 27 24 1.18 (0.61-2.29) 0.85 
TP53BP1: G412S rs689647 205 221 41 41 0.9 (0.54-1.50) 0.95 (0.13-7.21) 0.70 
TP53BP2: rs3738370 C>T 5′-UTR rs3738370 221 243 53 41 1.31 (0.81-2.12) 0.9 (0.21-3.95) 0.40 
TP53BP2: rs17739 G>A 5′-UTR rs17739 198 219 78 85 0.99 (0.67-1.47) 12 1.22 (0.46-3.19) 0.84 
XRCC5: 323bp 3 of STP T>C rs1051677 225 244 56 53 1.29 (0.82-2.03) 3.45 (0.34-34.69) 0.16 
CASP10: 1228G>A -V410I rs5837767 258 283 34 33 1.12 (0.64-1.94) — 0.31 
CASP3: IVS1-1555 A>C rs3087455 116 119 129 145 0.86 (0.59-1.25) 35 37 0.93 (0.52-1.66) 0.59 
CASP8: D302H rs1045485 233 255 60 56 1.15 (0.74-1.79) 0.47 (0.09-2.51) 0.88 
CASP9: Q221R rs1052576 100 99 128 148 0.82 (0.56-1.22) 60 66 0.94 (0.58-1.53) 0.70 
XRCC5: 841bp 3 of STP -74582 G>A rs2440 119 105 106 130 0.71 (0.48-1.05) 37 41 0.8 (0.46-1.40) 0.21 
SNP namers no.Homozygotes common allele
Heterozygotes
Homozygotes rarer allele
Ptrend
CaCoCaCoOR (95% CI)CaCoOR (95% CI)
ATM: 5557 G>A -D1853N rs1801516 205 238 73 63 1.21 (0.80-1.85) 2.06 (0.50-8.49) 0.21 
ATM: IVS22-77T>C rs664677 108 85 134 170 0.68 (0.46-1.01) 47 54 0.77 (0.45-1.30) 0.18 
ATM: IVS48+238 C>G rs609429 92 70 101 113 0.7 (0.45-1.10) 35 52 0.55 (0.30-0.98) 0.03 
ATR: 632C>T -T211M rs2227928 99 111 138 138 1.21 (0.82-1.79) 49 48 1.07 (0.63-1.80) 0.63 
CCND1: 687bp 3 of STP G>C rs678653 105 112 105 114 0.84 (0.56-1.27) 34 31 1.04 (0.57-1.88) 0.82 
CCND1: 870G>A rs603965 79 81 139 156 0.81 (0.53-1.23) 71 76 0.94 (0.58-1.53) 0.79 
CDKN1B: −79C>T rs34330 168 178 111 115 1.09 (0.76-1.57) 17 17 0.96 (0.45-2.07) 0.80 
CDKN2A: 29bp 3 of STP C>G rs11515 219 238 69 70 1.09 (0.72-1.66) 0.47 (0.10-2.19) 0.92 
CDKN2A: 69bp 3 of STP C>T rs3088440 245 270 42 42 1.15 (0.69-1.89) — 0.45 
CDKN2A: A148T rs3731249 240 253 15 10 1.44 (0.60-3.49) — 0.16 
CDKN2B: C>A intron1 rs2069426 204 199 34 35 0.98 (0.56-1.71) 0.45 (0.04-4.57) 0.69 
CDKN2B: G>A intron1 rs974336 198 201 43 54 0.81 (0.50-1.32) 0.99 (0.15-6.31) 0.46 
GADD45A: 3812T>C rs532446 163 180 97 106 0.98 (0.67-1.43) 22 23 1.28 (0.65-2.50) 0.66 
MDM2: 309T>G rs2279744 100 115 147 143 1.27 (0.86-1.87) 35 44 0.88 (0.50-1.55) 0.91 
MDM2: E354E; -344 A>T rs769412 200 189 29 40 0.78 (0.44-1.36) 2.15 (0.37-12.70) 0.79 
p21/Cip1/CDKN1A: 20bp 3 of STP C>T rs1059234 251 272 43 36 1.23 (0.73-2.08) 0.91 (0.12-7.06) 0.52 
p21/Cip1/CDKN1A: S31R rs1801270 241 268 36 37 0.96 (0.56-1.65) 3.05 (0.30-31.35) 0.77 
PARP1/ADPRT1: IVS17-12 C>G rs4986819 238 249 48 55 0.92 (0.58-1.45) 1.49 (0.38-5.87) 1.00 
PARP1/ADPRT1: IVS4+12 G>A rs1805403 174 164 86 112 0.68 (0.47-1.01) 19 20 0.84 (0.41-1.72) 0.13 
PARP1/ADPRT1: P1328T rs1050112 98 113 100 115 1.16 (0.76-1.76) 32 36 0.98 (0.54-1.78) 0.84 
PARP1/ADPRT1: T802 A>T rs4986817 240 250 48 56 0.88 (0.55-1.39) 1.46 (0.37-5.75) 0.87 
PARP1/ADPRT1: V762A rs1136410 207 211 75 84 0.84 (0.57-1.25) 10 12 0.96 (0.38-2.46) 0.50 
RAD9A: 730bp 3 of STP G>A rs1064876 240 256 20 22 1.04 (0.53-2.06) — 0.59 
TP53: 14181 (C>T) rs12947788 211 211 21 31 0.6 (0.31-1.16) 0.41 (0.04-4.51) 0.09 
TP53: R72P -BstUI rs1042522 174 159 100 122 0.78 (0.53-1.13) 19 26 0.62 (0.32-1.23) 0.08 
TP53BP1: D353E rs560191 141 154 123 131 0.95 (0.66-1.37) 27 24 1.18 (0.61-2.29) 0.85 
TP53BP1: G412S rs689647 205 221 41 41 0.9 (0.54-1.50) 0.95 (0.13-7.21) 0.70 
TP53BP2: rs3738370 C>T 5′-UTR rs3738370 221 243 53 41 1.31 (0.81-2.12) 0.9 (0.21-3.95) 0.40 
TP53BP2: rs17739 G>A 5′-UTR rs17739 198 219 78 85 0.99 (0.67-1.47) 12 1.22 (0.46-3.19) 0.84 
XRCC5: 323bp 3 of STP T>C rs1051677 225 244 56 53 1.29 (0.82-2.03) 3.45 (0.34-34.69) 0.16 
CASP10: 1228G>A -V410I rs5837767 258 283 34 33 1.12 (0.64-1.94) — 0.31 
CASP3: IVS1-1555 A>C rs3087455 116 119 129 145 0.86 (0.59-1.25) 35 37 0.93 (0.52-1.66) 0.59 
CASP8: D302H rs1045485 233 255 60 56 1.15 (0.74-1.79) 0.47 (0.09-2.51) 0.88 
CASP9: Q221R rs1052576 100 99 128 148 0.82 (0.56-1.22) 60 66 0.94 (0.58-1.53) 0.70 
XRCC5: 841bp 3 of STP -74582 G>A rs2440 119 105 106 130 0.71 (0.48-1.05) 37 41 0.8 (0.46-1.40) 0.21 

NOTE: Statistically significant results (P < 0.05) are reported in bold.

Abbreviations: Ca, cases; Co, controls; OR, odds ratios adjusted for age, sex, country, and tobacco smoking.

Table 3.

Associations between lung cancer risk and SNPs of genes involved in the DNA break repair pathways (HRR, NHEJ, and SSBR)

SNP namers no.Homozygotes common allele
Heterozygotes
Homozygotes rarer allele
Ptrend
CaCoCaCoOR (95% CI)CaCoOR (95% CI)
BARD1: 143C>T -P24S rs1048108 97 94 143 154 0.94 (0.63-1.39) 48 59 0.76 (0.45-1.27) 0.32 
BARD1: 1592G>A -M507V rs2070094 110 102 134 148 0.77 (0.53-1.14) 42 48 0.80 (0.46-1.37) 0.27 
BARD1: H506H C>T rs2070093 204 221 83 80 1.11 (0.75-1.64) 0.92 (0.32-2.66) 0.75 
BRCA1: D693N rs4986850 231 230 27 44 0.66 (0.38-1.15) 0.59 (0.05-6.82) 0.13 
BRCA1: E1038G rs16941 121 136 117 115 1.15 (0.78-1.69) 37 29 1.51 (0.84-2.71) 0.18 
BRCA1: P871L rs799917 116 137 137 143 1.26 (0.87-1.82) 41 32 1.59 (0.90-2.81) 0.08 
BRCA1: Q356R rs1799950 238 258 38 48 0.99 (0.60-1.62) 0.92 (0.07-12.13) 0.95 
BRCA2: −26G>A rs1799943 138 150 107 114 0.98 (0.67-1.44) 29 32 0.94 (0.51-1.72) 0.83 
BRCA2: N372H rs144848 159 188 96 98 1.13 (0.77-1.65) 28 18 1.78 (0.90-3.52) 0.13 
BRCA2: T1915M rs4987117 269 290 17 20 1.07 (0.51-2.21) — 0.82 
LIG4: -176C>T rs1805388 204 206 69 79 1.01 (0.67-1.52) 1.69 (0.36-8.00) 0.77 
LIG4: -194C>T rs1805389 243 263 32 44 0.89 (0.52-1.54) 0.99 (0.12-8.07) 0.71 
NBS1: L34 G>A rs1063045 123 134 117 125 0.90 (0.61-1.32) 32 42 0.82 (0.46-1.45) 0.45 
NBS1: Q185E rs1805794 134 140 121 134 0.84 (0.58-1.23) 31 36 0.97 (0.54-1.73) 0.62 
RAD51: 135 G>C rs1801320 222 242 65 68 1.01 (0.67-1.54) 1.33 (0.38-4.65) 0.78 
RAD51: 172 G>T rs1801321 76 97 79 80 1.42 (0.88-2.27) 11 17 1.02 (0.42-2.49) 0.37 
RAD52: C>T2259 -744bp 3 of STP rs11226 84 101 145 144 1.04 (0.69-1.55) 62 66 0.98 (0.60-1.59) 0.96 
RAD54B: N250 T>C rs2291439 116 124 122 138 1.00 (0.68-1.47) 34 39 0.86 (0.48-1.54) 0.70 
RECQL: 6bp 3 of STP A>C rs13035 86 99 126 127 1.27 (0.84-1.92) 58 60 1.09 (0.66-1.80) 0.60 
XRCC2: 41657C>T rs718282 259 262 37 48 0.70 (0.42-1.16) — 0.09 
XRCC2: 4234G>C rs3218384 160 161 84 106 0.94 (0.64-1.40) 18 17 1.17 (0.55-2.48) 0.92 
XRCC2: R188H rs3218536 217 217 22 27 0.93 (0.48-1.79) 0.24 (0.02-2.67) 0.44 
XRCC3: 17893A>G -IVS5-14 rs1799796 123 119 137 165 0.91 (0.63-1.32) 33 27 1.27 (0.69-2.35) 0.75 
XRCC3: 4541A>G rs1799794 176 205 79 74 1.16 (0.77-1.74) 13 17 0.64 (0.28-1.46) 0.81 
XRCC3: T241M rs861539 127 129 132 143 1.01 (0.70-1.46) 36 40 0.91 (0.52-1.60) 0.83 
XRCC4: N298S -IVS7-1 G>A rs1805377 173 215 48 41 1.47 (0.90-2.41) 2.98 (0.41-21.85) 0.07 
SNP namers no.Homozygotes common allele
Heterozygotes
Homozygotes rarer allele
Ptrend
CaCoCaCoOR (95% CI)CaCoOR (95% CI)
BARD1: 143C>T -P24S rs1048108 97 94 143 154 0.94 (0.63-1.39) 48 59 0.76 (0.45-1.27) 0.32 
BARD1: 1592G>A -M507V rs2070094 110 102 134 148 0.77 (0.53-1.14) 42 48 0.80 (0.46-1.37) 0.27 
BARD1: H506H C>T rs2070093 204 221 83 80 1.11 (0.75-1.64) 0.92 (0.32-2.66) 0.75 
BRCA1: D693N rs4986850 231 230 27 44 0.66 (0.38-1.15) 0.59 (0.05-6.82) 0.13 
BRCA1: E1038G rs16941 121 136 117 115 1.15 (0.78-1.69) 37 29 1.51 (0.84-2.71) 0.18 
BRCA1: P871L rs799917 116 137 137 143 1.26 (0.87-1.82) 41 32 1.59 (0.90-2.81) 0.08 
BRCA1: Q356R rs1799950 238 258 38 48 0.99 (0.60-1.62) 0.92 (0.07-12.13) 0.95 
BRCA2: −26G>A rs1799943 138 150 107 114 0.98 (0.67-1.44) 29 32 0.94 (0.51-1.72) 0.83 
BRCA2: N372H rs144848 159 188 96 98 1.13 (0.77-1.65) 28 18 1.78 (0.90-3.52) 0.13 
BRCA2: T1915M rs4987117 269 290 17 20 1.07 (0.51-2.21) — 0.82 
LIG4: -176C>T rs1805388 204 206 69 79 1.01 (0.67-1.52) 1.69 (0.36-8.00) 0.77 
LIG4: -194C>T rs1805389 243 263 32 44 0.89 (0.52-1.54) 0.99 (0.12-8.07) 0.71 
NBS1: L34 G>A rs1063045 123 134 117 125 0.90 (0.61-1.32) 32 42 0.82 (0.46-1.45) 0.45 
NBS1: Q185E rs1805794 134 140 121 134 0.84 (0.58-1.23) 31 36 0.97 (0.54-1.73) 0.62 
RAD51: 135 G>C rs1801320 222 242 65 68 1.01 (0.67-1.54) 1.33 (0.38-4.65) 0.78 
RAD51: 172 G>T rs1801321 76 97 79 80 1.42 (0.88-2.27) 11 17 1.02 (0.42-2.49) 0.37 
RAD52: C>T2259 -744bp 3 of STP rs11226 84 101 145 144 1.04 (0.69-1.55) 62 66 0.98 (0.60-1.59) 0.96 
RAD54B: N250 T>C rs2291439 116 124 122 138 1.00 (0.68-1.47) 34 39 0.86 (0.48-1.54) 0.70 
RECQL: 6bp 3 of STP A>C rs13035 86 99 126 127 1.27 (0.84-1.92) 58 60 1.09 (0.66-1.80) 0.60 
XRCC2: 41657C>T rs718282 259 262 37 48 0.70 (0.42-1.16) — 0.09 
XRCC2: 4234G>C rs3218384 160 161 84 106 0.94 (0.64-1.40) 18 17 1.17 (0.55-2.48) 0.92 
XRCC2: R188H rs3218536 217 217 22 27 0.93 (0.48-1.79) 0.24 (0.02-2.67) 0.44 
XRCC3: 17893A>G -IVS5-14 rs1799796 123 119 137 165 0.91 (0.63-1.32) 33 27 1.27 (0.69-2.35) 0.75 
XRCC3: 4541A>G rs1799794 176 205 79 74 1.16 (0.77-1.74) 13 17 0.64 (0.28-1.46) 0.81 
XRCC3: T241M rs861539 127 129 132 143 1.01 (0.70-1.46) 36 40 0.91 (0.52-1.60) 0.83 
XRCC4: N298S -IVS7-1 G>A rs1805377 173 215 48 41 1.47 (0.90-2.41) 2.98 (0.41-21.85) 0.07 
Table 4.

Associations between lung cancer risk and SNPs of genes mainly involved in NER, BER (GG-NER, TC-NER; single-nucleotide BER, multiple-nucleotide BER), MMR, and MGMT

SNP namers no.Homozygotes common allele
Heterozygotes
Homozygotes rarer allele
Ptrend
CaCoCaCoOR (95% CI)CaCoOR (95% CI)
ERCC1: 15310G>C rs16979802 207 215 36 40 0.92 (0.54-1.56) 5.35 (0.53-54.31) 0.69 
ERCC1: 17677C>A rs3212961 217 240 61 66 1.14 (0.74-1.75) 1.87 (0.60-5.79) 0.27 
ERCC1: 19716G>C rs3212948 132 131 120 134 0.95 (0.65-1.38) 36 43 0.77 (0.45-1.33) 0.39 
ERCC1: 354T>C -19007T>C −N118N rs11615 124 119 102 118 0.89 (0.60-1.33) 44 48 0.83 (0.49-1.40) 0.44 
ERCC1: 8092C>A rs3212986 162 154 97 101 0.80 (0.54-1.19) 17 18 0.79 (0.37-1.70) 0.28 
ERCC2-XPD: D312N rs1799793 118 113 111 126 0.91 (0.61-1.35) 48 66 0.65 (0.40-1.08) 0.12 
ERCC2-XPD: L751Q rs13181 132 108 109 135 0.70 (0.47-1.03) 49 58 0.70 (0.42-1.14) 0.08 
ERCC2-XPD: R156R C>A rs238406 92 108 137 149 1.05 (0.71-1.55) 65 49 1.42 (0.86-2.35) 0.21 
ERCC4-XPF: R415Q rs1800067 258 277 40 34 1.17 (0.68-2.01) — 0.95 
ERCC5-XPG: 335C>T -H46H rs1047768 92 89 128 140 0.92 (0.61-1.38) 53 60 0.97 (0.58-1.63) 0.86 
ERCC5-XPG: 3508G>C −H1104D rs17655 167 156 65 83 0.78 (0.51-1.19) 14 17 0.51 (0.22-1.15) 0.07 
RAD23B: A249V rs1805329 177 169 89 104 0.86 (0.59-1.26) 13 23 0.49 (0.23-1.04) 0.08 
XPC: R939Q rs2228001 93 97 104 122 0.88 (0.57-1.34) 37 43 0.70 (0.39-1.24) 0.23 
APEX1/APE1: D148E rs3136820 84 80 140 147 0.91 (0.60-1.38) 69 84 0.90 (0.55-1.46) 0.65 
APEX 1/APE1: Q51H rs1048945 276 294 14 19 0.70 (0.32-1.51) — 0.36 
LIG3: 1508bp 3 of STP C>T rs1052536 62 92 146 153 1.54 (1.00-2.38) 85 68 2.05 (1.25-3.38) <0.01 
MLH1: 676A>G -I219V rs1799977 145 129 123 151 0.73 (0.50-1.05) 23 29 0.51 (0.26-0.98) 0.02 
MSH2: G322D rs4987188 276 286 15 12 1.22 (0.54-2.76) — 0.98 
MSH3: 235G>A -V79I -V70I rs1650697 151 155 12 22 0.57 (0.26-1.28) 1.56 (0.41-5.92) 0.72 
MSH3: 2835G>A -R940Q rs184967 212 215 71 83 0.84 (0.56-1.25) 0.97 (0.34-2.76) 0.48 
MSH3: 3124A>G -T1036A −T1045A rs26279 153 154 116 123 0.92 (0.64-1.33) 27 35 0.78 (0.43-1.42) 0.41 
MSH6: D180 -540T>C rs1800935 158 155 102 133 0.77 (0.53-1.11) 33 19 1.95 (1.00-3.78) 0.55 
MSH6: G39E rs1042821 171 162 59 71 0.93 (0.59-1.45) 11 17 0.43 (0.18-1.05) 0.14 
MUTYH: Q324H rs3219489 205 201 83 100 0.80 (0.55-1.18) 11 0.84 (0.31-2.30) 0.29 
MUTYH: V22M rs3219484 253 271 39 44 0.98 (0.59-1.62) 0.78 (0.05-13.04) 0.89 
OGG1: S326C -m6 -Fnu4HI rs1052133 203 209 82 94 1.01 (0.69-1.48) 12 1.68 (0.64-4.39) 0.51 
PMS2: M622I rs1805324 153 151 10 1.47 (0.51-4.26) — 0.48 
POLB: P242R rs3136797 280 301 16 14 1.52 (0.70-3.33) 2.32 (0.11-47.78) 0.24 
XRCC1: R194W rs1799782 263 262 32 53 0.61 (0.36-1.02) 2.73 (0.13-58.07) 0.10 
XRCC1: R280H rs25489 260 290 32 25 1.45 (0.80-2.65) — 0.18 
XRCC1: R399Q rs25487 118 123 143 149 1.07 (0.74-1.56) 34 42 0.85 (0.48-1.49) 0.77 
LIG1: −7C>T rs20579 206 245 73 61 1.73 (1.13-2.64) — <0.01 
LIG1: IVS2+12 C>T rs3730849 114 111 134 142 0.91 (0.62-1.33) 40 56 0.70 (0.42-1.18) 0.20 
LIG1: IVS9-21 A>G rs3730931 220 255 64 52 1.73 (1.11-2.72) 3.19 (0.52-19.42) 0.01 
PCNA: 1876A>G rs25405 207 217 52 54 0.94 (0.59-1.49) 6.85 (0.64-73.88) 0.60 
PCNA: 2232C>T rs25406 97 107 135 136 1.19 (0.80-1.77) 44 44 1.29 (0.74-2.23) 0.32 
MGMT/AGT: 171C>T -L53L rs1803965 218 223 72 83 0.97 (0.65-1.44) 1.08 (0.25-4.69) 0.91 
MGMT/AGT: 262C>T -L84F rs12917 212 219 76 83 1.04 (0.70-1.55) 1.01 (0.25-4.12) 0.86 
MGMT/AGT: 427A>G -I143V rs2308321 238 241 54 71 0.73 (0.48-1.13) 0.75 (0.13-4.50) 0.17 
MGMT/AGT: 533A>G -K178R rs2308327 236 236 51 70 0.71 (0.46-1.10) 0.73 (0.12-4.40) 0.13 
SNP namers no.Homozygotes common allele
Heterozygotes
Homozygotes rarer allele
Ptrend
CaCoCaCoOR (95% CI)CaCoOR (95% CI)
ERCC1: 15310G>C rs16979802 207 215 36 40 0.92 (0.54-1.56) 5.35 (0.53-54.31) 0.69 
ERCC1: 17677C>A rs3212961 217 240 61 66 1.14 (0.74-1.75) 1.87 (0.60-5.79) 0.27 
ERCC1: 19716G>C rs3212948 132 131 120 134 0.95 (0.65-1.38) 36 43 0.77 (0.45-1.33) 0.39 
ERCC1: 354T>C -19007T>C −N118N rs11615 124 119 102 118 0.89 (0.60-1.33) 44 48 0.83 (0.49-1.40) 0.44 
ERCC1: 8092C>A rs3212986 162 154 97 101 0.80 (0.54-1.19) 17 18 0.79 (0.37-1.70) 0.28 
ERCC2-XPD: D312N rs1799793 118 113 111 126 0.91 (0.61-1.35) 48 66 0.65 (0.40-1.08) 0.12 
ERCC2-XPD: L751Q rs13181 132 108 109 135 0.70 (0.47-1.03) 49 58 0.70 (0.42-1.14) 0.08 
ERCC2-XPD: R156R C>A rs238406 92 108 137 149 1.05 (0.71-1.55) 65 49 1.42 (0.86-2.35) 0.21 
ERCC4-XPF: R415Q rs1800067 258 277 40 34 1.17 (0.68-2.01) — 0.95 
ERCC5-XPG: 335C>T -H46H rs1047768 92 89 128 140 0.92 (0.61-1.38) 53 60 0.97 (0.58-1.63) 0.86 
ERCC5-XPG: 3508G>C −H1104D rs17655 167 156 65 83 0.78 (0.51-1.19) 14 17 0.51 (0.22-1.15) 0.07 
RAD23B: A249V rs1805329 177 169 89 104 0.86 (0.59-1.26) 13 23 0.49 (0.23-1.04) 0.08 
XPC: R939Q rs2228001 93 97 104 122 0.88 (0.57-1.34) 37 43 0.70 (0.39-1.24) 0.23 
APEX1/APE1: D148E rs3136820 84 80 140 147 0.91 (0.60-1.38) 69 84 0.90 (0.55-1.46) 0.65 
APEX 1/APE1: Q51H rs1048945 276 294 14 19 0.70 (0.32-1.51) — 0.36 
LIG3: 1508bp 3 of STP C>T rs1052536 62 92 146 153 1.54 (1.00-2.38) 85 68 2.05 (1.25-3.38) <0.01 
MLH1: 676A>G -I219V rs1799977 145 129 123 151 0.73 (0.50-1.05) 23 29 0.51 (0.26-0.98) 0.02 
MSH2: G322D rs4987188 276 286 15 12 1.22 (0.54-2.76) — 0.98 
MSH3: 235G>A -V79I -V70I rs1650697 151 155 12 22 0.57 (0.26-1.28) 1.56 (0.41-5.92) 0.72 
MSH3: 2835G>A -R940Q rs184967 212 215 71 83 0.84 (0.56-1.25) 0.97 (0.34-2.76) 0.48 
MSH3: 3124A>G -T1036A −T1045A rs26279 153 154 116 123 0.92 (0.64-1.33) 27 35 0.78 (0.43-1.42) 0.41 
MSH6: D180 -540T>C rs1800935 158 155 102 133 0.77 (0.53-1.11) 33 19 1.95 (1.00-3.78) 0.55 
MSH6: G39E rs1042821 171 162 59 71 0.93 (0.59-1.45) 11 17 0.43 (0.18-1.05) 0.14 
MUTYH: Q324H rs3219489 205 201 83 100 0.80 (0.55-1.18) 11 0.84 (0.31-2.30) 0.29 
MUTYH: V22M rs3219484 253 271 39 44 0.98 (0.59-1.62) 0.78 (0.05-13.04) 0.89 
OGG1: S326C -m6 -Fnu4HI rs1052133 203 209 82 94 1.01 (0.69-1.48) 12 1.68 (0.64-4.39) 0.51 
PMS2: M622I rs1805324 153 151 10 1.47 (0.51-4.26) — 0.48 
POLB: P242R rs3136797 280 301 16 14 1.52 (0.70-3.33) 2.32 (0.11-47.78) 0.24 
XRCC1: R194W rs1799782 263 262 32 53 0.61 (0.36-1.02) 2.73 (0.13-58.07) 0.10 
XRCC1: R280H rs25489 260 290 32 25 1.45 (0.80-2.65) — 0.18 
XRCC1: R399Q rs25487 118 123 143 149 1.07 (0.74-1.56) 34 42 0.85 (0.48-1.49) 0.77 
LIG1: −7C>T rs20579 206 245 73 61 1.73 (1.13-2.64) — <0.01 
LIG1: IVS2+12 C>T rs3730849 114 111 134 142 0.91 (0.62-1.33) 40 56 0.70 (0.42-1.18) 0.20 
LIG1: IVS9-21 A>G rs3730931 220 255 64 52 1.73 (1.11-2.72) 3.19 (0.52-19.42) 0.01 
PCNA: 1876A>G rs25405 207 217 52 54 0.94 (0.59-1.49) 6.85 (0.64-73.88) 0.60 
PCNA: 2232C>T rs25406 97 107 135 136 1.19 (0.80-1.77) 44 44 1.29 (0.74-2.23) 0.32 
MGMT/AGT: 171C>T -L53L rs1803965 218 223 72 83 0.97 (0.65-1.44) 1.08 (0.25-4.69) 0.91 
MGMT/AGT: 262C>T -L84F rs12917 212 219 76 83 1.04 (0.70-1.55) 1.01 (0.25-4.12) 0.86 
MGMT/AGT: 427A>G -I143V rs2308321 238 241 54 71 0.73 (0.48-1.13) 0.75 (0.13-4.50) 0.17 
MGMT/AGT: 533A>G -K178R rs2308327 236 236 51 70 0.71 (0.46-1.10) 0.73 (0.12-4.40) 0.13 

NOTE: Statistically significant results (P < 0.05) are reported in bold.

We have undertaken an exploratory investigation of genes involved in cell cycle and DNA repair among a population of lung cancer cases and controls on the basis that they are likely to be enriched for risk-associated genetic variants. The validity of this assumption is supported by the increased report of family history of lung and other tobacco-related cancers among the young-onset cases as opposed to those who developed lung cancer after the age of 50 years. Among the young-onset cases, there was over a 2-fold risk of lung cancer in first-degree family members (OR, 2.06; 95% CI, 1.16-3.65) and over a 4-fold risk of other tobacco-related cancers (OR, 4.94; 95% CI, 1.47-16.62). Conversely, among those ages >50 years, the risk of lung cancer among family members was much reduced (OR, 1.45; 95% CI, 1.15-1.81) and the risk of other tobacco-related cancers was not apparent (OR, 0.86; 95% CI, 0.58-1.29). Although we found several interesting statistically significant associations with modest increases in risk, none were significant when a prior probability of association of 1% was applied. When interpreting such results, it should be taken into account that, among the polymorphisms investigated in the present study, information on their effect on expression and protein function is available for only few of the SNPs studied. Although haplotype-tagging SNPs are available for all of the gene studies, this information was not available when SNPs were being selected for the current study. We therefore focused on variants with relatively high prevalence and those coding for missense changes, as these were more likely to affect the function of the encoded protein.

It is interesting to note that most of the genes with a significant association (i.e., LIG1, LIG3, MLH1, and MSH6) belong to the MMR and, to a minor extent, BER pathways. Although lung cancer is not considered part of the HNPCC spectrum (37), a significant excess of lung, liver, and brain tumors has been detected within HNPCC families (38). Moreover, an altered expression, including a complete lack of protein, of MLH1, in lung cancer, in conjunction with the presence of microsatellite instability in tumor tissue or allelic imbalance of MSH2, has been reported (39, 40). MLH1 was found inactivated in lung cancers following promoter methylation, indicating a role of this gene in lung cancer (27). It has also been suggested that the MLH1 and MSH3 genes could be involved in lung tumorigenesis through a gene dosage effect in those tumors that are hemizygotes and retaining the wild-type copies of MLH1 and MSH3 and not showing microsatellite instability (41). In a previous study from Korea, the variation −93 G>A within MLH1 was found associated with increased risk of squamous cell lung carcinoma (42). Moreover, the exposure to chromate seems to promote lung cancer also through the inhibition of hMLH1 (43). Finally, engineered mice deficient in the MMR gene Mutyh are prone to develop lung cancer (44). Our findings are consistent with the importance of MMR in protecting against the development of lung cancer and deserve further investigation.

One of the variants that was found to be associated with a protective effect was the IVS48+238 C>G variant in the ATM gene. This variant, when present in the homozygous state, has been previously found to be associated with an increased risk of breast cancer and, when present in the heterozygous state, with a decreased risk of therapeutic radiation sensitivity in breast cancer patients (45). Exposure to X-rays through occupational examinations was associated with an increased risk of lung cancer in a larger study population from which the cases and controls investigated here were selected (46). Clearly, further investigation of the role of the variants in the ATM gene and lung cancer risk is warranted.

The present study has some potential limitations. Potential selection bias can occur when subjects who agree to participate in the study have characteristics that differ from those of subjects who are eligible for the study. The major reason for nonparticipation in this study was the refusal of some eligible subjects, which might represent a form of self-selection. Bias from self-selection may affect estimates of exposure to environmental factors; however, it is unlikely that self-selection would be related to a subject's genotype. Results of a simulation study (47) suggested that selection factors that are related only to environmental factors might still lead to a biased estimate of genetic main effects if the environmental factors modify the genetic effects. Nevertheless, bias in the estimate of the genetic main effect due to the selection factor of environmental exposures can be adjusted for in the analyses by treating the environmental factors as potential confounders as we have done (47). Another limitation is the missing data generated in the process of genotyping. These missing data are likely to be independent from demographic variables and smoking status, and their consequence would be to reduce the statistical power of our study. However, the missing data were associated with country as the quality of the DNA samples varied from one country to another. Nevertheless, the missing data were not likely to bias the estimates to any meaningful extent because the call rate was high (with an average of 92%) and independent from the case-control status. In addition, any variation in the call rate by country was controlled for in the multivariable models.

Another potential limitation is related to the number of cases and controls, which limits the power of the study to detect significant associations particularly for rare alleles responsible for weak associations. For example, in our study, the XRCC1 polymorphism R194W is associated with a reduced risk of cancer, in agreement with the results reported in literature for this polymorphism and a recent meta-analysis (30), but the association was not strong enough to reach statistical significance. We also did not observe the association with the OGG1 S326C SNP that we found when we analyzed the complete study population, regardless of age (48). We think that the reason for this is that the OGG1 SNP confers risk only to subjects who are homozygous for the minor allele, a condition too rare to be associated with a detectable risk with the sample size of the present study. Despite these limitations, the present study is one of the largest investigations, on the role of DNA repair and cell cycle control genes in relation to the risk of lung cancer in people with a young age. The findings of this study shed a new light on early-onset lung cancer; however, confirmation in a second independent study is required. Nonetheless, further studies on the role of ligase LIG1 and LIG3 seem promising.

Note: S. Landi and F. Gemignani contributed equally to this work.

Grant support: European Commission (DG-XII; contract IC15-CT96-0313), National Cancer Institute R01 grant (contract CA 092039-01A2), Association for International Cancer Research grant (contract 03-281), “Marie-Curie Reintegration Grant” (contract MERG-CT-2004-506373), and Associazione Italiana per la Ricerca sul Cancro principal investigator grant 2005. The Warsaw part of the study was supported by a local grant from The Polish State Committee for Scientific Research (grant SPUB-M-COPERNICUS/P-05/DZ-30/99/2000). F. Gemignani is a recipient of a fellowship from the International Association for the Study on Lung Cancer.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1
IARC. IARC monographs on the evaluation of carcinogenic risks to humans, vol. 83. Tobacco smoke and involuntary smoking. Lyon (France): IARC; 2004. p. 1–1382.
2
Friedberg EC. DNA damage and repair.
Nature
2003
;
421
:
436
–40.
3
Krokan HE, Nilsen H, Skorpen F, Otterlei M, Slupphaug G. Base excision repair of DNA in mammalian cells.
FEBS Lett
2000
;
476
:
73
–7.
4
Kunkel TA, Erie DA. DNA mismatch repair.
Annu Rev Biochem
2005
;
74
:
681
–710.
5
Friedberg EC. How nucleotide excision repair protects against cancer.
Nat Rev Cancer
2001
;
1
:
22
–33.
6
Sekiguchi M, Nakabeppu Y, Sakumi K, Tuzuki T. DNA-repair methyltransferase as a molecular device for preventing mutation and cancer.
J Cancer Res Clin Oncol
1996
;
122
:
199
–206.
7
Malanga M, Althaus FR. The role of poly(ADP-ribose) in the DNA damage signaling network.
Biochem Cell Biol
2005
;
83
:
354
–64.
8
Weterings E, van Gent DC. The mechanism of non-homologous end-joining: a synopsis of synapsis.
DNA Repair (Amst)
2004
;
3
:
1425
–35.
9
Johnson RD, Jasin M. Double-strand-break-induced homologous recombination in mammalian cells.
Biochem Soc Trans
2001
;
29
:
196
–201.
10
Abraham RT. Cell cycle checkpoint signaling through the ATM and ATR kinases.
Genes Dev
2001
;
15
:
2177
–96.
11
Norbury CJ, Zhivotovsky B. DNA damage-induced apoptosis.
Oncogene
2004
;
23
:
2797
–808.
12
Friedberg EC, Bond JP, Burns DK, et al. Defective nucleotide excision repair in xpc mutant mice and its association with cancer predisposition.
Mutat Res
2000
;
459
:
99
–108.
13
Tsutsumi M, Masutani M, Nozaki T, et al. Increased susceptibility of poly(ADP-ribose) polymerase-1 knockout mice to nitrosamine carcinogenicity.
Carcinogenesis
2001
;
22
:
1
–3.
14
Iwakuma T, Sakumi K, Nakatsuru Y, et al. High incidence of nitrosamine-induced tumorigenesis in mice lacking DNA repair methyltransferase.
Carcinogenesis
1997
;
18
:
1631
–5.
15
Sakumi K, Tominaga Y, Furuichi M, et al. Ogg1 knockout-associated lung tumorigenesis and its suppression by Mth1 gene disruption.
Cancer Res
2003
;
63
:
902
–5.
16
Zhang X, Miao X, Liang G, et al. Polymorphisms in DNA base excision repair genes ADPRT and XRCC1 and risk of lung cancer.
Cancer Res
2005
;
65
:
722
–6.
17
Wu X, Zhao H, Amos CI, et al. p53 genotypes and haplotypes associated with lung cancer susceptibility and ethnicity.
J Natl Cancer Inst
2002
;
94
:
681
–90.
18
Sugimura H, Kohno T, Wakai K, et al. hOGG1 Ser326Cys polymorphism and lung cancer susceptibility.
Cancer Epidemiol Biomarkers Prev
1999
;
8
:
669
–74.
19
Spitz MR, Wei Q, Dong Q, Amos CI, Wu X. Genetic susceptibility to lung cancer: the role of DNA damage and repair.
Cancer Epidemiol Biomarkers Prev
2003
;
12
:
689
–98.
20
Le Marchand L, Donlon T, Lum-Jones A, Seifried A, Wilkens LR. Association of the hOGG1 Ser326Cys polymorphism with lung cancer risk.
Cancer Epidemiol Biomarkers Prev
2002
;
11
:
409
–12.
21
Shen M, Berndt SI, Rothman N, et al. Polymorphisms in the DNA nucleotide excision repair genes and lung cancer risk in Xuan Wei, China.
Int J Cancer
2005
;
116
:
768
–73.
22
Cote ML, Kardia SL, Wenzlaff AS, Ruckdeschel JC, Schwartz AG. Risk of lung cancer among white and black relatives of individuals with early-onset lung cancer.
JAMA
2005
;
293
:
3036
–42.
23
Li X, Hemminki K. Inherited predisposition to early onset lung cancer according to histological type.
Int J Cancer
2004
;
112
:
451
–7.
24
Skuladottir H, Autrup H, Autrup J, et al. Polymorphisms in genes involved in xenobiotic metabolism and lung cancer risk under the age of 60 years. A pooled study of lung cancer patients in Denmark and Norway.
Lung Cancer
2005
;
48
:
187
–99.
25
Cote ML, Kardia SL, Wenzlaff AS, Land SJ, Schwartz AG. Combinations of glutathione S-transferase genotypes and risk of early-onset lung cancer in Caucasians and African Americans: a population-based study.
Carcinogenesis
2005
;
26
:
811
–9.
26
Yang P, Bamlet WR, Ebbert JO, Taylor WR, de Andrade M. Glutathione pathway genes and lung cancer risk in young and old populations.
Carcinogenesis
2004
;
25
:
1935
–44.
27
Wang YC, Lu YP, Tseng RC, et al. Inactivation of hMLH1 and hMSH2 by promoter methylation in primary non-small cell lung tumors and matched sputum samples.
J Clin Invest
2003
;
111
:
887
–95.
28
Matullo G, Palli D, Peluso M, et al. XRCC1, XRCC3, XPD gene polymorphisms, smoking, and (32)P-DNA adducts in a sample of healthy subjects.
Carcinogenesis
2001
;
22
:
1437
–45.
29
Lunn RM, Langlois RG, Hsieh LL, Thompson CL, Bell DA. XRCC1 polymorphisms: effects on aflatoxin B1-DNA adducts and glycophorin A variant frequency.
Cancer Res
1999
;
59
:
2557
–61.
30
Hu Z, Ma H, Chen F, Wei Q, Shen H. XRCC1 polymorphisms and cancer risk: a meta-analysis of 38 case-control studies.
Cancer Epidemiol Biomarkers Prev
2005
;
14
:
1810
–8.
31
Gemignani F, Landi S, Vivant F, Zienolddiny S, Brennan P, Canzian F. A catalogue of polymorphisms related to xenobiotic metabolism and cancer susceptibility.
Pharmacogenetics
2002
;
12
:
459
–63.
32
Guo Z, Guilfoyle RA, Thiel AJ, Wang R, Smith LM. Direct fluorescence analysis of genetic polymorphisms by hybridization with oligonucleotide arrays on glass supports.
Nucleic Acids Res
1994
;
22
:
5456
–65.
33
Landi S, Gemignani F, Gioia-Patricola L, Chabrier A, Canzian F. Evaluation of a microarray for genotyping polymorphisms related to xenobiotic metabolism and DNA repair.
Biotechniques
2003
;
35
:
816
–27.
34
Dos Santos Silva I. Size of a study. In: Cancer epidemiology: principles and methods. 2nd ed. Barcelona (Spain): International Agency for the Research on Cancer; 1999. p. 333–53.
35
Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, Rothman N. Assessing the probability that a positive report is false: an approach for molecular epidemiology studies.
J Natl Cancer Inst
2004
;
96
:
434
–42.
36
Wigginton JE, Cutler DJ, Abecasis GR. A note on exact tests of Hardy-Weinberg equilibrium.
Am J Hum Genet
2005
;
76
:
887
–93.
37
Aarnio M, Sankila R, Pukkala E, et al. Cancer risk in mutation carriers of DNA-mismatch-repair genes.
Int J Cancer
1999
;
81
:
214
–8.
38
Ponz de Leon M, Benatti P, Pedroni M, Sassatelli R, Roncucci L. Risk of cancer revealed by follow-up of families with hereditary non-polyposis colorectal cancer: a population-based study.
Int J Cancer
1993
;
55
:
202
–7.
39
Chang JW, Chen YC, Chen CY, Chen JT, Chen SK, Wang YC. Correlation of genetic instability with mismatch repair protein expression and p53 mutations in non-small cell lung cancer.
Clin Cancer Res
2000
;
6
:
1639
–46.
40
Xinarianos G, Liloglou T, Prime W, et al. hMLH1 and hMSH2 expression correlates with allelic imbalance on chromosome 3p in non-small cell lung carcinomas.
Cancer Res
2000
;
60
:
4216
–21.
41
Benachenhou N, Guiral S, Gorska-Flipot I, Labuda D, Sinnett D. High resolution deletion mapping reveals frequent allelic losses at the DNA mismatch repair loci hMLH1 and hMSH3 in non-small cell lung cancer.
Int J Cancer
1998
;
77
:
173
–80.
42
Park SH, Lee GY, Jeon HS, et al. −93G→A polymorphism of hMLH1 and risk of primary lung cancer.
Int J Cancer
2004
;
112
:
678
–82.
43
Takahashi Y, Kondo K, Hirose T, et al. Microsatellite instability and protein expression of the DNA mismatch repair gene, hMLH1, of lung cancer in chromate-exposed workers.
Mol Carcinog
2005
;
42
:
150
–8.
44
Xie Y, Yang H, Cunanan C, et al. Deficiencies in mouse Myh and Ogg1 result in tumor predisposition and G to T mutations in codon 12 of the K-ras oncogene in lung tumors.
Cancer Res
2004
;
64
:
3096
–102.
45
Angele S, Romestaing P, Moullan N, et al. ATM haplotypes and cellular response to DNA damage: association with breast cancer risk and clinical radiosensitivity.
Cancer Res
2003
;
63
:
8717
–25.
46
Boffetta P, Mannetje A, Zaridze D, et al. Occupational X-ray examinations and lung cancer risk.
Int J Cancer
2005
;
115
:
263
–7.
47
Morimoto LM, White E, Newcomb PA. Selection bias in the assessment of gene-environment interaction in case-control studies.
Am J Epidemiol
2003
;
158
:
259
–63.
48
Hung RJ, Brennan P, Canzian F, et al. Large-scale investigation of base excision repair genetic polymorphisms and lung cancer risk in a multicenter study.
J Natl Cancer Inst
2005
;
97
:
567
–76.