Abstract
Objectives: Nucleotide excision repair enzymes remove bulky damage caused by environmental agents, including carcinogenic polycyclic aromatic hydrocarbons found in cigarette smoke, a risk factor for colorectal adenoma. Among participants randomized to the screening arm of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial, we studied the risk of advanced colorectal adenoma in relation to cigarette smoking and selected single nucleotide polymorphisms (SNP) in the nucleotide excision repair pathway.
Methods: Cases (n = 772) were subjects with left-sided advanced adenoma (>1 cm in size, high-grade dysplasia, or villous characteristics). Controls (n = 777) were screen-negative for left-sided polyps by sigmoidoscopy. DNA was extracted from blood samples and 15 common nonsynonymous SNPs in seven-nucleotide excision repair genes [XPC, RAD23B (hHR23B), CSB (ERCC6), XPD (ERCC2), CCNH, XPF (ERCC4), and XPG (ERCC5)] were genotyped.
Results: None of the studied SNPs were independently associated with advanced adenoma risk. Smoking was related to adenoma risk and XPC polymorphisms (R492H, A499V, K939Q) modified these effects (Pinteraction from 0.03-0.003). Although the three XPC variants were in linkage disequilibrium, a multivariate logistic regression tended to show independent protective effects for XPC 499V (Ptrend = 0.06), a finding supported by haplotype analysis (covariate-adjusted global permutation P = 0.03).
Conclusions: Examining a spectrum of polymorphic variants in nucleotide excision repair genes, we found evidence that smoking-associated risks for advanced colorectal adenoma are modified by polymorphisms in XPC, particularly haplotypes containing XPC 499V. (Cancer Epidemiol Biomarkers Prev 2006;15(2):306–11)
Introduction
The recent discovery of a novel autosomal recessive form of familial adenomatous polyposis caused by mutations in the base excision repair gene MYH (1) and the well-documented involvement of mismatch repair in hereditary nonpolyposis colorectal cancer suggest the importance of DNA repair mechanisms in colorectal carcinogenesis (2). Based on epidemiologic, molecular, and clinical evidence, colorectal adenomas, particularly those large in size or histologically advanced, are recognized precursor lesions for colorectal cancer (3).
Cigarette smoking is a risk factor for colorectal adenoma and likely also for colon cancer (4). Tobacco products, such as polycyclic aromatic hydrocarbons, heterocyclic amines, nitrosamines, and aromatic amines, reaching the colorectal mucosa, through direct ingestion or via the circulatory system, result in DNA adducts, leading to mutations and potentially to cancer development (5). There is considerable interindividual variation in the response to genotoxic exposures, such as polycyclic aromatic hydrocarbons, and genetic polymorphisms in the DNA repair genes could influence DNA repair capacity and, in turn, cancer susceptibility (6).
The nucleotide excision repair pathway may have particular importance for smoking-related adenoma risk because this system removes complex bulky adducts, including those caused by polycyclic aromatic hydrocarbons in tobacco smoke (7). Excision repair involves global genome repair and transcription coupled repair, involving DNA damage recognition, DNA unwinding, excision of the damage, synthesis of new DNA, gap repair, and ligation (Fig. 1; ref. 8). Damage recognition is carried out either by the XPC-hHR23B (encoded by RAD23B) protein complex in the global genome repair pathway or by RNA polymerase II, which recruits CSA and CSB (ERCC6) in transcription coupled repair. In the second step, DNA is unwound by the action of TFIIH, a nine-subunit protein complex composed of DNA helicases, XPB and XPD, and a kinase subunit cyclin H (CCNH). Replication protein A (RPA) stabilizes the unwound DNA, and XPA facilitates the assembly of DNA repair factors. The damaged DNA is then removed by XPG (ERCC5) and the XPF (ERCC4)-ERCC1 complex, which make 3′ and 5′ incisions, respectively, on either side of the damage. Finally, with the help of several replication accessory factors, the DNA gap is filled with new bases synthesized by DNA polymerases δ and ε and a DNA ligase seals the nick.
Several polymorphisms in nucleotide excision repair genes were found to alter DNA repair, including variants in XPC (9-11) and XPD (ERCC2; refs. 10, 12), although not all studies have been consistent (13-15). Despite the potential role of nucleotide excision repair in colorectal carcinogenesis, few epidemiologic studies of colorectal tumors have evaluated the associations with genetic polymorphisms in this pathway. Studies of colorectal cancer in Taiwan (16) and the United Kingdom (17) found no overall effects for selected polymorphisms in XPD, XPF (ERCC4), XPG (ERCC5), and ERCC1. Genotype effects among smoking subgroups, however, have not been evaluated, and no data were available for colorectal adenoma.
We studied the risk for advanced colorectal adenoma in relation to 15 common single nucleotide polymorphisms (SNPs) in seven genes in the nucleotide excision repair pathway—XPC, RAD23B (hHR23B), CSB (ERCC6), XPD, CCNH, XPF, and XPG—among participants in a large colorectal cancer screening trial in the United States. All SNPs are nonsynonymous, leading to amino acid substitution in the protein product.
Materials and Methods
The Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial
The National Cancer Institute Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial, conducted at 10 U.S. screening centers (Birmingham, Alabama; Denver, Colorado; Detroit, Michigan; Honolulu, Hawaii; Marshfield, Wisconsin; Minneapolis, Minnesota; Pittsburgh, Pennsylvania; Salt Lake City, Utah; St. Louis, Missouri; and Washington District of Columbia), is designed to determine whether screening with flexible sigmoidoscopy (60 cm sigmoidoscope), chest X-ray, digital rectal exam plus serum prostate-specific antigen, and CA125 plus transvaginal ultrasound can reduce mortality from cancers of the prostate, lung, colorectum, and ovary, respectively (18, 19). Primary recruitment methods involved mailing informational brochures and letters of invitation to age-eligible persons identified on public, commercial, or screening center-specific mailing lists. To be eligible for the trial, subjects must have been 55 and 74 years of age; not undergoing treatment for cancer (excluding nonmelanoma skin) and with no history of cancer at any of the sites involved in the trial; no history of surgical removal of the prostate, colon, or one lung; no current participation in another cancer screening or primary prevention trial; no use of Finasteride in the previous 6 months; no prostate-specific antigen test in the previous 3 years; no colonoscopy, sigmoidoscopy, or barium enema in the previous 3 years; and the ability to sign the consent form. A total of 77,483 participants (50% men; 87% White) were randomized to the screening arm and a similar number of participants were randomized to the nonscreened arm from 1993 to 2001.
At the baseline, subjects in the screening arm received a flexible sigmoidoscopic visualization of the distal colon (60 cm). If the sigmoidoscopic examination was suspicious for neoplasia (polyp or mass), participants were referred to their primary physicians for subsequent diagnostic follow-up, which generally resulted in a full examination of the colon by colonoscopy and surgery, if needed. Pathology reports regarding the pathologic size and location of each lesion found from colonoscopy or repeat sigmoidoscopy were obtained and coded by trained medical abstractors. Participants provided written informed consent. The study was approved by the institutional review boards of the National Cancer Institute and the 10 screening centers.
Study Population
As described previously (20, 21), a nested case-control study was conducted among participants randomized to the screening arm of the PLCO Trial, who had a successful sigmoidoscopy (insertion to at least 50 cm with >90% of mucosa visible or a suspect lesion identified), completed a risk factor questionnaire, provided a blood sample, and consented to participate in etiologic studies (n = 42,037) between September 1993 and September 1999. Of these participants, we excluded 4,834 with a self-reported history of ulcerative colitis, Crohn's disease, familial polyposis, colorectal polyps, Gardner's syndrome, or cancer (except basal cell skin cancer). We randomly selected 772 of 1,234 cases with at least one advanced colorectal adenoma (adenoma ≥1 cm or containing high-grade dysplasia or villous characteristics) in the distal colon (descending colon and sigmoid or rectum) and 777 controls without evidence of a polyp or other suspicious lesion in the distal colon (i.e., a negative sigmoidoscopy screening) at baseline, frequency-matched to the cases by gender and ethnicity.
Questionnaire Data
Participants completed a baseline risk factor questionnaire in which they reported their body size and the use of tobacco, alcohol, selected drugs, and hormones, and a 137-item food frequency questionnaire in which they reported their usual dietary intake over the 12 months before enrollment. Detailed information on smoking history was collected, including ages started and stopped, total years of use, amount usually used, and type of tobacco used (cigarettes, pipes, and cigars). Subjects who did not smoke or only smoked cigarettes for <6 months or only smoked pipes or cigars for <1 year were considered to be nonsmokers. For evaluation of risks in relation to time period of cigarette use, cigarette users were classified as long-term quitters (quit ≥10 years before enrollment) and current or recent smokers (quit <10 year before enrollment).
Genotype Assay
DNA samples were obtained from stored blood samples using Qiagen standard protocols (QIAamp DNA Blood Midi or Maxi kit: (http://www1.qiagen.com/default.aspx). Fifteen SNPs in seven DNA repair genes, XPC (R492H, A499V, K939Q), RAD23B (hHR23B, A249V), CSB (ERCC6, M1097V, R1230P, Q1413R), XPD (ERCC2, D312N, K751Q), CCNH (V270A), XPF (ERCC4, R379S, R415Q), and XPG (ERCC5, M254V, C529S, D1104H) were genotyped. All assays were validated and optimized at the National Cancer Institute Core Genotyping Facility as described on the website (http://snp500cancer.nci.nih.gov; ref. 22). Internal laboratory quality controls consisted of Coriell DNA samples containing the homozygous major allele, heterozygous, and homozygous minor allele genotypes for each polymorphism under study with four of each control type and four no template controls in every 384 samples. For external blinded quality controls, we interspersed ∼10% repeated quality control samples from 40 individuals. The blinded samples showed >99% interassay concordance for all assays. Approximately 8% of individuals were excluded because of insufficient DNA (7%) or fingerprint profile review showing subject-specific ambiguities (1%). Of those remaining, genotype data were successfully obtained for an average of 94% (90-98%) of study subjects.
Statistical Analysis
Departures from Hardy-Weinberg equilibrium among controls for each self-described ethnic group were assessed by comparing the observed genotype frequencies to the expected frequencies using a χ2 test or exact test if cell counts were small. Odds ratios (OR) and 95% confidence intervals (95% CI) for the genotype associations were obtained using unconditional logistic regression, adjusting for gender, race (White, Black, and others), and age (55-59, 60-64, 65-69, and 70-74 years). Trend tests were done using a score of 0, 1, or 2 for each genotype in a logistic regression model. Statistical significance of the multiplicative interaction between DNA repair genotype and smoking was assessed by comparing nested models with and without cross-product terms (assuming linearity for smoking-related risk) using a likelihood ratio statistic. Adenoma risks associated with haplotypes defined by the SNPs within the same gene were assessed using HaploStats (http://www.mayo.edu/hsr/people/schaid.html), which uses the expectation-maximization algorithm to estimate haplotypes. A global score test was used to evaluate overall differences in haplotype frequencies between cases and controls adjusted for covariates (23), and the effect of individual haplotypes on adenoma risk was assessed using the generalized linear model implemented in HaploStats (23, 24). Multiplicative interaction between XPC haplotypes and smoking was assessed by comparing nested models, with and without cross-product terms (assuming linearity for smoking-related risk), using a likelihood ratio statistic.
Results
Cases and controls were similar with regard to the matching factors, gender and race (Table 1). Cases tended to have a higher body mass index (P = 0.3), were older (P < 0.0001), more likely to have reported a first-degree family history of colorectal cancer (P = 0.02), and had less education (P = 0.001). Consistent with results from the entire cohort (25), cigarette smoking was associated with increased risk for advanced adenoma (ever versus never, OR, 1.4; 95% CI, 1.2-1.8) with greater risks among current or recent cigarette smokers (OR, 2.4; 95% CI, 1.8-3.2) than long-term quitters (OR, 1.1; 95% CI, 0.9-1.4 for quit ≥10 years). Risk relationships for level (OR, 1.7; 95% CI, 1.3-2.2 for >20 cigarettes/d) and pack-years of use (OR, 1.8; 95% CI, 1.3-2.3 for ≥30 pack-years) were weaker, showing the importance of recent exposure in relation to adenoma risk.
. | Case (n* = 772), n (%) . | Control (n* = 777), n (%) . | P, χ2 . | |||
---|---|---|---|---|---|---|
Gender | ||||||
Male | 535 (69.3) | 536 (69.0) | 0.9 | |||
Female | 237 (30.7) | 241 (31.0) | ||||
Race | ||||||
White | 725 (93.9) | 729 (93.8) | 1.0 | |||
Black | 22 (2.9) | 23 (3.0) | ||||
Other† | 25 (3.2) | 25 (3.2) | ||||
Age (y) | ||||||
55-59 | 257 (33.3) | 363 (46.7) | <0.0001 | |||
60-64 | 244 (31.6) | 200 (25.7) | ||||
65-69 | 172 (22.3) | 140 (18.0) | ||||
70-74 | 99 (12.8) | 74 (9.5) | ||||
First-degree family history of colorectal cancer | ||||||
Yes | 97 (12.6) | 70 (9.0) | 0.02 | |||
No | 675 (87.4) | 707 (91.0) | ||||
Education | ||||||
<12 y | 72 (9.3) | 50 (6.4) | 0.001 | |||
12 y/high school equivalent | 191 (24.7) | 176 (22.7) | ||||
Some college | 276 (35.8) | 247 (31.8) | ||||
College and above | 232 (30.1) | 303 (39.0) | ||||
Body mass index at interview | ||||||
<18.5 | 5 (0.7) | 2 (0.3) | 0.3 | |||
≥18.5-<25 | 200 (25.9) | 219 (28.2) | ||||
≥25-<30 | 349 (45.2) | 357 (46.0) | ||||
≥30 | 215 (27.9) | 188 (24.2) | ||||
Pathologic characteristics of adenoma | ||||||
Size (cm) | ||||||
<1 | 133 (17.2) | |||||
≥1 | 571 (74.0) | |||||
Unknown | 68 (8.8) | |||||
Multiplicity | ||||||
Single | 527 (68.3) | |||||
Multiple | 245 (31.7) | |||||
Advanced histology‡ | ||||||
No | 283 (36.7) | |||||
Yes | 489 (63.3) |
. | Case (n* = 772), n (%) . | Control (n* = 777), n (%) . | P, χ2 . | |||
---|---|---|---|---|---|---|
Gender | ||||||
Male | 535 (69.3) | 536 (69.0) | 0.9 | |||
Female | 237 (30.7) | 241 (31.0) | ||||
Race | ||||||
White | 725 (93.9) | 729 (93.8) | 1.0 | |||
Black | 22 (2.9) | 23 (3.0) | ||||
Other† | 25 (3.2) | 25 (3.2) | ||||
Age (y) | ||||||
55-59 | 257 (33.3) | 363 (46.7) | <0.0001 | |||
60-64 | 244 (31.6) | 200 (25.7) | ||||
65-69 | 172 (22.3) | 140 (18.0) | ||||
70-74 | 99 (12.8) | 74 (9.5) | ||||
First-degree family history of colorectal cancer | ||||||
Yes | 97 (12.6) | 70 (9.0) | 0.02 | |||
No | 675 (87.4) | 707 (91.0) | ||||
Education | ||||||
<12 y | 72 (9.3) | 50 (6.4) | 0.001 | |||
12 y/high school equivalent | 191 (24.7) | 176 (22.7) | ||||
Some college | 276 (35.8) | 247 (31.8) | ||||
College and above | 232 (30.1) | 303 (39.0) | ||||
Body mass index at interview | ||||||
<18.5 | 5 (0.7) | 2 (0.3) | 0.3 | |||
≥18.5-<25 | 200 (25.9) | 219 (28.2) | ||||
≥25-<30 | 349 (45.2) | 357 (46.0) | ||||
≥30 | 215 (27.9) | 188 (24.2) | ||||
Pathologic characteristics of adenoma | ||||||
Size (cm) | ||||||
<1 | 133 (17.2) | |||||
≥1 | 571 (74.0) | |||||
Unknown | 68 (8.8) | |||||
Multiplicity | ||||||
Single | 527 (68.3) | |||||
Multiple | 245 (31.7) | |||||
Advanced histology‡ | ||||||
No | 283 (36.7) | |||||
Yes | 489 (63.3) |
Numbers do not add up to the total because of missing values.
Hispanic (0.9%), Asian (1.7%), Pacific islander (0.4%), and American Indian native (0.3%).
High-grade dysplasia or villous elements.
Genotype distributions among controls were consistent with Hardy-Weinberg equilibrium (P > 0.05 for all SNPs). None of the studied polymorphisms were independently associated with risk of advanced adenoma (Table 2). After considering smoking status (in terms of nonsmoker, long-term quitter, and current or recent smoker), all three XPC SNPs were shown to modify smoking-related risks (Pinteraction ranging from 0.003 to 0.04; Table 3).
Genotype* . | Case (n = 772) . | Control (n = 777) . | OR (95% CI) . | |||
---|---|---|---|---|---|---|
XPC | ||||||
R492H (Nucleotide G to A) | ||||||
RR | 623 | 624 | 1.0 | |||
RH | 66 | 79 | 0.8 (0.6-1.2) | |||
HH | 1 | 2 | 0.5 (0.01-9.6)† | |||
Ptrend | 0.3 | |||||
A499V (Nucleotide C to T) | ||||||
AA | 397 | 403 | 1.0 | |||
AV | 261 | 259 | 1.0 (0.8-1.3) | |||
VV | 31 | 41 | 0.8 (0.5-1.2) | |||
Ptrend | 0.6 | |||||
K939Q (Nucleotide A to C) | ||||||
KK | 253 | 241 | 1.0 | |||
KQ | 300 | 312 | 0.9 (0.7-1.2) | |||
112 | 114 | 0.9 (0.7-1.2) | ||||
Ptrend | 0.5 | |||||
RAD23B (hHR23B) | ||||||
A249V (Nucleotide C to T) | ||||||
AA | 438 | 453 | 1.0 | |||
AV | 223 | 218 | 1.0 (0.8-1.3) | |||
VV | 24 | 33 | 0.8 (0.5-1.4) | |||
Ptrend | 0.8 | |||||
CSB (ERCC6) | ||||||
M1097V (Nucleotide A to G) | ||||||
MM | 405 | 410 | 1.0 | |||
MV | 211 | 224 | 0.9 (0.7-1.2) | |||
VV | 30 | 23 | 1.2 (0.7-2.2) | |||
Ptrend | 0.9 | |||||
R1230P (Nucleotide G to C) | ||||||
RR | 524 | 533 | 1.0 | |||
RP | 119 | 126 | 1.0 (0.8-1.3) | |||
PP | 10 | 4 | 2.5 (0.7-11.2)† | |||
Ptrend | 0.4 | |||||
Q1413R (Nucleotide A to G) | ||||||
422 | 423 | 1.0 | ||||
QR | 177 | 199 | 0.9 (0.7-1.1) | |||
RR | 30 | 24 | 1.2 (0.7-2.1) | |||
Ptrend | 0.6 | |||||
XPD (ERCC2) | ||||||
D312N (Nucleotide G to A) | ||||||
DD | 301 | 301 | 1.0 | |||
DN | 300 | 304 | 1.0 (0.8-1.3) | |||
NN | 82 | 93 | 0.9 (0.6-1.2) | |||
Ptrend | 0.5 | |||||
K751Q (Nucleotide A to C) | ||||||
KK | 300 | 315 | 1.0 | |||
KQ | 348 | 332 | 1.1 (0.9-1.4) | |||
95 | 112 | 0.9 (0.7-1.2) | ||||
Ptrend | 0.9 | |||||
CCNH | ||||||
V270A (Nucleotide T to C) | ||||||
VV | 416 | 441 | 1.0 | |||
VA | 241 | 222 | 1.2 (0.9-1.5) | |||
AA | 32 | 28 | 1.3 (0.8-2.2) | |||
Ptrend | 0.1 | |||||
XPF (ERCC4) | ||||||
P379S (Nucleotide C to T) | ||||||
PP | 686 | 696 | 1.0 | |||
PS | 5 | 6 | 0.8 (0.2-3.3)† | |||
SS | — | — | — | |||
P | 0.8 | |||||
R415Q (Nucleotide G to A) | ||||||
RR | 624 | 623 | 1.0 | |||
RQ | 78 | 86 | 0.9 (0.6-1.2) | |||
1 | 7 | 0.1 (0.003-1.1)† | ||||
Ptrend | 0.1 | |||||
XPG (ERCC5) | ||||||
M254V (Nucleotide A to G) | ||||||
MM | 633 | 650 | 1.0 | |||
MV | 36 | 36 | 1.0 (0.6-1.6) | |||
VV | 1 | 1 | 1.0 (0.01-80.7)† | |||
Ptrend | 1.0 | |||||
C529S (Nucleotide G to C) | ||||||
CC | 598 | 601 | 1.0 | |||
CS | 52 | 60 | 0.9 (0.6-1.4) | |||
SS | 1 | 1 | 1.0 (0.01-79.0)† | |||
Ptrend | 0.7 | |||||
D1104H (Nucleotide G to C) | ||||||
DD | 407 | 403 | 1.0 | |||
DH | 243 | 265 | 0.9 (0.7-1.1) | |||
HH | 29 | 29 | 1.0 (0.6-1.7) | |||
Ptrend | 0.5 |
Genotype* . | Case (n = 772) . | Control (n = 777) . | OR (95% CI) . | |||
---|---|---|---|---|---|---|
XPC | ||||||
R492H (Nucleotide G to A) | ||||||
RR | 623 | 624 | 1.0 | |||
RH | 66 | 79 | 0.8 (0.6-1.2) | |||
HH | 1 | 2 | 0.5 (0.01-9.6)† | |||
Ptrend | 0.3 | |||||
A499V (Nucleotide C to T) | ||||||
AA | 397 | 403 | 1.0 | |||
AV | 261 | 259 | 1.0 (0.8-1.3) | |||
VV | 31 | 41 | 0.8 (0.5-1.2) | |||
Ptrend | 0.6 | |||||
K939Q (Nucleotide A to C) | ||||||
KK | 253 | 241 | 1.0 | |||
KQ | 300 | 312 | 0.9 (0.7-1.2) | |||
112 | 114 | 0.9 (0.7-1.2) | ||||
Ptrend | 0.5 | |||||
RAD23B (hHR23B) | ||||||
A249V (Nucleotide C to T) | ||||||
AA | 438 | 453 | 1.0 | |||
AV | 223 | 218 | 1.0 (0.8-1.3) | |||
VV | 24 | 33 | 0.8 (0.5-1.4) | |||
Ptrend | 0.8 | |||||
CSB (ERCC6) | ||||||
M1097V (Nucleotide A to G) | ||||||
MM | 405 | 410 | 1.0 | |||
MV | 211 | 224 | 0.9 (0.7-1.2) | |||
VV | 30 | 23 | 1.2 (0.7-2.2) | |||
Ptrend | 0.9 | |||||
R1230P (Nucleotide G to C) | ||||||
RR | 524 | 533 | 1.0 | |||
RP | 119 | 126 | 1.0 (0.8-1.3) | |||
PP | 10 | 4 | 2.5 (0.7-11.2)† | |||
Ptrend | 0.4 | |||||
Q1413R (Nucleotide A to G) | ||||||
422 | 423 | 1.0 | ||||
QR | 177 | 199 | 0.9 (0.7-1.1) | |||
RR | 30 | 24 | 1.2 (0.7-2.1) | |||
Ptrend | 0.6 | |||||
XPD (ERCC2) | ||||||
D312N (Nucleotide G to A) | ||||||
DD | 301 | 301 | 1.0 | |||
DN | 300 | 304 | 1.0 (0.8-1.3) | |||
NN | 82 | 93 | 0.9 (0.6-1.2) | |||
Ptrend | 0.5 | |||||
K751Q (Nucleotide A to C) | ||||||
KK | 300 | 315 | 1.0 | |||
KQ | 348 | 332 | 1.1 (0.9-1.4) | |||
95 | 112 | 0.9 (0.7-1.2) | ||||
Ptrend | 0.9 | |||||
CCNH | ||||||
V270A (Nucleotide T to C) | ||||||
VV | 416 | 441 | 1.0 | |||
VA | 241 | 222 | 1.2 (0.9-1.5) | |||
AA | 32 | 28 | 1.3 (0.8-2.2) | |||
Ptrend | 0.1 | |||||
XPF (ERCC4) | ||||||
P379S (Nucleotide C to T) | ||||||
PP | 686 | 696 | 1.0 | |||
PS | 5 | 6 | 0.8 (0.2-3.3)† | |||
SS | — | — | — | |||
P | 0.8 | |||||
R415Q (Nucleotide G to A) | ||||||
RR | 624 | 623 | 1.0 | |||
RQ | 78 | 86 | 0.9 (0.6-1.2) | |||
1 | 7 | 0.1 (0.003-1.1)† | ||||
Ptrend | 0.1 | |||||
XPG (ERCC5) | ||||||
M254V (Nucleotide A to G) | ||||||
MM | 633 | 650 | 1.0 | |||
MV | 36 | 36 | 1.0 (0.6-1.6) | |||
VV | 1 | 1 | 1.0 (0.01-80.7)† | |||
Ptrend | 1.0 | |||||
C529S (Nucleotide G to C) | ||||||
CC | 598 | 601 | 1.0 | |||
CS | 52 | 60 | 0.9 (0.6-1.4) | |||
SS | 1 | 1 | 1.0 (0.01-79.0)† | |||
Ptrend | 0.7 | |||||
D1104H (Nucleotide G to C) | ||||||
DD | 407 | 403 | 1.0 | |||
DH | 243 | 265 | 0.9 (0.7-1.1) | |||
HH | 29 | 29 | 1.0 (0.6-1.7) | |||
Ptrend | 0.5 |
NOTE: OR and 95% CI values were calculated by unconditional logistic regression adjusted for gender, race, and age.
Allele change indicated by amino acid change. Nucleotide change in parentheses.
Exact estimate and 95% CI.
XPC genotype . | Never smokers . | Former smokers (quit ≥ 10 y) . | Current or recent smokers (quit < 10 y) . | Stratum-specific Ptrend . | ||||
---|---|---|---|---|---|---|---|---|
. | OR (95% CI) . | OR (95% CI) . | OR (95% CI) . | . | ||||
. | n: case, control . | n: case, control . | n: case, control . | . | ||||
R492H | ||||||||
RR | 1.0 | 1.1 (0.9-1.4) | 2.6 (1.9-3.6) | <0.0001 | ||||
197,254 | 218,246 | 172,89 | ||||||
RH + HH | 1.6 (0.9-2.8) | 0.8 (0.5-1.4) | 1.0 (0.4-2.3) | 0.2 | ||||
29,24 | 25,38 | 11,14 | ||||||
Stratum-specific P | 0.1 | 0.3 | 0.02 | Pinteraction = 0.003 | ||||
A499V | ||||||||
AA | 1.0 | 1.1 (0.8-1.5) | 3.0 (2.0-4.5) | <0.0001 | ||||
122,165 | 140,164 | 118,55 | ||||||
AV + VV | 1.1 (0.8-1.6) | 1.1 (0.8-1.6) | 1.9 (1.3-3.0) | 0.07 | ||||
103,116 | 103,119 | 65,46 | ||||||
Stratum-specific P | 0.3 | 0.9 | 0.06 | Pinteraction = 0.04 | ||||
K939Q | ||||||||
KK | 1.0 | 0.8 (0.5-1.2) | 1.5 (0.9-2.5) | 0.4 | ||||
90,85 | 84,102 | 56,38 | ||||||
KQ + QQ | 0.7 (0.5-1.0) | 0.9 (0.6-1.2) | 2.0 (1.3-3.0) | <0.0001 | ||||
131,183 | 148,161 | 120,59 | ||||||
Stratum-specific P | 0.05 | 0.7 | 0.3 | Pinteraction = 0.03 |
XPC genotype . | Never smokers . | Former smokers (quit ≥ 10 y) . | Current or recent smokers (quit < 10 y) . | Stratum-specific Ptrend . | ||||
---|---|---|---|---|---|---|---|---|
. | OR (95% CI) . | OR (95% CI) . | OR (95% CI) . | . | ||||
. | n: case, control . | n: case, control . | n: case, control . | . | ||||
R492H | ||||||||
RR | 1.0 | 1.1 (0.9-1.4) | 2.6 (1.9-3.6) | <0.0001 | ||||
197,254 | 218,246 | 172,89 | ||||||
RH + HH | 1.6 (0.9-2.8) | 0.8 (0.5-1.4) | 1.0 (0.4-2.3) | 0.2 | ||||
29,24 | 25,38 | 11,14 | ||||||
Stratum-specific P | 0.1 | 0.3 | 0.02 | Pinteraction = 0.003 | ||||
A499V | ||||||||
AA | 1.0 | 1.1 (0.8-1.5) | 3.0 (2.0-4.5) | <0.0001 | ||||
122,165 | 140,164 | 118,55 | ||||||
AV + VV | 1.1 (0.8-1.6) | 1.1 (0.8-1.6) | 1.9 (1.3-3.0) | 0.07 | ||||
103,116 | 103,119 | 65,46 | ||||||
Stratum-specific P | 0.3 | 0.9 | 0.06 | Pinteraction = 0.04 | ||||
K939Q | ||||||||
KK | 1.0 | 0.8 (0.5-1.2) | 1.5 (0.9-2.5) | 0.4 | ||||
90,85 | 84,102 | 56,38 | ||||||
KQ + QQ | 0.7 (0.5-1.0) | 0.9 (0.6-1.2) | 2.0 (1.3-3.0) | <0.0001 | ||||
131,183 | 148,161 | 120,59 | ||||||
Stratum-specific P | 0.05 | 0.7 | 0.3 | Pinteraction = 0.03 |
NOTE: OR and 95% CI values were calculated by unconditional logistic regression adjusted for gender, race, and age. Subjects who never used cigarettes but used pipes or cigars were excluded.
Trends for increased risks associated with current or recent smoking were statistically significant only for carriers of XPC 492RR, XPC 499AA, and XPC 939KQ+QQ (all Ptrend < 0.0001) with risks increasing to 2.6 (95% CI, 1.9-3.6), 3.0 (95% CI, 2.0-4.5), and 2.0 (95% CI, 1.3-3.0), respectively. In stratified analyses among current and recent smokers, we observed decreased risks of 0.4 (95% CI, 0.2-0.9) comparing carriers of the XPC 492H allele to those homozygous for the XPC 492R allele, and 0.6 (95% CI, 0.4-1.0) comparing carriers of the XPC 499V allele to those homozygous for the XPC 499A allele).
The three XPC SNPs were in strong linkage disequilibrium (D′ = 1.0 for all pairwise comparisons; specific r2 in Fig. 2), yet, including all three SNPs in a logistic regression analysis in current and recent smokers, tended to show statistically independent risk relationships only for the XPC 499 polymorphism (Ptrend = 0.06). Haplotype analysis inferred by the expectation-maximization algorithm showed that the three SNPs distributed in four common haplotypes (A, 40%; B, 36%; C, 18%; D, 5%; specifics by smoking subgroups in Table 4). As with the genotypes, haplotype analysis revealed no overall association with adenoma risk (Pglobal permutation = 0.6); however, again, risk differentials were noted among current and recent smokers (Pglobal permutation = 0.03) due largely to lower risks associated with haplotypes C and D, possibly driven by the above described genotype relationship with XPC 499V.
XPC haplotype . | No tobacco . | Long-term quitters (quit ≥ 10 y) . | Current or recent (quit < 10 y) smokers . |
---|---|---|---|
. | OR (95% CI) . | OR (95% CI) . | OR (95% CI) . |
. | Column %: case, control . | Column %: case, control . | Column %: case, control . |
A, 492R-499A-939Q | 1.0 | 1.0 | 1.0 |
39%,42% | 40%,40% | 40%,37% | |
B, 492R-499A-939K | 1.1 (0.8-1.5) | 1.0 (0.8-1.4) | 0.9 (0.6-1.5) |
35%,35% | 36%,35% | 42%,36% | |
C, 492R-499V-939K | 1.3 (0.9-1.8) | 1.1 (0.8-1.6) | 0.6 (0.3-1.0) |
20%,18% | 18%,18% | 15%,20% | |
D, 492H-499V-939K | 1.5 (0.8-2.8) | 0.8 (0.5-1.4) | 0.4 (0.2-1.1) |
6%,4% | 5%,7% | 3%,7% | |
Global permutation P | 0.4 | 0.7 | 0.03 |
XPC haplotype . | No tobacco . | Long-term quitters (quit ≥ 10 y) . | Current or recent (quit < 10 y) smokers . |
---|---|---|---|
. | OR (95% CI) . | OR (95% CI) . | OR (95% CI) . |
. | Column %: case, control . | Column %: case, control . | Column %: case, control . |
A, 492R-499A-939Q | 1.0 | 1.0 | 1.0 |
39%,42% | 40%,40% | 40%,37% | |
B, 492R-499A-939K | 1.1 (0.8-1.5) | 1.0 (0.8-1.4) | 0.9 (0.6-1.5) |
35%,35% | 36%,35% | 42%,36% | |
C, 492R-499V-939K | 1.3 (0.9-1.8) | 1.1 (0.8-1.6) | 0.6 (0.3-1.0) |
20%,18% | 18%,18% | 15%,20% | |
D, 492H-499V-939K | 1.5 (0.8-2.8) | 0.8 (0.5-1.4) | 0.4 (0.2-1.1) |
6%,4% | 5%,7% | 3%,7% | |
Global permutation P | 0.4 | 0.7 | 0.03 |
NOTE: OR and 95% CI values were calculated by unconditional logistic regression adjusted for gender, race, and age. Subjects who never used cigarettes but used pipes or cigars were excluded.
No statistically significant differences were observed for the XPC genotype effects by age, gender, race, adenoma size, or histologic type (i.e., presence of high-grade dysplasia or villous characteristics). When analyses were restricted to Whites only (94%), all results were essentially the same.
Discussion
We found no evidence that polymorphisms in seven-nucleotide excision repair genes had an overall effect on colorectal adenoma risk; however, smoking-related risk differentials for this disease varied in relation to carriage of three linked nonsynonymous polymorphisms in XPC (R492H, A499V, and K939Q), with protective effects most clearly associated with XPC 499V.
XPC, located on chromosome 3p25, is a 33-kb-long gene containing 16 exons, encoding a 940-amino-acid protein involved in DNA damage recognition (15). The XPC SNPs evaluated in this study are located in exons 8 and 15 (9, 26, 27). Although the coding variants lead to amino acid substitution, their functional significance is uncertain; the 939K allele was associated with reduced repair of single-strand breaks measured by alkaline Comet assay (11), although XPC A499V and XPC K939Q did not show functional correlations in the post-UV host cell reactivation-based complementation assay.
Two other common polymorphisms in XPC are of interest (Fig. 2). A common biallelic poly(AT) insertion (+) polymorphism in intron 9 has been associated with an increased risk of head and neck cancer (28) and is related to differential removal of UV photoproducts in the host-cell reactivation assay (10). A common C > A SNP 3′ of the XPC intron 11 splice acceptor site was related to increased formation of the XPC mRNA non-exon 12 isoform, resulting in diminished DNA repair (9).
These polymorphisms are in strong linkage disequilibrium with the K939Q SNP in exon 15 [939Q variant was correlated with the poly(AT) insertion and −5 A splice acceptor site with D′ = 1.0 and r2 = 0.9 for both; Fig. 2] in Caucasians (9), and, by the linkage disequilibrium relationships shown here, must also be partially linked to the XPC R492H and XPC A499V variants (939Q variant was correlated in our study with the 492R and 499A alleles, r2 = 0.04 and 0.2, respectively; refs. 13, 15).
The functional data on the XPC poly(AT) insertion (+) and −5 A acceptor site variants (9-11) are of further interest in relation to differential smoking-related risks of colorectal adenoma found in our study, as similar predictive effects were noted for smoking-related head and neck cancer (28) and lung cancer (29), and the tightly linked XPC 939Q variant was associated with increased bladder cancer risk in one study (30), although not in another study (31).
Subjects in this study came from 10 different screening centers representing a broad population distribution in the United States. The cases and controls were comparable, randomly selected from the same source population, and screened by a standardized procedure (i.e., cases were not screened based on symptoms). The large study population allowed us to confine the analysis to cases with advanced adenoma, a particularly meaningful intermediate outcome with higher potential for malignant transformation. Although this fairly large sample size allowed us to explore genetic interrelationships with smoking-related adenoma risk, we had limited statistical power to make any confident conclusion. Reliable genotype data were ensured by high success and reproducibility rates. Our subjects, however, were largely Caucasians and better educated. As another consideration, in studying prevalent cases identified from the initial baseline screening, we may be oversampling slow-growing adenoma (the so-called length-sampling bias). As the PLCO Trial continues to identify additional prevalent adenomas as well as incident adenomas and colorectal cancers, we will have opportunities to address some of the limitations.
In summary, we found that three linked XPC alleles, 492R, 499A, and 939Q, were associated with increased risk of colorectal adenoma in smokers, potentially related to functional changes due to amino acid substitution or to linkage with other functional polymorphisms in the gene. Our finding on adenoma and supporting data from some functional (9-11) and epidemiologic studies of smoking-related cancers (28-30) suggest a critical role of XPC in repairing DNA damage (e.g., bulky adducts) caused by tobacco carcinogens (e.g., polycyclic aromatic hydrocarbons, heterocyclic amines, nitrosamines, and aromatic amines). These results for XPC variants, tobacco use, and adenoma, along with the smoking-related adenoma risk modifications by variants of polycyclic aromatic hydrocarbon metabolism-related genes (e.g., EPHX1, NQO1, CYP1A1, and GSTT1) that we have previously observed in this population (20, 21, 32), support the importance of genetic susceptibility to tobacco-related colorectal carcinogenesis. Although we found no relation between other nucleotide excision repair genes and smoking-related adenoma risk, potential effects by unidentified genetic variants cannot be ruled out. Confirmation in other studies of colorectal tumors and investigations of other genes and variants in this pathway and other major DNA repair pathways are needed to clarify the underlying mechanisms of DNA repair in the development of colorectal cancer.
Grant support: Intramural Research Program of the NIH, National Cancer Institute, Division of Cancer Epidemiology and Genetics.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Acknowledgments
We thank Drs. Christine Berg and Philip Prorok (Division of Cancer Prevention, National Cancer Institute); the Screening Center investigators and staff or the PLCO Cancer Screening Trial; Tom Riley and staff (Information Management Services, Inc.); Barbara O'Brien and staff (Westat, Inc.); and Drs. Bill Kopp, Wen Shao, and staff (Science Applications International Corporation-Frederick) for their contributions to making this study possible.