Interindividual differences in DNA repair capacity (DRC) may play a critical role in breast cancer risk. Previously, we determined that DRC measured via removal of in vitro–induced benzo[a]pyrene diolepoxide-DNA adducts in lymphoblastoid cell lines was lower in cases compared with controls among sisters discordant for breast cancer from the Metropolitan New York Registry of Breast Cancer Families. We have now determined genotypes for seven single nucleotide polymorphisms in five nucleotide excision repair genes, including Xeroderma pigmentosum complementation group A (XPA +62T>C), group C (XPC Lys939Gln and Ala499Val), group D (XPD Asp312Asn and Lys751Gln), and group G (XPG His1104Asp) and ERCC1 (8092 C>A) in a total of 160 sister pairs for whom DRC phenotype data were available. Overall, there were no statistically significant differences in average DRC for most of the genotypes. A final multivariate conditional logistic model, including three single nucleotide polymorphisms (XPA +62T>C, XPC Ala499Val, and XPG His1104Asp) and smoking status, only modestly predicted DRC after adjusting for case-control status and age of blood donation. The overall predictive accuracy was 61% in the model with a sensitivity of 78% and specificity of 39%. These findings suggest that those polymorphisms we have investigated to date in nucleotide excision repair pathway genes explain only a small amount of the variability in DRC. (Cancer Epidemiol Biomarkers Prev 2006;15(9):1614–20)

Elevated levels of DNA damage caused by excessive exposure to carcinogens or inherited deficiencies in repair capacity may increase an individual's susceptibility to many kinds of human cancers (1, 2). Using different in vitro phenotype assays, several recent studies have shown that DNA repair capacity (DRC) has considerable interindividual variation and that individuals with reduced DRC are at increased risk of developing breast cancer (3-6). DRC measured by the host-cell reactivation assay using both luciferase (luc) and chloramphenicol acetyltransferase (cat) as reporter genes, indicated that women with breast carcinoma had a 22% to 36% reductions in DRC compared with controls (4-6). Those with DRC lower than the median level of controls had a 3-fold increased risk for breast cancer compared with those with higher DRC (5). Our own previous finding also indicated not only that do breast cancer patients have a 24.5% decrease in average DRC compared with their sisters without breast cancer but also that there is an increase in risk with decreasing DRC (odds ratios ranged from 1.2 to 2.4 to 3.0, by quartiles; Ptrend < 0.01) using an assay measuring the removal of in vitro–induced benzo[a]pyrene diolepoxide (BPDE)-DNA adducts in lymphoblastoid cells (3). For several other types of human cancers (lung, prostate, skin, head, and neck cancers), increased risk is also associated with deficient DRC (7). These phenotype studies are usually good at predicting cancer risk but have relatively large variations in risk estimates because of the large assay variation in addition to the relatively small numbers of subjects compared with genotype association studies (2). Because the assays are very time consuming and it is not easy to develop large-scale, high-throughput methods for molecular epidemiologic studies, few subjects are assayed, a major limitation of phenotype assays (3, 5, 8-12).

It is likely that single nucleotide polymorphisms (SNP) in coding and regulatory sequences of genes in the nucleotide excision repair (NER) pathway may result in subtle structural alterations in DNA repair enzymes and modulate cancer susceptibility by affecting individual DRC (13-19). Previous studies showed that DRC for removal of BPDE-induced DNA damage measured by the host-cell reactivation assay was modulated by two XPD/ERCC2 polymorphisms (Asp312Asn and Lys751Gln; ref. 20). In other studies, DRC was consistently lower in subjects homozygous for XPC intron 9 poly(AT) or XPD 156Arg, 312Asn, and 751Gln than in subjects with other genotypes, suggesting these polymorphic alleles might have a recessive effect on the DRC phenotype (9, 21). But results for XPD Asp312Asn are inconsistent (22). It may be difficult to detect subtle differences in DRC by studying only one or two SNPs in a very complex pathway. Genotype studies might be improved by using a robust study design and including multiple polymorphisms in the same pathway to estimate risks associated with individual DRC.

In the present study, we used sisters discordant for breast cancer from the Metropolitan New York Registry, one of six international collaborating sites (http://www.metronyregistry.org/) of the Breast Cooperative Family Registry (23) to predict the influence of seven SNPs in the NER pathway genes (XPA 5′ untranslated region +62T>C, XPC Ala499Val and Lys939Gln, XPD Asp312Asn and Lys751Gln, XPG His1104Asp, and ERCC1 8092 C>A) on DRC phenotype. Our aim was to evaluate whether these genetic polymorphisms in NER pathway genes could partly explain the variation of individual DRC phenotype. If so, then sets of polymorphisms in the DNA repair pathway, which are easier to measure in large-scale epidemiologic studies, may be a proxy for DNA repair phenotype, which is less easily measured in epidemiologic settings.

Study Design and Characteristics of Study Population

The Breast Cooperative Family Registry is an international consortium established in 1995 as a resource for research on the epidemiologic, clinical, and genetic aspects of breast cancer (http://epi.grants.cancer.gov/CFR/; ref. 23). The description of the sources of study participants and recruitment and data collection methods as part of the parent Metropolitan New York Registry project have been described in detail elsewhere (3, 24). Briefly, the Metropolitan New York Registry has been recruiting high-risk breast and/or ovarian cancer families from clinical and community settings within the metropolitan New York area since 1995 who met one of the following criteria: (a) one or more members with breast cancer or ovarian cancer diagnosed at <45 years of age; (b) one or more members with both breast and ovarian cancer; (c) two or more first-degree relatives with breast or ovarian cancer diagnosed at age ≥45 years; (d) any male with breast cancer; and (e) a known BRCA mutation. The participants are interviewed either in person or by phone using epidemiology and family history questionnaires that collect information on demographics, ethnicity, history of cancers, smoking, and alcohol consumption, reproductive history, hormone use, weight, height, and physical activity. A self-administered dietary questionnaire is also provided with return by mail. In addition, a sample of blood or buccal cells is collected from participants. The present study includes 160 sister pairs with available lymphoblastoid cell lines for phenotype.

Laboratory Methods

Genomic DNA was extracted from WBC by salting out and SNPs were determined by one of two different methods: fluorescence polarization and Taqman assay. A commercial AcycloPrime-FP SNP Detection kit obtained from Perkin-Elmer Life Sciences (Boston, MA) was used for detection of five SNPs detection. The Taqman system using a 7500 Real-time PCR system (ABI Applied Biosystems, Foster City, CA) with allelic discrimination software supplied by the manufacturer was used for the remaining two SNPs. Table 1 displays the genes and polymorphisms included in this study, with their National Center for Biotechnology Information reference sequence, chromosome location, and polymorphic site as well as the PCR and single nucleotide extension primers for the fluorescence polarization assay and the Taqman probes. Genotyping data were complete for 99.5% of breast cancer cases and 98.9% of controls. Ten percent of samples were reassayed after relabeling to keep laboratory staff blinded. Concordance for reassayed samples was 99.6% (279/280).

Table 1.

Studied genes, polymorphisms, and core features in the NER pathway

GenesNCBI reference sequence no.Chromosome locationPolymorphic nucleotide/amino acid changedbSNP IDPrimers and probes
ERCC1 NM_012099 19q13.3 8092C>A rs3212986 5′-TAGTTCCTCAGT TTCCCG-3′ (sense) 
     5′-TGAGCCAATTCAGCCACT-3′ (antisense) 
     5′-CTACACAGGCTGCTGCTGCTGCT-3′ (forward probe) 
XPA NM_000380 9q22.3 Ex1+62T>C, 5′ UTR rs1800975 5′-AAGCCCCGTCGGCCGCCGCCATCTC[T/C]GGCCCACTCCGAGGACCTAGCTCCC-3′ (reverse probes) 
     Applied Biosystems assay no. C_482935_1 
XPC NM_004628 3p25 Ex9−377C>T/Ala499Val rs2228000 5′-GCCTCTGATCCCTCTGATGA-3′ (sense) 
     5′-CATCGCTGCACATTTTCTTG-3′ (antisense) 
     5′-GTAAGGACCCAAGCTTGCCAG-3′ (forward probe) 
   Ex16+211A>C/Lys939Gln rs2228001 5′-GCCTCAAAACCGAGAAGATG-3′ (sense) 
     5′-CTGCCTCAGTTT GCCTTCTC-3′ (antisense) 
     5′-GGGCGCTCAGCTCACAGCT-3′ (reverse probe) 
XPD (ERCC2) NM_000400 19q13.3 Ex10−16G>A/Asp312Asn rs1799793 5′-CGGGGCTCACCCTGCAGCACTTCGT[C/T]GGGCAGCACGGGGTTGGCCAGGTGG-3′ (reverse probes) 
     Applied Biosystems assay no. C_3145050_10 
   Ex23+61A>C/lys751Gln rs13181 5′-CCCTCTCCCTTTCCTCTGTT-3′ (sense) 
     5′-GGCAAGACTCAGGAGTCACC-3′ (antisense) 
     5′-CTGAGCAATCTGCTCTATCCTCT -3′ (reverse probe) 
XPG (ERCC) NM_000123 13q33 Ex15−344G>C/Asp1104His rs17655 5′-ACGAAAGAATACATGCGGTGGA-3′ (sense) 
     5′-ATCTGGCGGTCACGAGGAC-3′ (antisense) 
     5′-CCTCTCAGAATCTGATGGATCTTCAAGTGAA-3′ (forward probe) 
GenesNCBI reference sequence no.Chromosome locationPolymorphic nucleotide/amino acid changedbSNP IDPrimers and probes
ERCC1 NM_012099 19q13.3 8092C>A rs3212986 5′-TAGTTCCTCAGT TTCCCG-3′ (sense) 
     5′-TGAGCCAATTCAGCCACT-3′ (antisense) 
     5′-CTACACAGGCTGCTGCTGCTGCT-3′ (forward probe) 
XPA NM_000380 9q22.3 Ex1+62T>C, 5′ UTR rs1800975 5′-AAGCCCCGTCGGCCGCCGCCATCTC[T/C]GGCCCACTCCGAGGACCTAGCTCCC-3′ (reverse probes) 
     Applied Biosystems assay no. C_482935_1 
XPC NM_004628 3p25 Ex9−377C>T/Ala499Val rs2228000 5′-GCCTCTGATCCCTCTGATGA-3′ (sense) 
     5′-CATCGCTGCACATTTTCTTG-3′ (antisense) 
     5′-GTAAGGACCCAAGCTTGCCAG-3′ (forward probe) 
   Ex16+211A>C/Lys939Gln rs2228001 5′-GCCTCAAAACCGAGAAGATG-3′ (sense) 
     5′-CTGCCTCAGTTT GCCTTCTC-3′ (antisense) 
     5′-GGGCGCTCAGCTCACAGCT-3′ (reverse probe) 
XPD (ERCC2) NM_000400 19q13.3 Ex10−16G>A/Asp312Asn rs1799793 5′-CGGGGCTCACCCTGCAGCACTTCGT[C/T]GGGCAGCACGGGGTTGGCCAGGTGG-3′ (reverse probes) 
     Applied Biosystems assay no. C_3145050_10 
   Ex23+61A>C/lys751Gln rs13181 5′-CCCTCTCCCTTTCCTCTGTT-3′ (sense) 
     5′-GGCAAGACTCAGGAGTCACC-3′ (antisense) 
     5′-CTGAGCAATCTGCTCTATCCTCT -3′ (reverse probe) 
XPG (ERCC) NM_000123 13q33 Ex15−344G>C/Asp1104His rs17655 5′-ACGAAAGAATACATGCGGTGGA-3′ (sense) 
     5′-ATCTGGCGGTCACGAGGAC-3′ (antisense) 
     5′-CCTCTCAGAATCTGATGGATCTTCAAGTGAA-3′ (forward probe) 

Abbreviations: NCBI, National Center for Biotechnology Information; UTR, untranslated region.

DRC in the NER pathway was determined using a phenotype assay that measured the removal of in vitro–induced BPDE-DNA adducts in lymphoblastoid cells essentially as described previously (3). Data were available for 156 cases and 153 controls who were 97.5% and 95.6% of the eligible cases (160) and controls (160) with lymphoblastoid cell lines. All assays were done with the laboratory blinded to the subject's case control status.

Statistical Methods

Hardy-Weinberg equilibrium was tested to compare the observed and expected genotype frequencies among cases and controls. DRC was calculated from the formula % DRC = 100 × [(I0IC) − (I4IC)] / (I0IC), where I0 is the intensity of treated cells immediately after treatment, I4 after 4 hours of repair, and IC is the value for control, untreated cells (3). Paired t test was used for comparisons of continuous variables and McNemar's test for comparison of categorical variables between cases and sister controls. One-way ANOVA was used to compare the means of DRC across the various genotypes for each gene.

A conditional logistic regression model was used to predict risk for low DRC defined as having a DRC ≤35.1% (the median DRC value of the controls; ref. 25). The likelihood ratio test was used to assess the effect of variables of interest, such as case-control status (yes and no), genotype status, age at time of blood donation (<50, 50-59, and ≥60 years old), smoking status (yes and no), and body mass index (BMI) level (<25, 25-29, and ≥30). Because age at time of blood donation and BMI, as continuous variables, were not linearly related to DRC, they were modeled as categorical variables for the analysis. The final model was developed in the following way. First, we ran all models separately among cases and controls and we also pooled the data modeling multiplicative interaction terms between the case-control status and the predictors described above. If the interaction term was not significant at the 0.20 level, we reassessed the effect of the variable on low DRC without adjusting for its interaction term. If the interaction term was significant, we included it in the final model when we assessed the model fit. Second, predictors were considered as potential predictors in the final model if they were significant at the 0.20 level. Age group and case-control status, however, were considered to be important risk factors of low DRC a priori and therefore adjusted for in all models. Those variables that were no longer significant at the 0.20 levels were removed one at a time based on their corresponding likelihood ratio test statistic. Next, variables that had been initially omitted in step 2 are included one at a time to reassess their importance in the presence of the remaining variables. The final model includes those predictors significant at the 0.20 level along with age and case-control status. The predictability of the model was assessed using receiver operator characteristic (ROC) curves, and sensitivity and specificity were calculated (26). All of the statistical analyses were carried out using Statistical Analysis System version 9.0 (SAS Institute, Cary, NC).

Selected characteristics of the sisters discordant for breast cancer are presented in Table 2. Overall, there were no statistically significant differences between breast cancer cases and sister controls in the distribution of age at blood donation, smoking status, number of cigarettes daily, and BMI. Similar to our previous results, in this slightly larger sample, DRC was statistically significantly lower in the cases than in the controls (26.5% versus 35.4%; P < 0.01), representing an average 33.5% reduction in DRC among the cases. Low DRC, defined as having a value below the median of the controls, was found to be statistically significantly associated with elevated breast cancer risk (adjusted odds ratio, 2.3; 95% confidence interval, 1.4-3.8). This is consistent with our previous observation (3).

Table 2.

Comparison of breast cancer cases and unaffected sister controls by selected variables

Selected variablesCases (n = 160)Controls (n = 160)P*
Age at blood donation, mean (SD) 49.4 (10.0) 49.8 (10.8) 0.38 
Smoking status, n (%)    
    Nonsmoker 94 (58.8) 86 (53.8) 0.37 
    Ever smoker 66 (41.2) 74 (46.2)  
No. cigarette daily, mean (SD) 15.2 (12.1) 14.8 (10.7) 0.94 
BMI, mean (SD) 25.7 (5.8) 25.4 (4.5) 0.50 
DRC, mean (SD) 26.5 (22.5) 35.4 (24.3) <0.01 
DRC    
    >35.1% (higher) 51 (32.7) 78 (51.0) <0.01 
    ≤35.1% (low) 105 (67.3) 75 (49.0)  
Selected variablesCases (n = 160)Controls (n = 160)P*
Age at blood donation, mean (SD) 49.4 (10.0) 49.8 (10.8) 0.38 
Smoking status, n (%)    
    Nonsmoker 94 (58.8) 86 (53.8) 0.37 
    Ever smoker 66 (41.2) 74 (46.2)  
No. cigarette daily, mean (SD) 15.2 (12.1) 14.8 (10.7) 0.94 
BMI, mean (SD) 25.7 (5.8) 25.4 (4.5) 0.50 
DRC, mean (SD) 26.5 (22.5) 35.4 (24.3) <0.01 
DRC    
    >35.1% (higher) 51 (32.7) 78 (51.0) <0.01 
    ≤35.1% (low) 105 (67.3) 75 (49.0)  
*

Ps were respectively derived from paired t test for comparisons of continuous variables and McNemar's test for comparison of categorical variables between cases and sister controls.

Median value.

OR, 2.3; 95% CI, 1.4-3.8, adjusted for age at blood donation, BMI, and smoking status.

We compared the average DRC among different genotypes in the NER pathway genes separately by case-control status (Table 3). Overall, there were no statistically significant differences in mean DRC by allele for most of the SNPs. A significantly higher DRC was found among controls with one or two copies of the XPC codon 499 variant T allele compared with those carrying the wild-type CC genotype (P = 0.04). Among breast cancer cases, subjects with the XPD codon 312 heterozygous GA genotype had significantly higher DRC (31.1%) compared with those with the wild-type GG genotype (21.4%). But there was no significant difference in DRC between those with the codon 312 homozygous AA genotype (22.5%) and with GG genotype among breast cancer cases.

Table 3.

Genotypes in the NER pathway and mean of DRC among breast cancer cases and unaffected sister controls

GenesGenotypesCases
Controls
No.Mean (SD)Ps*No.Mean (SD)Ps*
ERCC1 CC 84 25.2 (21.4) 0.62 86 35.3 (23.0) 0.96 
 CA 62 27.4 (22.7)  55 36.2 (25.4)  
 AA 10 31.9 (30.6)  11 34.2 (30.1)  
XPA TT 60 28.5 (23.6) 0.73 66 36.4 (25.6) 0.31 
 TC 73 26.2 (21.5)  59 37.2 (22.9)  
 CC 21 24.3 (23.0)  27 28.8 (24.5)  
XPC codon 499 CC 96 25.9 (20.6) 0.98 91 31.3 (22.4) 0.04 
 CT 50 26.7 (24.8)  55 41.2 (25.3)  
 TT 26.2 (25.5)  43.5 (40.9)  
XPC codon 939 AA 53 23.9 (22.8) 0.59 53 36.1 (25.1) 0.68 
 AC 85 28.0 (23.0)  70 36.5 (23.9)  
 CC 17 26.2 (19.8)  27 31.8 (25.0)  
XPD codon 312 GG 60 21.4 (20.5) 0.03 59 33.1 (24.4) 0.55 
 GA 80 31.1 (23.7)  64 35.8 (23.3)  
 AA 16 22.5 (20.0)  30 39.0 (26.7)  
XPD codon 751 AA 63 22.1 (19.0) 0.10 74 36.9 (24.6) 0.74 
 AC 66 28.2 (24.1)  57 34.2 (23.6)  
 CC 25 32.5 (25.6)  22 33.2 (26.0)  
XPG codon 1104 GG 83 29.4 (24.5) 0.15 82 37.6 (22.3) 0.35 
 GC 63 24.0 (19.8)  62 32.2 (26.6)  
 CC 16.5 (17.4)  41.2 (27.8)  
GenesGenotypesCases
Controls
No.Mean (SD)Ps*No.Mean (SD)Ps*
ERCC1 CC 84 25.2 (21.4) 0.62 86 35.3 (23.0) 0.96 
 CA 62 27.4 (22.7)  55 36.2 (25.4)  
 AA 10 31.9 (30.6)  11 34.2 (30.1)  
XPA TT 60 28.5 (23.6) 0.73 66 36.4 (25.6) 0.31 
 TC 73 26.2 (21.5)  59 37.2 (22.9)  
 CC 21 24.3 (23.0)  27 28.8 (24.5)  
XPC codon 499 CC 96 25.9 (20.6) 0.98 91 31.3 (22.4) 0.04 
 CT 50 26.7 (24.8)  55 41.2 (25.3)  
 TT 26.2 (25.5)  43.5 (40.9)  
XPC codon 939 AA 53 23.9 (22.8) 0.59 53 36.1 (25.1) 0.68 
 AC 85 28.0 (23.0)  70 36.5 (23.9)  
 CC 17 26.2 (19.8)  27 31.8 (25.0)  
XPD codon 312 GG 60 21.4 (20.5) 0.03 59 33.1 (24.4) 0.55 
 GA 80 31.1 (23.7)  64 35.8 (23.3)  
 AA 16 22.5 (20.0)  30 39.0 (26.7)  
XPD codon 751 AA 63 22.1 (19.0) 0.10 74 36.9 (24.6) 0.74 
 AC 66 28.2 (24.1)  57 34.2 (23.6)  
 CC 25 32.5 (25.6)  22 33.2 (26.0)  
XPG codon 1104 GG 83 29.4 (24.5) 0.15 82 37.6 (22.3) 0.35 
 GC 63 24.0 (19.8)  62 32.2 (26.6)  
 CC 16.5 (17.4)  41.2 (27.8)  
*

One-way ANOVA test adjusted for age group (<50, 50-59, and ≥60 years).

We conducted an analysis to determine whether the seven SNPs in the NER pathway genes and factors (age group of blood donation, cigarette smoking status, and BMI levels) predicted low DRC phenotype in a multivariate conditional logistic model (Table 4). The final model included three genes (XPA +62T>C, XPC Ala499Val, and XPG His1104Asp) and ever smoking after adjusting for case-control status and age group of blood donation.

Table 4.

Ps and odds ratios in conditional logistic regression model for prediction of DRC

VariablesNo.
Final model
High DRC*Low DRC*OR (95% CI)Ps
ERCC1 CA 59 70   
 AA 11 12   
XPA TC 61 83 1.2 (0.4-3.8) 0.17 
 CC 16 32 5.9 (0.9-38.8)  
XPC codon 499 CT 48 60 0.3 (0.1-0.9) 0.06 
 TT 0.2 (0.0-2.3)  
XPC codon 939 AC 68 98   
 CC 23 28   
XPD codon 312 GA 70 82   
 AA 24 27   
XPD codon 751 AC 57 76   
 CC 20 31   
XPG codon 1104 GC 58 82 3.5 (0.9-14.2) 0.20 
 CC 2.6 (0.1-70.3)  
Case control status Cases 57 110 13.4 (3.6-49.4) <0.01 
Smoking status Ever smoking 66 81 2.2 (0.6-7.9) 0.01 
Ever smoking × case control status Cases and ever smoking 27 41 0.1 (0.0-0.5)  
BMI 25-30 44 61   
 >30 22 30   
Age group of blood donation (y) 50-59 47 57 1.2 (0.4-3.9) 0.35 
 ≥60 21 30 7.2 (0.5-105.3)  
VariablesNo.
Final model
High DRC*Low DRC*OR (95% CI)Ps
ERCC1 CA 59 70   
 AA 11 12   
XPA TC 61 83 1.2 (0.4-3.8) 0.17 
 CC 16 32 5.9 (0.9-38.8)  
XPC codon 499 CT 48 60 0.3 (0.1-0.9) 0.06 
 TT 0.2 (0.0-2.3)  
XPC codon 939 AC 68 98   
 CC 23 28   
XPD codon 312 GA 70 82   
 AA 24 27   
XPD codon 751 AC 57 76   
 CC 20 31   
XPG codon 1104 GC 58 82 3.5 (0.9-14.2) 0.20 
 CC 2.6 (0.1-70.3)  
Case control status Cases 57 110 13.4 (3.6-49.4) <0.01 
Smoking status Ever smoking 66 81 2.2 (0.6-7.9) 0.01 
Ever smoking × case control status Cases and ever smoking 27 41 0.1 (0.0-0.5)  
BMI 25-30 44 61   
 >30 22 30   
Age group of blood donation (y) 50-59 47 57 1.2 (0.4-3.9) 0.35 
 ≥60 21 30 7.2 (0.5-105.3)  

Abbreviation: OR (95% CI), odds ratio (95% confidence interval).

*

DRC is categorized based on the median of controls, 0: >35.1 (high DRC); 1: ≤35.1 (low DRC).

Ps from likelihood ratio test.

ROC curve analysis was used to identify the predictive accuracies of the forecasted model for low DRC. ROC curves were separately obtained from the full model (including all genetic and environmental variables available) and the final model. The overall predictive accuracy (defined as the percentage of patients correctly classified as case and controls correctly distinguished as noncase using the model) was 61% for the probability cut point of 0.50 in the final forecasted model with a sensitivity of 78% and a specificity of 39%. This is similar to the predictive accuracy (63%) of the full model that has a sensitivity of 74% and a specificity of 48% (Fig. 1). In the full model of environmental variables only, the overall predictive accuracy was 58% for the probability cut point of 0.50 with a sensitivity 73% and a specificity 38% (data not shown), which are slightly lower than that in the final model.

Figure 1.

ROC curves of sensitivity verse 1-specificity. A. ROC for full model, the overall predictive accuracy is 63% for the probability cut point of 0.50. Sensitivity, 74%; specificity, 48%. B. ROC for final model, the overall predictive accuracy is 61% for the probability cut point of 0.50. Sensitivity, 78%; specificity, 39%.

Figure 1.

ROC curves of sensitivity verse 1-specificity. A. ROC for full model, the overall predictive accuracy is 63% for the probability cut point of 0.50. Sensitivity, 74%; specificity, 48%. B. ROC for final model, the overall predictive accuracy is 61% for the probability cut point of 0.50. Sensitivity, 78%; specificity, 39%.

Close modal

This molecular epidemiologic study used seven genetic polymorphisms in the NER pathway genes to predict individual DRC phenotype. SNP profiles in the NER pathway and ever smoking status only modestly predict variation of individual DRC after adjusting for case-control status and age of blood donation (Table 4). The SNPs selected in the final predictive model included XPA +62T>C, XPC Ala499Val, and XPG His1104Asp. The overall predictive accuracy of the final model was 61% with a moderate sensitivity (78%), but the specificity was relative low (39%). Our failure to predict individual DRC phenotype with these variables suggests that there are other unknown factors (including genetic, epigenetic, and nongenetic factors) that contribute to individual differences in DRC.

Until now, few studies have examined the effects of genetic polymorphisms in the NER pathway genes on individual DRC phenotype. A small number of SNPs are usually analyzed and the results have not always been concordant. Wu et al. (12) observed that the XPA-A23G variant was associated with more efficient DRC in control subjects compared with those with the homozygous A allele but not in cases. Qiao et al. (9) found that DRC was lower in subject homozygous for the XPC intron 9 poly(AT) variant, and Spitz et al. (20, 21) reported that two XPD polymorphisms (Asp312Asn and Lys751Gln) were associated with reduced NER capacity and increased risk of lung cancer. But, other studies did not show the same associations with cancer risk (22, 27, 28). Genotyping in cancer association studies has also provided some evidence that these polymorphisms might be involved in cancer risk (12, 29, 30). Our ability to predict only a small part of individual DRC phenotype was not surprising because we included only ∼3.5% of the total known SNPs in the NER pathway. There are >60 other SNPs in 30 NER genes unanalyzed in the present study that might potentially influence function (7, 31). The NER is a pathway composed of four major consecutive steps: damage recognition, unwinding around the damaged sites, excision of the oligonucleotide containing the site of damage, and gap filling by DNA repair synthesis (31, 32). The DRC phenotype assay used in the current study measures the ability to carry out all four steps in NER, whereas the seven genotypes studied here are primarily involved in the first three steps.

In addition to genetic polymorphisms, previous study suggested that cancer status, age, and cigarette smoking might affect DRC phenotype. Reports using direct or indirect measurement of DNA damage repair found a decline in DRC with donor age in human dermal fibroblasts, lymphocytes, and transformed lymphoblastoid cells (33-36). A study in Japan found that cells from elderly subjects recovered slower than cells from young subjects after UV stimulation (31). The variable treatment history for cancer patients, either radiation or chemotherapy, may also lead to decreased DRC (37-39). To control for the potential effect of these factors on DRC, we adjusted for breast cancer status and age group of blood donation in the process of model selection. In the final model, consistent with previous observations from cancer patients (37-39), breast cancer status was significantly associated with low DRC phenotype. We reported previously (3) that DRC decreased with age among the unaffected sisters as expected but had an opposite pattern in the sisters with cancer. In these analyses, we did not observe a linear association with age; women >60 years were more likely to have poorer DNA repair, although the point estimate was not statistically significant. We observed a positive association between smoking and DRC phenotype (Table 4) in agreement with another study finding smoking associated with DRC (40).

Compared with previous studies, our study has several advantages. One strength was the family-based design using sisters from the same families as controls, which can potentially reduce confounding from population admixture as well as lifestyle factors that cluster within families (41). The study included nuclear families that were not selected based on a population-based scheme and may limit the generalizability of the findings (24). The study sample is of moderate size limiting our statistical power. Given our sample size and the range of frequencies for our genotypes, we were only able to detect associations 2-fold or higher between genotype and phenotype. Other factors possibly predicting individual DRC phenotype, such as dietary antioxidants, were not examined in the current study (42). These results should therefore be viewed as preliminary until confirmed in larger studies.

In conclusion, specific SNPs in the NER pathway genes (XPA +62T>C, XPC Ala499Val, and XPG His1104Asp) and ever smoking only modestly predict individual variation in DRC; overall predictive accuracy (61%) was relatively low (sensitivity, 78%; specificity, 39%). These NER pathway SNPs that we selected are commonly assessed in epidemiologic studies and their poor prediction for DNA repair phenotype may offer some evidence of why studies only using SNPs have been inconsistent. It may still be possible that an extensive panel of DNA repair SNPs may predict phenotype, but our findings suggest that the small subsets of SNPs are likely a poor proxy for phenotype. The results obtained from discordant sister pairs in a breast cancer family registry should serve as a stimulus for further studies on the complex molecular events that affect individual DRC phenotype.

Grant support: NIH grants U01 CA69398, P30 CA13696, and P30 ES09089, the Breast Cancer Research Foundation, and Cancer Prevention and Research Foundation Postdoctoral Research Fellowship (D.O. Kennedy).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

We thank Irina Gurvich for the processing, storage, and inventory of biospecimens for the study.

1
Berwick M, Vineis P. Markers of DNA repair and susceptibility to cancer in humans: an epidemiologic review.
J Natl Cancer Inst
2000
;
92
:
874
–97.
2
Neumann AS, Sturgis EM, Wei Q. Nucleotide excision repair as a marker for susceptibility to tobacco-related cancers: a review of molecular epidemiological studies.
Mol Carcinog
2005
;
42
:
65
–92.
3
Kennedy DO, Agarwal M, Shen J, et al. DNA repair capacity via removal of BPDE-DNA adducts in lymphoblastoid cell lines from sisters discordant for breast cancer.
J Natl Cancer Inst
2005
;
97
:
127
–32.
4
Ramos JM, Ruiz A, Colen R, et al. DNA repair and breast carcinoma susceptibility in women.
Cancer
2004
;
100
:
1352
–7.
5
Shi Q, Wang LE, Bondy ML, et al. Reduced DNA repair of benzo[a]pyrene diol epoxide-induced adducts and common XPD polymorphisms in breast cancer patients.
Carcinogenesis
2004
;
25
:
1695
–700.
6
Grossman L, Matanoski G, Farmer E, et al. DNA repair as a susceptibility factor in chronic diseases in human populations. In: Dizdaroglu M, Karakaya E, editors. Advances in DNA damage and repair. New York: Kluwer Academic/Plenum Publishers; 1999. p. 149–67.
7
Lockett KL, Snowhite IV, Hu JJ. Nucleotide-excision repair and prostate cancer risk.
Cancer Lett
2005
;
220
:
125
–35.
8
Mayer C, Popanda O, Zelezny O, et al. DNA repair capacity after γ-irradiation and expression profiles of DNA repair genes in resting and proliferating human peripheral blood lymphocytes.
DNA Repair (Amst)
2002
;
1
:
237
–50.
9
Qiao Y, Spitz MR, Shen H, et al. Modulation of repair of ultraviolet damage in the host-cell reactivation assay by polymorphic XPC and XPD/ERCC2 genotypes.
Carcinogenesis
2002
;
23
:
295
–9.
10
Vodicka P, Kumar R, Stetina R, et al. Genetic polymorphisms in DNA repair genes and possible links with DNA repair rates, chromosomal aberrations, and single-strand breaks in DNA.
Carcinogenesis
2004
;
25
:
757
–63.
11
Wang Y, Spitz MR, Zhu Y, et al. From genotype to phenotype: correlating XRCC1 polymorphisms with mutagen sensitivity.
DNA Repair (Amst)
2003
;
2
:
901
–8.
12
Wu X, Zhao H, Wei Q, et al. XPA polymorphism associated with reduced lung cancer risk and a modulating effect on nucleotide excision repair capacity.
Carcinogenesis
2003
;
24
:
505
–9.
13
Chen S, Tang D, Xue K, et al. DNA repair gene XRCC1 and XPD polymorphisms and risk of lung cancer in a Chinese population.
Carcinogenesis
2002
;
23
:
1321
–5.
14
Hou SM, Falt S, Angelini S, et al. The XPD variant alleles are associated with increased aromatic DNA adduct level and lung cancer risk.
Carcinogenesis
2002
;
23
:
599
–603.
15
Sturgis EM, Castillo EJ, Li L, et al. Polymorphisms of DNA repair gene XRCC1 in squamous cell carcinoma of the head and neck.
Carcinogenesis
1999
;
20
:
2125
–9.
16
Winsey SL, Haldar NA, Marsh HP, et al. A variant within the DNA repair gene XRCC3 is associated with the development of melanoma skin cancer.
Cancer Res
2000
;
60
:
5612
–6.
17
Divine KK, Gilliland FD, Crowell RE, et al. The XRCC1 399 glutamine allele is a risk factor for adenocarcinoma of the lung.
Mutat Res
2001
;
461
:
273
–8.
18
Kumar R, Hoglund L, Zhao C, et al. Single nucleotide polymorphisms in the XPG gene: determination of role in DNA repair and breast cancer risk.
Int J Cancer
2003
;
103
:
671
–5.
19
Collins A, Harrington V. Repair of oxidative DNA damage: assessing its contribution to cancer prevention.
Mutagenesis
2002
;
17
:
489
–93.
20
Spitz MR, Wu X, Wang Y, et al. Modulation of nucleotide excision repair capacity by XPD polymorphisms in lung cancer patients.
Cancer Res
2001
;
61
:
1354
–7.
21
Qiao Y, Spitz MR, Guo Z, et al. Rapid assessment of repair of ultraviolet DNA damage with a modified host-cell reactivation assay using a luciferase reporter gene and correlation with polymorphisms of DNA repair genes in normal human lymphocytes.
Mutat Res
2002
;
509
:
165
–74.
22
Lunn RM, Helzlsouer KJ, Parshad R, et al. XPD polymorphisms: effect on DNA repair proficiency.
Carcinogenesis
2000
;
21
:
551
–5.
23
John EM, Hopper JL, Beck JC, et al. The Breast Cancer Family Registry: an infrastructure for cooperative multinational, interdisciplinary, and translational studies of the genetic epidemiology of breast cancer.
Breast Cancer Res
2004
;
6
:
R375
–89.
24
Ahsan H, Chen Y, Whittemore AS, et al. A family-based genetic association study of variants in estrogen-metabolism genes COMT and CYP1B1 and breast cancer risk.
Breast Cancer Res Treat
2004
;
85
:
121
–31.
25
Hosmer DW, Lemenshow S. Applied logistic regression. New York: John Wiley and Sons; 1989.
26
Zweig MH, Campbell G. Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine.
Clin Chem
1993
;
39
:
561
–77.
27
Leng S, Cheng J, Pan Z, et al. Associations between XRCC1 and ERCC2 polymorphisms and DNA damage in peripheral blood lymphocyte among coke oven workers.
Biomarkers
2004
;
9
:
395
–406.
28
Lee GY, Jang JS, Lee SY, et al. XPC polymorphisms and lung cancer risk.
Int J Cancer
2005
;
115
:
807
–13.
29
Blankenburg S, Konig IR, Moessner R, et al. Assessment of 3 xeroderma pigmentosum group C gene polymorphisms and risk of cutaneous melanoma: a case-control study.
Carcinogenesis
2005
;
26
:
1085
–90.
30
Terry MB, Gammon MD, Zhang FF, et al. Polymorphism in the DNA repair gene XPD, polycyclic aromatic hydrocarbon-DNA adducts, cigarette smoking, and breast cancer risk.
Cancer Epidemiol Biomarkers Prev
2004
;
13
:
2053
–8.
31
Takahashi Y, Moriwaki S, Sugiyama Y, et al. Decreased gene expression responsible for post-ultraviolet DNA repair synthesis in aging: a possible mechanism of age-related reduction in DNA repair capacity.
J Invest Dermatol
2005
;
124
:
435
–42.
32
de Boer J, Hoeijmakeers JHJ. Nucleotide excision repair and human syndromes.
Carcinogenesis
2000
;
21
:
453
–60.
33
Goukassian D, Gad F, Yaar M, et al. Mechanisms and implications of the age-associated decrease in DNA repair capacity.
FASEB J
2000
;
14
:
1325
–34.
34
Grossman L. Epidemiology of ultraviolet-DNA repair capacity and human cancer.
Environ Health Perspect
1997
;
105
Suppl 4:
927
–30.
35
Moriwaki S, Ray S, Tarone RE, et al. The effect of donor age on the processing of UV-damaged DNA by cultured human cells: reduced DNA repair capacity and increased DNA mutability.
Mutat Res
1996
;
364
:
117
–23.
36
Wei Q, Matanoski GM, Farmer ER, et al. DNA repair capacity for ultraviolet light-induced damage is reduced in peripheral lymphocytes from patients with basal cell carcinoma.
J Invest Dermatol
1995
;
104
:
933
–6.
37
Alapetite C, Thirion P, de la RA, et al. Analysis by alkaline comet assay of cancer patients with severe reactions to radiotherapy: defective rejoining of radioinduced DNA strand breaks in lymphocytes of breast cancer patients.
Int J Cancer
1999
;
83
:
83
–90.
38
Plappert UG, Stocker B, Fender H, et al. Changes in the repair capacity of blood cells as a biomarker for chronic low-dose exposure to ionizing radiation.
Environ Mol Mutagen
1997
;
30
:
153
–60.
39
Roth M, Rogers S, Roberts SS, et al. Stimulation of UV-induced DNA excision repair by chemotherapeutic drugs in cancer patients.
Anticancer Res
1994
;
14
:
809
–15.
40
D'Errico M, Calcagnile A, Iavarone I, et al. Factors that influence the DNA repair capacity of normal and skin cancer-affected individuals.
Cancer Epidemiol Biomarkers Prev
1999
;
8
:
553
–9.
41
Cardon LR, Palmer LJ. Population stratification and spurious allelic association.
Lancet
2003
;
361
:
598
–604.
42
Moller P, Loft S. Interventions with antioxidants and nutrients in relation to oxidative DNA damage and repair.
Mutat Res
2004
;
551
:
79
–89.