Introduction
The insulin-like growth factor-I (IGF-I) signaling pathway regulates both cellular proliferation and apoptosis, thereby making it a compelling candidate gene for cancer pathogenesis. Epidemiologic studies indicate that high levels of circulating IGF-I are associated with elevated risk of breast cancer in premenopausal women (1, 2). Few studies have investigated the role of genetic variation in IGF-I in relation to breast cancer and have focused solely on the polymorphic dinucleotide repeat (CA) in the promoter region (3-7). Evidence of an effect of this polymorphism on breast cancer risk remains inconsistent (8). To date, no studies have thoroughly characterized common variation in IGF-I and examined this in relation to breast cancer risk. We characterized genetic variation across the IGF-I locus using a combination of direct (resequencing) and indirect (haplotype-based approach) methods and then did a large case-control analysis to assess association between this inherited variation and sporadic breast cancer risk in a multiethnic cohort.
Materials and Methods
Study Population
Detailed information about the Multiethnic Cohort–nested breast cancer case-control study has been reported previously (9). A total of 1,615 incident breast cancer cases (21% African American, 7% Native Hawaiian, 26% Japanese, 21% Latina, and 25% White) and 1,962 controls (22% African American, 15% Native Hawaiian, 21% Japanese, 20% Latina, and 22% White) were included in the present study. Controls were women without breast cancer before the cohort entry and without a diagnosis up to April 2002. Controls were frequency matched to cases by age and ethnicity.
IGF-I Characterization
The details of exon resequencing, linkage disequilibrium characterization and tagging single nucleotide polymorphism (tSNP) selection have been described (10). Briefly, IGF-I exons were sequenced in 95 advanced cases of breast cancer (n = 19 per racial-ethnic group). No missense SNP was identified during this sequencing effort. To characterize the linkage disequilibrium patterns, 64 SNPs (1 SNP/2.4 kb) spanning 156 kb were genotyped in a multiethnic panel of 349 unrelated women with no history of cancer. Haplotype blocks (regions of strong linkage disequilibrium) were defined using the methods of Gabriel et al. (11). Twenty-nine tSNPs were selected to predict the common haplotypes with high probability (average Rh2 = 0.90) and 35 unmeasured SNPs (i.e., SNPs not genotyped in case-control samples). The method of predicting unmeasured SNPs has been described elsewhere (10).
Case-Control Genotyping
All SNPs were genotyped by Taqman assay (Applied Biosystems, Foster City, CA). Taqman primers, probes, and conditions for genotyping assays are available upon request. The average genotyping success rate was 97%. All genotyping was done with laboratory personnel blinded to case-control status of the samples, which included quality control samples for validation. Concordance for quality control samples was 99.7%.
Statistical Analysis
We used the χ2 test to assess departures of the genotype distribution from Hardy-Weinberg equilibrium among controls in each ethnic group. All tSNPs conformed to Hardy-Weinberg equilibrium at P < 0.01 level. Methods for haplotype estimation and case-control analysis were previously described (9). A likelihood ratio test was done to globally test for haplotype effect in each block. Unconditional logistic regression models were used to estimate odds ratios (OR) and 95% confidence intervals for haplotype-specific and genotype-specific risks, adjusting for age and ethnicity. We also did multivariate analyses with established breast cancer risk factors included in the models (9). All Ps are two sided. SAS version 8.2 (SAS Institute, Cary, NC) was used for all analyses.
Results
We tested both haplotypes and single tSNPs to maximize the likelihood that we have captured all of the unmeasured variation at the locus. Four regions of strong linkage disequilibrium (haplotype blocks), ranging from 11 to 60 kb, were identified, and 5 to 11 common haplotypes (i.e., frequency ≥ 5%) were observed within each block. The haplotypes for each ethnic group accounted for 77% to 100% of the chromosomes in the population (10).
There were no significant associations between inherited variation at IGF-I and breast cancer risk. The haplotype frequencies between cases and controls were similar in each block (Ps of the global test for association with breast cancer > 0.46; Table 1). This observation provides little support for the existence of a common disease allele at the IGF-I locus strongly contributing to breast cancer risk. Within each block, no individual haplotype was observed to be associated with breast cancer risk. We also assessed association between single tSNPs (and combinations of tSNPs) and breast cancer risk. Using the 29 tSNPs and genotype data obtained from the multiethnic panel, we estimated for each individual the alleles of the 35 unmeasured SNPs (10). We were able to predict 33 of 35 SNPs (average Rs2 = 0.91); SNP 18 (hCV2801091) and SNP 63 (rs2971578) could not be predicted with Rs2 > 0.7. SNP 18 was then genotyped in case-control samples, whereas SNP 63 failed assay design. Figure 1 shows the association between 63 SNPs and breast cancer risk, and no convincing association was observed. Two nominally significant negative associations were observed: SNP 1 (rs855228; Ptrend = 0.03) and SNP 4 (rs7965399; Ptrend = 0.04). We performed permutation test to evaluate how often these nominally significant associations would be expected by chance (10). The permutation P was 0.64 for a nominal P = 0.03, which indicates a similar result would be seen by chance 64% of the time. Analysis of cases with advanced disease only (n = 421) or adjustment for established breast cancer risk factors and menopausal status did not alter the results.
. | Haplotype . | Haplotype frequency (%)* . | . | . | . | . | Odds ratio† (95% confidence interval), all group combined . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
. | . | AA . | NH . | JA . | LA . | WH . | . | |||||||
. | . | Case/control . | Case/control . | Case/control . | Case/control . | Case/control . | . | |||||||
Block 1 | ||||||||||||||
1a | TGGA | 47.3/42.0 | 72.3/74.3 | 65.5/64.4 | 74.7/74.5 | 85.0/82.2 | Reference | |||||||
1b | TTAA | 22.8/24.6 | 5.7/6.8 | 6.0/4.5 | 9.9/8.8 | 10.4/10.9 | 0.93 (0.79-1.08) | |||||||
1c | CGGA | 14.6/16.0 | 0.84 (0.64-1.09) | |||||||||||
1d | CGAA | 11.6/12.9 | 0.86 (0.65-1.12) | |||||||||||
1e | CGAT | 17.8/15.0 | 24.3/25.7 | 9.0/11.3 | 0.90 (0.76-1.06) | |||||||||
Global test: χ2 = 4.39 with 5 df, P = 0.494 | ||||||||||||||
Block 2 | ||||||||||||||
2a | GCGCA | 55.9/60.2 | 51.7/53.7 | 51.8/55.0 | 51.7/48.1 | 52.0/51.2 | Reference | |||||||
2b | GCGGA | 5.4/6.7 | 0.85 (0.58-1.25) | |||||||||||
2c | GCGGG | 29.6/26.1 | 23.7/26.7 | 17.2/17.3 | 12.5/12.5 | 21.8/23.4 | 1.01 (0.89-1.15) | |||||||
2d | GCCGG | 14.7/10.5 | 24.5/22.3 | 9.9/11.5 | 1.06 (0.89-1.26) | |||||||||
2e | ACGCA | 5.9/6.1 | 6.2/4.8 | 6.7/7.9 | 10.9/8.1 | 1.17 (0.95-1.43) | ||||||||
2f | AAGCA | 5.8/3.7 | 18.2/19.6 | 13.8/14.2 | 1.03 (0.86-1.24) | |||||||||
Global test: χ2 = 5.38 with 6 df, P = 0.496 | ||||||||||||||
Block 3 | ||||||||||||||
3a | TAGACTCGCA | 41.9/43.7 | 49.2/50.2 | 51.8/52.6 | 47.1/42.9 | 45.3/45.1 | Reference | |||||||
3b | TAGACTCTCA | 5.4/3.0 | 5.2/4.1 | 17.5/18.5 | 12.3/12.1 | 1.06 (0.88-1.28) | ||||||||
3c | TAGACACGCA | 12.4/14.8 | 0.78 (0.58-1.05) | |||||||||||
3d | TAGACACGGA | 6.0/6.9 | 0.78 (0.52-1.17) | |||||||||||
3e | TAGAGTCGCA | 3.3/5.2 | 7.6/8.8 | 12.7/11.2 | 1.02 (0.83-1.26) | |||||||||
3f | TAGGCACGGG | 13.5/9.7 | 22.4/21.8 | 1.09 (0.89-1.34) | ||||||||||
3g | TAGGCAAGGG | 6.4/8.3 | 0.83 (0.59-1.18) | |||||||||||
3h | TGGACACGCA | 7.2/7.6 | 0.90 (0.68-1.19) | |||||||||||
3i | TGGACACGGA | 11.0/9.4 | 20.6/20.7 | 20.6/20.9 | 5.6/6.2 | 12.0/11.4 | 1.01 (0.87-1.17) | |||||||
3j | TGAACACGGA | 5.7/4.0 | 5.3/4.8 | 6.6/5.9 | 1.16 (0.90-1.50) | |||||||||
3k | CAGGCACGGA | 5.1/6.9 | 0.74 (0.48-1.13) | |||||||||||
Global test: χ2 = 10.71 with 11 df, P = 0.468 | ||||||||||||||
Block 4 | ||||||||||||||
4a | GGCCCTGACA | 26.7/26.3 | 37.4/33.6 | 25.6/25.1 | 47.4/47.5 | 40.0/40.0 | Reference | |||||||
4b | GGTCCGGGCA | 14.2/14.5 | 13.5/15.4 | 18.3/19.1 | 10.2/9.4 | 13.6/14.5 | 0.97 (0.83-1.13) | |||||||
4c | GGTCCGGGCG | 7.5/7.2 | 0.91 (0.67-1.23) | |||||||||||
4d | GGTCCTGACA | 5.6/6.3 | 0.94 (0.64-1.36) | |||||||||||
4e | GATCCGCGCA | 21.5/23.9 | 4.7/8.1 | 10.0/9.7 | 11.9/10.9 | 0.93 (0.79-1.11) | ||||||||
4f | GATCCGCGCG | 4.0/5.6 | 9.3/9.3 | 0.85 (0.65-1.12) | ||||||||||
4g | GATCTGCGCG | 6.5/5.5 | 17.1/17.9 | 19.4/20.5 | 1.00 (0.83-1.21) | |||||||||
4h | GACCCTGACA | 6.5/5.7 | 0.90 (0.66-1.23) | |||||||||||
4i | TATTCGCGCA | 12.6/9.7 | 21.9/19.1 | 11.4/11.4 | 1.10 (0.91-1.33) | |||||||||
4j | GGTCCGGGTA | 7.5/7.8 | 0.99 (0.67-1.45) | |||||||||||
Global test: χ2 = 4.85 with 10 df, P = 0.901 |
. | Haplotype . | Haplotype frequency (%)* . | . | . | . | . | Odds ratio† (95% confidence interval), all group combined . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
. | . | AA . | NH . | JA . | LA . | WH . | . | |||||||
. | . | Case/control . | Case/control . | Case/control . | Case/control . | Case/control . | . | |||||||
Block 1 | ||||||||||||||
1a | TGGA | 47.3/42.0 | 72.3/74.3 | 65.5/64.4 | 74.7/74.5 | 85.0/82.2 | Reference | |||||||
1b | TTAA | 22.8/24.6 | 5.7/6.8 | 6.0/4.5 | 9.9/8.8 | 10.4/10.9 | 0.93 (0.79-1.08) | |||||||
1c | CGGA | 14.6/16.0 | 0.84 (0.64-1.09) | |||||||||||
1d | CGAA | 11.6/12.9 | 0.86 (0.65-1.12) | |||||||||||
1e | CGAT | 17.8/15.0 | 24.3/25.7 | 9.0/11.3 | 0.90 (0.76-1.06) | |||||||||
Global test: χ2 = 4.39 with 5 df, P = 0.494 | ||||||||||||||
Block 2 | ||||||||||||||
2a | GCGCA | 55.9/60.2 | 51.7/53.7 | 51.8/55.0 | 51.7/48.1 | 52.0/51.2 | Reference | |||||||
2b | GCGGA | 5.4/6.7 | 0.85 (0.58-1.25) | |||||||||||
2c | GCGGG | 29.6/26.1 | 23.7/26.7 | 17.2/17.3 | 12.5/12.5 | 21.8/23.4 | 1.01 (0.89-1.15) | |||||||
2d | GCCGG | 14.7/10.5 | 24.5/22.3 | 9.9/11.5 | 1.06 (0.89-1.26) | |||||||||
2e | ACGCA | 5.9/6.1 | 6.2/4.8 | 6.7/7.9 | 10.9/8.1 | 1.17 (0.95-1.43) | ||||||||
2f | AAGCA | 5.8/3.7 | 18.2/19.6 | 13.8/14.2 | 1.03 (0.86-1.24) | |||||||||
Global test: χ2 = 5.38 with 6 df, P = 0.496 | ||||||||||||||
Block 3 | ||||||||||||||
3a | TAGACTCGCA | 41.9/43.7 | 49.2/50.2 | 51.8/52.6 | 47.1/42.9 | 45.3/45.1 | Reference | |||||||
3b | TAGACTCTCA | 5.4/3.0 | 5.2/4.1 | 17.5/18.5 | 12.3/12.1 | 1.06 (0.88-1.28) | ||||||||
3c | TAGACACGCA | 12.4/14.8 | 0.78 (0.58-1.05) | |||||||||||
3d | TAGACACGGA | 6.0/6.9 | 0.78 (0.52-1.17) | |||||||||||
3e | TAGAGTCGCA | 3.3/5.2 | 7.6/8.8 | 12.7/11.2 | 1.02 (0.83-1.26) | |||||||||
3f | TAGGCACGGG | 13.5/9.7 | 22.4/21.8 | 1.09 (0.89-1.34) | ||||||||||
3g | TAGGCAAGGG | 6.4/8.3 | 0.83 (0.59-1.18) | |||||||||||
3h | TGGACACGCA | 7.2/7.6 | 0.90 (0.68-1.19) | |||||||||||
3i | TGGACACGGA | 11.0/9.4 | 20.6/20.7 | 20.6/20.9 | 5.6/6.2 | 12.0/11.4 | 1.01 (0.87-1.17) | |||||||
3j | TGAACACGGA | 5.7/4.0 | 5.3/4.8 | 6.6/5.9 | 1.16 (0.90-1.50) | |||||||||
3k | CAGGCACGGA | 5.1/6.9 | 0.74 (0.48-1.13) | |||||||||||
Global test: χ2 = 10.71 with 11 df, P = 0.468 | ||||||||||||||
Block 4 | ||||||||||||||
4a | GGCCCTGACA | 26.7/26.3 | 37.4/33.6 | 25.6/25.1 | 47.4/47.5 | 40.0/40.0 | Reference | |||||||
4b | GGTCCGGGCA | 14.2/14.5 | 13.5/15.4 | 18.3/19.1 | 10.2/9.4 | 13.6/14.5 | 0.97 (0.83-1.13) | |||||||
4c | GGTCCGGGCG | 7.5/7.2 | 0.91 (0.67-1.23) | |||||||||||
4d | GGTCCTGACA | 5.6/6.3 | 0.94 (0.64-1.36) | |||||||||||
4e | GATCCGCGCA | 21.5/23.9 | 4.7/8.1 | 10.0/9.7 | 11.9/10.9 | 0.93 (0.79-1.11) | ||||||||
4f | GATCCGCGCG | 4.0/5.6 | 9.3/9.3 | 0.85 (0.65-1.12) | ||||||||||
4g | GATCTGCGCG | 6.5/5.5 | 17.1/17.9 | 19.4/20.5 | 1.00 (0.83-1.21) | |||||||||
4h | GACCCTGACA | 6.5/5.7 | 0.90 (0.66-1.23) | |||||||||||
4i | TATTCGCGCA | 12.6/9.7 | 21.9/19.1 | 11.4/11.4 | 1.10 (0.91-1.33) | |||||||||
4j | GGTCCGGGTA | 7.5/7.8 | 0.99 (0.67-1.45) | |||||||||||
Global test: χ2 = 4.85 with 10 df, P = 0.901 |
Abbreviations: AA, African American; NH, native Hawaiian; JA, Japanese; LA, Latina; WH, White; df, degree of freedom.
Only haplotypes ≥ 5% among cases or controls are shown.
Adjusted for age and ethnicity.
Discussion
To date, our study is the largest study to evaluate the genetic contribution of IGF-I to breast cancer risk. The main strength of this study is the thorough and systematic approach for studying common variation in IGF-I: direct sequencing of exons for putative functional SNPs and genotyping a dense set of variants for the noncoding regions. We found no evidence of an association between inherited variation at IGF-I and breast cancer risk. Our power calculation indicates that our sample of 3,577 individuals provides 80% power (assuming a log additive model, a two-sided test and using α = 0.001 to account for multiple testing) to detect an odds ratios of 1.5 for SNP with 5% allele frequency or for haplotypes predicted with Rh2 ≥ 0.9 and frequency as low as 5%.
This is the first study that used SNPs in assessing association between IGF-I genetic variation and breast cancer risk. Five published studies of IGF-I and breast cancer have focused only on the (CA)n repeat polymorphism (3-7). The (CA)19 allele, the most common allele in various populations, has been associated with increased risk in two studies (3, 6), whereas one study reported an inverse association (7), and two other studies (including ours) observed no association (4, 5). In our previous study, we found that the (CA)19 allele was not associated with breast cancer among postmenopausal women (odds ratios, 1.2; 95% confidence interval, 0.8-1.8; ref. 5). This study was based on 400 cases and 400 controls from four racial-ethnic groups: African American, Japanese, Latino, and White from the Multiethnic Cohort.
Our results suggest that common germ line variation in IGF-I is not a major contributor to breast cancer risk in this multiethnic cohort. Whether other genes in IGF pathway (e.g., IGF-2 and IGFBPs) have a more important role or whether the combined effects of several modest associations within this pathway have greater cumulative effect on breast cancer susceptibility remains to be determined. Future work will include evaluation of whether IGF-2, IGF2R, IGFBP-1, and IGFBP-3 genetic variations are associated with breast cancer risk.
Grant support: National Cancer Institute grants CA54281 and CA63464.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Acknowledgments
We thank Noel Burtt, Loreall Pooler, David Wong, and Stephanie Riley for their laboratory assistance; and Elena Giorgi and Hank Huang for their technical support; and the participants of the Multiethnic Cohort Study for their participation and commitment to the study.