Background: Recently, 41 new genetic susceptibility loci for breast cancer risk were identified in a genome-wide association study (GWAS) conducted in European descendants. Most of these risk variants have not been directly replicated in Asian populations.

Methods: We evaluated nine of those nonreplication loci in East Asians to identify new risk variants for breast cancer in these regions. First, we analyzed single-nucleotide polymorphisms (SNP) in these regions using data from two GWAS conducted among Chinese and Korean women, including 5,083 cases and 4,376 controls (stage 1). In each region, we selected an SNP showing the strongest association with breast cancer risk for replication in an independent set of 7,294 cases and 9,404 controls of East Asian descents (stage 2). Logistic regression models were used to calculate adjusted ORs and 95% confidence intervals (CI) as a measure of the association of breast cancer risk and genetic variants.

Results: Two SNPs were replicated in stage 2 at P < 0.05: rs1419026 at 6q14 [per allele OR, 1.07; 95% confidence interval (CI), 1.03–1.12; P = 3.0 × 10−4] and rs941827 at 10q25 (OR, 0.92, 95% CI, 0.89–0.96; P = 5.3 × 10−5). The association with rs941827 remained highly statistically significant after adjusting for the risk variant identified initially in women of European ancestry (OR, 0.88; 95% CI, 0.82–0.97; P = 5.3 × 10−5).

Conclusion: We identified a new breast cancer risk variant at 10q25 in East Asian women.

Impact: Results from this study improve the understanding of the genetic basis for breast cancer. Cancer Epidemiol Biomarkers Prev; 22(7); 1297–303. ©2013 AACR.

Genetic factors play a significant role in the etiology of breast cancer (1–4). To date, genome-wide association studies (GWAS) have identified approximately 67 genetic susceptibility risk loci for breast cancer (5–19). With a few exceptions (9, 13, 14, 16, 20), most susceptibility loci were initially identified in GWAS conducted in European-ancestry populations. Most, if not all, of the initially reported risk variants, in the form of single-nucleotide polymorphisms (SNPs; termed as index SNPs in subsequent text), are tagging SNPs. These SNPs were identified likely through their linkage disequilibrium (LD) with disease variants. Because LD patterns differ across populations of different ancestries, some findings from GWAS conducted in European descendants cannot be directly extrapolated to other populations (20–27). We recently evaluated all breast cancer risk variants identified to date and found that approximately half of the risk variants identified initially in European descendants cannot be directly replicated in East Asians (28). In the present study, we investigated 9 regions where the index SNP has not been replicated in Asian samples in an attempt to identify other breast cancer risk variants for East Asian women.

Study populations

This study was conducted as part of the Asia Breast Cancer Consortium (ABCC), which has been described elsewhere (9, 13, 14, 18, 22, 28). Briefly, samples analyzed in this study were from 8 epidemiologic studies in the ABCC (Table 1). Samples were from 13,642 Chinese women, 11,713 Korean women, and 802 Japanese women. Chinese participants were selected from 4 studies: the Shanghai Breast Cancer Study (SBCS), the Shanghai Breast Cancer Survival Study (SBCSS), the Shanghai Endometrial Cancer Study (SECS, controls only), and the Shanghai Women's Health Study (SWHS; refs. 9, 29, 30). Korean participants came from 4 studies: the Seoul Breast Cancer Study (SeBCS; ref. 20), Korea National Cancer Center (Korea-NCC), Korea Genome Epidemiology Study (KoGES; ref. 31), and Korean Hereditary Breast Cancer (KOHBRA; ref. 32). Japanese samples were from the Japan Nagano Breast Cancer Study (33). In total, 12,377 cases and 13,780 controls were analyzed in the present study.

Table 1.

Summary of selected characteristics of participants by studies

StudyNo. of casesNo. of controlsPopulation ethnicityStudy designaMean age,b yMenopause,c %ER+,d %
SBCGS1e (GWAS) 2,918 2,324 Chinese Population-based 52/50f 43/42 65 
SBCGS2e (stage 2) 1,613 1,800 Chinese Population-based 53/53 50/55f 62 
SBCGS3e (stage 3) 2,601 2,386 Chinese Population-based 54/55f 50/53 65 
SeBCS1 (GWAS) 2,165 2,052 Korean Hospital-based 48/51f 36/55f 63 
SeBCS2 (stage 2) 777 1,104 Korean Hospital-based 48/48 36/37 63 
KOHBRA/KoGES 1,397 3,209 Korean Hospital-based with community controls 40/50f 23/NA 63 
Korea-NCC 505 504 Korean Hospital-based 49/49 50/45 65 
Nagano 401 401 Japanese Hospital-based 54/54 55/65f 75 
Total 12,377 13,780      
StudyNo. of casesNo. of controlsPopulation ethnicityStudy designaMean age,b yMenopause,c %ER+,d %
SBCGS1e (GWAS) 2,918 2,324 Chinese Population-based 52/50f 43/42 65 
SBCGS2e (stage 2) 1,613 1,800 Chinese Population-based 53/53 50/55f 62 
SBCGS3e (stage 3) 2,601 2,386 Chinese Population-based 54/55f 50/53 65 
SeBCS1 (GWAS) 2,165 2,052 Korean Hospital-based 48/51f 36/55f 63 
SeBCS2 (stage 2) 777 1,104 Korean Hospital-based 48/48 36/37 63 
KOHBRA/KoGES 1,397 3,209 Korean Hospital-based with community controls 40/50f 23/NA 63 
Korea-NCC 505 504 Korean Hospital-based 49/49 50/45 65 
Nagano 401 401 Japanese Hospital-based 54/54 55/65f 75 
Total 12,377 13,780      

aCase–control study.

bMean age of cases–controls with available data.

cProportion of postmenopausal status of cases–controls with available data.

dAmong cases with ER data.

eIncluding cases and controls from four studies conducted in Shanghai.

fSignificant at α = 0.01 level.

Abbreviations: NA, not available; Nagano, Japan Nagano Breast Cancer Study; SBCGS, Shanghai Breast Cancer Genetic Study.

Genotyping and quality control

Stage 1 testing was conducted using existing data from 2 GWAS, wherein 5,285 Chinese women and 4,777 Korean women were genotyped primarily using the Affymetrix Genome-wide Human SNP Array 6.0. Genotyping protocols have been described elsewhere (9, 18, 20). From the Chinese GWAS, we included 1 negative control and at least 3 positive quality control (QC) samples from the Coriell Cell Repositories in each of the 96-well plates for genotyping with Affymetrix SNP Array 6.0 chips. A total of 273 positive QC samples were successfully genotyped; the average concordance rate was 99.9% with a median value of 100%. Genetically identical and unexpected duplicate samples were excluded, as they were close relatives with a pair-wise proportion of identify-by-descent (IBD) estimate greater than 0.25. All samples with a call rate less than 95% were excluded. SNPs were excluded if: (i) minor allele frequency (MAF) < 1%; (ii) call rate < 95%; or (iii) genotyping concordance rate < 95% in QC samples. The final dataset included 2,918 cases and 2,324 controls for 690,947 markers. For the Korean GWAS, the Affymetrix SNP Array 6.0 was used (20). A total of 30 QC samples were successfully genotyped; the average concordance rate was 99.8%. SNPs were excluded if: (i) genotype call rate < 95%; (ii) MAF < 1% in either cases or controls; (iii) evidence for deviation from Hardy–Weinberg equilibrium (HWE) at P < 10−6; or (iv) poor genotyping cluster plot in either cases or controls. After QC filtering, the final dataset included 2,165 cases and 2,052 controls for 555,525 markers. All samples from both studies were genetically confirmed to be females.

We used the program MACH 1.0 (34) to impute genotypes for autosomal SNPs in HapMap Phase II release 22 for samples from the Chinese and Korean GWAS. Only SNPs with imputation quality score RSQR ≥ 0.3 were included in subsequent analyses. Dosage data for imputed SNPs in samples from each of the stage 1 studies were analyzed using the program mach2dat (34).

We genotyped 9 selected SNPs in stage 2 using the iPLEX MassARRAY platform (Sequenom). PCR primers and allelic-specific extension primers were designed with the MassARRAY Assay Design 4.0 software, and alleles of each SNP were detected through matrix-assisted laser desorption/ionization–time-of-flight mass spectrometry on the MassARRAY Analyzer 4 System (Sequenom). In each 96-well plate, 5 QC samples were used in the Sequenom assay, including 1 negative control (water), 2 blinded duplicates, and 2 samples from the HapMap project. Mean concordance rate was 99.7% for the blind duplicates and 98.9% for the HapMap samples.

Statistical analyses

PLINK version 1.06 was used to analyze the genome-wide data obtained in stage 1 (35). Population structure was investigated by analyzing 690,947 SNPs that passed QC in Chinese women (2,918 cases and 2,324 controls) and 555,525 SNPs that passed QC in Korean women (2,165 cases and 2,052 controls), showing an estimated inflation factor λ of 1.042 for Chinese (18) and 1.043 for Koreans (20). Therefore, population substructure, if present, should not substantially affect the results of this study. Logistic regression was used to estimate breast cancer risk under a log-additive model and adjusted for age and study site, when appropriate ORs associated with each SNP and 95% confidence intervals (CI) were estimated for cancer risk. Conditional analyses were conducted by adjusting the index SNP in each locus to evaluate possible independent association of breast cancer with the SNP under study. All analyses were conducted using SAS version 9.3 (SAS Institute Inc.) unless noted otherwise.

Table 2 presents the associations of 9 index SNPs with breast cancer risk from the Chinese and Korean GWAS (stage 1). Results from the original GWAS of European-ancestry populations are shown, also. None of these index SNPs were associated with breast cancer risk at P < 0.05 in our study. To search for other possible variants in these regions which could be associated with breast cancer risk, we selected an SNP from each of these regions for further evaluation. In each region, we selected an SNP that showed the most significant association with breast cancer risk and is located within ±500kb of the index SNP of that region (Table 3).

Table 2.

Associations of index SNPs in 9 recently reported breast cancer susceptibility loci

Asian GWAS (5,083 cases and 4,376 controls)European
SNPaChr./genebPosition,c bpAllelesdEAFeOR (95% CI)PtrendEAFfOR (95% CI)fPtrendf
rs2016394 2q31/DLX2 172681217 G/A 0.19 0.98 (0.90–1.08) 0.727 0.48 0.95 (0.93–0.97) 1.2 × 10−8 
rs204247 6p23/RANBP9 13830502 G/A 0.39 0.97 (0.92–1.03) 0.373 0.43 1.05 (1.03–1.07) 8.3 × 10−9 
rs17529111 6q14/FAM46A 82185105 C/T 0.20 1.07 (0.99–1.15) 0.094 0.22 1.06 (1.04–1.09) 4.3 × 10−9 
rs720475 7q35/ARHGEF5 143705862 A/G 0.02 1.07 (0.89–1.29) 0.455 0.25 0.94 (0.92–0.96) 7.0 × 10−11 
rs11780156 8q24/MIR1208 129263823 T/C 0.22 1.00 (0.93–1.07) 0.908 0.16 1.07 (1.04–1.10) 3.4 × 10−11 
rs11814448 10p12/DNAJC1 22355849 C/A 0.01 1.31 (0.84–2.02) 0.230 0.02 1.26 (1.18–1.35) 9.3 × 10−16 
rs7904519 10q25/TCF7L2 114763917 G/A 0.04 1.01 (0.88–1.17) 0.844 0.46 1.06 (1.04–1.08) 3.1 × 10−8 
rs12575663 11q13/OVOL1 65331111 A/G 0.17 1.01 (0.93–1.09) 0.865 0.47 0.95 (0.93–0.96) 8.6 × 10−12 
rs6001930 22q13/MKL1 39206180 C/T 0.28 1.05 (0.99–1.12) 0.124 0.11 1.12(1.09–1.16) 8.8 × 10−19 
Asian GWAS (5,083 cases and 4,376 controls)European
SNPaChr./genebPosition,c bpAllelesdEAFeOR (95% CI)PtrendEAFfOR (95% CI)fPtrendf
rs2016394 2q31/DLX2 172681217 G/A 0.19 0.98 (0.90–1.08) 0.727 0.48 0.95 (0.93–0.97) 1.2 × 10−8 
rs204247 6p23/RANBP9 13830502 G/A 0.39 0.97 (0.92–1.03) 0.373 0.43 1.05 (1.03–1.07) 8.3 × 10−9 
rs17529111 6q14/FAM46A 82185105 C/T 0.20 1.07 (0.99–1.15) 0.094 0.22 1.06 (1.04–1.09) 4.3 × 10−9 
rs720475 7q35/ARHGEF5 143705862 A/G 0.02 1.07 (0.89–1.29) 0.455 0.25 0.94 (0.92–0.96) 7.0 × 10−11 
rs11780156 8q24/MIR1208 129263823 T/C 0.22 1.00 (0.93–1.07) 0.908 0.16 1.07 (1.04–1.10) 3.4 × 10−11 
rs11814448 10p12/DNAJC1 22355849 C/A 0.01 1.31 (0.84–2.02) 0.230 0.02 1.26 (1.18–1.35) 9.3 × 10−16 
rs7904519 10q25/TCF7L2 114763917 G/A 0.04 1.01 (0.88–1.17) 0.844 0.46 1.06 (1.04–1.08) 3.1 × 10−8 
rs12575663 11q13/OVOL1 65331111 A/G 0.17 1.01 (0.93–1.09) 0.865 0.47 0.95 (0.93–0.96) 8.6 × 10−12 
rs6001930 22q13/MKL1 39206180 C/T 0.28 1.05 (0.99–1.12) 0.124 0.11 1.12(1.09–1.16) 8.8 × 10−19 

ars12575663 in complete LD with index SNP rs3903072 in CEU, CHB, and JPN samples (r2 = 1.0, based on LD data from HapMap release27).

bThe closest gene.

cLocation based on NCBI Human Genome Build 36.3.

dEffect/reference alleles based on NCBI Human Genome Build 36.3, dbSNP b126 forward strand.

eEAF in controls of Asian samples.

fSummary results from the original studies in European descendants (EAF, OR, and 95% CI from iCOGS; P from combined GWAS + iCOGS).

Abbreviations: Chr., chromosome; EAF, effect allele frequency.

Table 3.

Associations of breast cancer risk with SNPs selected for this study

Stage 1 (5,083/4,376)Stage 2 (7,294/9,404)Combined (12,377/13,780)
Tested SNPaChr./genebPosition,c bpAllelesdEAFeIndex SNPCEU (r2)fCHB (r2)fImputedOR (95% CI)gPtrendgOR (95% CI)gPtrendgOR (95% CI)gPtrendg
rs788166 2q31/DLX2 172623735 C/G 0.62 rs2016394 0.01 0.06 Yes 1.09 (1.02–1.15) 0.006 1.01 (0.97–1.06) 0.605 1.04 (1.00–1.08) 0.037 
rs821287 6p23/RANBP9 13464145 G/A 0.68 rs204247 0.01 0.05 Yes 0.86 (0.77–0.95) 0.003 1.07 (1.02–1.12) 0.005 1.03 (0.99–1.08) 0.164 
rs1419026 6q14/FAM46A 82145898 T/C 0.29 rs17529111 0.29 0.56 Yes 1.07 (1.00–1.13) 0.048 1.08 (1.02–1.13) 0.003 1.07 (1.03–1.12) 3.0 × 10−4 
rs10278902 7q35/ARHGEF5 143320582 G/A 0.59 rs720475 Yes 0.94 (0.88–1.00) 0.034 1.06 (1.01–1.11) 0.014 1.01 (0.98–1.05) 0.434 
rs2608036 8q24/MYC 129194161 G/A 0.82 rs11780156 0.07 0.05 No 1.10 (1.02–1.19) 0.014 1.04 (0.98–1.10) 0.192 1.06 (1.01–1.11) 0.011 
rs16921849 10p12/DNAJC1 21873162 G/A 0.34 rs11814448 0.01 0.01 Yes 1.07 (1.00–1.14) 0.036 0.98 (0.93–1.02) 0.342 1.01 (0.97–1.05) 0.621 
rs941827 10q25/TCF7L2 114548877 C/T 0.29 rs7904519 0.01 0.02 No 0.88 (0.82–0.93) 5.0 × 10−5 0.95 (0.90–1.00) 0.049 0.92 (0.89–0.96) 5.3 × 10−5 
rs500161 11q13/OVOL1 65452014 T/C 0.51 rs12575663 0.56 0.11 Yes 1.05 (0.99–1.12) 0.084 0.99 (0.95–1.04) 0.761 1.01 (0.98–1.05) 0.425 
rs138019 22q13/MKL1 38941574 G/A 0.92 rs6001930 0.02 0.03 Yes 0.86 (0.77–0.97) 0.009 1.04 (0.96–1.14) 0.334 0.98 (0.91–1.04) 0.464 
Stage 1 (5,083/4,376)Stage 2 (7,294/9,404)Combined (12,377/13,780)
Tested SNPaChr./genebPosition,c bpAllelesdEAFeIndex SNPCEU (r2)fCHB (r2)fImputedOR (95% CI)gPtrendgOR (95% CI)gPtrendgOR (95% CI)gPtrendg
rs788166 2q31/DLX2 172623735 C/G 0.62 rs2016394 0.01 0.06 Yes 1.09 (1.02–1.15) 0.006 1.01 (0.97–1.06) 0.605 1.04 (1.00–1.08) 0.037 
rs821287 6p23/RANBP9 13464145 G/A 0.68 rs204247 0.01 0.05 Yes 0.86 (0.77–0.95) 0.003 1.07 (1.02–1.12) 0.005 1.03 (0.99–1.08) 0.164 
rs1419026 6q14/FAM46A 82145898 T/C 0.29 rs17529111 0.29 0.56 Yes 1.07 (1.00–1.13) 0.048 1.08 (1.02–1.13) 0.003 1.07 (1.03–1.12) 3.0 × 10−4 
rs10278902 7q35/ARHGEF5 143320582 G/A 0.59 rs720475 Yes 0.94 (0.88–1.00) 0.034 1.06 (1.01–1.11) 0.014 1.01 (0.98–1.05) 0.434 
rs2608036 8q24/MYC 129194161 G/A 0.82 rs11780156 0.07 0.05 No 1.10 (1.02–1.19) 0.014 1.04 (0.98–1.10) 0.192 1.06 (1.01–1.11) 0.011 
rs16921849 10p12/DNAJC1 21873162 G/A 0.34 rs11814448 0.01 0.01 Yes 1.07 (1.00–1.14) 0.036 0.98 (0.93–1.02) 0.342 1.01 (0.97–1.05) 0.621 
rs941827 10q25/TCF7L2 114548877 C/T 0.29 rs7904519 0.01 0.02 No 0.88 (0.82–0.93) 5.0 × 10−5 0.95 (0.90–1.00) 0.049 0.92 (0.89–0.96) 5.3 × 10−5 
rs500161 11q13/OVOL1 65452014 T/C 0.51 rs12575663 0.56 0.11 Yes 1.05 (0.99–1.12) 0.084 0.99 (0.95–1.04) 0.761 1.01 (0.98–1.05) 0.425 
rs138019 22q13/MKL1 38941574 G/A 0.92 rs6001930 0.02 0.03 Yes 0.86 (0.77–0.97) 0.009 1.04 (0.96–1.14) 0.334 0.98 (0.91–1.04) 0.464 

aOne SNP was selected in each locus for the study.

bThe closest gene to corresponding index SNP in each region.

cLocation based on NCBI Human Genome Build 36.3.

dEffect/reference alleles were defined on the basis of NCBI Human Genome Build 36.3, dbSNP 126 forward strand.

eEffect allele frequency in controls of stage 2 studies.

fOn the basis of HapMap genotype data release 27, dbSNP building 126.

gAdjusted for age and study site.

Abbreviations: Chr., chromosome; EAF, effect allele frequency.

Stage 1 and 2 results for the 9 selected SNPs are presented in Table 3. SNPs rs1419026 at 6q14 and rs941827 at 10q25 showed an association with breast cancer risk at P < 0.05 in the same direction in both stages. In the combined analysis of 12,377 breast cancer cases and 13,780 control women, per allele ORs were 1.07 (95% CI, 1.03–1.12; P = 3.0 × 10−4) and 0.92 (95% CI, 0.89–0.96; P = 5.3 × 10−5) for rs1419026 and rs941827, respectively. The association of rs1419026 with breast cancer risk was, in general, consistent across participating studies and heterogeneity test was not statistically significant (P = 0.675). The association of breast cancer risk with rs941827 was substantially stronger in the Nagano study (OR, 0.72; 95% CI, 0.58–0.89; P = 0.002) than other 7 studies combined (OR, 0.93; 95% CI, 0.89–0.97; P = 3.8 × 10−4; Pheterogeneity = 0.027). After excluding the Nagano study, the heterogeneity test was no longer statistically significant (P = 0.103).

Conditional analyses for rs1419026 and rs941827 were conducted by adjusting the index SNP in each of these loci. These analyses were conducted using stage 1 samples with data available for both new and index SNPs. The association with rs941827 at 10q25 remained statistically significant after adjusting for index SNP rs7904519 (OR, 0.88; 95% CI, 0.82–0.93; P = 5.3 × 10−5; data now shown in tables). However, the significant association with rs1419026 disappeared after adjusting for its index SNP rs17529111 (OR, 1.05; 95% CI, 0.96–1.15; P = 0.293). SNPs rs1419026 and rs941827 were associated with both estrogen receptor (ER)+ and ER− cancer (Table 4).

Table 4.

Associations of rs1419026 and rs941827 with breast cancer risk, stratified by ER status

ER-positiveER-negative
SNP (locus)Cases/ControlsOR (95% CI)aPtrendaCases/ControlsOR (95% CI)aPtrenda
rs1419026 (6q14) 6,864/13,570 1.06 (1.01–1.11) 0.018 3,801/13,570 1.10 (1.04–1.17) 5.2 × 10−4 
rs941827 (10q25) 6,851/13,561 0.92 (0.87–0.96) 4.2 × 10−4 3,790/13,561 0.91 (0.86–0.97) 0.002 
ER-positiveER-negative
SNP (locus)Cases/ControlsOR (95% CI)aPtrendaCases/ControlsOR (95% CI)aPtrenda
rs1419026 (6q14) 6,864/13,570 1.06 (1.01–1.11) 0.018 3,801/13,570 1.10 (1.04–1.17) 5.2 × 10−4 
rs941827 (10q25) 6,851/13,561 0.92 (0.87–0.96) 4.2 × 10−4 3,790/13,561 0.91 (0.86–0.97) 0.002 

aAdjusted for age and study site.

Nominally significant associations were also observed for 2 other SNPs (rs821287 and rs10278902) in stage 2. However, the direction of the association for these 2 SNPs was inconsistent in stages 1 and 2, and thus these 2 SNPs were considered not being replicated in this study.

In this large study conducted in East Asian women, we identified a new genetic risk variant for breast cancer at 10q25, a breast cancer susceptibility locus identified recently in a GWAS of European descendants (19). We also found an SNP (rs1419026) at 6q14 that showed a stronger association with breast cancer risk in East Asians than the index SNP (rs17529111) initially identified in this locus in a GWAS conducted among European descendants. Our study has expanded the list of breast cancer risk variants identified for East Asian women and provides data that might be useful in fine-mapping GWAS-identified regions to identify causal variants for this common malignancy.

The index SNP rs7904519 at 10q25 was not replicated in our study. The risk allele frequency is very low in East Asian women (0.045) compared with European descendants (0.405). In the present study, we found a significant association of breast cancer risk with SNP rs941827, with an effective allele frequency of 0.26 in East Asian women and 0.29 in Europeans. These 2 SNPs are not correlated (r2 < 0.03 in either CEU or CHB + JPT samples). The index SNP (rs7904519) is located in intron 3 of the TCF7L2 gene (NM_030756). The SNP identified in our study (rs941827) is located in intron 7 of the vesicle transport through interaction with t-SNAREs homolog 1A (yeast) (VTI1A) gene (NM_145206), approximately 215 kb upstream of rs7904519. The VT11A gene encodes a soluble N-ethylmaleimide–sensitive fusion protein attachment protein receptor (SNARE) that mediates the transport of vesicles between the Golgi apparatus and the plasma membrane (36, 37). The potential role of the VTI1A protein in breast carcinogenesis remains unknown. Intriguingly, SNPs rs941827 and rs7904519 are included in a 420- to 540-kb region that has been found to be deleted in some breast and colorectal cancer samples (38, 39). This deletion causes a VTI1A-TCF7L2 fusion, which may affect the regulatory function of the TCF/β-catenin complex on the Wnt signaling (40). Further studies are needed to clarify the mechanism of the association of VTI1A variants and breast cancer risk identified in this study.

In our study, some imputed SNPs could not be investigated properly because of their low imputation quality in stage 1. Some of the index SNPs evaluated in stage 1 showed an association in the same direction as reported previously in the European-ancestry study, although the association was not statistically significant perhaps due to a small sample size. In addition, we selected only one SNP per locus for stage 2 replication because of budget constraints. It is possible that additional risk variants may be located in some of these regions and can be further investigated in the future. Nevertheless, using data from East Asian women, we identified one new genetic risk variant at 10q25 for breast cancer. We also identified a risk variant at 6q14 that showed a stronger association with breast cancer in East Asians than the index SNP initially discovered in this region in a GWAS conducted in European descendants. These results are new and should be helpful for future studies to understand the genetic basis for breast cancer.

No potential conflicts of interest were disclosed.

The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding agencies.

Conception and design: J. Shi, W. Lu, J.-Y. Choi, B.-T. Ji, Y. Zheng, D.-Y. Noh, C.-Y. Shen, D. Kang, W. Zheng

Development of methodology: B.-T. Ji, W. Zheng

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): J. Shi, H. Sung, J.-Y. Choi, Y.-B. Xiang, M. Iwasaki, B.-T. Ji, S.K. Park, Y. Zheng, S. Tsugane, K.-Y. Yoo, W. Wang, D.-Y. Noh, W. Han, S.-W. Kim, C.-Y. Shen, K. Matsuo, S.H. Ahn, Y.-T. Gao, X.-O. Shu, Q. Cai, D. Kang, W. Zheng

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): J. Shi, B. Zhang, M.Y. Kim, M. Iwasaki, J. Long, B.-T. Ji, S.K. Park, C.-Y. Shen, W. Zheng

Writing, review, and/or revision of the manuscript: J. Shi, H. Sung, B. Zhang, Y.-B. Xiang, M. Iwasaki, B.-T. Ji, S. Tsugane, C.-Y. Shen, Y.-T. Gao, X.-O. Shu, Q. Cai, D. Kang, W. Zheng

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): J. Shi, B. Zhang, Y.-B. Xiang, B.-T. Ji, Y. Zheng, D.-Y. Noh, S.-W. Kim, M.H. Lee, J.W. Lee, J.-Y. Lee, K. Matsuo, Y.-T. Gao, X.-O. Shu, W. Zheng

Study supervision: W. Lu, B.-T. Ji, K.-Y. Yoo, D.-Y. Noh, S.-W. Kim, C.-Y. Shen, X.-O. Shu, D. Kang, W. Zheng

The authors thank participants and research staff for this study; and Regina Courtney, Jie Wu, Jing He, Mary Jo Daly, and Bethanie Rammer for their help with sample preparation and technical support for the project at Vanderbilt.

The work for this project at Vanderbilt was supported primarily by U.S. NIH grants R01CA124558, R01CA148667, R01CA64277, R37CA070867, and U19 CA148065, as well as Ingram Professorship and Research Reward funds from Vanderbilt University. Sample preparation and genotyping assays at Vanderbilt were conducted at the Survey and Biospecimen Shared Resources and Vanderbilt Microarray Shared Resource, which are supported in part by Vanderbilt-Ingram Cancer Center (P30 CA68485). Funding for the constituent studies was provided by U.S. NIH grants (R01CA124558, R01CA148667, R01CA64277, R37CA070867, R01CA118229, R01CA092585, R01CA122756, R01 CA137013, R01CA63464, R01CA54281, and CA132839); U.S. Department of Defense Breast Cancer Research Program (BC011118 and BC050791); National Research Foundation, Ministry of Education, Science and Technology, National Biobank of Korea; and National R&D Program of the Republic of Korea (2011-0001564, 1020350, 0620410-1, 2012-0000347, and KOBB-2011-03); and Japanese Ministry of Health, Labor and Welfare and Ministry of Education (17015049 and 221S0001).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Lichtenstein
P
,
Holm
NV
,
Verkasalo
PK
,
Iliadou
A
,
Kaprio
J
,
Koskenvuo
M
, et al
Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from Sweden, Denmark, and Finland
.
N Engl J Med
2000
;
343
:
78
85
.
2.
Peto
J
,
Mack
TM
. 
High constant incidence in twins and other relatives of women with breast cancer
.
Nat Genet
2000
;
26
:
411
4
.
3.
Nathanson
KL
,
Wooster
R
,
Weber
BL
. 
Breast cancer genetics: what we know and what we need
.
Nat Med
2001
;
7
:
552
6
.
4.
Zhang
B
,
Beeghly-Fadiel
A
,
Long
J
,
Zheng
W
. 
Genetic variants associated with breast-cancer risk: comprehensive research synopsis, meta-analysis, and epidemiological evidence
.
Lancet Oncol
2011
;
12
:
477
88
.
5.
Easton
DF
,
Pooley
KA
,
Dunning
AM
,
Pharoah
PD
,
Thompson
D
,
Ballinger
DG
, et al
Genome-wide association study identifies novel breast cancer susceptibility loci
.
Nature
2007
;
447
:
1087
93
.
6.
Hunter
DJ
,
Kraft
P
,
Jacobs
KB
,
Cox
DG
,
Yeager
M
,
Hankinson
SE
, et al
A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer
.
Nat Genet
2007
;
39
:
870
4
.
7.
Stacey
SN
,
Manolescu
A
,
Sulem
P
,
Rafnar
T
,
Gudmundsson
J
,
Gudjonsson
SA
, et al
Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor-positive breast cancer
.
Nat Genet
2007
;
39
:
865
9
.
8.
Stacey
SN
,
Manolescu
A
,
Sulem
P
,
Thorlacius
S
,
Gudjonsson
SA
,
Jonsson
GF
, et al
Common variants on chromosome 5p12 confer susceptibility to estrogen receptor-positive breast cancer
.
Nat Genet
2008
;
40
:
703
6
.
9.
Zheng
W
,
Long
J
,
Gao
YT
,
Li
C
,
Zheng
Y
,
Xiang
YB
, et al
Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1
.
Nat Genet
2009
;
41
:
324
8
.
10.
Ahmed
S
,
Thomas
G
,
Ghoussaini
M
,
Healey
CS
,
Humphreys
MK
,
Platte
R
, et al
Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2
.
Nat Genet
2009
;
41
:
585
90
.
11.
Thomas
G
,
Jacobs
KB
,
Kraft
P
,
Yeager
M
,
Wacholder
S
,
Cox
DG
, et al
A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1)
.
Nat Genet
2009
;
41
:
579
84
.
12.
Turnbull
C
,
Ahmed
S
,
Morrison
J
,
Pernet
D
,
Renwick
A
,
Maranian
M
, et al
Genome-wide association study identifies five new breast cancer susceptibility loci
.
Nat Genet
2010
;
42
:
504
7
.
13.
Long
J
,
Cai
Q
,
Shu
XO
,
Qu
S
,
Li
C
,
Zheng
Y
, et al
Identification of a functional genetic variant at 16q12.1 for breast cancer risk: results from the Asia Breast Cancer Consortium
.
PLoS Genet
2010
;
6
:
e1001002
.
14.
Cai
Q
,
Long
J
,
Lu
W
,
Qu
S
,
Wen
W
,
Kang
D
, et al
Genome-wide association study identifies breast cancer risk variant at 10q21.2: results from the Asia Breast Cancer Consortium
.
Hum Mol Genet
2011
;
20
:
4991
9
.
15.
Fletcher
O
,
Johnson
N
,
Orr
N
,
Hosking
FJ
,
Gibson
LJ
,
Walker
K
, et al
Novel breast cancer susceptibility locus at 9q31.2: results of a genome-wide association study
.
J Natl Cancer Inst
2011
;
103
:
425
35
.
16.
Haiman
CA
,
Chen
GK
,
Vachon
CM
,
Canzian
F
,
Dunning
A
,
Millikan
RC
, et al
A common variant at the TERT-CLPTM1L locus is associated with estrogen receptor-negative breast cancer
.
Nat Genet
2011
;
43
:
1210
4
.
17.
Ghoussaini
M
,
Fletcher
O
,
Michailidou
K
,
Turnbull
C
,
Schmidt
MK
,
Dicks
E
, et al
Genome-wide association analysis identifies three new breast cancer susceptibility loci
.
Nat Genet
2012
;
44
:
312
8
.
18.
Long
J
,
Cai
Q
,
Sung
H
,
Shi
J
,
Zhang
B
,
Choi
JY
, et al
Genome-wide association study in east Asians identifies novel susceptibility loci for breast cancer
.
PLoS Genet
2012
;
8
:
e1002532
.
19.
Michailidou
K
,
Hall
P
,
Gonzalez-Neira
A
,
Ghoussaini
M
,
Dennis
J
,
Milne
RL
, et al
Large-scale genotyping identifies 41 new loci associated with breast cancer risk
.
Nat Genet
2013
;
45
:
353
61
.
20.
Kim
HC
,
Lee
JY
,
Sung
H
,
Choi
JY
,
Park
SK
,
Lee
KM
, et al
A genome-wide association study identifies a breast cancer risk variant in ERBB4 at 2q34: results from the Seoul Breast Cancer Study
.
Breast Cancer Res
2012
;
14
:
R56
.
21.
Zheng
W
,
Cai
Q
,
Signorello
LB
,
Long
J
,
Hargreaves
MK
,
Deming
SL
, et al
Evaluation of 11 breast cancer susceptibility loci in African-American women
.
Cancer Epidemiol Biomarkers Prev
2009
;
18
:
2761
4
.
22.
Cai
Q
,
Wen
W
,
Qu
S
,
Li
G
,
Egan
KM
,
Chen
K
, et al
Replication and functional genomic analyses of the breast cancer susceptibility locus at 6q25.1 generalize its importance in women of Chinese, Japanese, and European ancestry
.
Cancer Res
2011
;
71
:
1344
55
.
23.
Chen
F
,
Chen
GK
,
Millikan
RC
,
John
EM
,
Ambrosone
CB
,
Bernstein
L
, et al
Fine-mapping of breast cancer susceptibility loci characterizes genetic risk in African Americans
.
Hum Mol Genet
2011
;
20
:
4491
503
.
24.
Hutter
CM
,
Young
AM
,
Ochs-Balcom
HM
,
Carty
CL
,
Wang
T
,
Chen
CT
, et al
Replication of breast cancer GWAS susceptibility loci in the Women's Health Initiative African American SHARe Study
.
Cancer Epidemiol Biomarkers Prev
2011
;
20
:
1950
9
.
25.
Chen
J
,
Jiang
Y
,
Liu
X
,
Qin
Z
,
Dai
J
,
Jin
G
, et al
Genetic variants at chromosome 9p21, 10p15 and 10q22 and breast cancer susceptibility in a Chinese population
.
Breast Cancer Res Treat
2012
;
132
:
741
6
.
26.
Huo
D
,
Zheng
Y
,
Ogundiran
TO
,
Adebamowo
C
,
Nathanson
KL
,
Domchek
SM
, et al
Evaluation of 19 susceptibility loci of breast cancer in women of African ancestry
.
Carcinogenesis
2012
;
33
:
835
40
.
27.
Sueta
A
,
Ito
H
,
Kawase
T
,
Hirose
K
,
Hosono
S
,
Yatabe
Y
, et al
A genetic risk predictor for breast cancer using a combination of low-penetrance polymorphisms in a Japanese population
.
Breast Cancer Res Treat
2012
;
132
:
711
21
.
28.
Zheng
W
,
Zhang
B
,
Cai
Q
,
Sung
H
,
Michailidou
K
,
Shi
J
, et al
Common genetic determinants of breast cancer risk in East Asian women: a collaborative study of 23,637 breast cancer cases and 25,280 controls
.
Hum Mol Genet
2013
;
22
:
2539
50
.
29.
Zheng
W
,
Chow
WH
,
Yang
G
,
Jin
F
,
Rothman
N
,
Blair
A
, et al
The Shanghai Women's Health Study: rationale, study design, and baseline characteristics
.
Am J Epidemiol
2005
;
162
:
1123
31
.
30.
Shu
XO
,
Zheng
Y
,
Cai
H
,
Gu
K
,
Chen
Z
,
Zheng
W
, et al
Soy food intake and breast cancer survival
.
JAMA
2009
;
302
:
2437
43
.
31.
Cho
YS
,
Go
MJ
,
Kim
YJ
,
Heo
JY
,
Oh
JH
,
Ban
HJ
, et al
A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits
.
Nat Genet
2009
;
41
:
527
34
.
32.
Han
SA
,
Park
SK
,
Ahn
SH
,
Lee
MH
,
Noh
DY
,
Kim
LS
, et al
The Korean Hereditary Breast Cancer (KOHBRA) study: protocols and interim report
.
Clin Oncol (R Coll Radiol)
2011
;
23
:
434
41
.
33.
Itoh
H
,
Iwasaki
M
,
Hanaoka
T
,
Kasuga
Y
,
Yokoyama
S
,
Onuma
H
, et al
Serum organochlorines and breast cancer risk in Japanese women: a case-control study
.
Cancer Causes Control
2009
;
20
:
567
80
.
34.
Li
Y
,
Willer
CJ
,
Ding
J
,
Scheet
P
,
Abecasis
GR
. 
MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes
.
Genet Epidemiol
2010
;
34
:
816
34
.
35.
Purcell
S
,
Neale
B
,
Todd-Brown
K
,
Thomas
L
,
Ferreira
MA
,
Bender
D
, et al
PLINK: a tool set for whole-genome association and population-based linkage analyses
.
Am J Hum Genet
2007
;
81
:
559
75
.
36.
Kreykenbohm
V
,
Wenzel
D
,
Antonin
W
,
Atlachkine
V
,
von Mollard
GF
. 
The SNAREs vti1a and vti1b have distinct localization and SNARE complex partners
.
Eur J Cell Biol
2002
;
81
:
273
80
.
37.
Flowerdew
SE
,
Burgoyne
RD
. 
A VAMP7/Vti1a SNARE complex distinguishes a non-conventional traffic route to the cell surface used by KChIP1 and Kv4 potassium channels
.
Biochem J
2009
;
418
:
529
40
.
38.
Stephens
PJ
,
McBride
DJ
,
Lin
ML
,
Varela
I
,
Pleasance
ED
,
Simpson
JT
, et al
Complex landscapes of somatic rearrangement in human breast cancer genomes
.
Nature
2009
;
462
:
1005
10
.
39.
Bass
AJ
,
Lawrence
MS
,
Brace
LE
,
Ramos
AH
,
Drier
Y
,
Cibulskis
K
, et al
Genomic sequencing of colorectal adenocarcinomas identifies a recurrent VTI1A-TCF7L2 fusion
.
Nat Genet
2011
;
43
:
964
8
.
40.
Clevers
H
,
Nusse
R
. 
Wnt/beta-catenin signaling and disease
.
Cell
2012
;
149
:
1192
205
.