Background: Although the role of miRNA in cancer development and progression has been well established, the association between genetic variants in miRNA biogenesis pathway genes and breast cancer risk has been yet unclear.

Methods: We analyzed data from two genome-wide association studies conducted in East Asian women including 5,066 cases and 4,337 controls. Among the single-nucleotide polymorphisms (SNP), which were directly genotyped or imputed, we selected 237 SNPs in 32 genes involved in miRNA biogenesis pathway and its regulation.

Results: Although eight SNPs were nominally associated with breast cancer risk in combined samples (P < 0.05), none of them were significant after adjustment for multiple comparisons.

Conclusions: The common genetic variants in miRNA biogenesis pathway genes may not be associated with breast cancer risk.

Impact: This study suggests no association between the polymorphisms in miRNA biogenesis pathway genes and breast cancer risk. Studies with large sample size and more genetic variants should be warranted to adequately evaluate the potential association. Cancer Epidemiol Biomarkers Prev; 21(8); 1385–7. ©2012 AACR.

MiRNAs are a major class of noncoding RNAs that posttranscriptionally modulate gene expression in a sequence specific manner. The role of miRNAs in human cancer pathogenesis has been well established by the identification of aberrant expression of miRNAs in many types of cancer (1). There is increasing evidence that the genetic variants in miRNA genes, in their biogenesis pathway genes and binding sites of target mRNA are associated with cancer risk and survival (2). However, the association between the polymorphisms of miRNA biogenesis pathway genes and breast cancer risk is uncertain. The previous study focusing on this pathway was conducted in small sample size and the coverage of genes of interests was limited (3). More recently, the results on the effects of genetic variants in miRNA biogenesis pathway genes were inconsistent even in the same cancer type (4). We comprehensively evaluated the common variants in miRNA biogenesis pathway genes by analyzing data from the 2 previous genome-wide association studies (GWAS) conducted in women of Asian ancestry consisted of 5,066 breast cancer cases and 4,337 controls.

Detailed methods for the Seoul Breast Cancer Study and the Shanghai Breast Cancer Study have been published elsewhere (5, 6) and described in brief in Supplementary Table S1 (3). This study included 5,066 histologically confirmed breast cancer cases and 4,337 controls [(2,190/2,052) Koreans and (2,876/2,285) Chinese]. The protocol was approved by the Institutional Review Board, and all participants provided written informed consent.

The 35 genes (ADAR, ADARB1, DDX5, DDX17, DDX20, DGCR8, DICER1, DROSHA, EIF2C1, EIF2C2, GEMIN4, HNRNPA1, ILF2, ILF3, KHSRP, LIN28A, NANOG, PAPD4, PIWIL1, RAN, SMAD1, SMAD2, SMAD3, SMAD4, SMAD6, SMAD7, SNIP1, SRRT, SRSF1, TARBP, TRIM32, TRIM71, XPO5, XRN2, and ZCCHC11) were selected on the basis of their biologic role in miRNA biogenesis as determined by the combination of literature review (7) and the bioinformatics tool the Gene Ontology (Supplementary Table S2). Using the data from the HapMap Project and web-based single-nucleotide polymorphism (SNP) selection tool (8), we selected haplotype tagging SNPs defined by the linkage disequilibrium with the pairwise r2 ≥ 0.60, in the region from 10 kb upstream and 5 kb downstream of the largest cDNA of each gene with minor allele frequency (MAF) of ≥0.05 in Asian populations. For 3 genes (TARBP, DDX5, and SRSF1) among the 35 genes, no SNP was included because of low MAF or low imputation quality (RSQR < 0.3, DDX5 rs1991401 and SRSF1 rs2233911). A total of 247 SNPs (26 genotyped and 221 imputed) located in 32 genes were included. For individual study, OR and 95% confidence intervals (CI) were estimated using logistic regression implemented in PLINK (genotyped SNPs) and MACH2DAT (imputed SNPs) with adjustment for age. We carried out meta-analysis using a weighted z-score method under the fixed effect model implemented in METAL (9). Cochran Q statistics were used to assess the heterogeneity across the studies.

Of the 247 SNPs, 8 SNPs (DDX20 rs3754025, EIF2C2 rs7834784 and rs6578126, PIWIL1 rs10848087, DICER1rs12432281, SMAD6 rs17264185 and rs7170982, and XRN2 rs10485627) were associated with breast cancer risk (Pmeta < 0.050, Table 1). The most significant association was observed in intronic SNP rs7834784 in EIF2C2 with ORper-allele as 1.15 (95% CI, 1.02–1.29; Pmeta = 0.021). Study specific estimates of both studies were generally similar. However, these nominally significant associations were not significant when accounting for multiple comparisons. The results of all the SNPs evaluated were presented in Supplementary Table S3.

Table 1.

Association between the SNPs with a Pmeta < 0.050 in the genes of interests and breast cancer risk

Seoul Breast Cancer Study (2,190 cases and 2,052 controls)Shanghai Breast Cancer Study (2,876 cases and 2,285 controls)Combined (5,066 cases and 4,337 controls)
SNPGenomic locationa/positiona, bpGene/predicted functionAllelesbEAFcPer-allele OR (95% CI)dRSQRePtrendEAFcPer-allele OR (95% CI)dRSQRePtrendPer-allele OR (95% CI)PmetaPhetf
rs3754025 1p13.2/112104760 DDX20/intronic A/T 0.10 0.94 (0.81–1.08) 1.00 0.37 0.07 0.86 (0.74–1.00) 1.00 0.05 0.90 (0.81–1.00) 0.05 0.45 
rs7834784 8q24.3/141693661 EIF2C2/intronic C/T 0.08 1.29 (1.08–1.54) 0.80 0.01 0.09 1.06 (0.91–1.23) 0.80 0.48 1.15 (1.02–1.29) 0.02 0.10 
rs6578126 8q24.3/141704788 EIF2C2/intronic A/G 0.06 0.77 (0.64–0.93) 0.99 0.01 0.05 0.98 (0.82–1.16) 0.99 0.81 0.79 (0.77–1.00) 0.04 0.07 
rs10848087 12q24.33/129405118 PIWIL1/coding A/G 0.09 0.82 (0.70–0.96) 0.91 0.01 0.10 0.96 (0.84–1.10) 0.90 0.56 0.90 (0.81–0.99) 0.04 0.13 
rs12432281 14q32.13/94674881 DICER1/5 UTR A/G 0.30 0.94 (0.83–1.06) 0.58 0.30 0.34 0.89 (0.80–0.99) 0.58 0.03 0.91 (0.84–0.98) 0.02 0.51 
rs17264185 15q22.31/64784141 SMAD6/intronic A/G 0.83 1.12 (0.99–1.26) 0.89 0.08 0.85 1.09 (0.97–1.21) 1.00 0.14 1.10 (1.01–1.19) 0.02 0.74 
rs7170982 15q22.31/64843093 SMAD6/intronic G/T 0.35 1.09 (1.00–1.20) 0.96 0.06 0.32 1.05 (0.96–1.14) 0.98 0.29 1.07 (1.00–1.14) 0.04 0.52 
rs10485627 20p11.22/21266977 XRN2/intronic A/C 0.82 0.88 (0.79–0.99) 1.00 0.03 0.79 0.93 (0.85–1.03) 0.98 0.17 1.10 (0.85–0.98) 0.01 0.45 
Seoul Breast Cancer Study (2,190 cases and 2,052 controls)Shanghai Breast Cancer Study (2,876 cases and 2,285 controls)Combined (5,066 cases and 4,337 controls)
SNPGenomic locationa/positiona, bpGene/predicted functionAllelesbEAFcPer-allele OR (95% CI)dRSQRePtrendEAFcPer-allele OR (95% CI)dRSQRePtrendPer-allele OR (95% CI)PmetaPhetf
rs3754025 1p13.2/112104760 DDX20/intronic A/T 0.10 0.94 (0.81–1.08) 1.00 0.37 0.07 0.86 (0.74–1.00) 1.00 0.05 0.90 (0.81–1.00) 0.05 0.45 
rs7834784 8q24.3/141693661 EIF2C2/intronic C/T 0.08 1.29 (1.08–1.54) 0.80 0.01 0.09 1.06 (0.91–1.23) 0.80 0.48 1.15 (1.02–1.29) 0.02 0.10 
rs6578126 8q24.3/141704788 EIF2C2/intronic A/G 0.06 0.77 (0.64–0.93) 0.99 0.01 0.05 0.98 (0.82–1.16) 0.99 0.81 0.79 (0.77–1.00) 0.04 0.07 
rs10848087 12q24.33/129405118 PIWIL1/coding A/G 0.09 0.82 (0.70–0.96) 0.91 0.01 0.10 0.96 (0.84–1.10) 0.90 0.56 0.90 (0.81–0.99) 0.04 0.13 
rs12432281 14q32.13/94674881 DICER1/5 UTR A/G 0.30 0.94 (0.83–1.06) 0.58 0.30 0.34 0.89 (0.80–0.99) 0.58 0.03 0.91 (0.84–0.98) 0.02 0.51 
rs17264185 15q22.31/64784141 SMAD6/intronic A/G 0.83 1.12 (0.99–1.26) 0.89 0.08 0.85 1.09 (0.97–1.21) 1.00 0.14 1.10 (1.01–1.19) 0.02 0.74 
rs7170982 15q22.31/64843093 SMAD6/intronic G/T 0.35 1.09 (1.00–1.20) 0.96 0.06 0.32 1.05 (0.96–1.14) 0.98 0.29 1.07 (1.00–1.14) 0.04 0.52 
rs10485627 20p11.22/21266977 XRN2/intronic A/C 0.82 0.88 (0.79–0.99) 1.00 0.03 0.79 0.93 (0.85–1.03) 0.98 0.17 1.10 (0.85–0.98) 0.01 0.45 

Abbreviation: EAF, effect allele frequency.

aLocation based on NCBI Human Genome Build 36.3. Hg18.

bEffect/reference alleles.

cEffect allele frequency in controls.

dAdjusted for age.

eImputation R-square.

fP for heterogeneity between both studies was calculated using the Cochran Q test.

In this study, SNPs in miRNA biogenesis pathway genes were not associated with breast cancer risk. We could not exclude the possibility that the lack of association could come from the differences in subject characteristics, the interactions of environmental modifiers, and other uncontrolled bias between both studies. We also could not replicate the SNPs shown in previous report (3) although highly correlated proxies were included aside from SNPs in DDX20. Considering the sample size of the previous report, it could be a chance finding and at least need to be further investigated in other populations. The strengths of this study are its large sample size and comprehensive search of genes of interests and the high coverage of known SNPs. With the current sample size, for a SNP with MAF as 0.26 (average value of all tested SNPs), the minimum log-additive OR that we can detect at 99% power is 1.22, considering the Bonferroni-corrected significance level (P = 1E-04). Future genomic scan including several kinds of structural variations, copy number variants, new variants identified by high-resolution sequencing with larger sample size, as well as subtype specific analysis, could elucidate further information on the potential association.

No potential conflicts of interest were disclosed.

Conception and design: H. Sung, S.K. Park, D. Kang

Development of methodology: H. Sung, D. Kang

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): H. Sung, K.-Y. Yoo, D.-Y. Noh, W. Zheng

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): H. Sung, B. Zhang, Y.-J. Choi, J. Long, S.K. Park, S.-H. Ahn, D. Kang

Writing, review, and/or revision of the manuscript: H. Sung, B. Zhang, Y.-J. Choi, S.K. Park, D.-Y. Noh, D. Kang

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): D.-Y. Noh, W. Zheng

Study supervision: D. Kang, W. Zheng

This research was supported by BRL (Basic Research Laboratory) program through the National Research Foundation of Korea funded by the Ministry of Education, Science and Technology (2012-0000347).

1.
Spizzo
R
,
Nicoloso
MS
,
Croce
CM
,
Calin
GA
. 
SnapShot: microRNAs in cancer
.
Cell
2009
;
137
:
586
586.e1
.
2.
Ryan
BM
,
Robles
AI
,
Harris
CC
. 
Genetic variation in microRNA networks: the implications for cancer research
.
Nat Rev Cancer
2010
;
10
:
389
402
.
3.
Sung
H
,
Lee
KM
,
Choi
JY
,
Han
S
,
Lee
JY
,
Li
L
, et al
Common genetic polymorphisms of microRNA biogenesis pathway genes and risk of breast cancer: a case-control study in Korea
.
Breast Cancer Res Treat
2011
;
130
:
939
51
.
4.
Permuth-Wey
J
,
Chen
Z
,
Tsai
YY
,
Lin
HY
,
Chen
YA
,
Barnholtz-Sloan
J
, et al
MicroRNA processing and binding site polymorphisms are not replicated in the Ovarian Cancer Association Consortium
.
Cancer Epidemiol Biomarkers Prev
2011
;
20
:
1793
7
.
5.
Kim
HC
,
Lee
JY
,
Sung
H
,
Choi
JY
,
Park
SK
,
Lee
KM
, et al
A genome-wide association study identifies a breast cancer risk variant in ERBB4 at 2q34: results from the Seoul Breast Cancer Study
.
Breast Cancer Res
2012
;
14
:
R56
.
6.
Zheng
W
,
Long
J
,
Gao
YT
,
Li
C
,
Zheng
Y
,
Xiang
YB
, et al
Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1
.
Nat Genet
2009
;
41
:
324
8
.
7.
Krol
J
,
Loedige
I
,
Filipowicz
W
. 
The widespread regulation of microRNA biogenesis, function and decay
.
Nat Rev Genet
2010
;
11
:
597
610
.
8.
Xu
Z
,
Taylor
JA
. 
SNPinfo: integrating GWAS and candidate gene information into functional SNP selection for genetic association studies
.
Nucleic Acids Res
2009
;
37
:
W600
5
.
9.
Willer
CJ
,
Li
Y
,
Abecasis
GR
. 
METAL: fast and efficient meta-analysis of genomewide association scans
.
Bioinformatics
2010
;
26
:
2190
1
.