Background:

It is unclear if established breast cancer risk factors exert similar causal effects across hormone receptor breast cancer subtypes. We estimated and compared causal estimates of height, body mass index (BMI), type 2 diabetes, age at menarche, age at menopause, breast density, alcohol consumption, regular smoking, and physical activity across these subtypes.

Methods:

We used a two-sample Mendelian randomization approach and selected genetic instrumental variables from large-scale genome-wide association studies. Publicly available summary-level Breast Cancer Association Consortium data (n = 247,173; 133,384 cases, 113,789 controls) for the following subtypes were included: luminal A–like (45,253 cases), luminal B–/HER2-negative–like (6,350 cases), luminal B–like (6,427 cases), HER2-enriched (2,884 cases), and triple-negative (8,602 cases). We employed multiple Mendelian randomization methods to evaluate the strength of causal evidence for each risk factor–subtype association.

Results:

Collectively, our analyses indicated that increased height and decreased BMI are probable causal risk factors for all five subtypes. For the other risk factors, the strength of evidence for causal effects differed across subtypes. Heterogeneity in the magnitude of causal effect estimates for age at menopause and breast density was explained by null findings for triple-negative tumors. Regular smoking was the sole risk factor for which there was no evidence of a causal effect on any subtype.

Conclusions:

This study suggests that established breast cancer risk factors differ across hormone receptor subtypes.

Impact:

Our results are valuable for the development of primary prevention strategies, improvement of breast cancer risk stratification in the general population, and identification of novel breast cancer risk factors.

Previous case–control and cohort studies provide some evidence for breast cancer subtype–specific risk factors, including reproductive factors (13), body mass index (BMI; refs. 2, 4, 5), and alcohol consumption (2, 6, 7), although reported associations are inconsistent across studies (see Supplementary Table S1 for a detailed overview). A likely explanation for this inconsistency is the relatively small number of cases for rarer subtypes, such as HER2-enriched and triple-negative breast cancer. In general, breast cancer studies that collected both risk factor data and detailed pathology data for a large number of women are limited. A second challenge is that results from observational studies are often subject to bias due to (residual) confounding, measurement error, and reverse causation (8). As a result, it remains unclear whether previous results reflect causal associations with breast cancer subtypes.

Mendelian randomization (MR) is a specific type of instrumental variable (IV) analysis that minimizes the risk of these biases through the use of germline genetic variants, provided that certain assumptions are valid (8). Previous MR studies on breast cancer risk supported causal associations for height (9), BMI (10), age at menarche (11), age at menopause (12), breast density (13), and physical activity (14) but not for type 2 diabetes (T2D; ref. 15), alcohol consumption, and ever having smoked regularly (16, 17). Because of the heterogeneity of breast cancer, also within estrogen receptor (ER)–defined subtypes, it is essential to assess whether these associations with hormone receptor subtypes are causal. Thus far, only a handful of MR analyses have included data on these biologically more homogeneous subtypes (1823). However, the validity of their findings is currently unclear, as most previous MR studies did not perform a full assessment of the different MR assumptions (see Supplementary Table S2 for a detailed overview). Consequently, a comprehensive evaluation of causal evidence for breast cancer subtype–specific risk factors is still lacking.

Therefore, the aim of this study was to estimate and compare the causal effects of nine established risk factors, including anthropometric, reproductive, and behavioral exposures, across five hormone receptor breast cancer subtypes using a two-sample MR design.

Study design: Two-sample MR

We used a two-sample MR (RRID:SCR_019010) study design and summary-level data for both the risk factors and outcomes of interest. Specifically, we extracted summary statistics for genetic variant–risk factor and genetic variant–breast cancer subtype associations from different genome-wide association studies (GWAS), which are described in more detail below. All included GWAS conducted comprehensive quality control of the genetic data. To yield valid causal estimates, selected genetic IVs should meet the following MR assumptions: (i) IVs are robustly associated with the risk factors of interest (relevance assumption), (ii) IVs are not associated with confounders of the studied associations (independence assumption), and (iii) IVs do not affect the risk of breast cancer subtypes through mechanisms that do not involve the risk factors of interest (exclusion restriction assumption; ref. 8). An important additional assumption of two-sample MR is that the study participants included in both samples are from similar underlying populations (homogeneity assumption; ref. 24). We performed extensive secondary analyses to assess whether these assumptions were reasonable.

Data sources for genetic variant–risk factor associations

In 2020, Cancer Research UK published a list of established breast cancer risk factors based on scientific evidence up to that moment (25). From this list, along with a meta-analysis of prospective cohort studies (9), we selected risk factors for which GWAS data were available. As a result, we included the following nine breast cancer risk factors: height, BMI, T2D, age at menarche, age at menopause, percent breast density, alcohol consumption, regular smoking, and overall physical activity. We extracted data for these risk factors from the largest published GWAS, including (mostly) participants of European ancestry (12, 13, 2633) that were published before September 2021. Details for each included data source, including the percentage of female participants and age, are presented in Table 1, and details about the association models specified by each risk factor GWAS are included in Supplementary Table S3. Due to risk factor transformations that included GWAS performed, estimated MR ORs correspond to a 1 SD increase in risk factors, except for age at menarche and age at menopause, for which ORs correspond to a 1-year increase, and for T2D and smoking, for which ORs correspond to a unit increase in the log odds.

Table 1.

Details about the setting and participants of included risk factor and breast cancer data sources.

SettingParticipantsEstimated maximum overlap with outcome data source (%)
Ref.NMeta-analysisQuantification traitPopulation-based studies (%)Recruitment periodAncestryFemale participants (%)Age, years (mean ± SD)
Risk factors (unit) 
 Height (m) (27693,529 Yes (GIANT & UKB) NR NR, but 456,426 UKB participants were included: ∼66% NR European GIANT: 43.83% (28GIANT: Ranged from 29.3 ± 4.2 to 75.7 ± 3.4 across individual studies (280% 
 BMI (kg/m2) (27681,275 Yes (GIANT & UKB) NR NR, but 456,426 UKB participants were included: ∼67% NR European GIANT: 56.59% (29GIANT: Ranged from 18.9 ± 0.6 to 75.7 ± 3.4 across individual studies (291.5% 
 T2D (yes/no) (301,407,282: 228,499 cases and 1,178,783 controls Yes (MVP, DIAMANTE Consortium, Penn Medicine Biobank, Pakistan Genomic Resource, Biobank Japan, Malmö Diet and Cancer Study, Medstar, and PennCath) Based on ≥2 T2D-related ICD-9-CM diagnosis codes in electronic health care records, antidiabetic treatment, HbA1c or glucose levels, self-report MVPa: 100% MVPa: January 2011–October 2016 
  • Multiancestry

  • 79.2% European; n = 1,114,458 (148,726 cases and 965,732 controls)

 
MVPa: 7.2% MVPa: 68.2 ± 13.8 0.8% 
 Age at menarche (years) (31329,345 Yes (ReproGen Consortium, 23andMe, and UKB) Self-report NR NR European 100% Ranged from 15.8 ± 0.6 to 76.3 ± 5.5 across individual studies 16.3% 
 Age at menopause (years) (12496,151b Yes (ReproGen Consortium, BCAC, UKB, and 23andMe) Self-report; defined as the age at last naturally occurring menstrual period followed by at least 12 consecutive months of amenorrhea NR NR European 100% Ranged from 55.1 ± 5.4 to 76.6 ± 5.6 across individual studiesc 9.8% 
 Breast density (%) (1324,192 Yes (Hologic and GE studies) Using a computer-assisted method to measure mammographic density [software: Cumulus 6 (26)] 100% 2004–2013 European (non-Hispanic White) 100% 
  • Hologic: 61.9 ± 8.6

  • GE: 59.2 ± 8.9

 
0% 
 Alcohol consumption (drinks/week) (32941,280 Yes (24 studies) Self-report; defined as the average number of drinks a participant reported drinking each week, aggregated across all types of alcohol NR NR European ∼50% NR 1.0% 
 Smoking behavior (ever regularly smoked; yes/no) (321,232,091 Yes (25 studies) Self-report NR NR European ∼50% NR 0.8% 
 Physical activity (Metabolic Equivalent of Task score) (3391,105 No (only UKB) Wrist-worn accelerometer-measured physical activity during a 7-day period 100% 2013–2015 (collection of accelerometer data) European 56.1% NR for study population, but manuscript reports age range for UKB: 45–80 years 0% 
Outcomes 
 Breast cancer and five hormone receptor subtypesd (34
  • In total: 247,173

  • 133,384 cases

  • 113,789 controls

  • Subtype-specific analyses: 197,755

  • 45,253 luminal A–like cases, 6,350 luminal B–/HER2-negative–like cases, 6,427 luminal B–like cases, 2,884 HER2-enriched cases, 8,602 triple-negative cases, 91,477 controls

 
Yes (82 studies; 81 for the subtype-specific analyses) 
  • Mostly based on health care records and cancer registries; a few studies used self-reported data

  • Hormone receptor subtypes were defined based on ER, PR, HER2, and grade of the tumor. Methods to assess these tumor markers differed across individual studies and included clinical/pathology records and IHC of whole tumor sections of tissue microarrays

 
  • 57% for complete sample

  • Cases: 52%

  • Controls: 63%

 
  • BCAC formed in 2005

  • The recruitment period of individual BCAC studies ranging from 1947 to 2017

 
European 100% 
  • 56.5 ± 12.2 for complete sample

  • Cases: 56.6 ± 12.2

  • Controls: 56.4 ± 12.2

 
N/A 
SettingParticipantsEstimated maximum overlap with outcome data source (%)
Ref.NMeta-analysisQuantification traitPopulation-based studies (%)Recruitment periodAncestryFemale participants (%)Age, years (mean ± SD)
Risk factors (unit) 
 Height (m) (27693,529 Yes (GIANT & UKB) NR NR, but 456,426 UKB participants were included: ∼66% NR European GIANT: 43.83% (28GIANT: Ranged from 29.3 ± 4.2 to 75.7 ± 3.4 across individual studies (280% 
 BMI (kg/m2) (27681,275 Yes (GIANT & UKB) NR NR, but 456,426 UKB participants were included: ∼67% NR European GIANT: 56.59% (29GIANT: Ranged from 18.9 ± 0.6 to 75.7 ± 3.4 across individual studies (291.5% 
 T2D (yes/no) (301,407,282: 228,499 cases and 1,178,783 controls Yes (MVP, DIAMANTE Consortium, Penn Medicine Biobank, Pakistan Genomic Resource, Biobank Japan, Malmö Diet and Cancer Study, Medstar, and PennCath) Based on ≥2 T2D-related ICD-9-CM diagnosis codes in electronic health care records, antidiabetic treatment, HbA1c or glucose levels, self-report MVPa: 100% MVPa: January 2011–October 2016 
  • Multiancestry

  • 79.2% European; n = 1,114,458 (148,726 cases and 965,732 controls)

 
MVPa: 7.2% MVPa: 68.2 ± 13.8 0.8% 
 Age at menarche (years) (31329,345 Yes (ReproGen Consortium, 23andMe, and UKB) Self-report NR NR European 100% Ranged from 15.8 ± 0.6 to 76.3 ± 5.5 across individual studies 16.3% 
 Age at menopause (years) (12496,151b Yes (ReproGen Consortium, BCAC, UKB, and 23andMe) Self-report; defined as the age at last naturally occurring menstrual period followed by at least 12 consecutive months of amenorrhea NR NR European 100% Ranged from 55.1 ± 5.4 to 76.6 ± 5.6 across individual studiesc 9.8% 
 Breast density (%) (1324,192 Yes (Hologic and GE studies) Using a computer-assisted method to measure mammographic density [software: Cumulus 6 (26)] 100% 2004–2013 European (non-Hispanic White) 100% 
  • Hologic: 61.9 ± 8.6

  • GE: 59.2 ± 8.9

 
0% 
 Alcohol consumption (drinks/week) (32941,280 Yes (24 studies) Self-report; defined as the average number of drinks a participant reported drinking each week, aggregated across all types of alcohol NR NR European ∼50% NR 1.0% 
 Smoking behavior (ever regularly smoked; yes/no) (321,232,091 Yes (25 studies) Self-report NR NR European ∼50% NR 0.8% 
 Physical activity (Metabolic Equivalent of Task score) (3391,105 No (only UKB) Wrist-worn accelerometer-measured physical activity during a 7-day period 100% 2013–2015 (collection of accelerometer data) European 56.1% NR for study population, but manuscript reports age range for UKB: 45–80 years 0% 
Outcomes 
 Breast cancer and five hormone receptor subtypesd (34
  • In total: 247,173

  • 133,384 cases

  • 113,789 controls

  • Subtype-specific analyses: 197,755

  • 45,253 luminal A–like cases, 6,350 luminal B–/HER2-negative–like cases, 6,427 luminal B–like cases, 2,884 HER2-enriched cases, 8,602 triple-negative cases, 91,477 controls

 
Yes (82 studies; 81 for the subtype-specific analyses) 
  • Mostly based on health care records and cancer registries; a few studies used self-reported data

  • Hormone receptor subtypes were defined based on ER, PR, HER2, and grade of the tumor. Methods to assess these tumor markers differed across individual studies and included clinical/pathology records and IHC of whole tumor sections of tissue microarrays

 
  • 57% for complete sample

  • Cases: 52%

  • Controls: 63%

 
  • BCAC formed in 2005

  • The recruitment period of individual BCAC studies ranging from 1947 to 2017

 
European 100% 
  • 56.5 ± 12.2 for complete sample

  • Cases: 56.6 ± 12.2

  • Controls: 56.4 ± 12.2

 
N/A 

Abbreviations: ICD-9-CM, International Classification of Diseases, Ninth Revision, Clinical Modification; MVP, Million Veteran Program; N/A, not applicable; NR, not reported; UKB, UK Biobank.

a

Information was not reported for other studies included in the GWAS meta-analysis.

b

For our MR analyses, we included betas from the combined analysis (meta-analysis and 23andMe).

c

Age details for the 23andMe sample were not reported.

d

Details that were not included in the BCAC GWAS article were kindly provided by Zhang (first author; ref. 34).

Data source for genetic variant–breast cancer subtype associations

We extracted publicly available summary-level data from the largest Breast Cancer Association Consortium (BCAC) GWAS to date for total breast cancer (n = 247,173; 133,384 cases, 113,789 controls) and the five hormone receptor breast cancer subtypes: luminal A–like (45,253 cases), luminal B–/HER2-negative–like (6,350 cases), luminal B–like (6,427 cases), HER2-enriched (2,884 cases), and triple-negative breast cancer (8,602 cases; ref. 34). These subtypes were classified based on tumor grade, ER, progesterone receptor (PR), and HER2 status as follows: luminal A–like (ER+ and/or PR+ and HER2 and grade 1 or 2), luminal B–/HER2-negative–like (ER+ and/or PR+ and HER2 and grade 3), luminal B–like (ER+ and/or PR+ and HER2+), HER2-enriched (ER, PR, HER2+), or triple-negative (ER, PR, HER2). The same set of controls (91,477 controls) was used for all subtype-specific GWAS analyses. Participants included in this BCAC GWAS were all female and of European ancestry. All GWAS that generated the used summary-level BCAC data received ethical approval from qualified institutional boards, and all study participants provided informed consent, mostly in accordance with the Declaration of Helsinki and, in a few studies, in accordance with the U.S. Common Rule. Additional details about the study population are included in Table 1. Genetic variant–total breast cancer and genetic variant–subtype associations were estimated using standard logistic regression models and two-stage polytomous logistic models, respectively. Accordingly, the number of cases per hormone receptor subtype represents the effective number of cases per subtype; see the Supplementary Note of Zhang and colleagues (34) for details. We estimated the maximum percentage of overlap with the selected risk factor GWAS based on the number of individuals from studies that were included in both risk factor and BCAC GWAS analyses. For this calculation, we used data provided in the supplemental information of risk factor GWAS and similar details about our BCAC summary statistics (kindly provided by Dr. Haoyu Zhang). We calculated the maximum proportion of overlap by dividing the number of study participants in overlapping studies by the total number of risk factor GWAS participants. For example, between the smoking behavior GWAS and the BCAC GWAS, the only overlapping study was the Women’s Health Initiative. The smoking behavior GWAS included 17,868 Women’s Health Initiative study participants, while the BCAC GWAS included 9,553 participants. Consequently, the maximum percentage overlap of study participants would be (9,553/total smoking behavior GWAS sample size) × 100%.

Selection of risk factor–specific genetic IVs

From each risk factor GWAS, we selected genome-wide significant genetic variants (P < 5 × 10−8). For height and BMI, an even more stringent P value threshold (P < 1 × 10−8) was used by the GWAS authors (27). The T2D GWAS (30) was the only data source that included a trans-ethnic population. To avoid confounding due to population stratification (8), we only included genetic variants that reached genome-wide significance in European ancestry–specific T2D analyses. For genetic variants that were not available in the BCAC GWAS summary statistics, we searched for proxy genetic variants [linkage disequilibrium (LD) r2 ≥ 0.8] using the NIH LDlink API implemented in the LDlinkR R package (35). To maximize statistical power to detect causal effects, we did not exclude genetic variants based on pairwise correlation (i.e., LD) for our primary analyses but instead accounted for variant correlation in the analysis. Prior to conducting MR analyses, we performed harmonization of alleles and effect estimates between the risk factor and BCAC GWAS using the TwoSampleMR R package (36). At this step, we excluded palindromic genetic variants with intermediate allele frequencies (i.e., adenine/thymine or cytosine/guanine) genetic variants with an effect allele frequency ranging from 0.40 to 0.60) because the harmonization of these specific variants between different data sources can be error-prone. In addition, genetic variants that were not available in the 1000G phase III reference panel were excluded during harmonization. Supplementary Figure S1 presents an overview of this selection process.

Statistical analyses

All analyses were performed using R version 4.0.5 (https://www.R-project.org/). Prior to performing our primary MR analyses, we set out to calculate LD matrices for all genetic variants for the specific risk factors using the ld_matrix function implemented in the TwoSampleMR R package. However, due to the substantial proportion of highly correlated genetic variants for height and BMI, the correlation matrices that we calculated for these risk factors were near-singular. Therefore, we used a previously published method (37) that performs unscaled principal component analysis on a weighted version of the genetic correlation matrix instead. This method results in transformed values for the genetic variants–risk factor and genetic variants–outcome associations and a transformed correlation matrix. We included these transformed objects in our primary inverse-variance weighted (IVW) analyses.

Primary analysis

We employed the IVW method using a multiplicative random-effects model to calculate primary MR estimates for all nine risk factors in relation to each breast cancer subtype. For these analyses, we included genetic variant–risk factor estimates from the largest GWAS sample available. Consequently, we used estimates from sex-combined analyses for the risk factors height, BMI, T2D, smoking behavior, alcohol consumption, and physical activity. If analyses from conditional and joint GWAS analyses (i.e., analyses that identify index and independent secondary genetic variants) were available, we used these estimates to weigh genetic variant–risk factor associations (Supplementary Table S3). We performed post hoc power calculations for subtype-specific associations using a publicly available web application (https://sb452.shinyapps.io/power/; ref. 38). Specifically, we estimated statistical power within each subtype to detect a causal effect estimate equal to the magnitude of the causal effect estimate that we observed for overall breast cancer, as this estimate would be the most accurate under the null hypothesis that there is no heterogeneity across subtypes (see Supplementary Table S4 for used parameters). In addition to estimating causal effects of each risk factor on total breast cancer and breast cancer subtypes, we calculated the I2 index to quantify heterogeneity (%) in primary MR estimates across subtypes. We calculated I2 estimates through meta-analysis of the five subtype-specific effect estimates per risk factor using random-effects models implemented in the metafor R package (39). MR estimates for the different subtypes were not formally independent, as they were calculated using the same set of controls. However, because the case populations were different, the correlation resulting from this overlap is expected to be minimal. As our results suggested consistently different effect estimates for triple-negative tumors, we also calculated heterogeneity in subtype-specific estimates after exclusion of this subtype as a post hoc analysis.

Secondary analyses

Uncorrelated genetic variants as genetic instruments

We additionally performed IVW analyses restricted to uncorrelated genetic variants (LD r2 ≤ 0.001). Our rationale for this was twofold: direct comparison with previously published MR estimates for breast cancer and, direct comparison with robust MR analyses, which are not all extended for the inclusion of correlated IVs.

Robust methods to assess MR assumptions

To yield valid causal estimates, genetic IVs have to meet the relevance, independence, and exclusion restriction assumptions. The selection of genetic IVs based on the genome-wide significance threshold is an accepted approach to ensure that genetic variants meet the relevance assumption (40). To quantify the strength of included genetic IVs, we calculated F-statistics based on r2 (i.e., the variance explained in the respective risk factor) estimated in independent study populations, if available (Supplementary Table S5). We performed the following robust MR methods to check how consistent our findings were under less stringent assumptions about the pleiotropic effects of the included genetic IVs: MR-Egger regression (41), weighted median (42), mode-based estimator (43), and MR-PRESSO (44). Altogether, the results of these different methods give some insight into the plausibility of the exclusion restriction assumption. A comprehensive overview of each method’s assumptions, strengths, weaknesses, and statistical power was previously published (45). Based on previous MR findings about breast cancer (11), we also performed multivariable MR analyses for BMI and age at menarche. We performed two separate multivariable MR analyses: The first included summary-level data for BMI from the GWAS by Yengo and colleagues (27), and the second included data from the largest BMI GWAS in the UK Biobank by Elsworth (https://gwas.mrcieu.ac.uk/datasets/ukb-b-19953/). The reason for this was that ∼25% of the genetic IVs for age at menarche were missing in the most recent BMI GWAS (27), whereas all variants were available in the UK Biobank GWAS summary statistics (https://gwas.mrcieu.ac.uk/datasets/ukb-b-19953/).

Female-specific genetic variant weights to meet the homogeneity assumption

In addition to these three fundamental assumptions, the homogeneity assumption should be valid in two-sample MR studies (24). However, the BCAC GWAS only included women, whereas six of the nine selected risk factor GWAS included both females and males (Table 1). Accordingly, the homogeneity assumption is not met by design. To assess potential bias because of this violation, we conducted secondary analyses in which we replaced the betas of the genetic variants that reached genome-wide significance in sex-combined analyses with female-specific betas. For height and BMI, we extracted female-specific betas from a previous GIANT GWAS (46, 47). Of the 3,146 genetic variants for height, only 3,002 genetic variants were available in this female-specific data source. For the remaining 144 genetic variants, we included weights from sex-combined GWAS analyses. For physical activity, we received female-specific betas from the GWAS authors (33). For T2D, alcohol consumption, and smoking behavior, female-specific estimates were not publicly available.

Evaluation of strength of evidence for causal effects

We ultimately combined results from our primary and secondary MR analyses to evaluate the strength of evidence for causal effects of each risk factor on each breast cancer subtype. For this evaluation, we used the following recently proposed definition (48): Evidence was considered to be “consistent” if all performed MR methods presented a P value < 0.05; evidence was considered to be “concordant” if at least one method (primary or secondary analysis) had a P value < 0.05 and the direction of the effect estimate was concordant across all methods; evidence was considered to be “inconsistent” if at least one method had a P value < 0.05, but the direction of the effect estimates differed across methods; and evidence was considered to be “inadequate” if all MR methods had a P value ≥ 0.05. We used the same classification as the mentioned reference, but different labels for the categories that we believe are more reasonable.

Data availability

All R codes, including details on package versions, that were used to generate our results are available at https://github.com/SchmidtGroupNKI/MR_BCsubtypes.

Descriptive statistics data sources

Details about the setting and participants included in each risk factor GWAS and the breast cancer subtype GWAS are presented in Table 1. The total sample size of the included GWAS ranged from 24,192 to 1,232,091 individuals for breast density and smoking behavior, respectively. Except for the T2D GWAS, all data sources were restricted to European study populations (T2D GWAS: 79.2% European). GWAS for height, BMI, T2D, alcohol consumption, regular smoking, and physical activity included both females and males. For the latter three risk factors, ∼50% of the study subjects were female; for the anthropometric-related risk factors, details on biological sex were insufficiently reported. The age distribution of study participants included in the breast cancer subtype GWAS and risk factor GWAS was similar with a reported mean age of more than 55 years. We estimated the maximum overlap in study participants between the risk factor and BCAC GWAS to range from 0% (height, breast density, and physical activity) to 16.3% (age at menarche).

Causal effects of established breast cancer risk factors across breast cancer subtypes

Causal effect estimates (OR per 1 SD increase for all risk factors but per 1 year increase for age at menarche and age at menopause or per unit increase in the log odds for T2D and smoking) for each of the nine breast cancer risk factors across the five hormone receptor breast cancer subtypes are presented in Fig. 1 and Table 2. Statistical power estimates corresponding to our primary MR estimates are presented in Supplementary Table S4. In general, we observed that IVW estimates for luminal A–like and luminal B–/HER2-negative–like subtypes were very similar to IVW estimates for overall breast cancer. Heterogeneity across subtype-specific causal effects was not due to opposite causal effect estimates but due to stronger estimates or the absence of an effect for some subtypes.

Figure 1.

Causal breast cancer subtype–specific effect estimates per unit increase for nine established breast cancer risk factors. Presented ORs and 95% CIs were calculated using the IVW method, including correlated variants, which was taken into account through the inclusion of a transformed LD matrix. ORs correspond to a 1 SD increase for all risk factors. However, ORs for age at menarche and age at menopause correspond to a 1-year increase, and ORs for T2D and smoking correspond to a unit increase in the log odds. The gray vertical dotted line indicates an OR of 1.00 (i.e., absence of a causal association).

Figure 1.

Causal breast cancer subtype–specific effect estimates per unit increase for nine established breast cancer risk factors. Presented ORs and 95% CIs were calculated using the IVW method, including correlated variants, which was taken into account through the inclusion of a transformed LD matrix. ORs correspond to a 1 SD increase for all risk factors. However, ORs for age at menarche and age at menopause correspond to a 1-year increase, and ORs for T2D and smoking correspond to a unit increase in the log odds. The gray vertical dotted line indicates an OR of 1.00 (i.e., absence of a causal association).

Close modal
Table 2.

Subtype-specific causal effect estimates per unita increase for all nine breast cancer risk factors across primary and secondary MR analyses.

Risk factorOutcomePrimary IVW analysisSecondary IVW analysisMR-EggerWeighted medianWeighted modeMR-PRESSOb
OR95% CIOR95% CIOR95% CIP value interceptcOR95% CIOR95% CIOR95% CI
Height All BCAC breast cancer cases 1.03 (1.00–1.05) 1.05 (1.02–1.07) 1.04 (1.00–1.08) 0.69 1.02 (0.99–1.05) 1.01 (0.98–1.04) 1.02 (1.00–1.04) 
 Luminal A–like 1.04 (1.01–1.07) 1.07 (1.03–1.10) 1.06 (1.02–1.11) 0.87 1.04 (1.00–1.08) 1.04 (1.00–1.08) 1.05 (1.02–1.08) 
 Luminal B–/HER2-negative–like 1.02 (0.98–1.06) 1.06 (1.02–1.10) 1.06 (1.00–1.12) 0.94 1.05 (0.99–1.12) 1.04 (0.98–1.11) 1.05 (1.01–1.09) 
 Luminal B–like 1.07 (1.01–1.12) 1.06 (1.02–1.11) 1.03 (0.97–1.11) 0.27 1.04 (0.97–1.13) 1.01 (0.93–1.10) 1.07 (1.02–1.12) 
 HER2-enriched 1.01 (0.94–1.08) 1.02 (0.96–1.09) 1.05 (0.96–1.15) 0.37 0.98 (0.88–1.09) 0.99 (0.89–1.10) NA NA 
 Triple-negative 1.02 (0.97–1.06) 1.03 (0.99–1.07) 1.02 (0.97–1.08) 0.64 1.02 (0.96–1.09) 1.00 (0.94–1.07) 1.03 (0.99–1.07) 
BMI (kg/m2All BCAC breast cancer cases 0.96 (0.91–1.00) 0.95 (0.91–1.00) 0.97 (0.90–1.05) 0.54 0.96 (0.91–1.02) 0.99 (0.94–1.05) 0.97 (0.93–1.00) 
 Luminal A–like 0.96 (0.91–1.01) 0.98 (0.93–1.04) 1.00 (0.91–1.10) 0.60 1.00 (0.93–1.07) 1.04 (0.97–1.11) 0.98 (0.94–1.02) 
 Luminal B–/HER2-negative–like 0.98 (0.91–1.06) 0.95 (0.89–1.03) 0.98 (0.87–1.11) 0.61 0.99 (0.88–1.11) 0.98 (0.88–1.09) 0.95 (0.88–1.02) 
 Luminal B–like 0.91 (0.84–0.99) 0.98 (0.89–1.07) 1.04 (0.89–1.2) 0.32 1.04 (0.91–1.19) 1.03 (0.91–1.17) 0.97 (0.89–1.05) 
 HER2-enriched 0.85 (0.76–0.95) 0.92 (0.82–1.04) 0.99 (0.81–1.21) 0.37 1.03 (0.84–1.25) 1.05 (0.87–1.26) 0.92 (0.81–1.03) 
 Triple-negative 1.00 (0.92–1.08) 0.98 (0.91–1.06) 1.03 (0.91–1.18) 0.32 1.03 (0.91–1.17) 0.98 (0.87–1.11) 0.98 (0.91–1.06) 
T2D All BCAC breast cancer cases 0.96 (0.93–1.00) 1.00 (0.97–1.03) 0.96 (0.90–1.02) 0.14 1.02 (0.99–1.05) 1.11 (1.06–1.17) 0.99 (0.97–1.01) 
 Luminal A–like 0.96 (0.92–1.00) 1.00 (0.96–1.03) 0.95 (0.88–1.02) 0.13 1.00 (0.97–1.04) 1.13 (1.05–1.22) 0.98 (0.95–1.01) 
 Luminal B–/HER2-negative–like 0.96 (0.91–1.02) 1.02 (0.97–1.07) 0.96 (0.87–1.05) 0.13 1.05 (0.98–1.12) 1.04 (0.95–1.14) 1.02 (0.98–1.07) 
 Luminal B–like 0.94 (0.88–1.01) 0.97 (0.92–1.03) 0.93 (0.83–1.03) 0.31 0.99 (0.90–1.09) 0.99 (0.90–1.09) 0.99 (0.94–1.04) 
 HER2-enriched 0.92 (0.83–1.02) 0.92 (0.86–0.99) 0.92 (0.79–1.07) 0.94 0.88 (0.77–1.01) 0.90 (0.76–1.05) NA NA 
 Triple-negative 0.97 (0.91–1.03) 1.01 (0.96–1.05) 1.03 (0.93–1.13) 0.65 1.07 (1.00–1.14) 1.23 (1.11–1.36) 0.99 (0.95–1.04) 
Age at menarche (years) All BCAC breast cancer cases 0.99 (0.94–1.04) 0.98 (0.94–1.02) 0.98 (0.88–1.09) 0.92 0.98 (0.94–1.02) 0.96 (0.90–1.03) 0.97 (0.94–1.00) 
 Luminal A–like 0.99 (0.94–1.05) 0.99 (0.95–1.04) 0.98 (0.86–1.11) 0.79 0.97 (0.91–1.03) 0.91 (0.82–1.01) 0.98 (0.94–1.02) 
 Luminal B–/HER2-negative–like 0.95 (0.88–1.02) 0.96 (0.90–1.03) 1.02 (0.86–1.22) 0.47 0.94 (0.85–1.03) 0.95 (0.79–1.14) 0.95 (0.90–1.02) 
 Luminal B–like 0.98 (0.90–1.07) 1.02 (0.94–1.10) 0.94 (0.77–1.15) 0.40 1.00 (0.90–1.12) 1.01 (0.83–1.22) 1.00 (0.93–1.07) 
 HER2-enriched 0.98 (0.86–1.12) 0.97 (0.87–1.07) 0.98 (0.74–1.30) 0.90 0.97 (0.83–1.14) 0.89 (0.68–1.17) NA NA 
 Triple-negative 1.00 (0.93–1.08) 0.98 (0.92–1.04) 1.02 (0.86–1.21) 0.58 1.00 (0.90–1.10) 1.02 (0.88–1.19) 0.96 (0.91–1.02) 
Age at menopause (years) All BCAC breast cancer cases 1.04 (1.02–1.06) 1.04 (1.02–1.05) 1.04 (1.00–1.07) 0.91 1.05 (1.03–1.06) 1.05 (1.03–1.07) 1.05 (1.04–1.06) 
 Luminal A–like 1.05 (1.03–1.07) 1.04 (1.03–1.06) 1.04 (1.00–1.08) 0.82 1.06 (1.04–1.08) 1.06 (1.03–1.08) 1.06 (1.04–1.07) 
 Luminal B–/HER2-negative–like 1.05 (1.02–1.08) 1.05 (1.02–1.08) 1.03 (0.97–1.08) 0.33 1.06 (1.02–1.10) 1.06 (1.01–1.10) 1.06 (1.03–1.08) 
 Luminal B–like 1.03 (0.99–1.07) 1.03 (1.00–1.06) 1.02 (0.96–1.08) 0.68 1.04 (0.99–1.08) 1.04 (0.98–1.09) 1.04 (1.01–1.07) 
 HER2-enriched 1.06 (1.01–1.11) 1.04 (1.00–1.07) 1.05 (0.98–1.14) 0.65 1.05 (0.99–1.11) 1.08 (0.98–1.18) NA NA 
 Triple-negative 1.01 (0.98–1.04) 1.00 (0.98–1.03) 1.04 (0.99–1.10) 0.09 1.03 (0.99–1.07) 1.04 (0.99–1.10) NA NA 
Breast density (%) All BCAC breast cancer cases 1.32 (0.89–1.94) 1.03 (0.68–1.57) 0.43 (0.06–3.17) 0.38 1.18 (1.00–1.39) 1.39 (1.16–1.66) 1.26 (1.04–1.51) 
 Luminal A–like 1.44 (0.95–2.20) 1.09 (0.68–1.75) 0.38 (0.04–3.49) 0.34 1.30 (1.06–1.61) 1.39 (1.06–1.82) 1.19 (0.89–1.57) 
 Luminal B–/HER2-negative–like 1.54 (1.05–2.25) 1.05 (0.71–1.54) 0.39 (0.06–2.35) 0.27 1.12 (0.80–1.56) 1.40 (0.65–3.00) 1.16 (0.83–1.63) 
 Luminal B–like 1.38 (0.87–2.17) 1.07 (0.68–1.70) 0.58 (0.06–5.56) 0.59 1.32 (0.89–1.95) 1.93 (0.94–3.93) 1.35 (0.86–2.10) 
 HER2-enriched 1.14 (0.80–1.62) 0.91 (0.57–1.47) 0.47 (0.05–4.81) 0.56 0.93 (0.56–1.53) 0.84 (0.22–3.17) 1.16 (0.73–1.82) 
 Triple-negative 0.99 (0.73–1.34) 1.01 (0.63–1.63) 0.21 (0.02–1.72) 0.13 1.18 (0.86–1.63) 1.39 (0.87–2.21) 1.19 (0.88–1.60) 
Alcohol consumption (drinks/week) All BCAC breast cancer cases 1.03 (0.84–1.26) 0.92 (0.77–1.11) 0.92 (0.66–1.28) 0.99 0.86 (0.70–1.05) 0.88 (0.73–1.05) 0.94 (0.80–1.10) 
 Luminal A–like 1.08 (0.85–1.36) 0.91 (0.73–1.14) 0.97 (0.66–1.44) 0.67 0.93 (0.72–1.21) 0.92 (0.73–1.18) 0.92 (0.75–1.13) 
 Luminal B–/HER2-negative–like 1.00 (0.73–1.37) 0.90 (0.62–1.32) 0.94 (0.5–1.79) 0.88 0.88 (0.55–1.40) 0.86 (0.56–1.33) 0.95 (0.67–1.35) 
 Luminal B–like 0.83 (0.50–1.39) 0.78 (0.54–1.13) 0.57 (0.31–1.06) 0.22 0.51 (0.30–0.86) 0.52 (0.32–0.85) 0.75 (0.54–1.04) 
 HER2-enriched 0.95 (0.56–1.62) 1.35 (0.84–2.16) 1.68 (0.76–3.72) 0.51 1.69 (0.76–3.74) 1.64 (0.77–3.49) NA NA 
 Triple-negative 0.93 (0.67–1.29) 0.91 (0.66–1.25) 0.85 (0.50–1.45) 0.76 0.81 (0.51–1.28) 0.82 (0.54–1.24) NA NA 
Smoking (ever smoked regularly; yes/no) All BCAC breast cancer cases 1.03 (0.96–1.11) 1.08 (1.01–1.16) 1.04 (0.79–1.36) 0.74 1.05 (0.98–1.13) 1.00 (0.83–1.20) 1.09 (1.03–1.15) 
 Luminal A–like 1.04 (0.95–1.13) 1.09 (1.00–1.18) 1.22 (0.87–1.71) 0.48 1.03 (0.94–1.12) 0.96 (0.75–1.22) 1.05 (0.98–1.13) 
 Luminal B–/HER2-negative–like 1.06 (0.93–1.20) 1.03 (0.91–1.16) 0.82 (0.51–1.33) 0.35 0.97 (0.83–1.15) 0.86 (0.54–1.37) NA NA 
 Luminal B–like 0.97 (0.85–1.10) 1.10 (0.97–1.25) 1.07 (0.64–1.81) 0.93 1.05 (0.87–1.27) 0.94 (0.58–1.52) NA NA 
 HER2-enriched 0.91 (0.73–1.14) 1.13 (0.93–1.37) 0.94 (0.43–2.05) 0.64 1.08 (0.82–1.42) 1.03 (0.48–2.19) NA NA 
 Triple-negative 0.96 (0.86–1.09) 1.02 (0.90–1.15) 0.81 (0.50–1.33) 0.36 0.98 (0.83–1.15) 1.05 (0.63–1.77) 1.01 (0.89–1.13) 
Physical activity All BCAC breast cancer cases 0.49 (0.23–1.06) 0.53 (0.28–1.00) 0.03 (1.23 × 10−4–5.86) 0.28 0.66 (0.46–0.93) 0.92 (0.66–1.29) 0.85 (0.34–2.05) 
 Luminal A–like 0.40 (0.13–1.24) 0.44 (0.17–1.13) 0.04 (5.13 × 10−6–321.75) 0.60 0.53 (0.33–0.85) 0.82 (0.49–1.37) 0.42 (0.00–29.22) 
 Luminal B–/HER2-negative-like 0.51 (0.26–0.99) 0.47 (0.24–0.92) 3.65 (0.01–1734.32) 0.51 0.66 (0.34–1.30) 0.69 (0.33–1.44) NA NA 
 Luminal B–like 0.47 (0.14–1.52) 0.49 (0.19–1.31) 4.06 × 10−3 (1.34 × 10−6–12.28) 0.24 0.66 (0.28–1.59) 1.21 (0.28–5.14) NA NA 
 HER2-enriched 0.59 (0.15–2.39) 0.61 (0.20–1.89) 1.79 × 10−3 (2.07 × 10−7–15.43) 0.20 0.65 (0.19–2.24) 1.26 (0.18–9.01) NA NA 
 Triple-negative 0.98 (0.26–3.74) 0.96 (0.31–3.02) 0.49 (6.44 × 10−6–36,805.73) 0.91 0.77 (0.37–1.61) 0.80 (0.36–1.78) 0.82 (0.45–1.49) 
Risk factorOutcomePrimary IVW analysisSecondary IVW analysisMR-EggerWeighted medianWeighted modeMR-PRESSOb
OR95% CIOR95% CIOR95% CIP value interceptcOR95% CIOR95% CIOR95% CI
Height All BCAC breast cancer cases 1.03 (1.00–1.05) 1.05 (1.02–1.07) 1.04 (1.00–1.08) 0.69 1.02 (0.99–1.05) 1.01 (0.98–1.04) 1.02 (1.00–1.04) 
 Luminal A–like 1.04 (1.01–1.07) 1.07 (1.03–1.10) 1.06 (1.02–1.11) 0.87 1.04 (1.00–1.08) 1.04 (1.00–1.08) 1.05 (1.02–1.08) 
 Luminal B–/HER2-negative–like 1.02 (0.98–1.06) 1.06 (1.02–1.10) 1.06 (1.00–1.12) 0.94 1.05 (0.99–1.12) 1.04 (0.98–1.11) 1.05 (1.01–1.09) 
 Luminal B–like 1.07 (1.01–1.12) 1.06 (1.02–1.11) 1.03 (0.97–1.11) 0.27 1.04 (0.97–1.13) 1.01 (0.93–1.10) 1.07 (1.02–1.12) 
 HER2-enriched 1.01 (0.94–1.08) 1.02 (0.96–1.09) 1.05 (0.96–1.15) 0.37 0.98 (0.88–1.09) 0.99 (0.89–1.10) NA NA 
 Triple-negative 1.02 (0.97–1.06) 1.03 (0.99–1.07) 1.02 (0.97–1.08) 0.64 1.02 (0.96–1.09) 1.00 (0.94–1.07) 1.03 (0.99–1.07) 
BMI (kg/m2All BCAC breast cancer cases 0.96 (0.91–1.00) 0.95 (0.91–1.00) 0.97 (0.90–1.05) 0.54 0.96 (0.91–1.02) 0.99 (0.94–1.05) 0.97 (0.93–1.00) 
 Luminal A–like 0.96 (0.91–1.01) 0.98 (0.93–1.04) 1.00 (0.91–1.10) 0.60 1.00 (0.93–1.07) 1.04 (0.97–1.11) 0.98 (0.94–1.02) 
 Luminal B–/HER2-negative–like 0.98 (0.91–1.06) 0.95 (0.89–1.03) 0.98 (0.87–1.11) 0.61 0.99 (0.88–1.11) 0.98 (0.88–1.09) 0.95 (0.88–1.02) 
 Luminal B–like 0.91 (0.84–0.99) 0.98 (0.89–1.07) 1.04 (0.89–1.2) 0.32 1.04 (0.91–1.19) 1.03 (0.91–1.17) 0.97 (0.89–1.05) 
 HER2-enriched 0.85 (0.76–0.95) 0.92 (0.82–1.04) 0.99 (0.81–1.21) 0.37 1.03 (0.84–1.25) 1.05 (0.87–1.26) 0.92 (0.81–1.03) 
 Triple-negative 1.00 (0.92–1.08) 0.98 (0.91–1.06) 1.03 (0.91–1.18) 0.32 1.03 (0.91–1.17) 0.98 (0.87–1.11) 0.98 (0.91–1.06) 
T2D All BCAC breast cancer cases 0.96 (0.93–1.00) 1.00 (0.97–1.03) 0.96 (0.90–1.02) 0.14 1.02 (0.99–1.05) 1.11 (1.06–1.17) 0.99 (0.97–1.01) 
 Luminal A–like 0.96 (0.92–1.00) 1.00 (0.96–1.03) 0.95 (0.88–1.02) 0.13 1.00 (0.97–1.04) 1.13 (1.05–1.22) 0.98 (0.95–1.01) 
 Luminal B–/HER2-negative–like 0.96 (0.91–1.02) 1.02 (0.97–1.07) 0.96 (0.87–1.05) 0.13 1.05 (0.98–1.12) 1.04 (0.95–1.14) 1.02 (0.98–1.07) 
 Luminal B–like 0.94 (0.88–1.01) 0.97 (0.92–1.03) 0.93 (0.83–1.03) 0.31 0.99 (0.90–1.09) 0.99 (0.90–1.09) 0.99 (0.94–1.04) 
 HER2-enriched 0.92 (0.83–1.02) 0.92 (0.86–0.99) 0.92 (0.79–1.07) 0.94 0.88 (0.77–1.01) 0.90 (0.76–1.05) NA NA 
 Triple-negative 0.97 (0.91–1.03) 1.01 (0.96–1.05) 1.03 (0.93–1.13) 0.65 1.07 (1.00–1.14) 1.23 (1.11–1.36) 0.99 (0.95–1.04) 
Age at menarche (years) All BCAC breast cancer cases 0.99 (0.94–1.04) 0.98 (0.94–1.02) 0.98 (0.88–1.09) 0.92 0.98 (0.94–1.02) 0.96 (0.90–1.03) 0.97 (0.94–1.00) 
 Luminal A–like 0.99 (0.94–1.05) 0.99 (0.95–1.04) 0.98 (0.86–1.11) 0.79 0.97 (0.91–1.03) 0.91 (0.82–1.01) 0.98 (0.94–1.02) 
 Luminal B–/HER2-negative–like 0.95 (0.88–1.02) 0.96 (0.90–1.03) 1.02 (0.86–1.22) 0.47 0.94 (0.85–1.03) 0.95 (0.79–1.14) 0.95 (0.90–1.02) 
 Luminal B–like 0.98 (0.90–1.07) 1.02 (0.94–1.10) 0.94 (0.77–1.15) 0.40 1.00 (0.90–1.12) 1.01 (0.83–1.22) 1.00 (0.93–1.07) 
 HER2-enriched 0.98 (0.86–1.12) 0.97 (0.87–1.07) 0.98 (0.74–1.30) 0.90 0.97 (0.83–1.14) 0.89 (0.68–1.17) NA NA 
 Triple-negative 1.00 (0.93–1.08) 0.98 (0.92–1.04) 1.02 (0.86–1.21) 0.58 1.00 (0.90–1.10) 1.02 (0.88–1.19) 0.96 (0.91–1.02) 
Age at menopause (years) All BCAC breast cancer cases 1.04 (1.02–1.06) 1.04 (1.02–1.05) 1.04 (1.00–1.07) 0.91 1.05 (1.03–1.06) 1.05 (1.03–1.07) 1.05 (1.04–1.06) 
 Luminal A–like 1.05 (1.03–1.07) 1.04 (1.03–1.06) 1.04 (1.00–1.08) 0.82 1.06 (1.04–1.08) 1.06 (1.03–1.08) 1.06 (1.04–1.07) 
 Luminal B–/HER2-negative–like 1.05 (1.02–1.08) 1.05 (1.02–1.08) 1.03 (0.97–1.08) 0.33 1.06 (1.02–1.10) 1.06 (1.01–1.10) 1.06 (1.03–1.08) 
 Luminal B–like 1.03 (0.99–1.07) 1.03 (1.00–1.06) 1.02 (0.96–1.08) 0.68 1.04 (0.99–1.08) 1.04 (0.98–1.09) 1.04 (1.01–1.07) 
 HER2-enriched 1.06 (1.01–1.11) 1.04 (1.00–1.07) 1.05 (0.98–1.14) 0.65 1.05 (0.99–1.11) 1.08 (0.98–1.18) NA NA 
 Triple-negative 1.01 (0.98–1.04) 1.00 (0.98–1.03) 1.04 (0.99–1.10) 0.09 1.03 (0.99–1.07) 1.04 (0.99–1.10) NA NA 
Breast density (%) All BCAC breast cancer cases 1.32 (0.89–1.94) 1.03 (0.68–1.57) 0.43 (0.06–3.17) 0.38 1.18 (1.00–1.39) 1.39 (1.16–1.66) 1.26 (1.04–1.51) 
 Luminal A–like 1.44 (0.95–2.20) 1.09 (0.68–1.75) 0.38 (0.04–3.49) 0.34 1.30 (1.06–1.61) 1.39 (1.06–1.82) 1.19 (0.89–1.57) 
 Luminal B–/HER2-negative–like 1.54 (1.05–2.25) 1.05 (0.71–1.54) 0.39 (0.06–2.35) 0.27 1.12 (0.80–1.56) 1.40 (0.65–3.00) 1.16 (0.83–1.63) 
 Luminal B–like 1.38 (0.87–2.17) 1.07 (0.68–1.70) 0.58 (0.06–5.56) 0.59 1.32 (0.89–1.95) 1.93 (0.94–3.93) 1.35 (0.86–2.10) 
 HER2-enriched 1.14 (0.80–1.62) 0.91 (0.57–1.47) 0.47 (0.05–4.81) 0.56 0.93 (0.56–1.53) 0.84 (0.22–3.17) 1.16 (0.73–1.82) 
 Triple-negative 0.99 (0.73–1.34) 1.01 (0.63–1.63) 0.21 (0.02–1.72) 0.13 1.18 (0.86–1.63) 1.39 (0.87–2.21) 1.19 (0.88–1.60) 
Alcohol consumption (drinks/week) All BCAC breast cancer cases 1.03 (0.84–1.26) 0.92 (0.77–1.11) 0.92 (0.66–1.28) 0.99 0.86 (0.70–1.05) 0.88 (0.73–1.05) 0.94 (0.80–1.10) 
 Luminal A–like 1.08 (0.85–1.36) 0.91 (0.73–1.14) 0.97 (0.66–1.44) 0.67 0.93 (0.72–1.21) 0.92 (0.73–1.18) 0.92 (0.75–1.13) 
 Luminal B–/HER2-negative–like 1.00 (0.73–1.37) 0.90 (0.62–1.32) 0.94 (0.5–1.79) 0.88 0.88 (0.55–1.40) 0.86 (0.56–1.33) 0.95 (0.67–1.35) 
 Luminal B–like 0.83 (0.50–1.39) 0.78 (0.54–1.13) 0.57 (0.31–1.06) 0.22 0.51 (0.30–0.86) 0.52 (0.32–0.85) 0.75 (0.54–1.04) 
 HER2-enriched 0.95 (0.56–1.62) 1.35 (0.84–2.16) 1.68 (0.76–3.72) 0.51 1.69 (0.76–3.74) 1.64 (0.77–3.49) NA NA 
 Triple-negative 0.93 (0.67–1.29) 0.91 (0.66–1.25) 0.85 (0.50–1.45) 0.76 0.81 (0.51–1.28) 0.82 (0.54–1.24) NA NA 
Smoking (ever smoked regularly; yes/no) All BCAC breast cancer cases 1.03 (0.96–1.11) 1.08 (1.01–1.16) 1.04 (0.79–1.36) 0.74 1.05 (0.98–1.13) 1.00 (0.83–1.20) 1.09 (1.03–1.15) 
 Luminal A–like 1.04 (0.95–1.13) 1.09 (1.00–1.18) 1.22 (0.87–1.71) 0.48 1.03 (0.94–1.12) 0.96 (0.75–1.22) 1.05 (0.98–1.13) 
 Luminal B–/HER2-negative–like 1.06 (0.93–1.20) 1.03 (0.91–1.16) 0.82 (0.51–1.33) 0.35 0.97 (0.83–1.15) 0.86 (0.54–1.37) NA NA 
 Luminal B–like 0.97 (0.85–1.10) 1.10 (0.97–1.25) 1.07 (0.64–1.81) 0.93 1.05 (0.87–1.27) 0.94 (0.58–1.52) NA NA 
 HER2-enriched 0.91 (0.73–1.14) 1.13 (0.93–1.37) 0.94 (0.43–2.05) 0.64 1.08 (0.82–1.42) 1.03 (0.48–2.19) NA NA 
 Triple-negative 0.96 (0.86–1.09) 1.02 (0.90–1.15) 0.81 (0.50–1.33) 0.36 0.98 (0.83–1.15) 1.05 (0.63–1.77) 1.01 (0.89–1.13) 
Physical activity All BCAC breast cancer cases 0.49 (0.23–1.06) 0.53 (0.28–1.00) 0.03 (1.23 × 10−4–5.86) 0.28 0.66 (0.46–0.93) 0.92 (0.66–1.29) 0.85 (0.34–2.05) 
 Luminal A–like 0.40 (0.13–1.24) 0.44 (0.17–1.13) 0.04 (5.13 × 10−6–321.75) 0.60 0.53 (0.33–0.85) 0.82 (0.49–1.37) 0.42 (0.00–29.22) 
 Luminal B–/HER2-negative-like 0.51 (0.26–0.99) 0.47 (0.24–0.92) 3.65 (0.01–1734.32) 0.51 0.66 (0.34–1.30) 0.69 (0.33–1.44) NA NA 
 Luminal B–like 0.47 (0.14–1.52) 0.49 (0.19–1.31) 4.06 × 10−3 (1.34 × 10−6–12.28) 0.24 0.66 (0.28–1.59) 1.21 (0.28–5.14) NA NA 
 HER2-enriched 0.59 (0.15–2.39) 0.61 (0.20–1.89) 1.79 × 10−3 (2.07 × 10−7–15.43) 0.20 0.65 (0.19–2.24) 1.26 (0.18–9.01) NA NA 
 Triple-negative 0.98 (0.26–3.74) 0.96 (0.31–3.02) 0.49 (6.44 × 10−6–36,805.73) 0.91 0.77 (0.37–1.61) 0.80 (0.36–1.78) 0.82 (0.45–1.49) 

Abbreviation: NA, not applicable.

Bold numbers indicate causal estimates for which P < 0.05.

a

ORs correspond to a 1 SD increase for all risk factors. However, ORs for age at menarche and age at menopause correspond to a 1-year increase, and ORs for T2D and smoking correspond to a unit increase in the log odds.

b

ORs for the MR-PRESSO method are only presented if the outlier test (P < 0.05) indicated that one or multiple outlying genetic variants were distorting the IVW estimate.

cA P value < 0.05 for the MR-Egger intercept indicates that unbalanced horizontal pleiotropy is present.

Anthropometric risk factors

For height, primary IVW estimates were similar across breast cancer subtypes (I2 = 0%) but suggested only a causal risk–increasing effect of increasing height on luminal A– and luminal B–like tumors. Estimated heterogeneity across hormone receptor subtypes remained 0% after exclusion of the estimate for triple-negative tumors. The combination of our primary and secondary MR analyses provided consistent evidence for a causal risk–increasing effect of increased height on the risk of luminal A–like tumors and concordant evidence for causal associations with luminal B–/HER2-negative–like and luminal B–like tumors (Table 2; Supplementary Fig. S2). Evidence for causal associations with HER2-enriched and triple-negative tumors was inadequate. However, MR analyses including female-specific genetic IVs provided consistent evidence for a causal risk–increasing effect of increased height on luminal B–like tumors and concordant evidence for the other four subtypes (Supplementary Table S6; Fig. 2).

Figure 2.

Overview of evidence for (subtype-specific) causal effects per increasing unit of the risk factor. For this figure, we counted the number of performed MR methods (both primary and secondary analyses) that provided statistical evidence for causal effects (threshold P < 0.05) and assessed the direction of causal effect estimates. In cases where the MR-PRESSO analysis indicated that there were no outliers and, thus, the secondary IVW estimate was valid, we included the IVW estimate from our secondary analyses twice. Evidence was considered consistent if all performed MR methods for the specific association reached P < 0.05. Evidence was considered concordant if at least one MR method (main or sensitivity) for the specific association reached P < 0.05 and the direction of the effect estimate was concordant for all methods. Evidence was considered inconsistent if at least one MR method (main or sensitivity) for the specific association reached P < 0.05, but the direction of the effect estimate differed between methods. Evidence was considered to be inadequate if none of the MR methods reached P < 0.05.* MR-Egger, weighted median, weighted mode, and MR-PRESSO analyses are less suitable when only a few genetic IVs are available; the results for physical activity should, therefore, be interpreted with appropriate caution.

Figure 2.

Overview of evidence for (subtype-specific) causal effects per increasing unit of the risk factor. For this figure, we counted the number of performed MR methods (both primary and secondary analyses) that provided statistical evidence for causal effects (threshold P < 0.05) and assessed the direction of causal effect estimates. In cases where the MR-PRESSO analysis indicated that there were no outliers and, thus, the secondary IVW estimate was valid, we included the IVW estimate from our secondary analyses twice. Evidence was considered consistent if all performed MR methods for the specific association reached P < 0.05. Evidence was considered concordant if at least one MR method (main or sensitivity) for the specific association reached P < 0.05 and the direction of the effect estimate was concordant for all methods. Evidence was considered inconsistent if at least one MR method (main or sensitivity) for the specific association reached P < 0.05, but the direction of the effect estimate differed between methods. Evidence was considered to be inadequate if none of the MR methods reached P < 0.05.* MR-Egger, weighted median, weighted mode, and MR-PRESSO analyses are less suitable when only a few genetic IVs are available; the results for physical activity should, therefore, be interpreted with appropriate caution.

Close modal

For increasing BMI, primary IVW estimates provided evidence for a causal risk–decreasing effect on luminal B–like and HER2-enriched tumors [ORlumB = 0.91; 95% confidence interval (CI), 0.84–0.99 and ORHER2+ = 0.85; 95% CI, 0.76–0.95]. Accordingly, I2 estimates indicated moderate heterogeneity across subtype-specific estimates (I2 = 31.1% across all subtypes; I2 = 40.1% after exclusion of triple-negative tumors). However, based on the combination of primary and secondary MR analyses, evidence for a causal effect of BMI was merely inconsistent for luminal B–like and HER2-enriched tumors and inadequate for the other subtypes (Table 2; Supplementary Fig. S3). In contrast, analyses including female-specific genetic IVs provided concordant evidence for a causal risk–decreasing effect of increasing BMI on all hormone receptor subtypes (Supplemental Table S6; Fig. 2).

For increasing risk of T2D, our primary analyses only suggested a causal risk–decreasing effect on luminal A–like tumors although causal effect estimates for the other subtypes were very similar (I2 = 0% for analyses including and excluding triple-negative tumors). Altogether, our primary and secondary MR analyses provided concordant evidence for a causal risk–decreasing effect on the risk of HER2-enriched tumors. Evidence for causal associations with luminal A–like and triple-negative tumors was inconsistent but inadequate for luminal B–like and luminal B–/HER2-negative–like tumors (Table 2; Fig. 2; Supplementary Fig. S4).

Reproductive risk factors

We observed no evidence for a causal effect of higher age at menarche on any of the hormone receptor subtypes in primary MR analyses (I2 = 0%). Secondary univariable MR analyses supported these findings (Table 2; Supplementary Fig. S5), but multivariable MR analyses for higher age at menarche and increasing BMI provided evidence for a direct causal association between higher age at menarche and a decreased risk of luminal A–like, luminal B–/HER2-negative–like, and triple-negative breast tumors (Supplementary Fig. S6). However, corresponding heterogeneity estimates suggested similar effects across subtypes [I2 = 15.1% based on data from Yengo and colleagues (27); 0% based on UK Biobank data (https://gwas.mrcieu.ac.uk/datasets/ukb-b-19953/)]. Based on the combination of our primary and secondary analyses, including multivariable MR analyses, evidence for a causal effect of higher age at menarche was only concordant for luminal A–like tumors and inconsistent for luminal B–/HER2-negative–like and triple-negative tumors (Fig. 2).

For higher age at menopause, primary IVW estimates suggested causal risk–increasing effects on luminal A–like, luminal B–/HER2-negative–like, and HER2-enriched tumors but not on luminal B–like and triple-negative tumors (I2 = 42.1%). Estimated heterogeneity across hormone receptor subtypes, excluding triple-negative breast cancer, was 0%, which indicates that the absence of a causal effect on this specific subtype [OR = 1.01 (95% CI, 0.98–1.04)] explains the observed heterogeneity across all five subtypes. Collectively, our MR analyses provided consistent or concordant evidence for a causal effect of age at menopause on all subtypes except triple-negative breast cancer (Table 2; Fig. 2; Supplementary Fig. S7).

In primary IVW analyses, genetically predicted higher breast density was only significantly associated with an increased risk of luminal B–/HER2-negative–like breast cancer (I2 = 15.6% across all subtypes; I2 = 0% after exclusion of triple-negative tumors). Based on the combination of all used MR methods, evidence for a causal effect was inconsistent for luminal A–like and luminal B–/HER2-negative–like tumors and inadequate for the other subtypes (Table 2; Fig. 2; Supplementary Fig. S8).

Lifestyle factors

Primary analyses did not provide evidence for causal effects of alcohol consumption and regular smoking on the risk of any of the hormone receptor subtypes (I2alcohol = 0%; I2smoking = 0%). Two out of the five secondary MR analyses suggested a causal risk–decreasing effect of higher alcohol consumption on the risk of luminal B–like breast tumors but not on the other subtypes (Table 2; Supplementary Fig. S9). Consequently, our results only provide concordant evidence for a causal effect of alcohol consumption on this specific subtype (Fig. 2). Secondary analyses for regular smoking supported our primary findings (Table 2; Supplementary Fig. S10), and thus, evidence for a causal association between smoking and the risk of any of the hormone receptor subtypes is inadequate. For overall higher physical activity, our primary results only provided evidence for a causal risk–decreasing effect on the risk of luminal B–/HER2-negative–like breast tumors (I2 = 0%; Table 2; Supplementary Fig. S11). However, statistical power was generally low, except for the luminal A–like subtype (Supplementary Table S4). Moreover, CIs around primary causal effect estimates were very wide, indicating a high degree of uncertainty. Based on the combination of our primary and secondary MR analyses, we found concordant evidence for a causal effect of physical activity on luminal A–like breast cancer and inconsistent evidence for luminal B–/HER2-negative–like tumors. For the other subtypes, evidence for a causal effect was inadequate. MR analyses including female-specific IVs provided concordant evidence for a causal risk–decreasing effect on luminal A–like, luminal B–/HER2-negative–like, and HER2-enriched breast tumors (Supplementary Table S6). In these analyses, evidence for a causal association with luminal B–like tumors shifted to inconsistent, whereas the evidence for a causal association with triple-negative tumors remained inadequate (Fig. 2). Yet the results of secondary MR analyses for physical activity should be interpreted with caution due to the very low number of genetic IVs available for this risk factor.

This MR study indicates that the causal effects of several established breast cancer risk factors differ across hormone receptor breast cancer subtypes. Specifically, we observed moderate heterogeneity in subtype-specific causal effects for age at menopause and breast density. Although this heterogeneity was explained by null findings for triple-negative tumors, statistical evidence was also inadequate for causal associations of breast density with luminal B–like and HER2-enriched tumors. For height, BMI, risk of T2D, age at menarche, alcohol consumption, regular smoking, and physical activity, causal effect estimates were similar across breast cancer subtypes. However, for height and BMI, the evidence for causal effects was concordant, or stronger, for all five subtypes. In contrast, for regular smoking, statistical evidence for a causal effect was inadequate for all subtypes. For the remaining six risk factors, the strength of causal evidence ranged from concordant to inadequate across subtypes. There was no evidence of opposing effects of any risk factor across hormone receptor subtypes. Altogether, our findings suggest that it is more likely that there is heterogeneity in the presence or absence of causal associations between risk factors and breast cancer subtypes than in the magnitude and direction of causal effects.

Previous MR studies supported height, BMI, age at menarche, age at menopause, breast density, and physical activity as breast cancer risk factors but could not confirm the risk of T2D, alcohol consumption, and smoking behavior as causal risk factors for overall breast cancer. The majority of patients with breast cancer are diagnosed with luminal tumors (49). As a result, established risk factor–breast cancer associations will, in general, represent associations with luminal subtypes and potentially less with HER2-enriched and triple-negative tumors. Our results indeed indicate that for triple-negative breast cancer, there is concordant evidence of causality for only two out of the nine risk factors, whereas for the other subtypes, there is concordant evidence of causality for four to five risk factors. For HER2-enriched tumors, differences compared with luminal tumors were less clear, possibly due to the considerably lower statistical power for the HER2-enriched subtype. For triple-negative tumors, we consider it likely that null findings for age at menopause, breast density, and physical activity are not caused by insufficient statistical power because the sample size for this subtype was similar to that for luminal B–/HER2-negative–like and luminal B–like tumors, and causal ORs were consistently 1.

Considering results from observational studies (i.e., not MR), recent large-scale analyses for height and age at menopause in relation to hormone receptor breast cancer subtypes are in line with our findings. Specifically, increasing height was associated with a higher risk of ER+PR+, ER+PR, and ERPR postmenopausal breast cancer (50). In a recent BCAC analysis of self-reported data, age at menopause was also not associated with the risk of triple-negative tumors (1). Results from the same two studies for BMI and age at menarche were, however, not in line with our findings. A higher adult BMI was associated with a lower risk of ER+PR+ premenopausal breast cancer and a higher risk of ER+PR+ and ERPR postmenopausal breast cancer (50). Evidence for associations between BMI and other subtypes was less clear although an association between higher BMI and a lower risk of breast cancer was suggested for ER+PR postmenopausal tumors. The results from our analyses, including female-specific weights, suggest that a higher genetically determined BMI is associated with a lower risk of all hormone receptor subtypes. Our finding is in line with an earlier MR study that found that a higher BMI was associated with a lower risk of both pre- and postmenopausal breast cancer (51), which contradicts observational evidence showing that a higher BMI is associated with a higher risk of postmenopausal breast cancer (e.g.; ref. 52). Although summary-level BCAC data stratified by age are currently not available for the five hormone receptor subtypes that we studied, new summary-level data have been calculated by Dr. Kyriaki Michailidou for ER+ and ER breast cancer across three age categories (<40, 40–55, and >55 years). We used these data to further investigate potential differences in the association between BMI and pre- and postmenopausal breast cancer, but these analyses did not support such a difference (Supplementary Table S7). Based on observational BCAC data, a younger age at menarche was associated with all hormone receptor subtypes (1), whereas our MR results only provided concordant causal evidence for an association with luminal A–like tumors. Yet estimated heterogeneity across subtypes in multivariable MR estimates for age at menarche was negligible, and statistical power to detect a causal effect was limited. Two recent observational studies suggested that breast density was also associated with all subtypes (4, 53), but based on our results, evidence for a causal association is weak. Large-scale analyses that studied associations with hormone receptor breast cancer subtypes are currently lacking for the risk of T2D, alcohol consumption, smoking behavior, and physical activity, which hampers a meaningful comparison with our findings for these risk factors. However, systematic reviews of observational studies for several of these lifestyle-related traits indicate that their results for overall breast cancer are likely to be biased by unmeasured confounding (e.g.; ref. 54). This observation highlights the importance of approaches that are more robust to residual confounding and measurement error, such as MR, to understand the etiology of breast cancer subtypes.

Until now, only one other MR analysis has set out to investigate multiple known risk factors in relation to hormone receptor breast cancer subtypes (18). This previous study reported similar results for age at menopause, which was associated with all subtypes but triple-negative tumors. Furthermore, a subtype-specific causal effect for alcohol consumption was reported, which was suggested to be causally associated only with the risk of HER2-enriched breast cancer. However, their MR-PRESSO and MR-Egger estimates for alcohol consumption were in line with our evidence for a causal risk–decreasing effect on the risk of luminal B–like tumors. This contradiction illustrates the added value of our approach to evaluate causal evidence based on the combination of six different MR methods (seven for age at menarche). Consequently, our conclusions are less likely to reflect invalid causal inferences due to unbalanced horizontal pleiotropy or due to chance findings because of multiple testing. We also assessed the homogeneity assumption through the inclusion of female-specific genetic IVs for height, BMI, and physical activity and showed that these instruments were consistently associated with stronger causal evidence across all breast cancer subtypes compared with combined-sex genetic IVs. In line with this observation, our findings for the risk of T2D, alcohol consumption, and regular smoking should be considered preliminary until female-specific IVs for these risk factors are used in future MR analyses. Altogether, our results underline the importance of an extensive investigation of the MR assumptions.

Another frequently unassessed assumption for two-sample MR analyses is that the risk factor and outcome GWAS samples should be independent, that is, there should be no overlap in study participants (24). As the BCAC includes several studies that also participate in other consortia, this assumption was not completely met in the current analysis. Based on reported details by the included GWAS, we estimated a relatively small overlap in participants, ranging from 0% to 16.3%. Bias due to sample overlap in two-sample MR studies arises in analyses that include weak genetic instruments. In the case of minimal sample overlap, this bias will be toward the null and thus rather increase type 2 error rates than type 1 error rates (55). For each risk factor, we estimated maximum and minimum F-statistics corresponding to primary IVW analyses for luminal A–like and HER2-enriched tumors, respectively. We based these estimations on the variance explained by the included genetic variants in an independent study population, if such independent r2 estimates were available. This approach minimized the overestimation of F-statistics due to winner’s curse bias in the discovery GWAS. Minimum F-statistics for height, BMI, alcohol, and smoking were below the arbitrary threshold of 10, which indicates that weak instrument bias may have biased causal effect estimates for these risk factors toward the null. A last important assumption for two-sample MR studies is that regression models employed for the risk factor GWAS and the outcome GWAS should be adjusted for the same covariates (24). Specifically, adjustment for potential confounders in the risk factor GWAS can induce collider bias in two-sample MR analyses. In the current analysis, such bias may have affected our results for breast density because its GWAS was adjusted for BMI. A higher BMI is associated with decreased breast density (56), but without data on the causal association and confounding structure among the genetic IVs, breast density, BMI, and breast cancer subtypes, it is difficult to evaluate the potential impact on our results (24, 57).

An additional strength of our study compared with previous MR studies that set out to investigate associations between breast cancer risk factors and the hormone receptor subtypes is that we maximized the statistical power of our primary analyses through the inclusion of as many genetic IVs as possible in combination with LD matrices. Although these correlation matrices were estimated based on genetic data of only ∼400 participants, causal estimates from our primary analyses were very similar to estimates from our secondary, more conservative analyses. Despite our efforts, our post hoc power analyses indicated that the statistical power to detect causal risk factors across all subtypes was still suboptimal. Future MR studies including stronger genetic IVs, that is, genetic IVs explaining more variance in the risk factor of interest, could further increase statistical power. However, the identification of additional loci requires even larger GWAS study populations, and this has proved to be challenging for lifestyle-related risk factors such as alcohol consumption, smoking, and physical activity. Another possibility to increase statistical power is the inclusion of larger numbers of breast cancer cases for hormone receptor subtypes, which will be possible through initiatives like the Confluence project (58). Further improvement in the quality of MR studies can be achieved if future risk factor GWAS stratify analyses by biological sex and report full details about the included GWAS settings, participants, and methods.

In conclusion, our results suggest that of the established breast cancer risk factors, height and BMI are likely to exert similar causal effects across all breast cancer subtypes. Our MR analyses also suggest that the majority of established breast cancer risk factors are not causally associated with the risk of triple-negative tumors. These insights are valuable for the development of primary prevention strategies and the improvement of breast cancer risk stratification in the general population. Our findings also emphasize the importance of taking breast cancer subtype into account for the identification of novel breast cancer risk factors.

M. Shokouhi reports grants from the Antoni van Leeuwenhoek Foundation during the conduct of the study. M.K. Schmidt reports grants from the European Union and the Antoni van Leeuwenhoek Foundation during the conduct of the study. No disclosures were reported by the other authors.

R.M.G. Verdiesen: Formal analysis, validation, investigation, visualization, methodology, writing–original draft, writing–review and editing. M. Shokouhi: Formal analysis, validation, investigation, visualization, methodology, writing–original draft, writing–review and editing. S. Burgess: Conceptualization, supervision, validation, writing–review and editing. S. Canisius: Conceptualization, supervision, validation, writing–review and editing. J. Chang-Claude: Conceptualization, supervision, validation, writing–review and editing. S.E. Bojesen: Conceptualization, supervision, validation, writing–review and editing. M.K. Schmidt: Conceptualization, resources, supervision, funding acquisition, validation, project administration, writing–review and editing.

We thank Dr. Haoyu Zhang for providing additional details about the BCAC GWAS summary statistics and Professor Aiden Doherty for providing female-specific summary statistics for overall physical activity. We thank Renee Menezes for help with MR-PRESSO. We acknowledge the BCAC for providing summary data. We gratefully acknowledge the Genetic Epidemiology Group at Cambridge University for their help in generating summary estimates, especially Dr. Kyriaki Michailidou. R.M.G. Verdiesen was funded by the European Union’s Horizon 2020 Research and Innovation Programme (grant number 633784 for the B-CAST project) granted to M.K. Schmidt. M. Shokouhi is funded by the Antoni van Leeuwenhoek Foundation. All research at the Netherlands Cancer Institute is supported by institutional grants from the Dutch Cancer Society and the Dutch Ministry of Health, Welfare and Sport. S. Burgess is supported by the Wellcome Trust (225790/Z/22/Z) and the United Kingdom Research and Innovation Medical Research Council (MC_UU_00002/7, MC_UU_00040/01). The breast cancer genome-wide association analyses for BCAC were supported by Cancer Research UK (PPRPGM-Nov20\100002, C1287/A10118, C1287/A16563, C1287/A10710, C12292/A20861, C12292/A11174, C1281/A12014, C5047/A8384, C5047/A15007, C5047/A10692, C8197/A16565); the Gray Foundation; the NIH (CA128978, X01HG007492: the DRIVE Consortium); the PERSPECTIVE project supported by the Government of Canada through Genome Canada and the Canadian Institutes of Health Research (grant GPH-129344); the Ministère de l’Économie, Science et Innovation du Québec through Genome Québec and the PSRSIIRI-701 grant; the Quebec Breast Cancer Foundation; the European Community’s Seventh Framework Programme under grant agreement number 223175 (HEALTH-F2-2009-223175; COGS); the European Union’s Horizon 2020 Research and Innovation Programme (634935 and 633784); the Post-Cancer GWAS initiative (U19 CA148537, CA148065, and CA148112: the GAME-ON initiative); the Department of Defence (W81XWH-10-1-0341); the Canadian Institutes of Health Research (CIHR) for the CIHR Team in Familial Risks of Breast Cancer (CRN-87521); the Komen Foundation for the Cure; the Breast Cancer Research Foundation; and the Ovarian Cancer Research Fund. All studies and funders are listed in Zhang and colleagues (34).

Note: Supplementary data for this article are available at Cancer Epidemiology, Biomarkers & Prevention Online (http://cebp.aacrjournals.org/).

1.
Jung
AY
,
Ahearn
TU
,
Behrens
S
,
Middha
P
,
Bolla
MK
,
Wang
Q
, et al
.
Distinct reproductive risk profiles for intrinsic-like breast cancer subtypes: pooled analysis of population-based studies
.
J Natl Cancer Inst
2022
;
114
:
1706
19
.
2.
Gaudet
MM
,
Gierach
GL
,
Carter
BD
,
Luo
J
,
Milne
RL
,
Weiderpass
E
, et al
.
Pooled analysis of nine cohorts reveals breast cancer risk factors by tumor molecular subtype
.
Cancer Res
2018
;
78
:
6011
21
.
3.
Anderson
KN
,
Schwab
RB
,
Martinez
ME
.
Reproductive risk factors and breast cancer subtypes: a review of the literature
.
Breast Cancer Res Treat
2014
;
144
:
1
10
.
4.
McCarthy
AM
,
Friebel-Klingner
T
,
Ehsan
S
,
He
W
,
Welch
M
,
Chen
J
, et al
.
Relationship of established risk factors with breast cancer subtypes
.
Cancer Med
2021
;
10
:
6456
67
.
5.
Schoemaker
MJ
,
Nichols
HB
,
Wright
LB
,
Brook
MN
,
Jones
ME
,
O’Brien
KM
, et al;
Premenopausal Breast Cancer Collaborative Group
;
Association of body mass index and age with subsequent breast cancer risk in premenopausal women
.
JAMA Oncol
2018
;
4
:
e181771
.
6.
Papadimitriou
N
,
Markozannes
G
,
Kanellopoulou
A
,
Critselis
E
,
Alhardan
S
,
Karafousia
V
, et al
.
An umbrella review of the evidence associating diet and cancer risk at 11 anatomical sites
.
Nat Commun
2021
;
12
:
4579
.
7.
Gago-Dominguez
M
,
Castelao
JE
,
Gude
F
,
Fernandez
MP
,
Aguado-Barrera
ME
,
Ponte
SM
, et al
.
Alcohol and breast cancer tumor subtypes in a Spanish cohort
.
Springerplus
2016
;
5
:
39
.
8.
Burgess
S
,
Thompson
SG
.
Mendelian randomization: methods for causal inference using genetic variants
.
New York (NY)
:
CRC Press
;
2021
. p.
13
50
.
9.
Zhang
B
,
Shu
XO
,
Delahanty
RJ
,
Zeng
C
,
Michailidou
K
,
Bolla
MK
, et al
.
Height and breast cancer risk: evidence from prospective studies and Mendelian randomization
.
J Natl Cancer Inst
2015
;
107
:
djv219
.
10.
Vithayathil
M
,
Carter
P
,
Kar
S
,
Mason
AM
,
Burgess
S
,
Larsson
SC
.
Body size and composition and risk of site-specific cancers in the UK Biobank and large international consortia: a Mendelian randomisation study
.
PLoS Med
2021
;
18
:
e1003706
.
11.
Burgess
S
,
Thompson
DJ
,
Rees
JMB
,
Day
FR
,
Perry
JR
,
Ong
KK
.
Dissecting causal pathways using mendelian randomization with summarized genetic data: application to age at menarche and risk of breast cancer
.
Genetics
2017
;
207
:
481
7
.
12.
Ruth
KS
,
Day
FR
,
Hussain
J
,
Martínez-Marchal
A
,
Aiken
CE
,
Azad
A
, et al
.
Genetic insights into biological mechanisms governing human ovarian ageing
.
Nature
2021
;
596
:
393
7
.
13.
Sieh
W
,
Rothstein
JH
,
Klein
RJ
,
Alexeeff
SE
,
Sakoda
LC
,
Jorgenson
E
, et al
.
Identification of 31 loci for mammographic density phenotypes and their associations with breast cancer risk
.
Nat Commun
2020
;
11
:
5116
.
14.
Papadimitriou
N
,
Dimou
N
,
Tsilidis
KK
,
Banbury
B
,
Martin
RM
,
Lewis
SJ
, et al
.
Physical activity and risks of breast and colorectal cancer: a Mendelian randomisation analysis
.
Nat Commun
2020
;
11
:
597
.
15.
Yuan
S
,
Kar
S
,
Carter
P
,
Vithayathil
M
,
Mason
AM
,
Burgess
S
, et al
.
Is type 2 diabetes causally associated with cancer risk? Evidence from a two-sample Mendelian randomization study
.
Diabetes
2020
;
69
:
1588
96
.
16.
Larsson
SC
,
Carter
P
,
Kar
S
,
Vithayathil
M
,
Mason
AM
,
Michaëlsson
K
, et al
.
Smoking, alcohol consumption, and cancer: a Mendelian randomisation study in UK Biobank and international genetic consortia participants
.
PLoS Med
2020
;
17
:
e1003178
.
17.
Dimou
N
,
Yarmolinsky
J
,
Bouras
E
,
Tsilidis
KK
,
Martin
RM
,
Lewis
SJ
, et al
.
Causal effects of lifetime smoking on breast and colorectal cancer risk: Mendelian randomization study
.
Cancer Epidemiol Biomarkers Prev
2021
;
30
:
953
64
.
18.
Chen
F
,
Wen
W
,
Long
J
,
Shu
X
,
Yang
Y
,
Shu
XO
, et al
.
Mendelian randomization analyses of 23 known and suspected risk factors and biomarkers for breast cancer overall and by molecular subtypes
.
Int J Cancer
2022
;
151
:
372
80
.
19.
Hayes
BL
,
Robinson
T
,
Kar
S
,
Ruth
KS
,
Tsilidis
KK
,
Frayling
T
, et al
.
Do sex hormones confound or mediate the effect of chronotype on breast and prostate cancer? A Mendelian randomization study
.
PLoS Genet
2022
;
18
:
e1009887
.
20.
Shu
X
,
Zhou
Q
,
Sun
X
,
Flesaker
M
,
Guo
X
,
Long
J
, et al
.
Associations between circulating proteins and risk of breast cancer by intrinsic subtypes: a Mendelian randomisation analysis
.
Br J Cancer
2022
;
127
:
1507
14
.
21.
Nounu
A
,
Kar
SP
,
Relton
CL
,
Richmond
RC
.
Sex steroid hormones and risk of breast cancer: a two-sample Mendelian randomization study
.
Breast Cancer Res
2022
;
24
:
66
.
22.
Bouras
E
,
Karhunen
V
,
Gill
D
,
Huang
J
,
Haycock
PC
,
Gunter
MJ
, et al
.
Circulating inflammatory cytokines and risk of five cancers: a Mendelian randomization analysis
.
BMC Med
2022
;
20
:
3
.
23.
Fu
Y
,
Xu
F
,
Jiang
L
,
Miao
Z
,
Liang
X
,
Yang
J
, et al
.
Circulating vitamin C concentration and risk of cancers: a Mendelian randomization study
.
BMC Med
2021
;
19
:
171
.
24.
Lawlor
DA
.
Commentary: two-sample Mendelian randomization: opportunities and challenges
.
Int J Epidemiol
2016
;
45
:
908
15
.
25.
UK CR
.
Risk factors
.
[cited 2020 Jun 30]. Available from:
https://www.cancerresearchuk.org/about-cancer/breast-cancer/risks-causes/risk-factors.
26.
Byng
JW
,
Boyd
NF
,
Fishell
E
,
Jong
RA
,
Yaffe
MJ
.
The quantitative analysis of mammographic densities
.
Phys Med Biol
1994
;
39
:
1629
38
.
27.
Yengo
L
,
Sidorenko
J
,
Kemper
KE
,
Zheng
Z
,
Wood
AR
,
Weedon
MN
, et al
.
Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry
.
Hum Mol Genet
2018
;
27
:
3641
9
.
28.
Wood
AR
,
Esko
T
,
Yang
J
,
Vedantam
S
,
Pers
TH
,
Gustafsson
S
, et al
.
Defining the role of common variation in the genomic and biological architecture of adult human height
.
Nat Genet
2014
;
46
:
1173
86
.
29.
Locke
AE
,
Kahali
B
,
Berndt
SI
,
Justice
AE
,
Pers
TH
,
Day
FR
, et al
.
Genetic studies of body mass index yield new insights for obesity biology
.
Nature
2015
;
518
:
197
206
.
30.
Vujkovic
M
,
Keaton
JM
,
Lynch
JA
,
Miller
DR
,
Zhou
J
,
Tcheandjieu
C
, et al
.
Discovery of 318 new risk loci for type 2 diabetes and related vascular outcomes among 1.4 million participants in a multi-ancestry meta-analysis
.
Nat Genet
2020
;
52
:
680
91
.
31.
Day
FR
,
Thompson
DJ
,
Helgason
H
,
Chasman
DI
,
Finucane
H
,
Sulem
P
, et al
.
Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk
.
Nat Genet
2017
;
49
:
834
41
.
32.
Liu
M
,
Jiang
Y
,
Wedow
R
,
Li
Y
,
Brazel
DM
,
Chen
F
, et al
.
Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use
.
Nat Genet
2019
;
51
:
237
44
.
33.
Doherty
A
,
Smith-Byrne
K
,
Ferreira
T
,
Holmes
MV
,
Holmes
C
,
Pulit
SL
, et al
.
GWAS identifies 14 loci for device-measured physical activity and sleep duration
.
Nat Commun
2018
;
9
:
5257
.
34.
Zhang
H
,
Ahearn
TU
,
Lecarpentier
J
,
Barnes
D
,
Beesley
J
,
Qi
G
, et al
.
Genome-wide association study identifies 32 novel breast cancer susceptibility loci from overall and subtype-specific analyses
.
Nat Genet
2020
;
52
:
572
81
.
35.
Myers
TA
,
Chanock
SJ
,
Machiela
MJ
.
LDlinkR: an R package for rapidly calculating linkage disequilibrium statistics in diverse populations
.
Front Genet
2020
;
11
:
157
.
36.
Hemani
G
,
Zheng
J
,
Elsworth
B
,
Wade
KH
,
Haberland
V
,
Baird
D
, et al
.
The MR-Base platform supports systematic causal inference across the human phenome
.
Elife
2018
;
7
:
e34408
.
37.
Burgess
S
,
Zuber
V
,
Valdes-Marquez
E
,
Sun
BB
,
Hopewell
JC
.
Mendelian randomization with fine-mapped genetic data: choosing from large numbers of correlated instrumental variables
.
Genet Epidemiol
2017
;
41
:
714
25
.
38.
Burgess
S
.
Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome
.
Int J Epidemiol
2014
;
43
:
922
9
.
39.
Viechtbauer
W
.
Conducting meta-analyses in R with the metafor package
.
J Stat Softw
2010
;
36
:
1
48
.
40.
Burgess
S
,
Davey Smith
G
,
Davies
NM
,
Dudbridge
F
,
Gill
D
,
Glymour
MM
, et al
.
Guidelines for performing Mendelian randomization investigations: update for summer 2023
.
Wellcome Open Res
2019
;
4
:
186
.
41.
Bowden
J
,
Davey Smith
G
,
Burgess
S
.
Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression
.
Int J Epidemiol
2015
;
44
:
512
25
.
42.
Bowden
J
,
Davey Smith
G
,
Haycock
PC
,
Burgess
S
.
Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator
.
Genet Epidemiol
2016
;
40
:
304
14
.
43.
Hartwig
FP
,
Davey Smith
G
,
Bowden
J
.
Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption
.
Int J Epidemiol
2017
;
46
:
1985
98
.
44.
Verbanck
M
,
Chen
CY
,
Neale
B
,
Do
R
.
Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases
.
Nat Genet
2018
;
50
:
693
8
.
45.
Slob
EAW
,
Burgess
S
.
A comparison of robust Mendelian randomization methods using summary data
.
Genet Epidemiol
2020
;
44
:
313
29
.
46.
Randall
JC
,
Winkler
TW
,
Kutalik
Z
,
Berndt
SI
,
Jackson
AU
,
Monda
KL
, et al
.
Sex-stratified genome-wide association studies including 270,000 individuals show sexual dimorphism in genetic loci for anthropometric traits
.
PLoS Genet
2013
;
9
:
e1003500
.
47.
Pulit
SL
,
Stoneman
C
,
Morris
AP
,
Wood
AR
,
Glastonbury
CA
,
Tyrrell
J
, et al
.
Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry
.
Hum Mol Genet
2019
;
28
:
166
74
.
48.
Markozannes
G
,
Kanellopoulou
A
,
Dimopoulou
O
,
Kosmidis
D
,
Zhang
X
,
Wang
L
, et al
.
Systematic review of Mendelian randomization studies on risk of cancer
.
BMC Med
2022
;
20
:
41
.
49.
Anderson
WF
,
Rosenberg
PS
,
Prat
A
,
Perou
CM
,
Sherman
ME
.
Tracking and evaluating molecular tumor markers with cancer registry data: HER2 and breast cancer
.
J Natl Cancer Inst
2014
;
106
:
dju165
.
50.
van den Brandt
PA
,
Ziegler
RG
,
Wang
M
,
Hou
T
,
Li
R
,
Adami
HO
, et al
.
Body size and weight change over adulthood and risk of breast cancer by menopausal and hormone receptor status: a pooled analysis of 20 prospective cohort studies
.
Eur J Epidemiol
2021
;
36
:
37
55
.
51.
Guo
Y
,
Warren Andersen
S
,
Shu
XO
,
Michailidou
K
,
Bolla
MK
,
Wang
Q
, et al
.
Genetically predicted body mass index and breast cancer risk: Mendelian randomization analyses of data from 145,000 women of European descent
.
PLoS Med
2016
;
13
:
e1002105
.
52.
Lahmann
PH
,
Hoffmann
K
,
Allen
N
,
van Gils
CH
,
Khaw
KT
,
Tehard
B
, et al
.
Body size and breast cancer risk: findings from the European Prospective Investigation into Cancer and Nutrition (EPIC)
.
Int J Cancer
2004
;
111
:
762
71
.
53.
Kleinstern
G
,
Scott
CG
,
Tamimi
RM
,
Jensen
MR
,
Pankratz
VS
,
Bertrand
KA
, et al
.
Association of mammographic density measures and breast cancer “intrinsic” molecular subtypes
.
Breast Cancer Res Treat
2021
;
187
:
215
24
.
54.
Ling
S
,
Brown
K
,
Miksza
JK
,
Howells
L
,
Morrison
A
,
Issa
E
, et al
.
Association of type 2 diabetes with cancer: a meta-analysis with bias analysis for unmeasured confounding in 151 cohorts comprising 32 million people
.
Diabetes Care
2020
;
43
:
2313
22
.
55.
Burgess
S
,
Davies
NM
,
Thompson
SG
.
Bias due to participant overlap in two-sample Mendelian randomization
.
Genet Epidemiol
2016
;
40
:
597
608
.
56.
Boyd
NF
,
Lockwood
GA
,
Byng
JW
,
Little
LE
,
Yaffe
MJ
,
Tritchler
DL
.
The relationship of anthropometric measures to radiological features of the breast in premenopausal women
.
Br J Cancer
1998
;
78
:
1233
8
.
57.
Hartwig
FP
,
Tilling
K
,
Davey Smith
G
,
Lawlor
DA
,
Borges
MC
.
Bias in two-sample Mendelian randomization when using heritable covariable-adjusted summary associations
.
Int J Epidemiol
2021
;
50
:
1639
50
.
58.
Genetics NNCI-DoCE. Confluence Project
[cited 2022 Feb 5]. Available from:
https://dceg.cancer.gov/research/cancer-types/breast-cancer/confluence-project.
This open access article is distributed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.