Purpose: Circulating hormone concentrations are associated with breast cancer risk, with well-established associations for postmenopausal women. Biomarkers may represent minimally invasive measures to improve risk prediction models.

Experimental Design: We evaluated improvements in discrimination gained by adding serum biomarker concentrations to risk estimates derived from risk prediction models developed by Gail and colleagues and Pfeiffer and colleagues using a nested case–control study within the EPIC cohort, including 1,217 breast cancer cases and 1,976 matched controls. Participants were pre- or postmenopausal at blood collection. Circulating sex steroids, prolactin, insulin-like growth factor (IGF) I, IGF-binding protein 3, and sex hormone–binding globulin (SHBG) were evaluated using backward elimination separately in women pre- and postmenopausal at blood collection. Improvement in discrimination was evaluated as the change in concordance statistic (C-statistic) from a modified Gail or Pfeiffer risk score alone versus models, including the biomarkers and risk score. Internal validation with bootstrapping (1,000-fold) was used to adjust for overfitting.

Results: Among women postmenopausal at blood collection, estradiol, testosterone, and SHBG were selected into the prediction models. For breast cancer overall, model discrimination after including biomarkers was 5.3 percentage points higher than the modified Gail model alone, and 3.4 percentage points higher than the Pfeiffer model alone, after accounting for overfitting. Discrimination was more markedly improved for estrogen receptor–positive disease (percentage point change in C-statistic: 7.2, Gail; 4.8, Pfeiffer). We observed no improvement in discrimination among women premenopausal at blood collection.

Conclusions: Integration of hormone measurements in clinical risk prediction models may represent a strategy to improve breast cancer risk stratification. Clin Cancer Res; 23(15); 4181–9. ©2017 AACR.

Translational Relevance

Breast cancer risk prediction models are increasingly being considered in the context of mammography screening, given the need to balance potential benefits of screening against possible negative side effects, such as overdiagnosis or false-positive findings, and financial costs. Our model demonstrates opportunities for higher discrimination in models incorporating blood-based biomarkers, including estradiol and testosterone, hormones with established cross-laboratory standardization protocols.

Risk prediction models aim to identify women at increased risk of breast cancer who may benefit from targeted screening or prophylactic chemoprevention. For women in the general population who have no indication of carrying a highly predisposing genetic mutation (e.g., BRCA1/2), current prediction models include menstrual and reproductive history, body mass index (BMI), past and current use of oral contraceptives (OC) or postmenopausal hormones (PMH), and basic information on family history and/or previous diagnosis of benign breast disease as predictors (1, 2). In addition, some first models have been developed integrating polygenetic risk scores based on common polymorphisms, and, for women participating in breast cancer screening, models may also include mammographic density (3–8).

Serum hormone concentrations represent potential additional predictors for these models. Endogenous hormones, including androgens, estrogens, insulin-like growth factor I (IGF-I), and prolactin, have been associated with risk in both pre- and postmenopausal breast cancer (9–14), suggesting that selected hormone measurements could be used for improving risk models for the identification of higher risk women. However, only one prior investigation, limited to postmenopausal women, has evaluated whether the addition of circulating hormones to risk prediction models improves discrimination (15).

Here, we present a study from the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort, in which we assessed whether the inclusion of dehydroepiandrosterone sulfate (DHEAS), testosterone, estradiol, estrone, sex hormone–binding globulin (SHBG), IGF-I, IGF-binding protein 3 (IGFBP3), or prolactin could improve risk prediction of invasive breast cancer for pre- and postmenopausal women, as compared with established and validated risk scores from the Breast Cancer Risk Assessment Tool (BCRAT, based on the Gail model; ref. 1) and, on a more restricted subset of data and as a secondary analysis, from the more recently developed model by Pfeiffer and colleagues (2).

Study population

The EPIC cohort has been described in detail previously (16, 17). Briefly, more than 500,000 study participants (367,903 women) were recruited from 10 European countries (Denmark, France, Germany, Greece, Italy, the Netherlands, Norway, Spain, Sweden, and the United Kingdom) between 1992 and 2002. Participants from Sweden and Norway did not contribute data to this analysis. In addition to questionnaire-based data and anthropometric measures, serum samples were collected at baseline using a standardized protocol and were stored at ≤−150°C.

Incident cancer cases are identified via record linkage with regional cancer registries (Denmark, Italy, the Netherlands, Spain, and the United Kingdom), health insurance records, cancer and pathology registries, and active follow-up of study subjects (France, Germany, Greece, and Naples, Italy). Data on vital status are obtained from mortality registries, in combination with data collected by active follow-up.

Case and control selection

Between 1993 and 2010, a total of 10,713 incident cases of invasive breast cancer were identified. Of these, up to 1,590 cases were included in a series of nested case–control studies on the relationship of breast cancer risk with endogenous hormone levels, as reported in detail previously (9, 10, 18–21).

Women with known menopausal status, not using exogenous hormones (i.e., OC or PMH), and with no reported history of cancer (except nonmelanoma skin cancer), were eligible for this study. Women who were 42 years or younger or reported having had regular menses in the last 12 months were classified as premenopausal. Women were classified as postmenopausal when they reported not having any menses over the past 12 months, when older than 55 years of age, or when reporting bilateral oophorectomy. Women older than 42 years and with incomplete information on menopausal status were classified as perimenopausal/unknown menopausal status and were generally excluded from EPIC studies on endogenous hormones. Cases were selected in two study phases. In phase I (through 2004), all eligible cases were included; in phase II, all estrogen receptor–negative (ER) cases, plus an equal number of ER+ cases were randomly selected among cases matching each ER case for recruitment center (after 2004).

Cases were restricted to incident invasive breast cancers diagnosed during follow-up; cases of ductal carcinoma in situ were excluded. Up to two control subjects were matched to each case, using incidence density sampling. Matching factors were study recruitment center, menopausal status (premenopausal, postmenopausal), age at enrolment (±6 months), time of day of blood collection (±1 hour), fasting status (<3 hours; 3–6 hours, >6 hours), and, for premenopausal women, menstrual cycle phase. We restricted the current study to women with available data on at least estradiol, testosterone, SHBG, and IGF-I, resulting in a total study population of 1,217 cases and 1,976 matched controls.

Laboratory analyses

This study includes concentrations of circulating (serum) testosterone, DHEAS, estrone, estradiol, SHBG, androstenedione, progesterone, IGF-I, IGFBP3, and prolactin. Measurement of the biomarkers has been described in detail previously (9, 10, 18–21). For all hormone measurements, blood samples from cases and matched controls were analyzed within the same analytic batch, and laboratory technicians were blinded to the case–control status of the study subjects. The assays were performed at the International Agency for Research into Cancer (IARC; Lyon, France) and at the German Cancer Research Center (DKFZ; Heidelberg, Germany), using commercially available immunoassays. Serum progesterone concentrations were measured only in premenopausal women, as ovarian progesterone synthesis ceases after menopause.

Statistical analyses

For all cases and controls, we calculated 5-year and 10-year risk scores for breast cancer, using established models by Gail (1) and by Pfeiffer (2), and using available data on menopausal status, ages at menarche and at menopause, duration of PMH use, parity, number of children and age at first full-term pregnancy, family history of breast cancer, alcohol consumption at recruitment, and BMI (kg/m2). Information on personal history of breast biopsies and benign breast disease (i.e., hyperplasia) was not available and was set to missing in the estimation of Gail model risk estimates. Given missing data on these two risk factors, we henceforth refer to our Gail model risk estimates as results from a modified Gail model. When information on family history was missing (n = 1,693, 53%), it was set to zero. The Pfeiffer model is designed for women older than 50 years of age; thus, estimates using this model are available for only 60% of our study sample. Given the small number of premenopausal women older than age 50 in our study (n = 45 cases), improvement to this model was evaluated only among women postmenopausal at blood collection. Furthermore, sporadic missing values in the risk factor variables in the Pfeiffer risk model ranging from 0% (BMI) to 6% (age at menopause) led to the exclusion of 92 case sets from the analyses using the Pfeiffer risk score.

All biomarker measurements were log2 transformed. Estrogens and androgens were measured in two study phases (9, 18, 20, 21). To harmonize the measurements from the two study phases, the log2-transformed values from phase II were standardized to the distribution in phase I using mean and SD, by menopausal status. Among women premenopausal at blood collection, we used menstrual cycle phase-specific residuals for estrone, estradiol, and progesterone, calculated as a woman's individual hormone value minus menstrual cycle phase-specific mean value from a local linear regression model. This accounts for within-person variability in these hormones across the menstrual cycle. RR estimates were derived using conditional logistic regression, which was calibrated toward the absolute risk estimates from the prespecified modified Gail and Pfeiffer epidemiologic risk models as an offset variable, thus including the predefined absolute risk score for each participant in the model with a fixed beta of 1. In a single-step backwards elimination process, biomarkers with a P value below 0.157, indicating improvement in the Akaike information criterion (22), were retained in the model. Biomarker concentration availability did not overlap among all cases and controls (i.e., different subsets of biomarkers were available for subsets of the study population). Therefore, we evaluated one biomarker from each pathway in these analyses (i.e., androgens: testosterone; estrogens: estradiol; growth factors: IGF-I; SHBG), to achieve the maximum sample size with overlapping biomarker values. Testosterone and estradiol are both part of the United States Centers for Disease Control and Prevention (CDC) Hormone Standardization Program, designed to ensure comparability of results across laboratories (www.cdc.gov/labstandards/hs.html). Thus, these markers in particular are attractive candidates as circulating markers to be included in clinical risk prediction models.

Improvement in risk estimation was assessed with concordance statistic (C-statistic; equivalent to the area under the receiver operating curve) for the effect of the biomarkers alone, and in terms of change in C-statistic from addition of biomarkers to the calculated absolute risk estimates. We assessed the increase in average risk difference between cases and noncases with the integrated discrimination improvement (IDI) and evaluated the frequency of improvement of prediction with the net reclassification improvement (NRI; continuous).

Internal validation with bootstrapping (1,000-fold) was applied to adjust for overfitting from model development and estimation. The mean “optimism” estimate for the C-statistics, IDI, and NRI was subtracted from the observed estimates, and 2.5 and 97.5 percentiles of the bootstrap estimates are presented as 95% confidence limits. Analyses were performed separately for women pre- and postmenopausal at blood collection. In addition, we also conducted analyses considering only ER+ breast cancer cases. All analyses were conducted in SAS v9.3.

This study included a total of 1,217 breast cancer cases (430 pre- and 787 postmenopausal at the time of blood donation) with calculated 5-year (and 10-year) risk scores from the modified Gail model and measurements of testosterone, estradiol, SHBG, and IGF-I. A total of 661 cases contributed to analyses using the Pfeiffer model (restricted to postmenopausal women >50 years of age at blood collection).

Women were median age 56 years (range, 26–77) at blood collection, and the majority had at least one full-term pregnancy (cases, 83%; controls, 86%), with median two full-term pregnancies (range, 1–14) among parous women (Table 1). The median-predicted 5- and 10-year risks as calculated by the modified Gail and Pfeiffer models were only slightly higher among the breast cancer cases as compared with the controls, and the modified Gail and Pfeiffer risk scores provided only weak discrimination between cases and controls (C-statistics, postmenopausal women, modified Gail, 52.7%; Pfeiffer, 53.8%). Median concentrations for each of the evaluated biomarkers are presented in Table 2.

Table 1.

Description of epidemiologic risk factors and risk score estimates in cases and matched controls, presented as n (%) or median (min–max): EPIC breast cancer nested case–control study

Cases (n = 1,217)Controls (n = 1,976)
Age at recruitment, years 56.2 (26.6–76.4) 56.3 (26.4–76.8) 
BMI, kg/m2 25.3 (16.6–46.1) 25.2 (16.0–45.3) 
Age at menarche, years 13 (≤8; ≥20) 13 (≤8;≥20) 
Ever full-term pregnancy 1,016 (83%) 1,699 (86%) 
Age at first full-term pregnancy, yearsa 25.0 (16.0–44.0) 25.0 (16.0–43.0) 
Number of full-term pregnanciesa 2.0 (1.0–9.0) 2.0 (1.0–14.0) 
Menopausal status at blood collection 
 Premenopausal 430 (35%) 684 (35%) 
 Postmenopausal 755 (62%) 1,233 (62%) 
 Surgical postmenopausal 32 (3%) 59 (3%) 
Age at menopause (among postmenopausal), years 50 (15–59) 50 (24–62) 
Family history of breast cancer 
 Yes 87 (7%) 87 (4%) 
 No 466 (38%) 860 (44%) 
 Unknown 664 (55%) 1,029 (52%) 
Alcohol consumption, baseline 
 Nondrinker 246 (20%) 443 (22%) 
 0–6 g/d 489 (40%) 831 (42%) 
 >6 g/d 482 (40%) 701 (35%) 
5-year breast cancer risk estimates 
 Modified Gail model 1.68% (0.07–4.59) 1.64% (0.07–4.54) 
 Pfeiffer modelb 1.53% (0.72–3.55) 1.47% (0.66–3.86) 
10-year breast cancer risk estimates 
 Modified Gail model 3.52% (0.26–8.43) 3.46% (0.25–8.43) 
 Pfeiffer model 3.13% (1.59–6.96) 3.02% (1.46–7.10) 
Tumor ER statusc 
 Positive 560 (46%)  
 Negative 242 (20%)  
Cases (n = 1,217)Controls (n = 1,976)
Age at recruitment, years 56.2 (26.6–76.4) 56.3 (26.4–76.8) 
BMI, kg/m2 25.3 (16.6–46.1) 25.2 (16.0–45.3) 
Age at menarche, years 13 (≤8; ≥20) 13 (≤8;≥20) 
Ever full-term pregnancy 1,016 (83%) 1,699 (86%) 
Age at first full-term pregnancy, yearsa 25.0 (16.0–44.0) 25.0 (16.0–43.0) 
Number of full-term pregnanciesa 2.0 (1.0–9.0) 2.0 (1.0–14.0) 
Menopausal status at blood collection 
 Premenopausal 430 (35%) 684 (35%) 
 Postmenopausal 755 (62%) 1,233 (62%) 
 Surgical postmenopausal 32 (3%) 59 (3%) 
Age at menopause (among postmenopausal), years 50 (15–59) 50 (24–62) 
Family history of breast cancer 
 Yes 87 (7%) 87 (4%) 
 No 466 (38%) 860 (44%) 
 Unknown 664 (55%) 1,029 (52%) 
Alcohol consumption, baseline 
 Nondrinker 246 (20%) 443 (22%) 
 0–6 g/d 489 (40%) 831 (42%) 
 >6 g/d 482 (40%) 701 (35%) 
5-year breast cancer risk estimates 
 Modified Gail model 1.68% (0.07–4.59) 1.64% (0.07–4.54) 
 Pfeiffer modelb 1.53% (0.72–3.55) 1.47% (0.66–3.86) 
10-year breast cancer risk estimates 
 Modified Gail model 3.52% (0.26–8.43) 3.46% (0.25–8.43) 
 Pfeiffer model 3.13% (1.59–6.96) 3.02% (1.46–7.10) 
Tumor ER statusc 
 Positive 560 (46%)  
 Negative 242 (20%)  

aAmong parous women.

bData available for 1,041 controls (53%) and 661 cases (54%).

cMissing for 34% of cases; percentages reflect distribution among all cases. Through 2004, all cases included; after 2004, all ER cases plus an equal number of ER+ cases.

Table 2.

Distribution of circulating biomarker concentrations in cases and controls, by menopausal status at blood collection: EPIC breast cancer nested case–control study

Premenopausal at blood collectionaPostmenopausal at blood collection
CasesControlsCasesControls
Hormone n Median (5th–95th percentile) n Median (5th–95th percentile) N Median (5th–95th percentile) n Median (5th–95th percentile) 
 Testosterone (ng/mL)b 430 0.46 (0.16–1.11) 684 0.45 (0.16–0.96) 787 0.39 (0.11–1.01) 1,292 0.35 (0.09–0.93) 
 Androstenedione (ng/mL) 270 1.52 (0.55–4.08) 516 1.44 (0.45–3.38) 561 0.96 (0.29–2.84) 1,057 0.87 (0.25–2.53) 
 DHEAS (μg/dL) 270 127.0 (37.1–352.8) 516 123.8 (32.4–298) 559 84.5 (19.0–281.8) 1,061 73.3 (17.19–243.6) 
 SHBG (nmol/L)b 430 48.01 (15.33–124.32) 684 45.82 (14.9 – 115) 787 32.6 (10.31–94.2) 1,292 34.6 (9.9–95.6) 
 Estrone (E1; pg/mL) 269 96.34 (28.77–297.53) 508 98.22 (28.6–294.9) 535 42.4 (16.4–108.1) 1,036 39.7 (15.1–83.2) 
 Estradiol (E2; pg/mL)b 430 102.9 (27.1–404.0) 678 97.89 (17.5–390.5) 787 25.5 (12.42 – 84.2) 1,292 23.3 (10.7–65.2) 
 Progesterone (ng/mL)b 405 5.08 (0.3–37.8) 635 5.38 (0.28–49.7)     
 IGF-I (ng/mL)b 430 273.5 (142.6–430.1) 684 272.2 (140–425) 787 218.0 (102.7–395.7) 1,292 211.9 (107.4–375.1) 
 IGF-BP3 (ng/mL) 258 3,229 (1,814–6,462) 490 3,227, (1,738–6,281) 550 3,323 (1,532–6,692) 1,024 3,245 (1,690–6,525) 
 Prolactin (ng/mL) 268 8.74 (3.77–27.3) 257 9.35 (3.92–25.0) 500 5.88 (2.87–18.88) 483 5.69 (2.93–14.49) 
Premenopausal at blood collectionaPostmenopausal at blood collection
CasesControlsCasesControls
Hormone n Median (5th–95th percentile) n Median (5th–95th percentile) N Median (5th–95th percentile) n Median (5th–95th percentile) 
 Testosterone (ng/mL)b 430 0.46 (0.16–1.11) 684 0.45 (0.16–0.96) 787 0.39 (0.11–1.01) 1,292 0.35 (0.09–0.93) 
 Androstenedione (ng/mL) 270 1.52 (0.55–4.08) 516 1.44 (0.45–3.38) 561 0.96 (0.29–2.84) 1,057 0.87 (0.25–2.53) 
 DHEAS (μg/dL) 270 127.0 (37.1–352.8) 516 123.8 (32.4–298) 559 84.5 (19.0–281.8) 1,061 73.3 (17.19–243.6) 
 SHBG (nmol/L)b 430 48.01 (15.33–124.32) 684 45.82 (14.9 – 115) 787 32.6 (10.31–94.2) 1,292 34.6 (9.9–95.6) 
 Estrone (E1; pg/mL) 269 96.34 (28.77–297.53) 508 98.22 (28.6–294.9) 535 42.4 (16.4–108.1) 1,036 39.7 (15.1–83.2) 
 Estradiol (E2; pg/mL)b 430 102.9 (27.1–404.0) 678 97.89 (17.5–390.5) 787 25.5 (12.42 – 84.2) 1,292 23.3 (10.7–65.2) 
 Progesterone (ng/mL)b 405 5.08 (0.3–37.8) 635 5.38 (0.28–49.7)     
 IGF-I (ng/mL)b 430 273.5 (142.6–430.1) 684 272.2 (140–425) 787 218.0 (102.7–395.7) 1,292 211.9 (107.4–375.1) 
 IGF-BP3 (ng/mL) 258 3,229 (1,814–6,462) 490 3,227, (1,738–6,281) 550 3,323 (1,532–6,692) 1,024 3,245 (1,690–6,525) 
 Prolactin (ng/mL) 268 8.74 (3.77–27.3) 257 9.35 (3.92–25.0) 500 5.88 (2.87–18.88) 483 5.69 (2.93–14.49) 

NOTE: Values in bold signify markers used in multibiomarker risk prediction model.

aFor premenopausal women, estradiol, estrone, and progesterone calibrated to day 21 of menstrual cycle phase, study phase I.

bHormones measured in two study phases. Phase II values calibrated to mean and SD observed in measurements from phase I.

Testosterone, androstenedione, and DHEAS were significantly associated with risk among premenopausal women in models adding the serum hormones individually to the modified Gail risk score as offset. Among this subgroup, significant ORs for a 1-unit change in log2-transformed hormone concentration ranged from 1.24 [95% confidence interval (CI), 1.01–1.53] for DHEAS to 1.33 (1.08–1.64) for testosterone (Table 3). In premenopausal women, the addition of single hormone concentrations to the calculated modified Gail model epidemiologic risk score improved discrimination (quantified by change in C-statistic) up to 2.3 percentage points (androstenedione). Among postmenopausal women, testosterone, androstenedione, DHEAS, SHBG, estrone, estradiol, and IGF-I were significantly associated with breast cancer risk. Statistically significant OR estimates ranged from 1.24 for both androstenedione (95% CI, 1.09–1.42) and DHEAS (95% CI, 1.11–1.40) to 1.56 (95% CI, 1.32–1.84) for estradiol and 1.66 (95% CI, 1.33–2.06) for estrone. SHBG was significantly inversely associated with postmenopausal breast cancer risk (OR, 0.86; 95% CI, 0.77–0.96). The associations for ER+ disease were somewhat stronger than those observed for total breast cancer. In this sample of women not using exogenous hormones at the time of blood donation, prolactin was not associated with breast cancer risk. In postmenopausal women, adding single serum hormone measurements improved discrimination up to 4.5 percentage points for breast cancer overall (estradiol). Results were similar in models using the Pfeiffer risk score as offset (restricted to postmenopausal women; Supplementary Table S1).

Table 3.

Associations between candidate biomarkers and breast cancer risk by menopausal status at blood collection, using 5-year risk from modified Gail model as a regression offset: EPIC breast cancer nested case–control study

Premenopausal at blood collectionPostmenopausal at blood collection
Case setsPORa (95% CI)ΔCbCase setsPORa (95% CI)ΔCb
All breast cancer cases 
 Testosterone 430 0.0073 1.33 (1.08–1.64) 1.0 787 1.88E−07 1.40 (1.24–1.59) 3.7 
 Androstenedione 270 0.0209 1.31 (1.04–1.65) 2.3 561 0.0014 1.24 (1.09–1.42) 1.9 
 DHEAS 270 0.0370 1.24 (1.01–1.53) 1.3 559 0.0003 1.24 (1.11–1.40) 2.8 
 SHBG 430 0.7617 0.97 (0.81–1.17) 0 787 0.0074 0.86 (0.77–0.96) 1.3 
 Estrone (E1) 269 0.4308 1.09 (0.89–1.33) 535 4.80E−06 1.66 (1.33–2.06) 3.1 
 Estradiol (E2) 430 0.2473 1.08 (0.95–1.22) 0.1 787 2.32E−07 1.56 (1.32–1.84) 4.5 
 Progesterone 405 0.3740 0.97 (0.89–1.04)     
 IGF-I 430 0.8359 1.04 (0.74–1.46) 0.1 787 0.0454 1.25 (1.00–1.55) 0.9 
 IGF-BP3 258 0.4826 1.24 (0.68–2.27) 0.7 550 0.2517 1.19 (0.88–1.60) 0.5 
 Prolactin 268 0.9497 1.01 (0.76–1.34) 0 500 0.2327 1.13 (0.92–1.40) 0.2 
ER+ breast cancer cases 
 Testosterone 197 0.0583 1.35 (0.99–1.85) 1.7 363 <0.0001 1.49 (1.23–1.80) 5.0 
 Androstenedione 113 0.0651 1.39 (0.98–1.96) 4.1 251 0.0022 1.37 (1.12–1.68) 4.2 
 DHEAS 113 0.0378 1.41 (1.02–1.95) 4.3 251 0.0014 1.36 (1.13–1.64) 5.3 
 SHBG 197 0.5623 0.92 (0.70–1.22) 0.2 363 0.1154 0.87 (0.74–1.03) 1.4 
 Estrone (E1) 112 0.2411 1.21 (0.88–1.65) −0.0 239 0.0065 1.58 (1.14–2.20) 2.4 
 Estradiol (E2) 197 0.0719 1.19 (0.98–1.44) 1.7 363 <0.0001 1.71 (1.31–2.22) 6.3 
 Progesterone 186 0.2943 0.94 (0.84–1.06) 0.6     
 IGF-I 197 0.2674 1.35 (0.79–2.29) 1.8 363 0.0906 1.31 (0.96–1.80) 1.8 
 IGF-BP3 110 0.2230 1.77 (0.71–4.46) 3.8 244 0.3986 1.22 (0.77–1.94) 0.8 
 Prolactin 191 0.7907 0.96 (0.68–1.34) 0.4 335 0.1010 1.25 (0.96–1.63) 1.0 
Premenopausal at blood collectionPostmenopausal at blood collection
Case setsPORa (95% CI)ΔCbCase setsPORa (95% CI)ΔCb
All breast cancer cases 
 Testosterone 430 0.0073 1.33 (1.08–1.64) 1.0 787 1.88E−07 1.40 (1.24–1.59) 3.7 
 Androstenedione 270 0.0209 1.31 (1.04–1.65) 2.3 561 0.0014 1.24 (1.09–1.42) 1.9 
 DHEAS 270 0.0370 1.24 (1.01–1.53) 1.3 559 0.0003 1.24 (1.11–1.40) 2.8 
 SHBG 430 0.7617 0.97 (0.81–1.17) 0 787 0.0074 0.86 (0.77–0.96) 1.3 
 Estrone (E1) 269 0.4308 1.09 (0.89–1.33) 535 4.80E−06 1.66 (1.33–2.06) 3.1 
 Estradiol (E2) 430 0.2473 1.08 (0.95–1.22) 0.1 787 2.32E−07 1.56 (1.32–1.84) 4.5 
 Progesterone 405 0.3740 0.97 (0.89–1.04)     
 IGF-I 430 0.8359 1.04 (0.74–1.46) 0.1 787 0.0454 1.25 (1.00–1.55) 0.9 
 IGF-BP3 258 0.4826 1.24 (0.68–2.27) 0.7 550 0.2517 1.19 (0.88–1.60) 0.5 
 Prolactin 268 0.9497 1.01 (0.76–1.34) 0 500 0.2327 1.13 (0.92–1.40) 0.2 
ER+ breast cancer cases 
 Testosterone 197 0.0583 1.35 (0.99–1.85) 1.7 363 <0.0001 1.49 (1.23–1.80) 5.0 
 Androstenedione 113 0.0651 1.39 (0.98–1.96) 4.1 251 0.0022 1.37 (1.12–1.68) 4.2 
 DHEAS 113 0.0378 1.41 (1.02–1.95) 4.3 251 0.0014 1.36 (1.13–1.64) 5.3 
 SHBG 197 0.5623 0.92 (0.70–1.22) 0.2 363 0.1154 0.87 (0.74–1.03) 1.4 
 Estrone (E1) 112 0.2411 1.21 (0.88–1.65) −0.0 239 0.0065 1.58 (1.14–2.20) 2.4 
 Estradiol (E2) 197 0.0719 1.19 (0.98–1.44) 1.7 363 <0.0001 1.71 (1.31–2.22) 6.3 
 Progesterone 186 0.2943 0.94 (0.84–1.06) 0.6     
 IGF-I 197 0.2674 1.35 (0.79–2.29) 1.8 363 0.0906 1.31 (0.96–1.80) 1.8 
 IGF-BP3 110 0.2230 1.77 (0.71–4.46) 3.8 244 0.3986 1.22 (0.77–1.94) 0.8 
 Prolactin 191 0.7907 0.96 (0.68–1.34) 0.4 335 0.1010 1.25 (0.96–1.63) 1.0 

NOTE: Values in bold signify markers used in multibiomarker risk prediction model.

aEffects presented as per doubling of concentration (from log2-transformed measurements).

bΔC represents change in C-statistic comparing model with offset for breast cancer risk score alone to one with risk score plus individual biomarker; expressed here as percentage point difference.

We next carried out a backward elimination step with testosterone, estradiol, IGF-I, and SHBG as candidates for a multibiomarker risk prediction model. Testosterone was selected by backward elimination for the prediction of premenopausal risk; estradiol, testosterone, and SHBG were selected for postmenopausal women in models using the modified Gail score (Table 4) and Pfeiffer score (Supplementary Table S2) as offset.

Table 4.

Estimated biomarker effects on risk of breast cancer, per doubling of hormone concentration, by menopausal status at blood collection: EPIC breast cancer nested case–control study

Premenopausal at blood collectionPostmenopausal at blood collection
Full modelSelectedFull modelSelected
OR (95% CI)OR (95% CI)OR (95% CI)OR (95% CI)
Testosterone 1.31 (1.06–1.62) 1.33 (1.08–1.64) 1.25 (1.08–1.45) 1.26 (1.09–1.45) 
Estradiol (E2) 1.06 (0.91–1.23)  1.31 (1.08–1.58) 1.31 (1.08–1.58) 
SHBG 0.98 (0.81–1.19)  0.91 (0.81–1.01) 0.90 (0.81–1.00) 
IGF-I 0.99 (0.70–1.41)  1.12 (0.89–1.40)  
Premenopausal at blood collectionPostmenopausal at blood collection
Full modelSelectedFull modelSelected
OR (95% CI)OR (95% CI)OR (95% CI)OR (95% CI)
Testosterone 1.31 (1.06–1.62) 1.33 (1.08–1.64) 1.25 (1.08–1.45) 1.26 (1.09–1.45) 
Estradiol (E2) 1.06 (0.91–1.23)  1.31 (1.08–1.58) 1.31 (1.08–1.58) 
SHBG 0.98 (0.81–1.19)  0.91 (0.81–1.01) 0.90 (0.81–1.00) 
IGF-I 0.99 (0.70–1.41)  1.12 (0.89–1.40)  

NOTE: Estimates adjusted for modified Gail risk score from common model (i.e. mutually adjusted) and in selected model.

Among women premenopausal at blood collection, we observed no improvement in discrimination, relative to the modified Gail model alone, in the full multibiomarker model or the selected model, including testosterone, for overall breast cancer or ER+ disease (Table 5). Discrimination for ER+ disease in premenopausal women was not improved when the model was refit among cases with ER+ tumors and their matched controls (four candidate hormones, change in discrimination: 1.7 percentage points, IDI: 0.0%, NRI: 11%). In contrast, among postmenopausal women and for overall breast cancer, adding these four hormone measurements to prediction models improved the C-statistic by 5.6 percentage points relative to the modified Gail score (correcting for optimism due to overfitting). The IDI indicates an average increase in risk difference of 0.18%, and the NRI shows improved prediction in 16% of the women. Results were similar in the model including the three selected biomarkers (relative to modified Gail score: 5.3 percentage point improvement in discrimination, IDI of 0.17% and NRI of 16%). Applying the model to ER+ disease, discrimination was improved by 7.5 percentage points in the full model, and 7.2 percentage points in the selected model, compared with the modified Gail score alone. When the model was refit among cases with ER+ tumors and their matched controls, testosterone and estradiol were selected into the prediction model with OR effects of 1.20 and 1.17, respectively, in postmenopausal women. Model discrimination improved by 7.0 percentage points (0.18% IDI, 18% NRI) for the four hormones together, and by 6.7 percentage points (0.17% IDI, 19% NRI) for the selected model, after adjustment for overoptimism.

Table 5.

Performance of full and selected models by menopausal status at blood collection in terms of C-statistic, IDI, and NRI (continuous), each with 95% CI and optimism correction from 1,000 bootstrap samples: EPIC breast cancer nested case–control study

%C (95% CI)IDI (95% CI)NRI (95% CI)
Premenopausal at blood collection
Modified Gail model 52.4 (48.9–55.8)   
Hormones only 52.6 (49.1–56.1)   
Full model, all cases 53.4 (49.9–56.9)   
 Improvement 1.0 0.0002 (−0.0001–0.0004) 0.05 (−0.07–0.17) 
 Optimisma 1.1 (−0.6–3.8) 0.0002 (−0.0001–0.0008) 0.04 (−0.06–0.16) 
 Corrected improvementa,b −0.1 (−0.3–2.0) 0.0000 (0.0000–0.0006) 0.01 (−0.04–0.16) 
Full model, ER+ cases 53.3 (48.1–58.8)   
 Improvement 2.0 0.0002 (−0.0002–0.0006) 0.03 (−0.15–0.21) 
 Optimisma 1.5 (−1.5–5.6) 0.0003 (−0.0002–0.0011) 0.07 (−0.09–0.22) 
 Corrected improvementa,b 0.5 (−0.8–3.7) 0.0000 (−0.0001–0.0007) −0.04 (−0.13–0.18) 
Selected model, all cases 53.3 (49.9–56.8)   
 Improvement 1.0 0.0002 (−0.0001–0.0004) 0.04 (−0.08–0.16) 
 Optimisma 0.9 (−0.8–3.7) 0.0002 (−0.0001–0.0008) 0.04 (−0.08–0.15) 
 Corrected improvementa,b 0.0 (−0.6–1.9) 0.0000 (0.0000–0.0006) 0.00 (−0.03–0.14) 
Selected model, ER+ cases 53.0 (47.8–58.1)   
 Improvement 1.6 0.0002 (−0.0002–0.0006) 0.06 (−0.11–0.24) 
 Optimisma 1.3 (−1.8–5.4) 0.0002 (−0.0002–0.0010) 0.06 (−0.11–0.22) 
 Corrected improvementa,b 0.3 (−0.7–3.7) 0.0000 (−0.0001–0.0006) 0.00 (−0.10–0.13) 
 Postmenopausal at blood collection 
Modified Gail model 52.7 (50.1–55.2)   
Hormones only 57.7 (55.2–60.2)   
Full model, all cases 58.8 (56.3–61.3)   
 Improvement 6.1 0.0021 (0.0013–0.0028) 0.17 (0.09–0.26) 
 Optimisma 0.5 (−1.6–2.9) 0.0003 (−0.0005–0.0017) 0.01 (−0.06–0.09) 
 Corrected improvementa,b 5.6 (4.7–6.2) 0.0018 (0.0017–0.0039) 0.16 (0.13–0.21) 
Full model, ER+ cases 60.1 (56.4–63.7)   
 Improvement 7.7 0.0020 (0.0010–0.0031) 0.19 (0.06–0.33) 
 Optimisma 0.2 (−2.9–3.7) 0.0002 (−0.0010–0.0019) 0.00 (−0.12–0.12) 
 Corrected improvementa,b 7.5 (6.3–8.1) 0.0019 (0.0017–0.0037) 0.19 (0.13–0.26) 
Selected model, all cases 58.5 (56.0–61.0)   
 Improvement 5.8 0.0021 (0.0013–0.0028) 0.17 (0.08–0.26) 
 Optimisma 0.5 (−1.5–2.9) 0.0003 (−0.0005–0.0017) 0.02 (−0.06–0.09) 
 Corrected improvementa,b 5.3 (4.2–6.2) 0.0017 (0.0017–0.0038) 0.16 (0.12–0.21) 
Selected model, ER+ cases 59.7 (56.1–63.4)   
 Improvement 7.4 0.0020 (0.0010–0.0030) 0.21 (0.08–0.34) 
 Optimisma 0.3 (−3.0–3.8) 0.0002 (−0.0010–0.0020) 0.00 (−0.12–0.13) 
 Corrected improvementa,b 7.2 (5.3–8.1) 0.0018 (0.0016–0.0037) 0.21 (0.13–0.26) 
%C (95% CI)IDI (95% CI)NRI (95% CI)
Premenopausal at blood collection
Modified Gail model 52.4 (48.9–55.8)   
Hormones only 52.6 (49.1–56.1)   
Full model, all cases 53.4 (49.9–56.9)   
 Improvement 1.0 0.0002 (−0.0001–0.0004) 0.05 (−0.07–0.17) 
 Optimisma 1.1 (−0.6–3.8) 0.0002 (−0.0001–0.0008) 0.04 (−0.06–0.16) 
 Corrected improvementa,b −0.1 (−0.3–2.0) 0.0000 (0.0000–0.0006) 0.01 (−0.04–0.16) 
Full model, ER+ cases 53.3 (48.1–58.8)   
 Improvement 2.0 0.0002 (−0.0002–0.0006) 0.03 (−0.15–0.21) 
 Optimisma 1.5 (−1.5–5.6) 0.0003 (−0.0002–0.0011) 0.07 (−0.09–0.22) 
 Corrected improvementa,b 0.5 (−0.8–3.7) 0.0000 (−0.0001–0.0007) −0.04 (−0.13–0.18) 
Selected model, all cases 53.3 (49.9–56.8)   
 Improvement 1.0 0.0002 (−0.0001–0.0004) 0.04 (−0.08–0.16) 
 Optimisma 0.9 (−0.8–3.7) 0.0002 (−0.0001–0.0008) 0.04 (−0.08–0.15) 
 Corrected improvementa,b 0.0 (−0.6–1.9) 0.0000 (0.0000–0.0006) 0.00 (−0.03–0.14) 
Selected model, ER+ cases 53.0 (47.8–58.1)   
 Improvement 1.6 0.0002 (−0.0002–0.0006) 0.06 (−0.11–0.24) 
 Optimisma 1.3 (−1.8–5.4) 0.0002 (−0.0002–0.0010) 0.06 (−0.11–0.22) 
 Corrected improvementa,b 0.3 (−0.7–3.7) 0.0000 (−0.0001–0.0006) 0.00 (−0.10–0.13) 
 Postmenopausal at blood collection 
Modified Gail model 52.7 (50.1–55.2)   
Hormones only 57.7 (55.2–60.2)   
Full model, all cases 58.8 (56.3–61.3)   
 Improvement 6.1 0.0021 (0.0013–0.0028) 0.17 (0.09–0.26) 
 Optimisma 0.5 (−1.6–2.9) 0.0003 (−0.0005–0.0017) 0.01 (−0.06–0.09) 
 Corrected improvementa,b 5.6 (4.7–6.2) 0.0018 (0.0017–0.0039) 0.16 (0.13–0.21) 
Full model, ER+ cases 60.1 (56.4–63.7)   
 Improvement 7.7 0.0020 (0.0010–0.0031) 0.19 (0.06–0.33) 
 Optimisma 0.2 (−2.9–3.7) 0.0002 (−0.0010–0.0019) 0.00 (−0.12–0.12) 
 Corrected improvementa,b 7.5 (6.3–8.1) 0.0019 (0.0017–0.0037) 0.19 (0.13–0.26) 
Selected model, all cases 58.5 (56.0–61.0)   
 Improvement 5.8 0.0021 (0.0013–0.0028) 0.17 (0.08–0.26) 
 Optimisma 0.5 (−1.5–2.9) 0.0003 (−0.0005–0.0017) 0.02 (−0.06–0.09) 
 Corrected improvementa,b 5.3 (4.2–6.2) 0.0017 (0.0017–0.0038) 0.16 (0.12–0.21) 
Selected model, ER+ cases 59.7 (56.1–63.4)   
 Improvement 7.4 0.0020 (0.0010–0.0030) 0.21 (0.08–0.34) 
 Optimisma 0.3 (−3.0–3.8) 0.0002 (−0.0010–0.0020) 0.00 (−0.12–0.13) 
 Corrected improvementa,b 7.2 (5.3–8.1) 0.0018 (0.0016–0.0037) 0.21 (0.13–0.26) 

a95% CIs from bootstrapping.

bStatistics from improvement minus optimism.

Improvement in discrimination was similar in models using the Pfeiffer score as offset, although somewhat weaker (all cases: improvement in discrimination, percentage points, all markers: 3.9; selected markers: 3.4; ER+ cases, all markers: 4.9; selected markers: 4.8; Supplementary Table S3).

For postmenopausal women, the inclusion of BMI (continuous) as an additional predictor to the modified Gail model did not improve model prediction and was not estimated as a significant effect (C, 58.9; P value for BMI estimate, 0.19); however, when BMI was added as a predictor, SHBG was no longer selected into the model (P = 0.26).

We included prolactin concentrations on 465 postmenopausal and 234 premenopausal cases and their matched controls in a secondary analysis. Among postmenopausal women, results from the risk prediction models were similar in models, including and excluding prolactin as a candidate biomarker. However, among premenopausal women, no biomarker was selected into the risk prediction model when prolactin was included as a candidate biomarker (data not shown). We evaluated improvements in discrimination based on 10-year risk estimates; results were similar to those presented for 5-year risk estimates.

Measurements of endogenous hormones significantly improved risk discrimination among postmenopausal women not using PMH. Relative to the modified Gail model alone, the selected model including estradiol, testosterone, and SHBG concentrations improved discrimination in terms of C-statistic by 5.3 percentage points for breast cancer overall and 7.2 percentage points for ER+ disease, after correcting for overfitting. Improvement in discrimination was similar when hormone concentrations were added to the Pfeiffer model (improvement in discrimination, percentage points, all cases, 3.4; ER+ cases, 4.8). The improvement in discrimination observed among postmenopausal women in the current study is similar in magnitude to that reported for polygenic risk scores, or mammographic density patterns (3–7). We observed no improvement in discrimination for women premenopausal at blood collection.

We included three measures of prediction model performance in this study: (i) the C-statistic, which provides a summary measure of the discriminatory capacity of a risk prediction model; (ii) the IDI, describing the change in the difference in absolute risk estimates between subsequent cases and controls from the modified Gail (or Pfeiffer) model alone and absolute risk estimates from models additionally including hormones; and (iii) the continuous NRI, showing the proportion of the population in which the model more accurately predicts absolute risk estimates in subsequent cases and controls, before as compared with after including hormones. Although all three measures are clearly related, they each provide a different perspective on the extent to which the models including hormones improve discrimination in the study population, relative to the risk factor models alone. Comparing absolute risk estimates from the modified Gail model alone with those from the model including hormones, the IDI indicates the mean difference in absolute risk estimates of women subsequently diagnosed as cases versus controls significantly increased by 0.17–0.19 percentage points in postmenopausal women (e.g., modified Gail model alone, 0.04 percentage point difference; additionally including hormones, 0.04 + 0.18 improvement = 0.22 percentage point difference). According to the NRI, after inclusion of hormones in the model, improved prediction discrimination was observed in 16% to 20% of the study population, with correctly increased risk estimates for future cases and decreased estimates for controls (i.e., improvement over and above potential changes in the wrong direction, regardless of magnitude of change).

Our findings confirm results from a recent study by Tworoger and colleagues in the Nurses' Health Study (15). Tworoger and colleagues observed a 5.9 percentage point improvement in discrimination for invasive breast cancer, relative to the modified Gail model, in the selected model with estrone sulfate, testosterone, and prolactin. Furthermore, as was observed in our study, Tworoger and colleagues observed somewhat stronger discrimination for ER+ disease (change in AUC 8.8 percentage points, relative to modified Gail model) in the selected models. In contrast to the previous study, prolactin was not selected into our model. However, these data were only available on a subset of our study population. Furthermore, circulating estrone sulfate concentrations were not available for cases and controls in our study. However, estrogens estrone and estradiol were considered.

Addition of hormones resulted in stronger improvement of risk prediction estimates from the modified Gail model versus the Pfeiffer model. At baseline, the differences in 5-year absolute risk estimates between subsequent cases and controls were larger in the Pfeiffer model (0.07 percentage points) than in the modified Gail model (0.04 percentage points). This is due, in part, to the standard inclusion of BMI in the Pfeiffer model. BMI is known to be a significant determinant of serum SHBG and sex hormones (particularly estrogens), in postmenopausal women (23–27). However, consistent with the previous study by Tworoger and colleagues (15), improvements in risk prediction using the modified Gail model were robust to adjustment for BMI.

Present recommendations for identifying women at sufficiently high risk to benefit from chemoprevention (28) include reference to the BCRAT originally developed by Gail and colleagues (1), with the aim to reduce costs not only in terms of financial expense, but also to optimize expected medical benefits against possible negative side effects (e.g., increased risk of endometrial cancer). Breast cancer risk prediction models are increasingly being considered in the context of mammography screening (29), given recent data suggesting screening may have limited benefit for some women (30), and the need to balance potential benefits of screening against possible negative side effects, such as overdiagnosis or false-positive findings and financial costs. Our model demonstrates opportunities for higher discrimination in models incorporating blood-based biomarkers, including estradiol and testosterone, hormones with established cross-laboratory standardization protocols (31). However, a cost–benefit analysis is required to determine the potential net value of implementing blood-based biomarkers in the screening context.

The median-predicted 5- and 10-year risks as calculated by the modified Gail and Pfeiffer models were only slightly higher among the breast cancer cases as compared with the controls, and the modified Gail and Pfeiffer risk scores provided only weak discrimination between cases and controls. This weak discrimination, as compared with reports from full cohort analyses, can be explained by the matching of breast cancer case and control subjects by a number of key predictors in the risk scores, including age and menopausal status, and by the exclusive focus on women not using exogenous hormones at the time of blood donation, as PMH use is another factor adding to risk stratification. A limitation of this study, and to some extent the prior investigation by Tworoger and colleagues (15), is that we used a modified Gail model because of a lack of data on selected risk factors. Data for family history of breast cancer and history of benign breast disease are lacking in the EPIC cohort. Furthermore, we used data from a single biomarker measurement in these analyses; blood samples were collected up to 14 years prior to diagnosis (median 4 years). However, the within-person stability of these markers over time has been demonstrated previously (32–34). Furthermore, the investigation by Tworoger and colleagues included the average of two hormone measurements from samples taken 10 years apart for select hormones, and for women diagnosed after the second blood draw. We observed similar improvements in discrimination in our study using one blood sample, suggesting that one measurement may be sufficient to predict longer term risk. Future studies should evaluate whether discrimination may be further improved by adding mammographic density, among women already part of screening programs, as well as genetic markers. These data were not available in sufficient numbers for the cases and controls included in this investigation. Tumor ER status was missing for 34% of cases. However, cases with and without ER status data were similar with respect to baseline absolute risk estimates, and therefore, we do not expect this to have introduced bias into our results.

In summary, we confirm the improvement in the discriminatory capacity of risk prediction models with the addition of hormone concentrations, among postmenopausal women not using PMH at blood collection. Integration of hormone measurements in clinical risk prediction models may represent a strategy to improve risk stratification for breast cancer.

No potential conflicts of interest were disclosed.

Conception and design: K. Overvad, A. Tjønneland, G. Severi, R. Tumino, H.B. Bueno de Mesquita, P.H. Peeters, M.-J. Sánchez, A. Barricarte, I. Tzoulaki, R. Kaaks

Development of methodology: A. Tjønneland, M.-D. Chirlaque, P. Ferrari

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): T. Kühn, K. Overvad, M.-C. Boutron-Ruault, G. Severi, H. Boeing, A. Trichopoulou, V. Benetou, G. Masala, V. Pala, R. Tumino, S. Panico, H.B. Bueno de Mesquita, P.H. Peeters, J. Ramón Quirós, A. Agudo, M.-J. Sánchez, M.-D. Chirlaque, A. Barricarte, P. Amiano, K.-T. Khaw, R.C. Travis, I. Tzoulaki, R. Kaaks

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): A. Hüsing, R.T. Fortner, H. Boeing, V. Benetou, C.H. van Gills, A. Agudo, M.-J. Sánchez, P. Ferrari, I. Tzoulaki, R. Kaaks

Writing, review, and/or revision of the manuscript: A. Hüsing, R.T. Fortner, T. Kühn, K. Overvad, A. Tjonneland, A. Olsen, M.-C. Boutron-Ruault, G. Severi, A. Fournier, H. Boeing, A. Trichopoulou, V. Benetou, P. Orfanos, G. Masala, V. Pala, R. Tumino, F. Fasanelli, S. Panico, H.B. Bueno de Mesquita, P.H. Peeters, C.H. van Gills, J. Ramón Quirós, A. Agudo, M.-J. Sánchez, M.-D. Chirlaque, A. Barricarte, P. Amiano, K.-T. Khaw, R.C. Travis, L. Dossus, K. Li, M.A. Merritt, I. Tzoulaki, E. Riboli, R. Kaaks

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): K. Overvad, A. Fournier, R. Tumino, J. Ramón Quirós, K.-T. Khaw

Study supervision: R. Tumino, H.B. Bueno de Mesquita, P.H. Peeters, A. Barricarte

For information on how to submit an application for gaining access to EPIC data and/or biospecimens, please follow the instructions at http://epic.iarc.fr/access/index.php.

R.T. Fortner was supported by a Marie Curie International Incoming Fellowship of the European Commission's Seventh Framework Programme (MC-IIF-623984).

The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by Danish Cancer Society (Denmark); Ligue Contre le Cancer, Institut Gustave Roussy, Mutuelle Générale de l'Education Nationale, Institut National de la Santé et de la Recherche Médicale (INSERM; France), German Cancer Aid, German Cancer Research Center (DKFZ), Federal Ministry of Education and Research (BMBF), Deutsche Krebshilfe, Deutsches Krebsforschungszentrum and Federal Ministry of Education and Research (Germany), the Hellenic Health Foundation (Greece), Associazione Italiana per la Ricerca sul Cancro-AIRC-Italy and National Research Council (Italy), Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF), Statistics Netherlands (The Netherlands), ERC-2009-AdG 232997 and Nordforsk, Nordic Centre of Excellence programme on Food, Nutrition and Health (Norway), Health Research Fund (FIS; PI13/00061 to Granada; PI13/01162 to EPIC-Murcia), Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, ISCIII RETIC (RD06/0020; Spain); Swedish Cancer Society, Swedish Research Council and County Councils of Skåne and Västerbotten (Sweden), Cancer Research UK (14136 to EPIC-Norfolk; C570/A16491 and C8221/A19170 to EPIC-Oxford), Medical Research Council (1000143 to EPIC-Norfolk, MR/M012190/1 to EPIC-Oxford; United Kingdom).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Gail
MH
,
Brinton
LA
,
Byar
DP
,
Corle
DK
,
Green
SB
,
Schairer
C
, et al
Projecting individualized probabilities of developing breast cancer for white females who are being examined annually
.
J Natl Cancer Inst
1989
;
81
:
1879
86
.
2.
Pfeiffer
RM
,
Park
Y
,
Kreimer
AR
,
Lacey
JV
 Jr.
,
Pee
D
,
Greenlee
RT
, et al
Risk prediction for breast, endometrial, and ovarian cancer in white women aged 50 y or older: derivation and validation from population-based cohort studies
.
PLoS Med
2013
;
10
:
e1001492
.
3.
Husing
A
,
Canzian
F
,
Beckmann
L
,
Garcia-Closas
M
,
Diver
WR
,
Thun
MJ
, et al
Prediction of breast cancer risk by genetic risk factors, overall and by hormone receptor status
.
J Med Genet
2012
;
49
:
601
8
.
4.
Evans
DG
,
Astley
S
,
Stavrinos
P
,
Harkness
E
,
Donnelly
LS
,
Dawe
S
, et al
Improvement in risk prediction, early detection and prevention of breast cancer in the NHS Breast Screening Programme and family history clinics: a dual cohort study
.
Southampton, UK
:
NIHR Journals Library
; 
2016
.
5.
Lee
CP
,
Choi
H
,
Soo
KC
,
Tan
MH
,
Chay
WY
,
Chia
KS
, et al
Mammographic breast density and common genetic variants in breast cancer risk prediction
.
PLoS One
2015
;
10
:
e0136650
.
6.
Vachon
CM
,
Pankratz
VS
,
Scott
CG
,
Haeberle
L
,
Ziv
E
,
Jensen
MR
, et al
The contributions of breast density and common genetic variation to breast cancer risk
.
J Natl Cancer Inst
2015
;
107
:
pii:dju397
7.
Shieh
Y
,
Hu
D
,
Ma
L
,
Huntsman
S
,
Gard
CC
,
Leung
JW
, et al
Breast cancer risk prediction using a clinical risk model and polygenic risk score
.
Breast Cancer Res Treat
2016
;
159
:
513
25
.
8.
Tice
JA
,
Cummings
SR
,
Smith-Bindman
R
,
Ichikawa
L
,
Barlow
WE
,
Kerlikowske
K
. 
Using clinical factors and mammographic breast density to estimate breast cancer risk: development and validation of a new predictive model
.
Ann Intern Med
2008
;
148
:
337
47
.
9.
Kaaks
R
,
Tikk
K
,
Sookthai
D
,
Schock
H
,
Johnson
T
,
Tjonneland
A
, et al
Premenopausal serum sex hormone levels in relation to breast cancer risk, overall and by hormone receptor status - results from the EPIC cohort
.
Int J Cancer
2014
;
134
:
1947
57
.
10.
Tikk
K
,
Sookthai
D
,
Johnson
T
,
Rinaldi
S
,
Romieu
I
,
Tjonneland
A
, et al
Circulating prolactin and breast cancer risk among pre- and postmenopausal women in the EPIC cohort
.
Ann Oncol
2014
;
25
:
1422
8
.
11.
Endogenous Hormones and Breast Cancer Collaborative Group
,
Key
TJ
,
Appleby
PN
,
Reeves
GK
,
Travis
RC
, et al
Sex hormones and risk of breast cancer in premenopausal women: a collaborative reanalysis of individual participant data from seven prospective studies
.
Lancet Oncol
2013
;
14
:
1009
19
.
12.
Tworoger
SS
,
Eliassen
AH
,
Zhang
X
,
Qian
J
,
Sluss
PM
,
Rosner
BA
, et al
A 20-year prospective study of plasma prolactin as a risk marker of breast cancer development
.
Cancer Res
2013
;
73
:
4810
9
.
13.
Endogenous Hormones and Breast Cancer Collaborative Group
,
Key
TJ
,
Appleby
PN
,
Reeves
GK
,
Roddam
AW
. 
Insulin-like growth factor 1 (IGF1), IGF binding protein 3 (IGFBP3), and breast cancer risk: pooled individual data analysis of 17 prospective studies
.
Lancet Oncol
2010
;
11
:
530
42
.
14.
Zhang
X
,
Tworoger
SS
,
Eliassen
AH
,
Hankinson
SE
. 
Postmenopausal plasma sex hormone levels and breast cancer risk over 20 years of follow-up
.
Breast Cancer Res Treat
2013
;
137
:
883
92
.
15.
Tworoger
SS
,
Zhang
X
,
Eliassen
AH
,
Qian
J
,
Colditz
GA
,
Willett
WC
, et al
Inclusion of endogenous hormone levels in risk prediction models of postmenopausal breast cancer
.
J Clin Oncol
2014
;
32
:
3111
7
.
16.
Riboli
E
. 
Nutrition and cancer: background and rationale of the European prospective investigation into cancer and nutrition (EPIC)
.
Ann Oncol
1992
;
3
:
783
91
.
17.
Riboli
E
,
Hunt
KJ
,
Slimani
N
,
Ferrari
P
,
Norat
T
,
Fahey
M
, et al
European Prospective Investigation into Cancer and Nutrition (EPIC): study populations and data collection
.
Public Health Nutr
2002
;
5
:
1113
24
.
18.
James
RE
,
Lukanova
A
,
Dossus
L
,
Becker
S
,
Rinaldi
S
,
Tjonneland
A
, et al
Postmenopausal serum sex steroids and risk of hormone receptor-positive and -negative breast cancer: a nested case-control study
.
Cancer Prev Res
2011
;
4
:
1626
35
.
19.
Kaaks
R
,
Johnson
T
,
Tikk
K
,
Sookthai
D
,
Tjonneland
A
,
Roswall
N
, et al
Insulin-like growth factor I and risk of breast cancer by age and hormone receptor status-A prospective study within the EPIC cohort
.
Int J Cancer
2014
;
134
:
2683
90
.
20.
Kaaks
R
,
Berrino
F
,
Key
T
,
Rinaldi
S
,
Dossus
L
,
Biessy
C
, et al
Serum sex steroids in premenopausal women and breast cancer risk within the European Prospective Investigation into Cancer and Nutrition (EPIC)
.
J Natl Cancer Inst
2005
;
97
:
755
65
.
21.
Kaaks
R
,
Rinaldi
S
,
Key
TJ
,
Berrino
F
,
Peeters
PH
,
Biessy
C
, et al
Postmenopausal serum androgens, oestrogens and breast cancer risk: the European prospective investigation into cancer and nutrition
.
Endocr Relat Cancer
2005
;
12
:
1071
82
.
22.
Steyerberg
EW
.
Clinical prediction models: a practical approach to development, validation, and updating
.
New York, NY
:
Springer
; 
2009
23.
Rinaldi
S
,
Key
TJ
,
Peeters
PH
,
Lahmann
PH
,
Lukanova
A
,
Dossus
L
, et al
Anthropometric measures, endogenous sex steroids and breast cancer risk in postmenopausal women: a study within the EPIC cohort
.
Int J Cancer
2006
;
118
:
2832
9
.
24.
Vaidya
D
,
Dobs
A
,
Gapstur
SM
,
Golden
SH
,
Cushman
M
,
Liu
K
, et al
Association of baseline sex hormone levels with baseline and longitudinal changes in waist-to-hip ratio: multi-ethnic study of atherosclerosis
.
Int J Obesity
2012
;
36
:
1578
84
.
25.
Baglietto
L
,
English
DR
,
Hopper
JL
,
MacInnis
RJ
,
Morris
HA
,
Tilley
WD
, et al
Circulating steroid hormone concentrations in postmenopausal women in relation to body size and composition
.
Breast Cancer Res Treat
2009
;
115
:
171
9
.
26.
Danforth
KN
,
Eliassen
AH
,
Tworoger
SS
,
Missmer
SA
,
Barbieri
RL
,
Rosner
BA
, et al
The association of plasma androgen levels with breast, ovarian and endometrial cancer risk factors among postmenopausal women
.
Int J Cancer
2010
;
126
:
199
207
.
27.
Key
TJ
,
Appleby
PN
,
Reeves
GK
,
Roddam
A
,
Dorgan
JF
,
Longcope
C
, et al
Body mass index, serum sex hormones, and breast cancer risk in postmenopausal women
.
J Natl Cancer Inst
2003
;
95
:
1218
26
.
28.
Moyer
VA
;
U.S. Preventive Services Task Force
. 
Medications to decrease the risk for breast cancer in women: recommendations from the U.S. Preventive Services Task Force recommendation statement
.
Ann Intern Med
2013
;
159
:
698
708
.
29.
Joy
JE
,
Penhoet
EE
,
Petitti
DB
,
editors
.
Saving Women's Lives: strategies for improving breast cancer detection and diagnosis
.
Washington, DC
:
National Academies Press
; 
2005
.
30.
Miller
AB
,
Wall
C
,
Baines
CJ
,
Sun
P
,
To
T
,
Narod
SA
. 
Twenty five year follow-up for breast cancer incidence and mortality of the Canadian national breast screening study: randomised screening trial
.
BMJ
2014
;
348
:
g366
.
31.
Centers for Disease Control and Prevention
. 
United States Centers for Disease Control and Prevention Hormone Standardization Program
.
Atlanta, GA
:
Centers for Disease Control and Prevention
; 
2016
[cited 2016 Sep 29]
. Available from: https://www.cdc.gov/labstandards/hs.html.
32.
Hankinson
SE
,
Manson
JE
,
Spiegelman
D
,
Willett
WC
,
Longcope
C
,
Speizer
FE
. 
Reproducibility of plasma hormone levels in postmenopausal women over a 2–3-year period
.
Cancer Epidemiol Biomarkers Prev
1995
;
4
:
649
54
.
33.
Missmer
SA
,
Spiegelman
D
,
Bertone-Johnson
ER
,
Barbieri
RL
,
Pollak
MN
,
Hankinson
SE
. 
Reproducibility of plasma steroid hormones, prolactin, and insulin-like growth factor levels among premenopausal women over a 2- to 3-year period
.
Cancer Epidemiol Biomarkers Prev
2006
;
15
:
972
8
.
34.
Lukanova
A
,
Zeleniuch-Jacquotte
A
,
Lundin
E
,
Micheli
A
,
Arslan
AA
,
Rinaldi
S
, et al
Prediagnostic levels of C-peptide, IGF-I, IGFBP-1, -2 and -3 and risk of endometrial cancer
.
Int J Cancer
2004
;
108
:
262
8
.