Background: The largest molecular subtype of primary prostate cancer is defined by the TMPRSS2:ERG gene fusion. Few studies, however, have investigated etiologic differences by TMPRSS2:ERG status. Because the fusion is hormone-regulated and a man's hormonal milieu varies by height and obesity status, we hypothesized that both may be differentially associated with risk of TMPRSS2:ERG-defined disease.

Methods: Our study included 49,372 men from the prospective Health Professionals Follow-up Study. Participants reported height and weight at baseline in 1986 and updated weight biennially thereafter through 2009. Tumor ERG protein expression (a TMPRSS2:ERG marker) was immunohistochemically assessed. We used multivariable competing risks models to calculate HRs and 95% confidence intervals (CIs) for the risk of ERG-positive and ERG-negative prostate cancer.

Results: During 23 years of follow-up, we identified 5,847 incident prostate cancers, among which 913 were ERG-assayed. Taller height was associated with an increased risk of ERG-positive disease only [per 5 inches HR 1.24; 95% confidence interval (CI), 1.03–1.50; Pheterogeneity = 0.07]. Higher body mass index (BMI) at baseline (per 5 kg/m2 HR 0.75; 95% CI, 0.61–0.91; Pheterogeneity = 0.02) and updated BMI over time (per 5 kg/m2 HR 0.86; 95% CI, 0.74–1.00; Pheterogeneity = 0.07) were associated with a reduced risk of ERG-positive disease only.

Conclusions: Our results indicate that anthropometrics may be uniquely associated with TMPRSS2:ERG-positive prostate cancer; taller height may be associated with greater risk, whereas obesity may be associated with lower risk.

Impact: Our study provides strong rationale for further investigations of other prostate cancer risk factors that may be distinctly associated with subtypes. Cancer Epidemiol Biomarkers Prev; 27(2); 193–200. ©2017 AACR.

The largest molecular subtype of primary prostate cancer is defined by the presence of gene fusions involving the transcription factor ERG (1). When the oncogene ERG binds to its most common fusion partner TMPRSS2, a gene regulated by androgens, the oncogene becomes androgen regulated as well (2). The fusion is unlikely prognostic by itself (3), but compelling evidence suggests it may play a role in prostate cancer progression, modifying the effect of obesity (4) and signaling pathways (5). There also exists preliminary evidence of risk factors associated with TMPRSS2:ERG-defined disease (6–10), although few etiologic studies have been conducted. Given that the search for prostate cancer risk factors has yielded few consistent associations and that the disease is heterogeneous, it follows that the search for risk factors for molecular subtypes could prove more fruitful. The identification of such risk factors could be critical toward developing personalized prevention strategies.

Anthropometrics are among the risk factors that could be differentially associated with TMPRSS2:ERG-defined disease. Obesity is characterized by reduced testosterone signaling, and tall height has been associated with increased androgen levels (11, 12). Cell line experiments suggest that androgen exposure results in localization of TMPRSS2 and ERG, thereby favoring fusion formation (13–17). Insulin-like growth factor (IGF) signaling influences height (18), appears enhanced in fusion-positive disease, and may promote fusion formation (4, 19). One case-control study evaluated obesity with respect to TMPRSS2:ERG-defined disease (9). The investigators found that obesity is associated with a lower risk of fusion-positive prostate cancer, and they did not examine height (9).

We assessed the role of anthropometrics in the development of TMPRSS2:ERG-defined prostate cancer in the Health Professionals Follow-up Study (HPFS). We leveraged long-term anthropometric data to evaluate the hypotheses that height is positively associated with the risk of TMPRSS2:ERG-positive prostate cancer, and that body mass index (BMI) and waist circumference are inversely associated with risk.

Study population

The HPFS is an ongoing cohort of 51,529 male health professionals age 40 to 75 at enrollment in 1986. Participants responded to a baseline questionnaire concerning lifestyle and medical history, and have since completed follow-up questionnaires biennially. Follow-up for the surveys has been >90%. For this study, we excluded men with cancers other than nonmelanoma skin cancer at baseline (n = 2,088) and who were missing baseline values for height or weight (n = 35) or date of birth (n = 34). The remaining 49,372 men free from prostate cancer at study initiation comprised our study population.

The Institutional Review Board at the Harvard T.H. Chan School of Public Health approved this study. Written informed consent was obtained from each subject.

Assessment of anthropometric measures

Participants reported height, weight at age 21, and weight at baseline, and weight biennially thereafter. Waist circumference was assessed in 1987. Self-reported weight and waist circumference were previously validated against technician-measured values with Pearson correlations of 0.97 and 0.95, respectively (20). Self-reported weight at age 21 has not been validated in the HPFS, but recalled weight during early adulthood in men has been shown to be accurate in other studies (21–23). Baseline BMI was calculated as reported weight in 1986 divided by the square of height reported in 1986 (kg/m2). Updated BMI was calculated as weight reported on each follow-up questionnaire after 1986, divided by the square of height reported in 1986 (kg/m2; i.e., it was updated every two years in analyses). If data were missing after baseline for analyses of updated BMI over the course of follow-up, then data from previous questionnaire cycles were used. All three BMI exposures were significantly correlated (P < 0.001).

Ascertainment of prostate cancer clinical data

Prostate cancer cases were initially identified by self-report or participants' next-of-kin, and confirmed by medical record and pathology report. Given the accuracy of reporting among men with available medical records, these analyses included the 10% of cases indicated only by self-report or death certificates. Deaths were ascertained via reports from family members and inspection of the National Death Index. Follow-up for mortality was over 98% complete.

Study investigators reviewed records to abstract information about clinical stage, Gleason score, and PSA levels at diagnosis. To reduce detection bias, we censored men diagnosed with stage T1a cancers (n = 268; ref. 24). In total, 5,847 prostate cancer cases were diagnosed during the study period.

Tumor tissue cohort and immunohistochemistry

We retrieved archival formalin-fixed paraffin-embedded prostate tumor tissue from men who underwent radical prostatectomy (RP; 95%) or transurethral resection of the prostate (TURP; 5%). Tissue was unavailable from some hospitals and for men who were not treated with surgery. For this study, ERG data were available for 913 of the RP cases (and none of the TURP cases). Hematoxylin and eosin slides were reviewed by study pathologists to confirm cancer and identify tumor areas for tissue microarray (TMA) construction. We constructed TMAs by sampling at least three 0.6 mm cores of tumor per case from the dominant nodule or nodule with highest Gleason (25). We used immunohistochemistry of ERG protein expression to characterize TMPRSS2:ERG status (Supplementary Fig. S1), which has high concordance with alternative methods (26, 27). Details of the assessment have been described previously (3). Briefly, ERG antisera (1:100, Clone ID: EPR3864; Epitomics, Inc.) were applied to 0.5-μm TMA sections and visualization of ERG was accomplished using the DAB Substrate Kit (Vector Laboratories Inc.). A case was scored ERG-positive if at least one TMA core had positive ERG staining within prostate cancer epithelial cells. Of cases positive for ERG in at least one core, 85% stained positive for ERG in all cores.

Statistical analysis

We considered height, age 21 BMI, baseline BMI, updated BMI over the course of follow-up, and waist circumference in 1987 as both continuous (in 5-unit increments) and categorical exposures. We reduced outlier influence on continuous exposures by employing a 99% Winsorization technique (28).

Person-time was calculated from return of the baseline questionnaire until prostate cancer diagnosis, death from any cause, or end of follow-up. Tumor tissue was characterized for ERG in cases diagnosed through February 2009; we thus ended follow-up at that time. For models investigating waist circumference, person-time accrual began in 1988. We used Cox proportional hazards models adjusted for age and calendar time to assess associations between anthropometric exposures and the risk of prostate cancer diagnosis. Because height and age 21 BMI cannot be affected by factors measured later in adulthood, their multivariable models only adjusted for age, calendar time, race, and family history of prostate cancer in a father or brother. Models for height were also adjusted for age 21 BMI and vice versa. Multivariable models of the remaining exposures were adjusted for the covariates in the table footnotes. For models of updated BMI, covariates besides height, age 21 BMI, race, and family history of prostate cancer were updated in each questionnaire cycle. For analyses of categorical exposures, we conducted linear trend tests across categories by modeling their median values continuously.

Next, we implemented an extension of Cox modeling as described by Lunn and McNeil that allows for exposure associations to vary by disease subtype (29, 30). The details of this competing risks method have been described (7). These models allowed for estimating hazard ratios separately for the risk of diagnosis with ERG-positive cancer and ERG-negative cancer versus no cancer. We tested heterogeneity across hazard ratios using likelihood ratio tests (31).

For overall and subtype-specific models of prostate cancer, we performed secondary analyses that applied inverse probability weights accounting for the unique clinical characteristics of cases assayed for ERG status. The methods to create these weights have been described (7). We explored confounding by PSA screening by stratifying by time period, examining associations separately for the pre-PSA (1986–1993) and PSA era (1994–2009).

Analyses were conducted using SAS version 9.4 (SAS Institute, Inc.). Tests were two-sided with P < 0.05 considered statistically significant.

Table 1 displays age-standardized characteristics of the study population by extreme categories of exposures at baseline (or in 1987 for waist circumference). Younger men tended to be taller, to have higher age 21 BMI, and a smaller waist circumference. Men in the lowest categories of BMI and waist circumference were most likely to be physically active, never have smoked, and use multivitamins.

Table 1.

Age-adjusted characteristics of the HPFS at baseline in 1986 (unless otherwise noted) according to extreme categories of anthropometric exposures

HeightBMI at age 21BMIWaist circumference, 1987a
Characteristics≤68 in>72 in<21 kg/m2≥25 kg/m2<23 kg/m2≥27.5 kg/m2≤35 inches≥40 inches
Number 13,196 8,449 10,804 10,703 8,752 10,190 8,895 7,424 
Mean height, in (SD) 66.9 (1.4) 74.0 (1.1) 70.3 (2.5) 70.1 (2.7) 70.2 (2.6) 70.1 (2.8) 69.2 (2.5) 71.0 (2.6) 
Mean BMI at age 21, kg/m2 (SD) 23.1 (2.8) 23.0 (2.9) 19.6 (1.0) 26.9 (2.0) 21.0 (2.0) 25.3 (3.1) 21.9 (2.2) 24.3 (3.2) 
Mean BMI, kg/m2 (SD) 25.7 (3.3) 25.5 (3.2) 23.6 (2.5) 28.2 (3.5) 21.7 (1.0) 30.1 (2.6) 22.9 (1.8) 28.9 (3.1) 
Mean waist circumference, 1987, in (SD) 36.3 (3.4) 38.8 (3.6) 36.1 (3.2) 39.6 (3.9) 34.1 (2.2) 41.8 (3.3) 33.5 (1.3) 42.4 (2.4) 
Mean age, years (SD)b 56.5 (10.2) 52.2 (9.0) 56.7 (10.0) 52.9 (9.2) 54.0 (10.3) 54.7 (9.2) 54.0 (9.7) 58.4 (9.5) 
Caucasian 11,509 (91.6%) 7,800 (97.0%) 9,699 (93.9%) 9,798 (96.6%) 7,825 (94.1%) 9,292 (96.0%) 7,976 (93.9%) 6,877 (97.1%) 
Family history of prostate cancer 1,508 (11.4%) 1,042 (12.2%) 1,289 (12.0%) 1,193 (11.1%) 1,034 (11.9%) 1,169 (11.4%) 1,183 (13.4%) 1,004 (13.4%) 
Diabetes 509 (3.4%) 240 (3.3%) 370 (2.9%) 431 (4.4%) 237 (2.7%) 475 (4.6%) 209 (2.8%) 453 (5.5%) 
Top quintile of physical activity (≥28.5 METS/week) 1,853 (15.1%) 1,363 (15.0%) 1,213 (12.2%) 1,719 (15.2%) 1,878 (21.3%) 815 (8.1%) 2,277 (24.1%) 604 (8.6%) 
Smoking status 
 Never 5,972 (48.3%) 3,807 (45.2%) 4,622 (46.0%) 4,697 (45.1%) 4,407 (51.7%) 4,146 (42.7%) 4,631 (52.1%) 2,879 (41.5%) 
 Past, quit >10 years before baseline 3,995 (29.9%) 2,309 (30.7%) 3,440 (31.0%) 2,991 (30.4%) 2,317 (28.0%) 3,077 (31.2%) 2,394 (29.9%) 2,382 (31.3%) 
 Past, quit ≤10 years before baseline 1,515 (12.2%) 1,156 (13.9%) 1,322 (12.8%) 1,488 (14.1%) 790 (9.3%) 1,574 (16.0%) 846 (9.6%) 1,218 (17.3%) 
 Current 1,188 (9.6%) 839 (10.2%) 1,049 (10.2%) 1,083 (10.5%) 915 (11.0%) 989 (10.1%) 723 (8.5%) 685 (9.9%) 
Multivitamin use 5,521 (42.2%) 3,508 (43.0%) 4,629 (42.8%) 4,263 (40.9%) 4,071 (47.6%) 4,688 (36.8%) 4,093 (47.3%) 2,933 (39.6%) 
Had screening PSA test, 1994c 4,604 (36.7%) 2,864 (38.5%) 3,795 (36.8%) 3,589 (37.5%) 3,031 (38.9%) 3,343 (35.1%) 3,264 (41.9%) 2,897 (40.8%) 
Had screening PSA test, 2004c 5,402 (58.3%) 4,070 (62.3%) 4,406 (59.2%) 4,896 (60.9%) 4,039 (62.0%) 4,181 (57.8%) 4,803 (67.6%) 3,269 (66.1%) 
Mean nutrient and food intakes 
 Total calories, kcal/day (SD) 1,922 (603) 2,061 (630) 1,999 (610) 1,964 (622) 2,005 (600) 1,997 (637) 2,003 (597) 2,022 (631) 
 Lycopene, μg/day (SD) 7,505 (4,956) 7,068 (4,610) 7,068 (4,767) 7,617 (4,899) 6,939 (4,659) 7,563 (4,936) 6,973 (4,602) 7,307 (4,826) 
 Calcium, mg/day (SD) 875 (398) 918 (413) 861 (391) 907 (409) 911 (413) 883 (398) 925 (411) 893 (406) 
 Alpha-linolenic acid, g/day (SD) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 1.0 (0.3) 1.1 (0.3) 1.0 (0.3) 1.1 (0.3) 
 Supplemental vitamin E, mg/day (SD) 37.4 (83.9) 39.0 (84.6) 35.6 (80.4) 38.4 (86.4) 44.1 (89.3) 33.6 (81.9) 43.4 (88.0) 33.6 (80.6) 
 Alcohol, g/day (SD) 10.1 (14.1) 12.0 (15.4) 11.4 (15.2) 10.5 (14.4) 10.6 (14.1) 10.8 (15.2) 10.8 (14.0) 11.6 (15.9) 
 Coffee, cups/day (SD) 1.9 (1.7) 1.9 (1.8) 1.7 (1.7) 2.1 (1.8) 1.7 (1.7) 2.1 (1.8) 1.7 (1.7) 2.1 (1.8) 
HeightBMI at age 21BMIWaist circumference, 1987a
Characteristics≤68 in>72 in<21 kg/m2≥25 kg/m2<23 kg/m2≥27.5 kg/m2≤35 inches≥40 inches
Number 13,196 8,449 10,804 10,703 8,752 10,190 8,895 7,424 
Mean height, in (SD) 66.9 (1.4) 74.0 (1.1) 70.3 (2.5) 70.1 (2.7) 70.2 (2.6) 70.1 (2.8) 69.2 (2.5) 71.0 (2.6) 
Mean BMI at age 21, kg/m2 (SD) 23.1 (2.8) 23.0 (2.9) 19.6 (1.0) 26.9 (2.0) 21.0 (2.0) 25.3 (3.1) 21.9 (2.2) 24.3 (3.2) 
Mean BMI, kg/m2 (SD) 25.7 (3.3) 25.5 (3.2) 23.6 (2.5) 28.2 (3.5) 21.7 (1.0) 30.1 (2.6) 22.9 (1.8) 28.9 (3.1) 
Mean waist circumference, 1987, in (SD) 36.3 (3.4) 38.8 (3.6) 36.1 (3.2) 39.6 (3.9) 34.1 (2.2) 41.8 (3.3) 33.5 (1.3) 42.4 (2.4) 
Mean age, years (SD)b 56.5 (10.2) 52.2 (9.0) 56.7 (10.0) 52.9 (9.2) 54.0 (10.3) 54.7 (9.2) 54.0 (9.7) 58.4 (9.5) 
Caucasian 11,509 (91.6%) 7,800 (97.0%) 9,699 (93.9%) 9,798 (96.6%) 7,825 (94.1%) 9,292 (96.0%) 7,976 (93.9%) 6,877 (97.1%) 
Family history of prostate cancer 1,508 (11.4%) 1,042 (12.2%) 1,289 (12.0%) 1,193 (11.1%) 1,034 (11.9%) 1,169 (11.4%) 1,183 (13.4%) 1,004 (13.4%) 
Diabetes 509 (3.4%) 240 (3.3%) 370 (2.9%) 431 (4.4%) 237 (2.7%) 475 (4.6%) 209 (2.8%) 453 (5.5%) 
Top quintile of physical activity (≥28.5 METS/week) 1,853 (15.1%) 1,363 (15.0%) 1,213 (12.2%) 1,719 (15.2%) 1,878 (21.3%) 815 (8.1%) 2,277 (24.1%) 604 (8.6%) 
Smoking status 
 Never 5,972 (48.3%) 3,807 (45.2%) 4,622 (46.0%) 4,697 (45.1%) 4,407 (51.7%) 4,146 (42.7%) 4,631 (52.1%) 2,879 (41.5%) 
 Past, quit >10 years before baseline 3,995 (29.9%) 2,309 (30.7%) 3,440 (31.0%) 2,991 (30.4%) 2,317 (28.0%) 3,077 (31.2%) 2,394 (29.9%) 2,382 (31.3%) 
 Past, quit ≤10 years before baseline 1,515 (12.2%) 1,156 (13.9%) 1,322 (12.8%) 1,488 (14.1%) 790 (9.3%) 1,574 (16.0%) 846 (9.6%) 1,218 (17.3%) 
 Current 1,188 (9.6%) 839 (10.2%) 1,049 (10.2%) 1,083 (10.5%) 915 (11.0%) 989 (10.1%) 723 (8.5%) 685 (9.9%) 
Multivitamin use 5,521 (42.2%) 3,508 (43.0%) 4,629 (42.8%) 4,263 (40.9%) 4,071 (47.6%) 4,688 (36.8%) 4,093 (47.3%) 2,933 (39.6%) 
Had screening PSA test, 1994c 4,604 (36.7%) 2,864 (38.5%) 3,795 (36.8%) 3,589 (37.5%) 3,031 (38.9%) 3,343 (35.1%) 3,264 (41.9%) 2,897 (40.8%) 
Had screening PSA test, 2004c 5,402 (58.3%) 4,070 (62.3%) 4,406 (59.2%) 4,896 (60.9%) 4,039 (62.0%) 4,181 (57.8%) 4,803 (67.6%) 3,269 (66.1%) 
Mean nutrient and food intakes 
 Total calories, kcal/day (SD) 1,922 (603) 2,061 (630) 1,999 (610) 1,964 (622) 2,005 (600) 1,997 (637) 2,003 (597) 2,022 (631) 
 Lycopene, μg/day (SD) 7,505 (4,956) 7,068 (4,610) 7,068 (4,767) 7,617 (4,899) 6,939 (4,659) 7,563 (4,936) 6,973 (4,602) 7,307 (4,826) 
 Calcium, mg/day (SD) 875 (398) 918 (413) 861 (391) 907 (409) 911 (413) 883 (398) 925 (411) 893 (406) 
 Alpha-linolenic acid, g/day (SD) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 1.0 (0.3) 1.1 (0.3) 1.0 (0.3) 1.1 (0.3) 
 Supplemental vitamin E, mg/day (SD) 37.4 (83.9) 39.0 (84.6) 35.6 (80.4) 38.4 (86.4) 44.1 (89.3) 33.6 (81.9) 43.4 (88.0) 33.6 (80.6) 
 Alcohol, g/day (SD) 10.1 (14.1) 12.0 (15.4) 11.4 (15.2) 10.5 (14.4) 10.6 (14.1) 10.8 (15.2) 10.8 (14.0) 11.6 (15.9) 
 Coffee, cups/day (SD) 1.9 (1.7) 1.9 (1.8) 1.7 (1.7) 2.1 (1.8) 1.7 (1.7) 2.1 (1.8) 1.7 (1.7) 2.1 (1.8) 

NOTE: Numbers may not add up to total sample sizes for characteristics with missing data; percentages may not add up as expected due to rounding.

Abbreviations: in, inches; METS, metabolic equivalents.

aWaist circumference reported in 1987; all other characteristics (with the exception of PSA screening history) reported for 1988 rather than 1986.

bNot adjusted for age.

cReported having a PSA test in the two years prior to the questionnaire date.

During 23 years of follow-up, we identified 5,847 prostate cancer cases out of our cohort of 49,372 men (Table 2). Among the 2,402 men who were treated with RP, 913 (15.6% of all cases) were ERG-assayed and 439 (48.1% of assayed cases) were ERG-positive. Men treated with RP without tissue available were more likely diagnosed in later years. Lifestyle and demographic factors were otherwise similar for surgically treated men with and without available tissue. Relative to surgically treated men, cases treated nonsurgically were more likely older at diagnosis and to be diagnosed with higher grade, stage, and PSA levels. They also had a higher prevalence of diabetes and were less likely to have data regarding their clinical characteristics.

Table 2.

Characteristics of participants with prostate cancer in the HPFS at the time of diagnosis (unless otherwise noted), by treatment and ERG status

Participants treated with RPParticipants treated otherwise
CharacteristicERG-positiveERG-negativeERG status unavailableERG status unavailable
Number 439 474 1,489 3,445 
Year of diagnosis 
 1986–1990 35 (8.0%) 28 (5.9%) 122 (8.2%) 295 (8.6%) 
 1991–1995 157 (35.8%) 151 (31.9%) 396 (26.6%) 781 (22.7%) 
 1996–2000 134 (30.5%) 151 (31.9%) 356 (23.9%) 896 (26.0%) 
 2000–2005 74 (16.9%) 99 (20.9%) 342 (23.0%) 932 (27.1%) 
 2006–2009 39 (8.9%) 45 (9.5%) 273 (18.3%) 541 (15.7%) 
Mean age, years (SD) 65.1 (6.0) 65.8 (5.7) 65.9 (5.9) 72.9 (6.9) 
Mean PSA level, ng/mL (SD)a 9.7 (11.5) 10.2 (12.4) 11.5 (109) 20.7 (151) 
 % Missing 32 (7.3%) 41 (8.6%) 141 (9.5%) 913 (26.5%) 
Biopsy Gleason score 
 2–6a 265 (66.9%) 265 (63.1%) 821 (66.3%) 1,482 (56.9%) 
 7a 107 (27.0%) 112 (26.7%) 338 (27.3%) 731 (28.1%) 
 8–10a 24 (6.1%) 43 (10.2%) 80 (6.5%) 392 (15.1%) 
 % Missing 43 (9.8%) 54 (11.4%) 250 (16.8%) 840 (24.4%) 
Clinical stage 
 T1/T2a 416 (94.8%) 453 (95.6%) 1,392 (95.5%) 2,473 (87.5%) 
 T3a 19 (4.3%) 14 (3.0%) 41 (2.8%) 112 (4.0%) 
 T4/N1/M1a 4 (0.9%) 7 (1.5%) 24 (1.7%) 243 (8.6%) 
 % Missing 0 (0.0%) 0 (0.0%) 32 (2.2%) 617 (17.9%) 
Mean height, inches (SD) 70.5 (2.6) 70.2 (2.7) 70.2 (2.5) 69.9 (2.6) 
Mean BMI at Age 21, kg/m2 in (SD) 22.8 (2.6) 23.0 (2.6) 22.8 (2.6) 22.7 (2.7) 
Mean BMI, kg/m2 (SD) 25.7 (3.1) 26.1 (3.2) 25.7 (3.1) 25.9 (3.5) 
Mean waist circumference in 1987, inches (SD) 36.9 (3.1) 37.5 (3.3) 36.9 (3.4) 37.7 (3.4) 
Caucasian 409 (97.6%) 433 (96.2%) 1,381 (97.1%) 3,137 (95.4%) 
Family history of prostate cancer 94 (21.4%) 116 (24.5%) 346 (23.2%) 625 (18.1%) 
Diabetes 21 (4.8%) 21 (4.4%) 70 (4.7%) 315 (9.1%) 
Top quintile of physical activity (≥28.5 METS/week) 59 (13.4%) 68 (14.4%) 260 (17.5%) 440 (12.8%) 
Smoking statusa 
 Never 215 (50.8%) 231 (50.9%) 687 (47.7%) 1,474 (44.1%) 
 Past, quit >10 years before diagnosis 135 (31.9%) 154 (33.9%) 531 (36.9%) 1,296 (38.8%) 
 Past, quit ≤10 years before diagnosis 48 (11.4%) 41 (9.0%) 159 (11.0%) 395 (11.8%) 
 Current 25 (5.9%) 28 (6.2%) 63 (4.4%) 175 (5.2%) 
Multivitamin use 227 (51.7%) 253 (53.6%) 829 (55.9%) 2,003 (58.4%) 
Had screening PSA test 203 (46.2%) 231 (48.7%) 751 (50.4%) 1,742 (50.6%) 
Mean nutrient and food intakes 
 Total calories, kcal/day (SD) 1,966 (528) 1,976 (522) 1,930 (535) 1,965 (545) 
 Lycopene intake, μg/day (SD) 6,898 (3,937) 7,091 (3,774) 7,328 (4,075) 7,116 (4,206) 
 Calcium, mg/day (SD) 945 (370) 922 (348) 945 (366) 967 (374) 
 Alpha-linolenic acid, g/day (SD) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 
 Supplemental vitamin E, mg/day (SD) 51.1 (79.2) 55.4 (80.1) 59.7 (85.1) 64.4 (86.6) 
 Alcohol, g/day (SD) 12.0 (13.3) 12.3 (14.6) 10.5 (12.0) 12.0 (14.5) 
 Coffee, cups/day (SD) 1.9 (1.6) 1.9 (1.6) 1.9 (1.5) 1.8 (1.5) 
Participants treated with RPParticipants treated otherwise
CharacteristicERG-positiveERG-negativeERG status unavailableERG status unavailable
Number 439 474 1,489 3,445 
Year of diagnosis 
 1986–1990 35 (8.0%) 28 (5.9%) 122 (8.2%) 295 (8.6%) 
 1991–1995 157 (35.8%) 151 (31.9%) 396 (26.6%) 781 (22.7%) 
 1996–2000 134 (30.5%) 151 (31.9%) 356 (23.9%) 896 (26.0%) 
 2000–2005 74 (16.9%) 99 (20.9%) 342 (23.0%) 932 (27.1%) 
 2006–2009 39 (8.9%) 45 (9.5%) 273 (18.3%) 541 (15.7%) 
Mean age, years (SD) 65.1 (6.0) 65.8 (5.7) 65.9 (5.9) 72.9 (6.9) 
Mean PSA level, ng/mL (SD)a 9.7 (11.5) 10.2 (12.4) 11.5 (109) 20.7 (151) 
 % Missing 32 (7.3%) 41 (8.6%) 141 (9.5%) 913 (26.5%) 
Biopsy Gleason score 
 2–6a 265 (66.9%) 265 (63.1%) 821 (66.3%) 1,482 (56.9%) 
 7a 107 (27.0%) 112 (26.7%) 338 (27.3%) 731 (28.1%) 
 8–10a 24 (6.1%) 43 (10.2%) 80 (6.5%) 392 (15.1%) 
 % Missing 43 (9.8%) 54 (11.4%) 250 (16.8%) 840 (24.4%) 
Clinical stage 
 T1/T2a 416 (94.8%) 453 (95.6%) 1,392 (95.5%) 2,473 (87.5%) 
 T3a 19 (4.3%) 14 (3.0%) 41 (2.8%) 112 (4.0%) 
 T4/N1/M1a 4 (0.9%) 7 (1.5%) 24 (1.7%) 243 (8.6%) 
 % Missing 0 (0.0%) 0 (0.0%) 32 (2.2%) 617 (17.9%) 
Mean height, inches (SD) 70.5 (2.6) 70.2 (2.7) 70.2 (2.5) 69.9 (2.6) 
Mean BMI at Age 21, kg/m2 in (SD) 22.8 (2.6) 23.0 (2.6) 22.8 (2.6) 22.7 (2.7) 
Mean BMI, kg/m2 (SD) 25.7 (3.1) 26.1 (3.2) 25.7 (3.1) 25.9 (3.5) 
Mean waist circumference in 1987, inches (SD) 36.9 (3.1) 37.5 (3.3) 36.9 (3.4) 37.7 (3.4) 
Caucasian 409 (97.6%) 433 (96.2%) 1,381 (97.1%) 3,137 (95.4%) 
Family history of prostate cancer 94 (21.4%) 116 (24.5%) 346 (23.2%) 625 (18.1%) 
Diabetes 21 (4.8%) 21 (4.4%) 70 (4.7%) 315 (9.1%) 
Top quintile of physical activity (≥28.5 METS/week) 59 (13.4%) 68 (14.4%) 260 (17.5%) 440 (12.8%) 
Smoking statusa 
 Never 215 (50.8%) 231 (50.9%) 687 (47.7%) 1,474 (44.1%) 
 Past, quit >10 years before diagnosis 135 (31.9%) 154 (33.9%) 531 (36.9%) 1,296 (38.8%) 
 Past, quit ≤10 years before diagnosis 48 (11.4%) 41 (9.0%) 159 (11.0%) 395 (11.8%) 
 Current 25 (5.9%) 28 (6.2%) 63 (4.4%) 175 (5.2%) 
Multivitamin use 227 (51.7%) 253 (53.6%) 829 (55.9%) 2,003 (58.4%) 
Had screening PSA test 203 (46.2%) 231 (48.7%) 751 (50.4%) 1,742 (50.6%) 
Mean nutrient and food intakes 
 Total calories, kcal/day (SD) 1,966 (528) 1,976 (522) 1,930 (535) 1,965 (545) 
 Lycopene intake, μg/day (SD) 6,898 (3,937) 7,091 (3,774) 7,328 (4,075) 7,116 (4,206) 
 Calcium, mg/day (SD) 945 (370) 922 (348) 945 (366) 967 (374) 
 Alpha-linolenic acid, g/day (SD) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 1.1 (0.3) 
 Supplemental vitamin E, mg/day (SD) 51.1 (79.2) 55.4 (80.1) 59.7 (85.1) 64.4 (86.6) 
 Alcohol, g/day (SD) 12.0 (13.3) 12.3 (14.6) 10.5 (12.0) 12.0 (14.5) 
 Coffee, cups/day (SD) 1.9 (1.6) 1.9 (1.6) 1.9 (1.5) 1.8 (1.5) 

NOTE: Numbers may not add up to total sample sizes for characteristics with missing data; percentages may not add up as expected due to rounding.

Abbreviation: METS, metabolic equivalents.

aAmong those with data available.

Table 3 presents multivariable results for anthropometric measures and risk of prostate cancer diagnosis overall and by ERG status; results from age- and calendar time-adjusted models were largely comparable (Supplementary Table S1). Increasing age 21 BMI was associated with a reduced risk of prostate cancer overall in both continuous (per 5 kg/m2 HR 0.92; 95% CI, 0.87–0.96) and categorical (≥25 kg/m2 vs. <21 kg/m2 HR 0.89; 95% CI, 0.82–0.97; Ptrend = 0.001) models. Baseline BMI also showed suggestive evidence of an inverse association with prostate cancer risk in continuous models only (per 5 kg/m2 HR 0.94; 95% CI, 0.89–1.00). Findings were largely similar when restricted to ERG-assayed cases. No other exposure was associated with prostate cancer overall.

Table 3.

Multivariable HR and 95% CI for association between anthropometric exposures and risk of prostate cancer, both overall and by ERG status, HPFS, 1986–2009

All casesRP cases assayed for ERG status only
#CasesOverall HR (95% CI)#ERG+ cases#ERG- casesOverall HR (95% CI)ERG-positive HR (95% CI)ERG-negative HR (95% CI)
Heighta        
Continuous, per 5 inches 5,847 1.03 (0.98–1.08) 439 474 1.10 (0.97–1.26) 1.24 (1.03–1.50) 0.98 (0.82–1.18) 
Pheterogeneity      0.07 
Categorical, inches 
 ≤68 1,594 1.00 (ref) 88 133 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 >68–70 1,780 1.06 (0.99–1.13) 138 128 1.05 (0.87–1.25) 1.39 (1.06–1.82) 0.82 (0.64–1.05) 
 >70–72 1,531 1.00 (0.93–1.08) 127 119 1.01 (0.84–1.21) 1.32 (1.00–1.75) 0.80 (0.62–1.02) 
 >72 942 1.05 (0.96–1.14) 86 94 1.19 (0.97–1.45) 1.42 (1.05–1.93) 1.03 (0.79–1.35) 
Ptrend  0.51   0.21 0.02 0.71 
Pheterogeneity      0.02 
BMI at age 21b        
Continuous, per 5 kg/m2 5,591 0.92 (0.87–0.96) 419 461 0.91 (0.81–1.03) 0.87 (0.73–1.03) 0.96 (0.81–1.12) 
Pheterogeneity      0.43 
Categorical, kg/m2 
 <21 1,427 1.00 (ref) 109 105 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 21–<23 1,663 0.99 (0.93–1.07) 125 118 0.88 (0.73–1.06) 0.89 (0.69–1.15) 0.87 (0.67–1.13) 
 23–<25 1,400 0.93 (0.86–1.00) 103 143 0.95 (0.79–1.15) 0.80 (0.61–1.05) 1.11 (0.86–1.43) 
 ≥25 1,101 0.89 (0.82–0.97) 82 95 0.80 (0.65–0.98) 0.73 (0.55–0.98) 0.87 (0.66–1.15) 
Ptrend  0.001   0.07 0.03 0.69 
Pheterogeneity      0.20 
BMI at baseline in 1986c        
Continuous, per 5 kg/m2 5,715 0.94 (0.89–1.00) 428 466 0.87 (0.76–1.00) 0.75 (0.61–0.91) 0.99 (0.84–1.18) 
Pheterogeneity      0.02 
Categorical, kg/m2 
 <23 1,038 1.00 (ref) 94 72 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 23–<25 1,698 1.04 (0.96–1.14) 140 133 1.04 (0.84–1.28) 0.96 (0.73–1.28) 1.13 (0.83–1.54) 
 25–<27.5 1,860 0.97 (0.89–1.05) 122 167 0.93 (0.75–1.16) 0.72 (0.53–0.97) 1.20 (0.89–1.62) 
 ≥27.5 1,119 0.97 (0.88–1.08) 72 94 0.88 (0.67–1.15) 0.71 (0.50-1.02) 1.09 (0.76-1.55) 
Ptrend  0.29   0.22 0.02 0.72 
Pheterogeneity      0.05 
Updated BMIc        
Continuous, per 5 kg/m2 5,838 0.99 (0.95–1.03) 439 474 0.95 (0.85–1.06) 0.86 (0.74–1.00) 1.03 (0.90–1.18) 
Pheterogeneity      0.07 
Categorical, kg/m2 
 <23 975 1.00 (ref) 70 57 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 23–<25 1,527 1.10 (1.01–1.20) 126 133 1.30 (1.04–1.63) 1.16 (0.86–1.57) 1.48 (1.07–2.04) 
 25–<27.5 1,799 1.04 (0.96–1.13) 136 144 1.10 (0.88–1.38) 1.02 (0.75–1.38) 1.20 (0.88–1.65) 
 ≥27.5 1,537 1.06 (0.97–1.16) 107 140 1.10 (0.87–1.41) 0.87 (0.63–1.21) 1.38 (0.99–1.92) 
Ptrend  0.68   0.79 0.14 0.31 
Pheterogeneity      0.19 
Waist circumference in 1987c,d        
Continuous, per 5 inches 4,058 0.98 (0.93–1.03) 333 347 0.96 (0.84–1.08) 0.85 (0.72–1.02) 1.06 (0.90–1.25) 
Pheterogeneity      0.07 
Categorical, inches 
 ≤35 1,079 1.00 (ref) 102 83 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 >35–37 1,080 1.02 (0.93–1.12) 96 92 1.13 (0.91–1.42) 1.12 (0.83–1.53) 1.15 (0.84–1.58) 
 >37–<40 982 0.94 (0.85–1.04) 77 92 1.12 (0.89–1.41) 1.00 (0.72–1.38) 1.25 (0.91–1.72) 
 ≥40 917 0.95 (0.85–1.05) 58 80 0.94 (0.73–1.22) 0.79 (0.55–1.13) 1.11 (0.78–1.57) 
Ptrend  0.13   0.52 0.12 0.59 
Pheterogeneity      0.44 
All casesRP cases assayed for ERG status only
#CasesOverall HR (95% CI)#ERG+ cases#ERG- casesOverall HR (95% CI)ERG-positive HR (95% CI)ERG-negative HR (95% CI)
Heighta        
Continuous, per 5 inches 5,847 1.03 (0.98–1.08) 439 474 1.10 (0.97–1.26) 1.24 (1.03–1.50) 0.98 (0.82–1.18) 
Pheterogeneity      0.07 
Categorical, inches 
 ≤68 1,594 1.00 (ref) 88 133 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 >68–70 1,780 1.06 (0.99–1.13) 138 128 1.05 (0.87–1.25) 1.39 (1.06–1.82) 0.82 (0.64–1.05) 
 >70–72 1,531 1.00 (0.93–1.08) 127 119 1.01 (0.84–1.21) 1.32 (1.00–1.75) 0.80 (0.62–1.02) 
 >72 942 1.05 (0.96–1.14) 86 94 1.19 (0.97–1.45) 1.42 (1.05–1.93) 1.03 (0.79–1.35) 
Ptrend  0.51   0.21 0.02 0.71 
Pheterogeneity      0.02 
BMI at age 21b        
Continuous, per 5 kg/m2 5,591 0.92 (0.87–0.96) 419 461 0.91 (0.81–1.03) 0.87 (0.73–1.03) 0.96 (0.81–1.12) 
Pheterogeneity      0.43 
Categorical, kg/m2 
 <21 1,427 1.00 (ref) 109 105 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 21–<23 1,663 0.99 (0.93–1.07) 125 118 0.88 (0.73–1.06) 0.89 (0.69–1.15) 0.87 (0.67–1.13) 
 23–<25 1,400 0.93 (0.86–1.00) 103 143 0.95 (0.79–1.15) 0.80 (0.61–1.05) 1.11 (0.86–1.43) 
 ≥25 1,101 0.89 (0.82–0.97) 82 95 0.80 (0.65–0.98) 0.73 (0.55–0.98) 0.87 (0.66–1.15) 
Ptrend  0.001   0.07 0.03 0.69 
Pheterogeneity      0.20 
BMI at baseline in 1986c        
Continuous, per 5 kg/m2 5,715 0.94 (0.89–1.00) 428 466 0.87 (0.76–1.00) 0.75 (0.61–0.91) 0.99 (0.84–1.18) 
Pheterogeneity      0.02 
Categorical, kg/m2 
 <23 1,038 1.00 (ref) 94 72 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 23–<25 1,698 1.04 (0.96–1.14) 140 133 1.04 (0.84–1.28) 0.96 (0.73–1.28) 1.13 (0.83–1.54) 
 25–<27.5 1,860 0.97 (0.89–1.05) 122 167 0.93 (0.75–1.16) 0.72 (0.53–0.97) 1.20 (0.89–1.62) 
 ≥27.5 1,119 0.97 (0.88–1.08) 72 94 0.88 (0.67–1.15) 0.71 (0.50-1.02) 1.09 (0.76-1.55) 
Ptrend  0.29   0.22 0.02 0.72 
Pheterogeneity      0.05 
Updated BMIc        
Continuous, per 5 kg/m2 5,838 0.99 (0.95–1.03) 439 474 0.95 (0.85–1.06) 0.86 (0.74–1.00) 1.03 (0.90–1.18) 
Pheterogeneity      0.07 
Categorical, kg/m2 
 <23 975 1.00 (ref) 70 57 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 23–<25 1,527 1.10 (1.01–1.20) 126 133 1.30 (1.04–1.63) 1.16 (0.86–1.57) 1.48 (1.07–2.04) 
 25–<27.5 1,799 1.04 (0.96–1.13) 136 144 1.10 (0.88–1.38) 1.02 (0.75–1.38) 1.20 (0.88–1.65) 
 ≥27.5 1,537 1.06 (0.97–1.16) 107 140 1.10 (0.87–1.41) 0.87 (0.63–1.21) 1.38 (0.99–1.92) 
Ptrend  0.68   0.79 0.14 0.31 
Pheterogeneity      0.19 
Waist circumference in 1987c,d        
Continuous, per 5 inches 4,058 0.98 (0.93–1.03) 333 347 0.96 (0.84–1.08) 0.85 (0.72–1.02) 1.06 (0.90–1.25) 
Pheterogeneity      0.07 
Categorical, inches 
 ≤35 1,079 1.00 (ref) 102 83 1.00 (ref) 1.00 (ref) 1.00 (ref) 
 >35–37 1,080 1.02 (0.93–1.12) 96 92 1.13 (0.91–1.42) 1.12 (0.83–1.53) 1.15 (0.84–1.58) 
 >37–<40 982 0.94 (0.85–1.04) 77 92 1.12 (0.89–1.41) 1.00 (0.72–1.38) 1.25 (0.91–1.72) 
 ≥40 917 0.95 (0.85–1.05) 58 80 0.94 (0.73–1.22) 0.79 (0.55–1.13) 1.11 (0.78–1.57) 
Ptrend  0.13   0.52 0.12 0.59 
Pheterogeneity      0.44 

aModels adjusted for age, calendar time, race (Caucasian, African American, Asian American, other), family history of prostate cancer in father or brother (yes, no), and BMI at age 21 (<21, 21–<23, 23–<25, 25+ kg/m2).

bModels adjusted for age, calendar time, race (Caucasian, African American, Asian American, other), family history of prostate cancer in father or brother (yes, no), and height (≤68, >68–70, >70–72, >72 inches).

cModels adjusted for age, calendar time, race (Caucasian, African American, Asian American, other), family history of prostate cancer in father or brother (yes, no), height (≤68, >68–70, >70–72, >72 inches), BMI at age 21 (<21, 21–<23, 23–<25, 25+ kg/m2), vigorous physical activity (quintiles of metabolic equivalents-hours/week), smoking (never, former/quit >10 years ago, former/quit ≤10 years ago, current), diabetes (yes, no), prostate-specific antigen testing (yes, no lagged by one period to avoid counting diagnostic PSA tests as screening; models of updated BMI only), use of multivitamins (yes, no), total calories (continuous), and intakes of calcium, alpha-linolenic acid, lycopene, supplemental vitamin E, alcohol (quintiles) and coffee (none, <1, 1–<2, 2–<3, 3+ cups/day.

dFollow-up starting in 1988.

Taller height was associated with an increased risk of diagnosis with ERG-positive prostate cancer (per 5 inches HR: 1.24; 95% CI, 1.03–1.50; Table 3), but not ERG-negative disease (Pheterogeneity = 0.07). Men with higher BMI at baseline (per 5 kg/m2 HR 0.75; 95% CI, 0.61–0.91; Pheterogeneity = 0.02) and over time (per 5 kg/m2 HR 0.86; 95% CI, 0.74–1.00; Pheterogeneity = 0.07) had a reduced risk of ERG-positive disease only. Higher age 21 BMI (per 5 inches HR 0.87; 95% CI, 0.73–1.03) and waist circumference (per 5 kg/m2 HR 0.85; 95% CI, 0.72–1.02) showed suggestive associations with a reduced risk of ERG-positive disease only, but heterogeneity tests across subtypes were nonsignificant.

Applying inverse probability weights did not materially change the results (Supplementary Table S2). Analyses stratified by PSA era were not well-powered to detect associations or heterogeneity, but results were qualitatively similar (Supplementary Table S3).

As the molecular taxonomy of prostate cancer has become clearer (1) it has become imperative to investigate disease etiology by prostate cancer subtype. Using an integrative molecular epidemiology approach, we found that taller height was associated with a higher risk of ERG-positive prostate cancer, and that baseline and updated obesity were associated with a lower risk. Evidence was only suggestive that age 21 BMI and waist circumference were associated with ERG-positive prostate cancer only; neither test of heterogeneity was significant. In sum, our results indicate that height and obesity are uniquely associated with the risk of ERG-positive prostate cancer. They also provide strong rationale for further investigations of other prostate cancer risk factors that may be distinctly associated with subtypes.

Androgen signaling likely promotes TMPRSS2:ERG (6, 13–17, 32, 33). Exposing prostate cancer cells (13–15, 33) and immortalized nonmalignant cells (16) to dihydrotestosterone appears to promote the fusion. In the HPFS and Physicians' Health Study, we found that increasing circulating free testosterone may be associated with a higher risk of ERG-positive disease only (6). Some (16, 32), but not all (34, 35) investigations, have reported that shorter polymorphic CAG repeat sequence in exon 1 of androgen receptor (AR) is associated with risk of ERG-positive disease only. The polymorphic CAG repeat is inversely associated with transcriptional activity of AR (36). As testosterone levels tend toward inverse correlations with obesity and potentially positive correlations with height (11, 12), these variations in testosterone signaling offer a plausible explanation for our findings.

This is the first investigation of the association between height and ERG-defined prostate cancer. Height has been shown to be modestly associated with an increased risk of total prostate cancer (37). We found no association, as is consistent with previous findings in the HPFS of positive associations with advanced and lethal prostate cancer only (38). Taller men may experience greater testosterone exposure over the life-course (12, 39, 40), and a pooled analysis found a weak positive association between serum testosterone and height (12). It follows that testosterone signaling may link height and TMPRSS2:ERG.

The IGF signaling pathway may also operate in the association between height and fusion-positive prostate cancer. IGF signaling is important in determining height (18), and evidence suggests that the IGF pathway is upregulated in fusion-positive cancers. Our group found higher IGF1 receptor (IGF1R) expression in ERG-positive tumors than in ERG-negative tumors (4). Another study found IGF1R was more highly expressed in the ERG-positive VCaP prostate cancer cell line than in cell lines lacking ERG, and transfecting ERG into other malignant cell lines increased IGF1R expression (19). Knockdown of ERG in VCaP cells also reduced IGF1R expression (19). Collectively, these data suggest greater IGF signaling in fusion-positive cancer. To the best of our knowledge, however, the role of IGF signaling in promoting TMPRSS2:ERG is untested.

Obesity has been consistently inversely associated with the risk of less aggressive prostate cancer (41). Regarding TMPRSS2:ERG-defined prostate cancer, our findings align with the only previous study to investigate obesity as a risk factor (9). It too found that obesity was distinctly associated with fusion-positive disease, reporting a 14% reduction in the odds of TMPRSS2:ERG-positive prostate cancer for every 5 kg/m2 increase in BMI (9). Obesity is characterized by lower circulating testosterone levels (11). Consistent with experimental research, our findings may be explained by lower circulating testosterone levels in obese men that reduce the risk for developing TMPRSS2:ERG. That the rate of obesity is higher (42, 43) and the rate of the fusion is lower (44–46) in populations of African ancestry relative to Caucasians lends additional support to our findings. It should be noted, however, that populations of East Asian ancestry also have lower rates of the fusion (3, 45–47), but tend toward lower rates of obesity (42, 43). Although detection bias due to hemodilution and greater biopsy sampling error in obese men may also partly explain an inverse association between obesity and prostate cancer (48), it is unlikely these issues would differ according to TMPRSS2:ERG status. We previously found obesity to be associated with a greater risk of lethal prostate cancer among men with fusion-positive tumors (4). It could be that the low testosterone environment of obesity is associated with reduced development of fusion-positive tumors, but following development of fusion-positive disease, increased growth stimulating hormone levels promote lethal progression. Further study is required to clarify these mechanisms and address the role of obesity in the development of TMPRSS2:ERG-positive prostate cancer in individuals of non-Caucasian ancestry.

Although age 21 BMI and waist circumference were suggestively associated with a reduced risk of ERG-positive prostate cancer, tests of heterogeneity across subtypes were nonsignificant. The only other study to evaluate recalled BMI in early adulthood and TMPRSS2:ERG-defined disease found similarly nonsignificant but suggestive results (9). The distribution of age 21 BMI in our cohort could have been too limited to observe a clear difference in associations with ERG-defined disease. Regarding waist circumference, no prior study has investigated associations with ERG-defined disease. Our results suggest that waist circumference may not be differentially associated with ERG-defined disease and/or that we were limited in our ability to evaluate the exposure; we had only one measurement during mid- to late-adulthood.

Strengths of our study include leveraging a large well-annotated prospectively monitored cohort. We collected exposure data prior to diagnosis and adjusted for potentially important demographic and lifestyle confounders. However, our cohort was comprised primarily of white men, limiting generalizability to other racial/ethnic groups. We did not directly measure TMPRSS2:ERG, though IHC for ERG has excellent agreement with FISH (26) and PCR (27). Finally, the cases for which we were able to assay ERG status, which were largely treated with RP, may not be representative of all men diagnosed with prostate cancer. However, their characteristics were not substantially different from other prostate cancer cases, and sensitivity analyses applying inverse probability weights did not appreciably change results.

In summary, we found that height was positively associated with risk of ERG-positive prostate cancer and that obesity, particularly later in life, was inversely associated with ERG-positive disease. Our results may reflect differences in exposure to androgens and/or IGF1 by height and obesity status. These findings suggest that prostate cancer etiology differs according to TMPRSS2:ERG status, and support conducting similar investigations beyond anthropometrics. Moreover, further studies are needed to validate our findings and clarify the mechanisms by which obesity and height may be linked with the development of TMPRSS2:ERG-positive prostate cancer.

No potential conflicts of interest were disclosed.

The authors assume full responsibility for analyses and interpretation of these data.

Conception and design: R.E. Graff, T.U. Ahearn, A. Pettersson, E.L. Giovannucci, M. Loda, L.A. Mucci

Development of methodology: R.E. Graff, M. Loda

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): R.T. Lis, M. Fiorentino, E.L. Giovannucci, M. Loda, L.A. Mucci

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): R.E. Graff, T.U. Ahearn, A. Pettersson, E.M. Ebot, T. Gerke, M. Song, R.T. Lis, M. Loda, L.A. Mucci

Writing, review, and/or revision of the manuscript: R.E. Graff, T.U. Ahearn, A. Pettersson, E.M. Ebot, T. Gerke, K.L. Penney, K.M. Wilson, S.C. Markt, C.H. Pernar, M. Song, R.T. Lis, D.R. Schmidt, M.G. Vander Heiden, E.L. Giovannucci, M. Loda, L.A. Mucci

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): T. Gerke, K.M. Wilson, A.G. Gonzalez-Feliciano

Study supervision: M.G. Vander Heiden, M. Loda, L.A. Mucci

We would like to thank the participants and staff of the HPFS for their valuable contributions as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, WY. The TMAs were constructed by the Tissue Microarray Core Facility at the Dana Farber/Harvard Cancer Center. This work was supported by the NIH (R01CA136578, to L.A. Mucci; U01 CA167552, to L.A. Mucci and W.C. Willett; P50 CA090381, to L.A. Mucci, M.G. Vander Heiden, and D.R. Schmidt; R25 CA112355, to R.E. Graff; T32 CA09001, to T.U. Ahearn, E.M. Ebot, S.C. Markt, and C.H. Pernar); the Prostate Cancer Foundation Young Investigators Awards (to L.A. Mucci, K.M. Wilson, and K.L. Penney); the American Cancer Society – Ellison Foundation Postdoctoral Fellowship (PF-14-140-01-CCE, to T.U. Ahearn); the Office of the Assistant Secretary of Defense for Health Affairs (under W81XWH-14-1-0250, to E.M. Ebot); SU2C (IRG0916, to M.G. Vander Heiden); the Ludwig Center at MIT (to M.G. Vander Heiden); and an HHMI faculty scholar award (to M.G. Vander Heiden).

Research supported by a Stand Up To Cancer Innovative Research Grant, Grant Number SU2C-AACR-IRG 09-16. Stand Up To Cancer is a program of the Entertainment Industry Foundation. Research grants are administered by the American Association for Cancer Research, the scientific partner of SU2C.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Abeshouse
A
,
Ahn
J
,
Akbani
R
,
Ally
A
,
Amin
S
,
Andry Christopher
D
, et al
The molecular taxonomy of primary prostate cancer
.
Cell
2015
;
163
:
1011
25
.
2.
Tomlins
SA
,
Laxman
B
,
Varambally
S
,
Cao
X
,
Yu
J
,
Helgeson
BE
, et al
Role of the TMPRSS2-ERG gene fusion in prostate cancer
.
Neoplasia
2008
;
10
:
177
88
.
3.
Pettersson
A
,
Graff
RE
,
Bauer
SR
,
Pitt
MJ
,
Lis
RT
,
Stack
EC
, et al
The TMPRSS2:ERG rearrangement, ERG expression, and prostate cancer outcomes: a cohort study and meta-analysis
.
Cancer Epidemiol Biomarkers Prev
2012
;
21
:
1497
509
.
4.
Pettersson
A
,
Lis
RT
,
Meisner
A
,
Flavin
R
,
Stack
EC
,
Fiorentino
M
, et al
Modification of the association between obesity and lethal prostate cancer by TMPRSS2:ERG
.
J Natl Cancer Inst
2013
;
105
:
1881
90
.
5.
Ahearn
TU
,
Pettersson
A
,
Ebot
EM
,
Gerke
T
,
Graff
RE
,
Morais
CL
, et al
A prospective investigation of PTEN loss and ERG expression in lethal prostate cancer
.
J Natl Cancer Inst
2016
;
108
:pii:
djv346
.
6.
Graff
RE
,
Meisner
A
,
Ahearn
TU
,
Fiorentino
M
,
Loda
M
,
Giovannucci
EL
, et al
Pre-diagnostic circulating sex hormone levels and risk of prostate cancer by ERG tumour protein expression
.
Br J Cancer
2016
;
114
:
939
44
.
7.
Graff
RE
,
Pettersson
A
,
Lis
RT
,
Ahearn
TU
,
Markt
SC
,
Wilson
KM
, et al
Dietary lycopene intake and risk of prostate cancer defined by ERG protein expression
.
Am J Clin Nutr
2016
;
103
:
851
60
.
8.
Wright
JL
,
Chery
L
,
Holt
S
,
Lin
DW
,
Luedeke
M
,
Rinckleb
AE
, et al
Aspirin and NSAID use in association with molecular subtypes of prostate cancer defined by TMPRSS2:ERG fusion status
.
Prostate Cancer Prostatic Dis
2016
;
19
:
53
6
.
9.
Egbers
L
,
Luedeke
M
,
Rinckleb
A
,
Kolb
S
,
Wright
JL
,
Maier
C
, et al
Obesity and prostate cancer risk according to tumor TMPRSS2:ERG gene fusion status
.
Am J Epidemiol
2015
;
181
:
706
13
.
10.
Penney
KL
,
Pettersson
A
,
Shui
IM
,
Graff
RE
,
Kraft
P
,
Lis
RT
, et al
Association of prostate cancer risk variants with TMPRSS2:ERG status: evidence for distinct molecular subtypes
.
Cancer Epidemiol Biomarkers Prev
2016
;
25
:
745
9
.
11.
Allan
CA
,
McLachlan
RI
. 
Androgens and obesity
.
Curr Opin Endocrinol Diabetes Obes
2010
;
17
:
224
32
.
12.
Handelsman
DJ
,
Yeap
B
,
Flicker
L
,
Martin
S
,
Wittert
GA
,
Ly
LP
. 
Age-specific population centiles for androgen status in men
.
Eur J Endocrinol
2015
;
173
:
809
17
.
13.
Mani
RS
,
Tomlins
SA
,
Callahan
K
,
Ghosh
A
,
Nyati
MK
,
Varambally
S
, et al
Induced chromosomal proximity and gene fusions in prostate cancer
.
Science
2009
;
326
:
1230
.
14.
Haffner
MC
,
Aryee
MJ
,
Toubaji
A
,
Esopi
DM
,
Albadine
R
,
Gurel
B
, et al
Androgen-induced TOP2B-mediated double-strand breaks and prostate cancer gene rearrangements
.
Nat Genet
2010
;
42
:
668
75
.
15.
Lin
C
,
Yang
L
,
Tanasa
B
,
Hutt
K
,
Ju
BG
,
Ohgi
K
, et al
Nuclear receptor-induced chromosomal proximity and DNA breaks underlie specific translocations in cancer
.
Cell
2009
;
139
:
1069
83
.
16.
Bastus
NC
,
Boyd
LK
,
Mao
X
,
Stankiewicz
E
,
Kudahetti
SC
,
Oliver
RT
, et al
Androgen-induced TMPRSS2:ERG fusion in nonmalignant prostate epithelial cells
.
Cancer Res
2010
;
70
:
9544
8
.
17.
Mani
RS
,
Amin
MA
,
Li
X
,
Kalyana-Sundaram
S
,
Veeneman
BA
,
Wang
L
, et al
Inflammation-induced oxidative stress mediates gene fusion formation in prostate cancer
.
Cell Rep
2016
;
17
:
2620
31
.
18.
Baron
J
,
Savendahl
L
,
De Luca
F
,
Dauber
A
,
Phillip
M
,
Wit
JM
, et al
Short and tall stature: a new paradigm emerges
.
Nat Rev Endocrinol
2015
;
11
:
735
46
.
19.
Mancarella
C
,
Casanova-Salas
I
,
Calatrava
A
,
Ventura
S
,
Garofalo
C
,
Rubio-Briones
J
, et al
ERG deregulation induces IGF-1R expression in prostate cancer cells and affects sensitivity to anti-IGF-1R agents
.
Oncotarget
2015
;
6
:
16611
22
.
20.
Rimm
EB
,
Stampfer
MJ
,
Colditz
GA
,
Chute
CG
,
Litin
LB
,
Willett
WC
. 
Validity of self-reported waist and hip circumferences in men and women
.
Epidemiology
1990
;
1
:
466
73
.
21.
Norgan
NG
,
Cameron
N
. 
The accuracy of body weight and height recall in middle-aged men
.
Int J Obes Relat Metab Disord
2000
;
24
:
1695
8
.
22.
Rhoads
GG
,
Kagan
A
. 
The relation of coronary disease, stroke, and mortality to weight in youth and in middle age
.
Lancet
1983
;
1
:
492
5
.
23.
Tamakoshi
K
,
Yatsuya
H
,
Kondo
T
,
Hirano
T
,
Hori
Y
,
Yoshida
T
, et al
The accuracy of long-term recall of past body weight in Japanese adult men
.
Int J Obes Relat Metab Disord
2003
;
27
:
247
52
.
24.
Catalona
WJ
,
Avioli
LV
. 
Diagnosis, staging, and surgical treatment of prostatic carcinoma
.
Arch Intern Med
1987
;
147
:
361
3
.
25.
Hendrickson
WK
,
Flavin
R
,
Kasperzyk
JL
,
Fiorentino
M
,
Fang
F
,
Lis
R
, et al
Vitamin D receptor protein expression in tumor tissue and prostate cancer progression
.
J Clin Oncol
2011
;
29
:
2378
85
.
26.
Chaux
A
,
Albadine
R
,
Toubaji
A
,
Hicks
J
,
Meeker
A
,
Platz
EA
, et al
Immunohistochemistry for ERG expression as a surrogate for TMPRSS2-ERG fusion detection in prostatic adenocarcinomas
.
Am J Surg Pathol
2011
;
35
:
1014
20
.
27.
van Leenders
GJ
,
Boormans
JL
,
Vissers
CJ
,
Hoogland
AM
,
Bressers
AA
,
Furusato
B
, et al
Antibody EPR3864 is specific for ERG genomic fusions in prostate cancer: implications for pathological practice
.
Mod Pathol
2011
;
24
:
1128
38
.
28.
Hastings
C
,
Mosteller
F
,
Tukey
JW
,
Winsor
CP
. 
Low moments for small samples: a comparative study of order statistics
.
Ann Math Stat
1947
;
18
:
413
26
.
29.
Lunn
M
,
McNeil
D
. 
Applying Cox regression to competing risks
.
Biometrics
1995
;
51
:
524
32
.
30.
Rosner
B
,
Glynn
RJ
,
Tamimi
RM
,
Chen
WY
,
Colditz
GA
,
Willett
WC
, et al
Breast cancer risk prediction with heterogeneous risk profiles according to breast cancer tumor markers
.
Am J Epidemiol
2013
;
178
:
296
308
.
31.
Glynn
RJ
,
Rosner
B
. 
Comparison of risk factors for the competing risks of coronary heart disease, stroke, and venous thromboembolism
.
Am J Epidemiol
2005
;
162
:
975
82
.
32.
Yoo
S
,
Pettersson
A
,
Jordahl
KM
,
Lis
RT
,
Lindstrom
S
,
Meisner
A
, et al
Androgen receptor CAG repeat polymorphism and risk of TMPRSS2:ERG-positive prostate cancer
.
Cancer Epidemiol Biomarkers Prev
2014
;
23
:
2027
31
.
33.
Coll-Bastus
N
,
Mao
X
,
Young
BD
,
Sheer
D
,
Lu
YJ
. 
DNA replication-dependent induction of gene proximity by androgen
.
Hum Mol Genet
2015
;
24
:
963
71
.
34.
Figg
WD
,
Chau
CH
,
Price
DK
,
Till
C
,
Goodman
PJ
,
Cho
Y
, et al
Androgen receptor CAG repeat length and TMPRSS2:ETS prostate cancer risk: results from the Prostate Cancer Prevention Trial
.
Urology
2014
;
84
:
127
31
.
35.
Mao
X
,
Li
J
,
Xu
X
,
Boyd
LK
,
He
W
,
Stankiewicz
E
, et al
Involvement of different mechanisms for the association of CAG repeat length polymorphism in androgen receptor gene with prostate cancer
.
Am J Cancer Res
2014
;
4
:
886
96
.
36.
Chamberlain
NL
,
Driver
ED
,
Miesfeld
RL
. 
The length and location of CAG trinucleotide repeats in the androgen receptor N-terminal domain affect transactivation function
.
Nucleic Acids Res
1994
;
22
:
3181
6
.
37.
Zuccolo
L
,
Harris
R
,
Gunnell
D
,
Oliver
S
,
Lane
JA
,
Davis
M
, et al
Height and prostate cancer risk: a large nested case-control study (ProtecT) and meta-analysis
.
Cancer Epidemiol Biomarkers Prev
2008
;
17
:
2325
36
.
38.
Moller
E
,
Wilson
KM
,
Batista
JL
,
Mucci
LA
,
Balter
K
,
Giovannucci
E
. 
Body size across the life course and prostate cancer in the Health Professionals Follow-up Study
.
Int J Cancer
2016
;
138
:
853
65
.
39.
Albin
AK
,
Norjavaara
E
. 
Pubertal growth and serum testosterone and estradiol levels in boys
.
Horm Res Paediatr
2013
;
80
:
100
10
.
40.
Lukas
WD
,
Campbell
BC
,
Ellison
PT
. 
Testosterone, aging, and body composition in men from Harare, Zimbabwe
.
Am J Hum Biol
2004
;
16
:
704
12
.
41.
Discacciati
A
,
Orsini
N
,
Wolk
A
. 
Body mass index and incidence of localized and advanced prostate cancer–a dose-response meta-analysis of prospective studies
.
Ann Oncol
2012
;
23
:
1665
71
.
42.
El-Sayed
AM
,
Scarborough
P
,
Galea
S
. 
Ethnic inequalities in obesity among children and adults in the UK: a systematic review of the literature
.
Obes Rev
2011
;
12
:
e516
34
.
43.
Flegal
KM
,
Kruszon-Moran
D
,
Carroll
MD
,
Fryar
CD
,
Ogden
CL
. 
Trends in obesity among adults in the United States, 2005 to 2014
.
JAMA
2016
;
315
:
2284
91
.
44.
Rosen
P
,
Pfister
D
,
Young
D
,
Petrovics
G
,
Chen
Y
,
Cullen
J
, et al
Differences in frequency of ERG oncoprotein expression between index tumors of Caucasian and African American patients with prostate cancer
.
Urology
2012
;
80
:
749
53
.
45.
Magi-Galluzzi
C
,
Tsusuki
T
,
Elson
P
,
Simmerman
K
,
LaFargue
C
,
Esgueva
R
, et al
TMPRSS2-ERG gene fusion prevalence and class are significantly different in prostate cancer of Caucasian, African-American and Japanese patients
.
Prostate
2011
;
71
:
489
97
.
46.
Ke Zhou
C
,
Young
D
,
Yeboah
ED
,
Coburn
SB
,
Tettey
Y
,
Biritwum
RB
, et al
TMPRSS2-ERG gene fusions in prostate cancer of West African men and a meta-analysis of racial differences
.
Am J Epidemiol
. 
2017 Jun 12
. [Epub ahead of print].
47.
Mao
X
,
Yu
Y
,
Boyd
LK
,
Ren
G
,
Lin
D
,
Chaplin
T
, et al
Distinct genomic alterations in prostate cancers in Chinese and Western populations suggest alternative pathways of prostate carcinogenesis
.
Cancer Res
2010
;
70
:
5207
12
.
48.
Wallner
LP
,
Morgenstern
H
,
McGree
ME
,
Jacobson
DJ
,
St Sauver
JL
,
Jacobsen
SJ
, et al
The effects of body mass index on changes in prostate-specific antigen levels and prostate volume over 15 years of follow-up: implications for prostate cancer detection
.
Cancer Epidemiol Biomarkers Prev
2011
;
20
:
501
8
.