Prostate cancer risk may be influenced by single genetic variants in the hormone-regulating genes androgen receptor (AR), cytochrome P450 (CYP17), and steroid-5-α-reductase type 2 (SRD5A2). In this study, we comprehensively investigated polymorphisms in these three loci and their joint effect in a large population-based study. We selected 23 haplotype-tagging single-nucleotide polymorphisms (htSNP) that could uniquely describe >95% of the haplotypes (6 in AR, 6 in CYP17, and 11 in SRD5A2). These htSNPs were then genotyped in the Cancer Prostate in Sweden population (2,826 case subjects and 1,705 controls). We observed significant association for several SNPs in the AR gene (P = 0.004-0.02) and CYP17 (P = 0.009-0.05) and one SNP in SRD5A2 (P = 0.02). Carriers of the most common AR haplotype had a significant excess risk to develop prostate cancer [odds ratio (OR), 1.25; 95% confidence interval (95% CI), 1.1-1.5; P = 0.002], yielding an estimated population attributable risk of 16% (95% CI, 0.06-0.25). Combining risk alleles from these genes yielded a 12% risk increase for each additional high-risk allele carried (95% CI, 1.1-1.2; P for trend = 9.2 × 10−5), with an overall OR of 1.87 (95% CI, 1.0-3.4) for carriers of all five included risk alleles, an OR of 2.13 (P for trend = 8 × 10−4) for advanced disease, and an OR of 4.35 (P for trend = 7 × 10−5) for disease onset before age 65 years. Genetic variation in key genes in the androgen pathway is important for development of prostate cancer and may account for a considerable proportion of all prostate cancers. Carriers of five high-risk alleles in the AR, CYP17, and SRD5A2 genes are at ∼2-fold excess risk to develop prostate cancer. (Cancer Res 2006; 66(22): 11077-83)

Prostate cancer constitutes a major health problem being the most common malignancy among men in developed countries. Accumulating evidence supports an important role of genetics in prostate cancer etiology, yet the responsible genes remain largely unidentified (1).

The prostate gland depends on androgens for its growth and maintenance, and they partly control cell proliferation and differentiation (2). Androgens are suggested to be involved also in prostate cancer development (3). Early-stage prostate cancer is androgen dependent, and androgen ablation is widely used to treat advanced disease (4). Accordingly, genetic variation within androgen-related genes has been proposed to influence prostate cancer risk. Specifically, three genes have been the target for intense investigation: the androgen receptor (AR; OMIM 313700), which among other things mediates androgen action, binds the male sex hormones testosterone and dihydrotestosterone (5), and activates expression of several other androgen-responsive genes (6); the cytochrome P450 (CYP17; OMIM 609300), which functions as a catalyst in key reactions involved in the last step of the testosterone biosynthesis (3); and the steroid-5-α-reductase type 2 (SRD5A2; OMIM 607306) responsible for converting testosterone to its biologically more active form dihydrotestosterone in the prostate (3).

Recently, we studied genetic variants reported to be associated with prostate cancer risk in a large Swedish prostate cancer case-control study (7). In total, we assessed 46 genetic variants in 27 genes and we were able to replicate 6 genetic variants in 5 different genes. Four of these replicated variants were located in the AR gene (CAG repeat), the CYP17 gene (rs743572), and the SRD5A2 gene (rs676033, rs523349). To this end, we now comprehensively investigate the effect from common germ-line genetic variants in the AR, CYP17, and SRD5A2 genes in a larger study. We used the approach of haplotype-tagging single-nucleotide polymorphisms (htSNP) to capture the majority of genetic variation and genotyped in total 23 SNPs in 4,531 men from a large Swedish population-based prostate cancer case-control study [Cancer Prostate in Sweden (CAPS)].

Case-control study. CAPS is a population-based case-control study described in detail elsewhere (8). Recruitment of participants lasted between March 2001 and October 31, 2003. We included all men living in the northern and central parts of Sweden, under the age of 80 years, and all men living in the Stockholm region and southeastern part of Sweden, under the age of 65 years.

All patients with a newly diagnosed carcinoma of the prostate were eligible to participate in the study. It is compulsory to report newly diagnosed cancer cases to regional cancer registries in Sweden, and by using these registries, we achieved complete case ascertainment. We obtained detailed clinical information, such as Gleason score, prostate-specific antigen (PSA) level at time of diagnosis, and tumor-node-metastasis stage from the Swedish National Prostate Cancer Registry.4

All cancers were cytologically (<5%) or pathologically verified. In total, 3,648 prostate cancer patients were identified and invited to the study. Of them, 3,039 (83%) agreed to participate, and for those, a blood sample and a questionnaire about risk factors and family history were collected. The clinical characteristics of the study subjects are presented in Table 1. Case subjects were classified as advanced cases if they met at least one of the following criteria: T3/T4, N+, M+, Gleason score of 8 to 10, or PSA level ≥50 ng/mL.

Table 1.

Characteristics of subjects in the CAPS study

CharacteristicsCases (N = 2,826)
Controls (N = 1,705)
n%n%
Age (y)     
    ≤59 640 22.6 275 16.1 
    60-69 1,321 46.7 739 43.3 
    ≥70 865 30.6 691 40.5 
PSA levels* (ng/mL)     
    <4 142 5.0 1,419 83.3 
    4-9.99 968 34.3 231 13.5 
    10-19.99 645 22.8 38 2.2 
    20-49.99 453 16.0 13 0.8 
    50-99.99 219 7.7 0.1 
    ≥100 306 10.8 0.1 
T stage     
    T0/TX 122 4.4   
    T1 1,025 36.3   
    T2 878 31.1   
    T3 697 24.7   
    T4 104 3.7   
N stage     
    N0/NX 2,733 96.7   
    N1-N3 93 3.3   
M stage     
    M0/MX 2,562 90.7   
    M1 264 9.3   
Gleason score     
    ≤4 101 4.0   
    5 284 11.1   
    6 957 37.4   
    7 773 30.2   
    8 246 9.6   
    9 177 6.9   
    10 24 0.9   
Differential grade     
    G1/GX 1,927 68.2   
    G2 583 20.6   
    G3 316 11.2   
Prostate cancer stage     
    Localized 1,575 56.8   
    Advanced 1,198 43.2   
CharacteristicsCases (N = 2,826)
Controls (N = 1,705)
n%n%
Age (y)     
    ≤59 640 22.6 275 16.1 
    60-69 1,321 46.7 739 43.3 
    ≥70 865 30.6 691 40.5 
PSA levels* (ng/mL)     
    <4 142 5.0 1,419 83.3 
    4-9.99 968 34.3 231 13.5 
    10-19.99 645 22.8 38 2.2 
    20-49.99 453 16.0 13 0.8 
    50-99.99 219 7.7 0.1 
    ≥100 306 10.8 0.1 
T stage     
    T0/TX 122 4.4   
    T1 1,025 36.3   
    T2 878 31.1   
    T3 697 24.7   
    T4 104 3.7   
N stage     
    N0/NX 2,733 96.7   
    N1-N3 93 3.3   
M stage     
    M0/MX 2,562 90.7   
    M1 264 9.3   
Gleason score     
    ≤4 101 4.0   
    5 284 11.1   
    6 957 37.4   
    7 773 30.2   
    8 246 9.6   
    9 177 6.9   
    10 24 0.9   
Differential grade     
    G1/GX 1,927 68.2   
    G2 583 20.6   
    G3 316 11.2   
Prostate cancer stage     
    Localized 1,575 56.8   
    Advanced 1,198 43.2   
*

PSA was not available for all subjects.

Case subjects were classified as advanced cases if they met at least one of the following criteria: T3/T4, N+, M+, Gleason score of 8 to 10, or PSA level ≥50 ng/mL. All cases were not classified.

Control subjects were randomly selected from the Swedish Population Registry, frequency matched according to the expected age distribution of cases (groups of 5-year interval) and geographic region (two regions, representing north and south of Sweden, including Stockholm). Control subjects were recruited concurrent with case subjects. A total of 3,153 controls was invited to the study and 1,895 (60%) agreed to participate. At time for this study, DNA was available for 2,826 cases and 1,705 controls. Written informed consent was obtained from each participant. The ethics committee at the Karolinska Institutet (Stockholm, Sweden) and Umeå University (Umeå, Sweden) approved the study.

SNP selection. The target region for selection of SNPs in the AR gene included 3 kb of the promoter region, all exons, introns, and 8 kb of the 3′-untranslated region (UTR). Using the public database SNPper,5

we identified altogether 116 SNPs and selected a subset of 52 SNPs with the criterion of 1 SNP per 4 kb. We prioritized validated SNPs and missense mutations.

For CYP17 and SRD5A2, SNP selection was based on phase II data from the International HapMap Project.6

We only included SNPs with a minor allele frequency >5%. By including complete haplotype blocks as defined by Gabriel et al. (9), our target region reached both upstream and downstream of the gene as long as linkage disequilibrium was maintained. For CYP17, the region examined reached from 14 kb upstream of the transcription start site to 9 kb downstream of the 3′-UTR. For SRD5A2, we examined a region extending from 54 kb upstream of the transcription start site to 2 kb downstream of the 3′-UTR.

Tagging methodology. We genotyped the selected 52 SNPs in AR in a randomly selected subset of 94 controls from the CAPS study. Consequently, four SNPs were eliminated because of failed assay, three due to low success rate, and 30 due to monomorphic results. From the remaining 15 SNPs, haplotypes were constructed. By haplotype deviation analysis using the htSNP2 software,7

we selected four SNPs that explained >95% of the haplotype diversity as htSNPs. At this time, the HapMap Project had released data for the AR gene. By comparing our haplotype distribution with theirs, we decided to complement our data with two additional SNPs and extended our target region to include 10 kb of the promoter and 11 kb of the 3′-UTR. In total, six SNPs were genotyped in the whole study population.

For CYP17 and SRD5A2, we selected htSNPs with the tagSNPSv2 software (10). The target region of CYP17 comprised one haplotype block. In total, seven SNPs were selected for genotyping in the whole study population. The target region of SRD5A2 comprised two haplotype blocks, which were tagged individually. htSNPs were selected to capture at least 95% of the haplotype diversity. In addition, we filled the gap between the blocks by tagging the whole region with already selected htSNPs and chose possible additional SNPs needed to ensure a total coverage of at least 95% of the genetic variation. In total, 11 SNPs were selected and genotyped in the whole study population.

Genotyping. The DNA samples were genotyped using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (Sequenom, Inc., San Diego, CA; ref. 11). PCR assays and associated extension reactions were designed using the SpectroDESIGNER software (Sequenom), and primer sequences are available on request (Metabion GmbH, Planegg-Martinsried, Germany). All amplification reactions were run in the same conditions in a total volume of 5 mL with 2.5 ng of genomic DNA, 1 pmol of each amplification primer, 0.2 mmol/L of each deoxynucleotide triphosphate, 2.5 mmol/L MgCl2, and 0.2 units of HotStarTaq DNA Polymerase (Qiagen, Inc., Valencia, CA). Reactions were heated at 95°C for 15 minutes and subjected to 45 cycles of amplification (20 seconds at 94°C, 30 seconds at 60°C, 30 seconds at 72°C) before a final extension of 7 minutes at 72°C. Extension reactions were conducted in a total volume of 9 mL using 5 pmol of allele-specific extension primer and the Mass EXTEND Reagents kit before being cleaned using SpectroCLEANER (Sequenom) on a MULTIMEK 96 automated 96-channels robot (Beckman Coulter, Fullerton, CA). Clean primer extension products were loaded onto a 384-element chip with a nanoliter pipetting system (Sequenom) and analyzed by a MassARRAY mass spectrometer (Bruker Daltonik GmbH, Bremen, Germany). The resulting mass spectra were analyzed for peak identification using the SpectroTYPER RT 2.0 software (Sequenom). For each SNP, two independent scorers confirmed all genotypes.

Statistical methods. Hardy-Weinberg equilibrium test for each autosomal SNP was done using a replication method as implemented in the GENETICS package8

for the R programming language. For each test, 10,000 permutations were run.

We tested for association between prostate cancer risk and each SNP using a likelihood ratio test of a covariate equal to number of rare alleles (0, 1, or 2) based on an unconditional logistic regression model as implemented in the STATA software. We adjusted for age and geographic region using indicator variables representing each combination of age of onset (groups of 5-year interval) and geographic region (two regions, representing north and south of Sweden, including Stockholm).

We used the HAPLO.STATS package (12) in the software language R for haplotype analysis. This method provides both global and haplotype-specific tests. We adjusted all computations for age and geographic region as described above. Haplotypes with a frequency <0.01 were pooled together. We calculated empirical Ps by randomly permuting the trait and covariates (13). Precision criterion for the Ps was set to a sample SE of one fourth of the estimated P, but at least 1,000 permutations were run for each simulation. We estimated haplotype-specific odds ratios (OR) for haplotype A in AR by calculating the “dosage” using the tagSNPSv2 software (10) and use it as a covariate in a logistic regression. Because men carry only one copy of AR, the only phase ambiguity in AR haplotypes is due to missing genotypes.

We tested for interaction between the genes by selecting the most significant SNP in each gene and included them and their two-order product terms as covariates in a logistic regression. Interaction was then assessed with a likelihood ratio test. In addition, we estimated the joint effect by calculating number of risk alleles carried (ranging from 0 to 5) for each subject and test for association with prostate cancer risk. The population attributable risk (PAR) was estimated by maximum likelihood estimation (14). All reported Ps are based on two-sided tests.

Genotyping failed for one selected htSNP in CYP17, rs17724534. Average genotyping success for the other SNPs was 97.7% (range, 90.6-99.0%). The concordance rate between duplicated samples (n = 330) was 99.96%. All autosomal SNPs were in Hardy-Weinberg equilibrium among the controls (P > 0.05).

Association analyses. Four AR SNPs (rs17302090, rs6152, rs7061037, and rs5964607) were inversely associated with prostate cancer risk (Table 2). We observed further risk reduction among cases with an advanced disease (ORs between 0.72 and 0.75). Analysis restricted to subjects <65 years revealed association for rs1337080 [OR, 0.69; 95% confidence interval (95% CI), 0.49-0.98; P = 0.04] and yet lower risk estimates for rs6152, rs7061037, and rs5964607 (ORs between 0.69 and 0.70).

Table 2.

Association between prostate cancer and SNPs in AR, CYP17, and SRD5A2 among Swedish men

GeneSNPRelative positionRoleGenotypeCases, n (%)Controls, n (%)P*OR (95% CI)
AR rs17302090 −9,586 Promoter 2,594 (94) 1,540 (92)  1.00 
    163 (6) 129 (8) 0.02 0.75 (0.6-1.0) 
 rs6152 +639 Exon 2,433 (87) 1,411 (84)   
    370 (13) 279 (16) 0.004 0.77 (0.6-0.9) 
 rs7061037 +57,175 Intron 2,424 (87) 1,404 (84)   
    369 (13) 278 (16) 0.004 0.77 (0.7-0.9) 
 rs1337080 +113,931 Intron 2,429 (93) 1,460 (92)   
    180 (7) 134 (8) 0.11 0.82 (0.7-1.0) 
 rs5031002 +177,637 Intron/exon boundary 2,692 (97) 1,630 (96)   
    91 (3) 63 (4) 0.42 0.87 (0.6-1.2) 
 rs5964607 +191,443 3′-UTR 2,260 (82) 1,301 (78)   
    501 (18) 367 (22) 0.004 0.79 (0.7-0.9) 
CYP17 rs2486758 −362 Promoter CC 1,646 (59) 1,034 (61)   
    CT 1,004 (36) 564 (34)  1.15 (1.0-1-3) 
    TT 142 (5) 84 (5) 0.05 1.13 (0.8-1.5) 
 rs17115100 +5,726 Intron GG 2,225 (80) 1,354 (80)   
    GT 541 (19) 324 (19)  1.01 (0.9-1.2) 
    TT 31 (1) 14 (1) 0.70 1.26 (0.7-2.4) 
 rs10883783 +5,967 Intron TT 1,457 (53) 862 (52)   
    AT 1,131 (41) 658 (40)  0.99 (0.9-1.1) 
    AA 172 (6) 143 (9) 0.04 0.69 (0.5-0.9) 
 rs4919683 +11,994 3′-UTR CC 1,013 (37) 581 (35)   
    AC 1,322 (48) 802 (48)  0.92 (0.8-1.1) 
    AA 426 (15) 282 (17) 0.04 0.83 (0.7-1.0) 
 rs10883782 +13,871 3′-UTR AA 2,054 (76) 1,265 (77)   
    AG 618 (23) 343 (21)  1.11 (1.0-1.3) 
    GG 46 (2) 33 (2) 0.48 0.84 (0.5-1.3) 
 rs619824 15,831 3′-UTR GG 952 (35) 548 (33)   
    GT 1,353 (49) 802 (48)  0.95 (0.8-1.1) 
    TT 450 (16) 321 (19) 0.009 0.77 (0.6-0.9) 
SRD5A2 rs481344 −53,949 Promoter GG 1,291 (46) 830 (49)   
    AG 1,227 (44) 698 (41)  1.12 (1.0-1.3) 
    AA 282 (10) 163 (10) 0.18 1.08 (0.9-1.3) 
 rs508562 −31,398 Promoter AA 867 (32) 553 (34)   
    AG 1,345 (49) 805 (49)  1.04 (0.9-1.2) 
    GG 525 (19) 290 (18) 0.26 1.11 (0.9-1.3) 
 rs623419 −28,108 Promoter GG 1,142 (42) 751 (46)   
    AG 1,207 (45) 707 (43)  1.11 (1.0-1.3) 
    AA 357 (13) 181 (11) 0.02 1.26 (1.0-1.5) 
 rs4952224 −14,286 Promoter AA 2,562 (91) 1,533 (91)   
    AG 240 (9) 149 (9)  0.95 (0.8-1.2) 
    GG 0 (0) 3 (0.2) 0.40 NA (NA) 
 rs585890 −13,460 Promoter AA 782 (28) 454 (27)   
    AT 1,387 (50) 806 (48)  1.01 (0.9-1.2) 
    TT 613 (22) 417 (25) 0.16 0.88 (0.7-1.0) 
 rs676033 −3,001 Promoter CC 1,241 (44) 805 (48)   
    CT 1,243 (44) 714 (42)  1.11 (1.0-1.3) 
    TT 315 (11) 174 (10) 0.09 1.14 (0.9-1.4) 
 rs523349 +264 Exon CC 1,262 (46) 815 (48)   
    CG 1,223 (44) 707 (42)  1.10 (1.0-1.3) 
    GG 286 (10) 164 (10) 0.22 1.08 (0.9-1.5) 
 rs2268796 +23,690 Intron GG 920 (33) 591 (35)   
    AG 1,360 (49) 804 (48)  1.06 (0.9-1.2) 
    AA 521 (19) 296 (18) 0.27 1.10 (0.9-1.3) 
 rs4952220 +40,414 Intron AA 893 (32) 576 (35)   
    AC 1,354 (49) 796 (48)  1.08 (0.9-1.2) 
    CC 537 (19) 286 (17) 0.07 1.18 (1.0-1.4) 
 rs12470143 +42,412 Intron CC 787 (28) 457 (27)   
    CT 1,383 (50) 797 (48)  1.01 (0.9-1.2) 
    TT 614 (22) 421 (25) 0.14 0.87 (0.7-1.0) 
 rs11889731 +53,113 Intron GG 2,253 (80) 1,352 (80)   
    GT 553 (19) 316 (19)  1.02 (0.9-1.2) 
    TT 32 (1) 27 (2) 0.70 0.69 (0.4-1.2) 
GeneSNPRelative positionRoleGenotypeCases, n (%)Controls, n (%)P*OR (95% CI)
AR rs17302090 −9,586 Promoter 2,594 (94) 1,540 (92)  1.00 
    163 (6) 129 (8) 0.02 0.75 (0.6-1.0) 
 rs6152 +639 Exon 2,433 (87) 1,411 (84)   
    370 (13) 279 (16) 0.004 0.77 (0.6-0.9) 
 rs7061037 +57,175 Intron 2,424 (87) 1,404 (84)   
    369 (13) 278 (16) 0.004 0.77 (0.7-0.9) 
 rs1337080 +113,931 Intron 2,429 (93) 1,460 (92)   
    180 (7) 134 (8) 0.11 0.82 (0.7-1.0) 
 rs5031002 +177,637 Intron/exon boundary 2,692 (97) 1,630 (96)   
    91 (3) 63 (4) 0.42 0.87 (0.6-1.2) 
 rs5964607 +191,443 3′-UTR 2,260 (82) 1,301 (78)   
    501 (18) 367 (22) 0.004 0.79 (0.7-0.9) 
CYP17 rs2486758 −362 Promoter CC 1,646 (59) 1,034 (61)   
    CT 1,004 (36) 564 (34)  1.15 (1.0-1-3) 
    TT 142 (5) 84 (5) 0.05 1.13 (0.8-1.5) 
 rs17115100 +5,726 Intron GG 2,225 (80) 1,354 (80)   
    GT 541 (19) 324 (19)  1.01 (0.9-1.2) 
    TT 31 (1) 14 (1) 0.70 1.26 (0.7-2.4) 
 rs10883783 +5,967 Intron TT 1,457 (53) 862 (52)   
    AT 1,131 (41) 658 (40)  0.99 (0.9-1.1) 
    AA 172 (6) 143 (9) 0.04 0.69 (0.5-0.9) 
 rs4919683 +11,994 3′-UTR CC 1,013 (37) 581 (35)   
    AC 1,322 (48) 802 (48)  0.92 (0.8-1.1) 
    AA 426 (15) 282 (17) 0.04 0.83 (0.7-1.0) 
 rs10883782 +13,871 3′-UTR AA 2,054 (76) 1,265 (77)   
    AG 618 (23) 343 (21)  1.11 (1.0-1.3) 
    GG 46 (2) 33 (2) 0.48 0.84 (0.5-1.3) 
 rs619824 15,831 3′-UTR GG 952 (35) 548 (33)   
    GT 1,353 (49) 802 (48)  0.95 (0.8-1.1) 
    TT 450 (16) 321 (19) 0.009 0.77 (0.6-0.9) 
SRD5A2 rs481344 −53,949 Promoter GG 1,291 (46) 830 (49)   
    AG 1,227 (44) 698 (41)  1.12 (1.0-1.3) 
    AA 282 (10) 163 (10) 0.18 1.08 (0.9-1.3) 
 rs508562 −31,398 Promoter AA 867 (32) 553 (34)   
    AG 1,345 (49) 805 (49)  1.04 (0.9-1.2) 
    GG 525 (19) 290 (18) 0.26 1.11 (0.9-1.3) 
 rs623419 −28,108 Promoter GG 1,142 (42) 751 (46)   
    AG 1,207 (45) 707 (43)  1.11 (1.0-1.3) 
    AA 357 (13) 181 (11) 0.02 1.26 (1.0-1.5) 
 rs4952224 −14,286 Promoter AA 2,562 (91) 1,533 (91)   
    AG 240 (9) 149 (9)  0.95 (0.8-1.2) 
    GG 0 (0) 3 (0.2) 0.40 NA (NA) 
 rs585890 −13,460 Promoter AA 782 (28) 454 (27)   
    AT 1,387 (50) 806 (48)  1.01 (0.9-1.2) 
    TT 613 (22) 417 (25) 0.16 0.88 (0.7-1.0) 
 rs676033 −3,001 Promoter CC 1,241 (44) 805 (48)   
    CT 1,243 (44) 714 (42)  1.11 (1.0-1.3) 
    TT 315 (11) 174 (10) 0.09 1.14 (0.9-1.4) 
 rs523349 +264 Exon CC 1,262 (46) 815 (48)   
    CG 1,223 (44) 707 (42)  1.10 (1.0-1.3) 
    GG 286 (10) 164 (10) 0.22 1.08 (0.9-1.5) 
 rs2268796 +23,690 Intron GG 920 (33) 591 (35)   
    AG 1,360 (49) 804 (48)  1.06 (0.9-1.2) 
    AA 521 (19) 296 (18) 0.27 1.10 (0.9-1.3) 
 rs4952220 +40,414 Intron AA 893 (32) 576 (35)   
    AC 1,354 (49) 796 (48)  1.08 (0.9-1.2) 
    CC 537 (19) 286 (17) 0.07 1.18 (1.0-1.4) 
 rs12470143 +42,412 Intron CC 787 (28) 457 (27)   
    CT 1,383 (50) 797 (48)  1.01 (0.9-1.2) 
    TT 614 (22) 421 (25) 0.14 0.87 (0.7-1.0) 
 rs11889731 +53,113 Intron GG 2,253 (80) 1,352 (80)   
    GT 553 (19) 316 (19)  1.02 (0.9-1.2) 
    TT 32 (1) 27 (2) 0.70 0.69 (0.4-1.2) 

Abbreviation: NA, not available.

*

Adjusted for age and geographic region.

Carriers of one allele of the rs2486758 SNP located in the promoter region of CYP17 were at 15% higher risk to develop prostate cancer than noncarriers. Moreover, three SNPs located at the end of the gene were associated with a decreased risk of prostate cancer (Table 2). For all CYP17 SNPs, the association became more pronounced in cases with early-onset disease. For example, carrying two copies of the rs619824 ‘T’ allele decreased the risk for a diagnosis at an early age by 36% (OR, 0.64; 95% CI, 0.49-0.83; P = 0.0009).

One SNP located 28 kb upstream of SRD5A2 was significantly associated with prostate cancer risk (OR, 1.26; 95% CI, 1.02-1.54) comparing A/A versus G/G. No other associations were observed. However, rs4952220, rs676033, rs623419, and rs508562 gave a marginally higher risk for advanced prostate cancer (P = 0.02-0.05).

Haplotype analyses. We observed an overall difference in AR haplotype frequencies between cases and controls (global P = 0.04; Table 3). The most common AR haplotype GGAAGC was more prevalent in cases (78%) than in controls (74%), yielding a 25% excess risk of developing prostate cancer (95% CI, 1.09-1.45; P = 0.002). The common nature of this haplotype causes a PAR of 16% (95% CI, 0.06-0.25). We observed an enrichment of GGAAGC haplotype carriers among advanced cases (carrier frequency 80%; OR, 1.39; 95% CI, 1.16-1.66; P = 0.0004) corresponding to a PAR of 22% (95% CI, 0.11-0.33).

Table 3.

Haplotype frequencies for prostate cancer cases and controls in AR, CYP17, and SRD5A2

GeneBlockHaplotype IDHaplotypeFrequencies
P*
CasesControls
AR            
          0.04 
   rs17302090 rs6152 rs7061037 rs1337080 rs5031002 rs5964607    
  0.78 0.74 0.002 
  0.07 0.09 0.07 
  0.06 0.07 0.04 
  0.05 0.05 0.86 
  0.04 0.03 0.36 
CYP17            
          0.03 
   rs619824 rs10883782 rs4919683 rs10883783 rs17115100 rs2486758    
  0.26 0.27 0.04 
  0.24 0.23 0.38 
  0.21 0.20 0.07 
  0.12 0.11 0.58 
  0.11 0.10 0.53 
  0.02 0.03 0.01 
  0.02 0.02 0.27 
  0.01 0.01 0.37 
SRD5A2            
          0.31 
   rs11889731  rs12470143  rs4952220     
     0.46 0.48 0.09 
     0.33 0.31 0.13 
     0.10 0.09 0.62 
     0.10 0.09 0.60 
          0.29 
   rs676033 rs585890 rs4952224 rs623419 rs508562 rs481344    
  0.38 0.40 0.15 
  0.30 0.28 0.23 
  0.10 0.10 0.81 
  0.10 0.09 0.91 
  0.03 0.04 0.20 
GeneBlockHaplotype IDHaplotypeFrequencies
P*
CasesControls
AR            
          0.04 
   rs17302090 rs6152 rs7061037 rs1337080 rs5031002 rs5964607    
  0.78 0.74 0.002 
  0.07 0.09 0.07 
  0.06 0.07 0.04 
  0.05 0.05 0.86 
  0.04 0.03 0.36 
CYP17            
          0.03 
   rs619824 rs10883782 rs4919683 rs10883783 rs17115100 rs2486758    
  0.26 0.27 0.04 
  0.24 0.23 0.38 
  0.21 0.20 0.07 
  0.12 0.11 0.58 
  0.11 0.10 0.53 
  0.02 0.03 0.01 
  0.02 0.02 0.27 
  0.01 0.01 0.37 
SRD5A2            
          0.31 
   rs11889731  rs12470143  rs4952220     
     0.46 0.48 0.09 
     0.33 0.31 0.13 
     0.10 0.09 0.62 
     0.10 0.09 0.60 
          0.29 
   rs676033 rs585890 rs4952224 rs623419 rs508562 rs481344    
  0.38 0.40 0.15 
  0.30 0.28 0.23 
  0.10 0.10 0.81 
  0.10 0.09 0.91 
  0.03 0.04 0.20 
*

Adjusted for age and geographic region.

An overall difference in CYP17 haplotype frequencies between cases and controls was observed (global P = 0.03; Table 3). Specifically, the TAAAGC haplotype was more common among controls (27%) than cases (26%; P = 0.04; Table 3). These observations were more pronounced in patients with an early-onset cancer (global P = 0.02; P = 0.005 for the TAAAGC haplotype).

We did not observe any differences in haplotype frequencies between cases and controls at the SRD5A2 locus. However, we observed that carriers of the GTA haplotype in block 1 had a decreased risk to develop advanced prostate cancer (P = 0.04).

Combination of SNPs. We estimated the joint effect of multiple SNPs in the same pathway by calculating the combined risk estimate for the rs6152 SNP in AR, rs619824 in CYP17, and rs623419 in SRD5A2. We did not observe any statistical interaction between these SNPs (P = 0.61). Because both rs6152 and rs619824 were associated with decreased risk of prostate cancer, we regarded the common allele as the risk allele. The number of risk alleles ranged between 0 and 5. Each additional high-risk allele was associated with a 12% risk increase (95% CI, 1.06-1.19; P = 0.00009; Table 4). This risk increment was similar in advanced cases (OR, 1.13; 95% CI, 1.05-1.21; P = 0.0008; Table 4) but more pronounced in cases with an early-onset disease (OR, 1.20; 95% CI, 1.09-1.30; P = 0.00007; Table 4). Compared with the reference group, individuals with five risk alleles exhibited significant higher risk of prostate cancer (OR, 1.87; 95% CI, 1.02-3.42; Table 4). The risk of early-onset prostate cancer was 4.35 times higher (95% CI, 1.67-11.3) for carriers of five risk alleles compared with those carrying none.

Table 4.

Combined effects of risk alleles for AR SNP rs6152, CYP17 SNP rs619824, and SRD5A2 SNP rs623419

PopulationNo. risk allelesCases, n (%)Controls, n (%)OR* (95% CI)P* (trend)
All cases     9 × 10−5 
 34 (1.2) 29 (1.7) 1.00 (Reference)  
 318 (11.3) 238 (14.0) 1.18 (0.7-2.0)  
 835 (29.6) 550 (32.3) 1.33 (0.8-2.2)  
 1,041 (36.8) 564 (33.1) 1.64 (1.0-2.7)  
 491 (17.4) 272 (16.0) 1.57 (0.9-2.7)  
 107 (3.8) 52 (3.1) 1.87 (1.0-3.4)  
Advanced cases     8 × 10−4 
 15 (1.2) 29 (1.7) 1.00 (Reference)  
 145 (11.6) 238 (14.0) 1.20 (0.6-2.3)  
 364 (29.1) 550 (32.3) 1.26 (0.7-2.4)  
 454 (36.3) 564 (33.1) 1.55 (0.8-2.9)  
 218 (17.4) 272 (16.0) 1.53 (0.8-3.0)  
 55 (4.4) 52 (3.1) 2.13 (1.0-4.4)  
Young cases (<65 years)     7 × 10−5 
 17 (1.2) 15 (2.2) 1.00 (Reference)  
 177 (12.4) 105 (15.2) 1.57 (0.8-3.3)  
 407 (28.6) 234 (33.9) 1.61 (0.8-3.3)  
 522 (36.7) 220 (31.8) 2.20 (1.1-4.5)  
 248 (17.4) 106 (15.3) 2.12 (1.0-4.4)  
 53 (3.7) 11 (1.6) 4.35 (1.7-11.3)  
PopulationNo. risk allelesCases, n (%)Controls, n (%)OR* (95% CI)P* (trend)
All cases     9 × 10−5 
 34 (1.2) 29 (1.7) 1.00 (Reference)  
 318 (11.3) 238 (14.0) 1.18 (0.7-2.0)  
 835 (29.6) 550 (32.3) 1.33 (0.8-2.2)  
 1,041 (36.8) 564 (33.1) 1.64 (1.0-2.7)  
 491 (17.4) 272 (16.0) 1.57 (0.9-2.7)  
 107 (3.8) 52 (3.1) 1.87 (1.0-3.4)  
Advanced cases     8 × 10−4 
 15 (1.2) 29 (1.7) 1.00 (Reference)  
 145 (11.6) 238 (14.0) 1.20 (0.6-2.3)  
 364 (29.1) 550 (32.3) 1.26 (0.7-2.4)  
 454 (36.3) 564 (33.1) 1.55 (0.8-2.9)  
 218 (17.4) 272 (16.0) 1.53 (0.8-3.0)  
 55 (4.4) 52 (3.1) 2.13 (1.0-4.4)  
Young cases (<65 years)     7 × 10−5 
 17 (1.2) 15 (2.2) 1.00 (Reference)  
 177 (12.4) 105 (15.2) 1.57 (0.8-3.3)  
 407 (28.6) 234 (33.9) 1.61 (0.8-3.3)  
 522 (36.7) 220 (31.8) 2.20 (1.1-4.5)  
 248 (17.4) 106 (15.3) 2.12 (1.0-4.4)  
 53 (3.7) 11 (1.6) 4.35 (1.7-11.3)  
*

Adjusted for age and geographic region.

Cases meeting any of the following criteria: T stage ≥3, N+, M+, Gleason scale ≥8, or PSA ≥50 ng/mL.

The CAG repeat in the androgen receptor. Earlier, we have reported that carriers of >22 AR CAG repeats exhibit a significantly higher risk of developing prostate cancer (7). This study was based on the first recruitment phase of CAPS (1,461 case subjects and 796 controls). In this study, we investigated the relationship between AR SNPs and the number of CAG repeats (long/short, where long is defined as >22 repeats) and found a strong genetic correlation between these (D′ ranging from 0.53 to 1.00).

We noticed that carrying a long allele reduces haplotype variability to include only the GGAAGC haplotype or the GGAAGT haplotype, whereas all haplotypes were represented among individuals with short repeats.

In this study, we examined if germ-line genetic variation in the androgen metabolic pathway alters risk for prostate cancer development. In total, 23 SNPs located in three hormone-related genes (AR, CYP17, and SRD5A2) were genotyped in 2,826 prostate cancer cases and 1,705 controls. We found that both individual SNPs and haplotypes were associated with prostate cancer risk. Specifically, the most common AR haplotype was carried by 78% of affected men compared with 74% of population controls, yielding a PAR of 16%. This haplotype associated more strongly with aggressive forms of prostate cancer (PAR, 22%) and young age at onset (PAR, 22%). In addition, several AR SNPs were significantly correlated with prostate cancer.

In agreement with our results, a smaller study from Australia (15) recently identified an association between rs6152 and reduced risk of metastatic prostate cancer. A U.S. study investigated AR polymorphisms and prostate cancer risk (16) by genotyping 32 SNPs in 882 cases and 874 controls with a multiethnic background. Although no overall association was found, 11 of 32 SNPs were associated with a reduced risk for an aggressive disease in concordance with our results. However, because their population included five different ethnic backgrounds, the sample size within each ethnicity was limited.

Individuals having a brother with prostate cancer are at higher risk than those with an affected father (17), suggesting an X-linked genetic model. The AR gene is located on Xq11.2-q12 and constitutes a compelling candidate gene for prostate cancer. Accordingly, the most investigated polymorphism in prostate cancer genetics is the CAG repeat located in exon 1 in the AR gene, 401 bp upstream from rs6152. An inverse relationship between CAG repeats and AR transcriptional activity has been found (18). Results from case-control studies have, however, been conflicting and inconclusive, although the majority suggests an increased risk for carriers of few repeats. Recently, we found that carriers of >22 repeats exhibit a higher risk of developing prostate cancer (7). When we included the CAG repeat (long/short) in our haplotype analysis, we noticed that the GGAAGC haplotype was divided into two haplotypes. Interestingly, none of these haplotypes were associated with risk as would have been expected if the CAG repeat marker was casual. Our result therefore suggests that the CAG repeat is not a causal variant for prostate cancer development. More likely, this repeat is in strong linkage disequilibrium with a yet unidentified variant located nearby.

Multiple CYP17 SNPs and haplotypes were associated with prostate cancer risk in this study, especially for patients with an early-onset disease. Interestingly, association was more pronounced in the 3′-UTR region (rs619824; Table 2). A recent study among 715 men from 266 prostate cancer families found a protective effect for this SNP in concordance with us (19). They also found that this SNP is located within a region showing sequence homology to a CCAAT/enhancer-binding protein, known to be a strong regulator of transcription. Thus, it is possible that this SNP may have direct functional relevance. The lack of functional studies on these genes makes it difficult to elaborate with the consequences on gene function these polymorphisms convey. However, a low testosterone level has been shown to correlate with poorly differentiated cancer and poor prognosis repeated times (2026).

We observed an increasing risk effect with number of risk alleles carried in this pathway; however, we could not detect any interaction effect between these genes. This suggests that these genes individually affect initiation of prostate cancer, and the sum of their main effects produces a substantial risk increase particularly for advanced and early-onset cancers. Patients carrying all five risk alleles have two times higher risk to develop advanced prostate cancer, and they are four times more likely to receive a diagnosis at a young age (Table 4) than noncarriers. Our results suggest that combined analysis of multiple SNPs in the same pathway may strengthen otherwise nominally associations. This is in line with the hypothesis of prostate cancer being a multigenic disease. Indeed, we do not expect that one single genetic variant will have a useful predictive value in risk assessment, and this study shows a possible way to identify subgroups with noteworthy risk increase.

To account for multiple testing, we did a data simulation by randomly permuting case-control status and then reevaluated association for each SNP. Adjusted Ps were then computed based on the empirical distribution of the maximum of the 23 test statistics. In addition, we estimated the probability of observing at least 9 significant associations (at the 5% level) of 23 tests under the null hypothesis of no associations. Based on 10,000 permutations, the only SNPs that remained significant were rs6152, rs7061037, and rs5964607 in the AR; however, the probability for observing at least nine significant associations was estimated to only 0.8%.

This study is to our knowledge the largest (>4,500 subjects) homogenous case-control study of prostate cancer. The CAPS study is unique in the context of a well-characterized phenotype and a population-based ascertainment from a homogenous population. Because prostate cancer diagnosis was little influenced by PSA screening at the time for this study (27), our findings apply predominantly to a clinically significant disease. Indeed, 43% of included cases are classified as having an aggressive cancer with high risk of disease progression. When generalizing our results, it is important to consider the differences in clinical characteristics between North American prostate cancer populations and the CAPS population. Citizens in North America have experienced a dramatic increase in widespread testing of PSA, and a considerable fraction of their diagnosed prostate cancer cases are clinically nonsignificant.

One possible limitation with CAPS is the low participation rate among the controls (60%) because it could introduce selection bias if the willingness to join the study is associated with the assessed genotypes. To address this issue, we compared baseline characteristics (e.g., age, body mass index, smoking status, and level of education) of participants who completed the questionnaire and donated blood with those of participants who only answered the questionnaire. Among both cases and controls, characteristics did not differ significantly between those who did and did not donate a blood specimen. Therefore, we find it unlikely that selection bias is an issue in our study.

In conclusion, this study provides strong support that genetic variation in hormone-related genes is important for prostate cancer development. Specifically, we identified a common haplotype at the AR locus, which seems to have a noteworthy effect on prostate cancer risk at a population level. Moreover, the joint effect of risk alleles in the AR, CYP17, and SRD5A2 genes can identify men with an increased risk of 1.5 to 2.0 to develop prostate cancer, a finding that warrants confirmation in other large prostate cancer studies.

Grant support: Swedish Cancer Society, Umeå University Hospital (Umeå, Sweden) Spear grant, and National Cancer Institute grant CA 105055 (Jianfeng Xu).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

We thank all study participants in the CAPS study; Ulrika Undén for skillfully coordinating the study center at Karolinska Institute; all urologists, including their patients, in the CAPS study and all urologists providing clinical data to the national registry of prostate cancer; Karin Andersson, Susan Lindh, Gabriella Thorén, and Margareta Åswärd (Regional Cancer Registries); CAPS steering committee, including Pär Stattin, Jan-Erik Johansson, and Eberhart Varenhorst; Sören Holmgren and the personnel at the Medical Biobank in Umeå for skillfully handling the blood samples; the personnel at Mutation Analysis Facility at Karolinska Institute; Cecilia Lindgren; and Prof. Jianfeng Xu and Dr. Siqun Zheng (Wake Forest University School of Medicine, Winston-Salem, NC) for their contribution in the second part of CAPS.

1
Edwards SM, Eeles RA. Unravelling the genetics of prostate cancer.
Am J Med Genet C Semin Med Genet
2004
;
129
:
65
–73.
2
Hsing AW, Reichardt JK, Stanczyk FZ. Hormones and prostate cancer: current perspectives and future directions.
Prostate
2002
;
52
:
213
–35.
3
Ntais C, Polycarpou A, Tsatsoulis A. Molecular epidemiology of prostate cancer: androgens and polymorphisms in androgen-related genes.
Eur J Endocrinol
2003
;
149
:
469
–77.
4
Linja MJ, Visakorpi T. Alterations of androgen receptor in prostate cancer.
J Steroid Biochem Mol Biol
2004
;
92
:
255
–64.
5
Lange EM, Chen H, Brierley K, et al. The polymorphic exon 1 androgen receptor CAG repeat in men with a potential inherited predisposition to prostate cancer.
Cancer Epidemiol Biomarkers Prev
2000
;
9
:
439
–42.
6
Stanford JL, Just JJ, Gibbs M, et al. Polymorphic repeats in the androgen receptor gene: molecular markers of prostate cancer risk.
Cancer Res
1997
;
57
:
1194
–8.
7
Lindström S, Zheng SL, Wiklund F, et al. Systematic replication study of reported genetic associations in prostate cancer: strong support for genetic variation in the androgen pathway. Prostate 2006 Sep 22 (Epub ahead of print).
8
Lindmark F, Zheng SL, Wiklund F, et al. H6D polymorphism in macrophage-inhibitory cytokine-1 gene associated with prostate cancer.
J Natl Cancer Inst
2004
;
96
:
1248
–54.
9
Gabriel SB, Schaffner SF, Nguyen H, et al. The structure of haplotype blocks in the human genome.
Science
2002
;
296
:
2225
–9.
10
Stram DO, Haiman CA, Hirschhorn JN, et al. Choosing haplotype-tagging SNPS based on unphased genotype data using a preliminary sample of unrelated subjects with an example from the Multiethnic Cohort Study.
Hum Hered
2003
;
55
:
27
–36.
11
Jurinke C, van den Boom D, Cantor CR, Köster H. Automated genotyping using the DNA MassArray technology.
Methods Mol Biol
2002
;
187
:
179
–92.
12
Schaid DJ, Rowland CM, Tines DE, Jacobson RM, Poland GA. Score tests for association between traits and haplotypes when linkage phase is ambiguous.
Am J Hum Genet
2002
;
70
:
425
–34.
13
Besag J, Clifford P. Sequential Monte Carlo p-values.
Biometrika
1991
;
78
:
301
–4.
14
Greenland S, Drescher K. Maximum likelihood estimation of the attributable fraction from logistic models.
Biometrics
1993
;
49
:
865
–72.
15
Hayes VM, Severi G, Eggleton SA, et al. The E211 G>A androgen receptor polymorphism is associated with a decreased risk of metastatic prostate cancer and androgenetic alopecia.
Cancer Epidemiol Biomarkers Prev
2005
;
14
:
993
–6.
16
Freedman ML, Pearce CL, Penney KL, et al. Systematic evaluation of genetic variation at the androgen receptor locus and risk of prostate cancer in a multiethnic cohort study.
Am J Hum Genet
2005
;
76
:
82
–90.
17
Zeegers MP, Jellema A, Ostrer H. Empiric risk of prostate carcinoma for relatives of patients with prostate carcinoma: a meta-analysis.
Cancer
2003
;
97
:
1894
–903.
18
Chamberlain NL, Driver ED, Miesfeld RL. The length and location of CAG trinucleotide repeats in the androgen receptor N-terminal domain affect transactivation function.
Nucleic Acids Res
1994
;
22
:
3181
–6.
19
Douglas JA, Zuhlke KA, Beebe-Dimmer J, et al. Identifying susceptibility genes for prostate cancer—a family-based association study of polymorphisms in CYP17, CYP19, CYP11A1, and LH-β.
Cancer Epidemiol Biomarkers Prev
2005
;
14
:
2035
–9.
20
Morgentaler A, Bruning CO, DeWolf WC. Occult prostate cancer in men with low serum testosterone levels.
JAMA
1996
;
276
:
1904
–6.
21
Hoffman MA, DeWolf WC, Morgentaler A. Is low serum free testosterone a marker for high grade prostate cancer?
J Urol
2000
;
163
:
824
–7.
22
Schatzl G, Madersbacher S, Haitel A, et al. Associations of serum testosterone with microvessel density, androgen receptor density, and androgen receptor gene polymorphism in prostate cancer.
J Urol
2003
;
169
:
1312
–5.
23
Ribeiro M, Ruff P, Falkson G. Low serum testosterone and a younger age predict for a poor outcome in metastatic prostate cancer.
Am J Clin Oncol
1997
;
20
:
605
–8.
24
Massengill JC, Sun L, Moul JW, et al. Pretreatment total testosterone level predicts pathological stage in patients with localized prostate cancer treated with radical prostatectomy.
J Urol
2003
;
169
:
1670
–5.
25
Schatzl G, Madersbacher S, Thurridl T, et al. High-grade prostate cancer is associated with low serum testosterone levels.
Prostate
2001
;
47
:
52
–8.
26
San Francisco IF, Regan MM, Dewolf WC, Olumi AF. Low age adjusted free testosterone levels correlate with poorly differentiated prostate cancer.
J Urol
2006
;
175
:
1341
–5 DISCUSSION 1345–6.
27
Stattin P, Johansson R, Damber JE, et al. Non-systematic screening for prostate cancer in Sweden—survey from the National Prostate Cancer Registry.
Scand J Urol Nephrol
2003
;
37
:
461
–5.