Abstract
Gleason grade is universally used for pathologic scoring of the differentiation of prostate cancer. However, it is unknown whether prostate tumors arise well differentiated and then progress to less differentiated forms or if Gleason grade is an early and largely unchanging feature. Prostate-specific antigen (PSA) screening has reduced the proportion of tumors diagnosed at advanced stage, which allows assessment of this question on a population level. If Gleason grade progresses as stage does, one would expect a similar reduction in high-grade tumors. We studied 1,207 Physicians' Health Study and Health Professionals Follow-up Study participants diagnosed with prostate cancer from 1982 to 2004 and treated with prostatectomy. We compared the distribution of grade and clinical stage across the pre-PSA and PSA screening eras. We re-reviewed grade using the ISUP 2005 revised criteria. The proportion of advanced stage tumors dropped more than six-fold, from the earliest period (12/1982–1/1993), 19.9% stage ≥T3, to the latest (5/2000–12/2004), 3% stage T3, none T4. The proportion of Gleason score ≥8 decreased substantially less, from 25.3% to 17.6%. A significant interaction between stage and diagnosis date predicting grade (P = 0.04) suggests that the relationship between grade and stage varies by time period. As the dramatic shift in stage since the introduction of PSA screening was accompanied by a more modest shift in Gleason grade, these findings suggest that grade may be established early in tumor pathogenesis. This has implications for the understanding of tumor progression and prognosis, and may help patients diagnosed with lower grade disease feel more comfortable choosing active surveillance. Cancer Res; 73(16); 5163–8. ©2013 AACR.
Introduction
For more than four decades, the Gleason score has been the most widely used grading system for prostate tumor differentiation and represents the best-established prognostic indicator for prostate cancer progression besides stage itself (1). Indeed, even within Gleason score 7, the predominance of pattern 4 (Gleason 4 + 3) carries more than a three-fold higher risk of prostate cancer mortality than Gleason pattern 3 + 4 (2). Pathologists and epidemiologists have long speculated on whether prostate tumors initially arise as well-differentiated tumors and then progress to higher grades over time, or whether grade is an early and largely unchanging feature of the tumor. This issue has important implications for understanding the basic biology of prostate cancer, and has clinical implications as well. If Gleason grade progresses little or not at all, patients who are diagnosed with lower grade disease may be more comfortable choosing active surveillance.
Several studies have attempted to answer this question by examining repeated biopsy specimens in individual patients (3–7). However, this question cannot readily be addressed through repeated biopsy because of the inherent heterogeneity of tumor nodules within the prostate. This heterogeneity is apparent given different Gleason grade findings from simultaneously collected needle cores within a single gland, and means that a later biopsy might well be targeting a different nodule or different part of the original nodule. Many of the biopsy studies are of low-grade patients (usually Gleason 6) undergoing active surveillance such that if cancer continues to be detected on subsequent biopsies it will continue to be either Gleason 6 or be upgraded, because cancers are rarely graded lower than Gleason 6. Once a higher grade tumor is detected, patients generally receive treatment and are therefore no longer followed up with additional biopsies. Thus, such studies are inherently more likely to show apparent Gleason progression due to these biases.
Although we cannot appropriately distinguish the 2 hypotheses regarding Gleason grade progression within individuals, the advent of PSA screening affords the opportunity to assess this issue on a population level. PSA screening provides a lead time of approximately 10 to 12 years (8–10). Thus, on the average, in repeatedly screened populations, tumors are diagnosed 10 to 12 years earlier in their natural history than in unscreened men; indeed, because of early detection, widespread screening has dramatically reduced the number and proportion of tumors with advanced stage at diagnosis and time of surgery (refs. 11–14, reviewed in ref. 15). If Gleason grade progresses, one would expect a similar reduction in high-grade tumors considering that tumors are now detected so much earlier in their natural history than before screening. One difficulty in such a comparison is that Gleason grading standards have shifted over time, so that scores below 6 are now rarely assigned, and assigning a higher grade has become more common, a trend described by Albertsen (16). Thus, standardized re-review of Gleason grade is essential for a valid assessment of possible change over calendar time. We therefore evaluated the change in distribution of re-reviewed Gleason scores in 2 prospective cohorts, with cases diagnosed 1982 to 2004, before and after introduction of widespread PSA screening, and compared this to the change in distribution of stage during that same interval.
Materials and Methods
Study population
Physicians' Health Study.
The Physicians' Health Study (PHS) began as a randomized, double-blind trial of aspirin and β-carotene in the prevention of cardiovascular disease and cancer among 22,071 healthy U.S. physicians; written consent was obtained from each participant at the time of initial enrollment and the investigation was approved by the Human Subjects Committee at Brigham and Women's Hospital. Men were excluded if they had any serious medical conditions at baseline including all cancers (except nonmelanoma skin cancer; ref. 17).
Participants are followed through annual questionnaires to collect data on diet, health, and lifestyle behaviors, and medical history, and biannually through postcards to ascertain health endpoints, including prostate cancer. All self-reported prostate cancer cases are verified through medical record and pathology review. There is a high follow-up rate for cancer incidence (96%). Through this systematic medical record review we also abstract data on clinical information, including clinical stage.
Health Professionals Follow-up Study.
The Health Professionals Follow-up Study (HPFS), an ongoing prospective cohort study on the causes of cancer and heart disease in men, consists of 51,529 U.S. health professionals who were aged 40 to 75 years in 1986 (2). At baseline and biennially thereafter, participants responded to a mailed questionnaire that included questions on demographics, lifestyle, and medical history. When a participant reports a prostate cancer diagnosis medical and pathology records are obtained. Study investigators review these records to confirm the diagnosis and to abstract clinical information, including stage at diagnosis. Follow-up rates for cancer incidence exceed 94%. The Human Subjects Committee at the Harvard School of Public Health approved this study.
Tumor repository and Gleason re-review
We collected archival formalin-fixed paraffin embedded radical prostatectomy and trans-urethral resection of the prostate (TURP) specimens from men with incident prostate cancer in the PHS and HPFS (4.47% TURP, 95.53% radical prostatectomy). Pathology departments were instructed to send either the entire TURP or radical prostatectomy material collected or a representative sample. When only some blocks were provided, our pathologist (M. Fiorentino) confirmed that at least the main cancer nodule was present. Using the ISUP 2005 revised criteria (18), a single pathologist (M. Fiorentino), blinded to clinical data (including outcome and date of diagnosis), re-reviewed all hematoxylin and eosin slides to assign major and minor Gleason grades, thereby avoiding the problem of the shift in Gleason scoring over the past several decades (16). Results describing the changes between original report and the re-review were previously reported, and we found the re-reviewed Gleason scores to significantly predict lethal outcome (2).
Statistical analysis
Analyses were conducted with SAS version 9.1 statistical software; all P values are two-sided. All individuals with a date of cancer diagnosis, clinical stage, and a re-reviewed Gleason score were included in this analysis. We combined data from PHS and HPFS and defined 4 time periods based on the quartiles of date at diagnosis (12/1982–1/1993, 1/1993–12/1996, 12/1996–5/2000, 5/2000–12/2004). These time periods span the pre-PSA and PSA eras. The proportions of 4 categories of Gleason score (≤6, 3 + 4, 4 + 3, ≥8) and 3 categories of clinical stage (T1/T2, T3, T4/N1/M1) were calculated for each of the 4 time periods. In a secondary analysis, we restricted to men diagnosed with clinical stage T1/T2 tumors and again examined the proportions of Gleason score across the same 4 time periods. A generalized linear model (GLM) was used to determine if the relationship between Gleason and stage depends on time period; re-reviewed Gleason grade was the outcome and clinical stage (3 categories), time periods, and an interaction variable between stage and time period were included in the model. The type III F-test was used to determine significance of this interaction. A GLM was conducted to determine the association of age at diagnosis with both Gleason score and time period separately; the type III F-test for trend was used to determine significance of the associations.
Because the proportion of Gleason score ≥8 tumors could be altered both by changes in the occurrence of those tumors or by changes in occurrence of lower grade tumors, as our main analysis, we estimated the incidence rate of re-reviewed tumors across the 4 time periods among the entire PHS and HPFS cohorts combined. This was conducted by counting the number of re-reviewed Gleason ≥8, Gleason 4 + 3, Gleason 3 + 4, and Gleason ≤6 tumors diagnosed, and calculating the total person-years (p-y) contributed by all participants in each time period. Person-time was counted from the start date of the time period to diagnosis of prostate cancer, death from any cause, or the end of the time period.
Results
Clinical characteristics of the PHS (N = 420) and HPFS (N = 787) participants are described in Table 1. Although screening information is not available from the PHS trial, participants in the overall HPFS were asked about their PSA testing during the previous 2 years beginning in 1994. In 1994, only 42% of participants had had a PSA test in the previous 2 years, but this percentage steadily increased over time to 81% in 2000. From 2002 to 2008, this percentage decreased slightly to 74% to 76% in each 2-year period.
. | PHS (N = 420) . | HPFS (N = 787) . |
---|---|---|
Years of diagnosis, range | 1982–2004 | 1986–2004 |
Age at diagnosis, mean (SD) | 66.1 (6.2) | 65.6 (6.2) |
Re-reviewed Gleason score (%) | ||
2–6 | 125 (29.8) | 142 (18.0) |
7 (3 + 4) | 151 (36.0) | 295 (37.5) |
7 (4 + 3) | 77 (18.3) | 189 (24.0) |
8–10 | 67 (16.0) | 161 (20.4) |
Clinical stage (%) | ||
T1/T2 | 392 (93.3) | 713 (90.6) |
T3 | 25 (6.0) | 48 (6.1) |
T4/N1/M1 | 3 (0.1) | 26 (3.3) |
. | PHS (N = 420) . | HPFS (N = 787) . |
---|---|---|
Years of diagnosis, range | 1982–2004 | 1986–2004 |
Age at diagnosis, mean (SD) | 66.1 (6.2) | 65.6 (6.2) |
Re-reviewed Gleason score (%) | ||
2–6 | 125 (29.8) | 142 (18.0) |
7 (3 + 4) | 151 (36.0) | 295 (37.5) |
7 (4 + 3) | 77 (18.3) | 189 (24.0) |
8–10 | 67 (16.0) | 161 (20.4) |
Clinical stage (%) | ||
T1/T2 | 392 (93.3) | 713 (90.6) |
T3 | 25 (6.0) | 48 (6.1) |
T4/N1/M1 | 3 (0.1) | 26 (3.3) |
The distribution of clinical stage and Gleason score across time periods are presented in Fig. 1. In the earliest period (12/1982–1/1993, pre-PSA), 19.9% were stage T3 or higher. This proportion decreased steadily until the last period (5/2000–12/2004), with only 3% of tumors stage T3, and none stage T4 or M1. In contrast to the dramatic 6-fold shift in stage at diagnosis, the proportion of Gleason score ≥8 decreased only moderately, from 25.3% in the first period to 17.6% to the most recent period, a shift of only 30%. When restricting to men with stage T1/T2, the proportion of Gleason score 8 to 10 decreased even less across time periods, from 19.5% to 16%.
Using a GLM, we observed a significant interaction between stage and ordinal categories of date at diagnosis to predict Gleason score (P = 0.04), showing that the relationship between Gleason score and stage is not uniform across time periods. Age at diagnosis is associated with Gleason score, with older men tending to have higher Gleason score (P < 0.0001); however, when examined by category of date of diagnosis, the association is only significant in the 3 PSA era time periods, becoming more significant over time (Table 2). Age at diagnosis is additionally not associated with categories of date of diagnosis (P = 0.10). This suggests that the change over time we observe for Gleason score is not due to a change in age at diagnosis and may represent an increase in screening of younger men detecting more indolent, lower grade tumors.
. | Mean age at diagnosis . | . | |||
---|---|---|---|---|---|
Time period . | Gleason 2–6 . | Gleason 3 + 4 . | Gleason 4 + 3 . | Gleason 8–10 . | Ptrend . |
12/82-1/93 | 66.8 | 66.5 | 66.8 | 67 | 0.70 |
1/93-12/96 | 64.6 | 64.4 | 66.7 | 66.3 | 0.02 |
12/96-5/00 | 63.4 | 65.2 | 65.5 | 67.3 | 0.003 |
5/00-12/04 | 64.7 | 65.4 | 66.7 | 68.1 | 0.0003 |
. | Mean age at diagnosis . | . | |||
---|---|---|---|---|---|
Time period . | Gleason 2–6 . | Gleason 3 + 4 . | Gleason 4 + 3 . | Gleason 8–10 . | Ptrend . |
12/82-1/93 | 66.8 | 66.5 | 66.8 | 67 | 0.70 |
1/93-12/96 | 64.6 | 64.4 | 66.7 | 66.3 | 0.02 |
12/96-5/00 | 63.4 | 65.2 | 65.5 | 67.3 | 0.003 |
5/00-12/04 | 64.7 | 65.4 | 66.7 | 68.1 | 0.0003 |
Widespread PSA screening not only advanced the time of diagnosis of prostate cancer, but also has increased the prevalence of detected indolent tumors that would otherwise never have been diagnosed. Thus, prostate cancers diagnosed in the PSA era are enriched with early stage and low grade. For this reason, the change in incidence of Gleason ≥8 tumors provides a stronger test of the hypothesis for Gleason progression over time than the proportion of ≥8 tumors among all tumors, which is also affected by the incidence of low-grade cases diagnosed. Although the incidence of re-reviewed Gleason ≤6 prostate cancer increased from 0.9 per 10,000 p-y in the first time period to 3.2 per 10,000 p-y in the last time period, and for Gleason 3 + 4 from 1.6 per 10,000 p-y to 3.9 per 10,000 p-y, the incidence rates of re-reviewed Gleason 4 + 3 and ≥8 cases showed no parallel change. Gleason 4 + 3 increased in the middle time periods, the rate came back down to baseline in the last time period (rates were 1.4, 2.3, 3.3, 1.8 per 10,000 p-y across the 4 time periods). For Gleason ≥8, rates were 1.3, 2.0, 2.0, and 1.9 per 10,000 p-y across the 4 time periods, respectively (Fig. 2). In contrast, the incidence rate of late-stage disease decreased markedly over time, with no similar decrease in early-stage disease; this result was similar for all prostate cancer cases in PHS/HPFS and in the subset with re-reviewed Gleason score (Supplementary Fig. S1).
Discussion
Our data confirm the dramatic shift toward lower stage disease in prostate cancer cases diagnosed in the pre-PSA era to more recent years. In contrast, the decrease in the proportion of high grade (Gleason 8–10) cancers across this time period is considerably less dramatic than the shift in clinical stage, and we observed no significant reduction in the incidence of high-grade cancers over time. The moderate decrease in the proportion of high-grade cases may be due to the increased diagnosis of low-volume, nonprogressing disease—cases that would otherwise never have been detected if not for PSA screening. These additional cases are more likely to be indolent and therefore low-grade disease. Thus, the analysis of absolute incidence of Gleason 8 to 10 over time provides a better assessment of change, and showed little evidence for a decrease. These results are supported by an analysis utilizing Surveillance, Epidemiology and End Results Program data. These data show that with the introduction of PSA screening and throughout the 1990s, the incidence rate of well and poorly differentiated cancers increased slightly, but most of the increase in the rate was due to moderately differentiated tumors (19).
Although we cannot rule out the possibility that Gleason grade progresses within an individual, we conclude that it is not a major feature of prostate cancer. Our results are additionally supported by Falzarano and colleagues (15) who found that upon Gleason score re-review, there was no difference in Gleason distribution between the early PSA and late PSA eras (15). This topic was explored in a slightly different way by Pashayan and colleagues (20), comparing prostate cancer detected in a PSA screened population to cancers diagnosed clinically during the same time period (2002–2005). The authors observed the proportion of advanced stage decreased with PSA screening, but this seemed to be mainly in cases with Gleason score <7, again confirming the strong association between grade and stage as well as the potential for overdiagnosis. Studies have also attempted to estimate the proportion of tumors where Gleason might progress using statistical modeling (21, 22), finding that dedifferentiation is a likely and common event. However, numerous assumptions are required in these statistical models, and the observed data are also compatible with little or no progression in Gleason grade. Sowalsky and colleagues (23) and Kovtun and colleagues (24) report molecular data suggesting the potential for Gleason progression, but because of the cross-sectional nature of the studies, such conclusions are not definitive.
If the Gleason grade of a tumor typically does not progress, this information can affect clinical management of prostate cancer diagnosed with a low Gleason score. These findings do not answer the question of whether other tumor foci with higher Gleason scores were missed by the random biopsy sampling. Nonetheless, the knowledge that Gleason score largely does not progress may make the choice of active surveillance more appealing for patients with low-grade disease. If we suppose that a Gleason score 3 + 3 will remain 3 + 3 for the entire course of the disease, active surveillance could be considered a definitive treatment for selected patients with ≤10 years life expectancy and could significantly delay (potentially forever) the treatment of selected patients with >10 years life expectancy. This option would prevent side effects from radiation or surgery for patients who do not need these more aggressive treatments. This will become more important with the development of improved imaging and molecular tools to assess the entire prostate at diagnosis.
Our findings have implications for understanding the biology of prostate cancer as well. Gleason score is a strong predictor of prostate cancer-specific death, and seems set early in the disease process; this suggests that later influences, such as diet, lifestyle, or environmental factors, might be important to trigger disease progression among men with low grade disease. Alternatively, earlier influences, such as genetics, may drive the development of a subtype of cancer that is more aggressive in a way that is not related to differentiation status. Lavery and Droller (25) posit that Gleason patterns 3 and 4 represent separate cancer diatheses, and the faster proliferation of pattern 4 compared to 3 may explain the apparent evolution of pattern 3 to 4. Different risk factors (genetics or environmental) may lead separately to the development of Gleason grade 3 or grade 4 disease.
There are limitations of this study to consider. We restricted the analysis to patients with radical prostatectomy so that we were able to acquire the tissue to re-review the Gleason score. This is a crucial aspect of the study because we avoid the issue with changes in grading criteria over time; however, this does assume that the indications for prostatectomy are constant over time, which may be questioned. Clinicians and surgeons may have changed their practices regarding which patients based on Gleason score are eligible for radical prostatectomy. In addition, because we do not deal with biopsy tissue sampling variability, the clinical implications are currently restricted. However, if a similar trend is observed with biopsy specimens this should confirm our results.
A major strength of this study is its large size and our ability to compare time periods spanning the pre-PSA and PSA era. This allows for the observation at the population level that the shift in Gleason scores over time is much less dramatic than the shift in stage. Although we cannot exclude the potential for progression of Gleason score in some individual cases, we therefore conclude that such progression is not a major feature of prostate cancer.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Authors' Contributions
Conception and design: K.L. Penney, M.J. Stampfer, J.A. Sinnott, E. Giovannucci, M. Fiorentino
Development of methodology: K.L. Penney, M.J. Stampfer
Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): S. Finn, E. Giovannucci, H.D. Sesso, L.A. Mucci
Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): K.L. Penney, H.D. Sesso, L.A. Mucci, M. Fiorentino
Writing, review, and/or revision of the manuscript: K.L. Penney, M.J. Stampfer, J.L. Jahn, J.A. Sinnott, J.R. Rider, S. Finn, E. Giovannucci, H.D. Sesso, M. Loda, L.A. Mucci, M. Fiorentino
Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): J.L. Jahn
Study supervision: K.L. Penney
Acknowledgments
The authors are grateful to the participants and staff of the Health Professionals Follow-up Study for their valuable contributions. In addition the authors like to thank the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, WY.
Grant Support
This work was supported by the Dana-Farber/Harvard Cancer Center Prostate SPORE (5P50CA090381-08) and the National Cancer Institute (5R01CA141298). The PHS was supported by grants CA34944, CA40360, and CA097193 from the National Cancer Institute and HL26490 and HL34595 from the National Heart, Lung, and Blood Institute. The HPFS was supported by CA55075. K.L. Penney was supported by a National Research Service Award (T32 CA009001-32); J.A. Sinnott is supported by T32 CA009001-32. K.L. Penney and L.A. Mucci are supported by the Prostate Cancer Foundation.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.