Abstract
A multimodal artificial intelligence (MMAI) biomarker was developed using clinical trial data from North American men with localized prostate cancer treated with definitive radiation, using biopsy digital pathology images and key clinical information (age, PSA, and T-stage) to generate prognostic scores. This study externally validates the biomarker in a prospective, real-world dataset of men who underwent radical prostatectomy (RP) for localized prostate cancer at a tertiary referral center in Sweden.
Association between the MMAI scores (continuous and categorical) and endpoints of interest was assessed with Fine–Gray and cumulative incidence analyses for biochemical recurrence (BCR) and logistic regression for adverse pathology (AP) at RP.
The analysis included 143 patients with evaluable biopsy pathology images and complete clinical data to generate MMAI scores. The median follow-up was 8.8 years. At diagnosis, the median PSA was 7.5 ng/mL, the median age was 64 years, 29% had a Gleason grade group ≥3, and 88 men were evaluable for AP at RP. MMAI was significantly associated with BCR [subdistribution HR, 2.45; 95% confidence interval (CI), 1.77–3.38; P < 0.001] and AP at RP (OR, 4.85; 95% CI, 2.54–10.78; P < 0.001). Estimated 5-year BCR rates for MMAI intermediate to high versus low were 25% (95% CI, 16%–36%) versus 4% (95% CI, 1%–11%), respectively.
The MMAI biomarker, previously shown to be prognostic for distant metastasis and prostate cancer–specific mortality in men receiving definitive radiation, was prognostic for post-RP endpoints: BCR and AP. This biomarker validation study further supports the use of MMAI biomarkers in men with prostate cancer outside North America and those treated with RP.
Translational Relevance
In this study, we have demonstrated that the multimodal artificial intelligence biomarker, previously shown to be prognostic for distant metastasis and prostate cancer–specific mortality in men receiving definitive radiation, is also prognostic for post–radical prostatectomy endpoints—biochemical recurrence and adverse pathology at radical prostatectomy—when utilizing prostate biopsies. The multimodal artificial intelligence biomarker test can be easily applied to digitized images of prostate biopsies, generating important prognostic value as a supporting tool for clinicians to personalize treatment decisions for patients eligible for treatment with curative intent.
Introduction
Prostate cancer is the most common solid malignancy in men and a leading cause of cancer mortality globally (1). A challenge in the management of localized prostate cancer is its heterogeneity both in outcomes and treatments; localized prostate cancer can range in aggressiveness from a low-risk, indolent disease which can be managed with surveillance, to a higher-risk disease which may be managed with more aggressive definitive therapies such as radical prostatectomy (RP) or radiotherapy with androgen deprivation (2). Currently, treatment decisions are based on several factors including age, performance status, previous abdominal surgery, inflammatory bowel disease, and patient preferences, but there is a need to better predict the risk of recurrent disease to improve selection of patients for surgery or radiotherapy. To minimize the risk of over- and undertreatment of prostate cancer, accurate prognostic tools are needed to risk-stratify patients across treatment options.
Advances in artificial intelligence and digital pathology have enabled the development of novel prognostic tools in prostate cancer that can more accurately risk-stratify men with prostate cancer compared with standard clinical variable–based risk models (3, 4). For example, the ArteraAI Prostate Test is a multimodal artificial intelligence (MMAI) deep learning model that incorporates digital histopathology and clinical data and accurately prognosticates across endpoints such as distant metastases and prostate cancer–specific mortality (PCSM). It is recommended in the National Comprehensive Cancer Network (NCCN) Guidelines as an advanced risk stratification tool for localized prostate cancer using prostate biopsy specimens (2) and can be used in addition to NCCN risk, unique patient characteristics, and patient preference to further inform clinical decision-making.
The MMAI model was initially developed and validated using NRG Oncology/Radiation Therapy Oncology Group (RTOG) phase III randomized controlled trial datasets, consisting primarily of North American men treated with definitive radiotherapy regimens. To further characterize the generalizability of this novel technology, this study aimed to evaluate the prognostic validity of the test for the first time on prostate biopsy samples from a large, real-world, prospectively collected dataset of non–North American men with localized prostate cancer who underwent definitive treatment with RP and had RP-specific endpoints such as biochemical recurrence (BCR) and adverse pathology (AP) at RP.
Materials and Methods
Patient cohort
This study involved retrospective analysis of prospectively collected data from men enrolled in Urology Prostate Cancer, a large, community-based biomarker study of men who were referred for biopsy and diagnosed with prostate cancer between 2004 and 2010 at Skåne University Hospital in Malmö, Sweden (5). Patients gave written informed consent and were consecutively enrolled in the cohort. The study was conducted in accordance with recognized ethical guidelines and Declaration of Helsinki and approved by the Lund University Ethical Committee (2016/1030 and 2018/937). Representative hematoxylin and eosin–stained diagnostic biopsy slides from these men were scanned on an Aperio CS2 slide scanner at 40× magnification in 2021. Men who subsequently underwent RP were included in this analysis. Digitized biopsy images were combined with clinical information to generate MMAI scores from the locked MMAI model.
MMAI model
This analysis used the ArteraAI MMAI Prostate Test (version 1.2; ref. 6), which includes a pre-established, locked algorithm developed using data from North American patients with nonmetastatic prostate cancer and has demonstrated prognostic risk stratification for distant metastases and PCSM. The algorithm utilizes digital histopathology images as well as complete clinical data (age, PSA, and tumor stage) to generate a continuous risk score ranging from 0 to 1. Scores were categorized into low-risk, intermediate-risk, and high-risk groups based on pre-established cut points correlating with a 3% and 10% estimated 10-year risk of distant metastases (7).
Objective and endpoints
The primary objective of this study was to validate the previously locked MMAI model developed on biopsy specimens in an external prospectively collected dataset of European men with prostate cancer who subsequently received RP. The primary endpoint was time to BCR, defined as two successive PSA measurements ≥0.2 ng/mL after RP. The secondary objective was to evaluate the biopsy-based model on the surgical endpoint of AP at RP, defined as Gleason grade group (GG) 3 or higher, pT3b or higher, and/or node-positive disease. Men eligible for upstaging (i.e., diagnosed as Gleason GG 2 or lower, cT3 or lower, and node-negative disease) were evaluable for the AP at the RP endpoint. Death without events was considered as a competing risk for the BCR endpoint.
Statistical analyses
Descriptive statistics of the validation cohort are reported by medians and IQR or frequencies and proportions, as appropriate. Univariable and multivariable Fine–Gray models and logistic regression models were constructed to evaluate the association of the MMAI score with BCR and AP at RP, respectively. Multivariable analyses (MVA) were adjusted for clinical risk nomograms. The MMAI score was evaluated both continuously and categorically; subdistribution HRs (sHR) and ORs and 95% confidence intervals (CI) were reported per SD increase in score or for intermediate to high versus low risk, as appropriate. Estimated event rates at 5 years were estimated by the cumulative incidence function and compared using the Gray test for the BCR endpoint. The distribution of MMAI risk across NCCN (2) and Cancer of the Prostate Risk Assessment (CAPRA; ref. 8) risk groups were compared using the Fisher exact test. Subgroup analyses were performed in which analyses were repeated within NCCN risk groups (NCCN low to favorable intermediate versus NCCN unfavorable intermediate to high).
Statistical analyses were performed using R version 4.2+ (R Foundation for Statistical Computing). All statistical tests were two-sided and considered statistically significant at the 0.05 level. No adjustment for multiple testing was applied. Patients with missing clinical or outcome information were omitted from any relevant analyses, assuming the data were missing completely at random.
Data availability
Restrictions apply to the availability of these data, which were used under license for this study. Data are available from the authors upon reasonable request with the permission of the regional health care provider, Region Skåne.
Results
Of the patients enrolled in the Urology Prostate Cancer cohort, 293 were diagnosed with prostate cancer on biopsy, of which 235 subsequently underwent RP. Clinical data were available for 230 patients (Supplementary Table S1), but only 154 had digital histopathology images from the diagnostic prostate biopsy available, of which 143 had evaluable images and complete clinical data to generate MMAI scores (Fig. 1).
Table 1 summarizes the demographic and clinical characteristics of the analysis cohort. The median follow-up of censored patients was 8.8 years (IQR, 6.8–10.2). The median age at biopsy was 64 years (IQR, 59–67), and the median PSA was 7.5 ng/mL (IQR, 5.7–12.9) at diagnosis. The Gleason GG at biopsy ranged from 1 to 5 with the majority of patients having 1 (39.9%) or 2 (30.8%). There was cT3 disease in 7.7% of patients and 28.6% were NCCN high risk. On final pathology, 11.2% had Gleason GG 4 to 5, 46.1% had pT3a-b, 35.7% had a positive surgical margin, 40.8% had extraprostatic extension, 11.9% had seminal vesicle invasion, and 9.6% had lymph node invasion. A subset of 88 patients was eligible for upstaging and evaluable for AP at RP.
Demographic and clinical characteristics of the analysis cohort.
Variable . | Overall (biopsy images available) . | Analysis cohort . | Excluded . |
---|---|---|---|
N = 154a . | N = 143a . | N = 11a . | |
Age at biopsy, years | 64 (60, 67) | 64 (59, 67) | 62 (60, 68) |
PSA at diagnosis, ng/mL | 7.4 (5.6, 12.7) | 7.5 (5.7, 13.0) | 5.5 (5.2, 5.6) |
Missing | 8 | 0 | 8 |
Percent positive biopsy cores | 0.38 (0.25, 0.50) | 0.38 (0.25, 0.50) | 0.50 (0.25, 0.70) |
Clinical Gleason score | |||
6 | 59 (38.3%) | 57 (39.9%) | 2 (18.2%) |
7 (3 + 4) | 50 (32.5%) | 44 (30.8%) | 6 (54.5%) |
7 (4 + 3) | 25 (16.2%) | 23 (16.1%) | 2 (18.2%) |
8 | 13 (8.4%) | 13 (9.1%) | 0 (0.0%) |
9 | 7 (4.5%) | 6 (4.2%) | 1 (9.1%) |
Clinical T stage | |||
T1c | 77 (50.0%) | 74 (51.7%) | 3 (27.3%) |
T2 | 62 (40.3%) | 57 (39.9%) | 5 (45.5%) |
T2c | 1 (0.6%) | 1 (0.7%) | 0 (0.0%) |
T3 | 13 (8.4%) | 11 (7.7%) | 2 (18.2%) |
Tx | 1 (0.6%) | 0 (0.0%) | 1 (9.1%) |
NCCN | |||
Low | 25 (18.1%) | 25 (18.8%) | 0 (0.0%) |
Favorable intermediate | 29 (21.0%) | 29 (21.8%) | 0 (00.0%) |
Unfavorable intermediate | 43 (31.2%) | 41 (30.8%) | 2 (40.0%) |
High | 41 (29.7%) | 38 (28.6%) | 3 (60.0%) |
Missing | 16 | 10 | 6 |
CAPRA | 4 (2, 5) | 4 (2, 5) | 4 (3, 5) |
Low | 40 (27.4%) | 40 (28.0%) | 0 (0.0%) |
Intermediate | 75 (51.4%) | 72 (50.3%) | 3 (100.0%) |
High | 31 (21.2%) | 31 (21.7%) | 0 (0.0%) |
Missing | 8 | 0 | 8 |
Pathologic Gleason score | |||
≤6 | 29 (18.8%) | 29 (20.3%) | 0 (0.0%) |
7 (3 + 4) | 64 (41.6%) | 59 (41.3%) | 5 (45.5%) |
7 (4 + 3) | 44 (28.6%) | 39 (27.3%) | 5 (45.5%) |
8 | 3 (1.9%) | 3 (2.1%) | 0 (0.0%) |
9 | 14 (9.1%) | 13 (9.1%) | 1 (9.1%) |
Pathological T stage | |||
T2a | 22 (14.5%) | 22 (15.6%) | 0 (0.0%) |
T2b | 3 (2.0%) | 3 (2.1%) | 0 (0.0%) |
T2c | 55 (36.2%) | 51 (36.2%) | 4 (36.4%) |
T3a | 52 (34.2%) | 48 (34.0%) | 4 (36.4%) |
T3b | 20 (13.2%) | 17 (12.1%) | 3 (27.3%) |
Missing | 2 | 2 | 0 |
Surgical margins | 57 (37.0%) | 51 (35.7%) | 6 (54.5%) |
Extraprostatic extension | 64 (41.8%) | 58 (40.8%) | 6 (54.5%) |
Missing | 1 | 1 | 0 |
Seminal vesicle invasion | 20 (13.0%) | 17 (11.9%) | 3 (27.3%) |
Lymph node invasion | 8 (10.0%) | 7 (9.6%) | 1 (14.3%) |
Missing | 74 | 70 | 4 |
CAPRA-S | 3 (2, 5) | 3 (2, 5) | 2 (1, 4) |
Low | 58 (39.7%) | 56 (39.2%) | 2 (66.7%) |
Intermediate | 54 (37.0%) | 53 (37.1%) | 1 (33.3%) |
High | 34 (23.3%) | 34 (23.8%) | 0 (0.0%) |
Missing | 8 | 0 | 8 |
Follow-up (censored patients), years | 9.1 (6.9, 10.2) | 8.8 (6.8, 10.2) | 10.1 (9.5, 10.5) |
Variable . | Overall (biopsy images available) . | Analysis cohort . | Excluded . |
---|---|---|---|
N = 154a . | N = 143a . | N = 11a . | |
Age at biopsy, years | 64 (60, 67) | 64 (59, 67) | 62 (60, 68) |
PSA at diagnosis, ng/mL | 7.4 (5.6, 12.7) | 7.5 (5.7, 13.0) | 5.5 (5.2, 5.6) |
Missing | 8 | 0 | 8 |
Percent positive biopsy cores | 0.38 (0.25, 0.50) | 0.38 (0.25, 0.50) | 0.50 (0.25, 0.70) |
Clinical Gleason score | |||
6 | 59 (38.3%) | 57 (39.9%) | 2 (18.2%) |
7 (3 + 4) | 50 (32.5%) | 44 (30.8%) | 6 (54.5%) |
7 (4 + 3) | 25 (16.2%) | 23 (16.1%) | 2 (18.2%) |
8 | 13 (8.4%) | 13 (9.1%) | 0 (0.0%) |
9 | 7 (4.5%) | 6 (4.2%) | 1 (9.1%) |
Clinical T stage | |||
T1c | 77 (50.0%) | 74 (51.7%) | 3 (27.3%) |
T2 | 62 (40.3%) | 57 (39.9%) | 5 (45.5%) |
T2c | 1 (0.6%) | 1 (0.7%) | 0 (0.0%) |
T3 | 13 (8.4%) | 11 (7.7%) | 2 (18.2%) |
Tx | 1 (0.6%) | 0 (0.0%) | 1 (9.1%) |
NCCN | |||
Low | 25 (18.1%) | 25 (18.8%) | 0 (0.0%) |
Favorable intermediate | 29 (21.0%) | 29 (21.8%) | 0 (00.0%) |
Unfavorable intermediate | 43 (31.2%) | 41 (30.8%) | 2 (40.0%) |
High | 41 (29.7%) | 38 (28.6%) | 3 (60.0%) |
Missing | 16 | 10 | 6 |
CAPRA | 4 (2, 5) | 4 (2, 5) | 4 (3, 5) |
Low | 40 (27.4%) | 40 (28.0%) | 0 (0.0%) |
Intermediate | 75 (51.4%) | 72 (50.3%) | 3 (100.0%) |
High | 31 (21.2%) | 31 (21.7%) | 0 (0.0%) |
Missing | 8 | 0 | 8 |
Pathologic Gleason score | |||
≤6 | 29 (18.8%) | 29 (20.3%) | 0 (0.0%) |
7 (3 + 4) | 64 (41.6%) | 59 (41.3%) | 5 (45.5%) |
7 (4 + 3) | 44 (28.6%) | 39 (27.3%) | 5 (45.5%) |
8 | 3 (1.9%) | 3 (2.1%) | 0 (0.0%) |
9 | 14 (9.1%) | 13 (9.1%) | 1 (9.1%) |
Pathological T stage | |||
T2a | 22 (14.5%) | 22 (15.6%) | 0 (0.0%) |
T2b | 3 (2.0%) | 3 (2.1%) | 0 (0.0%) |
T2c | 55 (36.2%) | 51 (36.2%) | 4 (36.4%) |
T3a | 52 (34.2%) | 48 (34.0%) | 4 (36.4%) |
T3b | 20 (13.2%) | 17 (12.1%) | 3 (27.3%) |
Missing | 2 | 2 | 0 |
Surgical margins | 57 (37.0%) | 51 (35.7%) | 6 (54.5%) |
Extraprostatic extension | 64 (41.8%) | 58 (40.8%) | 6 (54.5%) |
Missing | 1 | 1 | 0 |
Seminal vesicle invasion | 20 (13.0%) | 17 (11.9%) | 3 (27.3%) |
Lymph node invasion | 8 (10.0%) | 7 (9.6%) | 1 (14.3%) |
Missing | 74 | 70 | 4 |
CAPRA-S | 3 (2, 5) | 3 (2, 5) | 2 (1, 4) |
Low | 58 (39.7%) | 56 (39.2%) | 2 (66.7%) |
Intermediate | 54 (37.0%) | 53 (37.1%) | 1 (33.3%) |
High | 34 (23.3%) | 34 (23.8%) | 0 (0.0%) |
Missing | 8 | 0 | 8 |
Follow-up (censored patients), years | 9.1 (6.9, 10.2) | 8.8 (6.8, 10.2) | 10.1 (9.5, 10.5) |
Abbreviations: CAPRA-S, Cancer of the Prostate Risk Assessment (Surgical).
Median (IQR) or frequency (%). Proportions may not add up to 100% because of rounding.
On univariable analysis (UVA), the continuous MMAI score was significantly associated with BCR (sHR, 2.45; 95% CI, 1.77–3.38; P < 0.001; Table 2). MMAI categorical risk groups were also significantly associated with BCR (sHR, 5.39; 95% CI, 1.91–15.23; P = 0.002; Table 2). On MVA, the continuous MMAI score remained independently prognostic when adjusted for NCCN (sHR, 2.13; 95% CI, 1.44–3.14; P < 0.001) and CAPRA (sHR, 1.99; 95% CI, 1.29–3.07; P = 0.002; Table 3). Consistent results were observed for categorical MMAI risk groups (sHR, 3.45; 95% CI, 1.20–9.86; P = 0.02 for NCCN and sHR, 3.48; 95% CI, 1.24–9.75; P = 0.02 for CAPRA; Table 3). Estimated 5-year BCR rates for MMAI intermediate to high versus low were 25.2% (95% CI, 16.0%–35.5%) versus 3.6% (95% CI, 0.6%–11%), respectively (Fig. 2; P < 0.001).
UVA of the MMAI score and standard clinical variables for BCR and AP at RP.
Endpoint . | Variable . | Level . | Effect sizea (95% CI) . | P value . | Events/Nb . |
---|---|---|---|---|---|
BCR | MMAI | Continuousc | 2.45 (1.77–3.38) | <0.001 | 28/140 |
Intermediate to high vs. lowd | 5.39 (1.91–15.23) | 0.002 | 28 (24)/140 (81) | ||
Age | Continuous | 0.97 (0.92–1.02) | 0.29 | 28/140 | |
Log2 PSA | Continuous | 1.98 (1.44–2.70) | <0.001 | 28/140 | |
Grade group | GG 4 to 5 vs. GG 1 to 2 | 2.55 (1.08–6.01) | 0.03 | 28 (7)/140 (19) | |
cT stage | T2 to T2c vs. T1c | 3.73 (1.36–10.23) | 0.01 | 28 (14)/140 (58) | |
T3 vs. T1c | 21.36 (7.41–61.55) | <0.001 | 28 (9)/140 (11) | ||
NCCN | FI vs. low | 2.56 (0.29–22.90) | 0.40 | 28 (3)/130 (29) | |
UFI vs. low | 6.44 (0.90–46.20) | 0.06 | 28 (9)/130 (41) | ||
High vs. low | 13.47 (1.93–93.71) | 0.009 | 28 (15)/130 (38) | ||
CAPRA | Continuous | 1.52 (1.30–1.79) | <0.001 | 28/140 | |
Intermediate vs. low | 11.15 (1.58–78.41) | 0.01 | 28 (17)/140 (72) | ||
High vs. low | 17.47 (2.37–128.57) | 0.005 | 28 (10)/140 (31) | ||
AP | MMAI | Continuousb | 4.85 (2.54–10.78) | <0.001 | 21/88 |
Intermediate to high vs. lowd | 25.86 (6.64–172.95) | <0.001 | 21 (19)/88 (37) | ||
Age | Continuous | 1.05 (0.96–1.15) | 0.28 | 21/88 | |
Log2 PSA | Continuous | 2.59 (1.40–5.17) | 0.004 | 21/88 | |
Grade group | GG 2 vs. GG 1 | 1.36 (0.51–3.68) | 0.54 | 21 (11)/88 (41) | |
cT stage | T2 to T2c vs. T1c | 6.32 (2.22–20.04) | <0.001 | 21 (15)/88 (34) | |
NCCN | FI vs. low | 3.87 (0.82–28.17) | 0.12 | 20 (7)/78 (26) | |
UFI vs. low | 3.71 (0.74–27.58) | 0.14 | 20 (6)/78 (23) | ||
High vs. low | 52.50 (5.41–1358.84) | 0.003 | 20 (5)/78 (6) | ||
CAPRA | Continuous | 1.66 (1.12–2.58) | 0.02 | 21/88 | |
Intermediate vs. low | 2.93 (1.02–9.78) | 0.06 | 21 (16)/88 (51) |
Endpoint . | Variable . | Level . | Effect sizea (95% CI) . | P value . | Events/Nb . |
---|---|---|---|---|---|
BCR | MMAI | Continuousc | 2.45 (1.77–3.38) | <0.001 | 28/140 |
Intermediate to high vs. lowd | 5.39 (1.91–15.23) | 0.002 | 28 (24)/140 (81) | ||
Age | Continuous | 0.97 (0.92–1.02) | 0.29 | 28/140 | |
Log2 PSA | Continuous | 1.98 (1.44–2.70) | <0.001 | 28/140 | |
Grade group | GG 4 to 5 vs. GG 1 to 2 | 2.55 (1.08–6.01) | 0.03 | 28 (7)/140 (19) | |
cT stage | T2 to T2c vs. T1c | 3.73 (1.36–10.23) | 0.01 | 28 (14)/140 (58) | |
T3 vs. T1c | 21.36 (7.41–61.55) | <0.001 | 28 (9)/140 (11) | ||
NCCN | FI vs. low | 2.56 (0.29–22.90) | 0.40 | 28 (3)/130 (29) | |
UFI vs. low | 6.44 (0.90–46.20) | 0.06 | 28 (9)/130 (41) | ||
High vs. low | 13.47 (1.93–93.71) | 0.009 | 28 (15)/130 (38) | ||
CAPRA | Continuous | 1.52 (1.30–1.79) | <0.001 | 28/140 | |
Intermediate vs. low | 11.15 (1.58–78.41) | 0.01 | 28 (17)/140 (72) | ||
High vs. low | 17.47 (2.37–128.57) | 0.005 | 28 (10)/140 (31) | ||
AP | MMAI | Continuousb | 4.85 (2.54–10.78) | <0.001 | 21/88 |
Intermediate to high vs. lowd | 25.86 (6.64–172.95) | <0.001 | 21 (19)/88 (37) | ||
Age | Continuous | 1.05 (0.96–1.15) | 0.28 | 21/88 | |
Log2 PSA | Continuous | 2.59 (1.40–5.17) | 0.004 | 21/88 | |
Grade group | GG 2 vs. GG 1 | 1.36 (0.51–3.68) | 0.54 | 21 (11)/88 (41) | |
cT stage | T2 to T2c vs. T1c | 6.32 (2.22–20.04) | <0.001 | 21 (15)/88 (34) | |
NCCN | FI vs. low | 3.87 (0.82–28.17) | 0.12 | 20 (7)/78 (26) | |
UFI vs. low | 3.71 (0.74–27.58) | 0.14 | 20 (6)/78 (23) | ||
High vs. low | 52.50 (5.41–1358.84) | 0.003 | 20 (5)/78 (6) | ||
CAPRA | Continuous | 1.66 (1.12–2.58) | 0.02 | 21/88 | |
Intermediate vs. low | 2.93 (1.02–9.78) | 0.06 | 21 (16)/88 (51) |
Note: Boldface indicates P ≤ 0.05.
Abbreviations: FI, favorable intermediate; UFI, unfavorable intermediate.
Effect size refers to sHR and OR for BCR and AP endpoints, respectively.
Patients in parentheses represent events and number of patients belonging to the intermediate to high group.
MMAI score per 1 SD increase.
Pre-established MMAI risk groups.
MVA of the MMAI score adjusted for percent biopsy-positive cores and standard clinical risk nomograms for BCR and AP at RP.
Endpoint . | Variable . | Level . | Effect sizea (95% CI) . | P value . | Events/Nb . |
---|---|---|---|---|---|
BCR | MMAI | Continuousc | 2.10 (1.44–3.06) | <0.001 | 28/140 |
% Positive cores | Continuous | 1.41 (1.02–1.97) | 0.05 | ||
MMAI | Continuousc | 2.13 (1.44–3.14) | <0.001 | 28/130 | |
NCCN | FI vs. low | 1.30 (0.14–11.70) | 0.82 | ||
UFI vs. low | 3.45 (0.45–26.21) | 0.23 | |||
High vs. low | 3.05 (0.36–25.84) | 0.31 | |||
MMAI | Intermediate to high vs. lowd | 3.45 (1.20–9.86) | 0.02 | 28/130 | |
NCCN | FI vs. low | 1.30 (0.15–11.43) | 0.81 | ||
UFI vs. low | 3.54 (0.46–27.13) | 0.22 | |||
High vs low | 5.40 (0.73–39.93) | 0.10 | |||
MMAI | Continuousc | 1.99 (1.29–3.07) | 0.002 | 28/140 | |
CAPRA | Continuous | 1.18 (0.96–1.47) | 0.12 | ||
MMAI | Intermediate to high vs. lowd | 3.48 (1.24–9.75) | 0.02 | 28/140 | |
CAPRA | Intermediate vs. low | 8.00 (1.17–54.65) | 0.03 | ||
High vs. low | 9.08 (1.30–63.33) | 0.03 | |||
AP | MMAI | Continuousc | 4.89 (2.55–10.92) | <0.001 | 21/88 |
% Positive cores | Continuous | 0.86 (0.45–1.60) | 0.62 | ||
MMAI | Continuousb | 5.07 (2.26–13.68) | <0.001 | 20/78 | |
NCCN | FI vs. low | 0.56 (0.06–5.48) | 0.60 | ||
UFI vs. low | 0.77 (0.09–7.36) | 0.81 | |||
High vs. low | 2.16 (0.10–77.56) | 0.63 | |||
MMAI | Intermediate to high vs. lowd | 41.71 (6.47–850.96) | 0.001 | 20/78 | |
NCCN | FI vs. low | 0.32 (0.01–4.08) | 0.40 | ||
UFI vs. low | 0.61 (0.03–7.24) | 0.70 | |||
High vs. low | 1.94 (0.05–75.05) | 0.70 | |||
MMAI | Continuousc | 4.59 (2.32–10.51) | <0.001 | 21/88 | |
CAPRA | Continuous | 1.13 (0.67–1.89) | 0.64 | ||
MMAI | Intermediate to high vs. lowd | 23.67 (5.87–162.16) | <0.001 | 21/88 | |
CAPRA | Intermediate vs. low | 1.39 (0.36–5.52) | 0.63 |
Endpoint . | Variable . | Level . | Effect sizea (95% CI) . | P value . | Events/Nb . |
---|---|---|---|---|---|
BCR | MMAI | Continuousc | 2.10 (1.44–3.06) | <0.001 | 28/140 |
% Positive cores | Continuous | 1.41 (1.02–1.97) | 0.05 | ||
MMAI | Continuousc | 2.13 (1.44–3.14) | <0.001 | 28/130 | |
NCCN | FI vs. low | 1.30 (0.14–11.70) | 0.82 | ||
UFI vs. low | 3.45 (0.45–26.21) | 0.23 | |||
High vs. low | 3.05 (0.36–25.84) | 0.31 | |||
MMAI | Intermediate to high vs. lowd | 3.45 (1.20–9.86) | 0.02 | 28/130 | |
NCCN | FI vs. low | 1.30 (0.15–11.43) | 0.81 | ||
UFI vs. low | 3.54 (0.46–27.13) | 0.22 | |||
High vs low | 5.40 (0.73–39.93) | 0.10 | |||
MMAI | Continuousc | 1.99 (1.29–3.07) | 0.002 | 28/140 | |
CAPRA | Continuous | 1.18 (0.96–1.47) | 0.12 | ||
MMAI | Intermediate to high vs. lowd | 3.48 (1.24–9.75) | 0.02 | 28/140 | |
CAPRA | Intermediate vs. low | 8.00 (1.17–54.65) | 0.03 | ||
High vs. low | 9.08 (1.30–63.33) | 0.03 | |||
AP | MMAI | Continuousc | 4.89 (2.55–10.92) | <0.001 | 21/88 |
% Positive cores | Continuous | 0.86 (0.45–1.60) | 0.62 | ||
MMAI | Continuousb | 5.07 (2.26–13.68) | <0.001 | 20/78 | |
NCCN | FI vs. low | 0.56 (0.06–5.48) | 0.60 | ||
UFI vs. low | 0.77 (0.09–7.36) | 0.81 | |||
High vs. low | 2.16 (0.10–77.56) | 0.63 | |||
MMAI | Intermediate to high vs. lowd | 41.71 (6.47–850.96) | 0.001 | 20/78 | |
NCCN | FI vs. low | 0.32 (0.01–4.08) | 0.40 | ||
UFI vs. low | 0.61 (0.03–7.24) | 0.70 | |||
High vs. low | 1.94 (0.05–75.05) | 0.70 | |||
MMAI | Continuousc | 4.59 (2.32–10.51) | <0.001 | 21/88 | |
CAPRA | Continuous | 1.13 (0.67–1.89) | 0.64 | ||
MMAI | Intermediate to high vs. lowd | 23.67 (5.87–162.16) | <0.001 | 21/88 | |
CAPRA | Intermediate vs. low | 1.39 (0.36–5.52) | 0.63 |
Note: Boldface indicates P ≤ 0.05.
Abbreviations: FI, favorable intermediate; UFI, unfavorable intermediate.
Effect size refers to sHR and OR for BCR and AP endpoints, respectively.
Patients in parentheses represent events and number of patients belonging to the intermediate to high group.
MMAI score per 1 SD increase.
Pre-established MMAI risk groups.
Cumulative incidence plot of categorical MMAI risk groups for BCR. *, Statistically significant.
Cumulative incidence plot of categorical MMAI risk groups for BCR. *, Statistically significant.
Among the subset evaluable for AP at RP, the continuous MMAI score was significantly associated with AP (OR, 4.85; 95% CI, 2.54–10.78; P < 0.001; Table 2) on UVA. MMAI categorical risk groups were also significantly associated with AP (OR, 25.86; 95% CI, 6.64–172.95; P < 0.001; Table 2). On MVA, the continuous MMAI score remained independently prognostic when adjusted for NCCN (OR, 5.07; 95% CI, 2.26–13.68; P < 0.001; Table 3) and CAPRA (OR, 4.59; 95% CI, 2.32–10.51; P < 0.001; Table 3). Consistent results were observed for categorical MMAI risk groups (OR, 41.71; 95% CI, 6.47–850.96; P = 0.001 for NCCN and OR, 23.67; 95% CI, 5.87–162.16; P < 0.001 for CAPRA; Table 3).
Similar results were observed within NCCN subgroup analyses (Supplementary Table S2). The continuous MMAI score remained significantly associated with BCR within NCCN low to favorable intermediate patients (sHR, 2.78; 95% CI, 1.11–7.01; P = 0.03) and within NCCN unfavorable intermediate to high patients (sHR, 1.95; 95% CI, 1.39–2.74; P < 0.001) on UVA. With very few events, the categorical MMAI risk group was underpowered for the NCCN low to favorable intermediate patients but remained significantly associated with BCR within the NCCN unfavorable intermediate to high patients (sHR, 3.55; 95% CI, 1.08–11.70; P = 0.04). Among the subset evaluable for AP at RP, the continuous MMAI score remained significantly associated with AP within NCCN low to favorable intermediate patients (OR, 3.92; 95% CI, 1.76–10.87; P = 0.002) and within NCCN unfavorable intermediate to high patients (OR, 7.03; 95% CI, 2.28–36.29; P = 0.004) on UVA. The categorical MMAI risk group remained significantly associated with AP within the NCCN low to favorable intermediate patients (OR, 12.06; 95% CI, 2.43–91.32; P = 0.005) but did not converge within the NCCN unfavorable intermediate to high patients, as there were no AP events in the MMAI low group.
The distribution of the MMAI risk group within NCCN and CAPRA risk groups is shown in Fig. 3 (see Supplementary Fig. S1). Of the 38 (29%) men with NCCN high-risk disease, 4 (10.5%) and 26 (68.4%) were found to be MMAI low and intermediate risk, respectively. Similarly, 2 (6.5%) and 23 (74.2%) of the 31 (21.7%) men with high CAPRA risk were found to be MMAI low and intermediate risk, respectively.
MMAI model reclassification of prognostic risk within (A) NCCN and (B) CAPRA risk groups.
MMAI model reclassification of prognostic risk within (A) NCCN and (B) CAPRA risk groups.
Discussion
In this study, we found a previously developed digital histopathology–based MMAI biomarker to be prognostic for the endpoint of BCR in European men with localized prostate cancer treated with RP. Cumulative incidence plots showed that the biomarker performed well in stratifying the cohort among the previously defined three-tier MMAI risk groups. In addition, we found the biomarker to be prognostic for AP at RP among the subgroup that had its full RP pathology details available for analysis and was eligible for upstaging.
The clinical significance of these findings is that they represent the first external validation of the MMAI biomarker run on prostate biopsy samples from a cohort of men who received definitive surgical treatment with RP. This is compared with the initial development and validation efforts for the novel MMAI model which were done on the prostate biopsy samples of over 5,600 men enrolled in the NRG Oncology/RTOG phase III randomized controlled trial datasets treated with definitive radiotherapy regimens. Although the outcomes of men with localized prostate cancer treated with definitive radiotherapy regimens versus surgery are thought to be no different (9), this analysis is an important step in establishing generalizability to a population initially managed with surgery alone. In addition, it is reassuring that the model performed well in (i) an independent prospectively collected population, as opposed to the more curated cohorts typical of randomized controlled trials (10) and (ii) a non–North American population, compared with the NRG/RTOG cohorts which are enrolled from sites in the United States and Canada. These observations further support the external generalizability of the MMAI model.
The MMAI model demonstrated prognostic performance for the RP-specific endpoints of BCR and AP at RP It also risk stratified patients within NCCN and CAPRA risk groups, providing an additional layer of personalized information that has the potential to inform treatment escalation or de-escalation strategies in surgically managed men, such as the timing of the use of neoadjuvant or adjuvant therapies in addition to RP. For example, the phase III CALGB 90203 (Alliance) trial evaluated neoadjuvant chemohormonal therapy for high-risk localized prostate cancer before RP (11). Although chemohormonal therapy did not demonstrate an improvement in the primary endpoint of 3-year biochemical progression–free survival over RP alone, there were distinct genomic and transcriptomic features observed that demonstrated useful prognostic features (12). It is plausible that a prognostic biomarker driven by histopathology features, such as this MMAI model, could identify subgroups more likely to benefit from neoadjuvant or adjuvant therapies. Additionally, given its ability to predict BCR, which is valuable in the management of patients after RP, MMAI risk also has the potential to inform PSA monitoring, with more frequent follow-up in patients who have increased MMAI risk. These are important avenues for additional research.
Of note, these findings are consistent with prior preliminary validation efforts of the MMAI biomarker, such as its association with PCSM and overall survival on RP specimens (as opposed to prostate biopsy specimens examined herein; ref. 13) as well as its association with PCSM on prostate biopsy specimens in men from the United Kingdom with high-risk localized or metastatic prostate cancer (14).
This study has some limitations. It is a retrospective study using prospectively collected data from a single institution from a nation with a less racially and ethnically diverse population relative to the United States. Given the secondary, nonrandomized study design, we are unable to control for all potential known and unknown confounders. In addition, not all diagnosed patients from the primary reference cohort had the necessary clinical data or histopathology to generate MMAI scores. Furthermore, some patients had complete clinical or histopathology data but were cT3 or Gleason GG 4 to 5 at the time of biopsy and therefore could not be evaluated for the secondary endpoint of AP at RP. Patients excluded from the primary analysis cohort seemed to have slightly lower risk than those in the analysis cohort (Supplementary Table S1). Any of these limitations could introduce possible bias.
In conclusion, we found the prostate biopsy MMAI biomarker, previously shown to be prognostic for distant metastasis and PCSM in men receiving definitive radiation, to be prognostic for the post-RP endpoints of BCR and AP. This biomarker validation study using a real-world cohort further supports the use of MMAI biomarkers in men with localized prostate cancer outside the United States and Canada and treated with surgery.
Authors’ Disclosures
A. Bjartell reports personal fees from Accord, Sandoz, European Association of Urology, and IPSEN; grants and personal fees from Astellas, AstraZeneca, and Bayer; nonfinancial support from ESMO; grants from Curasight, Johnson & Johnson, and Spectracure; and other support from LIDDS Pharma, WntResearch, and Glactone Pharma outside the submitted work. V.Y.T. Liu reports other support from Artera outside the submitted work. M. Tierney reports other support from Artera outside the submitted work. T.J. Royce reports other support from Artera outside the submitted work. M. Sjöström reports grants from the Prostate Cancer Foundation and Swedish Cancer Society during the conduct of the study as well as personal fees from Astellas and Veracyte/Adelphi Targis outside the submitted work. E. Chen reports employment with Artera. A. Esteva reports other support from Artera during the conduct of the study as well as a patent for Multiple pending to Andre Esteva/Artera; in addition, A. Esteva is the founder and CEO of Artera and is employed by Artera. F.Y. Feng reports personal fees from Janssen, Bayer, Roivant Sciences, Astellas Pharma, Foundation Medicine, Varian, Novartis, Tempus, and Amgen and other support from PFS Genomics, Bristol Myers Squibb, SerImmune, and Artera outside the submitted work. No disclosures were reported by the other authors.
Authors’ Contributions
A. Bjartell: Conceptualization, resources, data curation, supervision, funding acquisition, methodology, writing–original draft, writing–review and editing. A. Krzyzanowska: Resources, data curation, investigation, writing–review and editing. V.Y.T. Liu: Conceptualization, resources, formal analysis, validation, visualization, methodology, writing–original draft, writing–review and editing. M. Tierney: Conceptualization, methodology, writing–original draft, project administration, writing–review and editing. T.J. Royce: Conceptualization, methodology, writing–original draft, writing–review and editing. M. Sjöström: Conceptualization, methodology, writing–original draft, writing–review and editing. M.M. Palominos-Rivera: Resources, data curation, investigation, writing–review and editing. E. Chen: Conceptualization, writing–review and editing. A. Kraft: Resources, data curation, writing–review and editing. A. Esteva: Writing–review and editing. F.Y. Feng: Writing–review and editing.
Acknowledgments
This study was funded by grants to A. Bjartell from The Swedish Cancer Society (Cancerfonden #2018/522 and #2021/1629), Swedish Scientific Council (Vetenskapsrådet, #2020-02017 and 2023-02671), The Research Funds at Skåne University Hospital, and The Governmental Funding through The Faculty of Medicine, Lund University (contract number 2018/810 and 2022/0105). The authors would like to thank Kristina Ekström-Holka for help with the scanning of the slides and Anna Holst, Anna Stiehm, Emelie Winell, Suzy Lindberg, Annica Löfgren, and Åsa Andersson for help with patient clinical information collection.
Note: Supplementary data for this article are available at Clinical Cancer Research Online (http://clincancerres.aacrjournals.org/).