Abstract
We explored and validated the association of postoperative carcinoembryonic antigen (CEA) with disease-free survival (DFS) and overall survival (OS) in stage III colon cancer.
Patients with stage III colon cancer from the MOSAIC and PETACC-8 trials were enrolled. The relation between CEA and outcomes was continuously modeled with the restricted cubic splines (RCS) method. Association of CEA with outcomes was assessed by the Kaplan–Meier method, with two risk groups among patients with a CEA level ≤5 ng/mL. Multivariate Cox proportional hazard models were constructed.
The CEA level was available in 1,292 (96%) and 2,477 (97%) patients in the discovery and validation cohorts. The RCS analysis confirmed that patients with a CEA level >5 ng/mL were at highest risk of recurrence or death and those with a CEA level ≤5 ng/mL presented a heterogeneous risk population. In the discovery cohort, the 3-year DFS rate was 75%, 65%, and 45% in a group of patients with CEA level of 0–1.30 ng/mL (n = 630), 1.30–5 ng/mL (n = 613), and >5 ng/mL (n = 49), respectively (P < 0.001). CEA was independently associated with endpoints. All findings were confirmed in the validation cohort.
Postoperative CEA level was highly and independently associated with DFS and OS, especially in patients with a CEA level of ≤5 ng/mL, suggesting that this cutoff is not optimal.
CEA levels should be applied more accurately in future trials and clinical practice.
Introduction
Colon cancer is the third most common cancer in men and women (1, 2). When diagnosed at a localized stage, curative management is proposed to patients including surgery and adjuvant chemotherapy if lymph node involvement is observed (stage III colon cancer). In stage III disease, fluoropyrimidine-based adjuvant treatment improved patient outcomes (3). The addition of oxaliplatin to a fluoropyrimidine chemotherapy backbone has produced additional benefit by increasing the 3-year disease-free survival (DFS) rate (4–6), and the oxaliplatin-containing combination became the worldwide standard adjuvant treatment in the setting of colon cancer. Recently, the findings of the International Duration Evaluation of Adjuvant Therapy collaboration have provided useful information in helping oncologists discuss the duration of adjuvant therapy (7–10). To optimize the value of this treatment, better tumor markers or better awareness of tumor markers to improve the recurrence risk detection rate are needed.
Carcinoembryonic antigen (CEA) is a well-known, low-cost biological tumor marker for colon cancer (11, 12), overexpressed in 90% of colorectal cancer cases (13). The European Society of Medical Oncology (ESMO) guidelines recommend CEA determination in a postoperative follow-up period (14). Previous large studies have shown that elevated preoperative CEA levels are associated with worse prognosis in all stage colon cancer patients (15) and in stage I and II colon cancer patients who did not receive adjuvant chemotherapy (16).
The results by Konishi and colleagues showed that the preoperative CEA level is not a relevant marker of recurrence in localized colon cancer if CEA is normalized after surgery (17), thus underling the importance of postoperative CEA. Another study by Lu and colleagues identified high postoperative CEA level of ≥5 ng/mL as an independent prognostic predictor of early relapse (18). The CEA upper limit of normal (ULN) levels vary between different institutions, ranging from 3.0 to 5.0 ng/mL, with a cutoff of 5.0 ng/mL used by most centers even though this value remains debated (15, 16, 19). Recently, Margalit and colleagues defined preoperative CEA of 2.35 ng/mL as an optimal cut-off point for predicting survival of stage I and II colon cancer (16). Likewise, Kim and colleagues suggested preoperative CEA of 3 ng/mL as the best cut-off value to identify stage III colon cancer patients with low and high risk of death or recurrence (19).
The aims of this post hoc analysis were (i) to assess and precisely validate the postoperative CEA prognostic value for DFS and OS in patients with stage III colon cancer receiving standard adjuvant treatments; (ii) to challenge the standard 5 ng/mL as threshold value to discriminate patients with low and high risk of recurrence or death; and (iii) to determine the additional value of postoperative CEA for the TNM classification to better stratify patients for risk of death and of recurrence.
Materials and Methods
Population
Stage III colon cancer patients for a discovery cohort were selected from the MOSAIC phase III trial (NCT00275210; refs. 4, 20). The MOSAIC study showed a survival benefit with the addition of oxaliplatin to adjuvant 5-fluorouracil (5-FU) and leucovorin (LV; the FOLFOX4 regimen) in patients with colon cancer. External validation cohort was obtained with the PETACC-8 phase III trial population (NCT00265811; ref. 21). The PETACC-8 study failed to show a survival benefit for adjuvant combination of FOLFOX and cetuximab in patients with stage III colon cancer. Both trials were performed in accordance with the Declaration of Helsinki and Good Clinical Practice guidelines and approved by the appropriate Ethics Committees.
Follow-up
In both study cohorts, the radiologic evaluation was made at randomization and every 6 months during the first 5 years of follow-up.
The MOSAIC database was locked on April 25, 2014, and the PETACC-8 database on October 27, 2016.
Data extraction
Demographics, cancer history, pathologic, clinical, biological parameters, and efficacy outcomes (DFS and OS) were prospectively collected at the time of randomization.
Postoperative CEA serum measurements
For both study cohorts, a postoperative CEA value (MOSAIC: <10 ng/mL; PETACC-8: ≤1.5× ULN) was an inclusion criterion. The serum CEA measurements were not centralized. In the PETACC-8 study, the ULN for CEA was available for all patients; the median CEA value was 5 ng/mL [interquartile range (IQR), 3.8–5].
Statistical analysis
Median (IQR) values and proportions (percentage) were used for continuous and categorical variables, respectively. Median and proportions were compared using the Wilcoxon–Mann–Whitney test and the χ2-test (or the Fisher exact test, if appropriate), respectively.
DFS was defined as the time between randomization and local/distant relapse, second colorectal/rectal occurrence, or death, whichever occurred first. Alive patients without relapse and second colorectal cancer were censored at the date of their last follow-up. OS was defined as the time between randomization and death from any cause. Patients known to be alive were censored at the date of their last follow-up.
DFS and OS were estimated using Kaplan–Meier method and described using median or rate at specific time points with their 95% confidence intervals (95% CI). Follow-up was calculated using the reverse Kaplan–Meier method (22).
When used continuously, the association between CEA and survival was investigated using the fractional polynomials method (23–25) and was validated by the restricted cubic splines method with graphical evaluation. Different cutoffs of CEA level in the discovery cohort were explored to investigate an association between CEA and survival (categorical). These were based on the percentiles (median and tercile) and optimal cut-off point (log-rank maximization) methods (26).
The association of demographic, clinical, biological, and molecular factors with survival was assessed with univariate Cox proportional hazards model. Variables with P values of less than 0.10 and/or clinically relevant variables were entered into an intermediate multivariable Cox regression model. Final multivariable model was constructed with statistically significant variables of the intermediate model. The correlation between variables was verified before construction of the multivariate models, in order to deal with potential colinearity. All multivariate models were stratified on the basis of adjuvant treatment received. Hazard proportionality was assessed for each categorical variable using the Schoenfeld residual-based test and a graphical method.
The predictive value and the discrimination ability of the final model were assessed with the Harrell's concordance index (C-index; ref. 27). The robustness of the statistical models was assessed by multiple imputation procedure (assessment of potential bias arising from missing data for parameters involved in the multivariable model; ref. 28).
The differential treatment effects on DFS and OS among CEA risk groups were evaluated with an interaction term in the Cox proportional hazards model.
All analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC) and R software version 2.15.2 (R Development Core Team; http://www.r-project.org). P values of less than 0.05 were considered statistically significant, and all tests were two-sided.
Results
Characteristics of patients
In the discovery cohort, 1,347 patients with stage III colon cancer from the MOSAIC study were enrolled. Postoperative CEA was available in 1,292 (95.9%) patients; 1,243 (96.2%) had a CEA level below 5 ng/mL. The median time between surgery and CEA measurement was 4.3 weeks and the median follow-up was 9.6 years (Table 1).
. | . | Discovery cohort . | External validation cohort . | P . |
---|---|---|---|---|
. | . | N = 1,347 . | N = 2,552 . | . |
Age (years)a | <70 | 1157 (85.9%) | 2283 (89.4%) | 0.001 |
≥70 | 190 (14.1%) | 269 (10.5%) | ||
Gendera | Female | 621 (46.1%) | 1091 (42.7%) | 0.04 |
Male | 726 (53.9%) | 1461 (57.2%) | ||
Body mass index | Normal | 661 (49.2%) | 1127 (44.8%) | <0.0001 |
Underweight | 49 (3.7%) | 51 (2.03%) | ||
Overweight | 461 (34.3%) | 926 (36.8%) | ||
Obese | 172 (12.8%) | 411 (16.3%) | ||
Missing | 4 | 37 | ||
Performance status | 0–1 | 1153 (85.6%) | 2430 (99.7%) | <0.0001 |
≥2 | 194 (14.4%) | 6 (0.25%) | ||
Missing | 0 | 116 | ||
Treatment arma | FOLFOX | 672 (49.9%) | 1277 (50.0%) | |
LV5FU2 | 675 (50.1%) | - | ||
FOLFOX-cetuximab | - | 1275 (50.0%) | ||
Tumor location | Left colon | 895 (66.4%) | 1549 (61.0%) | <0.0001 |
Right colon | 451 (33.5%) | 964 (38.0%) | ||
Both | 1 (0.1%) | 26 (1.0%) | ||
Missing | 0 | 13 | ||
T stage | T1 – T2 | 119 (8.8%) | 265 (10.4%) | 0.08 |
T3 | 977 (72.6%) | 1765 (69.3%) | ||
T4 | 250 (18.6%) | 518 (20.3%) | ||
Missing | 1 | 4 | ||
N stage | N1 | 882 (65.7%) | 1595 (62.5%) | 0.05 |
N2 | 460 (34.3%) | 957 (37.5%) | ||
Missing | 5 | |||
Histoprognostic grade | G1/2 | 1085 (84.4%) | 2051 (81.4%) | 0.02 |
G3/4 | 201 (15.6%) | 469 (18.6%) | ||
Missing | 61 | 32 | ||
Intestinal perforationa | Yes | 75 (5.6%) | 121 (4.7%) | 0.26 |
Intestinal obstructiona | Yes | 260 (19.3%) | 416 (16.3%) | <0.0001 |
CEA (ng/mL) | Median (IQR) | 1.4 (1–2.3) | 1.91 (0.9–2.2) | |
CEA (ng/mL) | ≤5 | 1243 (96.2%) | 2378 (95.89%) | 0.63 |
>5 | 49 (3.8%) | 102 (4.11%) | ||
Missing | 55 | 72 | ||
CEAb (ng/mL) | 0 to 1.30a | 630 (48.8%) | 1172 (47.2%) | 0.64 |
1.30 to 5 | 613 (47.4%) | 1205 (48.6%) | ||
>5 | 49 (3.8%) | 103 (4.1%) | ||
Missing | 55 | 72 | ||
CEAc (ng/mL) | 0 to 1 | 492 (38.1%) | 888 (35.8%) | 0.36 |
1 to 2 | 429 (33.2%) | 887 (35.8%) | ||
2 to 5 | 322 (24.9%) | 602 (24.3%) | ||
>5 | 49 (3.8%) | 103 (4.1%) | ||
Missing | 55 | 72 | ||
Time between surgery and CEA measurement (week) | Median (IQR) | 4.3 (3.3–5.4) | 5.1 (4.0–6.3) | <0.0001 |
Missing | 56 | 70 | ||
BRAF and MMR status | BRAFWT/dMMR | 28 (4.8%) | 103 (6.4%) | 0.003 |
BRAFmut/pMMR | 51 (8.8%) | 126 (7.9%) | ||
BRAFmut/dMMR | 12 (2.1%) | 64 (4.0%) | ||
BRAFWT/pMMR | 488 (84.3%) | 1308 (81.7%) | ||
Missing | 768 | 951 | ||
Median follow-up time (95% CI), yearsa | 9.6 (9.5–9.8) | 5.85 (5.71–5.91) |
. | . | Discovery cohort . | External validation cohort . | P . |
---|---|---|---|---|
. | . | N = 1,347 . | N = 2,552 . | . |
Age (years)a | <70 | 1157 (85.9%) | 2283 (89.4%) | 0.001 |
≥70 | 190 (14.1%) | 269 (10.5%) | ||
Gendera | Female | 621 (46.1%) | 1091 (42.7%) | 0.04 |
Male | 726 (53.9%) | 1461 (57.2%) | ||
Body mass index | Normal | 661 (49.2%) | 1127 (44.8%) | <0.0001 |
Underweight | 49 (3.7%) | 51 (2.03%) | ||
Overweight | 461 (34.3%) | 926 (36.8%) | ||
Obese | 172 (12.8%) | 411 (16.3%) | ||
Missing | 4 | 37 | ||
Performance status | 0–1 | 1153 (85.6%) | 2430 (99.7%) | <0.0001 |
≥2 | 194 (14.4%) | 6 (0.25%) | ||
Missing | 0 | 116 | ||
Treatment arma | FOLFOX | 672 (49.9%) | 1277 (50.0%) | |
LV5FU2 | 675 (50.1%) | - | ||
FOLFOX-cetuximab | - | 1275 (50.0%) | ||
Tumor location | Left colon | 895 (66.4%) | 1549 (61.0%) | <0.0001 |
Right colon | 451 (33.5%) | 964 (38.0%) | ||
Both | 1 (0.1%) | 26 (1.0%) | ||
Missing | 0 | 13 | ||
T stage | T1 – T2 | 119 (8.8%) | 265 (10.4%) | 0.08 |
T3 | 977 (72.6%) | 1765 (69.3%) | ||
T4 | 250 (18.6%) | 518 (20.3%) | ||
Missing | 1 | 4 | ||
N stage | N1 | 882 (65.7%) | 1595 (62.5%) | 0.05 |
N2 | 460 (34.3%) | 957 (37.5%) | ||
Missing | 5 | |||
Histoprognostic grade | G1/2 | 1085 (84.4%) | 2051 (81.4%) | 0.02 |
G3/4 | 201 (15.6%) | 469 (18.6%) | ||
Missing | 61 | 32 | ||
Intestinal perforationa | Yes | 75 (5.6%) | 121 (4.7%) | 0.26 |
Intestinal obstructiona | Yes | 260 (19.3%) | 416 (16.3%) | <0.0001 |
CEA (ng/mL) | Median (IQR) | 1.4 (1–2.3) | 1.91 (0.9–2.2) | |
CEA (ng/mL) | ≤5 | 1243 (96.2%) | 2378 (95.89%) | 0.63 |
>5 | 49 (3.8%) | 102 (4.11%) | ||
Missing | 55 | 72 | ||
CEAb (ng/mL) | 0 to 1.30a | 630 (48.8%) | 1172 (47.2%) | 0.64 |
1.30 to 5 | 613 (47.4%) | 1205 (48.6%) | ||
>5 | 49 (3.8%) | 103 (4.1%) | ||
Missing | 55 | 72 | ||
CEAc (ng/mL) | 0 to 1 | 492 (38.1%) | 888 (35.8%) | 0.36 |
1 to 2 | 429 (33.2%) | 887 (35.8%) | ||
2 to 5 | 322 (24.9%) | 602 (24.3%) | ||
>5 | 49 (3.8%) | 103 (4.1%) | ||
Missing | 55 | 72 | ||
Time between surgery and CEA measurement (week) | Median (IQR) | 4.3 (3.3–5.4) | 5.1 (4.0–6.3) | <0.0001 |
Missing | 56 | 70 | ||
BRAF and MMR status | BRAFWT/dMMR | 28 (4.8%) | 103 (6.4%) | 0.003 |
BRAFmut/pMMR | 51 (8.8%) | 126 (7.9%) | ||
BRAFmut/dMMR | 12 (2.1%) | 64 (4.0%) | ||
BRAFWT/pMMR | 488 (84.3%) | 1308 (81.7%) | ||
Missing | 768 | 951 | ||
Median follow-up time (95% CI), yearsa | 9.6 (9.5–9.8) | 5.85 (5.71–5.91) |
NOTE: Variables are described as n (%) except for continuous variables, which are described as median (IQR). P values refer to the differences between the discovery and validation cohorts.
Abbreviations: dMMR, deficient mismatch repair; LV5FU2, 5-fluorouracil chemotherapy; mut, mutated; N stage,, nodal stage according to the TNM classification; pMMR, proficient mismatch repair; T stage, tumor stage according to the TNM classification; WT, wild type.
aNo missing data.
bMedian approach was used to split the population of patients with a level of CEA <5 ng/mL.
cTertile approach was used to split the population of patients with a level of CEA <5 ng/mL.
In the external validation cohort, 2,552 patients with stage III colon cancer from the PETACC-8 study were enrolled. The median time between surgery and CEA measurement was 5.1 weeks and the median follow-up was 5.85 years (Table 1).
Association of postoperative CEA in its continuous form with survival
In the discovery cohort, the restricted cubic splines approach showed that patients with a CEA level of >5 ng/mL were at highest risk of recurrence or death. Patients with a CEA level of ≤5 ng/mL did not present a homogeneous risk population. A gradual risk suggested a square root relation between CEA and DFS (Fig. 1A). The square root transformation was the best to model the relation between CEA and survival endpoints with the fractional polynomials method. Similar observations were made for the external validation cohort (Fig. 1B) and for OS endpoint (Supplementary Fig. S1). These results allowed to identify different risk populations among patients with a level of CEA lower than 5 ng/mL.
Survival according to postoperative categorical CEA
The 5 ng/mL cut-off value of CEA allowed to identified two populations with distinct DFS risk profiles (CEA ≤ 5 ng/mL population: the 3-year DFS rate of 70%; and CEA > 5 ng/mL population: the 3-year DFS rate of 45%; Fig. 2A).
Considering the important number of patients with CEA ≤ 5 ng/mL, the median and tertile approach were used to determine 2 and 3 populations in these patients (Table 1). Figure 2B shows a distinct DFS risk profile in the two median based populations identified in patients with CEA ≤ 5 ng/mL. The 3-year DFS rate was 75%, 65%, and 45% for CEA < 1.30 ng/mL, CEA 1.30 to 5 ng/mL, and CEA > 5 ng/mL groups, respectively (P < 0.0001). The optimal cut-off value determined by the Hothorn and Lausen method (Supplementary Fig. S2) was equal to 1.30 ng/mL for patients with a CEA level of ≤ 5 ng/mL, reinforcing the choice of CEA 1.30 to determine different risk populations in these patients. These results were fully confirmed in the external validation cohort (Fig. 2C and D) and were also validated for OS in both cohorts (Supplementary Fig. S3). Similar results were observed for DFS and OS with the tertile approach for both study cohorts (Supplementary Fig. S4).
Consistent observations were found in a subgroup analysis of treatments and of times between surgery and CEA measurement for both cohorts (Supplementary Figs. S5–S8).
Patients' characteristics according to CEA level groups are presented in Supplementary Tables S1 and S2.
Postoperative CEA level as independent prognostic factor for survival
In univariate and multivariate analyses, CEA level was found to be significantly associated with DFS. The independent prognostic value of CEA was also confirmed in multivariate analysis. Patients with CEA between 1.30 and 5 mg/mL had a HR of 1.67 (95% CI, 1.28–2.19) compared to those with CEA < 1.30 ng/mL (Table 2). Similar results were obtained for OS (Table 2).
. | . | . | . | Univariate . | Final multivariate N patients = 547 . | ||
---|---|---|---|---|---|---|---|
. | . | N pts . | N evt . | HR (95% CI) . | P . | HR (95% CI) . | P . |
Age | >70 | 190 | 92 | 1.27 (1.01–1.58) | 0.04 | 1.5 (1.09–2.07) | 0.01 |
Gender | Male | 726 | 294 | 0.98 (0.83–1.16) | 0.84 | ||
Body mass index | Underweight | 49 | 18 | 0.96 (0.60–1.55) | 0.14 | ||
Overweight | 461 | 207 | 1.20 (1.00–1.44) | ||||
Obese | 172 | 64 | 0.91 (0.69–1.20) | ||||
Treatment arm | LV5FU2 | 675 | 293 | 1.26 (1.07–1.49) | 0.007 | ||
Tumor location | Right colon | 452 | 187 | 1.08 (0.90–1.29) | 0.40 | ||
Histoprognostic grade | G3/4 | 201 | 90 | 1.27 (1.01–1.59) | 0.04 | ||
Performance status | ≥ 1 | 970 | 388 | 0.98 (0.81–1.18) | 0.81 | ||
Intestinal perforation | Yes | 75 | 42 | 1.68 (1.23–2.31) | 0.001 | ||
Intestinal obstruction | Yes | 260 | 125 | 1.37 (1.12–1.68) | 0.002 | ||
CEA level | 0 to 1.30 ng/mL | 630 | 212 | Ref | <0.0001 | Ref | <0.0001 |
1.30 to 5 ng/mL | 613 | 286 | 1.53 (1.28–1.83) | 1.67 (1.28–2.19) | |||
>5 ng/mL | 49 | 29 | 2.75 (1.86–4.05) | 2.68 (1.53–4.69) | |||
CEA level | 0 to 1 ng/mL | 492 | 166 | Ref | <0.0001 | ||
1 to 2 ng/mL | 429 | 175 | 1.25 (1.01–1.54) | ||||
2 to 5 ng/mL | 322 | 157 | 1.62 (1.30–2.02) | ||||
>5 ng/mL | 49 | 29 | 2.72 (1.83–4.03) | ||||
BRAF and MMR status | BRAFWT/pMMR | 4488 | 212 | Ref | 0.42 | Ref | 0.22 |
BRAFmut/pMMR | 51 | 22 | 1.08 (0.70–1.68) | 1.03 (0.65–1.64) | |||
BRAFWT/dMMR | 28 | 8 | 0.62 (0.31–1.26) | 0.65 (0.32–1.32) | |||
BRAFmut/dMMR | 12 | 3 | 0.57 (0.18–1.78) | 0.36 (0.11–1.14) | |||
T stage | T3 | 977 | 384 | 1.90 (1.30–2.80) | <0.0001 | ||
T4 | 250 | 131 | 2.79 (1.85–4.19) | ||||
N stage | N2 | 460 | 237 | 1.89 (1.59–2.24) | <0.0001 | 1.93 (1.49–2.50) | <0.0001 |
. | . | . | . | Univariate . | Final multivariate N patients = 547 . | ||
---|---|---|---|---|---|---|---|
. | . | N pts . | N evt . | HR (95% CI) . | P . | HR (95% CI) . | P . |
Age | >70 | 190 | 92 | 1.27 (1.01–1.58) | 0.04 | 1.5 (1.09–2.07) | 0.01 |
Gender | Male | 726 | 294 | 0.98 (0.83–1.16) | 0.84 | ||
Body mass index | Underweight | 49 | 18 | 0.96 (0.60–1.55) | 0.14 | ||
Overweight | 461 | 207 | 1.20 (1.00–1.44) | ||||
Obese | 172 | 64 | 0.91 (0.69–1.20) | ||||
Treatment arm | LV5FU2 | 675 | 293 | 1.26 (1.07–1.49) | 0.007 | ||
Tumor location | Right colon | 452 | 187 | 1.08 (0.90–1.29) | 0.40 | ||
Histoprognostic grade | G3/4 | 201 | 90 | 1.27 (1.01–1.59) | 0.04 | ||
Performance status | ≥ 1 | 970 | 388 | 0.98 (0.81–1.18) | 0.81 | ||
Intestinal perforation | Yes | 75 | 42 | 1.68 (1.23–2.31) | 0.001 | ||
Intestinal obstruction | Yes | 260 | 125 | 1.37 (1.12–1.68) | 0.002 | ||
CEA level | 0 to 1.30 ng/mL | 630 | 212 | Ref | <0.0001 | Ref | <0.0001 |
1.30 to 5 ng/mL | 613 | 286 | 1.53 (1.28–1.83) | 1.67 (1.28–2.19) | |||
>5 ng/mL | 49 | 29 | 2.75 (1.86–4.05) | 2.68 (1.53–4.69) | |||
CEA level | 0 to 1 ng/mL | 492 | 166 | Ref | <0.0001 | ||
1 to 2 ng/mL | 429 | 175 | 1.25 (1.01–1.54) | ||||
2 to 5 ng/mL | 322 | 157 | 1.62 (1.30–2.02) | ||||
>5 ng/mL | 49 | 29 | 2.72 (1.83–4.03) | ||||
BRAF and MMR status | BRAFWT/pMMR | 4488 | 212 | Ref | 0.42 | Ref | 0.22 |
BRAFmut/pMMR | 51 | 22 | 1.08 (0.70–1.68) | 1.03 (0.65–1.64) | |||
BRAFWT/dMMR | 28 | 8 | 0.62 (0.31–1.26) | 0.65 (0.32–1.32) | |||
BRAFmut/dMMR | 12 | 3 | 0.57 (0.18–1.78) | 0.36 (0.11–1.14) | |||
T stage | T3 | 977 | 384 | 1.90 (1.30–2.80) | <0.0001 | ||
T4 | 250 | 131 | 2.79 (1.85–4.19) | ||||
N stage | N2 | 460 | 237 | 1.89 (1.59–2.24) | <0.0001 | 1.93 (1.49–2.50) | <0.0001 |
NOTE: Univariate variables that reached statistical significance (P < 0.10) or that did not reach significance but are known to be strongly associated with DFS were included in the multivariable model. A final multivariable model (displayed here) was realized with the statistically significant variables of the intermediate model.
Abbreviations: dMMR, deficient mismatch repair; evt, events; LV5FU2, 5-fluorouracil chemotherapy; mut, mutated; N stage, nodal stage according to the TNM classification; pMMR, proficient mismatch repair; pts, patients; T stage, tumor stage according to the TNM classification; WT, wild type.
. | . | . | . | Univariate . | Final multivariate (N patients = 531) . | ||
---|---|---|---|---|---|---|---|
. | . | N pts . | N evt . | HR (95% CI) . | P . | HR (95% CI) . | P . |
Age | >70 | 190 | 82 | 1.375 (1.08–1.75) | 0.009 | 1.62 (1.13–2.31) | 0.008 |
Gender | Male | 726 | 254 | 1.046 (0.87–1.26) | 0.63 | ||
Treatment arm | LV5FU2 | 675 | 250 | 1.255 (1.04–1.51) | 0.02 | ||
Tumor location | Right colon | 452 | 165 | 1.18 (0.97–1.42) | 0.09 | ||
Histoprognostic grade | G3/4 | 201 | 85 | 1.52 (1.20–1.93) | <0.0001 | 1.91 (1.25–2.91) | 0.003 |
Performance status | ≥ 1 | 970 | 335 | 1.08 (0.88–1.33) | 0.47 | ||
Intestinal perforation | Yes | 75 | 38 | 1.886 (1.35–2.63) | <0.0001 | ||
Intestinal obstruction | Yes | 260 | 113 | 1.494 (1.21–1.85) | <0.0001 | ||
CEA level, ng/mL | < 1.30 | 630 | 179 | Ref | Ref | 0.0005 | |
1.30 to 5 | 613 | 240 | 1.50 (1.23–1.82) | <0.0001 | 1.73 (1.27–2.34) | ||
> 5 | 49 | 26 | 2.58 (1.71–3.89) | 2.35 (1.26–4.39) | |||
CEA level, ng/mL | 0 to 1 | 492 | 138 | Ref | <0.0001 | ||
1 to 2 | 429 | 146 | 1.24 (0.99–1.57) | ||||
2 to 5 | 322 | 135 | 1.67 (1.32–2.12) | ||||
> 5 | 49 | 26 | 2.60 (1.71–3.96) | ||||
BRAF and MMR status | BRAFWT/pMMR | 488 | 170 | Ref | 0.46 | Ref | 0.03 |
BRAFmut/pMMR | 51 | 20 | 1.31 (0.83–2.09) | 1.37 (0.83–2.25) | |||
BRAFWT/dMMR | 28 | 7 | 0.71 (0.33–1.51) | 0.57 (0.26–1.26) | |||
BRAFmut/dMMR | 12 | 3 | 0.73 (0.23–2.28) | 0.19 (0.04–0.78) | |||
T stage | T3 | 977 | 320 | 2.232 (1.421–3.507) | <0.0001 | 2.18 (0.95–4.97) | 0.005 |
T4 | 250 | 119 | 3.661 (2.279–5.881) | 3.30 (1 ?40–7.74) | |||
N stage | N2 | 460 | 207 | 1.96 (1.63–2.357) | <0.0001 | 1.97 (1.47–2.65) | <0.0001 |
. | . | . | . | Univariate . | Final multivariate (N patients = 531) . | ||
---|---|---|---|---|---|---|---|
. | . | N pts . | N evt . | HR (95% CI) . | P . | HR (95% CI) . | P . |
Age | >70 | 190 | 82 | 1.375 (1.08–1.75) | 0.009 | 1.62 (1.13–2.31) | 0.008 |
Gender | Male | 726 | 254 | 1.046 (0.87–1.26) | 0.63 | ||
Treatment arm | LV5FU2 | 675 | 250 | 1.255 (1.04–1.51) | 0.02 | ||
Tumor location | Right colon | 452 | 165 | 1.18 (0.97–1.42) | 0.09 | ||
Histoprognostic grade | G3/4 | 201 | 85 | 1.52 (1.20–1.93) | <0.0001 | 1.91 (1.25–2.91) | 0.003 |
Performance status | ≥ 1 | 970 | 335 | 1.08 (0.88–1.33) | 0.47 | ||
Intestinal perforation | Yes | 75 | 38 | 1.886 (1.35–2.63) | <0.0001 | ||
Intestinal obstruction | Yes | 260 | 113 | 1.494 (1.21–1.85) | <0.0001 | ||
CEA level, ng/mL | < 1.30 | 630 | 179 | Ref | Ref | 0.0005 | |
1.30 to 5 | 613 | 240 | 1.50 (1.23–1.82) | <0.0001 | 1.73 (1.27–2.34) | ||
> 5 | 49 | 26 | 2.58 (1.71–3.89) | 2.35 (1.26–4.39) | |||
CEA level, ng/mL | 0 to 1 | 492 | 138 | Ref | <0.0001 | ||
1 to 2 | 429 | 146 | 1.24 (0.99–1.57) | ||||
2 to 5 | 322 | 135 | 1.67 (1.32–2.12) | ||||
> 5 | 49 | 26 | 2.60 (1.71–3.96) | ||||
BRAF and MMR status | BRAFWT/pMMR | 488 | 170 | Ref | 0.46 | Ref | 0.03 |
BRAFmut/pMMR | 51 | 20 | 1.31 (0.83–2.09) | 1.37 (0.83–2.25) | |||
BRAFWT/dMMR | 28 | 7 | 0.71 (0.33–1.51) | 0.57 (0.26–1.26) | |||
BRAFmut/dMMR | 12 | 3 | 0.73 (0.23–2.28) | 0.19 (0.04–0.78) | |||
T stage | T3 | 977 | 320 | 2.232 (1.421–3.507) | <0.0001 | 2.18 (0.95–4.97) | 0.005 |
T4 | 250 | 119 | 3.661 (2.279–5.881) | 3.30 (1 ?40–7.74) | |||
N stage | N2 | 460 | 207 | 1.96 (1.63–2.357) | <0.0001 | 1.97 (1.47–2.65) | <0.0001 |
NOTE: Univariate variables that reached statistical significance (P < 0.10) or that did not reach significance but are known to be strongly associated with DFS were included in the multivariable model. A final multivariable model (displayed here) was realized with the statistically significant variables of the intermediate model.
Abbreviations: dMMR, deficient mismatch repair; evt, events; LV5FU2, 5-fluorouracil chemotherapy; mut, mutated; N stage, nodal stage according to the TNM classification; pMMR, proficient mismatch repair; pts, patients; T stage, tumor stage according to the TNM classification; WT, wild type.
In the sensitivity analysis, multiple imputation procedure provided analogous results in the discovery cohort (Supplementary Tables S3 and S4). All results were replicated in the external validation cohort (Table 3; Supplementary Table S5).
. | . | HR (95% CI) . | P . |
---|---|---|---|
Age | >70 | 1.27 (0.97–1.66) | 0.08 |
CEA level | <1.30 ng/mL | Ref | <0.0001 |
1.30 to 5 ng/mL | 1.40 (1.16–1.68) | ||
>5 ng/mL | 2.69 (1.89–3.82) | ||
BRAF and MMR status | BRAFWT/pMMR | Ref | 0.09 |
BRAFmut/pMMR | 1.11 (0.81–1.51) | ||
BRAFWT/dMMR | 0.88 (0.60–1.29) | ||
BRAFmut/dMMR | 0.49 (0.27–0.89) | ||
N stage | N2 | 2.18 (1.82–2.60) | <0.0001 |
. | . | HR (95% CI) . | P . |
---|---|---|---|
Age | >70 | 1.27 (0.97–1.66) | 0.08 |
CEA level | <1.30 ng/mL | Ref | <0.0001 |
1.30 to 5 ng/mL | 1.40 (1.16–1.68) | ||
>5 ng/mL | 2.69 (1.89–3.82) | ||
BRAF and MMR status | BRAFWT/pMMR | Ref | 0.09 |
BRAFmut/pMMR | 1.11 (0.81–1.51) | ||
BRAFWT/dMMR | 0.88 (0.60–1.29) | ||
BRAFmut/dMMR | 0.49 (0.27–0.89) | ||
N stage | N2 | 2.18 (1.82–2.60) | <0.0001 |
Abbreviations: dMMR, deficient mismatch repair; mut, mutated; N stage, nodal stage according to the TNM classification; pMMR, proficient mismatch repair; WT, wild type.
Additional value of postoperative CEA for DFS risk stratification within TNM risk subgroups
In the discovery cohort, 729 (54.3%) and 614 (45.7%) patients had low-risk (T3N1) and high-risk (T4 or N2) stage III colon cancer, respectively. As expected, DFS differed according to these subgroups (Supplementary Fig. S9). Among the low and high-risk subgroups, the postoperative CEA level classification allowed to identify populations with different DFS and OS risk profiles (Fig. 3; Supplementary Figs. S9 and S10). Similar results were obtained in the external validation cohort for both DFS and OS (Fig. 3; Supplementary Fig. S9).
Chemotherapy regimen benefit across CEA risk groups
In the discovery cohort, the survival benefit from the addition of oxaliplatin to 5-fluorouracil was not different across different CEA groups in term of DFS and OS. No statistically significant interaction between CEA groups and treatment the arm (5-FU/LV vs. FOLFOX) was found (P = 0.5371 and P = 0.6761, respectively; Supplementary Fig. S11).
In the validation cohort, no interaction between the CEA risk groups and the treatment arm (cetuximab-FOLFOX vs. FOLFOX) in terms of DFS and OS was found (P = 0.8629 and P = 0.2412, respectively).
Discussion
This study confirmed strong prognostic value of postoperative CEA level for DFS and OS in two large cohorts of patients with stage III colon cancer treated with adjuvant chemotherapy. Patient with a CEA level of >5 ng/mL had a worse risk profile for recurrence and death. Distinct risk profiles were identified in patients with CEA ≤ 5 ng/mL currently considered as a single population with normal CEA values.
One of the questions that arises from our results is the choice of the CEA cutoff for risk prediction. Kim and colleagues showed in their recent retrospective study of 965 patients with stage III colon cancer that the optimal cutoff for preoperative CEA was 3 ng/mL in the training and validation sets (19). Other studies evaluating preoperative CEA also showed that colon cancer patients with CEA levels inferior to 2.5 or 2.7 ng/mL had better OS and DFS (29–31). Recently, in a large study of 45,449 patients with stage I and II colon cancer not receiving adjuvant chemotherapy, preoperative CEA level of 2.35 ng/mL was defined as the best cutoff to predict survival (16). The results of these studies suggest that a CEA value of 5 ng/mL is not optimal. We identified considerable DFS and OS heterogeneity in the population with postoperative CEA ≤ 5 ng/mL, providing new data in favor of a lower CEA cutoff in the postoperative setting for stage III colon cancer. Defining a lower cutoff for postoperative CEA will allow to construct more accurate prediction models and will better stratify patients for adjuvant treatment.
Nicholson and colleagues showed that the sensitivity of a CEA level of 2.5 ng/mL is better than the standard 5 ng/mL value (82% vs. 71%), but with rather low specificity (80% vs. 88%; ref. 32) Nevertheless, in adjuvant colorectal cancer, higher sensitivity is more relevant for detecting disease recurrence. If the sensitivity is good for a lower CEA cutoff, the question about reproducibility of the measures below 5 ng/mL has to be addressed. Analytic sensitivity of most automated immunoassay systems for measuring CEA is under 1 ng/mL. For instance, analytic sensitivity of the Abbott immunoassay used in our center is 0.5 ng/mL, whereas for Beckmann immunoassays, which are also widely used, is less than 0.7 ng/mL. Thus, reliable management decisions can be made with CEA levels below 5 ng/mL.
Recently, molecular and immunologic prognostic factors in predicting colon cancer patients' prognosis have been identified (33–36). In our study, the postoperative CEA performance seems similar to these molecular/immunologic scores for DFS and OS risk stratification in patients with stage III colon cancer. These observations and the fact that CEA is a low-cost biological tumor marker reinforce its interest.
With the breakthrough of liquid biopsy, circulating tumor DNA (ctDNA) has been assessed in early-stage colon cancer (37–39). Very high prognostic value of ctDNA for OS and RFS has been showed in very small cohorts (37, 39). However, no study to date has assessed the prognostic value of ctDNA in stage III colon cancer. Currently, three studies are ongoing (NCT03416478, NCT02842203, NCT03312374). Our results suggest that prognostic value of ctDNA should absolutely be evaluated together with CEA as a covariate in multivariate analysis. Similarly, Lu and colleagues showed that combining postoperative CEA levels assessment with the presence of postoperative circulating tumor cells may be valuable tool in predicting early relapse and survival rate in patients with stage II and III colon cancer (18). Such combination was also found to be of interest for better disease prediction in the study of non–small cell lung cancer (40).
We identified the additional prognostic value of postoperative CEA for survival endpoints among conventional TNM risk classification. Recently, the results of the IDEA study suggested that shorter adjuvant chemotherapy could be administered in the subgroup of patients with T3/N1 disease (7–10). However, given that these results are still debated, postoperative CEA level might assist in clinical decision-making in this setting.
Although prediction tools already exist in localized colon cancer (41–44), these do not include postoperative CEA level. In the light of our results, a careful consideration of this measurement should be made in scoring tools to predict DFS and OS for localized colon cancer in the future.
Our study has several strengths. We evaluated postoperative CEA in adjuvant stage III colon cancer that was of more interest than preoperative CEA. This study was constructed in the framework of two large international multicenter phase III trials offering a broad spectrum of reliable parameters including molecular status at inclusion and the largest postoperative CEA assessment data in stage III colon cancer. Moreover, patients in these studies were treated with 5-fluorouracil and leucovorin in combination with oxaliplatin in majority of cases (50%). We built our analysis in a rigorous methodologic framework respecting the transparent reporting of the multivariate model TRIPOD statement (25). Showing the same results for both cohorts, we confirmed the robustness of our data.
The current study also has some limitations. Given that CEA level was an inclusion criterion in the MOSAIC and PETACC-8 trials, this analysis includes a low number of patients with high CEA. Importantly, CEA level may be affected by factors that were not available in our cohorts, mainly smoking status and other comorbidities (45). However, the robustness and the magnitude of the association observed support the notion of a cancer-derived effect on the outcomes. A second potential limitation is that in our study CEA assessment is established within different ranges, according to each manufacture, and most of the time low threshold values are considered as insufficiently sensitive. The large DFS and OS differences between subgroups of patient with CEA <5 ng/mL suggest that the findings apply across CEA values obtained from a broad range of laboratories. Despite the heterogeneity of the adjuvant regimens received and the multiples laboratories that performed the CEA measurement, postoperative CEA identified in the overall population was still relevant in all subgroups.
To conclude, in these two large cohorts, we showed that the postoperative CEA level is a highly independent prognostic factor for DFS and OS. We further confirmed that patients with CEA ≤5 ng/mL should not be considered as a single-risk population. These results support a routine postoperative CEA testing, suggesting its future interest for optimizing clinical trials design and for defining risk-adapted and personalized strategies for stage III colon cancer management.
Disclosure of Potential Conflicts of Interest
G. Folprecht reports receiving a commercial research grant from Merck-Serono (for a clinical study), has received speakers bureau honoraria from Merck-Serono, Roche/Genentech, Sanofi-Aventis, Lilly, Amgen, and is a consultant/advisory board member for Merck-Serono, Bayer, MSD, BMS, Shire, Roche/Genentech, Servier, Laboratoire HRA Pharma, Amgen, Mundipharma. J. Tabernero is a consultant/advisory board member for Array Biopharma, AstraZeneca, Bayer, BeiGene, Boehringer Ingelheim, Chugai, Genentech, Inc., Genmab A/S, Halozyme, Imugene Limited, Inflection Biosciences Limited, Ipsen, Kura Oncology, Lilly, MSD, Menarini, Merck Serono, Merrimack, Merus, Molecular Partners, Novartis, Peptomyc, Pfizer, Pharmacyclics, ProteoDesign SL, Rafael Pharmaceuticals, F. Hoffmann-La Roche Ltd, Sanofi, SeaGen, Seattle Genetics, Servier, Symphogen, Taiho, VCN Biosciences, Biocartis, Foundation Medicine, HalioDX SAS, and Roche Diagnostics. T. Hickish is a medical director at IQHealth Tech and is a consultant/advisory board member for Sobi International and Lilly. T. André has received speakers bureau honoraria from Sanofi and is a consultant/advisory board member for Roche. D. Vernerey is a consultant/advisory board member for OSE, HalioDX, Pfizer, and Jansen. No potential conflicts of interest were disclosed by the other authors.
Authors' Contributions
Conception and design: E. Auclin, J. Taieb, C. Louvet, J. Tabernero, D. Vernerey
Development of methodology: E. Auclin, J. Taieb, D. Vernerey
Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): C. Lepage, T. Aparicio, R. Faroux, E. Mini, G. Folprecht, R. Salazar, M. Banzi, C. Louvet, J.-L. Van Laethem, J. Tabernero, T. Hickish, A. de Gramont, T. André
Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): E. Auclin, J. Taieb, E. Mini, R. Salazar, M. Benetkiewicz, J. Tabernero, T. André, D. Vernerey
Writing, review, and/or revision of the manuscript: E. Auclin, J. Taieb, C. Lepage, T. Aparicio, R. Faroux, E. Mini, G. Folprecht, R. Salazar, M. Benetkiewicz, C. Louvet, J.-L. Van Laethem, J. Tabernero, A. de Gramont, T. André, D. Vernerey
Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): J. Tabernero, T. André
Study supervision: J. Tabernero, D. Vernerey
Other (providing final approval of the version to be submitted): M. Benetkiewicz
Acknowledgments
We thank Pr. Jean-Paul Feugeas and Dr. Laurent Bermont for their help regarding the automated immunoassay systems.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.