Abstract
Purpose: To establish the concept of minimum clinically meaningful outcome (mCMO) of treatment in advanced solid tumors, to establish its threshold and evaluate how many superiority trials of new antineoplastic agents pass this threshold.
Experimental Design: We chose overall survival as the primary indicator of patient benefit. Four conceptually different types of treatment effect can be identified in OS curves: HR, gains in median OS, proportional, and absolute increases at long-term OS. We postulated threshold levels for these four parameters defining the mCMO and set the bar at three different levels of required benefit: high, medium, and low. The postulated values were then studied by comparing our thresholds with the actual results of the pivotal superiority phase III trials on new drugs reporting on mature OS data.
Results: Forty-three trials on 35,419 patients in 12 cancer types on 23 novel agents met these criteria. Only two trials reached the postulated “high” thresholds for HR and median OS. The number of “positive trials” increased to eight and 15 when the bar was lowered to the “medium” and “low” levels, respectively. The same analysis was done for proportional and absolute increases in long-term OS. No trial satisfied the criteria for long-term benefit, whereas only two and nine trials satisfied both parameters for the “medium and low” required benefit levels, respectively.
Conclusions: All four OS-related parameters contribute to define the mCMO. If the bar for the mCMO is raised too much, positive trials are exceptional. Clin Cancer Res; 21(5); 1036–43. ©2014 AACR.
See related commentary by Schilsky, p. 947
This article is featured in Highlights of This Issue, p. 937
We identify here four parameters of efficacy that can be lumped under two categories: those summarizing the average benefit (HR and median gains in overall survival; OS) and those reflecting a large benefit for a minority of patients (absolute or proportional increase in long-term survivors). The translational value of this paper is only indirect: molecular predictors of efficacy, where available, impact on both categories of endpoints, but in general, emphasis is primarily on the classical parameters of HR and median values. If the predictor is strong, the long term OS-related summary parameters should receive more emphasis in clinical trials, along the line of the “raising the bar philosophy.”
Introduction
Two general problems are increasingly recognized and debated with regard to the improvements in cancer treatment: their small incremental size and the high price of the new drugs. The two problems are connected because modern clinical trials in advanced solid tumors have become larger and larger (with associated increasing costs), resulting in statistically significant findings being achieved with smaller and smaller observed treatment effects. This drug-development strategy has been based on a rationale, risk-minimizing philosophy by the pharmaceutical industry; however, we suggest that we are approaching (or have already passed) an inflection point where continued pursuit of this strategy is not optimally productive.
Modest benefits could be considered worthwhile if associated with moderate costs and toxicity, whereas a new drug with a very high cost and/or substantial toxicity is worthwhile only if it produces sizeable clinical benefits. Hence, the relevance of statistical significance has increasingly been challenged when the treatment effect is small (1, 2). The simple solution would be to raise the bar of efficacy for approving antineoplastic agents (3, 4). As supporters of this concept (3), here we try to expand it by describing a structured approach to this problem in advanced solid tumors. We also apply our derived model to a large sample of the most important pivotal trials on biologic agents published during the last 15 years, to evaluate which of them would meet the criteria for success according to various threshold values of several efficacy parameters.
Materials and Methods
The definition of minimum clinically meaningful outcome
If a new treatment is to be introduced into clinical practice in the setting of “superiority” to an existing treatment, it is not sufficient to demonstrate that it is “better” than standard therapy. It should be necessary to demonstrate that its benefits outweigh its adverse effects and costs. If the benefits are expressed in terms of overall survival (OS), the increase in survival that balances the harms/costs of the treatment represents a threshold, the minimum clinically meaningful outcome (mCMO). This value should be considered as a cutoff between what is and what is not clinically meaningful. Until the treatment effect is shown with statistical rigor to be larger than this threshold, the treatment should still be considered experimental.
Factors affecting the mCMO
The extent of the benefit identifying mCMO may be a function of three factors: the prognosis of patients with that condition, the toxicity/inconveniences of the treatment, and its cost. In addition, the extent of benefit may have different relevance according to the endpoint used (OS, PFS, or others) and the way it is expressed (HR, absolute median gains, etc.). In general, depending on the prognosis and the setting of the disease, there is agreement that OS or PFS are the endpoints to be used in advanced solid cancers, with the first one being preferred whenever practical. Here, we chose OS. Extending the approach to alternative endpoints, such as PFS, is feasible, but involves a substantial increase in the number and complexity of the issues to be addressed. For instance, to allow comparisons between the effects of different drugs and to provide estimates of benefit readily understandable by patients and health administrators, it should be possible to translate effects on PFS into effects on OS, which is far from simple. For this reason, we included in our analysis only those trials where OS mature data were reported.
Structured approach to define the mCMO
We first focused on the identification of the best candidate parameters for the definition of the mCMO. Once those parameters were identified, we defined the threshold levels for each prognostic condition. We then arbitrarily set the bar of the mCMO at three different levels of required benefit (high, medium, and low), assuming that it may be simple to adapt these thresholds to different toxicity and cost categories (low, medium, and high). For example, if a medium level of benefit is required, the threshold values could be maintained for a medium level of toxicity and cost, but these values could be raised to high if the toxicity or cost is high or lowered to low if toxicity or cost is low. In this way, this model could grossly implement all three major determinants of benefit.
Parameters defining the thresholds of mCMO
Although HR is the parameter used in the design and interpretation of clinical trials, other summary statistical indices are also important. The median gain in OS is a straightforward figure that has the advantage of being easily understandable by patients and other stakeholders. The late effects of treatment have the same advantage, no matter whether expressed in terms of absolute increase in the survival rate at 2, 3, or 5 years (usually not so impressive, typically in the range of 5%–15%) or in terms of proportional increase in OS rates at 2, 3, or 5 years (usually more impressive figures, typically in the range of 20%–50%). Therefore four OS-related parameters may contribute to define the mCMO (Fig. 1): HR, gains in median OS, absolute, and proportional gains in the long-term survival rates, representing two conceptually different types of treatment effects: a small benefit for many (SMALL) and a large benefit for few (LARGE). SMALL may be measured both in terms of HR and gains in median OS, whereas LARGE may be more appropriately measured by proportional and or absolute increase in 2, 3, or 5-year OS. Three out of four parameters (median gain, absolute, or proportional gain in OS rates) are in general easily understandable by patients. This would provide extra value by using a language that is appropriate for the patient-doctor relationship.
A “model” of Kaplan–Meier figure showing the four OS-related parameters.
Raising the bar: the choice of threshold values for mCMO
We have considered two determinants for each of these four OS-related parameters: the extent of the required benefit and the prognosis of the condition under study.
Extent of the required benefit: We have proposed three potential levels of the benefit representing the mCMO when a low, medium, or high required benefit is the desired target (Table 1). The “high” level might be used when the toxicity and/or the cost of the new drug are high, the “low” level may apply to drugs that are both nontoxic/safe and inexpensive, whereas the “medium” level could be used in intermediate cases. The threshold values proposed for the “low” level are higher than many ‘target’ values used in current ongoing phase III trials, in keeping with the general concept of “raising the bar” in oncology.
Prognosis of the condition under study: Because the mCMO also depends on the prognosis of the condition under study, we have identified four prognostic categories, based on the median OS observed with standard therapies: <9 months, 9 to 12, 12 to 18, and > 18 months. These prognostic cutoff values were selected after a series of attempts to have a sizable and rather homogeneous number of trials in each prognostic category, but still reflecting the sense of poor, medium, and good prognosis.
The three proposed levels of the required benefit (mCMO), high, medium, and low (A, B, and C) as a function of SMALL (small benefit for many patients), LARGE (large benefit for few patients) and prognosis with standard treatment.
. | Small benefit for many . | Large benefit for few . | ||
---|---|---|---|---|
Prognosis . | HR . | GAIN in median OS . | Absolute increase . | Proportional increase . |
A. High threshold for mCMO | ||||
<9 mo | 0.60 | 3 mo | 15% | 100% |
9–12 mo | 0.65 | 4 mo | 15% | 100% |
12–18 mo | 0.65 | 5 mo | 15% | 100% |
>18 mo | 0.70 | 6 mo | 15% | 100% |
B. Medium threshold for mCMO | ||||
<9 mo | 0.65 | 2.5 mo | 10% | 50% |
9–12 mo | 0.70 | 3 mo | 10% | 50% |
12–18 mo | 0.70 | 3.5 mo | 10% | 50% |
>18 mo | 0.75 | 4 mo | 10% | 50% |
C. Low threshold for mCMO | ||||
<9 mo | 0.70 | 2 mo | 5% | 25% |
9–12 mo | 0.75 | 2.5 mo | 5% | 25% |
12–18 mo | 0.75 | 3 mo | 5% | 25% |
>18 mo | 0.80 | 3.5 mo | 5% | 25% |
. | Small benefit for many . | Large benefit for few . | ||
---|---|---|---|---|
Prognosis . | HR . | GAIN in median OS . | Absolute increase . | Proportional increase . |
A. High threshold for mCMO | ||||
<9 mo | 0.60 | 3 mo | 15% | 100% |
9–12 mo | 0.65 | 4 mo | 15% | 100% |
12–18 mo | 0.65 | 5 mo | 15% | 100% |
>18 mo | 0.70 | 6 mo | 15% | 100% |
B. Medium threshold for mCMO | ||||
<9 mo | 0.65 | 2.5 mo | 10% | 50% |
9–12 mo | 0.70 | 3 mo | 10% | 50% |
12–18 mo | 0.70 | 3.5 mo | 10% | 50% |
>18 mo | 0.75 | 4 mo | 10% | 50% |
C. Low threshold for mCMO | ||||
<9 mo | 0.70 | 2 mo | 5% | 25% |
9–12 mo | 0.75 | 2.5 mo | 5% | 25% |
12–18 mo | 0.75 | 3 mo | 5% | 25% |
>18 mo | 0.80 | 3.5 mo | 5% | 25% |
NOTE: Proposed absolute and proportional increase refer to OS gains at 2 or 3 years when the prognosis is below or above 12 months, respectively.
The threshold values for the four key parameters were arbitrarily selected using the following criteria:
HR for OS: The maximum variation within each level of required benefit was set at 0.1. The HR variation within the same prognostic category was 0.05.
Gains in Median OS: The minimum and maximum survival benefit was set at 2 and 6 months, respectively, as a function of the prognostic category, to reflect the perception that no benefit of less than 2 months and any benefit more than 6 months are in general considered clinically worthwhile, regardless of the underlying prognosis, toxicity, and cost of the treatment.
Absolute gains in late OS: The 5-year OS values would have fully satisfied the concept of LARGE, but almost no trials report such late survival results in the advanced setting. Therefore, we considered 2 or 3 years as late OS time points whenever the prognosis is below or above 12 months, respectively. Ranges between 2.5% and 20% were considered, and the range between 5% and 15% was chosen because little difference was observed between 2.5% and 5% and the selection afforded by 15% was already very high.
Proportional increase in 2 or 3 year OS: After testing various ranges, values between 25% and 100% were chosen because these values reflects concepts of benefit that may be easily conveyed to the patients and to nonmedical stakeholders.
Field testing of the model: selection of the clinical trials
The postulated values defining mCMO for SMALL and LARGE were considered relative to the actual results of pivotal trials on new agents. These trials were selected according to the following criteria:
Randomized phase III
Advanced solid tumors (palliative intent of treatment)
Published in the last 15 years
Reporting OS, whether as primary or secondary endpoint.
For example, the trial on afatinib was not selected because OS data are not yet mature and only PFS data are reported in the published registration trial (5). Forty-three pivotal trials on 35,419 patients in 12 cancer types on 23 new drugs (6–49) were considered and the consequences of raising the bar of the mCMO on the outcome of the 43 trials are presented below.
Results
The consequences of raising the bar of the mCMO on the ranking of currently available drugs for advanced solid tumors: “SMALL”
Because it would be desirable to meet both criteria defining SMALL (HR for OS and gains in median OS), we have reported the number of trials that would meet both conditions (column “both” in Table 2). Only two trials reached the postulated “high” threshold values (Table 2). The number of “positive trials” increased to 8 and 15 when the bar was lowered to the “medium” and “low” levels, respectively (Table 2).
mCMO for the high (A), medium (B), and low (C) required benefit for SMALL: number of trials meeting these criteria for success out of the 43 pivotal randomized studies analyzed according to the selection criteria described in the text
. | . | Criteria . | Number of positive trials . | ||||
---|---|---|---|---|---|---|---|
Prognosis in control arm . | Trials analyzed (n) . | HR . | Median gain in OS . | HR . | Median gain in OS . | “Either” parameter . | “Both” parameters . |
A. High threshold for mCMO: SMALL | |||||||
<9 mo | 9 | 0.60 | 3 mo | 1 | 3 | 3 | 1 |
9–12 mo | 8 | 0.65 | 4 mo | 0 | 1 | 1 | 0 |
12–18 mo | 9 | 0.65 | 5 mo | 1 | 0 | 1 | 0 |
>18 mo | 17 | 0.70 | 6 mo | 2 | 2 | 3 | 1 |
Total | 43 | 4 | 6 | 8 | 2 | ||
B. Medium threshold for mCMO: SMALL | |||||||
<9 mo | 9 | 0.65 | 2.5 mo | 1 | 4 | 4 | 1 |
9–12 mo | 8 | 0.70 | 3 mo | 2 | 3 | 3 | 2 |
12–18 mo | 9 | 0.70 | 3.5 mo | 3 | 4 | 5 | 2 |
>18 mo | 17 | 0.75 | 4 mo | 3 | 6 | 6 | 3 |
Total | 43 | 9 | 17 | 18 | 8 | ||
C. Low threshold for mCMO: SMALL | |||||||
<9 mo | 9 | 0.70 | 2 mo | 4 | 5 | 5 | 4 |
9–12 mo | 8 | 0.75 | 2.5 mo | 5 | 4 | 5 | 4 |
12–18 mo | 9 | 0.75 | 3 mo | 4 | 5 | 6 | 3 |
>18 mo | 17 | 0.80 | 3.5 mo | 4 | 7 | 7 | 4 |
Total | 43 | 17 | 21 | 23 | 15 |
. | . | Criteria . | Number of positive trials . | ||||
---|---|---|---|---|---|---|---|
Prognosis in control arm . | Trials analyzed (n) . | HR . | Median gain in OS . | HR . | Median gain in OS . | “Either” parameter . | “Both” parameters . |
A. High threshold for mCMO: SMALL | |||||||
<9 mo | 9 | 0.60 | 3 mo | 1 | 3 | 3 | 1 |
9–12 mo | 8 | 0.65 | 4 mo | 0 | 1 | 1 | 0 |
12–18 mo | 9 | 0.65 | 5 mo | 1 | 0 | 1 | 0 |
>18 mo | 17 | 0.70 | 6 mo | 2 | 2 | 3 | 1 |
Total | 43 | 4 | 6 | 8 | 2 | ||
B. Medium threshold for mCMO: SMALL | |||||||
<9 mo | 9 | 0.65 | 2.5 mo | 1 | 4 | 4 | 1 |
9–12 mo | 8 | 0.70 | 3 mo | 2 | 3 | 3 | 2 |
12–18 mo | 9 | 0.70 | 3.5 mo | 3 | 4 | 5 | 2 |
>18 mo | 17 | 0.75 | 4 mo | 3 | 6 | 6 | 3 |
Total | 43 | 9 | 17 | 18 | 8 | ||
C. Low threshold for mCMO: SMALL | |||||||
<9 mo | 9 | 0.70 | 2 mo | 4 | 5 | 5 | 4 |
9–12 mo | 8 | 0.75 | 2.5 mo | 5 | 4 | 5 | 4 |
12–18 mo | 9 | 0.75 | 3 mo | 4 | 5 | 6 | 3 |
>18 mo | 17 | 0.80 | 3.5 mo | 4 | 7 | 7 | 4 |
Total | 43 | 17 | 21 | 23 | 15 |
Table 2 also reports those trials meeting either criteria (column “either” in Table 2). The corresponding figures for the three levels of required benefit were 8, 18, and 23 positive trials out of 43. In addition, the table also reports which specific parameter each trial met (columns HR and median gain in OS). In general, meeting the postulated median gains was more common than achieving the HR thresholds.
Table 3 reports which drug would pass the thresholds for the different required benefit levels. It may be noted that trastuzumab, clearly a paradigm-changing agent for breast cancer, exceeds only the ‘medium’ and ‘low’ required benefit level, based upon the pivotal registration trial data (12). This underlines how high the proposed “high” threshold levels are.
Currently available drugs and their indications for advanced solid tumors passing the postulated OS threshold values for the mCMO in the 43 pivotal trials analyzed: “SMALL”
Level of required benefit . | Prognostic group . | Drugs meeting both criteria of median gain and HR for OS . | Drugs meeting either criteria of median gain or HR for OS . |
---|---|---|---|
High | <9 mo | Cetuximab-colon (30) | Temsirolimus-kidney (MG; ref. 6); Ipilimumab-melanoma (MG; ref. 40); Cetuximab-colon (HR +MG; ref. 30) |
9–12 mo | — | Abiraterone-prostate (MG; ref. 42) | |
12–18 mo | — | Enzalutamide-prostate (HR; ref. 44) | |
>18 mo | Bevacizumab-ovarian (46) | Tdm1-breast (HR; ref. 15); Cetuximab-head and neck (MG; ref. 38); Bevacizumab-ovarian (HR +MG; ref. 46) | |
Medium | <9 mo | Cetuximab-colon (30) | Sorafenib-HCC (ref. 37); Temsirolimus-kidney (MG; ref. 6); Ipilimumab-melanoma (MG; ref. 40); Cetuximab-colon (HR +MG; ref. 30) |
9–12 mo | Vemurafenib-melanoma (39); Radium223-prostate (45) | Vemurafenib-melanoma (HR +MG; ref. 39); Radium223-prostate (HR +MG; ref. 45); Abiraterone-prostate (MG; ref. 42) | |
12–18 mo | Enzalutamide-prostate (44); Bevacizumab-colon (23) | Cabazitaxel-prostate (HR; ref. 43); Bevacizumab-colon (MG; ref. 26); Bevacizumab-colon (HR +MG; ref. 23); Sorafenib-kidney (MG; ref. 49); Enzalutamide-prostate (HR +MG; ref. 44) | |
>18 mo | Tdm1-breast (15); Bevacizumab-ovarian (46); Cetuximab-head and neck (38) | Bevacizumab-ovarian (HR +MG; ref. 46); Panitumumab-colon (MG; ref. 29); Sunitinib-kidney (MG; ref. 7); Trastuzumab-breast (MG; ref. 12); Tdm1-breast (HR +MG; ref. 15); Cetuximab-head and neck (HR +MG; ref. 38) | |
Low | <9 mo | Erlotinib-lung (17); Cetuximab-colon (30); Ipilimumab-melanoma (40); Sorafenib-hcc (37) | Erlotinib-lung (HR +MG; ref. 17); Temsirolimus-kidney (MG; ref. 6); Sorafenib-HCC (HR +MG; ref. 37); Ipilimumab-melanoma (HR +MG; ref. 40); Cetuximab-colon (HR +MG; ref. 30) |
9–12 mo | Abiraterone-prostate (42); Vemurafenib-melanoma (39); Trastuzumab-gastric (41); Radium223-prostate (45) | Radium223-prostate (HR +MG; ref. 45); Vemurafenib-melanoma (HR +MG; ref. 39); Bevacizumab-colon (HR; ref. 25); Trastuzumab-gastric (HR +MG; ref. 41); Abiraterone-prostate (HR +MG; ref. 42) | |
12–18 mo | Bevacizumab-colon (23); Bevacizumab-colon (24) Enzalutamide-prostate (44) | Cabazitaxel-prostate (HR; ref. 43); Bevacizumab-colon (MG; ref. 26); Bevacizumab-colon (HR +MG; ref. 24); Sorafenib-kidney (MG; ref. 49); Enzalutamide-prostate (HR +MG; ref. 44); Bevacizumab-colon (HR +MG; ref. 23) | |
>18 mo | Trastuzumab-breast (12); Tdm1-breast (15); Bevacizumab-ovarian (46) Cetuximab-head and neck (38) | Bevacizumab-ovarian (HR +MG; ref. 46); Tdm1-breast (HR +MG; ref. 15); Cetuximab-head and neck (HR +MG; ref. 38); Panitumumab-colon (MG; ref. 29); Sunitinib-kidney (MG; ref. 7); Cetuximab-colon (MG; ref. 31); Trastuzumab-breast (HR +MG; ref. 12) |
Level of required benefit . | Prognostic group . | Drugs meeting both criteria of median gain and HR for OS . | Drugs meeting either criteria of median gain or HR for OS . |
---|---|---|---|
High | <9 mo | Cetuximab-colon (30) | Temsirolimus-kidney (MG; ref. 6); Ipilimumab-melanoma (MG; ref. 40); Cetuximab-colon (HR +MG; ref. 30) |
9–12 mo | — | Abiraterone-prostate (MG; ref. 42) | |
12–18 mo | — | Enzalutamide-prostate (HR; ref. 44) | |
>18 mo | Bevacizumab-ovarian (46) | Tdm1-breast (HR; ref. 15); Cetuximab-head and neck (MG; ref. 38); Bevacizumab-ovarian (HR +MG; ref. 46) | |
Medium | <9 mo | Cetuximab-colon (30) | Sorafenib-HCC (ref. 37); Temsirolimus-kidney (MG; ref. 6); Ipilimumab-melanoma (MG; ref. 40); Cetuximab-colon (HR +MG; ref. 30) |
9–12 mo | Vemurafenib-melanoma (39); Radium223-prostate (45) | Vemurafenib-melanoma (HR +MG; ref. 39); Radium223-prostate (HR +MG; ref. 45); Abiraterone-prostate (MG; ref. 42) | |
12–18 mo | Enzalutamide-prostate (44); Bevacizumab-colon (23) | Cabazitaxel-prostate (HR; ref. 43); Bevacizumab-colon (MG; ref. 26); Bevacizumab-colon (HR +MG; ref. 23); Sorafenib-kidney (MG; ref. 49); Enzalutamide-prostate (HR +MG; ref. 44) | |
>18 mo | Tdm1-breast (15); Bevacizumab-ovarian (46); Cetuximab-head and neck (38) | Bevacizumab-ovarian (HR +MG; ref. 46); Panitumumab-colon (MG; ref. 29); Sunitinib-kidney (MG; ref. 7); Trastuzumab-breast (MG; ref. 12); Tdm1-breast (HR +MG; ref. 15); Cetuximab-head and neck (HR +MG; ref. 38) | |
Low | <9 mo | Erlotinib-lung (17); Cetuximab-colon (30); Ipilimumab-melanoma (40); Sorafenib-hcc (37) | Erlotinib-lung (HR +MG; ref. 17); Temsirolimus-kidney (MG; ref. 6); Sorafenib-HCC (HR +MG; ref. 37); Ipilimumab-melanoma (HR +MG; ref. 40); Cetuximab-colon (HR +MG; ref. 30) |
9–12 mo | Abiraterone-prostate (42); Vemurafenib-melanoma (39); Trastuzumab-gastric (41); Radium223-prostate (45) | Radium223-prostate (HR +MG; ref. 45); Vemurafenib-melanoma (HR +MG; ref. 39); Bevacizumab-colon (HR; ref. 25); Trastuzumab-gastric (HR +MG; ref. 41); Abiraterone-prostate (HR +MG; ref. 42) | |
12–18 mo | Bevacizumab-colon (23); Bevacizumab-colon (24) Enzalutamide-prostate (44) | Cabazitaxel-prostate (HR; ref. 43); Bevacizumab-colon (MG; ref. 26); Bevacizumab-colon (HR +MG; ref. 24); Sorafenib-kidney (MG; ref. 49); Enzalutamide-prostate (HR +MG; ref. 44); Bevacizumab-colon (HR +MG; ref. 23) | |
>18 mo | Trastuzumab-breast (12); Tdm1-breast (15); Bevacizumab-ovarian (46) Cetuximab-head and neck (38) | Bevacizumab-ovarian (HR +MG; ref. 46); Tdm1-breast (HR +MG; ref. 15); Cetuximab-head and neck (HR +MG; ref. 38); Panitumumab-colon (MG; ref. 29); Sunitinib-kidney (MG; ref. 7); Cetuximab-colon (MG; ref. 31); Trastuzumab-breast (HR +MG; ref. 12) |
Abbreviation: MG, median gain next to each agent indicates which specific parameter of SMALL was met in each trial.
The consequences of raising the bar of the mCMO on the ranking of currently available drugs for advanced solid tumors: “LARGE”
The same analysis as above was performed for LARGE, as measured by the absolute OS gains and/or proportional OS increase at 2 or 3 years (Tables 4 and 5). The analysis of these parameters was done at 2 years for conditions with poorer prognosis (<12 months median survival with standard treatment) and at 3 years for conditions with better prognosis (>12 months) on trials reporting more than 20 patients at risk at these late times. Most trials lack this information: the number of qualifying trials was 18: 11 for the <12 months groups, and seven for the better prognosis groups. The table reveals that there are no trials satisfying both the absolute and proportional increase for the “high” required benefit levels (Table 4, columns “both”). Setting the bar at the “medium and low” threshold level (Table 4, sum of the 2 columns “both”) allows two and nine trials, respectively, to reach the postulated levels. If we consider the instances where trials satisfied the threshold of either endpoint (sum of the two columns “either” in Table 4), these numbers increase to 2, 8, and 11 for the high, medium, and low levels, respectively.
mCMO for the high (A), medium (B), and low (C) required benefit for “LARGE”, measured by absolute OS increase and proportional OS increase at 2 years in poor prognosis groups (<12 months) or at 3 years in good prognosis (>12 months) groups: number of trials meeting these criteria for success among the 18 trials (of the 43 analyzed) reporting OS data on more than 20 patients at risk at the late term of 2 and 3 years
. | . | Criteria . | Number of positive trials . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | . | . | 2 y . | 3 y . | ||||||
Prognosis . | Trials . | Absolute increase . | Proportional increase . | Absolute . | Proportional . | “Either” . | “Both” . | Absolute . | Proportional . | “Either” . | “Both” . |
A. High threshold for mCMO: LARGE | |||||||||||
<9 mo | 4 | 15% | 100% | 0 | 1 | 1 | 0 | ||||
9–12 mo | 7 | 15% | 100% | 0 | 0 | 0 | 0 | ||||
12–18 mo | 0 | 15% | 100% | 0 | 0 | 0 | 0 | ||||
>18 mo | 7 | 15% | 100% | 1 | 0 | 1 | 0 | ||||
Total | 18 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | ||
B. Medium threshold for mCMO: LARGE | |||||||||||
<9 mo | 4 | 10% | 50% | 1 | 1 | 2 | 0 | ||||
9–12 mo | 7 | 10% | 50% | 2 | 4 | 4 | 2 | ||||
12–18 mo | 0 | 10% | 50% | 0 | 0 | 0 | 0 | ||||
>18 mo | 7 | 10% | 50% | 2 | 0 | 2 | 0 | ||||
Total | 18 | 3 | 5 | 6 | 2 | 2 | 0 | 2 | 0 | ||
C. Low threshold for mCMO: LARGE | |||||||||||
<9 mo | 4 | 5% | 25% | 3 | 3 | 3 | 3 | ||||
9–12 mo | 7 | 5% | 25% | 5 | 5 | 5 | 5 | ||||
12–18 mo | 0 | 5% | 25% | 0 | 0 | 0 | 0 | ||||
>18 mo | 7 | 5% | 25% | 3 | 1 | 3 | 1 | ||||
Total | 18 | 8 | 8 | 8 | 8 | 3 | 1 | 3 | 1 |
. | . | Criteria . | Number of positive trials . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | . | . | 2 y . | 3 y . | ||||||
Prognosis . | Trials . | Absolute increase . | Proportional increase . | Absolute . | Proportional . | “Either” . | “Both” . | Absolute . | Proportional . | “Either” . | “Both” . |
A. High threshold for mCMO: LARGE | |||||||||||
<9 mo | 4 | 15% | 100% | 0 | 1 | 1 | 0 | ||||
9–12 mo | 7 | 15% | 100% | 0 | 0 | 0 | 0 | ||||
12–18 mo | 0 | 15% | 100% | 0 | 0 | 0 | 0 | ||||
>18 mo | 7 | 15% | 100% | 1 | 0 | 1 | 0 | ||||
Total | 18 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | ||
B. Medium threshold for mCMO: LARGE | |||||||||||
<9 mo | 4 | 10% | 50% | 1 | 1 | 2 | 0 | ||||
9–12 mo | 7 | 10% | 50% | 2 | 4 | 4 | 2 | ||||
12–18 mo | 0 | 10% | 50% | 0 | 0 | 0 | 0 | ||||
>18 mo | 7 | 10% | 50% | 2 | 0 | 2 | 0 | ||||
Total | 18 | 3 | 5 | 6 | 2 | 2 | 0 | 2 | 0 | ||
C. Low threshold for mCMO: LARGE | |||||||||||
<9 mo | 4 | 5% | 25% | 3 | 3 | 3 | 3 | ||||
9–12 mo | 7 | 5% | 25% | 5 | 5 | 5 | 5 | ||||
12–18 mo | 0 | 5% | 25% | 0 | 0 | 0 | 0 | ||||
>18 mo | 7 | 5% | 25% | 3 | 1 | 3 | 1 | ||||
Total | 18 | 8 | 8 | 8 | 8 | 3 | 1 | 3 | 1 |
Currently available drugs and their indications for advanced solid tumors passing the postulated OS threshold values for the mCMO in the 18 pivotal trials reporting OS data on more than 20 patients at risk at the late term of 2 and 3 years: “LARGE”
Level of required benefit . | Prognostic group . | Drugs meeting both criteria of absolute and proportional OS increase . | Drugs meeting either criteria of absolute or proportional OS increase . |
---|---|---|---|
High | <9 mo | — | Nab-Paclitaxel-pancreas (P; ref. 36) |
9–12 mo | — | — | |
12–18 mo | — | — | |
>18 mo | — | Sunitinib-GIST (A; ref. 22) | |
Medium | <9 mo | — | Temsirolimus-kidney (A; ref. 6); Nab-Paclitaxel-pancreas (P; ref. 36) |
9–12 mo | Radium223-prostate (45); Erlotinib-lung (16) | Bevacizumab-lung (P; ref. 18); Radium223-prostate (A+P; ref. 45); Bevacizumab-colon (P; ref. 25); Erlotinib-lung (A+P; ref. 16) | |
12–18 mo | — | — | |
>18 mo | — | Cetuximab-head and neck (A; ref. 38); Sunitinib-GIST (A; ref. 22) | |
Low | <9 mo | Nab-Paclitaxel-pancreas (36); Temsirolimus-kidney (6); Ipilimumab-melanoma (40) | Nab-Paclitaxel-pancreas (A+P; ref. 36); Temsirolimus-kidney (A+P; ref. 6); Ipilimumab-melanoma (A+P; ref. 40) |
9–12 mo | Trastuzumab-gastric (41); Radium223-prostate (45); Bevacizumab-lung (18); Erlotinib-lung (16); Bevacizumab-colon (25) | Trastuzumab-gastric (A+P; ref. 41); Radium223-prostate (A+P; ref. 45); Bevacizumab-lung (A+P; ref. 18); Erlotinib-lung (A+P; ref. 16); Bevacizumab-colon (A+P; ref. 25) | |
12–18 mo | — | — | |
>18 mo | Sunitinib-GIST (22) | Trastuzumab-breast (A; ref. 12); Cetuximab-head and neck (A; ref. 38); Sunitinib-GIST (A+P; ref. 22) |
Level of required benefit . | Prognostic group . | Drugs meeting both criteria of absolute and proportional OS increase . | Drugs meeting either criteria of absolute or proportional OS increase . |
---|---|---|---|
High | <9 mo | — | Nab-Paclitaxel-pancreas (P; ref. 36) |
9–12 mo | — | — | |
12–18 mo | — | — | |
>18 mo | — | Sunitinib-GIST (A; ref. 22) | |
Medium | <9 mo | — | Temsirolimus-kidney (A; ref. 6); Nab-Paclitaxel-pancreas (P; ref. 36) |
9–12 mo | Radium223-prostate (45); Erlotinib-lung (16) | Bevacizumab-lung (P; ref. 18); Radium223-prostate (A+P; ref. 45); Bevacizumab-colon (P; ref. 25); Erlotinib-lung (A+P; ref. 16) | |
12–18 mo | — | — | |
>18 mo | — | Cetuximab-head and neck (A; ref. 38); Sunitinib-GIST (A; ref. 22) | |
Low | <9 mo | Nab-Paclitaxel-pancreas (36); Temsirolimus-kidney (6); Ipilimumab-melanoma (40) | Nab-Paclitaxel-pancreas (A+P; ref. 36); Temsirolimus-kidney (A+P; ref. 6); Ipilimumab-melanoma (A+P; ref. 40) |
9–12 mo | Trastuzumab-gastric (41); Radium223-prostate (45); Bevacizumab-lung (18); Erlotinib-lung (16); Bevacizumab-colon (25) | Trastuzumab-gastric (A+P; ref. 41); Radium223-prostate (A+P; ref. 45); Bevacizumab-lung (A+P; ref. 18); Erlotinib-lung (A+P; ref. 16); Bevacizumab-colon (A+P; ref. 25) | |
12–18 mo | — | — | |
>18 mo | Sunitinib-GIST (22) | Trastuzumab-breast (A; ref. 12); Cetuximab-head and neck (A; ref. 38); Sunitinib-GIST (A+P; ref. 22) |
NOTE: A and P in parenthesis next to each agent indicate which specific parameter of “LARGE” was met in each trial.
Abbreviations: A, absolute increase in OS; P, proportional increase in OS.
Discussion
The current trend of conducting ever larger trials looking for smaller and smaller benefits that are statistically significant, but clinically marginal, has been strongly criticized (2–4, 50). We hope that the concepts proposed here set the stage for a new, more informed starting point for debate on these complex and controversial issues in three ways.
By specifically defining the new concept of the mCMO, the proposal goes beyond the vague concepts of “clinically worthwhile,” “clinically relevant,” “large deltas,” etc.
By clarifying that both LARGE and SMALL are valuable concepts.
By providing a tentative set of thresholds for the four OS-related parameters for ranking the clinical value of currently available drugs.
It should be clear that the approach we are suggesting does not represent an attempt to model the effect of treatments on the prognosis of patients with cancer. In general, the treatment effects observed in a randomized clinical trial are compatible with a variety of statistical models, including those that assume a substantial heterogeneity in treatment effects across patients. For instance, a moderate increase in median OS may conceal a large effect in a small group of patients, with no effect in the remaining patients. What we propose here is simply an operational model that can prove useful to establish criteria for defining the clinical relevance of a new treatment.
It is quite obvious that patients and doctors would prefer pursuing LARGE rather than SMALL. However, experience in clinical oncology suggests two key lessons in this regard: (i) because cancer treatment advances are “incremental,” their cumulative effects are missed if SMALL is rejected a priory and (ii), in some instances, the detection of LARGE benefits in molecularly defined subgroups of patients did not derive directly from preclinical or early clinical studies, but this recognition was initiated by the retrospective identification of these molecular parameters within studies demonstrating SMALL (see the entire anti-EGFR story in advanced colorectal cancer). Hence, both types of the mCMO are relevant and should be pursued with the hope that LARGE will be pursued with increased frequency in molecularly defined populations.
This model suffers recognized limitations
First and most important, there is no question that the future of oncology is to navigate toward a molecular classification of cancer. This will imply recognizing driving mutations that will hopefully be targetable by new drugs. Pursuing SMALL in these conditions is clearly inappropriate. However, the SMALL philosophy, and consequently, conducting large trials on unselected patients populations, is still more prudent whenever no clear indications about the molecular determinants of treatment effect are available. On the other hand, it has been suggested that different trial strategies, such as smaller sample sizes, but relaxed alpha values for trials in molecularly defined subgroups, may provide larger benefits for patients in the long run (51). Simply “raising the bar” and aiming at larger treatment effect in smaller studies may lead to the discarding of treatments of worthwhile, though not outstanding, efficacy.
Second, our model considers OS-related parameters only. This choice derives from the need for simplicity. However, a major challenge in this regard is the relevance of crossover in studies in which PFS is the primary endpoint. The simple solution could be to adapt the required threshold level to lessen the requirements for improvement based on a certain percentage of cross-over. However, this would add an additional level of complexity because the extent of crossover is difficult to control. For example, crizotinib, a practice-changing biologic for non–small cell lung cancer (21), does not appear in the table as a success because, despite a huge PFS gain over chemotherapy alone, OS outcome has not been affected so far due to extensive preplanned cross-over. In other trials, cross-over is more limited, preserving the impact of the new agent on OS (12).
Third, prognosis is a continuum. Lumping conditions with 4 to 5 months median OS (late lines in advanced colorectal cancer) with others in the 8 to 9 months median OS range (advanced gastric cancer), imply per se 100% difference. The same can be said for conditions of much longer MST such as advanced renal cell carcinomas that now have a median OS of longer than 30 months and second-line breast cancer (much shorter median OS). With such heterogeneities, additional prognostic categories could be recommendable, again at the cost of increasing the already high complexity of the model.
Fourth, too few trials report the late effect of treatment on OS; and too often, the patients at risk at the predefined time of analysis of 2 and 3 years are too few (we accepted the very low number of 20 to have a sizable sample of trials). This limits substantially the power of this model to ascertain the parameters of LARGE. The present system of trial design, conduct, analysis, and reporting is not clearly focused on looking for LARGE. The reasons for this are understandable (time and size of trials issues). However, either prolonging the follow-up time of the trials to allow data maturation or dedicating a second publication to long-term results is to be encouraged.
We believe that the basic principles (mCMO, SMALL, and LARGE) as well as the structured approach presented here provide a critical refinement to the “raising the bar philosophy” recently emphasized. We also recognize that additional fine tuning or more substantial adjustments to these concepts or threshold values by all stakeholders will improve the model further.
Disclosure of Potential Conflicts of Interest
A.F. Sobrero reports receiving speakers bureau honoraria from and is a consultant/advisory board member for Amgen, Bayer, Merck, Roche, and Sanofi. P. Bruzzi reports receiving speakers bureau honoraria from Bristol-Myers Squibb, Novartis, and Roche. No potential conflicts of interest were disclosed by the other authors.
Authors' Contributions
Conception and design: A.F. Sobrero, P. Bruzzi
Development of methodology: A.F. Sobrero, A. Pastorino, P. Bruzzi
Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): A.F. Sobrero, A. Pastorino
Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): A.F. Sobrero, A. Pastorino, D.J. Sargent
Writing, review, and/or revision of the manuscript: A.F. Sobrero, D.J. Sargent, P. Bruzzi
Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): A.F. Sobrero, A. Pastorino
Study supervision: A.F. Sobrero
Acknowledgments
We thank Lee M. Ellis, MD, for his review of the manuscript and insightful comments.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.