Abstract
Competing risks are events in which at least one precludes the observation of the other, such as toxicity and death. This commentary discusses and distinguishes between the two common types of competing risk analyses, the Kaplan–Meier and cumulative incidence curves. Clin Cancer Res; 18(8); 2127–9. ©2012 AACR.
See commentary on Dignam et al., p. 2301
In this issue of Clinical Cancer Research, Dignam and colleagues (1) discuss how to choose competing risks regression models to evaluate the association between predictors and cause-specific failures of interest. They differentiate the situations in which this choice is important from those in which it makes little difference, using a Radiation Therapy Oncology Group trial of prostate cancer as a nice example. The point of this commentary is to present some background to the issue and, as the title implies, to show how the two principal types of competing risks analyses' interpretations differ from each other. I restrict myself to the one-sample case and conclude that, though each analysis can be the right thing to do, they answer different questions. This conclusion also extends to the multivariable models that are the subject of Dignam and colleagues' investigations.
Consider a treatment failure associated with a time, such as late toxicity due to radiation or death in remission after bone marrow transplant in patients with acute lymphoblastic leukemia (ALL), both discussed below. Each failure is subject to a so-called competing risk, or censoring, meaning that some other event interferes with our observation of it. The competing risk may be the happy circumstance of the study ending with a patient lacking evidence of toxicity or death, or it may be a different failure such as death in the first case or recurrence in the second. Regardless, it must be accounted for in estimating failure rates. The first way of doing so, in terms of priority and popularity, is the traditional Kaplan–Meier estimate (2). It estimates the probability of a failure in the absence of the competing risk; it mathematically factors it out. The second and increasingly common method of assessing failure rates is by using the cumulative incidence curve to estimate the probability of failures actually observed in patients who are subject to censoring by the competing risk (3). The crux of the distinction between the italicized phrases lies in the effect of the competing risk on the estimates. It does not change the Kaplan–Meier curve (besides reducing the effective sample size), but by removing patients from being subject to failure, it will lower the cumulative incidence. Two examples should render this point intelligible.
The first example consists of what academics prefer to call a “spirited discussion” in the pages of the International Journal of Radiation Oncology. In 1994, Caplan and colleagues (4) launched a firm critique of the Kaplan–Meier curve for estimating rates of cause-specific failures, such as late complications of radiation. They pointed out that the Kaplan–Meier estimates of component causes of failure would sum to more than the Kaplan–Meier estimate of any cause. This is correct; to take an extreme example, with sufficient follow-up, the Kaplan–Meier estimate of cause of death due to heart disease would eventually equal one, or nearly so, as would the estimates of death due to cancer, kidney disease, or other causes. These estimates clearly sum to more than unity, the all-cause long-term probability of death (5). This is because the Kaplan–Meier curve estimates probabilities of one failure in the absence of any others. On the other hand, the cumulative incidence curves of each of these causes of death will sum to the cumulative incidence of any death. Caplan and colleagues concluded that Kaplan–Meier analysis of cause-specific failure “should be avoided…in favor of appropriate methods [cumulative incidence curves].” Bentzen and colleagues (6) and Denham and colleagues (7) provided an equally strong endorsement of Kaplan–Meier in their rebuttal. Their logic, summarized by Bentzen and colleagues (6), was that “The reason is that this type of estimate [the cumulative incidence function] is not specific for the endpoint of interest but is affected by the risk of the competing events as well.” That is, if a late complication is uncommon because most patients die of their disease, then the cumulative incidence will be low, whereas the Kaplan–Meier could be high; it is important to remember that Kaplan–Meier estimates late complication rates only among survivors. (The only point that all authors held in common was their dislike of giving crude proportions, that is, the total fraction of patients who had late toxicities. These figures are not adjusted for follow-up in any way; thus, shorter studies have lower rates.)
Which method is preferable? It depends on the use to which the results are to be put. Chappell (8), in an attempt to get in the last word, asked readers to consider a nonce example in which a population of patients has an 80% mortality rate during the first year of disease with no subsequent events. Suppose also that all survivors exhibit complications by 3 years. Then, cumulative incidence would give a 20% estimate of gross toxicity at 3 years, along with an 80% estimate of death. That is, only 20% of patients will have toxicities because the rest died too soon. Kaplan–Meier, on the other hand, would estimate a 100% toxicity rate among patients who survive 3 years, as well as the 80% mortality. This is no contradiction; the choice depends on the status of the competing risk, death. To a radiation biologist or other theorist seeking to characterize the toxicity process (perhaps out of interest in applying the same regimen to a different group of patients with better prognoses), death is a nuisance to be factored out, and Kaplan–Meier would be the right method. But to a patient or physician, death is a quite relevant event, and the low chance of toxicity as estimated by cumulative incidence might be worth possibly extending quality or quantity of life.
I end with a numerical example taken from Klein and Moeschberger (9), to whom I recommend the reader who wants a more thorough explanation of the data and calculations. They consider outcomes among 38 patients who received bone marrow transplant for ALL. The time scale is days after transplant, and one quantity of interest is the chance of a patient dying in remission. The opportunity for this failure to occur is clearly obviated by recurrence as a competing risk. The Kaplan–Meier and cumulative incidence estimates are plotted in Fig. 1 (because Kaplan–Meier curves traditionally start at one and decrease to zero, one minus Kaplan–Meier is shown here for comparability). We see that, as is generally the case, Kaplan–Meier exceeds cumulative incidence. This is because Kaplan–Meier estimates the probability of death in remission, assuming no relapse occurs, and cumulative incidence estimates the probability of death in remission among all patients whose pattern of relapses is similar to that observed in our data. The cumulative incidence curve is naturally lower because relapsed patients count as a “success” (in terms of death in relapse), whereas for the Kaplan–Meier curve they are eliminated from consideration. The preference again depends upon the question that is being asked and the purposes to which the answer will be put. Cumulative incidence gives the “real-world” chances of an event, and Kaplan–Meier is more informative about the medical processes involved in treatment-related mortality.
Dignam and colleagues extend the Kaplan–Meier and cumulative incidence methods to include covariates, which they label the “Cox CHR” and “Fine-Gray SHR” models, respectively (1). I hope that the reader has survived this commentary with some intuition about the Kaplan–Meier and cumulative incidence forms of survival-curve estimation and, thus enlightened, proceeds to read Dignam and colleagues' important work on the topic.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.