Predictive biomarkers can play a key role in individualized disease monitoring. Unfortunately, the use of biomarkers in clinical settings has thus far been limited. We have previously shown that mechanism-based pharmacokinetic/pharmacodynamic modeling enables integration of nonvalidated biomarker data to provide predictive model-based biomarkers for response classification. The biomarker model we developed incorporates an underlying latent variable (disease) representing (unobserved) tumor size dynamics, which is assumed to drive biomarker production and to be influenced by exposure to treatment. Here, we show that by integrating CT scan data, the population model can be expanded to include patient outcome. Moreover, we show that in conjunction with routine medical monitoring data, the population model can support accurate individual predictions of outcome. Our combined model predicts that a change in disease of 29.2% (relative standard error 20%) between two consecutive CT scans (i.e., 6–8 weeks) gives a probability of disease progression of 50%. We apply this framework to an external dataset containing biomarker data from 22 small cell lung cancer patients (four patients progressing during follow-up). Using only data up until the end of treatment (a total of 137 lactate dehydrogenase and 77 neuron-specific enolase observations), the statistical framework prospectively identified 75% of the individuals as having a predictable outcome in follow-up visits. This included two of the four patients who eventually progressed. In all identified individuals, the model-predicted outcomes matched the observed outcomes. This framework allows at risk patients to be identified early and therapeutic intervention/monitoring to be adjusted individually, which may improve overall patient survival. Cancer Res; 75(12); 2416–25. ©2015 AACR.

Lung cancer is the leading cause of cancer-related mortality around the world (1–3). In the United States and Europe, small cell lung cancer (SCLC) accounts for 13% to 20% of all new cases and deaths from lung cancer (4, 5). Treatment of SCLC is based on chemotherapy and radiotherapy. In the past 30 years, few therapeutic strategies have been successful in clinical trials (6) and many important questions regarding effective treatment strategies remain open (7, 8). These include the duration of the treatment, the role of maintenance and consolidation therapies, the role of chemoradiotherapy, alternative combination chemotherapy agents, and dose intensification. Although response rates (RR) to first-line treatment (generally combinations of platinum compound and etoposide) are high, RRs to second-line treatment are generally lower due to rapid onset of resistance (9). Among other factors, RRs to second-line treatment depend on the time elapsed between the end of first-line treatment and the detection of disease recurrence (9). Therefore, early identification of the individual risk of disease recurrence is a key component of effective clinical management.

Prognosis of untreated patients with SCLC is poor, with a median survival of 2 to 4 months after diagnosis. Currently, the main prognostic factor of SCLC is the disease stage at diagnosis (10). The most widely used staging classification is the dichotomous system developed by the Veterans Administration Lung Study Group. This method classifies patients with limited-stage disease (LD; those with tumor confined to the hemithorax of origin, the mediastinum, or the supraclavicular lymph nodes) and extensive-stage disease (ED; those whose tumor has spread outside these limits; ref. 11). SCLC is characterized by its fast growth and rapid onset of widespread metastases where approximately 30% of patients with SCLC are diagnosed as limited-stage and the remaining 70% as extensive-stage (4). The median survival for LD SCLC is 14 to 20 months, and the 2- and 5-year survival rates are 30% and 10%, respectively. For ED SCLC, median survival is 9 to 12 months, and survival after 2 years occurs in less than 5% of patients (4). In addition to stage at diagnosis, performance status and other clinical parameters, such as age, gender, and number and location of metastatic tumors, have been proposed as predictive and prognostic factors (10, 12–14).

Existing methods to detect disease progression in SCLC and most solid tumors generally rely on computed CT scans, where the change in the sum of longest diameters (SLD) is assessed. In particular, the RECIST criteria define disease progression as change of 20% in SLD or appearance of new metastases (15). Recommendations during follow-up of SCLC include a CT scan every 2 to 3 months (9, 16). This follow-up strategy, however, means that significant disease progression can already occur even before the recurrent disease is detected. In a retrospective study, Sugiyama and colleagues (17) showed that a more intensive follow-up strategy in SCLC patients that had achieved response to first-line treatment resulted in significantly higher postrelapse survival and overall median survival compared with nonintensive follow-up patients. However, more intensive follow-up programs may not be necessary for all patients because early identification of high-risk patients may enable targeted monitoring programs to ultimately improve individual patient survival.

The use of circulating biomarkers has been proposed for predicting patient outcome. However, despite the easy collection of those biomarkers in many tumor types, their use has had limited uptake due to the lack of validated markers of disease progression (18). One of the reasons is that establishing the link between biomarkers and tumor progression is not a trivial task. The prevailing strategy has been to use simple univariate and multivariate analysis to determine biomarker relevance. With SCLC, elevation of serum lactate dehydrogenase (LDH) is found in half of patients, and up to 85% of patients with ED (10, 19). Some studies have also proposed other biomarkers that may have prognostic significance, namely neuron-specific enolase (NSE), chromogranin, circulating tumor cells, and progastrin-releasing peptide (20–22). However, the prognostic and predictive value of these biomarkers has not been widely accepted in the context of standard clinical practice and therefore their use as predictive biomarkers has been limited (23). The empirical methods used to assess these biomarkers, however, may not provide an accurate representation of their utility because appropriate assessment of the predictive capacity of biomarkers should be in the context of mechanistic models (24–26). Indeed, we have previously shown that when these biomarkers are integrated into a mechanistic model of disease time-course, model-based disease predictions showed high correlation with outcome (27). These models allow for time-dependent and nonlinear relationships between biomarker and outcome to be accounted, which may not be detected by a simple assessment of correlation. Mechanistic models also allow data from multiple biomarkers to be integrated together to obtain improved predictors of response. With SCLC, we found that model-based disease predictions showed higher correlation with outcome than either biomarker individually.

The traditional approach for using markers for individual prediction generally relies on the comparison of biomarker levels with an established cutoff value (e.g., derived from receiver operating characteristic curve analysis). Indeed, our previously developed SCLC biomarker model could have been used in this manner for individual prediction. However, with cutoff approaches, individuals with similar disease profiles may have very different predictions of outcome if they lie close to the cutoff value. This unsatisfactory situation arises because a cutoff approach describes risk as a dichotomous rather than continuous variable. In contrast, a probabilistic model-based approach allows the continuous disease-risk relationship to be accurately modeled and provides quantitative predictions of individual risk. Patients predicted to have high probabilities of disease progression may therefore be provided with alternative therapeutic strategies while patients predicted to have less certain outcomes may be more intensively monitored.

On the basis of the above considerations, the main objective is to develop a framework that allows at-risk patients to be identified early so that therapeutic intervention and monitoring can be individualized. We use our previously reported biomarker model (27) to develop and illustrate this framework in SCLC.

Data

Two datasets were available: a training and an external dataset. The training data were identical to the data used to develop and assess the previously reported biomarker model (27). The external dataset was retrospectively collected from the medical records of 22 patients diagnosed in the University Hospital of Navarra (Pamplona, Spain) between 2012 and 2014. We included all treatment-naïve SCLC patients (as determined by histologic examination) receiving first-line chemotherapy (i.e., etoposide plus either cisplatin or carboplatin). No patients satisfying these inclusion criteria were excluded from the analysis. Figure 1A depicts standard treatment and follow-up periods. Treatment typically included six chemotherapy cycles given every 3 weeks. Some patients also received concomitant radiotherapy in the primary tumor (hemithorax). Tumor assessments were performed with CT before the commencement of therapy, between the second and third cycles of chemotherapy, at the end of the entire course of chemotherapy, and every second/third month thereafter in follow-up examinations. Clinical outcome was categorized according to modified RECIST criteria, where the sum of the longest diameters is compared with the value of the previous CT scan. Because the medical records did not contain the exact tumor size measurements, only the derived RECIST categories were used in the data analysis. Biomarker (LDH and NSE) samples used to externally validate the model were obtained during the treatment period. The time courses of LDH and NSE are shown in Supplementary Fig. S1. The external dataset included a total of 137 LDH and 77 NSE observations during treatment period and a total of 78 CT scans, obtained during both treatment and follow-up periods. Out of the 22 patients, 7 patients (31.8%) had CT scans with disease progression (with three patients progressing just after finishing first-line treatment). A more detailed description of the data can be found in Table 1 that lists also the patients and study characteristics of the training dataset.

Figure 1.

A, schematic representation of typical pharmacologic and disease monitoring protocol in SCLC patients. Vertical rectangles correspond to CT scans (tumor assessment); gray vertical arrows correspond to each chemotherapy (CT) cycle (etoposide plus cisplatin or carboplatin) and horizontal gray arrow depicts concomitant radiotherapy. B, schematic representation of the developed framework. Light shaded area corresponds to already published work (27) where darker shaded areas are developed within this article. VPC, visual predictive check; ROC, receiver operating characteristic curve.

Figure 1.

A, schematic representation of typical pharmacologic and disease monitoring protocol in SCLC patients. Vertical rectangles correspond to CT scans (tumor assessment); gray vertical arrows correspond to each chemotherapy (CT) cycle (etoposide plus cisplatin or carboplatin) and horizontal gray arrow depicts concomitant radiotherapy. B, schematic representation of the developed framework. Light shaded area corresponds to already published work (27) where darker shaded areas are developed within this article. VPC, visual predictive check; ROC, receiver operating characteristic curve.

Close modal
Table 1.

Patient characteristics in the training dataset (data used to develop the model) and the external dataset (data from new patients, used to externally validate the approach)

Training datasetExternal dataset
Limited disease (n = 18)Extensive disease (n = 42)Limited disease (n = 6)Extensive disease (n = 16)
n%n%n%n%
Age, y 
 Mean (range) 67.8 (48–78) — 61.0 (40–85) — 59.8 (53–81) — 57.4 (44–76) — 
Gender 
 Male 13 72.2 38 90.5 83.3 12 75 
 Female 27.8 9.5 16.7 25 
 ≤2 9.5 18.8 
LDH (UI/L) 
 Median (range) 237 (176–592) — 322 (163–3,480) — 224 (127–493) — 434 (130–5,696) — 
NSE, ng/mL 
 Median (range) 30.9 (9.85–156) — 67.7 (11.5–2,534) — 11.2 (5.09–30.3) — 14.3 (7.14–377) — 
Treatment strategy 
 Etoposide + cisplatin 50 31 73.8 66.7 15 93.8 
 Etoposide + carboplatin 50 11 26.2 33.3 6.2 
 Thoracic irradiation 16 88.9 13 30.9 83.3 43.7 
G-CSF coadministration 50 21 50 33.3 50 
CT scans Median (range) weeks # DP Median (range) weeks # DP Median (range) weeks # DP Median (range) weeks # DP 
During first-line treatment 8.4 (6–22) 8.5 (2.6–21) 6.3 (5.9–11) 7.0 (6–17) 
End of first-line treatment 22 (10–27) 20 (6.0–23) 17 (12–22) 13 (11–32) 
First follow-up 30 (22–48) 29 (19–37) 14 28 (22–66) 28 (19–45) 
Second follow-up 42 (32–53) 43 (31–92) 44 (27–80) 44 (36–52) 
Training datasetExternal dataset
Limited disease (n = 18)Extensive disease (n = 42)Limited disease (n = 6)Extensive disease (n = 16)
n%n%n%n%
Age, y 
 Mean (range) 67.8 (48–78) — 61.0 (40–85) — 59.8 (53–81) — 57.4 (44–76) — 
Gender 
 Male 13 72.2 38 90.5 83.3 12 75 
 Female 27.8 9.5 16.7 25 
 ≤2 9.5 18.8 
LDH (UI/L) 
 Median (range) 237 (176–592) — 322 (163–3,480) — 224 (127–493) — 434 (130–5,696) — 
NSE, ng/mL 
 Median (range) 30.9 (9.85–156) — 67.7 (11.5–2,534) — 11.2 (5.09–30.3) — 14.3 (7.14–377) — 
Treatment strategy 
 Etoposide + cisplatin 50 31 73.8 66.7 15 93.8 
 Etoposide + carboplatin 50 11 26.2 33.3 6.2 
 Thoracic irradiation 16 88.9 13 30.9 83.3 43.7 
G-CSF coadministration 50 21 50 33.3 50 
CT scans Median (range) weeks # DP Median (range) weeks # DP Median (range) weeks # DP Median (range) weeks # DP 
During first-line treatment 8.4 (6–22) 8.5 (2.6–21) 6.3 (5.9–11) 7.0 (6–17) 
End of first-line treatment 22 (10–27) 20 (6.0–23) 17 (12–22) 13 (11–32) 
First follow-up 30 (22–48) 29 (19–37) 14 28 (22–66) 28 (19–45) 
Second follow-up 42 (32–53) 43 (31–92) 44 (27–80) 44 (36–52) 

NOTE: ECOG performance status based on the Eastern Cooperative Oncology Group; CT scans report tumor assessments according to RECIST v1.1., defined according to previous CT scan.

Abbreviation: DP, disease progression.

Overview

We have previously developed a population model describing the dynamics of LDH and NSE in SCLC patients (27). A schematic view of the published biomarker model and model equations can be found in Supplementary Fig. S2. In this study, we extend this biomarker model with a logistic model to link model predictions from the biomarker model with clinical outcome (observed disease progression). We then perform an external validation of both the biomarker and the logistic models. We then used the combined biomarker/logistic model to predict, at the end of first-line treatment, individual probabilities of disease progression in follow-up periods. Figure 1B shows a schematic diagram of the workflow.

Biomarker model and external validation

Our previously published semi-mechanistic model for NSE and LDH dynamics has not yet been assessed using an external dataset (27). We therefore used the newly acquired external dataset to perform an external validation of biomarker predictions in these new patients.

External validation of the biomarker model at the individual level was based on basic goodness of fit plots. Individual parameters for the new set of patients were obtained in NONMEM 7.2 (28) by evaluating the empirical Bayes estimates (EBE). EBEs are conditional estimates of an individual's model parameters given the parameter estimates from the population model (27), the individual observations (LDH and NSE concentrations), and covariate information (radiotherapy and granulocyte colony-stimulating factor therapy) from patients in the external dataset.

The external validation at the population level was based on simulation-based diagnostics, namely prediction- and variability-corrected visual predictive checks (pvc-VPC; ref. 29) as implemented in PsN (30). Briefly, VPCs evaluate the model's ability to describe the central tendency and variability in the observed data by comparison of the observed quantiles to the quantiles of the simulated datasets. We produced pvc-VPCs by simulating 1,000 replicate datasets with the same characteristics of the external dataset (i.e., patient numbers, sample times, intervention, and covariate information) by sampling new sets of individual parameters from the estimated population parameter distributions (27).

Model for RECIST outcome

RECIST data were dichotomized into two categories (disease progression and non-disease progression) and analyzed using by logistic regression. We fit a linear logit model to relate the probability of having disease progression at the CT scans to the model predicted changes in disease (Eq. 1) between each CT scan and its previous CT scan. The logit transformation (Eq. 2) ensures that the probability of having disease progression as outcome of a CT scan will fall between 0 and 1.

where Di,j is the predicted disease level for patient j at observation point (CT scan) i, and ΔDi,j is the predicted change in disease level for patient j between observation point i and i + 1 (i.e., the following CT scan, approximately 8 weeks later). P(DPij∣η) represents the probability of disease progression in the jth individual at the ith measurement (CT scan), η is a subject-level random effect (normally distributed with mean 0 and variance OMEGA) describing interpatient variability in the probability of disease progression, θ1 is the intercept parameter and θ2 is the slope parameter. To obtain meaningful initial parameter estimates, θ1 and θ2, were parameterized in terms of ΔD80 and ΔD50, which describe the ΔD predicted to cause a 50% and 80% probability of disease progression [P(DP)], respectively. Thus, the intercept and the slope take the form of θ1 = −ΔD50 × θ2 and θ2 = log(4)/(ΔD80 − ΔD50).

The logistic model was obtained using the CT scans from treatment and follow-up periods of patients belonging to the training dataset. Model parameters were estimated in NONMEM 7.2 using the SAEM algorithm (28). Model evaluation was based on the construction of VPCs, in a similar manner to the biomarker model evaluation. The VPC compared the observed proportion of patients showing disease progression in each CT scan with the model-predicted distribution of the same proportion attained in 1,000 simulated datasets. Because a model may perform well at the population level (i.e., it may predict well the proportion of patients progressing at each CT scan) but it may perform poorly with individual patient prediction, we also evaluated the predictive accuracy of the model at the individual level. This was achieved by applying the framework detailed in the next section (individual early prediction of disease progression) but modified to use all data available (i.e., biomarker and CT scans outcomes). The final output was a graphical comparison of the predicted individual distribution of P(DP) with the actual outcome of each CT scan.

Individual early prediction of disease progression

To simulate decision making after completion of first-line treatment, we evaluated P(DP) predictions of the first two follow-up CT scans (i.e., the period 6 months after end of treatment) using data up to the end of first-line treatment, by comparing the predicted P(DP) distributions at the first and the second follow-up CT scans with the observed CT scan outcomes.

Biomarker (LDH and NSE) samples from patients in the external dataset obtained during the treatment period were used to create patient-specific disease profiles during treatment and follow-up using the Bayesian algorithm in NONMEM 7.2 (28). Changes in the distribution of predicted disease levels were computed in accordance with Eq. 1. The logistic model was then used to convert predicted changes in disease levels to probabilities of disease progression at the times where CT scans were collected (Eq. 2). In order to account for uncertainty in population and individual model parameters, we obtained the full uncertainty distribution of P(DP) predictions for each individual using Markov chain Monte Carlo (MCMC) evaluation in NONMEM 7.2 (28). In total, 1,000 MCMC samples were obtained from the stationary distribution with Gibbs sampling to obtain individual uncertainty distributions.

Biomarker model and external validation

Results of the external validation (individual and population level) of the previously developed biomarker model (27) are shown in Fig. 2 and indicate that individual profiles (Fig. 2A), typical tendencies, and data dispersion (Fig. 2B) are well captured by the model.

Figure 2.

External model validation at the individual level (A) and at the population level (B) using new patient data (external dataset). A, LDH observations (green circles), with individual predictions (green lines) on the left y-axis, and NSE observations (orange squares), with the individual predictions (orange lines) on the right y-axis for nine selected patients. Observations and predictions for both biomarkers are log-transformed. Vertical black arrows, chemotherapy administration. B, pcv-VPC of LDH (left) and NSE (right) for the final model against chemotherapy cycles. Cycle F corresponds to follow-up measurement. Dots, individual observations (LDH and NSE concentrations). Solid black lines represent the 5th, 50th, and 95th percentiles of the observed data. Shaded areas are the 95% prediction intervals based on simulated data (n = 1,000) for the corresponding percentiles.

Figure 2.

External model validation at the individual level (A) and at the population level (B) using new patient data (external dataset). A, LDH observations (green circles), with individual predictions (green lines) on the left y-axis, and NSE observations (orange squares), with the individual predictions (orange lines) on the right y-axis for nine selected patients. Observations and predictions for both biomarkers are log-transformed. Vertical black arrows, chemotherapy administration. B, pcv-VPC of LDH (left) and NSE (right) for the final model against chemotherapy cycles. Cycle F corresponds to follow-up measurement. Dots, individual observations (LDH and NSE concentrations). Solid black lines represent the 5th, 50th, and 95th percentiles of the observed data. Shaded areas are the 95% prediction intervals based on simulated data (n = 1,000) for the corresponding percentiles.

Close modal

Model for RECIST outcome

Predicted P(DP) was obtained by logistic regression where the expression for the logit included a linear function of ΔDi,j and ΔDi,j is the change in predicted disease between two consecutive CT scans using the biomarker model (27). Estimation of interpatient variability in the logistic model was not supported by the data. The estimates of the transformed parameters (see Materials and Methods) ΔD50 and ΔD80 were 0.292 and 0.510 with relative standard errors of 20.2% and 30.2%, respectively.

Figure 3A shows the relationship between the typical probability of disease progression and ΔDi,j. In Fig. 3B, the performance of the logistic model at the population level is shown for the training and external datasets. The predictive accuracy of the model at the individual level is shown in Supplementary Fig. S3. Results indicate adequate model performance in both cases.

Figure 3.

A, logistic regression model linking predicted changes (on the x-axis) in disease with probability of disease progression at a CT scan (on the y-axis). B, VPCs for the combined biomarker/logistic model where the observed proportion of patients with observed disease progression at each CT scan is compared with proportion obtained with simulated data (n = 1,000) for the training dataset (top) and the external dataset (bottom). CT scan #1 was taken during treatment, CT scan #2 corresponds to the tumor assessment obtained at the end of first-line treatment, and CT scans >#2 were obtained in the follow-up period. Dots represent observed proportion. Dark gray shared areas are the 90% prediction intervals, light gray shared areas are interquartile ranges based on simulated data, and black line represents the median of the simulated data.

Figure 3.

A, logistic regression model linking predicted changes (on the x-axis) in disease with probability of disease progression at a CT scan (on the y-axis). B, VPCs for the combined biomarker/logistic model where the observed proportion of patients with observed disease progression at each CT scan is compared with proportion obtained with simulated data (n = 1,000) for the training dataset (top) and the external dataset (bottom). CT scan #1 was taken during treatment, CT scan #2 corresponds to the tumor assessment obtained at the end of first-line treatment, and CT scans >#2 were obtained in the follow-up period. Dots represent observed proportion. Dark gray shared areas are the 90% prediction intervals, light gray shared areas are interquartile ranges based on simulated data, and black line represents the median of the simulated data.

Close modal

Individual early prediction of disease progression

P(DP) predictions for follow-up visits were obtained for new SCLC cancer patients (external dataset) assuming that only treatment period data were available (see Fig. 1). Figure 4 shows the schematic of the Bayesian framework developed for prediction. P(DP) uncertainty distributions were obtained using the full nonparametric distribution of the predicted disease in accordance with Eq. 2 (Fig. 4B–D).

Figure 4.

Schematic illustration of algorithm used to obtain predicted probabilities of progression at first and second follow-up scans based on biomarker samples from treatment period. A, LDH and NSE observations from a new patient were obtained during treatment period. B, the full posterior distribution of predicted disease level for this patient was generated using a Full Bayesian MCMC analysis using the population biomarker model as a prior. C, distribution of changes in predicted disease levels between two CT scans are computed. D, distribution of predicted changes in disease are translated to distribution of P(PD) at each CT scan.

Figure 4.

Schematic illustration of algorithm used to obtain predicted probabilities of progression at first and second follow-up scans based on biomarker samples from treatment period. A, LDH and NSE observations from a new patient were obtained during treatment period. B, the full posterior distribution of predicted disease level for this patient was generated using a Full Bayesian MCMC analysis using the population biomarker model as a prior. C, distribution of changes in predicted disease levels between two CT scans are computed. D, distribution of predicted changes in disease are translated to distribution of P(PD) at each CT scan.

Close modal

Figure 5 shows the results comparing the predicted P(DP) with the observed patient status (i.e., disease progression vs. non-disease progression) at the first and second follow-up scans. For demonstration purposes, P(DP) predictions higher than 80% were deemed “high confidence” or “reliable” predictions of a disease progression outcome. Similarly, P(DP) predictions lower than 20% were deemed “high confidence” predictions of a non-disease progression outcome. Out of 24 CT scan measurements, 75% of the P(DP) predictions were prospectively identified as “high confidence” predictions. As can be seen in Fig. 5, all “high confidence” P(DP) predictions correctly predicted follow-up outcomes with 100% accuracy using treatment only data. The remaining 25% of the P(DP) predictions were deemed to be of insufficient confidence to provide a reliable prediction of outcome. A total of 4 individuals experienced disease progression in the two follow-up scans. Two of these four outcomes were prospectively identified as having “high confidence” P(DP) predictions.

Figure 5.

Predicted probabilities of progression at the first two follow-up CT scans for patients in external dataset obtained from their biomarker data collected up to end of first-line treatment. Empty rhombuses represent patients that did not have disease progression in the CT scan and filled squares depict patients with observed disease progression at each CT scan. Dotted and solid lines depict the 25th and 75th quantiles corresponding to the individual posterior distribution.

Figure 5.

Predicted probabilities of progression at the first two follow-up CT scans for patients in external dataset obtained from their biomarker data collected up to end of first-line treatment. Empty rhombuses represent patients that did not have disease progression in the CT scan and filled squares depict patients with observed disease progression at each CT scan. Dotted and solid lines depict the 25th and 75th quantiles corresponding to the individual posterior distribution.

Close modal

In the past decade, great advances have been made in the area of personalized medicine, thanks to pharmacogenomics and targeted therapies (31–33). Likewise, great efforts have been made to develop new predictive biomarkers to personalize therapy to individuals (34, 35). In this work, we used known biomarkers together with probabilistic nonlinear mixed-effects models (NLME) within a framework to improve personalized disease monitoring.

The disease we have investigated together with the current monitoring strategy would traditionally be viewed as challenging for prediction due to the heterogeneity in response and highly dynamic nature of SCLC (36). On a conceptual level, the predictive capability of the model may be restricted by the ability to infer underlying disease processes from the available data. However, our success with predicting SCLC outcome suggests that the integrated NLME approach (combining individual biomarker and scan data) may have wider utility to other cancers even when available biomarkers have not historically been considered as predictive markers. In fact, NLME models have been used in the past to describe circulating biomarker dynamics and clinical outcome in different cancer indications. Some examples include the use of mathematical models in prostate cancer to personalize vaccination regimens to stabilize prostate-specific antigen levels (37, 38), the use of CA125 as a predictive marker of ovarian cancer (39), the kinetic modeling of human chorionic gonadotropin as an early predictor of methotrexate resistance in low-risk gestational trophoblastic neoplasia patients (40), and the use of soluble VEGF receptor 3 to monitor adverse events and clinical response in patients with imatinib-resistant gastrointestinal stromal tumors (41, 42). A key strength of NLME models is that they provide a description of individual variability in population level trends. NLME modeling also provides a rigorous statistical basis to individual prediction using Bayesian methodology, so that information from available individual level observations is balanced with knowledge and uncertainty in population level trends (43, 44).

The external dataset was used to evaluate the predictive accuracy of the previously reported biomarker model (27). Interestingly, NSE values in Table 1 are lower in the external dataset compared with the training dataset. This is likely due to differences in diagnostic techniques used between the two datasets leading to the Will Rogers phenomenon (45), where stage migration due to new methods of diagnosis leads to a systemic bias in measures of central tendency. Patients in the training dataset were staged using imaging techniques while patients included in the external dataset (more recently diagnosed) were staged according to the PET (Positron emission tomography) technique. Due to the highly metabolic nature of SCLC, it has been shown that the use of PET can increase staging accuracy, resulting in a higher percentage of patients upstaged from limited- to extensive-disease compared with patients downstaged from extensive- to limited-disease (46). Nevertheless, the biomarker and the RECIST model were capable to describe observations from the external dataset.

After successful external validation of the combined biomarker/logistic model, we investigated the feasibility of using data up to end of first-line treatment to obtain early prediction of disease progression in follow-up visits. Although we defined thresholds for “high confidence” P(DP) predictions to assess overall predictive accuracy in our external dataset, we do not recommend establishing arbitrary thresholds, such as P(DP)>80%, in clinical practice. This is because personalized dosing decisions should account for all available individual information, including patient characteristics, available treatments, and their expected tolerability and efficacy. We therefore envisage quantitative prediction of probability of progression as being one factor for oncologists to consider.

The surprising result of this investigation was that biomarker data gathered only during the treatment period was sufficient to provide reliable predictions for 75% of follow-up tumor assessment results. Moreover, disease progression was confidently predicted at the end of treatment for two out of the four cases present in the external dataset. Although the sample size here is limited, this suggests that the approach can be used to optimize the timing of commencement of second-line therapy in a significant proportion of patients. As an example, ID patient 107 completed first-line treatment on week 13 and was detected to be in disease progression in week 24. As shown in Fig. 5, our framework could have confidently predicted this outcome from the treatment-only data. Figure 6 corroborates this by showing that the model-predicted disease profile was increasing up to 13 weeks prior to the follow-up CT scan, a period where the patient was receiving no treatment. These model predictions could have been made at the end of first-line treatment, providing oncologists with the option to continue first-line therapy or commence second-line therapy immediately at the end of first-line treatment for this particular patient.

Figure 6.

Predicted disease profile for representative patient who had high predicted probability of progression at the first follow-up CT scan (ID patient 107). Shaded area represents posterior distribution of disease predictions. Treatment for this patient was given between time 0 and 13 weeks. Vertical lines represent CT scans (week 7, CT scan during treatment; week 13, CT scan at the end of first-line treatment; week 24, first follow-up CT scan).

Figure 6.

Predicted disease profile for representative patient who had high predicted probability of progression at the first follow-up CT scan (ID patient 107). Shaded area represents posterior distribution of disease predictions. Treatment for this patient was given between time 0 and 13 weeks. Vertical lines represent CT scans (week 7, CT scan during treatment; week 13, CT scan at the end of first-line treatment; week 24, first follow-up CT scan).

Close modal

For the 25% of assessments where P(DP) were not deemed to be of “high confidence” (i.e., 20% < P(DP) < 80%), the framework fails to provide reliable predictions of outcome. However, physicians can still derive relevant information from the model: there is insufficient individual-level information for these patients and therefore these patients should be considered “at risk.” Intensive monitoring plans can then be followed to gather more individual level information. For example, for ID 100 (Fig 5), who progressed at the first follow-up scan, the uncertainty intervals indicate that the true P(DP) may be above 80%, so the decision could be taken to continue weekly or biweekly visits for additional biomarker monitoring as a precautionary measure. Another example of a “non-high confidence” prediction is patient 104, who had an expected P(DP) at second follow-up scan of 30.3% and progressed at this scan. Although this P(DP) does not fall into the P(DP)>80% range, 30.3% is not a small probability of progression and therefore this patient should also be considered at risk and monitored carefully. It is the opinion of the authors that although many patients were predicted not to progress with high confidence, a reduced monitoring schedule relative to existing practices would not be advisable due to the reliance of the RECIST criteria on an approximately fixed interval between CT scans.

On a statistical level, the main distinction between the proposed framework and traditional binary classification systems is that the output is a continuous probability measure associated with a binary state rather than a binary state itself. The consequence of unreliable predictions in binary classification systems is the potential for inappropriate treatment decisions for individuals. As discussed, though the primary strength of the proposed framework is the ability to identify a priori individuals with reliable/unreliable predictions and tailor treatment/monitoring strategies as appropriate, providing value to physicians even when there is significant individual uncertainty in underlying predictor variables.

Limitations

Several limitations have already been stated in this discussion section and in the discussion section of Buil-Bruna and colleagues (27). All limitations applying to the previously reported biomarker model also apply here. Briefly, the limited numbers of patients available for the training dataset means that the ability to accurately quantify interpatient variability effects is reduced. Individual prediction in some instances may therefore be limited if the patient is atypical of individuals in the training dataset. A K-PD model was used to describe the relationship between treatment and drug effects due to the lack of pharmacokinetic information. The absence of PK information likely hampers the interpretation of the variability seen in the response. There was also insufficient information to incorporate differences in radiotherapy administration into the model.

In this work, the latent disease variable representing tumor size dynamics that was used to predict disease progression was inferred from two biomarkers. Although our results suggest that predictive biomarkers allow inference of the latent disease variable, actual tumor size measurements, when available, should be more informative. In fact, a recent review article summarizes several population models describing tumor growth dynamics using tumor size measurement data (47). However, medical records of the SCLC patients did not include exact tumor size measurements, which is a frequent occurrence with routinely collected medical data. In addition, there is a high degree of heterogeneity in the number, time, and frequency of samples (biomarker and CT scans) between patients, which translates into heterogeneity in the precision of individual P(DP) predictions. The proposed framework, however, accounts for these differences by quantifying the uncertainty in P(DP) predictions.

Avenues for further research

Optimal design methodology (48) could be applied to identify the optimum number and time of measurements to narrow prediction uncertainty.

We modeled the RECIST endpoint as a binary variable for simplicity. However, the additional step of modeling the four different RECIST categories (i.e., complete response, partial response, stable disease, and disease progression) may further increase the individual predictive accuracy.

Efficacy and toxicity associated with second-line therapies have not been formally considered in this analysis. Therefore, it is currently not possible to quantify what early commencement of second-line therapy would mean in terms of expected patient outcome (e.g., increased survival times or remission rates). In order to quantify the added value of early treatment in terms of measureable endpoints, a model-based analysis of efficacy and toxicity endpoints from second-line therapy would be required.

We recommend the use of the proposed prediction framework to support proactive personalized medicine in SCLC patients during and following first-line treatment. Development of the model-based framework required only patient medical record data and did not require commissioning of a new study. The feasibility of the approach should be investigated with other cancers.

No potential conflicts of interest were disclosed.

This work does not necessarily represent the view of all DDMoRe partners.

Conception and design: N. Buil-Bruna, T. Sahota, J.-M. López-Picazo, B. Ribba, I.F. Trocóniz

Development of methodology: N. Buil-Bruna, J.-M. López-Picazo, B. Ribba, I.F. Trocóniz

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): J.-M. López-Picazo, M. Moreno-Jiménez, S. Martín-Algarra

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): N. Buil-Bruna, T. Sahota, B. Ribba, I.F. Trocóniz

Writing, review, and/or revision of the manuscript: N. Buil-Bruna, T. Sahota, J.-M. López-Picazo, M. Moreno-Jiménez, S. Martín-Algarra, B. Ribba, I.F. Trocóniz

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): N. Buil-Bruna, I.F. Trocóniz

Study supervision: J.-M. López-Picazo, S. Martín-Algarra, I.F. Trocóniz

The authors thank M.C. Martínez-Cosgaya, I. Echenique-Gubía. and M.J. Martínez Alvarez-Nava for their kind help with data collection from University Clinic of Navarra.

N. Buil-Bruna was supported by a predoctoral fellowship from Asociación de Amigos de la Universidad de Navarra. This work was supported by the Innovative Medicines Initiative Joint Undertaking under grant agreement no. 115156, resources of which are composed of financial contributions from the European Union's Seventh Framework Programme (FP7/2007-2013) and EFPIA companies' in-kind contribution. The DDMoRe project is also supported by financial contribution from academic and SME partners.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Ferlay
J
,
Steliarova-Foucher
E
,
Lortet-Tieulent
J
,
Rosso
S
,
Coebergh
J
,
Comber
H
, et al
Cancer incidence and mortality patterns in europe: estimates for 40 countries in 2012
.
Eur J Cancer
2013
;
49
:
1374
403
.
2.
Ferlay
J
,
Soerjomataram
I
,
Dikshit
R
,
Eser
S
,
Mathers
C
,
Rebelo
M
, et al
Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012
.
Int J Cancer
2015
;
136
:
E359
86
.
3.
Siegel
R
,
Naishadham
D
,
Jemal
A
. 
Cancer statistics, 2013
.
CA Cancer J Clin
2013
;
63
:
11
30
.
4.
Govindan
R
,
Page
N
,
Morgensztern
D
,
Read
W
,
Tierney
R
,
Vlahiotis
A
, et al
Changing epidemiology of small-cell lung cancer in the united states over the last 30 years: analysis of the surveillance, epidemiologic, and end results database
.
J Clin Oncol
2006
;
24
:
4539
44
.
5.
Janssen-Heijnen
ML
,
Coebergh
JW
. 
The changing epidemiology of lung cancer in Europe
.
Lung Cancer
2003
;
41
:
245
58
.
6.
Lally
BE
,
Urbanic
JJ
,
Blackstock
AW
,
Miller
AA
,
Perry
MC
. 
Small cell lung cancer: have we made any progress over the last 25 years?
Oncologist
2007
;
12
:
1096
104
.
7.
Small cell lung cancer treatment [Internet]
.
[updated 2014 Aug 6; cited 2014 Dec 11]. Available from
: http://www.cancer.gov/cancertopics/pdq/treatment/small-cell-lung/healthprofessional/page6.
8.
Sgambato
A
,
Casaluce
F
,
Maione
P
,
Rossi
A
,
Sacco
PC
,
Panzone
F
, et al
Medical treatment of small cell lung cancer: state of the art and new development
.
Expert Opin Pharmacother
2013
;
14
:
2019
31
.
9.
Fruh
M
,
De Ruysscher
D
,
Popat
S
,
Crino
L
,
Peters
S
,
Felip
E
, et al
Small-cell lung cancer (SCLC): ESMO clinical practice guidelines for diagnosis, treatment and follow-up
.
Ann Oncol
2013
;
24
(
Suppl 6
):
vi99
105
.
10.
Paesmans
M
,
Sculier
JP
,
Lecomte
J
,
Thiriaux
J
,
Libert
P
,
Sergysels
R
, et al
Prognostic factors for patients with small cell lung carcinoma: analysis of a series of 763 patients included in 4 consecutive prospective trials with a minimum follow-up of 5 years
.
Cancer
2000
;
89
:
523
33
.
11.
Zelen
M
. 
Keynote address on biostatistics and data retrieval
.
Cancer Chemother Rep
1973
;
4
:
31
42
.
12.
Spiegelman
D
,
Maurer
LH
,
Ware
JH
,
Perry
MC
,
Chahinian
AP
,
Comis
R
, et al
Prognostic factors in small-cell carcinoma of the lung: an analysis of 1,521 patients
.
J Clin Oncol
1989
;
7
:
344
54
.
13.
Gronowitz
JS
,
Bergström
R
,
Noˇu
E
,
Påhlman
S
,
Brodin
O
,
Nilsson
S
, et al
Clinical and serologic markers of stage and prognosis in small cell lung cancer. a multivariate analysis
.
Cancer
1990
;
66
:
722
32
.
14.
Siu
LL
,
Shepherd
FA
,
Murray
N
,
Feld
R
,
Pater
J
,
Zee
B
. 
Influence of age on the treatment of limited-stage small-cell lung cancer
.
J Clin Oncol
1996
;
14
:
821
8
.
15.
Eisenhauer
E
,
Therasse
P
,
Bogaerts
J
,
Schwartz
L
,
Sargent
D
,
Ford
R
, et al
New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1)
.
Eur J Cancer
2009
;
45
:
228
47
.
16.
National comprehensive cancer network
. 
Practice guidelines for small-cell lung cancer
.
Rockledge, PA
:
National Comprehensive Cancer Network
; 
2000
.
17.
Sugiyama
T
,
Hirose
T
,
Hosaka
T
,
Kusumoto
S
,
Nakashima
M
,
Yamaoka
T
, et al
Effectiveness of intensive follow-up after response in patients with small cell lung cancer
.
Lung Cancer
2008
;
59
:
255
61
.
18.
Diamandis
EP
. 
Cancer biomarkers: can we turn recent failures into success?
J Natl Cancer Inst
2010
;
102
:
1462
7
.
19.
Cerny
T
,
Anderson
H
,
Bramwell
V
,
Thatcher
N
,
Blair
V
. 
Pretreatment prognostic factors and scoring system in 407 small-cell lung cancer patients
.
Int J Cancer
1987
;
39
:
146
9
.
20.
Molina
R
,
Filella
X
,
Auge
JM
. 
ProGRP: a new biomarker for small cell lung cancer
.
Clin Biochem
2004
;
37
:
505
11
.
21.
Holdenrieder
S
,
von Pawel
J
,
Dankelmann
E
,
Duell
T
,
Faderl
B
,
Markus
A
, et al
Nucleosomes, ProGRP, NSE, CYFRA 21-1, and CEA in monitoring first-line chemotherapy of small cell lung cancer
.
Clin Cancer Res
2008
;
14
:
7813
21
.
22.
Stovold
R
,
Blackhall
F
,
Meredith
S
,
Hou
J
,
Dive
C
,
White
A
. 
Biomarkers for small cell lung cancer: neuroendocrine, epithelial and circulating tumour cells
.
Lung Cancer
2012
;
76
:
263
8
.
23.
Harmsma
M
,
Schutte
B
,
Ramaekers
F
. 
Serum markers in small cell lung cancer: opportunities for improvement
.
Biochim Biophys Acta
2013
;
1836
:
255
72
.
24.
Agur
Z
,
Elishmereni
M
,
Kheifetz
Y
. 
Personalizing oncology treatments by predicting drug efficacy, side-effects, and improved therapy: mathematics, statistics, and their integration
.
Wiley Interdiscip Rev Syst Biol Med
2014
;
6
:
239
53
.
25.
Almufti
R
,
Wilbaux
M
,
Oza
A
,
Henin
E
,
Freyer
G
,
Tod
M
, et al
A critical review of the analytical approaches for circulating tumor biomarker kinetics during treatment
.
Ann Oncol
2014
;
25
:
41
56
.
26.
Keizer
RJ
,
Schellens
JH
,
Beijnen
J
,
Huitema
AD
. 
Pharmacodynamic biomarkers in model-based drug development in oncology
.
Curr Clin Pharmacol
2011
;
6
:
30
40
.
27.
Buil-Bruna
N
,
López-Picazo
J
,
Moreno-Jiménez
M
,
Martín-Algarra
S
,
Ribba
B
,
Trocóniz
IF
. 
A population pharmacodynamic model for lactate dehydrogenase and neuron specific enolase to predict tumor progression in small cell lung cancer patients
.
AAPS J
2014
;
16
:
609
19
.
28.
Bauer
R
. 
NONMEM users guide introduction to NONMEM 7.2. 0. ICON Development Solutions Ellicott City, MD;
2011
.
29.
Bergstrand
M
,
Hooker
AC
,
Wallin
JE
,
Karlsson
MO
. 
Prediction-corrected visual predictive checks for diagnosing nonlinear mixed-effects models
.
AAPS J
2011
;
13
:
143
51
.
30.
Lindbom
L
,
Pihlgren
P
,
Jonsson
N
. 
PsN-toolkit—a collection of computer intensive statistical methods for non-linear mixed effect modeling using NONMEM
.
Comput Methods Programs Biomed
2005
;
79
:
241
57
.
31.
Chan
IS
,
Ginsburg
GS
. 
Personalized medicine: progress and promise
.
Annu Rev Genomics Hum Genet
2011
;
12
:
217
44
.
32.
Crews
KR
,
Hicks
JK
,
Pui
C
,
Relling
MV
,
Evans
WE
. 
Pharmacogenomics and individualized medicine: translating science into practice
.
Clin Pharmacol Ther
2012
;
92
:
467
75
.
33.
Evans
WE
,
Relling
MV
. 
Moving towards individualized medicine with pharmacogenomics
.
Nature
2004
;
429
:
464
8
.
34.
Kalia
M
. 
Personalized oncology: recent advances and future challenges
.
Metab Clin Exp
2013
;
62
:
S11
4
.
35.
Duffy
MJ
,
O'Donovan
N
,
Crown
J
. 
Use of molecular markers for predicting therapy response in cancer patients
.
Cancer Treat Rev
2011
;
37
:
151
9
.
36.
van Meerbeeck
JP
,
Fennell
DA
,
De Ruysscher
DK
. 
Small-cell lung cancer
.
Lancet
2011
;
378
:
1741
55
.
37.
Kogan
Y
,
Halevi–Tobias
K
,
Elishmereni
M
,
Vuk-Pavlović
S
,
Agur
Z
. 
Reconsidering the paradigm of cancer immunotherapy by computationally aided real-time personalization
.
Cancer Res
2012
;
72
:
2218
27
.
38.
Kronik
N
,
Kogan
Y
,
Elishmereni
M
,
Halevi-Tobias
K
,
Vuk-Pavlović
S
,
Agur
Z
. 
Predicting outcomes of prostate cancer immunotherapy by personalized mathematical models
.
PLoS ONE
2010
;
5
:
e15482
.
39.
Wilbaux
M
,
Hénin
E
,
Oza
A
,
Colomban
O
,
Pujade-Lauraine
E
,
Freyer
G
, et al
Dynamic modeling in ovarian cancer: an original approach linking early changes in modeled longitudinal CA-125 kinetics and survival to help decisions in early drug development
.
Gynecol Oncol
2014
;
133
:
460
6
.
40.
You
B
,
Harvey
R
,
Henin
E
,
Mitchell
H
,
Golfier
F
,
Savage
P
, et al
Early prediction of treatment resistance in low-risk gestational trophoblastic neoplasia using population kinetic modelling of hCG measurements
.
Br J Cancer
2013
;
108
:
1810
6
.
41.
Hansson
E
,
Ma
G
,
Amantea
M
,
French
J
,
Milligan
P
,
Friberg
L
, et al
PKPD modeling of predictors for adverse effects and overall survival in sunitinib-treated patients with GIST
.
CPT Pharmacometrics Syst Pharmacol
2013
;
2
:
e85
.
42.
Hansson
E
,
Amantea
M
,
Westwood
P
,
Milligan
P
,
Houk
B
,
French
J
, et al
PKPD modeling of VEGF, sVEGFR-2, sVEGFR-3, and sKIT as predictors of tumor dynamics and overall survival following sunitinib treatment in GIST
.
CPT Pharmacometrics Syst Pharmacol
2013
;
2
:
e84
.
43.
Davidian
M
,
Giltinan
DM
. 
Nonlinear models for repeated measurement data
.
New York
:
Chapman & Hall
; 
1995
.
44.
Gelman
A
,
Carlin
JB
,
Stern
HS
,
Dunson
DB
,
Vehtari
A
,
Rubin
DB
. 
Bayesian data analysis
. 2nd ed.
New York
:
Chapman & Hall
; 
2004
.
45.
Feinstein
AR
,
Sosin
DM
,
Wells
CK
. 
The Will Rogers phenomenon: stage migration and new diagnostic techniques as a source of misleading statistics for survival in cancer
.
N Engl J Med
1985
;
312
:
1604
8
.
46.
Kalemkerian
GP
. 
Staging and imaging of small cell lung cancer
.
Cancer Imaging
2012
;
11
:
253
8
.
47.
Ribba
B
,
Holford
N
,
Magni
P
,
Trocóniz
I
,
Gueorguieva
I
,
Girard
P
, et al
A review of mixed-effects models of tumor growth and effects of anticancer drug treatment used in population analysis
.
CPT Pharmacometrics Syst Pharmacol
2014
;
3
:
e113
.
48.
Fedorov
VV
. 
Theory of optimal experiments
.
New York: Academic Press
; 
1972
.

Supplementary data