Experimental evidence indicates that tobacco smoke acts both as an initiator and a promoter in lung carcinogenesis. We used the two-stage clonal expansion model incorporating the ideas of initiation, promotion, and malignant conversion to analyze lung cancer mortality in three large cohorts, the British Doctors' cohort and the two American Cancer Society cohorts, to determine how smoking habits influence age-specific lung cancer rates via these mechanisms. Likelihood ratio tests indicate that smoking-related promotion is the dominant model mechanism associated with lung cancer mortality in all cohorts. Smoking-related initiation is less important than promotion but interacts synergistically with it. Although no information on ex-smokers is available in these data, the model with estimated variables can be used to project risks among ex-smokers. These projected risks are in good agreement with the risk among ex-smokers derived from other studies. We present 10-year projected risks for current and former smokers adjusted for competing causes of mortality. The importance of smoking duration on lung cancer risk in these cohorts is a direct consequence of promotion. Intervention and treatment strategies should focus on promotion as the primary etiologic mechanism in lung carcinogenesis.

The primary cause of lung cancer is exposure to tobacco smoke (1-3) followed (not necessarily in order) by radon, environmental and workplace carcinogens, nutrition, genetic susceptibility, and unknown factors that may be expressed as period and birth cohort effects.

Numerous carcinogenic components have been identified in the particulate and gas phases of cigarette smoke. Animal experimentation and other assays have thus far identified ∼40 human or animal carcinogens in the ≥4,000 chemical compounds that have been identified in tobacco smoke (3). Major lung carcinogens in smoke include some of the polycyclic aromatic hydrocarbons, such as benzo[a]pyrene, as well as some tobacco-specific N-nitrosamines. These chemicals, and perhaps other compounds that have not been identified, may act synergistically as initiators, promoters, or both (4, 5).

The goal of this study is to analyze lung cancer mortality in three cohorts, the British Doctors' cohort and the two American Cancer Society cohorts, with detailed information on cigarette smoking habits. For the analyses, we used a multistage model for carcinogenesis that incorporates the biological notions of initiation, promotion, and malignant conversion. Smoking may influence any of the biological processes in this model. Maximum likelihood methods are used to optimize the models to the cohort data, determine which processes are influenced by smoking, and find the corresponding dose-response relationships for the affected processes. The estimated model variables are used to predict 10-year lung cancer risks for hypothetical smoking scenarios, including the risk among smokers who quit. Thus, the models can be thought of as analogues for lung cancer of the Gail model (6) for breast cancer.

Subjects

The two American Cancer Society Cancer Prevention Study (CPS) cohorts, CPS-I and CPS-II, each included >1 million individuals. The CPS-I cohort was followed for 12 years beginning in 1959. The CPS-I data used in this study consist of White male and female current and never smokers as reported in the Smoking and Tobacco Control, Monograph 8, Chapter 3, Appendix B (7). The data provide information by age and duration by 5-year bins and by smoking rate as “never smokers,” 1 to 9, 10 to 19, 20, 21 to 39, and ≥40 cigarettes per day bins. The analysis of CPS-I White males included 960,220 person-years with 215 lung cancers among never smokers and 1,669,126 person-years and 3,437 lung cancers among current smokers. The analysis of CPS-I White females included 4,030,263 person-years with 573 lung cancers among never smokers and 1,628,783 person-years with 600 lung cancers among current smokers.

The CPS-II study began recruitment in 1982. We analyzed CPS-II data with 6-year follow-up as tabulated in the Smoking and Tobacco Control, Monograph 8, Chapter 4, Appendix 5 (7). The data for White male and female current and former smokers excluded persons with prevalent cancer other than nonmelanoma skin cancer. (This differed slightly from the CPS-I subcohort where all White males and females were included.) The available data are limited to individuals ages ≥50 years in 5-year age bins, including never smokers, and current smokers restricted to either 20 or 40 cigarettes per day and at least 30 years' duration by 5-year bins. The analysis of CPS-II White males included 559,156 person-years and 76 lung cancers among never smokers and 179,689 person-years and 734 lung cancers among current smokers. The analysis of CPS-II White females included 1,475,943 person-years and 192 lung cancer deaths among never smokers and 165,057 person-years and 374 lung cancers among current smokers. The British Doctors' cohort (8, 9) consisted of 34,439 male doctors who replied to a postal questionnaire in 1951. This cohort has been followed by Doll et al. for 50 years (9). However, we only had access to categorical data presented at 20 years' follow-up for mortality from lung cancer and other causes (8). The data are binned by 5-year age groups and smoking intensity for current smokers categorized as never smokers and current smokers of 1 to 4, 5 to 9, …, 30 to 34, and 35 to 40 cigarettes per day. As the age at initiation of smoking was unavailable for the British Doctors, we assumed it to be age 19 years as in Doll and Peto (8) and Moolgavkar et al. (10). The sensitivity of the model fit to this choice of initiation age was checked by systematically varying the age by 1-year increments. The consistency with CPS-I also suggests that this assumption is reasonable. The analysis of male British Doctors included 69,908 person-years and 6 lung cancers among never smokers and 109,353 person-years and 195 lung cancers among current smokers. The data were not pooled across cohorts in any of our analyses.

The three cohorts combined include 3,547,452 person-years of observation and 4,663 deaths from lung cancer among males and 7,300,046 person-years of observation and 1,739 deaths from lung cancer among females. The available CPS-I data are more informative than for the other cohorts because of more person-years factored into more extensive age, dose, and duration categories and longer follow-up than for CPS-II. The results presented here focus more on the male cohorts where variable estimates are more tightly constrained because of the larger fraction of smokers, higher average consumption per smoker, and greater number of lung cancers. We were also concerned that there might be more reporting bias for females than for males in the early cohorts when smoking by females was less socially acceptable.

A limitation of these data is that no information on ex-smokers is available. Despite this, we are able to make projections of lung cancer risk among smokers who quit as is described below.

Two-Stage Clonal Expansion Model

We use a model that explicitly acknowledges three distinct phases in the process of carcinogenesis. In the first phase (initiation), a susceptible stem cell acquires one or more mutations resulting in an initiated cell, which has partially escaped growth control. The second phase (promotion) is the clonal expansion of initiated cells. Promotion is an extremely efficient way of bringing about carcinogenesis because clonal expansion results in increased populations of cells that have already acquired some of the genetic alterations on the pathway to malignancy. Finally, in the last phase (malignant conversion), one of the initiated cells acquires the further genetic changes required to convert it into a malignant cell. There is considerable evidence that most human malignancies go through these three phases. Environmental agents, such as radiation and tobacco smoke, influence carcinogenesis via their effects on one or more phases of the process. The two-stage clonal expansion (TSCE) model is the simplest stochastic mathematical model of the initiation, promotion, and malignant conversion paradigm of carcinogenesis (11-16), with the “stages” corresponding to the “mutational” phases of initiation and malignant conversion.

The hazard (incidence) function generated by the TSCE model and its use for analyses of epidemiologic data have been described in several publications (12-16). In an earlier analysis of the British Doctors' data (10), we used a mathematically tractable approximation to the hazard function. We believe now, however, that the approximate form of the hazard function can yield misleading results and should not be used. One specific property of the exact hazard function, not possessed by commonly used approximations (17-19), is particularly relevant to the analyses here and that is the following. After exposure to a carcinogen ends, for example, after smokers quit, the exact hazard function ultimately approaches the hazard function in the unexposed. That is, even without invoking tissue repair, the incidence of lung cancer among former smokers would be expected to approach the incidence in nonsmokers. It is this property of the hazard function that allows us to predict risk among ex-smokers, although in the cohorts analyzed here we have information only on nonsmokers and continuing smokers.

The key variables of the model are as follows: ν is the rate of initiation, α and β are the rates of cell division and apoptosis of initiated cells, respectively, and μ is the rate of malignant conversion of initiated cells. In fitting the model to data, we use the variables ν, μ, α, and αβ, the net rate of promotion (see Table 1). The effect of smoking on cancer rates is modeled via a dose-response function on each of these variables. Progression from the first malignant cell to death from lung cancer is modeled either as a constant lag time or as a two-variable Γ function to represent the distribution of lag times from first malignant cell to death from lung cancer in the population. Likelihood construction and variable identifiability are discussed in recent articles (16, 20). Likelihood ratio tests are used to determine the importance of individual variables. Profile likelihood-based 95% confidence intervals (95% CI) were consistent with those constructed by Markov chain Monte Carlo (MCMC) methods using the Metropolis-Hastings algorithm.

Table 1.

Reduced model dose-response applied to all cohorts

Initial scale   
    X = 107  Assume 107 normal stem cells in both lungs* 
    ν0 = μ0  Equate background initiation, malignant conversion rates* 
Background variables for cohort   
Rate Variable  
    α0 p1 Background or nonsmoking initiated cell division rate* (per cell per year) 
    g0 = α0β0μ0 p2 Background or nonsmoking net initiated cell promotion rate (per cell per year) 
    ν0 = μ0 p3 Background or nonsmoking initiation, malignant conversion* (per cell per year) 
Reduced model dose-response for cohort   
    νi = ν0(1 + p4), p4 = 0 for nonsmokers  Initiation rate* (per cell per year) 
    gi = g0(1 + p5dip6 Net initiated cell promotion rate (per cell per year) 
    αi = α0(1 + p5dip6 Initiated cell division rate* (per cell per year) 
    μi = μ0  Malignant conversion, no dose-response* (per cell per year) 
Initial scale   
    X = 107  Assume 107 normal stem cells in both lungs* 
    ν0 = μ0  Equate background initiation, malignant conversion rates* 
Background variables for cohort   
Rate Variable  
    α0 p1 Background or nonsmoking initiated cell division rate* (per cell per year) 
    g0 = α0β0μ0 p2 Background or nonsmoking net initiated cell promotion rate (per cell per year) 
    ν0 = μ0 p3 Background or nonsmoking initiation, malignant conversion* (per cell per year) 
Reduced model dose-response for cohort   
    νi = ν0(1 + p4), p4 = 0 for nonsmokers  Initiation rate* (per cell per year) 
    gi = g0(1 + p5dip6 Net initiated cell promotion rate (per cell per year) 
    αi = α0(1 + p5dip6 Initiated cell division rate* (per cell per year) 
    μi = μ0  Malignant conversion, no dose-response* (per cell per year) 

NOTE: A full model was first tested where initiation, promotion, and malignant conversion are assumed to be potentially modified by smoking through flexible dose-response parameterization of the form: a(1 + bxc). Likelihood ratio tests were used to eliminate variables that were not significant for any of the cohorts, resulting in the reduced model.

*

Only the product νX is identifiable. Thus, without loss of generality, we assume X = 107 normal stem cells at risk. This number may be changed by reciprocal change in ν. A separate identifiability condition relates variables α, μ, and ν. Thus, without loss of generality, we set μ0 = ν0. These variables may be rescaled without changing the model fit. See ref. 20.

The TSCE model allows calculation of age-specific risk as a function of the smoking intensity (cigarettes per day) throughout life. The likelihood contribution made by each individual depends on the entire covariate history for that individual (i.e., age at start of smoking, daily intensity of smoking, and duration of smoking). If information on ex-smokers were available, the likelihood would depend in addition on age of quitting smoking and time since quitting smoking.

Ten-Year Risk Predictions

Models optimized for males and females in the CPS-I and CPS-II cohorts were used to predict 10-year risk estimates from dying of lung cancer. Competing causes of mortality were adjusted using standard actuarial methods for multiple decrement life tables. All cause annual risk estimates were extracted from the National Center for Health Statistics (21). We used the 1969 to 1971 age-specific and sex-specific life tables for CPS-I and the 1989 to 1991 life tables for CPS-II. 95% CIs were calculated by sampling model variables from a MCMC calculation using the Metropolis-Hastings algorithm.

The male and female cohorts from CPS-I and CPS-II and the male British Doctors' cohort were analyzed individually and jointly in all combinations. Each analysis began with a full model where the biological variables of the TSCE model were allowed to have a flexible dose-response pattern of the form: a(1 + bxc). Likelihood ratio tests were used to eliminate variables that were not significant for any cohort, resulting in a reduced model, shown in Table 1.

For each cohort fitted separately, the reduced model has six variables, of which three determine background rates and three determine smoking dose-response. The six-variable reduced model provides good fits to lung cancer among never and current smokers according to dose, duration, and age. Moreover, we are able to make predictions for former smokers. We also attempted to find a single model that would describe all three cohorts simultaneously; however, we were unable to do so. We did find, however, that a single model with a total of seven variables (three common dose-response and two common background variables; separate background α in the two cohorts) did an excellent job of describing the contemporaneous British Doctors' and CPS-I cohorts simultaneously. By way of comparison, a recent analysis of CPS-I and British Doctors' data by Knoke et al. used what they call an extended Doll and Peto model requiring four variables for each cohort to describe smokers plus two variables to describe CPS-I nonsmokers separately, with no predictive ability to describe the risk for ex-smokers (22). For the British Doctors' data, Knoke et al. fit lung cancer mortality only among continuing smokers. Thus, Knoke et al. require 10 variables to describe lung cancer among smokers in the two cohorts and lung cancer among CPS-I nonsmokers. Our combined model for the CPS-I and British Doctors' cohorts requires only seven variables to describe lung cancer among both smokers and nonsmokers in both cohorts. Moreover, as shown in Fig. 3, we are able to make predictions of risk among ex-smokers, which are in good agreement with the risks estimated in other studies.

The model fits to the different cohorts for both genders were generally similar in showing a significant dose-response for tobacco affecting promotion among all cohorts, a significant dose-response for initiation only in the contemporaneous CPS-I and British Doctors' cohorts, and a nonsignificant effect for malignant conversion in all cohorts. Most of the corresponding variable estimates were statistically indistinguishable in comparing the male CPS-I and British Doctors that were recruited around the same time. The male CPS-II cohort, recruited ∼20 years later, required different variables than for the earlier cohorts, with significantly increased dose-response for promotion compared with the other two male cohorts and nonsignificant dose-response for initiation. There are significant gender differences in each cohort. Females were qualitatively similar to males in that the CPS-II female cohort had higher dose-response estimates for promotion and small dose-response estimates for initiation, whereas the earlier CPS-I female cohort had significant dose-response for both initiation and promotion. Table 2 shows variable estimates and MCMC 95% CI for reduced model fits to males and females in each cohort. We calculated 95% CI for model variables based on MCMC runs of a million cycles for each cohort.

Table 2.

Reduced model variable estimates for males and females in the CPS-I and CPS-II cohorts and males in British Doctors' cohort [MLE (MCMC 95% CI)]

p1
p2
p3
p4
p5
p6
Background cell division rate (per cell per year)Background net proliferation rate (per cell per year)Background initiation and malignant conversion rate (per cell per year)Tobacco initiation rateTobacco promotion rate coefficientTobacco promotion rate power
CPS-I males 22.65 (17.35-31.31) 0.075 (0.065-0.085)* 1.40 × 10−7 (1.12 × 10−7-1.76 × 10−7)* 1.79 (1.11-3.07)* 0.21 (0.15-0.28)* 0.47 (0.42-0.54)* 
British Doctors 5.87 (2.66-12.89) 0.075 (0.065-0.085)* 1.40 × 10−7 (1.12 × 10−7-1.76 × 10−7)* 1.79 (1.11-3.07)* 0.21 (0.15-0.28)* 0.47 (0.42-0.54)* 
CPS-II males 7.70 (6.45-12.99) 0.090 (0.071-0.106) 7.16 × 10−8 (4.60 × 10−8-1.21 × 10−7∼0.00 (∼0.00-1.76) 0.60 (0.43-0.91) 0.22 (0.12-0.30) 
CPS-I females 71.56 (49.08-100.0) 0.086 (0.073-0.101) 8.93 × 10−8 (6.50 × 10−8-1.19 × 10−71.23 (0.32-2.80) 0.04 (0.02-0.07) 0.98 (0.80-1.15) 
CPS-II females 15.82 (13.39-42.12) 0.071 (0.055-0.088) 1.07 × 10−7 (6.97 × 10−8-1.62 × 10−70.02 (∼0.00-12.50) 0.50 (0.27-0.86) 0.32 (0.14-0.40) 
p1
p2
p3
p4
p5
p6
Background cell division rate (per cell per year)Background net proliferation rate (per cell per year)Background initiation and malignant conversion rate (per cell per year)Tobacco initiation rateTobacco promotion rate coefficientTobacco promotion rate power
CPS-I males 22.65 (17.35-31.31) 0.075 (0.065-0.085)* 1.40 × 10−7 (1.12 × 10−7-1.76 × 10−7)* 1.79 (1.11-3.07)* 0.21 (0.15-0.28)* 0.47 (0.42-0.54)* 
British Doctors 5.87 (2.66-12.89) 0.075 (0.065-0.085)* 1.40 × 10−7 (1.12 × 10−7-1.76 × 10−7)* 1.79 (1.11-3.07)* 0.21 (0.15-0.28)* 0.47 (0.42-0.54)* 
CPS-II males 7.70 (6.45-12.99) 0.090 (0.071-0.106) 7.16 × 10−8 (4.60 × 10−8-1.21 × 10−7∼0.00 (∼0.00-1.76) 0.60 (0.43-0.91) 0.22 (0.12-0.30) 
CPS-I females 71.56 (49.08-100.0) 0.086 (0.073-0.101) 8.93 × 10−8 (6.50 × 10−8-1.19 × 10−71.23 (0.32-2.80) 0.04 (0.02-0.07) 0.98 (0.80-1.15) 
CPS-II females 15.82 (13.39-42.12) 0.071 (0.055-0.088) 1.07 × 10−7 (6.97 × 10−8-1.62 × 10−70.02 (∼0.00-12.50) 0.50 (0.27-0.86) 0.32 (0.14-0.40) 

NOTE: Estimates for net cell proliferation rates are tightly constrained, whereas estimates of cell division rates are not due to compensatory cell death.

*

Shared variables in joint fit to White males in CPS-I and British Doctors' cohorts.

We found smoking-related promotion to be the most significant mechanism associated with age-specific lung cancer mortality in all cohorts. Promotion consists of two components, the net cell proliferation rate and the division rate for initiated stem cells. In all cohorts, except for CPS-I females, both the net proliferation rate and the division rate of initiated cells were found to have significant nonlinear (concave downward) dose-response as a function of increasing smoking rate. A joint model for males in CPS-I and the British Doctors using the same dose-response, but differing only in the background cell division rates, provided excellent descriptions of lung cancer mortality in each cohort as judged by fits to the data binned by age, dose, and duration. The higher background rate of division of initiated cells in the CPS-I cohort implies a higher extinction rate for initiated cells, leading to a lower asymptotic hazard, and a slower increase in risk several decades after smoking begins. Dose-response estimates for net cell proliferation rates for CPS-II are elevated significantly above those of the earlier cohorts over the full smoking range, but estimates of initiation are significantly lower, with no significant dose-response for initiation in CPS-II. These dose-response curves along with 95% MCMC CIs are shown in the Appendix.

Fits by dose, duration, and age and residual plots are shown for CPS-I White males in Figs. 1 and 2. The other cohorts were also fit well by the models as shown in the Appendix. The age-specific lung cancer risks binned by dose and duration for CPS-II are generally elevated above the risks for CPS-I at older ages.

Figure 1.

CPS-I White male mortality rates for never and current smokers with 12 years' follow-up by (A) dose and age and (B) duration and age. Solid line, MLE from joint reduced model fit to CPS-I and British Doctors; dashed lines, MCMC 95% CI; circles, ratio of observed lung cancer deaths to person-years at risk in 5-year age bins, with bars based on Poisson assumptions.

Figure 1.

CPS-I White male mortality rates for never and current smokers with 12 years' follow-up by (A) dose and age and (B) duration and age. Solid line, MLE from joint reduced model fit to CPS-I and British Doctors; dashed lines, MCMC 95% CI; circles, ratio of observed lung cancer deaths to person-years at risk in 5-year age bins, with bars based on Poisson assumptions.

Close modal
Figure 2.

Residuals for CPS-I White male never and current smokers by age and smoking intensity. The good fit of the model to the data is indicated by a lack of a significant trend as a function of age in each of the dose groups. Similar results are seen for residuals plotted by duration and age (data not shown).

Figure 2.

Residuals for CPS-I White male never and current smokers by age and smoking intensity. The good fit of the model to the data is indicated by a lack of a significant trend as a function of age in each of the dose groups. Similar results are seen for residuals plotted by duration and age (data not shown).

Close modal

Figure 3 shows predictions of the effects of smoking cessation based on the fit of the combined model for CPS-I and the British Doctors' cohorts. We assumed smoking began at age 19 years at either 20 or 40 cigarettes per day and compared the risk following stopping smoking at age 30 or 50 years to the risk for individuals continuing to smoke at the same rates. These results are seen to be in reasonably good agreement with relative risks for ex-smokers as a function of time since smoking cessation observed in Great Britain by Peto et al. (23) and in two large U.S. cohorts, the Nurses' Health Study of women and Health Professionals Study of men (24).

Figure 3.

Relative risks among ex-smokers predicted by our model fit to the British Doctors' cohort and comparison with relative risks in Great Britain estimated by Peto et al. (23). Predicted decrease in risk for smokers beginning to smoke at age 19 years and continuing at 20 cigarettes per day (dashed lines) or 40 cigarettes per day (solid lines) who quit smoking at either age 30 or 50 years relative to smokers continuing at the same intensity. Horizontal lines, observed risk ratios at various time intervals (<10, 10-19, 20-29, and >29 years) after quitting reported by Peto et al. based on combined information from national statistics and two large case-control studies (23). Note that our predictions are based on information on continuing and lifelong nonsmokers; we had no data on ex-smokers.

Figure 3.

Relative risks among ex-smokers predicted by our model fit to the British Doctors' cohort and comparison with relative risks in Great Britain estimated by Peto et al. (23). Predicted decrease in risk for smokers beginning to smoke at age 19 years and continuing at 20 cigarettes per day (dashed lines) or 40 cigarettes per day (solid lines) who quit smoking at either age 30 or 50 years relative to smokers continuing at the same intensity. Horizontal lines, observed risk ratios at various time intervals (<10, 10-19, 20-29, and >29 years) after quitting reported by Peto et al. based on combined information from national statistics and two large case-control studies (23). Note that our predictions are based on information on continuing and lifelong nonsmokers; we had no data on ex-smokers.

Close modal

Ten-year risk predictions with MCMC 95% CIs are shown in Table 3 for different smoking patterns among continuing smokers and for smokers who quit at the beginning of the 10-year risk-projection period. These calculations may overestimate the 10-year risk of dying from lung cancer for heavy smokers, because population-based annual life tables (21) were used to adjust for competing risk. (Separate life tables specific to different smoking levels were not available.) The calculation of risk for smokers who quit at the beginning of the 10-year interval was made by assuming dose-response functions return to background levels when smoking stops. Risk estimates have generally increased between CPS-I and CPS-II, with the largest increases seen among females. The risk estimates for CPS-II are somewhat less than recent estimates based on modeling using data from the Carotene and Retinol Efficacy Trial (25).

Table 3.

Ten-year risk projections for smokers who smoke for 25, 40, or 50 years and continue to smoke or quit at ages 55, 65, or 75 years based on models for White male and female smokers in the CPS-I and CPSII cohorts [% risk (95% CI)]

25 y
40 y
50 y
QuitStill smokingQuitStill smokingQuitStill smoking
CPS-I males: 1 pack daily smokers       
    55 0.7 (0.6-0.7) 0.8 (0.7-0.8) 1.8 (1.7-1.9) 1.9 (1.9-2.0) 
    65 1.1 (1.0-1.2) 1.2 (1.1-1.3) 2.4 (2.3-2.6) 2.6 (2.4-2.8) 3.9 (3.7-4.1) 4.2 (4.0-4.4) 
    75 1.4 (1.3-1.6) 1.5 (1.3-1.7) 2.7 (2.4-3.0) 2.8 (2.5-3.2) 3.9 (3.6-4.2) 4.1 (3.7-4.4) 
CPS-I males: 2 packs daily smokers       
    55 1.2 (1.1-1.2) 1.3 (1.2-1.4) 3.4 (3.2-3.6) 3.9 (3.6-4.1) 
    65 1.8 (1.6-1.9) 2.0 (1.8-2.2) 4.2 (3.9-4.6) 4.6 (4.2-5.0) 6.3 (5.8-6.8) 6.8 (6.3-7.4) 
    75 2.2 (1.9-2.4) 2.4 (2.1-2.6) 4.2 (3.6-4.8) 4.5 (3.9-5.1) 5.6 (5.0-6.3) 5.9 (5.3-6.6) 
CPS-II males: 1 pack daily smokers       
    55 0.9 (0.7-1.0) 1.0 (0.9-1.2) 2.9 (2.5-3.2) 3.3 (2.9-3.7) 
    65 1.7 (1.5-1.9) 2.0 (1.7-2.2) 4.7 (4.2-5.2) 5.3 (4.7-5.8) 6.6 (5.7-7.6) 7.2 (6.3-8.3) 
    75 2.6 (2.1-3.1) 2.9 (2.4-3.4) 5.7 (4.9-6.4) 6.2 (5.3-7.0) 6.9 (5.9 8.0) 7.4 (6.3-8.5) 
CPS-II males: 2 packs daily smokers       
    55 1.2 (1.1-1.4) 1.5 (1.3-1.7) 4.3 (3.7-4.8) 5.0 (4.3-5.6) 
    65 2.3 (1.9-2.7) 2.7 (2.3-3.2) 6.4 (5.6-7.1) 7.1 (6.3-8.0) 7.1 (6.4-9.4) 8.5 (7.0-10.4) 
    75 3.4 (2.7-4.0) 3.8 (3.1-4.5) 7.1 (5.9-8.1) 7.7 (6.4-8.8) 7.6 (6.2-9.2) 8.1 (6.6-9.9) 
CPS-I females: 1 pack daily smokers       
    55 0.4 (0.3-0.4) 0.4 (0.4-0.5) 0.8 (0.7-0.9) 0.8 (0.7-0.9) 
    65 0.6 (0.5-0.7) 0.7 (0.6-0.7) 1.0 (0.9-1.1) 1.1 (1.0-1.2) 1.4 (1.1-1.7) 1.5 (1.2-1.8) 
    75 0.7 (0.6-0.9) 0.8 (0.6-0.9) 1.1 (0.9-1.3) 1.1 (0.9-1.4) 1.3 (1.1-1.6) 1.4 (1.1-1.7) 
CPS-I females: 2 packs daily smokers       
    55 1.0 (0.8-1.2) 1.2 (0.9-1.4) 1.8 (1.4-2.2) 2.0 (1.5-2.5) 
    65 1.3 (1.1-1.6) 1.5 (1.2-1.8) 1.9 (1.4-2.4) 2.1 (1.6-2.7) 2.0 (1.3-2.8) 2.2 (1.5-3.1) 
    75 1.4 (1.0-1.7) 1.5 (1.1-1.9) 1.6 (1.2-2.2) 1.8 (1.3-2.4) 1.7 (1.1-2.4) 1.8 (1.2-2.6) 
CPS-II females: 1 pack daily smokers       
    55 0.6 (0.5-0.7) 0.7 (0.6-0.8) 1.8 (1.5-2.1) 2.0 (1.7-2.3) 
    65 1.1 (0.9-1.2) 1.2 (1.0-1.4) 2.6 (2.2-3.0) 2.9 (2.5-3.4) 3.8 (3.2-4.6) 4.2 (3.5-5.1) 
    75 1.5 (1.2-1.9) 1.7 (1.3-2.1) 3.2 (2.4-3.9) 3.5 (2.6-4.2) 4.2 (3.4-5.1) 4.5 (3.7-5.6) 
CPS-II females: 2 packs daily smokers       
    55 1.0 (0.8-1.2) 1.1 (0.9-1.4) 2.9 (2.3-3.5) 3.3 (2.6-4.0) 
    65 1.6 (1.2-2.0) 1.9 (1.4-2.3) 3.9 (3.1-4.7) 4.4 (3.5-5.3) 4.9 (3.8-6.5) 5.5 (4.2-7.2) 
    75 2.2 (1.6-2.7) 2.5 (1.7-3.2) 4.4 (3.2-5.6) 4.8 (3.5-6.2) 5.0 (3.8-6.7) 5.4 (4.1-7.3) 
25 y
40 y
50 y
QuitStill smokingQuitStill smokingQuitStill smoking
CPS-I males: 1 pack daily smokers       
    55 0.7 (0.6-0.7) 0.8 (0.7-0.8) 1.8 (1.7-1.9) 1.9 (1.9-2.0) 
    65 1.1 (1.0-1.2) 1.2 (1.1-1.3) 2.4 (2.3-2.6) 2.6 (2.4-2.8) 3.9 (3.7-4.1) 4.2 (4.0-4.4) 
    75 1.4 (1.3-1.6) 1.5 (1.3-1.7) 2.7 (2.4-3.0) 2.8 (2.5-3.2) 3.9 (3.6-4.2) 4.1 (3.7-4.4) 
CPS-I males: 2 packs daily smokers       
    55 1.2 (1.1-1.2) 1.3 (1.2-1.4) 3.4 (3.2-3.6) 3.9 (3.6-4.1) 
    65 1.8 (1.6-1.9) 2.0 (1.8-2.2) 4.2 (3.9-4.6) 4.6 (4.2-5.0) 6.3 (5.8-6.8) 6.8 (6.3-7.4) 
    75 2.2 (1.9-2.4) 2.4 (2.1-2.6) 4.2 (3.6-4.8) 4.5 (3.9-5.1) 5.6 (5.0-6.3) 5.9 (5.3-6.6) 
CPS-II males: 1 pack daily smokers       
    55 0.9 (0.7-1.0) 1.0 (0.9-1.2) 2.9 (2.5-3.2) 3.3 (2.9-3.7) 
    65 1.7 (1.5-1.9) 2.0 (1.7-2.2) 4.7 (4.2-5.2) 5.3 (4.7-5.8) 6.6 (5.7-7.6) 7.2 (6.3-8.3) 
    75 2.6 (2.1-3.1) 2.9 (2.4-3.4) 5.7 (4.9-6.4) 6.2 (5.3-7.0) 6.9 (5.9 8.0) 7.4 (6.3-8.5) 
CPS-II males: 2 packs daily smokers       
    55 1.2 (1.1-1.4) 1.5 (1.3-1.7) 4.3 (3.7-4.8) 5.0 (4.3-5.6) 
    65 2.3 (1.9-2.7) 2.7 (2.3-3.2) 6.4 (5.6-7.1) 7.1 (6.3-8.0) 7.1 (6.4-9.4) 8.5 (7.0-10.4) 
    75 3.4 (2.7-4.0) 3.8 (3.1-4.5) 7.1 (5.9-8.1) 7.7 (6.4-8.8) 7.6 (6.2-9.2) 8.1 (6.6-9.9) 
CPS-I females: 1 pack daily smokers       
    55 0.4 (0.3-0.4) 0.4 (0.4-0.5) 0.8 (0.7-0.9) 0.8 (0.7-0.9) 
    65 0.6 (0.5-0.7) 0.7 (0.6-0.7) 1.0 (0.9-1.1) 1.1 (1.0-1.2) 1.4 (1.1-1.7) 1.5 (1.2-1.8) 
    75 0.7 (0.6-0.9) 0.8 (0.6-0.9) 1.1 (0.9-1.3) 1.1 (0.9-1.4) 1.3 (1.1-1.6) 1.4 (1.1-1.7) 
CPS-I females: 2 packs daily smokers       
    55 1.0 (0.8-1.2) 1.2 (0.9-1.4) 1.8 (1.4-2.2) 2.0 (1.5-2.5) 
    65 1.3 (1.1-1.6) 1.5 (1.2-1.8) 1.9 (1.4-2.4) 2.1 (1.6-2.7) 2.0 (1.3-2.8) 2.2 (1.5-3.1) 
    75 1.4 (1.0-1.7) 1.5 (1.1-1.9) 1.6 (1.2-2.2) 1.8 (1.3-2.4) 1.7 (1.1-2.4) 1.8 (1.2-2.6) 
CPS-II females: 1 pack daily smokers       
    55 0.6 (0.5-0.7) 0.7 (0.6-0.8) 1.8 (1.5-2.1) 2.0 (1.7-2.3) 
    65 1.1 (0.9-1.2) 1.2 (1.0-1.4) 2.6 (2.2-3.0) 2.9 (2.5-3.4) 3.8 (3.2-4.6) 4.2 (3.5-5.1) 
    75 1.5 (1.2-1.9) 1.7 (1.3-2.1) 3.2 (2.4-3.9) 3.5 (2.6-4.2) 4.2 (3.4-5.1) 4.5 (3.7-5.6) 
CPS-II females: 2 packs daily smokers       
    55 1.0 (0.8-1.2) 1.1 (0.9-1.4) 2.9 (2.3-3.5) 3.3 (2.6-4.0) 
    65 1.6 (1.2-2.0) 1.9 (1.4-2.3) 3.9 (3.1-4.7) 4.4 (3.5-5.3) 4.9 (3.8-6.5) 5.5 (4.2-7.2) 
    75 2.2 (1.6-2.7) 2.5 (1.7-3.2) 4.4 (3.2-5.6) 4.8 (3.5-6.2) 5.0 (3.8-6.7) 5.4 (4.1-7.3) 

NOTE: Life tables are used to adjust for death from competing causes. Model-based 10-year risks are shown for each smoking pattern, with MCMC 95% CIs. Projections for individuals who quit smoking and continue to abstain for the following 10 years assume that model variables revert to background values following smoking cessation. Asterisks are placed in cells that correspond to unrealistically early ages for starting smoking. These rates are not generalizable and are probably lower than expected for the general population, as the cohorts were generally more educated and healthy, especially CPS-II where prevalent cancers were excluded.

Multistage Models

We are not suggesting that detailed biological mechanisms can be inferred from epidemiologic data using multistage carcinogenesis models. However, temporal patterns of risk are strongly influenced by whether mutation rates or rates of net cell proliferation are affected by an environmental agent. These temporal patterns, in turn, determine the maximum likelihood estimates (MLE) and therefore inferences regarding whether mutation rates or promotion or both are affected by an environmental agent. In our experience (26), both broad inferences regarding mechanism and the hazard functions are insensitive to choice of model, providing promotion is considered explicitly in the models.

For analyses of epidemiologic data, it is important to note that the exact hazard functions of rather general multistage models possess properties that are not reflected in the empirical hazard functions. We emphasize here that for these properties to hold the exact hazard function must be used; commonly used approximations, such as the power law approximation to the Armitage-Doll multistage model, do not possess these properties.

The first property is that the hazard tends to level off at old age. This is a prediction of the exact solution for any multistage model (17-19) and is in agreement with epidemiologic data for cancers of many types. Alternative explanations have been offered for the falloff in risk, including underascertainment of cancers among the elderly. Although ascertainment bias may contribute to an apparent leveling of risk at old age, the leveling is predicted by the multistage models.

A second property is that the hazard following cessation of exposure to a carcinogenic agent will eventually approach the hazard for individuals never exposed to the agent. Thus, the risk for ex-smokers should eventually approach that of never smokers, consistent with observed trends for ex-smokers. This prediction is based on the assumption that mutation and promotion rates return to background levels after smoking stops. The reversion of the hazard function for lung cancer among ex-smokers to nonsmoker levels is often attributed to repair of lung lesions. Repair may well play a role; however, this behavior of the hazard function is predicted by multistage carcinogenesis (17-19).

A third property is that carcinogenic agents that affect different steps in the multistage process lead to distinct patterns of risk following exposure. An agent that affects initiation, an early step, will show greater delay before an increase in risk than an agent that affects promotion or malignant conversion. Agents affecting promotion tend to give a prolonged “plateau” of increased risk. Agents that affect malignant conversion produce a relatively prompt increase in risk following exposure. Previous articles have examined the temporal evolution of risk following changes in intensity of exposure to carcinogens but only within the framework of the approximate form of the Armitage-Doll model (27-29).

Finally, multistage modeling allows analysis of data with complex exposure patterns to multiple agents that would be difficult or impossible using standard statistical techniques. The entire covariate history can be explicitly considered for exposure to multiple carcinogenic agents. Thus, for each agent, age at start of exposure, intensity of exposure, age at stop of exposure, and time since stop of exposure are all used for construction of the likelihood. An example of analyses of lung cancer mortality with complex exposure patterns to three lung carcinogens is provided by our recent analysis of the Chinese Tin Miners' data (26).

An understanding of the importance of different mechanisms in carcinogenesis may suggest better approaches to prevention, intervention, and treatment. This study identifies promotion as the dominant mechanism influencing tobacco-induced lung cancer risk in all cohorts. Based on this finding, it may be worth assessing if pharmacologic intervention could be targeted toward slowing cigarette-induced promotion among smokers. Recent analyses (13, 19) suggest that the antipromoting effects of cyclo-oxygenase-2 inhibitors in colon cancer is an efficient way to decrease risk even late in life.

It is certainly possible that some of the differences in risk we see between cohorts may be due to factors other than differences in reported smoking behavior or changing cigarette composition. For example, unreported differences in ramping-up during the uptake of cigarette smoking may have occurred between the earlier and the later cohorts. This could potentially influence risk patterns that we associate with a decreasing role of initiation and increasing role of promotion in conjunction with changes in cigarette composition.

Previous Analyses Using Multistage Models

The predictions in this article are based on a particular multistage model of carcinogenesis. Other forms of the model have been fitted to observational data for lung cancer, particularly the Armitage-Doll multistage model (29-35). Different models can offer similar statistical fits to observed data while having qualitatively different characteristics. Ideally, the selection of an epidemiologic model should be based on an understanding of the biology. For lung cancer, there is good evidence for initiation and promotion steps, suggesting the use of a model that includes clonal expansion. Further, we strongly advocate the use of exact solutions of whichever multistage model is used. As noted above and discussed in more detail elsewhere (16-19), commonly used approximations, such as the power law approximation to the Armitage-Doll model, have properties that are qualitatively quite different from those of the exact hazard functions.

In a recent article, Knoke et al. have suggested that the TSCE model provides a poor description of certain features of the data (22). There could be two reasons for their findings. First, they considered only the approximate version of the model, which was used by one of us in an article written ∼15 years ago. Since then, we have emphasized the importance of using the exact solution in several recent articles, some of them analyzing other lung cancer data sets. Second, Knoke et al. assume linear dose-response relationships for the variables of the model. In fact, in this article, we find that the dose-response relationship for promotion is strongly nonlinear. Analyses of residuals show that the TSCE model provides excellent descriptions of all three data sets considered here (see Fig. 2). Moreover, without information on ex-smokers, we are able to make predictions of risk among ex-smokers quitting at different ages that are in good qualitative agreement with the information available from other sources (refs. 9, 23, 24; see Fig. 3).

Mechanisms in Smoking-Related Lung Cancer

The analyses indicate that promotion is the most important mechanism (both in terms of statistical significance and attributable risk) in smoking-related lung cancer among males and females in all cohorts. Analysis of the earlier CPS-I and British Doctors' cohorts indicates that initiation is also significant, but with a smaller effect on lung cancer mortality, gaining importance primarily through a synergistic interaction with promotion. Analyses of the CPS-II male and female cohorts show an increased dose-response for promotion, with nonsignificant estimates for initiation, although an effect of initiation is not excluded. The difference in dose-response for initiation between CPS-I and CPS-II is significant (P = 0.01), even when background initiation rates and all other variables are allowed to differ. The apparent increase in the role of promotion and decrease in the role of initiation in the later cohorts may be associated with changes in cigarette composition.

We ran several sensitivity analyses to test if the shorter (6-year) follow-up for CPS-II, or the exclusion of prevalent cancers, could contribute to decreased power to detect a dose-response effect for initiation. We analyzed 6-year follow-up data for CPS-I, finding a significant dose-response for both initiation and promotion consistent with estimates using the 12-year CPS-I follow-up. Estimates for initiation using 6-year follow-up data for CPS-I are significantly larger than for the 6-year follow-up data for CPS-II as found in the analysis of the 12-year CPS-I follow-up data. We also analyzed 6-year CPS-II data for males of all races, including prevalent cancers, finding a nonsignificant dose-response for initiation (although a reduced effect of initiation is not excluded) consistent with the CPS-II analysis for White males, excluding prevalent cancers.

Smoking seems to cause significant and remarkably consistent increases in promotion and initiation as functions of dose in the contemporaneous CPS-I and British Doctors' cohorts. An influence on initiation is consistent with evidence for mutagenic effects of known carcinogens in tobacco, such as the nitrosamines and the polycyclic aromatic hydrocarbons, which are strong initiators (4). Some of these compounds also seem to be important as promoters (36-39). Other compounds in cigarette smoke condensate, such as catechol, are also co-carcinogens, affecting both initiation and promotion (4), but there seem to be many pure initiators and pure promoters. Promotion drives the clonal expansion of initiated cells that leads to recognizable premalignant lesions in lung cancer (38-40).

Promotion includes two components: net cell proliferation and cell division of initiated cells. An increase in the net cell proliferation rate of initiated cells is associated with higher cancer risk, as this is just the mean rate at which initiated clones expand. However, given a fixed net cell proliferation rate, a decrease in the cell division rate implies a lower extinction rate for initiated clones and thus a higher asymptotic risk at older ages. This occurs for the British Doctors who have a three to four times lower estimate for the background rate of cell division than the estimate for males in CPS-I and have correspondingly higher risks at older ages.

The effects of dose-response on initiation and promotion are highly synergistic in the CPS-I and British Doctors' cohorts (see Fig. 3S in the Appendix). With a dose-response only on initiation (at the rate estimated in the joint model), the excess risk would be very modest, rising slowly at older ages. The effect of dose-response only on promotion leads to a steep increase in risk around age 60 years followed by a plateau. The influence of promotion alone seems much more important than initiation, taken by itself. This is consistent with evidence cited above, indicating that the nitrosamines and the polycyclic aromatic hydrocarbons play major roles as promoters in carcinogenesis. Although the polycyclic aromatic hydrocarbons have been considered to be the main carcinogens in cigarettes, their concentration in cigarette condensate is far too low for the initiating effect to drive carcinogenesis (38). With dose-response affecting both initiation and promotion, the risk is increased much more than additively. This is consistent with the concept of the synergistic interaction between co-carcinogens, where initiation and promotion occur simultaneously (4).

As discussed in Materials and Methods, recruitment for CPS-I began in fall 1959, with most cohort members born between 1880 and 1919, whereas recruitment for CPS-II began in fall 1982 with most subjects born between 1900 and 1939. Over this period, there were significant changes in the availability of machine-manufactured cigarettes, cigarette composition, social norms, and other factors, such as nutrition, which could contribute to differences in the patterns of lung cancer mortality. Although the CPS-I and CPS-II studies used comparable techniques, there are some potentially important differences in regional coverage and follow-up. CPS-I enrolled subjects from 25 states, whereas CPS-II included subjects from all 50 states, the District of Columbia, Puerto Rico, and Guam. Reporting of smoking histories may differ across studies and calendar periods, possibly causing changes in variable estimates.

The models show qualitatively consistent changes between the earlier and the later cohorts for both males and females. Risks for a given smoking dose and duration seem to have increased, counter to what might be expected because of the introduction of low-tar and low-nicotine cigarettes. Indeed, the models indicate that initiation rates associated with smoking have significantly decreased, but increased promotion due to smoking has increased the overall risk.

The differences in dose-response may suggest an influence of changes in cigarette composition. An investigation of >170 types of cigarettes from the United States and Europe found no correlation between tar delivery and mainstream smoke concentration of nitrosamines (41). Tumor promoting activity, as measured by gap junctional intercellular communication, also does not closely reflect the tar yield of low-tar and high-tar cigarettes (42). Smokers of low-tar and low-nicotine cigarettes partially compensate by inhaling higher volumes of smoke (43). With enhanced smoke intake, smokers of low-tar and low-nicotine cigarettes may have elevated intake of nitrosamines and/or enhanced tumor promotion compared with smokers of higher nicotine cigarettes.

The apparent reduction in smoking-related initiation may also be associated with the properties of low-tar cigarettes. A study of mutagenicity of tobacco smoke from a series of 16 low-tar (1-10 mg tar) cigarettes showed significantly reduced mutagenicity (by ∼30%) by the Ames test for mainstream smoke from most of the low-tar cigarettes compared with high-tar (23 mg) cigarettes (44). Comparison of estimated initiation and promotion dose-response between CPS-I and CPS-II may be complicated by a shift from squamous cell carcinoma as the major subtype to increasing rates of adenocarcinoma.

Projection of Risk

The class of models used for data analyses in this article can be used to predict the risks of lung cancer in various subgroups. Because the models are fully parametric, it is clear how they can be used to project risks among nonsmokers and continuing smokers. It is less clear how the models can be used to predict risks among ex-smokers. This prediction depends on a property of the exact hazard functions generated by multistage models and which is not shared by the approximate solutions that have hitherto been widely used for analyses of data. When smokers quit, their lungs are no longer exposed to the initiating and promoting agents contained in tobacco smoke, and epidemiologic data indicate that lung cancer incidence rates among ex-smokers approach those among nonsmokers over a period of 15 to 20 years. Biologically, there are two possibilities, not mutually exclusive. (a) After exposure to the initiating and promoting agents in cigarette smokes stops, the rates of initiation and promotion revert to the background rates among nonsmokers and this reversion is sufficient to ultimately cause the incidence rates to revert to background rates. (b) Repair of damaged tissue occurs, for example, promoted lesions regress, and is responsible for the reversion of rates.

Previous analyses of data using multistage models by Freedman and Navidi have found that risk declines after smoking ceases (45). They suggested that repair or heterogeneity may underlie the declining risk but concluded that the first biological scenario above does not explain the decline in risk among ex-smokers. These analyses have relied, however, on approximate solutions to the hazard (incidence) functions. The exact solution, which has been used in this article and our other recent articles, makes entirely different predictions. It predicts that, after exposure to a carcinogen stops, the incidence rates approach the rates among the unexposed asymptotically. Although this result seems to be counterintuitive, a simple explanation is as follows. Because of the stochastic nature of the process of multistage carcinogenesis, there is considerable heterogeneity among any group of smokers with respect to the number and size distribution of initiated lesions. Some fortunate individuals may smoke for years without developing any initiated lesions. If the entire group now gives up smoking, the first individuals to develop lung cancer are those with the largest burden of initiated cells. Ultimately, over the course of many years, the only individuals remaining are those who, through great good fortune, had few or no initiated lesions as a result of their past smoking habits. These individuals are at the same risk as nonsmokers; therefore, the incidence rates now approach the rates among nonsmokers. The exact hazard function of multistage models captures this stochastic variation in risk among ex-smokers; the widely used approximations do not.

Promotion seems to be the dominant etiologic mechanism in smoking-related lung carcinogenesis. Significantly increased risk from cigarette smoking among both male and female smokers is seen in the CPS-II cohort when compared with the earlier CPS-I and British Doctors' cohorts. The increased risk is associated with significant changes in dose-response, including a significant increase in promotion, and significant decrease in initiation for a given smoking intensity. Intervention and treatment strategies may benefit from targeting promotion as a primary mechanism in lung carcinogenesis.

Figure S1: 

Figure S1:

British doctors male mortality rates with 20 years follow-up by: (A) dose and age, (B) duration and age. Solid line, maximum likelihood estimate from joint reduced model fit to CPS-I and British doctors, dashed lines, MCMC 95% CI. Circles show the ratio of observed lung cancer deaths to person years at risk in 5-year age bins, with bars based on Poisson assumptions.

Figure S1:

British doctors male mortality rates with 20 years follow-up by: (A) dose and age, (B) duration and age. Solid line, maximum likelihood estimate from joint reduced model fit to CPS-I and British doctors, dashed lines, MCMC 95% CI. Circles show the ratio of observed lung cancer deaths to person years at risk in 5-year age bins, with bars based on Poisson assumptions.

Close modal

Figure S2: 

Figure S2:

Estimates of the dose-response for (A) the net cell proliferation rate, and (B) initiation for the joint reduced model for white male CPS-I and British doctors (solid lines), and for CPS-II (dashed lines). The initiation and promotion rates corresponding to a given smoking rate are assumed to continue for the smoking duration. Separate optimization of the British doctors gives (marginally significant) different dose-response for net cell proliferation (dotted lines). The British doctors initiation rate underlies the joint fit in (B). The symmetric cell division rates are scaled using different background parameters for CPS-I and the British doctors, but share the same dose-response shape as for net cell proliferation. Gray shaded regions, MCMC 95% CI for net cell proliferation and initiation rates as a function of smoking rate, calculated using the full model for CPS-I and the British doctors cohorts. Bars are located at central points in the categorical dose ranges, at 0, 5, 15, 20, 30 and 42 cigarettes per day for CPS-I (solid bars), and at 0, 20, and 40 cigarettes per day for CPS-II (dashed bars), based on MCMC calculations.

Figure S2:

Estimates of the dose-response for (A) the net cell proliferation rate, and (B) initiation for the joint reduced model for white male CPS-I and British doctors (solid lines), and for CPS-II (dashed lines). The initiation and promotion rates corresponding to a given smoking rate are assumed to continue for the smoking duration. Separate optimization of the British doctors gives (marginally significant) different dose-response for net cell proliferation (dotted lines). The British doctors initiation rate underlies the joint fit in (B). The symmetric cell division rates are scaled using different background parameters for CPS-I and the British doctors, but share the same dose-response shape as for net cell proliferation. Gray shaded regions, MCMC 95% CI for net cell proliferation and initiation rates as a function of smoking rate, calculated using the full model for CPS-I and the British doctors cohorts. Bars are located at central points in the categorical dose ranges, at 0, 5, 15, 20, 30 and 42 cigarettes per day for CPS-I (solid bars), and at 0, 20, and 40 cigarettes per day for CPS-II (dashed bars), based on MCMC calculations.

Close modal

Figure S3: 

Figure S3:

Contribution to hazard from background (solid lines), and dose-response on initiation (dotted lines), promotion (dashed lines), or both processes (dot-dash lines), using the joint reduced model for white male CPS-I (thick lines) and the British doctors (thin lines) cohorts. The hazards are plotted for a hypothetical White male from CPS-I or British doctors' cohort smoking 40 cigarettes per day beginning at age 15.

Figure S3:

Contribution to hazard from background (solid lines), and dose-response on initiation (dotted lines), promotion (dashed lines), or both processes (dot-dash lines), using the joint reduced model for white male CPS-I (thick lines) and the British doctors (thin lines) cohorts. The hazards are plotted for a hypothetical White male from CPS-I or British doctors' cohort smoking 40 cigarettes per day beginning at age 15.

Close modal

Figure S4: 

Figure S4:

Comparison of joint reduced model for White male CPS-I and British doctors with model for White male CPS-II data with 6 years follow-up. (A and B) show rates aggregated by dose and age, and smoking duration and age, respectively. Dotted lines show the prediction of the final CPS-I and British doctors smoking model. Dashed line shows predictions using the same dose-response parameters as above, but with re-optimized background parameters. Solid line shows the predictions of a fully re-optimized final model fit to the CPS-II data.

Figure S4:

Comparison of joint reduced model for White male CPS-I and British doctors with model for White male CPS-II data with 6 years follow-up. (A and B) show rates aggregated by dose and age, and smoking duration and age, respectively. Dotted lines show the prediction of the final CPS-I and British doctors smoking model. Dashed line shows predictions using the same dose-response parameters as above, but with re-optimized background parameters. Solid line shows the predictions of a fully re-optimized final model fit to the CPS-II data.

Close modal

Grant support: National Cancer Institute through a Cancer Intervention and Surveillance Modeling Network grant U01 CA097415 and Australian National Health and Medical Research Council postdoctoral fellowship (M.S. Clements).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

We thank Dr. Rob Boer and the Cancer Intervention and Surveillance Modeling Network Lung Cancer Group for useful suggestions.

1
Thun MJ, Henley SJ, Calle EE. Tobacco use and cancer: an epidemiologic perspective for geneticists.
Oncogene
2002
;
21
:
7307
–25.
2
Burns DM, Benowitz NL. Risks associated with smoking cigarettes with low machine-measured yields of tar and nicotine. Smoking and Tobacco Control, Monograph 13, NIH Publ No. 02-5047; 2001 Feb.
3
U.S. Department of Health and Human Services. The health benefits of smoking cessation: a report of the Surgeon General. Rockville (MD): U.S. Department of Health and Human Services, Public Health Service, Centers for Disease Control, Center for Chronic Disease Prevention and Health Promotion, Office on Smoking and Health; 1989.
4
Rubin H. Synergistic mechanisms in carcinogenesis by polycyclic aromatic hydrocarbons and by tobacco smoke: a bio-historical perspective with updates.
Carcinogenesis
2001
;
22
:
1903
–30.
5
Hoffmann D, Djordjevic MV, Hoffmann I. The changing cigarette.
Prev Med
1997
;
26
:
427
–34.
6
Gail MH, Brinton LA, Byar DP, et al. Projecting individualized probabilities of developing breast cancer for White females who are being examined annually.
J Natl Cancer Inst
1989
;
81
:
1879
–86.
7
Burns DM, Garfinkel L, Samet JM. Changes in cigarette related disease risks and their implication for prevention and control. Smoking and Tobacco Control, Monograph 8, NIH Publ No. 97-4213; 1997 Feb.
8
Doll R, Peto R. Cigarette smoking and bronchial carcinoma: dose rate and time relationships among regular smokers and life-long nonsmokers.
J Epidemiol Community Health
1978
;
32
:
303
–13.
9
Doll R, Peto R, Boreham J, Sutherland I. Mortality in relation to smoking: 50 years' observations on male British Doctors.
Br Med J
2004
;
328
:
1519
.
10
Moolgavkar SH, Dewanji A, Luebeck G. Cigarette smoking and lung cancer: a reanalysis of the British Doctors' data.
J Natl Cancer Inst
1989
;
81
:
415
–20.
11
Moolgavkar SH, Knudson AG. Mutation and cancer: a model for human carcinogenesis.
J Natl Cancer Inst
1981
;
66
:
1037
–52.
12
Moolgavkar SH, Luebeck EG. Two-event model for carcinogenesis: biological, mathematical and statistical considerations.
Risk Anal
1990
;
10
:
323
–41.
13
Luebeck EG, Moolgavkar SH. Multistage carcinogenesis and the incidence of colorectal cancer.
Proc Natl Acad Sci U S A
2002
;
99
:
15095
–100.
14
Moolgavkar SH, Dewanji A, Venzon DJ. A stochastic two-stage model for cancer risk assessment I: the hazard function and the probability of tumor.
Risk Anal
1988
;
8
:
383
–92.
15
Moolgavkar SH, Venzon DJ. Two-event models for carcinogenesis: incidence curves for childhood and adult tumors.
Math Biosci
1979
;
47
:
55
–77.
16
Heidenreich WF, Luebeck EG, Moolgavkar SH. Some properties of the hazard function of the two-mutation clonal expansion model.
Risk Anal
1997
;
17
:
391
–9.
17
Moolgavkar SH. Commentary: Fifty years of the multistage model: remarks on a landmark paper [advanced access published 2004 Aug 19]. Int J Epidemiol.
18
Moolgavkar SH, Krewski D, Schwarz M. Mechanisms of carcinogenesis and biologically based models for estimation and prediction of risk.
IARC Sci Publ
1999
;
131
:
179
–237.
19
Moolgavkar SH, Luebeck EG. Multistage carcinogenesis and the incidence of human cancer.
Genes Chromosomes Cancer
2003
;
38
:
302
–6.
20
Hazelton WD, Luebeck EG, Heidenreich WF, Moolgavkar SH. Analysis of a cohort of Chinese tin miners with arsenic, radon, cigarette and pipe smoke exposures using the biologically-based two-stage clonal expansion model.
Radiat Res
2001
;
156
:
78
–94.
21
National Center for Health Statistics. U.S. decennial life tables for 1969-71; 1989-91. Vol. 1, No. 1. U.S. Life Tables (PHS) 75-1150; 97-1150-1 [accessed 2004 May 10]. Hyattsville (MD): Public Health Service. Available at: http://www.cdc.gov/nchs/products/pubs/pubd/lftbls/lftbls.htm.
22
Knoke JD, Shanks TG, Vauughn JW, Thun MJ, Burns DM. Lung cancer mortality is related to age in addition to duration and intensity of cigarette smoking: an analysis of CPS-I data.
Cancer Epidemiol Biomarkers Prev
2004
;
13
:
949
–57.
23
Peto R, Darby S, Deo H, Silcocks P, Whitley E, Doll R. Smoking, smoking cessation, and lung cancer in the UK since 1950: combination of national statistics with two case-control studies.
BMJ
2000
;
321
:
323
–9.
24
Bain C, Feskanich D, Speizer FE, et al. Lung cancer rates in men and women with comparable histories of smoking.
J Natl Cancer Inst
2004
;
96
:
826
–34.
25
Bach PB, Kattan MW, Thornquist MD, et al. Variations in lung cancer risk among smokers.
J Natl Cancer Inst
2003
;
95
:
470
–8.
26
Hazelton WD, Moolgavkar SH, Curtis SB, Zielinski JM, Ashmore JP, Krewski D. Biologically based analysis of lung cancer incidence in a large Canadian occupational cohort with low-dose ionizing radiation exposure, and comparison with Japanese atomic bomb survivors. J Toxicol Environ Health A. In press.
27
Whittemore AS. The age distribution of human cancer for carcinogenic exposure of varying intensity.
Am J Epidemiol
1977
;
106
:
418
–32.
28
Day NE, Brown CC. Multistage models and the primary prevention of cancer.
J Natl Cancer Inst
1980
;
64
:
977
–89.
29
Brown CC, Chu KC. Use of multistage models to infer stage affected by carcinogenic exposure: example of lung cancer and cigarette smoking.
J Chronic Dis
1987
;
40
:
171
–9S.
30
Armitage P, Doll R. The age distribution of cancer and a multistage theory of carcinogenesis.
Br J Cancer
1954
;
8
:
1
–12.
31
Lee PN. Studying the relationship of smoking to lung cancer using the multistage model of carcinogenesis: a review. 1995. Available at: http://www.pnlee.co.uk/reflist.htm.
32
Clements MS. Lung cancer mortality prediction using multi-stage population smoking models [thesis for Doctor of Philosophy]. University of Sydney; 2001.
33
Heidenreich WF, Luebeck EG, Hazelton WD, Paretzke HG, Moolgavkar SH. Multistage models and the incidence of cancer in the cohort of atomic bomb survivors.
Radiat Res
2002
;
158
:
607
–14.
34
Flanders WD, Lally CA, Zhu B-P, Henley SJ, Thun MJ. Lung cancer mortality in relation to age, duration of smoking, and daily cigarette consumption: results from Cancer Prevention Study II.
Cancer Res
2003
;
63
:
6556
–62.
35
Heidenreich WF, Wellman J, Jacob P, Wichmann HE. Mechanistic modeling in large case control studies of lung cancer risk from smoking.
Stat Med
2002
;
21
:
3055
–70.
36
West LA, Brognard J, Clark AS, et al. Rapid Akt activation by nicotine and a tobacco carcinogen modulates the phenotype of normal human epithelial cells.
J Clin Invest
2003
;
111
:
81
–90.
37
Schuller HM, Tithof PK, Williams M, Plummer H III. The tobacco-specific carcinogen 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone is a β-adrenergic agonist and stimulates DNA synthesis in lung adenocarcinoma via β-adrenergic receptor-mediated release of arachidonic acid.
Cancer Res
1991
;
59
:
4510
–5.
38
Rubin H. Selective clonal expansion and microenvironmental permissiveness in tobacco carcinogenesis.
Oncogene
2002
;
21
:
7392
–411.
39
Melikian AA, Hecht SS, Hoffmann D. Mechanistic studies of tobacco carcinogenesis in mouse epidermis and lung tissues.
Prog Clin Biol Res
1989
;
298
:
331
–45.
40
Moro-Sibilot D, Jeanmart M, Lantuejoul S, et al. Cigarette smoking preinvasive bronchial lesions, and autofluorescence bronchoscopy.
Chest
2002
;
122
:
1902
–8.
41
Fischer S, Spiegelhalder B, Preussmann R. Tobacco-specific nitrosamines in commercial cigarettes: possibilities for reducing exposure. IARC Sci Publ 1991;489–92.
42
Vang O, Wallin H, Autrup H. Inhibition of intracellular communication by condensates of high and low tar cigarettes.
Arch Toxicol
1995
;
69
:
415
–20.
43
Frost C, Fullerton FM, Stephen AM, et al. The tar reduction study: randomized trial of the effect of cigarette tar yield reduction on compensatory smoking.
Thorax
1995
;
50
:
1038
–43.
44
Chortyk OT, Chamberlain WJ. A study of the mutagenicity of tobacco smoke from low-tar cigarettes.
Arch Environ Health
1990
;
45
:
237
–44.
45
Freedman DA, Navidi WC. Ex-smokers and the multistage model for lung cancer.
Epidemiology
1990
;
1
:
21
–9.