Background: China's lung cancer crude death rate has increased 6.9-fold from 1973 to 2014. During this time, the country experienced extremely rapid economic growth and social change. It is important to understand the effects of risk factors on lung cancer mortality (LCM) for better allocation of limited resources of cancer prevention and control in China.

Methods: Using three nationwide mortality surveys from 1973 to 2005, Global Health Data Exchange data in 2013, three nationwide smoking surveys from 1984 to 2013, four population censuses from 1964 to 2000, and other datasets, we have compiled datasets and developed spatial random effect models to assess the association of various area-level–contributing factors on LCM. Spatial scan statistics are used to detect high-risk clusters of LCM.

Results: LCM is higher in urban and more industrialized areas (RR = 1.17) compared with those in rural areas. The level of industrial development's effect is higher for men, which accounts for about 70% of all LCM. Smoking is positively associated with regional variation of LCM rates, and the effect is higher for women than for men.

Conclusions: The geographic pattern of high LCM in China is different from that of Western countries. LCM is positively associated with higher socioeconomic status, with more urbanized areas at a higher level of industrial development.

Impact: There is a need to further explore additional risk in the high-risk clusters. The study is about China, but this situation may happen in other countries experiencing rapid industrialization and other developing countries. Cancer Epidemiol Biomarkers Prev; 26(4); 569–77. ©2017 AACR.

See all the articles in this CEBP Focus section, “Geospatial Approaches to Cancer Control and Population Sciences.”

Lung cancer is the leading cause of cancer-related death in China and has had the fastest increase in death rate among all cancers in the past 40 years. The crude death rate has increased 6.9-fold from 5.46 (1973–1975) to 43.03 per 100,000 and led to approximately 588,500 deaths in 2014 (1–3). Lung cancer accounted for a higher percentage of cancer-related deaths in China (26.8%) when compared with the world average (20%) in 2013 (4, 5). It was reported that the sharp increase of lung cancer mortality (LCM) for both men and women is clearly out of proportion to the population increase and is the least affected by demographic factors among all cancers in China (6, 7).

The regional disparity of LCM across China is very significant and ranges widely, from the highest rate of 66.3 in Liaoning to the lowest rate of 2.9 per 100,000 in Tibet in 2013 (8). Many factors, including cigarette smoking, exposure to carcinogens, environmental pollution, and socioeconomic factors, may directly or indirectly affect LCM, as well as the spatial variation of the rates. In this study, we aim to elucidate the most significant factors that affect the geographical variation of LCM rates in China.

Studies have shown that smoking is the main cause of LCM responsible for approximately 90% of all LCM. (4, 9–12) Researchers have also reported significant and increasing attributable risk of LCM associated with smoking in China (6, 12–16). Although 51.1% to 62.8% of Chinese men and 2.4% to 7% of women were smokers in 1984 to 2013 (13, 14), data show that smoking rate has been 3% to 15.2% lower in urban areas (13, 15, 16), but the LCM rate has been consistently 4.2 to 15.3 per 100,000 higher (17). Hence, in addition to smoking, this project aims to find other regional factors that may have significant contributions to the regional variation of LCM. Exposure to carcinogens has been recognized as one of the causes of lung cancer (9, 18, 19), such as metal mining and asbestos fibers (20–24), as well as household air pollution (2, 25–27). However, the effects are specific to a group of people in specific locations. Air pollution can cause lung cancer (28–32). The International Agency for Research on Cancer (Lyon, France) states that exposures to outdoor air pollution are linked to increased cancer risk in humans (33) and that the top three causes of air pollution are transportation, stationary power generation, and industrial emission (34). In contrast with postindustrial Western countries, China has experienced rapid industrialization since the 1950s. The sharp increase of energy consumption, transportation, and construction has resulted in serious air pollution and frequent smog problems in major cities (35, 36).

Regional socioeconomic status (SES) and its relationship to cancer is a relatively new field (37). Singh and colleagues (38) found that the relationship of SES and lung cancer reversed for men, higher male LCM in higher SES areas in the 1950s and lower LCM among the higher SES areas in 1998. Gomez and colleagues (37) reviewed 34 articles that studied the impact of neighborhood SES factors on cancer. They found that numerous associations of varying directions and magnitudes have been reported. European studies have demonstrated similar results in recent years although Vanthomme and colleagues reported that LCM was previously higher in high SES groups (39–41). Although SES may not be a direct determinant of LCM, evidence shows that it can be “highly correlated with differences in outcome and related to unknown environmental influences that cause cancer” (41, 42).

Small regional or case–control analyses have been performed for the attributable risk factors of lung cancer (43, 44); few in-depth studies have been conducted on the relationship between comprehensive regional factors and spatial disparity of LCM in a developing country. Study over an extended time period is rather scarce in a developing country, likely due to limited data availability.

Using the largest datasets from three nationwide retrospective mortality surveys from 1973 to 2005, three nationwide smoking surveys from 1984 to 2013, the Global Health Data Exchange in 2013, censuses from 1964 to 2000, and SES data, this research investigates the relationship between area factors and the spatial variation of LCM in men and women at both provincial and county levels over the past 40 years in China. The results offer policy implications on lung cancer control, prevention, and intervention not only in China but also in other developing countries.

Data

Outcome measure.

This research uses China's two nationwide retrospective cancer mortality surveys (1973–1975 and 1990–1992), the causes of death survey (2004–2005), and the 2013 mortality data (8). Table 1 lists the details of the first three surveys. In addition to the 1973–1975 survey that covered 96.7% of 2,473 counties in China (45), the 1990–1992 and 2004–2005 surveys selected 263 and 158 counties, respectively, using a multistage stratified cluster random sampling method (17, 46, 47). Permanent residents of the selected counties, 848.7 million in 1973–1975, 111.7 million in 1990–1992, and 71.4 million in 2004–2005, were surveyed. To ensure accuracy and completeness, each of the surveys used several uniform questionnaires and organized advisory and quality control groups at the county, provincial, and national levels (17, 46, 48). All 241,650 lung cancer deaths were recorded according to the International Classification of Diseases (ICD) codes and verified by trained survey personnel or medical specialists in the advisory and quality control groups. The LCM data used in this research have the underlying cause of death as “Trachea, bronchus, lung cancers” of the ICD-10 C33-C34 for 2004 to 2005 and ICD-9 162 in the two previous surveys (17, 46, 48).

Table 1.

Summary of China's three retrospective mortality surveys, 1973–2005

Survey1973–19751990–19922004–2005
Years covered 
Percentage of population surveyed 96.7% 10% 6% 
Average annual population surveyed 849 million 112 million 71.4 million 
Number of sampling sites (county or city)a 2,392 263 158 
Total number of lung cancer deaths among surveyed population 139,011 58,646 43,993 
Cancer death diagnosed at or above town/township level hospital 92.54% 97.4% 98.54% 
Cancer death diagnosed at or above county level hospital 79.15% 91% 93.85% 
Antemortem diagnosis was made by method Level I or Level IIb 70.5% 89.6% 85.16% 
Result of spot rechecks on mistakes in diagnosis and missing cases About 5% <5% <5% 
Total lung cancer crude death rate (per 100,000 person-year) 5.46 17.54 30.83 
County-level lung cancer crude death rate (per 100,000 person-year) 
 Minimum: male/female 0/0 2.15/0 0/0 
 Median: male/female 4.67/2.28 19.15/8.25 35.45/15.14 
 Mean: male/female 6.06/3.02 22.76/10.25 39.40/18.30 
 Maximum: male/female 73.21/23.32 93.95/41.80 98.24/83.46 
Survey1973–19751990–19922004–2005
Years covered 
Percentage of population surveyed 96.7% 10% 6% 
Average annual population surveyed 849 million 112 million 71.4 million 
Number of sampling sites (county or city)a 2,392 263 158 
Total number of lung cancer deaths among surveyed population 139,011 58,646 43,993 
Cancer death diagnosed at or above town/township level hospital 92.54% 97.4% 98.54% 
Cancer death diagnosed at or above county level hospital 79.15% 91% 93.85% 
Antemortem diagnosis was made by method Level I or Level IIb 70.5% 89.6% 85.16% 
Result of spot rechecks on mistakes in diagnosis and missing cases About 5% <5% <5% 
Total lung cancer crude death rate (per 100,000 person-year) 5.46 17.54 30.83 
County-level lung cancer crude death rate (per 100,000 person-year) 
 Minimum: male/female 0/0 2.15/0 0/0 
 Median: male/female 4.67/2.28 19.15/8.25 35.45/15.14 
 Mean: male/female 6.06/3.02 22.76/10.25 39.40/18.30 
 Maximum: male/female 73.21/23.32 93.95/41.80 98.24/83.46 

NOTE: Data source: refs. 45–48.

aThe surveys were carried out in all 29 provinces, autonomous regions, and municipalities of mainland China in 1973–1975, 27 provincial-level regions (including the addition of Hainan province) excluding Tibet Autonomous Region, Xinjiang Uygur Autonomous Region, and Qinghai province in 1990–1992, and all 31 provincial-level regions (including the addition of Chongqing municipality) in 2004–2005.

bLevel I, diagnosis was made by pathologic or cytologic examination of tissue specimens, including bone marrow; Level II, diagnosis was made by X-ray, ultrasound, isotope, scans, endoscopy, exploratory laparotomy (without biopsy), immunologic markers, or chemical laboratory tests.

Covariates.

To assess the impact of smoking on LCM, smoking prevalence rates were extracted from two nationwide smoking surveys in 1984 and 1996. The 1984 survey included 519,600 people ages 15 and older (random sample of 5/10,000) in 29 provinces in China (15). The 1996 survey (16) sampled the population in 145 sites of the Disease Surveillance Points System in 30 provinces. More than 1,000 households per surveillance site at the county level were surveyed by a randomized 3-stage sampling of households.

The province-level socioeconomic data are obtained from the 1986 and 1997 China Statistical Yearbook to match the years of the smoking surveys in 1984 and 1996 (49, 50). County-level socioeconomic measures come from the census data, which provide the most accurate county-level data in China. Details on the socioeconomic data are presented in the Supplementary Material and Supplementary Table S1.

Multiple linear regression analysis at the provincial level

The provincial-level analysis includes 14 variables in the following categories: income, employment, education, medical resources, smoking prevalence, and pollution (Supplementary Table S1). An initial correlation analysis (Supplementary Table S2) is performed to screen for variables that may have strong correlation with LCM rates. Area factors highly related to LCM with low collinearity are selected and included in the next step of analysis. A multiple linear regression is performed to estimate the effects of the area factors on LCM rates. To allow for long lag period between exposure to LCM, the 1990–1992 LCM rates are regressed on the 1984–1985 area factors, and rates for 2004–2005 and 2013 are regressed on the 1984–1985 and 1996 area factors. Among the factors of interest, those identified as highly correlated to LCM are included in the multiple linear regression.

As most of the provincial-level regions in China are very large in both population and area, the provincial-level analysis may have covered a lot of variations at finer resolution. With the available county-level data from the mortality surveys and censuses, the county-level analyses were conducted in more details.

Poisson regression and cluster analysis at the county level

The number of deaths due to lung cancer is available by sex and county for each of the three mortality surveys. The number of deaths is assumed to follow a Poisson probability distribution, with expected deaths modeled as a function of population, the covariates, and spatial random effects between counties. Detail on the Poisson regression model is presented in Supplementary Material.

Results from the Poisson regression model explain the impact of each contributing area factor to LCM. What cannot be attributed to the contributing factors is revealed via residuals. Further investigation into the residuals points to areas where the contributing factors cannot fully explain the elevated LCM rates. A cluster analysis is performed on the original death counts (without model adjustment), as well as on the model-adjusted counts using the spatial scan statistics in the freely available software SaTScanTM (51). Details are presented in Supplementary Material.

Figure 1 maps the crude LCM rate (deaths per 100,000 people) by county, sex, and survey year. For each of the three surveys, male rate is higher than female rate. Male LCM accounts for about 70% of all LCM in the three surveys. High rates tend to be clustered around major cities like Beijing and Shanghai.

Figure 1.

LCM crude rate by county, sex, and survey year.

Figure 1.

LCM crude rate by county, sex, and survey year.

Close modal

The provincial level stepwise model selection identifies factors that are statistically significant (P < 0.05) on LCM (Table 2). Results show that female smoking prevalence is positively associated with female LCM rate in all the analyses; male smoking has no significant effect on male LCM. Industrial development, GDP level, solid and water pollution, and literacy level are all positively linked to LCM. The early measure of density of medical personnel is positively linked to LCM, and the later measure is negatively linked to LCM. The adjusted R2 ranges between 0.344 and 0.855 in the provincial regression models.

Table 2.

Results of multiple linear regression (point estimate and 95% CI) of province-level area factors (1984–1996) on LCM rates (1990–2013)

1990–1992 lung cancer death rate (26 regions)2004–2005 lung cancer death rate (29 regions)2013 lung cancer death rate (29 regions)
MaleFemaleMaleFemaleMaleFemale
Area level factora and data sourceSlope (95% CI)Slope (95% CI)Slope (95% CI)Slope (95% CI)Slope (95% CI)Slope (95% CI)
1984–1985 China Statistical Yearbook 
 % Female smoker  0.81b (0.60–1.02)  1.17b (0.78–1.56)  0.68b (0.32–1.05) 
 % Ind. labor 1.39c (0.51–2.26)  1.83b (1.35–2.31)    
 % Serv. labor −2.76a (−4.08 to −1.44)      
 Cons. p.c. 0.08c (0.03–0.136)      
 GDP p.c.  0.007b (0.004–0.009)     
 Med. persn.   1.05b (0.55–1.55) 0.45c (0.13–0.76) 1.98b (1.31–2.65) 0.66b (0.40–0.91) 
 Waste water p.c.    0.12c (0.05–0.19)   
 Solid waste p.c.     13.01c (3.27–22.75) 6.52c (1.98–11.08) 
 Adj. R2 0.786 0.855 0.736 0.736 0.649 0.768 
1996 China Statistical Yearbook 
 % Female smoker    1.08c (0.25–1.92)   
 % Ind. labor   3.42b (2.59–4.26) 1.38b (0.91–1.86)   
 Med. persn.   −3.34c (−5.96 to −0.72)  −3.83c (−7.29 to −0.38)  
 Literacy     1.23b (0.71–1.74) 0.47b (0.23–0.71) 
 Adj. R2   0.748 0.608 0.430 0.344 
1990–1992 lung cancer death rate (26 regions)2004–2005 lung cancer death rate (29 regions)2013 lung cancer death rate (29 regions)
MaleFemaleMaleFemaleMaleFemale
Area level factora and data sourceSlope (95% CI)Slope (95% CI)Slope (95% CI)Slope (95% CI)Slope (95% CI)Slope (95% CI)
1984–1985 China Statistical Yearbook 
 % Female smoker  0.81b (0.60–1.02)  1.17b (0.78–1.56)  0.68b (0.32–1.05) 
 % Ind. labor 1.39c (0.51–2.26)  1.83b (1.35–2.31)    
 % Serv. labor −2.76a (−4.08 to −1.44)      
 Cons. p.c. 0.08c (0.03–0.136)      
 GDP p.c.  0.007b (0.004–0.009)     
 Med. persn.   1.05b (0.55–1.55) 0.45c (0.13–0.76) 1.98b (1.31–2.65) 0.66b (0.40–0.91) 
 Waste water p.c.    0.12c (0.05–0.19)   
 Solid waste p.c.     13.01c (3.27–22.75) 6.52c (1.98–11.08) 
 Adj. R2 0.786 0.855 0.736 0.736 0.649 0.768 
1996 China Statistical Yearbook 
 % Female smoker    1.08c (0.25–1.92)   
 % Ind. labor   3.42b (2.59–4.26) 1.38b (0.91–1.86)   
 Med. persn.   −3.34c (−5.96 to −0.72)  −3.83c (−7.29 to −0.38)  
 Literacy     1.23b (0.71–1.74) 0.47b (0.23–0.71) 
 Adj. R2   0.748 0.608 0.430 0.344 

a% of active female smokers among female population (% female smoker), % of industry labor force in population (% ind. labor), % of service labor force in population (% serv. labor), consumption per capita (cons p.c.), GDP per capita (GDP p.c.), medical personnel per 1,000 persons (med. persn.), % of literate people in population 12 years and over (literacy), and solid waste and waste water per capita (waste water p.c. and solid waste p.c.).

bP < 0.001.

c0.001 < P < 0.05.

Table 3 summarizes the results from the county level Poisson regression. Death count by county and sex is modeled with regional risk factors prior to the mortality survey year. For the 1973–1975 mortality, percentage of nonagriculture population has an increased risk of LCM for both males and females. On average, there was a 1% increase in LCM RR for every one percentage point increase of nonagriculture population for both sexes. The same factor was also positively correlated with LCM throughout the two later surveys for both sexes. During the 1990–1992 survey, LCM increased for urban areas, with higher high school education attainment level, percentage of industry labors, and smoking prevalence. Male LCM has more significant contributing factors when compared with female LCM. The men in urban areas have a LCM risk 17% higher than their counterparts in rural areas, whereas the risk for women is not significantly different between urban and rural areas. Smoking increases LCM rates in a region for both male and female, and the impact on female is higher than that on male. For every 1% point increase in smoking prevalence, male lung cancer RR increases 1% and female lung cancer RR increases 4%. Both the percentage of nonagriculture population in 1964 and the percentage of industry labor in 1990 were positively and significantly correlated with LCM in later years, but the 1982 measure of industry labor is not significant for either male or female data. The impact of percentage industry labor on LCM is slightly higher among males than females, with an estimated RR = 1.028 and the 95% confidence interval (CI), 1.013–1.042 for males, compared with RR = 1.015 and the 95% CI, 1.004–1.026 for females. During the 2004–2005 survey, counties were grouped as in the Eastern, Central, or Western regions in China (see Supplementary Fig. S1). Compared with Western China, Eastern China has higher RR of 1.29 (95% CI, 1.07–1.56) for male LCM and 1.21 (95% CI, 0.97–1.51) for female LCM. Central China did not show a significantly higher risk than Western China. Other factors that contributed to increased LCM risk were economic development level (measured by % of nonagriculture population in the 1964 census or % of industry labor in the later censuses) and higher smoking prevalence. Again, effect of smoking on regional variation of LCM is higher among women than among men (RR = 1.04 for female vs. RR = 1.009 for male). Across the three surveys, the pseudo R2 ranges between 0.68 and 0.92, indicating a reasonably good model fit for the mixed effect model in the Poisson regression.

Table 3.

Estimated RR (point and 95% CI) on effect of area-level risk factors on lung cancer deaths by county, sex, and year, China

1973–1975 (n = 2,110)1990–1992 (n = 241)2004–2005 (n = 158)
MaleFemaleMaleFemaleMaleFemale
Area-level factor and data sourcePoint (95% CI)Point (95% CI)Point (95% CI)Point (95% CI)Point (95% CI)Point (95% CI)
1964 Census % Nonagr pop 1.013a(1.012–1.013) 1.012a (1.012–1.013) 1.005b (1.001–1.009) 1.007a (1.004–1.010) 1.004c (1.000–1.007) 1.006a (1.003–1.009) 
1982 Census Urban   1.174c (0.982–1.402) NSd (NS) NS (NS) NS (NS) 
 HS edu   1.012c (0.999–1.025) NS (NS) NS (NS) NS (NS) 
 % Ind. labor   NS (NS) NS (NS) 1.011b (1.001–1.021) NS (NS) 
1984 Smoking Survey Smoking   1.013b (1.002–1.023) 1.042a (1.033–1.052) 1.009c (0.998–1.021) 1.040a (1.023–1.056) 
1990 Census % Ind. labor   1.028a (1.013–1.042) 1.015b (1.004–1.026) NS (NS) NS (NS) 
2004 Mortality Survey East     1.291a (1.067–1.563) 1.212c (0.972–1.512) 
Pseudo R2  0.728 0.676 0.887 0.924 0.881 0.904 
1973–1975 (n = 2,110)1990–1992 (n = 241)2004–2005 (n = 158)
MaleFemaleMaleFemaleMaleFemale
Area-level factor and data sourcePoint (95% CI)Point (95% CI)Point (95% CI)Point (95% CI)Point (95% CI)Point (95% CI)
1964 Census % Nonagr pop 1.013a(1.012–1.013) 1.012a (1.012–1.013) 1.005b (1.001–1.009) 1.007a (1.004–1.010) 1.004c (1.000–1.007) 1.006a (1.003–1.009) 
1982 Census Urban   1.174c (0.982–1.402) NSd (NS) NS (NS) NS (NS) 
 HS edu   1.012c (0.999–1.025) NS (NS) NS (NS) NS (NS) 
 % Ind. labor   NS (NS) NS (NS) 1.011b (1.001–1.021) NS (NS) 
1984 Smoking Survey Smoking   1.013b (1.002–1.023) 1.042a (1.033–1.052) 1.009c (0.998–1.021) 1.040a (1.023–1.056) 
1990 Census % Ind. labor   1.028a (1.013–1.042) 1.015b (1.004–1.026) NS (NS) NS (NS) 
2004 Mortality Survey East     1.291a (1.067–1.563) 1.212c (0.972–1.512) 
Pseudo R2  0.728 0.676 0.887 0.924 0.881 0.904 

Abbreviations: HS edu, high school population; ind. labor, industrial labor; nonagr pop, nonagriculture population.

aP < 0.001.

b0.001 < P < 0.05.

c0.05 < P < 0.1.

dNS, P > 0.1.

Figure 2 maps the clusters of high LCM risk without and with the Poisson regression model for the three sources of survey data. These are all statistically significant clusters with P < 0.05. Table 4 summarizes the male and female clusters in the 1973–1975 survey. Without model adjustment, five areas are identified as high-risk clusters for male LCM (Fig. 2, top left). They are listed as Clusters #1 through #5 with descending RR. Shanghai and the surrounding area have the highest RR of 3.59. The other clusters are all in major cities. With model adjustment that considers the percentage of nonagriculture population in the 1964 census, Cluster #1 is split into two, one smaller and more concentrated in the Shanghai area and the other “spilled” over to the neighboring Jiangxi Province (see Supplementary Fig. S1) in the more rural areas. A possible explanation is that although the crude mortality rate is not high in Jiangxi, when considering the percentage of the nonagriculture population, the lung cancer risk in Jiangxi is about 33% higher than the areas not in the clusters. Similarly, “shift” or “spillover” is observed in other clusters except for Cluster #3. The RR in the original Cluster #3 is 1.92. After model adjustment, the cluster is split into two smaller ones, both with smaller RRs. For the female LCM data (Fig. 2, bottom left), 7 clusters are identified without the model adjustment. With model adjustment, one cluster (#5) disappears as it is mainly associated with the percentage of nonagriculture population. Three new clusters (#8–#10) appear as the percentage of nonagriculture population is not high enough to explain the elevated female LCM in those areas. It is also worth noting that the RR gets smaller after model adjustment for most of the clusters, and the remaining RR would be the result of a model for additional risk factors not considered already.

Figure 2.

Sigificant high-risk lung cancer mortality clusters detected without and with model adjustment, by sex and survey year.

Figure 2.

Sigificant high-risk lung cancer mortality clusters detected without and with model adjustment, by sex and survey year.

Close modal
Table 4.

High-risk clusters for male and female LCM, 1973–1975, without and with model adjustment (columns are cluster #, major provinces, observed death count, expected death count, and RR)

Without model adjustmentWith model adjustment
Cluster (# of counties)Major provinceDeathExpectRRCluster (#of counties)Major provinceDeathExpectRR
Male 
 1 (42) Shanghai 4,157 1,307 3.59 1-A (16) Shanghai 2,262 1,052 2.26 
     1-B (88) Zhejiang, Jiangxi 1,106 843 1.33 
 2 (2) Guangdong 227 106 2.15 2 (13) Guangdong 439 328 1.34 
 3 (85) Liaoning, Jilin 2,821 1,549 1.92 3-A (10) Liaoning 470 340 1.39 
     3-B (10) Liaoning, Jilin 225 154 1.47 
 4 (89) Beijing, Tianjin, Shandong, Hebei 2,694 1,550 1.82 4 (3) Shandong 1,891 1,044 1.87 
 5 (122) Shanxi, Henan 2,092 1,523 1.41 5-A (78)  1,344 1,023 1.33 
     5-B (88)  1,106 843 1.33 
Female 
 1 (17) Shanghai 1,293 404 3.45 1 (30) Shanghai 1,212 704 1.80 
 2 (58) Tianjin 1,216 503 2.57 2 (73) Tianjin, Hebei 765 569 1.37 
 3 (52) Liaoning, Shandong 1,105 507 2.29 3 (10) Liaoning 205 134 1.54 
 4 (16) Shandong 353 173 2.07 4 (59) Shandong 1,006 535 1.96 
 5 (26) Jilin 503 254 2.02      
 6 (89) Beijing, Hebei 785 497 1.62 6 (73) Beijing, Hebei 765 569 1.37 
 7 (33) Zhejiang 411 299 1.39 7 (33) Zhejiang 392 267 1.48 
     8 (2) Chongqing 138 61 2.28 
     9 (7) Guangdong 192 111 1.74 
     10 (29) Shaanxi 168 84 2.01 
Without model adjustmentWith model adjustment
Cluster (# of counties)Major provinceDeathExpectRRCluster (#of counties)Major provinceDeathExpectRR
Male 
 1 (42) Shanghai 4,157 1,307 3.59 1-A (16) Shanghai 2,262 1,052 2.26 
     1-B (88) Zhejiang, Jiangxi 1,106 843 1.33 
 2 (2) Guangdong 227 106 2.15 2 (13) Guangdong 439 328 1.34 
 3 (85) Liaoning, Jilin 2,821 1,549 1.92 3-A (10) Liaoning 470 340 1.39 
     3-B (10) Liaoning, Jilin 225 154 1.47 
 4 (89) Beijing, Tianjin, Shandong, Hebei 2,694 1,550 1.82 4 (3) Shandong 1,891 1,044 1.87 
 5 (122) Shanxi, Henan 2,092 1,523 1.41 5-A (78)  1,344 1,023 1.33 
     5-B (88)  1,106 843 1.33 
Female 
 1 (17) Shanghai 1,293 404 3.45 1 (30) Shanghai 1,212 704 1.80 
 2 (58) Tianjin 1,216 503 2.57 2 (73) Tianjin, Hebei 765 569 1.37 
 3 (52) Liaoning, Shandong 1,105 507 2.29 3 (10) Liaoning 205 134 1.54 
 4 (16) Shandong 353 173 2.07 4 (59) Shandong 1,006 535 1.96 
 5 (26) Jilin 503 254 2.02      
 6 (89) Beijing, Hebei 785 497 1.62 6 (73) Beijing, Hebei 765 569 1.37 
 7 (33) Zhejiang 411 299 1.39 7 (33) Zhejiang 392 267 1.48 
     8 (2) Chongqing 138 61 2.28 
     9 (7) Guangdong 192 111 1.74 
     10 (29) Shaanxi 168 84 2.01 

As the 1990–1992 and 2004–2005 surveys sampled noncongruent counties that were determined to be representative of province, the locations of these two datasets appear to be isolated counties in the map. The middle and the right panels in Fig. 2 are clusters for the 1990–1992 and 2004–2005 surveys. Without model adjustment, they all reveal a large cluster that spans from the Northeast to Shanghai. With model adjustment, the cluster shrinks to either mainly Shandong (female) or from Beijing to Shanghai (male). This implies that for the vast area of Northeast China, where there has been rapid industrial development, the high LCM rate is most likely explained by the high industrial development level. In the middle part of East China (from Beijing to Shanghai), the industrial development level is not high enough to fully explain the high LCM rate, and further research is needed to explore more factors not considered in our study.

The geographic pattern of high LCM in China is different from the current pattern in Europe and the United States, where higher LCM is associated with lower SES. However, during the economic expansion of post–World War II, LCM in the United States was higher in areas with higher SES. (38) It is possible that the trend of LCM in different level of SES in China is following the trend of the LCM in the United States, with a lag consistent to the lag in economic development. During the initial industrialization in a traditional agricultural society, higher cancer rates existed in more affluent areas where people are exposed to more pollution. Such areas also experience a faster increase of energy consumption and significant lifestyle changes from outdoor physical activities to indoor machine operation. In this period, people in more affluent areas may have more resources to buy and consume tobacco products. The pollution exposure and smoking behavior may interact and result in more complicated LCM patterns. These interactions may be responsible for a portion of LCM and at least partially explain the sharp increase and regional variation of LCM in the past 40 years in China.

Gender is an important factor that influences the impact of other regional covariates. In China, most men smoke. The average regional men's smoking rate remained around 60% in this study, with very little variation among regions. In a more recent report of 2010 data (52), men's smoking was still as high as 53%. Results in this study show that the impact of industrial development level on LCM is stronger than it is for women. A possible explanation is that in a work environment dominated by men, workers are exposed to air pollution as well as smoking and secondhand smoking, and the interaction makes the impact stronger. For women, the impact of smoking on the regional variation of LCM is stronger than that for men. Because of traditional views on women's behavior in China, smoking among women is relatively rare, but smoking among women varies widely between regions, largely due to local customs in some regions and rural areas. Regional female smoking rate ranges between 2% and 24% in the 1984 smoking survey. It is possible where female smoking rate is high, other confounders interact with smoking to increase the female LCM. Americans have more than halved smoking rates over the past five decades (53) through relentless pursuit of tobacco control; it is a long way to go on China's public view, policy making, and collective effort to control tobacco use, and it will remain an important task to monitor and report trends of LCM and its regional variations.

The results are particularly significant because China's resident registration system keeps the vast majority of the population in a place for decades. China's industrial development was highly controlled by the government for more than four decades after 1949. Under this central command system, more than 80% of the industrial output was produced by state-owned industries, and industrial laborers were permanent employees working in the same enterprises until retirement. Although rural industrialization and migrant workers gained momentum after the mid-1990s (54), the influence on the regional LCM rates in this research should be minimal because lung cancer's long latency period would require a period of at least 20 years from the date of exposure before assessing the risk of cancer. (55) Given China's 1.3 billion population and relatively stable regional inhabitants during our study period, the spatial variation of LCM explained by regional industrialization should be more reliable and provide a better understanding of the effect of early-stage industrialization on LCM.

Further study is necessary to identify the locations where the LCM may occur in the future so that early action for prevention and treatment measures can be taken. There is also a need to explore other risk factors that are not included in this study, especially in areas that are identified as high clusters with model adjustment. This study is about China, but this situation may happen in other countries experiencing rapid industrialization, such as India and other developing countries.

This study is not without limitations. First, the LCM rate is crude rate instead of age-adjusted rate, making it difficult to compare with other studies. The crude rate is used mainly because it is difficult to choose a standard population. Over the four decades of the study period, China implemented a family planning policy aiming to control population increase. The breadth and strength of implementing the policy, though, varies widely, making the population profiles complicated between regions. Using any standard population, age adjustment unavoidably under- or overestimates the LCM burden in certain areas, making it subjective to evaluate regional LCM burdens. On the basis of this consideration, crude rate is analyzed to evaluate the absolute burden on LCM and help the cancer control and prevention effort in allocating resources where they are mostly needed. Another possible issue is the sparseness of the county units in the two later surveys. As described, the county survey units were selected from the whole spectrum of LCM severity in a province, so that they provide a complete picture of LCM in their respective province. They are actually samples within a province representing the whole province's LCM status. As such, the resulting clusters actually span across provinces instead of counties. Any analysis of a sample needs to consider the sampling errors in the statistical model, which will bring extra computation burden. For simplicity and interpretation of results, this study does not seek approaches that involve survey design or sampling error. With the creation and development of cancer surveillance systems in China, it is with hope that population-based cancer surveillance data will become available in the near future, so that small-area spatial analysis based on county-level data is possible.

A spatial mixed effect model is applied to explain differences in LCM in China by gender, economic development level, and smoking behavior, Factors that significantly impact LCM are identified. Regions with high risk for LCM are identified. Those that remain to be high LCM clusters after model adjustment deserve further exploration in other risk factors.

This study shows that the Eastern China has a higher risk for LCM than the Western China and that the risk is higher in urban areas. At the provincial level, smoking increases the risk of LCM for women, and the association of regional variation of smoking rates to male LCM is not significant. At the county level, smoking increases the risk of LCM for both men and women, and the impact on the female regional variation of LCM rate is about 4 times as high as that of males. Both province and county level analysis show that a higher level of industrial development is associated with elevated LCM rates, with higher impact on males, which account for about 70% of all LCM. Higher education levels are linked to higher LCM rates, more for men than for women. High-risk areas are clustered in major cities and more affluent areas. After adjusting for a model that considers smoking, urban/rural status, region, education level, and industrial development level, high-risk clusters either shrink to smaller areas or shift to nearby areas. Additional factors contributing to high LCM risk is a topic for further epidemiologic study.

No potential conflicts of interest were disclosed.

Conception and design: X. Shen, L. Wang, L. Zhu

Development of methodology: X. Shen, L. Wang, L. Zhu

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): X. Shen, L. Wang

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): X. Shen, L. Wang, L. Zhu

Writing, review, and/or revision of the manuscript: X. Shen, L. Zhu

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): X. Shen, L. Wang

Study supervision: X. Shen, L. Wang, L. Zhu

X. Shen received the following grants: Connecticut State University SystemAAUP Faculty Research Grant (2011 and 2012) and Central Connecticut State University School of Arts and Sciences Deans Research Initiative Grant (2011).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Zeng
H
,
Zheng
R
,
Zhang
S
,
He
J
,
Chen
W
. 
Lung cancer incidence and mortality in China, 2008
.
Thoracic Cancer
2013
;
4
:
53
8
.
2.
Seow
W
,
Hu
W
,
Vermeulen
R
,
Hosgood
HD
 III
,
Downward
G
,
Chapman
R
, et al
Household air pollution and lung cancer in China: a review of studies in Xuanwei
.
Chinese J Cancer
2014
;
33
:
471
5
.
3.
Chinese Center for Disease Control and Prevention
. 
National Disease Surveillance System Causes of Death Statistics, 2014
.
Beijing, China
:
People's Medical Publishing House
; 
2014
.
4.
Cancer Research UK
. 
How many people die from lung cancer? In Lung Cancer and Smoking
.
London, United Kingdom
:
Cancer Research UK
; 
2017
.
Available from
: http://info.cancerresearchuk.org/cancerstats/keyfacts/lung-cancer.
5.
Chinese Center for Disease Control and Prevention
. 
China causes of death registry dataset
.
Beijing, China
:
Chinese Center for Disease Control and Prevention
; 
2016
.
6.
Yang
G-H
,
Wang
J-F
,
Wan
X
,
Wang
L-J
,
Chen
A-P
. 
Quantitative analysis of factors affected mortality trend in Chinese, 2002
.
Zhonghua Liu Xing Bing Xue Za Zhi
2005
;
26
:
934
38
.
7.
Zhi
XY
,
Zou
XN
,
Hu
M
,
Jiang
Y
,
Jia
MM
,
Yang
GH
. 
Increased lung cancer mortality rates in the Chinese population from 1973–1975 to 2004–2005: An adverse health effect from exposure to smoking
.
Cancer
2015
;
121
:
3107
12
.
8.
Institute for Health Metrics and Evaluation
. 
Global burden of disease
.
Seattle, WA
:
University of Washington
; 
2016
.
Available from
: http://www.healthdata.org.
9.
Doll
R
,
Hill
AB
. 
Smoking and carcinoma of the lung
.
BMJ
1950
;
2
:
739
48
.
10.
Doll
R
,
Peto
R
. 
The causes of cancer; quantitative estimates of avoidable risks of cancer in the United States today
.
J Natl Cancer Inst
1981
;
66
:
1191
308
.
11.
U.S. Department of Health and Human Services
.
Surgeon general's report—the health consequences of smoking
.
Rockville, MD
:
NCI
; 
2004
.
12.
Centers for Disease Control and Prevention
.
Lung cancer awareness
.
Atlanta, GA
: Centers for Disease Control and Prevention; 
2017
.
Available from
: http://www.cdc.gov/Features/LungCancer.
13.
Yang
G
,
Hu
A
,
editors
.
Smoke control and China's future
.
Beijing, China
:
Economic Daily Publisher
; 
2011
.
14.
Liang
X
,
Chinese Center for Disease Control and Prevention
.
2015 China Adult Tobacco Survey Report
.
Beijing, China
:
People's Medical Publishing House
; 
2016
.
15.
Committee of the National Patriotic Health Campaign
,
Ministry of Health PRC
,
editors
.
1984 National Survey of Smoking (1984 nian quanguo xiyan chouyang diaocha ziliao huibian)
.
Beijing, China
:
People's Health Publishing House
; 
1988
.
16.
Office of Committee of the National Patriotic Health Campaign, Department of Disease Control, Chinese Ministry of Health, Organizations Responsible for National Prevalence Survey, Chinese Academy of Preventive Medicine, Chinese Association of Smoking and Health
.
Smoking and Health in China, 1996 National Prevalence Survey of Smoking Pattern
.
Beijing, China
:
Chinese Science and Technology Press
; 
1997
.
17.
National Office for Cancer Prevention and Control, National Central for Cancer Registry, Disease Prevention and Control Bureau, Ministry of Health China
,
editors
.
Chinese Cancer Mortality Report, The Third National Retrospective Survey of Deaths by Cause
.
Beijing, China
:
People's Medical Publishing House
; 
2010
.
18.
Menck
HR
,
Casagrande
JT
,
Henderson
BE
. 
Industrial air pollution: possible effect on lung cancer
.
Science
1974
;
183
:
210
2
.
19.
Higginson
J
. 
Cancer and environment: Higginson speaks out
.
Science
1979
;
205
:
1363
4
.
20.
Fan
YG
,
Hu
P
,
Jiang
Y
,
Chang
RS
,
Yao
SX
,
Wang
W
, et al
Association between sputum atypia and lung cancer risk in an occupational cohort in Yunnan, China
.
Chest
2009
;
135
:
778
85
.
21.
Checkoway
H
,
Ray
RM
,
Lundin
JI
,
Astrakianakis
G
,
Seixas
NS
,
Camp
JE
, et al
Lung cancer and occupational exposures other than cotton dust and endotoxin among women textile workers in Shanghai, China
.
Occup Environ Med
2011
;
68
:
425
9
.
22.
Gao
PF
,
Chen
BT
,
Hearl
FJ
,
McCawley
MA
,
Schwerha
DJ
,
Odencrantz
J
, et al
Estimating factors to convert Chinese ‘total dust' measurements to ACGIH respirable concentrations in metal mines and pottery industries
.
Ann Occup Hyg
2000
;
44
:
251
7
.
23.
Maeda
M
,
Nishimura
Y
,
Kumagai
N
,
Hayashi
H
,
Hatayama
T
,
Katoh
M
, et al
Dysregulation of the immunesystem caused by silica and asbestos
.
J Immunotoxicol
2010
;
7
:
268
78
.
24.
Hashim
D
,
Gaughan
D
,
Boffetta
P
,
Lucchini
RG
. 
Baseline Serum β-carotene concentration and mortality among long-term asbestos-exposed insulators
.
Cancer Epidemiol Biomarkers Prev
2015
;
24
:
555
60
.
25.
Hosgood
HD
 III
,
Song
M
,
Hsiung
C
,
Yin
Z
,
Shu
X
,
Wang
Z
, et al
Interactions between household air pollution and GWAS-identified lung cancer susceptibility markers in the Female Lung Cancer Consortium in Asia (FLCCA)
.
Hum Genet
2015
;
134
:
333
41
.
26.
Hosgood
HD
 III
,
Wei
H
,
Sapkota
A
,
Choudhury
I
,
Bruce
N
,
Smith
K
, et al
Household coal use and lung cancer: systematic review and meta-analysis of case-control studies, with an emphasis on geographic variation
.
Int J Epidemiol
2011
;
40
:
719
28
.
27.
Hosgood
HD
 III
,
Chapman
R
,
He
X
,
Hu
W
,
Tian
L
,
Liu
L
, et al
History of lung disease and risk of lung cancer in a population with high household fuel combustion exposures in rural China
.
Lung Cancer
2013
;
81
:
343
6
.
28.
Tie
X
,
Wu
D
,
Brasseur
G
. 
Lung cancer mortality and exposure to atmospheric aerosol particles; Guangzhou, China
.
Atmospheric Environ
2009
;
43
:
2375
7
.
29.
Cao
J
,
Yang
CX
,
Li
JX
,
Chen
RJ
,
Chen
BH
,
Gu
DF
, et al
Association between long-term exposure to outdoor air pollution and mortality in China: a cohort study
.
J Hazard Mater
2011
;
186
:
1594
600
.
30.
Xu
ZY
,
Yu
DG
,
Jing
LB
,
Xu
XP
. 
Air pollution and daily mortality in Shenyang, China
.
Arch Environ Health
2000
;
55
:
115
20
.
31.
Du
YX
,
Cha
Q
,
Chen
XW
,
Chen
YZ
,
Huang
LF
,
Feng
ZZ
, et al
An epidemiological study of risk factors for lung cancer in Guangzhou, China
.
Lung Cancer
1996
;
14
:
S9
S37
.
32.
Wong
CM
,
Tsang
H
,
Lai
HK
,
Thomas
GN
,
Lam
KB
,
Chan
KP
, et al
Cancer mortality risks from long-term exposure to ambient fine particle
.
Cancer Epidemiol Biomarkers Prev
2016
;
25
:
839
45
.
33.
Loomis
D
,
Grosse
Y
,
Lauby-Secretan
B
,
El Ghissassi
F
,
Bouvard
V
,
Benbrahim-Tallaa
L
, et al
The carcinogenicity of outdoor air pollution
.
Lancet Oncol
2013
;
14
:
1262
3
.
34.
World Health Organization.
Outdoor air pollution
.
Lyon, France
:
International Agency for Research on Cancer
; 
2016
.
35.
China State Statistical Bureau, Department of Comprehensive Statistics, editors
.
China Compendium of Statistics 1949–2008
.
Beijing, China
:
China Statistics Press
; 
2009
.
36.
China State Statistical Bureau
.
China Statistical Yearbook, 2016
.
Beijing, China
:
China Statistical Publishing House
; 
2016
.
37.
Gomez
SL
,
Shariff-Marco
S
,
Rouen
MD
,
Keegan
THM
,
Yen
IH
,
Mujahid
M
, et al
The impact of neighborhood social and built environment factors across the cancer continuum: current research, methodologic considerations, and future directions
.
Cancer
2015
;
121
:
2314
30
.
38.
Singh
G
,
Miller
B
,
Hankey
B
. 
Changing area socioeconomic patterns in U.S. cancer mortality, 1950–1998: Part II–Lung and colorectal cancers
.
J Natl Cancer Inst
2002
;
94
:
916
25
.
39.
Bryere
J
,
Dejardin
O
,
Bouvier
V
,
Colonna
M
,
Guizard
A-V
,
Troussard
X
, et al
Socioeconomic environment and cancer incidence: a French population-based study in Normandy
.
BMC Cancer
2014
;
14
:
1
10
.
40.
Merletti
F
,
Galassi
C
,
Spadea
T
. 
The socioeconomic determinants of cancer
.
Environ Health
2011
;
10
:
1
7
.
41.
Vanthomme
K
,
Vandenheede
H
,
Hagedoorn
P
,
Gadeyne
S
. 
Socioeconomic disparities in lung cancer mortality in Belgian men and women (2001-2011): does it matter who you live with?
BMC Public Health
2016
;
16
:
493
.
42.
Elk
R
,
Landrine
H
,
editors
.
Cancer disparities, causes and evidence-based solutions
.
New York, NY
:
Springer
; 
2012
.
43.
Wang
M
,
Song
H
,
Chen
WQ
,
Lu
CY
,
Hu
QS
,
Ren
ZF
, et al
Cancer mortality in a Chinese population surrounding a multi-metal sulphide mine in Guangdong province: an ecologic study
.
BMC Public Health
2011
;
11
.
44.
Cui
J
,
Li
H
,
Ma
Q
,
Diao
Y-T
,
Yin
C
,
Cui
Y-c
, et al
Change trend analysis of lung cancer mortality during period of 1970-2005 in Shandong province
.
Cancer Res Prev Treat
2010
;
37
:
838
41
.
45.
National Office for Cancer Prevention and Control
.
Chinese Cancer Mortality Research
.
Beijing, China
:
People's Medical Publishing House
; 
1980
.
46.
National Office for Cancer Prevention and Control
.
Chinese Cancer Mortality Research (1990–1992) I & II
.
Beijing, China
:
People's Medical Publishing House
; 
2008
.
47.
Chen
Z
,
editor
.
Report of the Third National Retrospective Sampling Survey of Deaths by Cause
.
Beijing, China
:
Peking Union Medical College Press
; 
2008
.
48.
Li
JY
,
Liu
BQ
,
Li
GY
,
Chen
ZJ
,
Sun
XD
,
Rong
SD
. 
Atlas of cancer mortality in china an aid for cancer control and research
.
Int J Epidemiol
1981
;
10
:
127
33
.
49.
China State Statistical Bureau
.
China Statistical Yearbook, 1997
.
Beijing, China
:
China Statistical Publishing House
; 
1997
.
50.
China State Statistical Bureau
.
China Statistical Yearbook, 1986
.
Beijing, China
:
China Statistical Publishing House
; 
1986
.
51.
Kulldorff
M
,
editor
.
SaTScan User Guide for version
; 
2014
.
52.
Li
Q
,
Hsia
J
,
Yang
G
. 
Prevalence of smoking in China in 2010
.
N Engl J Med
2011
;
364
:
2469
70
.
53.
U.S. Department of Health and Human Services
.
The health consequences of Ssmoking - 50 years of progress: A Report of the Surgeon General
.
Atlanta, GA
:
Centers for Disease Control and Prevention
; 
2014
.
54.
Shen
X
. 
Spatial inequality of rural industrial development in China, 1989–1994
.
J Rural Studies
1999
;
15
:
179
99
.
55.
Smith
D
. 
Re: "Occupational exposure and lung cancer risk: a population-based case-referent study in Sweden"
.
Am J Epidemiol
2001
;
153
:
1028
9
.