Abstract
Background: Inflammatory breast cancer (IBC) is a rare and highly aggressive form of primary breast cancer. Little is known about the risk factors for IBC, specifically the association with socioeconomic position (SEP).
Methods: The association between breast cancer type (IBC vs. non-IBC) with county-level SEP in the Surveillance, Epidemiology, and End Results database for cases diagnosed from 2000 to 2007 was examined. County-level SEP characteristics included metropolitan versus non-metropolitan residence, percentage below the poverty level, percentage less than high-school graduate, and an index combining the poverty and high-school variables. IBC and non-IBC age-adjusted incidence rates were calculated, stratified on SEP and race/ethnicity. The odds of IBC versus non-IBC given a particular SEP characteristic, adjusting for age and race/ethnicity, was examined through fitting of hierarchical logistic regression models (HLM).
Results: Incidence rates for IBC generally increased as SEP decreased, whereas the opposite was found for non-IBC. HLM results showed that low SEP is associated with higher odds of IBC: highest (≥20%) versus lowest (<10%) persons below the poverty level [OR (95% confidence interval, CI) = 1.25 (1.09–1.43)]; highest (>28.76%) versus lowest (≤15.99%) persons less than high-school graduate [OR (95% CI) = 1.25 (1.10–1.42)]; and low SEP as measured by poverty-high school index versus high SEP [OR (95% CI)= 1.26 (1.11–1.44)].
Conclusion: Overall breast cancer has been found to be positively associated with SEP, whereas in this analysis, IBC was associated with decreasing SEP.
Impact: Studies focused on understanding the disparity in IBC incidence, as well as interventions to eliminate these differences are needed. Cancer Epidemiol Biomarkers Prev; 21(1); 155–65. ©2011 AACR.
Introduction
Breast cancer is a heterogeneous disease, characterized by distinct tumor subtypes thought to correspond to different etiologies (1–5). Inflammatory breast cancer (IBC) is a rare and highly aggressive form of primary breast cancer (6–11). Although risk factors for IBC remain largely unknown, some studies have shown different risk factor profiles for IBC as compared with non-IBC cases (12–15).
Breast cancer incidence in the United States is related to socioeconomic position (SEP; ref. 16), being greater among women with higher education and income (17, 18) and among women residing in communities with higher average levels of education and income (19–23). Although some studies have found much of this association can be explained by known breast cancer risk factors (17, 22), a study examining both individual- and community-level SEP revealed that after adjusting for individual SEP and breast cancer risk factors, women living in the highest SEP communities continued to have greater odds of having breast cancer than women living in the lowest SEP communities, suggesting community-level effects on breast cancer risk (16). Robert and colleagues hypothesized that these community effects could independently affect breast cancer risk through various pathways including more access to mammograms (leading to more breast cancer detection) and community norms such as exogenous hormone use, alcohol intake, and diet (16).
Higher incidence of overall breast cancer in urban areas, both in the United States and internationally, has been reported for many years (24–28). Residence is also related to SEP, with rural residents in the United States generally having lower income, less education, and lower health insurance coverage than their urban counterparts (29).
Given the lack of knowledge about factors associated with IBC incidence and the evidence that some overall breast cancer risk factors may not have the same effect on IBC risk, the aim of this study was to examine the association of county-level SEP measures to IBC and non-IBC incidence in the U.S. Surveillance, Epidemiology, and End Results (SEER) database linked to 2000 U.S. Census county-attribute data.
Materials and Methods
Data source
The SEER 17 Registries database linked to 2000 U.S. county attributes was used for this analysis (30). The population-linked data set includes all breast cancer cases from 2000 to 2007 for the following SEER registries: Atlanta, Connecticut, Detroit, Hawaii, Iowa, New Mexico, San-Francisco-Oakland, Seattle-Puget Sound, Utah, Los Angeles, San Jose-Monterey, rural Georgia, the Alaska Native Tumor Registry, Greater California, Kentucky, Louisiana, and New Jersey (31). The U.S. SEER database covers approximately 26% of the U.S. population, including 23% of African Americans, 40% of Hispanics, 42% of American Indians and Alaska Natives, 53% of Asians, and 70% of Hawaiian/Pacific Islanders (32).
Individual-level measures
The outcome variable for this analysis was diagnosis of a first malignant primary breast cancer [International Classification of Diseases for Oncology (ICD-O-3) = C500-C509] as IBC or non-IBC. To be certain all IBC cases were captured, a comprehensive case definition was used where a breast cancer case having any one of the following codes assigned to the SEER variables below was classified as IBC (6, 15, 33–35):
Site and morphology. Histologic type ICD-O-3 (2000–2007) = 8530 (“Inflammatory Carcinoma”; refs. 36, 37)
Stage - TNM. Derived AJCC [American Joint Committee on Cancer] T, 6th ed (for cases 2004+) = T4d (“Inflammatory Carcinoma”; refs. 36, 38)
Extent of disease [EOD] - CS.CS extension (for cases 2004+) = 71-73 (36, 39)
71: “Diagnosis of inflammatory carcinoma without a clinical description of inflammation, erythema, edema, peau d'orange, etc., of more than 50% of the breast, with or without dermal lymphatic infiltration. Inflammatory carcinoma, NOS.”
72: “Diagnosis of inflammatory carcinoma with a clinical description of inflammation, erythema, edema, peau d'orange, etc., of less than or equal to 50% of the breast, with or without dermal lymphatic infiltration.”
73: “Diagnosis of inflammatory carcinoma with a clinical description of inflammation, erythema, edema, peau d'orange, etc., of more than 50% of the breast, with or without dermal lymphatic infiltration.”
Extent of disease - Historic. EOD 10 - extent (for cases 2000–2003) = 70 (36, 40).
70: “Inflammatory carcinoma, including diffuse (beyond that directly overlying the tumor) dermal lymphatic permeation or infiltration.”
All other histologic types were considered non-IBC. Breast cancers assigned to histologic codes 9590 and greater (lymphomas) were not included.
Age at diagnosis was analyzed as a continuous variable. A merged race and ethnicity variable with the following categories was also examined: Non-Hispanic White, Black, Hispanic White, Asian/Pacific Islander, and American Indian/Alaska Native.
County-level measures
Four county-level measures of SEP derived from the 2000 U.S. census were used in this analysis (41). Counties were divided into metropolitan (metro) versus non-metro areas based on their 2003 U.S. Department of Agriculture Rural-Urban Continuum code (RUCC), as has been done in previous cancer studies (42–45). Codes 1–3 were defined as metro counties, whereas codes 4–9 defined non-metro counties (code definitions below).
Rural-Urban Continuum Code Definitions for 2003 (46, 47):
Metro counties:
1 (Counties in metro areas of ≥1 million population)
2 (Counties in metro areas of 250,000–1 million population)
3 (Counties in metro areas of <250,000 population)
Non-metro counties:
4 (Urban population of ≥20,000, adjacent to a metro area)
5 (Urban population of ≥20,000, not adjacent to a metro area)
6 (Urban population of 2,500–19,999, adjacent to a metro area)
7 (Urban population of 2,500–19,999, not adjacent to a metro area)
8 (Completely rural or <2,500 urban population, adjacent to a metro area)
9 (Completely rural or <2,500 urban population, not adjacent to a metro area)
Percentage of persons within a county living below the federal poverty level was divided into 3 categories: <10%, 10%–19.99%, and ≥20%. This measure has been used in other SEER-based studies of SEP variation and disparities in cancer incidence and outcomes (48, 49). Percentage of the population below the poverty level has several advantages as a SEP measure. It is easily understood, being based on readily interpretable variables with a priori cutoff points, with areas having ≥20% poverty generally considered “poverty areas” in census publications and studies using census data (48–51). Percentage below the poverty level also takes into account a family's size and age structure and is directly tied to the person's ability to buy a representative basket of goods and services, being updated annually by the U.S. Census Bureau to reflect changes in the Consumer Price Index (48, 52).
Education, specifically completion of high school measured at both the individual and aggregate level, has been previously shown to be positively associated with overall breast cancer incidence (19, 53). To examine the county-level effect of high-school education on IBC incidence, percentage of adults (older than 25 years) in a county who did not graduate from high school was divided into quartiles on the basis of the distribution of the variable across all counties in the United States using the SEER county attributes database (54). The quartiles were as follows: ≤15.99%, 16%–20.80%, 20.81%–28.76%, and >28.76%.
To examine the joint effect of poverty and education on the odds IBC versus non-IBC incidence, a composite measure of these two variables was also created. The percentage below high-school graduate quartiles were combined with the 3-category poverty variable to create high, middle, and low SEP categories as follows: high SEP, <10.00% poverty and <15.99% less than high-school graduate; low SEP,(i) 10%–19.99% poverty and >28.76% less than high-school graduate or (ii) ≥20% poverty and >28.76% less than-high school graduate or (iii) ≥20% poverty and 20.81%–28.76% less than high-school graduate, with the middle SEP group being all remaining combinations. A similar index was used previously in a study examining overall breast cancer survival in the SEER database (43).
Statistical analysis
Statistical software developed by the SEER program (SEER*stat version 6.6.2, National Cancer Institute, Bethesda, MD) readily allows for the calculation of incidence rates using the incidence data set linked to county attribute and population data (55). Female adult (older than 18 years) IBC and non-IBC incidence rates (IR) for the period from 2000 to 2007 were calculated. IRs were directly age-adjusted to the 2000 U.S. standard population (56, 57) and further stratified on the following county-level SEP measures: metro versus non-metro residence, percentage below poverty, and percentage less than high-school graduate, as well as race/ethnicity. Upper and lower 95% confidence intervals (CI) around the IR ratios were used to test for statistical significance. A 95% IR ratio (CI not including 1) was indicative of the rates being statistically significantly different at the 0.05 significance level (58). The 95% CIs for the IRs and IR ratios were calculated using the method described in the work of Tiwari and colleagues (59). While SEER*stat allows for the calculation of incidence rates stratified by county attribute measures, it does not allow for the calculation of incidence rates by a variable created from the merging of two county attributes, such as the poverty-high school index.
To directly compare the association of IBC and non-IBC cases with SEP, hierarchical logistic regression models were fit using the SAS Glimmix procedure (version 9.1; SAS Institute Inc.; ref. 60). As women who reside in the same county may have similar, unmeasured characteristics, and therefore be more alike than individual women across different counties, it is important to take this intraclass correlation into account as ignoring it may lead to underestimation of the regression coefficient's SE (61, 62). A hierarchical modeling structure allows for the examination of both individual- and county-level fixed effects while accounting for the effect due to clustering within counties through the addition of a random intercept, leading to more accurate SE estimates (61–63).
Four separate HLMs were fit, with each model including one county-level SEP measure as well as age at diagnosis and merged race/ethnicity as independent variables. Age at diagnosis and race/ethnicity were included in all models due to their previously reported association with the incidence of different histopathologic types of breast cancer, specifically IBC (1, 6, 7, 15, 33, 64–66). All HLMs modeled the log ORs of IBC. The general model is outlined below where the county intercept becomes a linear combination of a grand mean (α), the county SEP fixed effect (γ), and county random effects (μ), with β1 and β2 being individual fixed effects (63):
Results
First primary breast cancer cases (n = 346,211) in women older than 18 years diagnosed from 2000 to 2007 were available for this analysis; 39 cases missing county-level information and 2,545 cases missing race/ethnicity were excluded, leaving 343,627 cases. After these exclusions, there were 5,536 IBC and 338,091 non-IBC cases included in this analysis. In the model examining metro versus non-metro counties, a further 394 cases were excluded because of missing RUCC, leaving 5,525 IBC and 337,708 non-IBC cases for analysis.
Table 1 describes the study population characteristics. The mean age at diagnosis for IBC was 58.0 versus 60.7 years for non-IBC. The majority of IBC cases were of Non-Hispanic White race/ethnicity (67.0%) followed by Black (14.6%), Hispanic White (12.7%), Asian/Pacific Islander (5.1%), and American Indian/Alaska Native (0.7%). The comparable percentages for non-IBC were 75.0%, 9.1%, 8.6%, 6.9%, and 0.4%; 22.6% of IBC cases were classified as high SEP using the poverty-high school index, whereas 51.4% were middle SEP and 26.0% low SEP. Using the same index, 27.3% of non-IBC cases were considered high SEP, 50.8% middle SEP, and 21.8% low SEP.
Table 2 provides the age-adjusted IRs (per 100,000) and IR ratios for IBC according to SEP characteristics and race/ethnicity and Table 3 provides the corresponding results for non-IBC. There was no difference in the overall IBC IRs for metro versus non-metro residence. When these rates were stratified by race/ethnicity, the only significant difference found was for Blacks (metro IR = 3.4, non-metro IR = 4.8). The IBC IRs for counties where >10% of the residents were below the poverty level were generally significantly higher for all races combined, Non-Hispanic Whites, and Blacks. No significant difference was seen in IBC poverty stratified rates for Hispanic Whites, Asians/Pacific Islanders, or American Indians/Alaska Natives. The IBC IRs for counties where a higher percentage of the population had not graduated from high school followed a similar pattern as the poverty variable.
Non-IBC IRs for those living in non-metro counties were significantly lower for all races combined and Non-Hispanic Whites but significantly higher for Asians/Pacific Islanders and American Indians/Alaska Natives race/ethnicity groups. There was no significant difference in the metro and non-metro IRs for Blacks and Hispanic Whites. Non-IBC IRs for those living in counties where >10% of the residents were below the poverty level were significantly lower for all races combined, Non-Hispanic Whites, Hispanic Whites, Asians/Pacific Islanders, and American Indians/Alaska Natives, with no significant difference in rates observed for Blacks. The non-IBC IRs for counties where a higher percentage of the population had not graduated from high school followed a similar pattern as the poverty variable.
Table 4 gives the ORs and 95% CIs for the 4 HLMs. In these models, which account for the clustering of cases within counties, after adjustment for age at diagnosis and race/ethnicity, residing in a high poverty county (≥20%), a county in the highest quartile of persons with less than a high school degree, and a low SEP county based on the poverty-high school index was significantly associated with higher odds of IBC [OR (95% CI) = 1.25 (1.09–1.43), 1.25 (1.10–1.42), and 1.26 (1.11–1.44), respectively]. Younger age and Black, Hispanic White, and American Indian/Alaska Native race/ethnicity were also significantly associated with higher odds of IBC in all models, whereas Asian/Pacific Islander race/ethnicity was significantly associated with lower odds of IBC.
Discussion
Contrary to the previously reported positive association between urban residence, SEP, and overall breast cancer occurrence (16–28), this study found that living in a high poverty county (≥20%), a county with a high percentage of less than high-school graduates, and residing in a low SEP county as defined by the poverty-high school index were significantly associated with IBC, even after adjustment for age at diagnosis and race/ethnicity.
Prior studies examining overall breast cancer occurrence have found it to be associated with urban residence and higher SEP (16–28). However, the majority of breast cancers are non-IBC and thus determining IBC's association with SEP based on studies of all breast cancer types is difficult. This study used an inclusive definition to separately characterize IBC from all other breast cancer types and then directly compared IBC with non-IBC through use of HLMs adjusting for age and race/ethnicity, to specifically examine SEP and its association with IBC as a distinct breast cancer entity.
Residing in a county with a large percentage of persons below the poverty level, less than high-school graduates, and in the low SEP group of the poverty-high school index were all associated with IBC in this analysis, suggesting that poverty and education are capturing similar aspects of SEP that affect IBC incidence. As SEP measures, poverty and education both act as a summary measure of a county's SEP and can be compared over time and across U.S. geographic areas (50). While poverty and education are correlated, as each has been shown to be related to overall breast cancer incidence, they were included as separate SEP measures as well as in a combined index in this analysis (16–23, 67). Metro versus non-metro area of residence at diagnosis captures various characteristics that can be directly and/or indirectly related to an individual's health, such as population density, geographic isolation, exposure to agriculture, industrial or commercial complexes, and proximity and access to health care services (68, 69). Furthermore, there are many ways to classify counties based on characteristics such as administrative units, land use, and economic concepts (68). It is possible the definition of metro versus non-metro used in this analysis does not capture specific factors that may be related to IBC and thus explain the lack of association seen.
Although risk factors for IBC remain largely unknown, some studies have shown different risk factor profiles for patients with IBC as compared with patients with non-IBC. Chang and colleagues found that high BMI was significantly associated with increased risk of IBC, regardless of menopausal status (12). This is in contrast to overall breast cancer, where higher premenopausal weight has been shown to reduce risk (70). Chang and colleagues also found patients with IBC were more likely to be premenopausal and have younger age at menarche and first birth than patients with non-IBC and non–breast cancer (12).
A study conducted in France by Le and colleagues found that patients with IBC had a lower educational level, a higher BMI, a longer cumulative duration of breastfeeding, and included a greater proportion of non-European women than patients with non-IBC (13). A recent study of Egyptian breast cancer cases found that patients with IBC had significantly lower parity than patients with non-IBC (14). Furthermore, a 2010 study based in Tunisia reported a rural predominance of IBC among the cases studied and hypothesized that the reduction in IBC seen in that country was due, in part, to increasing SEP (71).
In this analysis, IBC was associated with younger age and Black race/ethnicity, whereas Asian/Pacific Islander race/ethnicity was associated with lower odds of IBC, as found in previous studies (1, 6, 7, 15, 33, 64–66). White Hispanic and American Indian/Alaska Native race/ethnicity were also found to be significantly associated with IBC. A previous study showed no difference in the age-adjusted IBC IR between Hispanic and non-Hispanic women for cases diagnosed from 1994 to 1998 reported to the North American Association of Central Cancer Registries (64). However, this study did not classify Hispanic origin as mutually exclusive from other race/ethnicities and used the more restrictive ICD-O-3 8530 code to define IBC (64). Younger age at IBC diagnosis has been reported for American Indian/Alaska Native women as compared with White women (6, 64), although no studies which directly compared the IRs or proportion of IBC between American Indian/Alaska Native women and other race/ethnicities were located.
The strengths of this study include the use of the U.S. SEER database, 5 mutually exclusive race/ethnicity categories, a comprehensive definition of IBC, and a hierarchical modeling structure. The SEER program is considered the standard for cancer registry data quality worldwide (72). Quality control studies, including case-finding, recoding, and reliability studies, are continually conducted by the SEER program to ensure data included in the registries are accurate and collected and recorded in a uniform and timely manner across all registries (72). As IBC is a relatively rare diagnosis, the U.S. SEER database, which covers 26% of the U.S. population residing in varying regions and geographic areas with overrepresentation of minority groups, allows for the stratification of IBC incidence by SEP and race/ethnicity categories (32).
Previous studies have been limited to reporting IBC rates and proportions for a limited number of race/ethnicity categories, usually for White, Black, and Other, due to small numbers of cases as well as the manner in which these data were recorded by SEER (1, 7, 15, 33, 73). Beginning with the November 2005 SEER data submission, the algorithms for creating the race recode variables within the SEER database were revised, allowing for the examination of incidence for 4 race categories: White, Black, American Indian/Alaska Native, and Asian/Pacific Islander, as well as Hispanic ethnicity (73). The race and Hispanic ethnicity data can also now be merged to create mutually exclusive race/ethnicity categories (73). This allowed for the current analysis to report results for 5 mutually exclusive race/ethnicity categories, as opposed to the more limited race/ethnicity analyses in previous IBC studies.
IBC studies have been hampered by lack of a standard case definition (6, 11, 74). Previous studies have used the ICD-O code 8530 designation to define IBC (7, 64, 65, 75, 76). ICD-O code 8530 is a pathologic designation requiring plugging of the dermal lymphatics with tumor emboli and does not consider clinical skin changes (6). However, this conservative IBC definition is not consistent with the current AJCC staging manual guidelines and may underestimate the true incidence of IBC (6, 7, 64, 77). SEER EOD codes are based on a combined clinical and operative/pathologic assessment abstracted from the pathology report and allow for identification of IBC cases that do not have ICD-O code 8530 as the pathologic diagnosis for years prior to 2004 (7). From 2004 forward, SEER includes a variable with derived AJCC staging, which allows for the identification of IBC cases defined as the primary tumor designation of “T4d” (35, 36). The comprehensive IBC definition used in this article is similar to that used in recent IBC studies (6, 15, 33–35). Use of this definition should help ensure less misclassification of IBC cases to the non-IBC group. Finally, use of a hierarchical modeling structure allows for the calculation of more accurate SEs, thus adding confidence to any significant results found (61–63).
A few limitations should be noted when interpreting the results of this analysis. Though SEER data are broadly representative of the U.S. population, cases recorded in the SEER database are more likely to be foreign born and urban than the U.S. population as measured in the 2000 census (78). There are also a relatively small number of American Indian/Alaska Native IBC cases available for this analysis (n = 39), which is reflected in the wide 95% CIs around the IR and OR estimates for this race/ethnicity category. However, because of the U.S. SEER's large size and more than 30 years of follow-up, it is generally considered to accurately represent the overall U.S. cancer population (78).
Another limitation is the lack of individual-level SEP information in the U.S. SEER database and the inherent ecologic bias in interpreting the results of this analysis at the individual level. Any associations seen between county-level SEP and IBC occurrence in this analysis may not necessarily be observed were individual-level SEP available and used in the analysis (79). Therefore, the results of this analysis are better interpreted at the contextual level, that is, the effect being measured is that of residing in a county with a particular SEP characteristic, not that of the breast cancer cases' individual SEP. However, a study comparing census-level SEP measures with individual-level measures found they were similarly associated with individual-level health outcomes (52). Furthermore, the U.S. SEER database linked to U.S. census data provides a unique opportunity to conduct analyses stratified by race/ethnicity and county-level SEP measures on a relatively large number of IBC and non-IBC cases.
Overall breast cancer has been found to be positively associated with SEP, whereas in this analysis, IBC was associated with decreasing SEP. One explanation for these results is that women of lower SEP have less access to health care that would lead to early detection and the resultant neglected breast cancer develops into IBC. Some earlier IBC work suggested that it may be a subtype of locally advanced breast cancer rather than a distinct entity (80). However, the majority of recent studies on the epidemiology, clinical and prognostic characteristics, biology, and molecular genetics of IBC suggest it is likely a distinct biologic entity from other breast cancer (10, 12–15, 35, 81–84). Another explanation is that breast cancers occurring in women of lower SEP presenting with skin involvement are misdiagnosed as IBC (85, 86). However, there is little literature, especially in the United States, suggesting women of lower SEP are at higher risk for IBC, so it is unlikely that clinicians would be more likely to look for and diagnose (or misdiagnose) IBC disproportionately in women of lower SEP.
These results are in keeping with a growing amount of evidence showing IBC likely has a different risk factor profile than other breast cancers and is a distinct biologic entity (10, 12–15, 35, 81–84). Few studies have examined the epidemiology of rarer forms of breast cancer such as IBC, though studies that have suggested the general breast cancer risk profile may not hold for rarer breast cancer subtypes (1, 65, 87). Further investigation into the etiology of IBC is needed to elucidate risk factors for the disease that would help guide prevention and screening programs, especially studies which examine individual- and community-level associations between multiple SEP measures and IBC incidence. However, these results also indicate studies designed to investigate why the disparity of higher incidence of IBC in lower SEP groups and racial/ethnic minorities is observed, as well as potential interventions to eliminate these differences, are called for. Furthermore, because treatment is especially urgent in IBC, design and implementation of strategies that would promote earlier IBC diagnosis among lower SEP groups and racial/ethnic minorities, which traditionally experience less access to early detection programs, would likely have a direct and favorable impact on their prognosis.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Acknowledgments
The authors thank Dr. William Anderson at the Division of Cancer Epidemiology and Genetics, National Cancer Institute, for providing expertise on IBC and the SEER database, as well as the merged race-ethnicity SEER*stat variable used in this analysis.
Grant Support
This work was supported by a Rackham Merit Fellowship (J.A. Schlichting) from the University of Michigan and the Cancer Epidemiology Education in Special Populations Program of the University of Michigan (CA R25 112383). Additional funding was received from the Avon Foundation (A.S. Soliman and S.D. Merajver), the Breast Cancer Research Foundation (S.D. Merajver), and the Debbie Strange-Browne Inflammatory Breast Cancer Foundation (S.D. Merajver).
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.