Background:

Epidemiologic analyses of sarcoma are limited by the heterogeneity and rarity of the disease. Utilizing population-based surveillance data enabled us to evaluate the contribution of census tract-level socioeconomic status (CT-SES) and race/ethnicity on sarcoma incidence rates.

Methods:

We utilized the Surveillance, Epidemiology, and End Results program to evaluate associations between CT-SES and race/ethnicity on the incidence rates of sarcoma. Incidence rate ratios and 99% confidence intervals were estimated from quasi-Poisson models. All models were stratified by broad age groups (pediatric: <20 years, adult: 20–65 years, older adult: 65+ years) and adjusted for sex, age, and year of diagnosis. Within each age group, we conducted analyses stratified by somatic genome (fusion-positive and fusion-negative sarcomas) and for subtypes with >200 total cases. A P value less than 0.01 was considered statistically significant.

Results:

We included 55,415 sarcoma cases in 35 sarcoma subtype–age group combinations. Increasing CT-SES was statistically significantly associated with 11 subtype–age group combinations, primarily in the older age group strata (8 subtypes), whereas malignant peripheral nerve sheath tumors in adults were associated with decreasing CT-SES. Nearly every sarcoma subtype–age group combination displayed racial/ethnic disparities in incidence that were independent of CT-SES.

Conclusions:

We found race/ethnicity to be more frequently associated with sarcoma incidence than CT-SES. Our findings suggest that genetic variation associated with ancestry may play a stronger role than area-level SES-related factors in the etiology of sarcoma.

Impact:

These findings provide direction for future etiologic studies of sarcomas.

Sarcomas, which can be broadly categorized as soft-tissue sarcomas (STS) or bone sarcomas (BS), comprise a group of over 50 exceedingly rare and heterogeneous mesenchymal neoplasms (1). The overall incidence of STS and BS is fewer than 5 cases per 100,000 persons per year, although substantial variation in incidence by subtype, age, sex, and race has been described (2–5). Several factors are associated with sarcoma risk (6). These include genetic predisposition from both rare variants (7–9), some of which underlie cancer predisposition syndromes (10), and common variants (11–13). Other studies have observed sarcoma development to be associated with environmental [e.g., phenoxy herbicide exposure (14, 15), radiotherapy (16, 17)], infectious (18), and perinatal factors [e.g., birth weight (19), parental age (20)]. However, the etiology for the vast majority of sarcomas remains unknown, largely because the rarity and heterogeneity of the disease preclude sufficiently powered etiologic studies (6).

Evaluating socioeconomic status (SES) and racial/ethnic disparities in incidence across sarcoma subtypes may provide clues into their etiology. SES can serve as a proxy for locally varying environmental and lifestyle factors that influence health (21), whereas racial/ethnic disparities that are independent of SES may indicate that genetic variants associated with ancestry contribute to tumor development. However, prior analyses of SES and sarcoma incidence are either limited to analyses of childhood cases or to those that combined distinct subtypes into a single group (22–25), and few analyses of race-specific incidence patterns considered probable confounding by SES (2, 5). It is therefore difficult to properly attribute differences observed to environment or to genetic variants associated with ancestry.

In this study, we sought to utilize the population-based Surveillance, Epidemiology, and End Results (SEER) program to investigate the independent associations of census tract-level SES (CT-SES) and race/ethnicity with the incidence rate of individually rare subtypes of sarcoma diagnosed across the age span.

Study population

Sarcoma cases were identified from the SEER Census Tract-Level SES database, a specialized database that includes cancer cases diagnosed between 2000 and 2015 in the catchment area of 16 SEER registries (26). The Alaska Native and Louisiana tumor registries are excluded by SEER due to confidentiality concerns and the impact of Hurricane Katrina on population estimates, respectively (27). We identified a sarcoma as any microscopically confirmed, initial primary malignant tumor with an International Classification of Disease for Oncology, 3rd Edition (ICD-O-3) histology code recognized by the 2013 World Health Organization's (WHO) Classification of Tumours of Soft Tissue and Bone (28). We then categorized sarcomas into subtypes according to WHO classifications and the recommendations of an expert sarcoma oncologist (B.J. Weigel) and pathologist (P. Murugan; Supplementary Table S1). Kaposi sarcoma was excluded because it primarily arises in the setting of HIV infection (18). In addition, we classified sarcoma subtypes by their somatic genome as either fusion-positive (F+) or fusion-negative (F−) according to published data (Supplementary Table S2; refs. 29, 30). The (F+) category included subtypes with somatic genomes characterized by simple chromosomal translocations, whereas the (F−) category included subtypes with somatic genomes characterized by complex chromosomal rearrangements (31). We opted to categorize sarcoma subtypes by their presumed fusion status because evidence suggests that (F−) sarcomas are more likely than (F+) sarcomas to arise from genetic susceptibility (29, 32–34). Cases diagnosed in the year 2000 were excluded because they were not coded under the ICD-O-3 guidelines introduced in 2001 (35).

SES, race/ethnicity, and other covariates

SES was assessed using a composite index of SES measured at the level of the census tract, defined as a small geographic unit (∼4,000 people) designed to be homogeneous with respect to population characteristics, economic status, and living conditions (27). The CT-SES index was generated from a factor analysis of seven SES characteristics identified by Yost and colleagues (36). Cases were mapped to a census tract based upon their address at diagnosis (36). CT-SES indices assigned to each census tract were then categorized by SEER into quintiles of equal population size, with the first quintile (Q1) representing the lowest CT-SES and the fifth quintile (Q5) representing the highest (26).

Cases were identified as having Hispanic ethnicity by SEER based upon the North American Association for Central Cancer Registries Hispanic-Latino identification algorithm (37). Those without a Hispanic ethnicity were categorized into mutually exclusive race categories as either non-Hispanic (NH)-White, NH-Black, or American Indian/Alaskan Native or Asian Pacific Islander (AIAN/API); the AIAN and API race categories were combined due to sample size constraints. Age at diagnosis was categorized into nine age groups (0–9 years, 10–19 years, 20–29 years, 30–39 years, 40–49 years, 50–64 years, 65–74 years, 75–84 years, and 85+).

The population denominator estimates provided by SEER were stratified by single calendar year, sex, age, race, and Hispanic ethnicity. SEER allocates multiracial populations into one of the four single race categories with a probability proportional to the size of that race in the population (26).

After excluding cases with missing CT-SES (n = 914) or race/ethnicity (n = 775) variables, we analyzed 97% of the starting dataset (55,415 of 57,044 cases).

Statistical analyses

Incidence rate ratios (IRR) and 99% confidence intervals (CI) for CT-SES and race/ethnicity, adjusted for age at diagnosis, sex, and year of diagnosis, were estimated from quasi-Poisson models, which handle overdispersion by assuming the variance is a linear function of the mean (38). In all models, log-transformed population denominator estimates served as the offset term. Analyses were stratified by broad age group categories (pediatric <20 years; adult 20–65 years; older adult >65 years) to account for possible differences in sarcoma etiology. Within each age group strata, associations were assessed for sarcomas classified by fusion status [either (F+) or (F−)], as well as for any subtype with more than 200 total cases, a cutoff that was set a priori to maintain adequate power. For each sarcoma subtype, CT-SES was assessed as either a categorical or ordinal variable, with the P value from the ordinal variable used as a test for trend. For ease of interpretation, we have focused our reporting on the results obtained from analyzing CT-SES as an ordinal variable (Fig. 1). In some instances, these results appeared to obscure a nonlinear association apparent in the analysis of CT-SES as a categorical variable. We, therefore, present results from analyses of CT-SES as a categorical variable in Supplementary Table S3.

Figure 1.

Multivariable-adjusted IRR for sarcoma by CT-SES stratified by sarcoma subtype and age group, SEER 16 registries (2001–2015). The IRR was obtained by evaluating CT-SES as an ordinal variable, and error bars represent 99% CIs. Estimates are adjusted for race/ethnicity, age at diagnosis, sex, and year. The analysis of malignant chordoma in the older adult strata did not include race/ethnicity as a covariate due to insufficient sample sizes (i.e., <5 NH-Black cases). ARMS, Alveolar RMS; ES, Ewing sarcoma; UPS, undifferentiated pleomorphic sarcoma.

Figure 1.

Multivariable-adjusted IRR for sarcoma by CT-SES stratified by sarcoma subtype and age group, SEER 16 registries (2001–2015). The IRR was obtained by evaluating CT-SES as an ordinal variable, and error bars represent 99% CIs. Estimates are adjusted for race/ethnicity, age at diagnosis, sex, and year. The analysis of malignant chordoma in the older adult strata did not include race/ethnicity as a covariate due to insufficient sample sizes (i.e., <5 NH-Black cases). ARMS, Alveolar RMS; ES, Ewing sarcoma; UPS, undifferentiated pleomorphic sarcoma.

Close modal

A P value less than 0.01 was considered statistically significant. This threshold adjusted for the multiple primary hypothesis tests evaluated within each sarcoma subtype and age group strata (i.e., the ordinal CT-SES and 3 race/ethnicity comparisons) and controls the overall Type I error rate at approximately 0.05. All reported P values are two-sided. Datasets were created using SEER*Stat 8.3.6 (39), and analyses were performed in R version 3.4.4 (40).

Table 1 presents the distribution of sarcoma cases by demographic and clinical characteristics. Overall, 55,415 initial primary malignant sarcoma cases were identified (48,348 STS and 7,067 BS). Among all sarcoma cases, 64% were NH-White, 12% were NH-Black, 16% were of Hispanic ethnicity, and 9% were AIAN/API. The distribution of sarcoma cases from lowest to highest SES quintile was 17%, 18%, 20%, 22%, and 24%, respectively.

Table 1.

Distribution of sarcoma cases by demographic and clinical characteristics, SEER 16 registries (2001–2015).

Sex, %Race, %Census tract SES index, %
TotalMFNHWNHBHAIAN/APIQ1Q2Q3Q4Q5
All ages at diagnosis 
 BS 7,067 57 43 63 22 17 19 20 21 24 
 STS 48,348 51 49 64 12 15 17 18 20 22 24 
 (F−) sarcomas 17,391 59 41 67 17 16 18 20 21 24 
 (F+) sarcomas 13,662 53 47 58 14 20 17 19 20 21 23 
Pediatric (<20 years) 
 Osteosarcoma 1,569 56 44 45 15 32 20 22 20 19 19 
 ES of bone 931 61 39 65 25 15 19 23 18 25 
 ARMS 489 54 46 45 17 28 10 18 22 18 19 23 
 Embryonal RMS 749 61 39 55 14 25 18 20 19 19 24 
 F/MF tumors 518 52 48 48 20 28 19 22 16 21 22 
 Synovial sarcoma 302 51 49 52 31 18 19 20 20 23 
 ES of soft tissue 352 56 44 57 32 16 19 22 21 22 
 (F−) sarcomas 2,078 56 44 46 14 32 20 22 19 20 20 
 (F+) sarcomas 2,762 56 44 55 10 27 17 20 20 19 23 
Adult (20–65 years) 
 Osteosarcoma 1,203 56 45 54 13 23 21 18 20 19 21 
 ES of bone 450 64 36 73 19 18 20 19 20 23 
 Chondrosarcoma 1,699 53 48 72 17 15 18 20 22 25 
 Malignant chordoma 448 61 39 65 21 16 17 18 22 28 
 Other RMS 372 58 42 52 16 22 10 20 19 21 20 20 
 F/MF tumors 5,581 48 52 57 19 15 18 18 20 21 24 
 Synovial sarcoma 1,366 54 46 56 10 26 19 20 19 21 21 
 ES of soft tissue 442 52 48 62 22 12 18 21 22 20 20 
 MPNST 1,048 55 45 58 15 18 22 19 20 20 18 
 DSRCT 225 84 16 53 20 20 18 23 23 16 20 
 GIST 3,966 56 44 54 18 14 14 19 18 20 21 23 
 Leiomyosarcoma 6,036 27 73 59 16 17 17 19 19 23 22 
 Liposarcoma 4,796 60 40 63 20 16 18 20 22 25 
 Malignant vascular tumors 1,027 52 48 63 12 16 10 16 19 21 22 23 
 Epithelioid sarcoma 386 57 43 62 11 19 17 18 18 24 23 
 UPS 1,571 62 38 70 15 16 17 20 22 26 
 Unclassified sarcomas 2,218 54 46 61 13 17 19 18 20 20 22 
 (F−) sarcomas 9,114 58 42 64 10 18 17 18 20 21 24 
 (F+) sarcomas 9,284 52 48 57 15 19 18 19 20 21 23 
Older adult (65+ years) 
 Chondrosarcoma 537 53 47 83 10 13 18 18 24 27 
 Malignant chordoma 230 62 38 NR NR NR NR 12 17 16 23 32 
 Other RMS 237 43 57 74 14 17 19 24 26 
 F/MF tumors 1,455 50 50 74 11 12 19 20 23 26 
 MPNST 252 55 45 75 10 17 18 21 20 24 
 GIST 3,309 48 52 62 15 14 17 19 18 22 23 
 Leiomyosarcoma 3,220 41 59 75 11 16 19 20 21 24 
 Liposarcoma 2,721 62 38 74 12 14 18 20 23 26 
 Malignant vascular tumors 1,078 58 42 76 11 14 16 21 26 23 
 UPS 2,568 68 32 85 12 17 21 22 27 
 Unclassified sarcomas 2,064 55 45 80 14 18 20 23 25 
 (F−) sarcomas 6,199 62 38 79 13 18 21 23 26 
 (F+) sarcomas 1,616 51 49 73 11 15 19 17 24 25 
Sex, %Race, %Census tract SES index, %
TotalMFNHWNHBHAIAN/APIQ1Q2Q3Q4Q5
All ages at diagnosis 
 BS 7,067 57 43 63 22 17 19 20 21 24 
 STS 48,348 51 49 64 12 15 17 18 20 22 24 
 (F−) sarcomas 17,391 59 41 67 17 16 18 20 21 24 
 (F+) sarcomas 13,662 53 47 58 14 20 17 19 20 21 23 
Pediatric (<20 years) 
 Osteosarcoma 1,569 56 44 45 15 32 20 22 20 19 19 
 ES of bone 931 61 39 65 25 15 19 23 18 25 
 ARMS 489 54 46 45 17 28 10 18 22 18 19 23 
 Embryonal RMS 749 61 39 55 14 25 18 20 19 19 24 
 F/MF tumors 518 52 48 48 20 28 19 22 16 21 22 
 Synovial sarcoma 302 51 49 52 31 18 19 20 20 23 
 ES of soft tissue 352 56 44 57 32 16 19 22 21 22 
 (F−) sarcomas 2,078 56 44 46 14 32 20 22 19 20 20 
 (F+) sarcomas 2,762 56 44 55 10 27 17 20 20 19 23 
Adult (20–65 years) 
 Osteosarcoma 1,203 56 45 54 13 23 21 18 20 19 21 
 ES of bone 450 64 36 73 19 18 20 19 20 23 
 Chondrosarcoma 1,699 53 48 72 17 15 18 20 22 25 
 Malignant chordoma 448 61 39 65 21 16 17 18 22 28 
 Other RMS 372 58 42 52 16 22 10 20 19 21 20 20 
 F/MF tumors 5,581 48 52 57 19 15 18 18 20 21 24 
 Synovial sarcoma 1,366 54 46 56 10 26 19 20 19 21 21 
 ES of soft tissue 442 52 48 62 22 12 18 21 22 20 20 
 MPNST 1,048 55 45 58 15 18 22 19 20 20 18 
 DSRCT 225 84 16 53 20 20 18 23 23 16 20 
 GIST 3,966 56 44 54 18 14 14 19 18 20 21 23 
 Leiomyosarcoma 6,036 27 73 59 16 17 17 19 19 23 22 
 Liposarcoma 4,796 60 40 63 20 16 18 20 22 25 
 Malignant vascular tumors 1,027 52 48 63 12 16 10 16 19 21 22 23 
 Epithelioid sarcoma 386 57 43 62 11 19 17 18 18 24 23 
 UPS 1,571 62 38 70 15 16 17 20 22 26 
 Unclassified sarcomas 2,218 54 46 61 13 17 19 18 20 20 22 
 (F−) sarcomas 9,114 58 42 64 10 18 17 18 20 21 24 
 (F+) sarcomas 9,284 52 48 57 15 19 18 19 20 21 23 
Older adult (65+ years) 
 Chondrosarcoma 537 53 47 83 10 13 18 18 24 27 
 Malignant chordoma 230 62 38 NR NR NR NR 12 17 16 23 32 
 Other RMS 237 43 57 74 14 17 19 24 26 
 F/MF tumors 1,455 50 50 74 11 12 19 20 23 26 
 MPNST 252 55 45 75 10 17 18 21 20 24 
 GIST 3,309 48 52 62 15 14 17 19 18 22 23 
 Leiomyosarcoma 3,220 41 59 75 11 16 19 20 21 24 
 Liposarcoma 2,721 62 38 74 12 14 18 20 23 26 
 Malignant vascular tumors 1,078 58 42 76 11 14 16 21 26 23 
 UPS 2,568 68 32 85 12 17 21 22 27 
 Unclassified sarcomas 2,064 55 45 80 14 18 20 23 25 
 (F−) sarcomas 6,199 62 38 79 13 18 21 23 26 
 (F+) sarcomas 1,616 51 49 73 11 15 19 17 24 25 

Note: Q1 corresponds to lowest small-area SES quintile; Q5 corresponds to highest.

Abbreviations: ARMS, alveolar RMS; B, Black; ES, Ewing sarcoma; F, Female; H, Hispanic; M, Male; NH, Non-Hispanic; NR, not reported due to insufficient sample sizes (<5 cases in a cell); UPS, undifferentiated pleomorphic sarcoma; W, White.

SES and sarcoma incidence

Figure 1 shows the IRRs for CT-SES, stratified by sarcoma category and age group strata (values are given in Supplementary Table S3). In the pediatric age group, a positive trend in the incidence of fibroblastic/myofibroblastic (F/MF) tumors was observed across CT-SES quintiles (IRR, 1.09; 99% CI, 1.00–1.18; P value = 0.007). However, the results from CT-SES evaluated as an ordinal variable may have obscured a potential nonlinear association in the incidence of F/MF tumors; the IRR comparing CT-SES Q2 versus Q1 (IRR, 1.37; 99% CI, 0.97–1.94) appeared higher than the IRR comparing CT-SES Q3 versus Q1 (IRR, 1.04; 99% CI, 0.71–1.52; Supplementary Table S3). The incidence rates for other sarcoma subtypes evaluated in the pediatric age group were not associated with CT-SES.

In the adult age group strata, F/MF tumors and liposarcoma showed evidence of having a higher incidence in higher CT-SES quintiles (P value < 0.001 for both), whereas malignant peripheral nerve sheath tumors (MPNST) showed the opposite trend with every increase in CT-SES quintile associated with a 7% lower incidence rate (99% CI, 0.87–0.99; P value = 0.002). The other subtypes evaluated in adults were not associated with CT-SES, including all of the BS subtypes evaluated.

In the older adult age group strata, most subtypes (8 of 11 evaluated) showed a trend of increased incidence with increasing CT-SES quintiles that reached statistical significance, including those subtypes that were not significantly associated with CT-SES in adults. Other rhabdomyosarcoma (RMS), MPNST, and malignant vascular tumors were the only subtypes evaluated in older adults that were not associated with CT-SES.

In all age groups, we observed an increasing incidence of (F+) sarcomas with increasing CT-SES (P value < 0.001 in all age groups). A positive trend was also observed for (F−) sarcomas in the older adult aged strata (P value < 0.001), but the trends in the adult and pediatric age groups were not statistically significant (P values = 0.02 and 0.08, respectively).

Race/ethnicity and sarcoma incidence

Figure 2 shows the IRRs for race and ethnicity, stratified by sarcoma category and age group strata (values are given in Supplementary Table S4). In the pediatric age group, the IRRs observed in NH-Black children compared with NH-White children differed according to sarcoma subtype, with statistically significant IRRs observed for five of the seven subtypes evaluated. Heterogeneous results were also observed among children of Hispanic ethnicity, although we found fewer significant results (three of the seven subtypes evaluated) and smaller effect sizes than those comparing incidence rates in NH-Black to NH-White children. Among AIAN/API children, the IRRs that reached statistical significance (three of seven subtypes) were all less than 1, with some subtypes showing incidence rates that were nearly 50% lower in AIAN/API compared with NH-White children (e.g., Ewing sarcoma of bone, embryonal RMS, and F/MF tumors).

Figure 2.

Multivariable-adjusted incidence rate ratios for sarcoma by race/ethnicity stratified by sarcoma subtype and age group, SEER 16 registries (2001–2015). Estimates are adjusted for CT-SES (ordinal), age at diagnosis, sex, and year. Error bars represent 99% CIs. Multivariable analysis of malignant chordoma in older adult strata did not include race/ethnicity as a covariate due to insufficient sample sizes (i.e., <5 NHB cases). ARMS, alveolar RMS; ES, Ewing sarcoma; UPS, undifferentiated pleomorphic sarcoma. NH-White is the reference category.

Figure 2.

Multivariable-adjusted incidence rate ratios for sarcoma by race/ethnicity stratified by sarcoma subtype and age group, SEER 16 registries (2001–2015). Estimates are adjusted for CT-SES (ordinal), age at diagnosis, sex, and year. Error bars represent 99% CIs. Multivariable analysis of malignant chordoma in older adult strata did not include race/ethnicity as a covariate due to insufficient sample sizes (i.e., <5 NHB cases). ARMS, alveolar RMS; ES, Ewing sarcoma; UPS, undifferentiated pleomorphic sarcoma. NH-White is the reference category.

Close modal

In adults, several STS subtypes had incidence rates in NH-Blacks that were substantially higher (IRR > 1.60) than those observed in NH-Whites, including other RMS, F/MF tumors, desmoplastic small round cell tumor (DSRCT), gastrointestinal stromal tumor (GIST), and leiomyosarcoma. The incidence rates for several subtypes were also significantly different in adults of Hispanic ethnicity compared with NH-White adults (6 of 17 subtypes evaluated), although we again observed weaker associations than those comparing the incidence rates in NH-Black adults with NH-White adults. Among adults identified as AIAN/API, we observed incidence rates that were significantly lower than those observed in NH-White adults for several sarcoma subtypes (9 of 17 subtypes evaluated). A notable exception is GIST, the only subtype evaluated to show a significantly higher incidence rate in AIAN/API compared with NH-White adults (IRR, 1.48; 99% CI, 1.31–1.68).

Several of the results observed in older adults (>65 years at diagnosis) were dissimilar to those in adults. For example, the incidence of undifferentiated pleomorphic sarcoma was lower in older adults identified as NH-Black (IRR, 0.48; 99% CI, 0.35–0.64) and Hispanic (IRR, 0.65; 99% CI, 0.52–0.8) compared with NH-White older adults, but showed no difference in incidence across these racial/ethnic categories in adults. Conversely, results among AIAN/API older adults largely concurred with those observed in younger aged individuals. Incidence rates in AIAN/API older adults that reached statistical significance (5 of 10 subtypes evaluated) were primarily lower than those observed for NH-Whites, except for GIST, which had an incidence rate that was 78% higher in AIAN/API compared with NH-White older adults (99% CI, 1.56–2.04).

In all age groups, the incidence of (F−) and (F+) sarcomas in NH-Black or Hispanic individuals compared with NH-White individuals were heterogeneous. Conversely, the comparative incidence rates in AIAN/API individuals that reached statistical significance consistently showed lower incidence rates for both (F−) and (F+) sarcomas compared with NH-Whites in all age groups.

In the current study, we found race/ethnicity to be more often associated with sarcoma incidence than CT-SES. Specifically, of the 35 subtype–age group combinations evaluated for an association with CT-SES, 12 were statistically significant, and most occurred in the older age group strata (8 subtypes), with all, except MPNST in adults, having a positive trend in incidence across CT-SES levels. Conversely, nearly every subtype–age group combination displayed racial/ethnic differences in incidence rates that were independent of CT-SES. Notably strong associations included an over two-fold increase in the incidence of GIST among NH-Black compared with NH-White adults. Overall, our findings suggest that genetic variation associated with ancestry may play a stronger role than area-level SES-related factors in the etiology of STS and BS subtypes.

Prior results regarding the association between SES and sarcoma incidence are limited. A large SEER analysis (n = 1,576 STS and 1,113 BS) found that children ages 0 to 19 years living in higher-income counties of the United States had a higher incidence of STS but not BS (22), whereas other case–control studies using maternal or household education as a proxy for SES have found mostly null associations (24, 41). In prior analyses, however, individual subtypes of sarcoma were not separately evaluated. With regards to race/ethnicity, several race-specific incidence patterns have been reported, including the dramatic patterns reported herein among minority populations diagnosed with Ewing sarcoma (42–44), GIST (45, 46), DSRCT (47), and leiomyosarcoma (2). Prior analyses, however, have failed to comprehensively account for likely confounding by SES. The results of our analysis build upon prior work by demonstrating the independent associations of CT-SES and race/ethnicity on the incidence of subtypes of sarcoma throughout the age span.

Area-level SES is speculated to represent complex interactions between environmental, social, and cultural influences on health; therefore, we cannot determine which specific aspect may have driven our observed associations. Several possible risk factors for sarcoma have been identified in prior studies (6), including those we presume to be correlated with both a high SES [e.g., older parental age (20)] and low SES [e.g., residential proximity to industrial facilities (48, 49)], but few of those nominated to date have been definitively established as risk factors (6). Nonetheless, the variation in incidence across CT-SES supports the possibility that exogenous or lifestyle factors contribute to the development of at least some subtypes of sarcoma. Moreover, that results were heterogeneous across subtypes and age groups highlights the likelihood of etiologic differences in the tumors evaluated (i.e., environmental vs. genetic factors).

For example, it is hypothesized that individuals diagnosed with (F−) sarcomas more often harbor rare, pathogenic germline genetic variants than those with (F +) sarcomas, as evidenced by the increased frequency with which (F−) sarcoma cases are diagnosed in cancer predisposition syndromes and as secondary cancers (29, 32–34). That the association with CT-SES appeared weaker in (F−) sarcomas compared with (F+) sarcomas in the pediatric and adult age groups supports this hypothesis, as tumors that are unrelated to SES are perhaps more likely influenced by germline genetics. In the older adult stratum, however, we note dissimilar findings; CT-SES was associated with both classes of tumors and nearly every subtype evaluated (8 of 11). We speculate that CT-SES is unrelated to (F−) sarcomas and other subtypes in younger patients due to the increased frequency with which germline genetic predisposition contributes to sarcoma development early in life (7, 50).

In the pediatric and adult age groups, the incidence of several sarcoma subtypes was unrelated to CT-SES but demonstrated dramatic race-specific incidence patterns. We speculate that these findings may be driven in part by differences in the frequency of genetic variants across ancestries. For example, the binding affinity for EWS-FLI1, the fusion oncoprotein characteristic of Ewing sarcoma (30), depends upon the length of GGAA microsatellite repeats (51), which is polymorphic between populations of primarily European and African ancestries (52). This is consistent with the substantially higher incidence of Ewing sarcoma in NH-White individuals compared with NH-Black individuals reported here and elsewhere (42–44). Whether genetic variants that are associated with ancestry underlie associations with other subtypes is unclear. We note, however, that racial and ethnic categories are imperfect measures of genomic ancestry (53), and that other social, cultural, or lifestyle factors which are unaccounted for by area-level SES may have confounded our observations. The use of admixture mapping (54, 55) may help to resolve the potential mechanisms underlying our observations.

To our knowledge, ours is the largest study to investigate the independent contributions of SES and race/ethnicity on sarcoma incidence. Utilizing the SEER program enabled us to evaluate associations among individually rare subtypes of sarcoma within a racially and ethnically diverse population with reasonably adequate statistical power. In addition, the use of a comprehensive and widely available SES variable measured at the census tract-level allowed us to control analyses for possible environmental and lifestyle factors, and facilitated informed speculations on the relative contribution of environment versus genetics in the development of these rare groups of cancers.

Nevertheless, we acknowledge several limitations. Although we included only microscopically confirmed sarcomas, there may still be misclassification of tumor subsets represented in our study. Accurately diagnosing sarcomas remains a clinical challenge (56, 57), and recording their diagnostic codes into tumor registries is subject to error (58, 59). It is possible that misclassification may be greater among individuals with lower SES, as they may have reduced access to specialty sarcoma centers or clinical trials. Furthermore, we classified subtypes as fusion-positive or -negative according to the predominant somatic mutations in each sarcoma based on prior literature (29, 30), but tumors categorized as (F +) may have lacked a fusion gene, and some (F−) sarcoma subtypes may harbor fusion genes yet to be discovered (60). We also note that area-level SES does not represent individual-level SES, but rather captures the SES characteristics of one's environment that may also influence health (61). SEER includes individual-level insurance status for cases but does not include it in calculations of population denominator estimates. This precluded us from using insurance status as a measure of SES in our analysis of incidence rates. Other individual-level SES characteristics (e.g., income or educational attainment) are not publicly available from SEER. Furthermore, the CT-SES measure was assigned to a case based upon their address at diagnosis. This may not account for potentially long latency periods between exposure and diagnosis, during which an individual may have moved between areas of high or low SES. Furthermore, we note that our analysis could not account for environmental exposures that may be associated with sarcoma development, but which are unrelated to CT-SES. Finally, given this is a broad study of rare cancers across the age spectrum, a large number of statistical tests were performed. Some significant findings may therefore be due to chance.

Through this large and comprehensive analysis, we observed sarcoma incidence rates to be more often associated with race/ethnicity than CT-SES. Although future etiologic analyses—particularly genetic studies—are needed to confirm our findings, our results suggest that genetics play a greater role than environmental factors on the etiology of several subtypes of sarcoma.

B.J. Diessner reports grants from National Institute of Arthritis and Musculoskeletal and Skin Diseases at the National Institutes of Health (T32 AR05938 - Musculoskeletal Training Grant) during the conduct of the study. J.N. Poynter reports grants from NIH during the conduct of the study. No potential conflicts of interest were disclosed by the other authors.

B.J. Diessner: Conceptualization, data curation, software, formal analysis, visualization, methodology, writing–original draft, project administration. B.J. Weigel: Conceptualization, supervision, methodology, writing–review and editing. P. Murugan: Conceptualization, methodology, writing–review and editing. L. Zhang: Conceptualization, supervision, methodology, writing–review and editing. J.N. Poynter: Conceptualization, supervision, methodology, writing–review and editing. L.G. Spector: Conceptualization, supervision, methodology, writing–review and editing.

This work was supported by the National Institute of Arthritis and Musculoskeletal and Skin Diseases at the National Institutes of Health “Musculoskeletal Training Grant” (T32 AR05938; to B.J. Diessner).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Helman
LJ
,
Meltzer
P
. 
Mechanisms of sarcoma development
.
Nat Rev Cancer
2003
;
3
:
685
94
.
2.
Toro
JR
,
Travis
LB
,
Hongyu
JW
,
Zhu
K
,
Fletcher
CDM
,
Devesa
SS
. 
Incidence patterns of soft tissue sarcomas, regardless of primary site, in the Surveillance, Epidemiology and End Results program, 1978–2001: an analysis of 26,758 cases
.
Int J Cancer
2006
;
119
:
2922
30
.
3.
Ng
VY
,
Scharschmidt
TJ
,
Mayerson
JL
,
Fisher
JL
. 
Incidence and survival in sarcoma in the United States: a focus on musculoskeletal lesions
.
Anticancer Res
2013
;
33
:
2597
604
.
4.
Ferrari
A
,
Sultan
I
,
Huang
TT
,
Rodriguez-Galindo
C
,
Shehadeh
A
,
Meazza
C
, et al
Soft tissue sarcoma across the age spectrum: a population-based study from the Surveillance Epidemiology and End Results database
.
Pediatr Blood Cancer
2011
;
57
:
943
9
.
5.
Rouhani
P
,
Fletcher
CDM
,
Devesa
SS
,
Toro
JR
. 
Cutaneous soft tissue sarcoma incidence patterns in the U.S.: an analysis of 12,114 cases
.
Cancer
2008
;
113
:
616
27
.
6.
Burningham
Z
,
Hashibe
M
,
Spector
L
,
Schiffman
JD
. 
The epidemiology of sarcoma
.
Embase Nature Rev Rheumatol
2011
;
2
:
1
16
.
7.
Ballinger
ML
,
Goode
DL
,
Ray-Coquard
I
,
James
PA
,
Mitchell
G
,
Niedermayr
E
, et al
Monogenic and polygenic determinants of sarcoma risk: an international genetic study
.
Lancet Oncol
2016
;
17
:
1261
71
.
8.
Chan
SH
,
Lim
WK
,
Ishak
NDB
,
Li
ST
,
Goh
WL
,
Tan
GS
, et al
Germline mutations in cancer predisposition genes are frequent in sporadic sarcomas
.
Sci Rep
2017
;
7
:
1
8
.
9.
Zhang
J
,
Walsh
MF
,
Wu
G
,
Edmonson
MN
,
Gruber
TA
,
Easton
J
, et al
Germline mutations in predisposition genes in pediatric cancer
.
N Engl J Med
2015
;
373
:
2336
46
.
10.
Farid
M
,
Ngeow
J
. 
Sarcomas associated with genetic cancer predisposition syndromes: a review
.
Oncologist
2016
;
21
:
1002
13
.
11.
Savage
SA
,
Mirabello
L
,
Wang
Z
,
Gastier-Foster
JM
,
Gorlick
R
,
Khanna
C
, et al
Genome-wide association study identifies two susceptibility loci for osteosarcoma
.
Nat Genet
2013
;
45
:
799
803
.
12.
Postel-Vinay
S
,
Véron
AS
,
Tirode
F
,
Pierron
G
,
Reynaud
S
,
Kovar
H
, et al
Common variants near TARDBP and EGR2 are associated with susceptibility to Ewing sarcoma
.
Nat Genet
2012
;
44
:
323
9
.
13.
Machiela
MJ
,
Grünewald
TGP
,
Surdez
D
,
Reynaud
S
,
Mirabeau
O
,
Karlins
E
, et al
Genome-wide association study identifies multiple new loci associated with Ewing sarcoma susceptibility
.
Nat Commun
2018
;
9
:
1
8
.
14.
Hardell
L
,
Sandström
A
. 
Case-control study: soft-tissue sarcomas and exposure to phenoxyacetic acids or chlorophenols
.
Br J Cancer
1979
;
39
:
711
7
.
15.
Eriksson
M
,
Hardell
L
,
Moller
T
. 
Soft-tissue sarcomas and exposure to chemical substances: a case-referent study
.
Br J Ind Med
1981
;
38
:
27
33
.
16.
Berrington de Gonzalez
A
,
Kutsenko
A
,
Rajaraman
P
. 
Sarcoma risk after radiation exposure
.
Clin Sarcoma Res
2012
;
2
:
1
8
.
17.
Virtanen
A
,
Pukkala
E
,
Auvinen
A
. 
Incidence of bone and soft tissue sarcoma after radiotherapy: a cohort study of 295,712 Finnish cancer patients
.
Int J Cancer
2006
;
118
:
1017
21
.
18.
Renwick
N
,
Halaby
T
,
Weverling
GJ
,
Dukers
NH
,
Simpson
GR
,
Coutinho
RA
, et al
Seroconversion for human herpesvirus 8 during HIV infection is highly predictive of Kaposi's sarcoma
.
Aids
1998
;
12
:
2481
8
.
19.
Ognjanovic
S
,
Carozza
SE
,
Chow
EJ
,
Fox
EE
,
Horel
S
,
McLaughlin
CC
, et al
Birth characteristics and the risk of childhood rhabdomyosarcoma based on histological subtype
.
Br J Cancer
2010
;
102
:
227
31
.
20.
Johnson
KJ
,
Carozza
SE
,
Chow
EJ
,
Fox
EE
,
Horel
S
,
McLaughlin
CC
, et al
Parental age and risk of childhood cancer: a pooled analysis
.
Epidemiology
2009
;
20
:
475
83
.
21.
Seeman
TE
,
Crimmins
E
. 
Social environment effects on health and aging integrating epidemiologic and demographic approaches and perspectives
.
Ann N Y Acad Sci
2001
;
954
:
88
117
.
22.
Pan
IJ
,
Daniels
JL
,
Zhu
K
. 
Poverty and childhood cancer incidence in the United States
.
Cancer Causes Control
2010
;
21
:
1139
45
.
23.
Alston
RD
,
Rowan
S
,
Eden
TOB
,
Moran
A
,
Birch
JM
. 
Cancer incidence patterns by region and socioeconomic deprivation in teenagers and young adults in England
.
Br J Cancer
2007
;
96
:
1760
6
.
24.
Kehm
RD
,
Spector
LG
,
Poynter
JN
,
Vock
DM
,
Osypuk
TL
. 
Socioeconomic status and childhood cancer incidence: a population-based multilevel analysis
.
Am J Epidemiol
2018
;
187
:
982
91
.
25.
Hampras
SS
,
Moysich
KB
,
Marimuthu
SP
,
Ravid
V
,
Jayaprakash
V
. 
Socioeconomic factors and the risk for sarcoma
.
Eur J Cancer Prev
2014
;
23
:
560
5
.
26.
Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat database: census tract-level SES and rurality database (2000–2015), National Cancer Institute, DCCPS, Surveillance Research Program, based on the November 2018 submission
.
Available from
: www.seer.cancer.gov.
27.
Yu
M
,
Tatalovich
Z
,
Gibson
JT
,
Cronin
KA
. 
Using a composite index of socioeconomic status to investigate health disparities while protecting the confidentiality of cancer registry data
.
Cancer Causes Control
2014
;
25
:
81
92
.
28.
Fletcher
CDM
,
Bridge
JA
,
Beidge
JA
,
Hogendoom PCW
MF
.
WHO classification of tumours of soft tissue and bone
.
Lyon (France)
:
IARC
; 
2013
.
29.
Lupo
PJ
,
Brown
AL
,
Hettmer
S
. 
Second malignancy risk among pediatric, adolescent, and young adult survivors of fusion-positive and fusion-negative sarcomas: results from the SEER database, 1992 through 2012
.
Cancer
2016
;
122
:
3492
500
.
30.
Perry
JA
,
Seong
BKA
,
Stegmaier
K
. 
Biology and therapy of dominant fusion oncoproteins involving transcription factor and chromatin regulators in sarcomas
.
Annu Rev Cancer Biol
2019
;
3
:
299
321
.
31.
Bovée
JVMG
,
Hogendoorn
PCW
. 
Molecular pathology of sarcomas: concepts and clinical implications
.
Virchows Arch
2010
;
456
:
193
9
.
32.
Lupo
PJ
,
Danysh
HE
,
Plon
SE
,
Curtin
K
,
Malkin
D
,
Hettmer
S
, et al
Family history of cancer and childhood rhabdomyosarcoma: a report from the Children's Oncology Group and the Utah Population Database
.
Cancer Med
2015
;
4
:
781
90
.
33.
Archer
NM
,
Amorim
RP
,
Naves
R
,
Hettmer
S
,
Diller
LR
,
Ribeiro
KB
, et al
An increased risk of second malignant neoplasms after rhabdomyosarcoma: population-based evidence for a cancer predisposition syndrome?
Pediatr Blood Cancer
2016
;
63
:
196
201
.
34.
Pinto
N
,
Hawkins
DS
. 
Second malignant neoplasms in rhabdomyosarcoma: victims of our own success or an underlying genetic predisposition syndrome?
Pediatr Blood Cancer
2016
;
63
:
189
90
.
35.
NAACCR ICD-O-3 Implementation Work Group.
North American Association of Central Cancer Registries guidelines for ICD-O-3 implementation
.
Springfield (IL)
:
North American Association of Central Cancer Registries
; 
2000
.
Available from:
https://www.naaccr.org/wp-content/uploads/2016/11/ICDO3Final-Implementation-Guide.pdf.
36.
Yost
K
,
Perkins
C
,
Cohen
R
,
Morris
C
,
Wright
W
. 
Socioeconomic status and breast cancer incidence in California for different race/ethnic groups
.
Cancer Causes Control
2001
;
12
:
703
11
.
37.
NAACCR Race and Ethnicity Work Group
.
NAACCR guideline for enhancing Hispanic/Latino identification: revised NAACCR Hispanic/Latino identification algorithm [NHIA v2.2.1]
.
Springfield (IL)
:
North American Association of Central Cancer Registries
; 
2011
.
38.
Ver Hoef
JM
,
Boveng
PL
. 
Quasi-Poisson vs. negative binomial regression: how should we model overdispersed count data?
Ecology
2007
;
88
:
2766
72
.
39.
Surveillance Research Program, National Cancer Institute. SEER*Stat Software version 8.3.6
.
Available from
: https://seer.cancer.gov/seerstat.
40.
R core team
.
R: a language and environment for statistical computing
.
Vienna (Austria)
:
R Foundation for Statistical Computing
; 
2016
. Available from: https://www.R-project.org.
41.
Carozza
SE
,
Puumala
SE
,
Chow
EJ
,
Fox
EE
,
Horel
S
,
Johnson
KJ
, et al
Parental educational attainment as an indicator of socioeconomic status and risk of childhood cancers
.
Br J Cancer
2010
;
103
:
136
42
.
42.
Jawad
MU
,
Cheung
MC
,
Min
ES
,
Schneiderbauer
MM
,
Koniaris
LG
,
Scully
SP
. 
Ewing sarcoma demonstrates racial disparities in incidence-related and sex-related differences in outcome: an analysis of 1631 cases from the SEER database, 1973–2005
.
Cancer
2009
;
115
:
3526
36
.
43.
Polednak
AP
. 
Primary bone cancer incidence in black and white residents of New York state
.
Cancer
1985
;
55
:
2883
8
.
44.
Parkin
DM
,
Stiller
CA
,
Draper
GJ
,
Bieber
CA
. 
The international incidence of childhood cancer
.
Int J Cancer
1988
;
42
:
511
20
.
45.
Søreide
K
,
Sandvik
OM
,
Søreide
JA
,
Giljaca
V
,
Jureckova
A
,
Bulusu
VR
. 
Global epidemiology of gastrointestinal stromal tumours (GIST): a systematic review of population-based cohort studies
.
Cancer Epidemiol
2016
;
40
:
39
46
.
46.
Ma
GL
,
Murphy
JD
,
Martinez
ME
,
Sicklick
JK
. 
Epidemiology of gastrointestinal stromal tumors in the era of histology codes: results of a population-based study
.
Cancer Epidemiol Biomarkers Prev
2015
;
24
:
298
302
.
47.
Worch
J
,
Cyrus
J
,
Goldsby
R
,
Matthay
KK
,
Neuhaus
J
,
DuBois
SG
. 
Racial differences in the incidence of mesenchymal tumors associated with EWSR1 translocation
.
Cancer Epidemiol Biomarkers Prev
2011
;
20
:
449
53
.
48.
Zambon
P
,
Ricci
P
,
Bovo
E
,
Casula
A
,
Gattolin
M
,
Fiore
AR
, et al
Sarcoma risk and dioxin emissions from incinerators and industrial plants: a population-based case-control study (Italy)
.
Environ Heal A Glob Access Sci Source
2007
;
6
:
1
10
.
49.
García-Pérez
J
,
Morales-Piga
A
,
Gómez-Barroso
D
,
Tamayo-Uria
I
,
Pardo Romaguera
E
,
López-Abente
G
, et al
Risk of bone tumors in children and residential proximity to industrial and urban areas: new findings from a case-control study
.
Sci Total Environ
2017
;
579
:
1333
42
.
50.
Mirabello
L
,
Yeager
M
,
Mai
PL
,
Gastier-Foster
JM
,
Gorlick
R
,
Khanna
C
, et al
Germline TP53 variants and susceptibility to osteosarcoma
.
J Natl Cancer Inst
2015
;
107
:
1
4
.
51.
Gangwal
K
,
Sankar
S
,
Hollenhorst
PC
,
Kinsey
M
,
Haroldsen
SC
,
Shah
AA
, et al
Microsatellites as EWS/FLI response elements in Ewing's sarcoma
.
Proc Natl Acad Sci U S A
2008
;
105
:
10149
54
.
52.
Beck
R
,
Monument
MJ
,
Watkins
WS
,
Smith
R
,
Boucher
KM
,
Schiffman
JD
, et al
EWS/FLI-responsive GGAA microsatellites exhibit polymorphic differences between European and African populations
.
Cancer Genet
2012
;
205
:
304
12
.
53.
Mersha
TB
,
Abebe
T
. 
Self-reported race/ethnicity in the age of genomic research: its potential impact on understanding health disparities
.
Hum Genomics
2015
;
9
:
1
15
.
54.
Winkler
CA
,
Nelson
GW
,
Smith
MW
. 
Admixture mapping comes of age
.
Annu Rev Genomics Hum Genet
2010
;
11
:
65
89
.
55.
Shriner
D
. 
Overview of admixture mapping
.
Curr Protoc Hum Genet
2017
;
94
:
1
23
.
56.
Ray-coquard
I
,
Montesco
MC
,
Coindre
JM
,
Dei tos
AP
,
Lurkin
A
,
Ranchère-vince
D
, et al
Sarcoma: concordance between initial diagnosis and centralized expert review in a population-based study within three European regions
.
Ann Oncol
2012
;
23
:
2442
9
.
57.
Thway
K
,
Fisher
C
. 
Histopathological diagnostic discrepancies in soft tissue tumours referred to a specialist centre
.
Sarcoma
2009
;
125
:
2926
34
.
58.
Lyu
HG
,
Haider
AH
,
Landman
AB
,
Raut
CP
. 
The opportunities and shortcomings of using big data and national databases for sarcoma research
.
Cancer
2019
;
125
:
2926
34
.
59.
Lyu
HG
,
Stein
LA
,
Saadat L
V
,
Phicil
SN
,
Haider
A
,
Raut
CP
. 
Assessment of the accuracy of disease coding among patients diagnosed with sarcoma
.
JAMA Oncol
2018
;
4
:
1293
5
.
60.
Mertens
F
,
Antonescu
CR
,
Mitelman
F
. 
Gene fusions in soft tissue tumors: Recurrent and overlapping pathogenetic themes
.
Genes Chromosom Cancer
2016
;
55
:
291
310
.
61.
Krieger
N
,
Williams
DR
,
Moss
NE
. 
Measuring social class in US public health research: concepts, methodologies, and guidelines
.
Annu Rev Public Health
1997
;
18
:
341
78
.