Abstract
Purpose: Ionizing radiation is a well-established carcinogen in rodent models and a risk factor associated with human cancer. We developed a mouse model that captures radiation effects on host biology by transplanting unirradiated Trp53-null mammary tissue to sham or irradiated hosts. Gene expression profiles of tumors that arose in irradiated mice are distinct from those that arose in naïve hosts. We asked whether expression metaprofiles could discern radiation-preceded human cancer or be informative in sporadic breast cancers.
Experimental Design: Affymetrix microarray gene expression data from 56 Trp53-null mammary tumors were used to define gene profiles and a centroid that discriminates tumors arising in irradiated hosts. These were applied to publicly available human cancer datasets.
Results: Host irradiation induces a metaprofile consisting of gene modules representing stem cells, cell motility, macrophages, and autophagy. Human orthologs of the host irradiation metaprofile discriminated between radiation-preceded and sporadic human thyroid cancers. An irradiated host centroid was strongly associated with estrogen receptor–negative breast cancer. When applied to sporadic human breast cancers, the irradiated host metaprofile strongly associated with basal-like and claudin-low breast cancer intrinsic subtypes. Comparing host irradiation in the context of TGF-β levels showed that inflammation was robustly associated with claudin-low tumors.
Conclusions: Detection of radiation-preceded human cancer by the irradiated host metaprofile raises possibilities of assessing human cancer etiology. Moreover, the association of the irradiated host metaprofiles with estrogen receptor–negative status and claudin-low subtype suggests that host processes similar to those induced by radiation underlie sporadic cancers. Clin Cancer Res; 19(6); 1353–62. ©2013 AACR.
Radiation has long been established as a risk factor for breast cancer, with recent evidence suggesting that it promotes estrogen receptor (ER)-negative cancers in mice and humans. Human breast cancer is comprised of at least 6 transcriptional subtypes, each with distinct molecular programs. We report that the human orthologs of a mouse mammary tumor transcriptional program elicited by prior host irradiation can be used to segregate radiation-preceded cancers. Moreover, this program is associated with ER-negative human breast cancer subtypes, suggesting that sporadic cancer is promoted by similar biologic processes.
Introduction
Ionizing radiation is one of very few environmental exposures unequivocally associated with increased cancer risk in humans (1), particularly in thyroid and breast cancer following exposure at a young age (2, 3). Breast cancer increases in women who survived the atomic bombs (4), received diagnostic radiation for tuberculosis (5), or were treated with radiation for benign breast disease (6). Twenty percent of women treated with radiation for Hodgkin lymphoma develop breast cancer before the age of 40 years (7). Breast cancer is a complex disease that consists of at least 6 intrinsic subtypes identified by gene expression profiling (8–10) that can be prognostic (11, 12). Recent studies suggest that prior exposure to radiation promotes aggressive, estrogen receptor (ER)-negative tumors (13–15).
Radiation is a complete carcinogen able to both initiate and promote cancer. Initiation, thought to be due to oncogenic mutations from misrepaired double DNA breaks, is widely believed to be the critical event for radiation carcinogenesis (16), however host systemic and stromal responses can also contribute to the carcinogenic potential of radiation (17–20). To test whether host biology contributes to radiation carcinogenesis, we established a radiation chimera model that separates radiation effects on the host from those on the target epithelium (21). In this model, the mammary gland is cleared of endogenous epithelium in host mice, which are subsequently irradiated and then transplanted orthotopically with nonmalignant Trp53-null mammary tissue (22), which has many similarities to human breast cancer, including progression from preneoplastic lesions to ductal carcinoma in situ to tumors with diverse histopathologies (23, 24). Even though host irradiation occurred many months before tumor development and the mammary epithelium was never irradiated, the course of Trp53-null carcinogenesis is significantly altered by host irradiation as evidenced by decreased tumor latency and more rapid tumor growth rate. Unexpectedly, host irradiation also increased the proportion of ER-negative tumors. Expression profiles of Trp53-null tumors arising in an irradiated host compared with those arising in nonirradiated hosts were also distinct, suggesting that the biology elicited by radiation has long-lasting effects on tumor development (22). Network analysis of the irradiated host signature implicated 2 critical factors, TGF-β and mammary stem cells that were validated with additional experiments, suggesting that the irradiated microenvironment promotes breast cancer by at least 2 distinct mechanisms (22).
Detailed understanding of the basis for cancer characteristics and clinical behaviors in irradiated populations could improve risk predictions and may uncover means to reduce risk. We speculated that expression metaprofiles might discern radiation-preceded human cancer and be informative in sporadic breast cancers. We used bioinformatics to evaluate the value of murine-irradiated host signatures for classifying radiation-preceded human cancers and its associations with sporadic breast cancer.
Materials and Methods
Data from Affymetrix mouse Genechip MG-430 2.0 arrays from our prior study (22), GSE18216 NCBI Gene Expression Omnibus database accession number, were used, in addition to 8 additional samples (merged under GSE42742). Background was normalized using robust multichip average algorithm (25), R software v2.10.1, with widgets specific to the Affymetrix platform. Unsupervised hierarchical clustering using Gene Cluster v3.0 software was visualized using Java TreeView v1.1.4r3 software. Data was mean-centered; gene clustering was done by an uncentered correlation and array clustering was done by Spearman rank correlation under complete linkage. Pathways were identified with Ingenuity Pathway Analysis (IPA) or ConceptGen (http://conceptgen.ncibi.org/core/conceptGen/index.jsp).
Irradiated murine host signature
Significance of analysis of microarray (SAM) used a 2-class analysis with 100 permutations per comparison of the reference class to the target class, followed by a fold change cutoff of 1.5 (26). To increase stringency, a secondary, or “tandem,” bootstrapping was done by running the above SAM analysis iteratively, removing one sample from the reference class each time, including an iteration that removed no samples, to generate a list of genes regulated ≥1.5-fold present in 80% of the secondary SAM analyses.
Microarray data from 1,608 human breast tumors from the study of Ringnér and colleagues and 337 untreated human breast cancer from the study of Prat and colleagues (27, 28) classified into molecular subtypes were used for cross-species comparison (28). Human orthologs of murine genes present on the human array platforms were used to cluster human microarray data using gene cluster as above. Genes in human microarray data were filtered by criteria of log2 value >5 in 80% of the samples, before isolating the genes for clustering. In some analyses, the validity of clustering was tested against the performance of 10,000 randomly selected gene sets of the same size. Spearman correlation and complete linkage were used to assess the distribution of irradiated and sham samples in the first divisions of the clustering dendrogram, excluding dendrograms with less than 5 samples in these branches.
Irradiated host centroid
To construct the irradiated host centroid, gene expression data (n = 32) were filtered to include only genes with an expression above background in more than 75% of the samples. A t test–based P value of the median-centered probes present in the irradiated host signature was calculated for each gene based on separation of samples on host irradiation status. Gene expression centroids were calculated for probes with P < 0.01, leaving 133 of the original 323 probes. For classification of human tumors, mouse probes were translated to human genes using cross-referenced Entrez Gene IDs, leaving 72 genes in the centroid. Breast cancers were assayed for nearest centroid classification using Spearman correlation for the irradiated (IR host) and the sham (S-host) gene expression centroids.
Results
Host irradiation induces a distinct metaprofile in Trp53-null murine mammary tumors
Expression profiles from 56 tumors arising from Trp53-null fragments transplanted to cleared mammary glands of control or previously irradiated wild-type (WT) or Tgfb1 heterozygote 3-month-old mice were previously reported (22). Subsequently, Herschkowitz and colleagues showed that the transcriptional profiles of Trp53-null tumors can be classified into molecular subtypes, including two basal-like classes, luminal, claudin-low similar to human breast cancer, and a subtype unique to this model (29). To determine whether host irradiation affected the spectrum of intrinsic molecular subtypes, a intrinsic gene list, previously defined for mouse tumors (30), was used for hierarchical clustering analysis of expression profiles from these 56 tumors with 187 murine mammary tumors, including 50 other Trp53-null tumors, and 10 mouse mammary glands. SigClust (31) was used to assess the significance of tumor clustering and objectively determine significant groups/subtypes (Fig. 1; Supplementary Table S1). SigClust assigned these newly analyzed 56 Trp53-null tumors to basal-like (5 of 56), claudin-low (14 of 56), luminal (19 of 56), and p53-null (6 of 56) intrinsic subtypes. Two clusters of tumors, mostly from irradiated hosts (11 of 12), were unclassified by this method. The distribution of tumor subtypes as a function of either host irradiation and/or host genotype was not significantly different as determined by χ2 (data not shown).
Yet host irradiation confers a distinct expression signature on tumor transcriptomes (22). Because tumors arising in irradiated hosts were not enriched in a particular tumor subtype, we concluded that the gene lists that define tumors arising in irradiated hosts are metaprofiles that overlay intrinsic subtype. To further explore this biology, we generated an irradiated host profile list of 323 genes (Supplementary Table S2) significantly regulated by at least 1.5-fold in at least 80% of the secondary SAM bootstraps. As expected, tumors were clustered according to prior host irradiation (Fig. 2A). IPA implicated inflammation as a key process associated with the irradiated host environment. The top IPA interaction network included inflammatory response, cell-to-cell signaling and interaction, and organismal survival (Fig. 2B; IPA score 54). Specific inflammatory programs included proliferation of T lymphocytes (P = 8.5E-5, 21 genes), chemotaxis (P = 3.1E-4, 20 genes), and cell movement of phagocytes (P = 0.002, 16 genes). Four gene networks representing 2 cell types, stem cells and macrophages, and 2 processes, motility and autophagy, were evident (Fig. 2C).
The irradiated host signature segregates radiation-preceded human cancer
To test whether this host biology is applicable to other experimental models, we applied this gene list to published data from sporadic or radiation-preceded rat sarcomas (32) and murine Ptch1-mutant medulloblastoma (33). Most radiation-induced sarcoma and medulloblastoma were clustered by this profile (Supplementary Fig. S1A and S1B).
Encouraged by this evidence that the biology captured by the radiation chimera is useful across tumor types and species, we searched for expression profile microarray of human sporadic cancers compared with those preceded by radiation. Most radiation-preceded tumor microarray data sets are not amenable to analysis due to sample size or platform differences (15, 32, 34–36). We found 2 sets: radiation-preceded papillary thyroid carcinomas (36) and radiotherapy-associated sarcomas (Fig. 2D and E; ref. 32). Clustering using the subset of human gene orthologs present in the murine irradiated host signature resulted in segregation of sporadic from radiation-preceded cancers. Permutation analysis showed that segregation by the genes from the irradiated host was significantly better than randomly selected genes in radiation-preceded thyroid cancers (P < 0.02). This analysis suggests that host response to radiation, as defined by the radiation chimera model, also significantly affects the transcriptome of cancers arising in irradiated humans.
Association of irradiated host metaprofile and human breast cancer intrinsic subtypes
We devised a metric by which to classify breast cancers as similar to tumors from irradiated hosts by making a centroid classifier from the 323 irradiated host signature, defined herein. One centroid represents tumors from nonirradiated WT hosts and the second represents tumors from irradiated WT hosts, both based on 72 of the most differentially expressed genes of the 323 gene list. As expected, the 72-gene centroid (Table S3) discriminates Trp53-null tumors from sham-irradiated hosts and irradiated hosts (Fig. 3A). Tumor ER status, based on immunostaining, was distributed independently (Fig. 3B), even though ER-negative breast tumors arose more frequently in irradiated hosts (22), which is consistent with our previous report that a distinct signature discriminated ER-negative tumors arising in an irradiated host (22). We used the centroid to assign human breast cancers compiled by Ringnér and colleagues (28) according to similarity to sham or irradiated host centroid. Most ER-positive breast cancers associated with the sham host signature, whereas ER-negative breast cancers were strongly associated with irradiated host signature (Fig. 3C). This suggests that the transcriptome in sporadic ER-negative, basal-like human breast cancer is influenced by tissue processes similar to those that promote ER-negative tumorigenesis in the radiation chimera murine model. Several intrinsic subtypes are represented in ER-negative breast cancer, as evident in the dataset from the study of Prat and colleagues (10). Principle component analysis of the 72 genes in the centroid within the Prat UNC-337 dataset showed that claudin-low tumors were most strongly associated with the irradiated host centroid (Fig. 3D).
Metaprofiles are gene expression modules consisting of co-expressed genes that represent key biologic processes that have prognostic or treatment-predictive power for cancer (37, 38). Equipped with a gene list that had many orthologs present on human microarray platforms, we next determined how the biology represented in the irradiated host metaprofile applied to 2 datasets of sporadic human breast cancers. The first consists of 1,608 breast cancers (28). Human orthologs of 182 of the 323 irradiated host gene list clustered tumors into 2 major groups, each of which had 2 subgroups (Fig. 4A). Each of the 4 major subgroups was enriched with a particular intrinsic subtype, which were associated with different upregulated genes (Supplementary Table S4).
Subgroup 1A consisted of predominantly luminal A and B breast cancers that have an intermediate prognosis and disease-free survival compared with the other groups (Fig. 4B and C). Group 1B contained a block of basal-like breast cancers and ERBB2 tumors and fared poorly, as is consistent with prior reports for these intrinsic subtypes (10, 39). Group 2C contained many normal-like tumors and luminal subtypes and had a similar overall survival and relapse-free survival as group 1A. Notably, group 2D consisted of a mix of tumors that exhibited longer overall and disease-free survival.
It is thought that the poor prognosis of basal-like and luminal B subtypes is due to increased proliferation, as indicated by elevated expression of proliferation-related genes. To test the extent to which proliferation was driving the ability of irradiated host gene list to cluster the compiled breast cancer profiles, we removed 64 genes that were identified as being involved in “proliferation of eukaryotic cells” as annotated in IPA (Supplementary Table S5). The remaining 118 genes of the irradiated host gene list still segregated breast cancers into 4 main subgroups enriched for molecular subtypes (Supplementary Fig. S2A). Without proliferation genes, the luminal B cancer no longer shared the main bifurcation with the basal-like subcluster. Even so, 1B was both strongly enriched in basal-like breast cancer and had a much worse relapse-free survival than the other subgroups, suggesting that the biology elicited by irradiated host is an important factor in prognosis of these tumors.
We next tested the use of the irradiated host gene list in the data set from the study of Prat and colleagues (10), which classified 337 human breast cancers (UNC337) into 6 intrinsic subtypes with the addition of the sixth type characterized as claudin-low. Using 203 human orthologs of the irradiated host genes clustered basal-like tumors, normal-like tumors, and 2 distinct groups of claudin-low tumors (Fig. 5A). Notably, claudin-low and basal-like tumors were on different arms.
Three gene clusters appear to define the clustering of the subtypes (Supplementary Table S6). Cluster a contains 16 genes involved in tumorigenesis (P = 8.9E-4); cluster b contains 19 genes involved in immune response (P = 5.3E-8); and cluster c contains 29 genes involved in genetic disorders (P = 2.3E-3). Expression of each of these gene clusters was significantly different among the 6 subtypes (Fig. 5B–D). Principle component analysis of these data indicates that basal-like breast cancers are enriched for genes in cluster a, claudin-low are enriched in cluster b, and both tumors are depleted of genes in cluster c.
Claudin-low tumors exhibit epithelial-to-mesenchymal transition (EMT) features and are enriched in genes associated with stem cell biology (10). TGF-β is a key driver of EMT and mediates various aspects of stem cell biology and is highly induced by ionizing radiation (reviewed in ref. 40). The radiation chimera experiment conducted in Tgfb1 heterozygote hosts showed that the effect of host irradiation on tumor latency and growth rates was TGF-β-dependent, whereas the effect on ER status was not (22). We speculated that the profiles of tumors arising in irradiated Tgfb1 heterozygote hosts compared with those arising in control mice would be informative. SAM was done using a 2-class analysis with 100 permutations per comparison of the reference class to the target class, followed by a fold change cutoff of 1.5, followed by a secondary “tandem,” bootstrapping (26). Interestingly, tumors arising in unirradiated hosts of either genotype were indistinguishable using this method. A list of 199 genes that were present in 100% of the secondary SAM analyses were able to segregate tumors of irradiated Tgfb1 heterozygote hosts from those of nonirradiated heterozygote hosts under unsupervised hierarchical clustering, independent of ER status or histopathology subtype (Fig. 6A). It did not do so when applied to tumors from WT hosts (Fig. 6B). Thus, host irradiation in conjunction with host TGF-β levels elicits distinct transcriptional biologies of Trp53-null tumors.
IPA revealed enrichment for cancer-related genes, along with inflammatory processes such as recruitment and activation of lymphocytes and phagocytes (all P < 0.005). Gene enrichment analysis using the ConceptGen database identified extracellular matrix programs and activation of monocytes, macrophages, and dendritic cells (all P < 0.005; data not shown). Eight genes, 7 of which are “stem-related”, are present in gene lists from both irradiated WT and Tgfb1 heterozygote mice. However, 5 of 7 genes are oppositely regulated between the 2 profiles. Trp63, Igfbp2, and Id4 are upregulated in the irradiated host signature from WT-irradiated mice but downregulated in that from Tgfb1 heterozygote mice. This inverse pattern indicates that TGF-β is a critical component of the radiation response. For example, Cd133, a marker of progenitor cells and cancer-initiating cells in several cancer types including breast cancer (41–43), is present only in profile from Tgfb1 heterozygote mice.
We then applied this gene list to the UNC337 breast cancers. This list clustered breast cancer into 2 arms that represented roughly basal-like, ER-negative cancers, and luminal, ER-positive cancers (Fig. 6C). In contrast to clustering using the WT-irradiated host signature, claudin-low and basal-like were no longer in distinct arms, indicating that the biology resulting from TGF-β provides an important distinction between ER-negative basal-like and claudin-low tumors (Supplementary Table S6). Together, these analyses suggest that processes promoting cancer in the irradiated mouse are strongly associated with spontaneous basal-like and claudin-low human breast cancer, the latter of which is particularly influenced by TGF-β-associated inflammatory processes.
Conclusions
Here, we show that a gene signature derived from a murine mammary radiation chimera model is informative in both radiation-preceded and sporadic human cancer, underscoring the contribution of host biology during cancer evolution. The Trp53-null tumor subtype distribution was not particularly affected by host irradiation; rather, a distinct tumor microenvironmental transcriptome signature could be discerned, suggesting that tumors were “imprinted” by prior host radiation exposure. Together, our analyses support the idea that the radiation response of the microenvironment is a significant component of the carcinogenic process. Moreover, segregation using this signature suggests that similar processes may underlie the development of radiation-preceded human cancer and specific subtypes of apparently sporadic breast cancers.
The 323 irradiated host signature identified herein was enriched for genes indicative of inflammation, including a macrophage module, suggesting that either the recruitment or activation of inflammatory cells may underlie the effect of radiation on cancer. The human gene orthologs of a centroid classified breast cancers into distributions that suggest that a subgroup of ER-negative, basal-like intrinsic subtypes were like Trp53-null tumors that arose in the irradiated hosts. The relevance of the Trp53-null mammary model is supported by recent report of the Cancer Genome Atlas network on the molecular portraits of human breast cancer (44). The study group found that 80% of basal-like breast cancers harbored mutations in TP53, most of which were nonsense and frame shift mutations. The ER-negative, TP53-mutant, basal-like subtype was distinct from the other subtypes across all mRNA, miRNA, sequencing, and DNA copy number array platforms, suggesting that perhaps similar mechanisms may be detected in radiation-preceded cancer. The human orthologs of the irradiated murine host signature clustered 2 breast cancers datasets into groups with distinct outcomes and discriminated between closely related basal-like and claudin-low breast cancers. Of particular interest is that TGF-β-mediated inflammatory processes strongly define the claudin-low breast cancers (45).
The signature derived from the radiation chimera model also provided important insights into features of sporadic human breast cancer. Several recent studies have turned attention to the stroma to derive prognostic value by using expression profiling of stromal and extratumoral tissues (46). Using microdissected stroma from breast cancer, Finak and colleagues showed that a stroma-derived prognostic predictor stratifies disease outcome based on a signature of immune mediators, hypoxia, and angiogenesis (47). Analysis of the expression profiles from invasive breast cancer and ductal carcinoma in situ provides evidence that stromal biology is a key determinant of progression (48). Consistent with this, the presence of distinct subtypes of microenvironment, an active versus inactive cancer-adjacent microenvironment, influences the aggressiveness and outcome of ER-positive human breast cancers (46). Our data suggests that host biology induced in the radiation chimera model has strong parallels to the biology that underlies aggressive, ER-negative sporadic breast cancers.
Disclosure of Potential Conflicts of Interest
C.M. Perou has employment (other than primary affiliation; e.g., consulting) in University Genomics as board member and BioClassifier LLC as board member and has ownership interest (including patents) in University Genomics and BioClassifier LLC. No potential conflicts of interest were disclosed by the other authors.
Authors' Contributions
Conception and design: D.H. Nguyen, J.-H. Mao, M.H. Barcellos-Hoff
Development of methodology: D.H. Nguyen, A. Balmain, J.-H. Mao
Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): A. Balmain
Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): D.H. Nguyen, E. Fredlund, W. Zhao, C.M. Perou, A. Balmain, J.-H. Mao, M.H. Barcellos-Hoff
Writing, review, and/or revision of the manuscript: D.H. Nguyen, E. Fredlund, C.M. Perou, A. Balmain, J.-H. Mao, M.H. Barcellos-Hoff
Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): D.H. Nguyen, J.-H. Mao
Study supervision: J.-H. Mao, M.H. Barcellos-Hoff
Grant Support
This research was supported by funding from NASA Specialized Center for Research in Radiation Health Effects, NNX09AM52G, and the Department of Energy, Office of Biological and Environmental Research program on Low-Dose Radiation. C.M. Perou and W. Zhao received support from the NCI Breast SPORE program P50-CA58223, by RO1-CA138255 and RO1-CA148761, and by the Breast Cancer Research Foundation.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.