Global transcript analysis is increasingly used to describe cancer taxonomies beyond the microscopic reach of the eye. Diagnostic and prognostic portraits are formulated by ranking cancers according to transcriptional proximity. However, the role that distinct biological factors play in defining these portraits remains undefined. It is likely that the transcriptional repertoire of cancers depends, on one hand, on the anamnestic retention of their ontogenesis and, on the other, on the emergence of novel expression patterns related to oncogenesis. We compared the transcriptional profile of primary renal cell cancers (RCCs) with that of normal kidney tissue and several epithelial cancers of nonrenal origin to weigh the contribution that ontogeny and oncogenesis make in molding their genetic profile. Unsupervised global transcript analysis demonstrated that RCCs retain transcriptional signatures related to their ontogeny and cluster close to normal renal epithelium. When renal lineage-associated genes are removed from the analysis and cancer-specific genes are analyzed, RCCs segregate with other cancers with limited lineage specificity underlying a predominance of the oncogenic process over lineage specificity. However, a RCC-specific set of oncogenesis-related genes was identified and surprisingly shared by sarcomas. In summary, the transcriptional portrait of primary RCCs is largely dominated by ontogeny. Genes responsible for lineage specificity may represent poor molecular targets for immune or drug therapy. Most genes associated with oncogenesis are shared with other cancers and may represent better therapeutic targets. Finally, a small subset of genes is associated with lineage-specific oncogenesis, and these may provide information regarding the biological behavior of RCCs and facilitate diagnostic classification of RCCs.

There is substantial evidence that global transcript analysis can differentiate tumors into subclasses beyond the discriminatory power of histopathological observation, and such a distinction may have diagnostic and prognostic value (1, 2, 3, 4, 5, 6, 7, 8, 9). Whereas in some cases, molecular subclasses may portray distinct disease taxonomies associated with specific biological behaviors and/or susceptibility to therapeutic intervention, it is possible that temporal changes associated with the dedifferentiation of cancer may segregate taxonomically identical entities into separate groups according to their genetic profile. For instance, we noticed that melanoma metastases can segregate into two molecular subclasses originally believed to represent distinct disease taxonomies (2). However, serial analysis of identical metastases through fine-needle aspiration biopsies suggested that the two subclasses were most likely the result of temporal changes of individual lesion phenotype because material obtained from the same lesions could cluster within either subgroup with a unilateral shift of the later samples toward a less differentiated phenotype (7, 10). We therefore hypothesized that global transcript analysis is influenced by the role that the anamnestic expression of lineage-specific genes may play at different time points of differentiation. However, the previously described study compared melanoma metastases with cultured normal epithelial melanocytes, stretching dedifferentiation analysis to the limit. In addition, the genetic profile of in vitro cultured melanocytes could severely misrepresent “normality” due to nonphysiologic culture conditions.

In this study, we entertained an analysis of the role that ontogeny plays in cancer using as a model primary renal cell cancer (RCCs) tissues paired with normal renal tissue subjected to identical surgical manipulation and experimental preparation. The transcriptional analysis of the paired specimens was subsequently compared with archival frozen samples of melanoma metastases, representing a putative extreme of diversity with regard to ontogenesis and neoplastic progression, and with various primary epithelial cancers to frame the boundaries of similarities and discrepancies in the transcriptional program.

Tissue Procurement

Seventeen autologous RCCs and normal kidney samples were collected at the Department of Urology of The Johannes Gutenberg University (Mainz, Germany). Specimens were collected as the result of routine operative procedures, and portions were frozen for subsequent analysis, whereas the remaining tissue was used for pathological confirmation. Tissue procurement followed standard ethical procedures according to institutional policy. Details regarding the specimens studied are shown in Table 1. One normal kidney sample (CHTN-5) and nonrenal samples were variably collected during therapy of patients referred at the Surgery Branch, National Cancer Institute (Bethesda, MD) or through the US Tissue Network (Philadelphia, PA). In addition, the following nonrenal specimens were used for the analysis: seven melanoma metastases (Mel); five primary esophageal adenocarcinomas (EO-T); two primary ovarian cancers (OT); three primary soft-tissue sarcomas (CHTN-sarcoma); two primary breast cancers (CHTN-breast); one endometrial cancer (CHTN-endometrial); one colon cancer (CHTN-colon Ad); and one laryngeal cancer (CHTN-larynx).

RNA Preparation, Amplification, and Labeling

Total RNA was extracted from frozen material using Trizol reagent (Invitrogen, Carlsbad, CA) and amplified into antisense RNA (aRNA) as described previously (7, 11, 12, 13). Although the quantity of starting total RNA was in most cases sufficient for cDNA array hybridization, we have shown repeatedly that the fidelity of aRNA hybridization is at least equal and likely superior to that of total RNA for transcriptional profiling due to lack of contaminant ribosomal and tRNA (11, 14). Therefore, we used aRNA to increase the consistency of results, particularly when low-quality total RNA was documented by Agilent Bioanalyzer 2000 (Agilent Technologies, Palo Alto, CA). After amplification, aRNA quality was tested with the Agilent Bioanalyzer (12). High-quality aRNA was obtained from 14 RCCs and 12 normal kidney samples that were subsequently subjected to analysis. Of them, 10 constituted RCCs and normal kidney pairs from the same surgical specimen (Table 1).

Similarly, total RNA from peripheral blood mononuclear cells pooled from six normal donors was extracted and amplified to serve as constant reference (7, 11, 12, 13). Test and reference RNAs were labeled with Cy5 (red) and Cy3 (green) and cohybridized to a costum-made17.5K cDNA microarray.4 Microarrays were printed at the Immunogenetics Section, Department of Transfusion Medicine, Clinical Center, National Institutes of Health (Bethesda, MD) with a configuration of 32 × 24 × 23 and contained 17,500 elements. Clones used for printing included a combination of the Research Genetics RG_HsKG_031901 8k clone set and 9,000 clones selected from the RG_Hs_seq_ver_070700 40k clone set. The 17,500 spots included 12,072 uniquely named genes, 875 duplicated genes, and about 4,000 expression sequence tags.

Data Analysis

Quality Validation.

All statistical analyses were performed using the log2-based ratios normalizing the medial log2 ratio value across the array equal to zero. Validation and reproducibility were performed using our internal reference concordance system, based on the expectation that results obtained through the hybridization of the same test and reference material in different experiments should perfectly collimate. The level of concordance was measured by periodically rehybridizing the same arbitrarily selected test sample (A375 melanoma cell line) with the reference sample. High concordance in gene expression predicts that ratios in different experimental conditions are highly reproducible. With this goal, we analyzed seven forward and seven reciprocally labeled replicate array experiments that were hybridized periodically every other 25 cDNA array slides within each printing. SDs across those 14 arrays were analyzed after labeling swap, and discordant genes due to consistent labeling bias were excluded. This analysis demonstrated a >95% concordance level. Nonconcordant genes due to reproducible or random biases were excluded from subsequent analysis (15).

Unsupervised Analysis.

Principal component analysis (PCA) was performed with the Partek Pro software program (Partek Inc., St. Charles, MO). Unsupervised clustering was performed according to the Pearson correlation method of Eisen et al.(16) and visualized with Treeview software (Stanford University, Stanford, CA). Genomic portraits were depicted according to the central method for display, using a normalization factor as suggested by Ross et al.(17). Details about different unsupervised tests are discussed in the respective results section.

Supervised Analysis.

Identification of class-specific genes was performed using a paired or an unpaired two-tailed Student’s t test, as appropriate. The same analyses were performed using unpaired Wilcoxon’s nonparametric assessment and provided the same conclusions (data not shown). Identification of relatively consistent gene expression in various data sets was performed by comparing the median log2 ratio values over mean log2 ratio values for each subset of samples among the genes found to be significantly overexpressed in cancer specimens compared with normal kidney samples. For each subset analyzed, consistently expressed genes were considered those that had a median/mean ratio ≥ 1 because at least 50% of the samples exhibited an average expression value equal to or above the average expression value for that subset. Sarcoma samples were excluded from the analysis, and their statistics are presented separately. Three subsets of genes were explored. The first subset included genes preferentially expressed by RCCs compared with other tumors (Δ > 2 Oth, where Δ = median log2 ratio for RCCs − median log2 ratio for other tumors, and Oth = median log2 ratio for other tumors). The second subset included genes concordantly expressed by RCCs and other tumors (Δ ≤ 2 Oth, where Δ = median log2 ratio for RCCs − median log2 ratio for other tumors, and Oth = median log2 ratio for other tumors). The third subset included genes preferentially expressed by tumors other than RCCs (−Δ ≤ RCCs, where Δ = median log2 ratio for RCCs − median log2 ratio for other tumors, and RCCs = median log2 ratio for RCCs).

Influence of Ontogenesis on the Transcriptional Program of RCCs.

PCA was applied to the complete data set that included approximately 16,000 genes after automatic filtering of poor-quality spots (Fig. 1,A). This analysis clustered together all normal renal samples. RCCs samples clustered close to normal renal tissue, gradually spreading from this “normality” hub. Interestingly, three soft-tissue sarcomas approached the expression profile of the RCCs most distant from the normal renal tissue cluster. All other primary epithelial cancers clustered progressively away from the RCCs and sarcomas. Melanoma metastases clustered quite distant from all other samples (data not shown). Unsupervised analysis was performed also, after filtering genes with at least 85% presence in all experiments and at least one experiment with a ratio of >5. This high-stringency strategy focused the analysis on 5,098 genes most likely to play a significant role in either ontogenesis (genes expressed in both RCCs and normal renal tissues) or oncogenesis (genes expressed in RCCs and other cancers) Unsupervised hierarchical clustering largely confirmed the PCA analysis and identified three renal tissue-related clusters (Fig. 1 B). One cluster included nine RCCs specimens (cluster a) and two other subclusters closer to each other. One of them included a combination of three RCCs and five normal kidney samples plus the normal kidney sample obtained from the US Tissue Network (cluster b). A third cluster included eight normal kidney samples and one RCCs (R2828 TU) representing the only chromophobe tumor studied (cluster c). Finally, one of the RCCs samples (R2833 TU) clustered away from renal tissues together with other epithelial cancers.

Although the distribution of samples in the three clusters suggested that cluster a contained a more undifferentiated type of RCCs than cluster b, no correlation was noted between the distribution of the RCCs samples in either cluster and their tumor-node-metastasis (TNM) staging or histologic grading (Table 1). Interestingly, three primary soft-tissue sarcomas were included in cluster a in agreement with the PCA analysis.

All other primary cancers of nonrenal lineage segregated from the transcriptional profile of renal samples (Fig. 1 B). Therefore, primary RCCs clustered closer to normal kidney tissue than to other epithelial nonurologic cancers, underlining the predominance of ontogeny on their global expression pattern. As predictable, melanoma metastases segregated from all other specimens either because of the dedifferentiation that accompanies the metastatic process or because of their neuroectodermal origin. Melanoma metastases were combined with the data set only in this preliminary analysis to portray a “biological extreme” that could help frame the transcriptional profile of the primary cancers studied here between the highly differentiated profile of normal kidney tissue and the highly undifferentiated profile of metastatic tumors. Melanoma metastases were not further analyzed because this study was aimed at analysis of the differentiation pattern of primary RCCs compared with that of other primary epithelial nonurologic cancers.

To separate ontogeny from oncogenesis, a supervised analysis was performed to identify genes among the complete data set that were differentially expressed between RCCs included in cluster a and normal kidney. Genes were identified at an arbitrarily set significance threshold of P2 < 0.001 (two-tailed, unpaired Student’s t test). RCCs samples included in groups b and c were excluded with the assumption that a partial overlap of expression of RCCs-specific and renal tissue-specific genes was at the basis of their clustering in proximity to normal renal samples. This analysis identified 350 clones responsible for the segregation of RCCs and normal kidney specimens. Obviously, other genes related to ontogenesis still expressed by RCCs in group a could not be identified by this strategy. Therefore, the information portrays a partial picture still likely to be sufficient to provide a sense of the effect of lineage-specific gene expression. A high percentage of the genes appeared to be up-regulated in clusters b and c (green vertical bars in Fig. 1,B) that included mostly normal kidney samples. Thus, genes progressively down-regulated during neoplastic dedifferentiation were most likely related to ontogeny. Another set of genes was up-regulated predominantly in RCCs lesions and coordinately expressed by other epithelial cancers of nonrenal lineage, therefore representing genes associated with oncogenesis (red vertical bars in Fig. 1,C). Unsupervised reclustering of the experimental samples based on the 350 clones enhanced the separation between RCCs and normal kidney approximating RCCs to other epithelial cancers, emphasizing the predominance of shared patterns of gene expression when genes responsible for lineage descent are removed (Fig. 2 A).

In an independent analysis, the same 9 RCCs samples in cluster a were compared with all other epithelial primary cancers using a significance cutoff identical to the one used for their comparison against the 13 normal kidney specimens (P2 < 0.001, two-tailed unpaired Student’s t test) that identified 350 differentially expressed genes. This analysis identified 716 transcripts differentially expressed between RCCs in group a and other epithelial cancers. This difference was not attributable to different statistical power between the two comparisons because the number of samples in both groups compared with RCCs in group a was similar (14 normal kidney samples and 12 nonrenal epithelial cancers compared with a constant number of RCCs). The number of genes differentially expressed between RCCs and other epithelial cancers (716 genes) was twice the number of genes differentially expressed between the RCCs lesions in group a and normal kidney epithelium (350 genes). This finding underlines the closer proximity of the least differentiated primary RCCs to their lineage compared with other cancers when their global transcriptional profile is monitored.

An analysis using the same significance cutoff was performed on 10 paired RCCs lesions and normal kidney samples derived from the same surgical specimens. This paired analysis identified 170 differentially expressed genes (the smaller number is due to the smaller sample size), mostly overlapping those identified with the previous analyses, and included a majority of genes overexpressed in normal renal tissue (data not shown).

Influence of the General Process of Oncogenesis on the Transcriptional Program of RCCs.

To identify genes associated with oncogenesis independently of lineage, we looked for genes expressed differentially between all primary tumors (RCCs and non-RCCs) and normal kidney with the same arbitrary significance cutoff P2 of <0.001 (two-tailed Student’s t test). This process identified 1,347 genes. Some of the differences identified could be attributed to a predominant effect of genes differentially expressed between nonrenal tumors and normal kidney tissue still due to lineage diversity. Therefore, we separated the recovered 1,347 genes into those that were differentially expressed between RCCs and other cancers (possibly because RCCs retained lineage-specific expression patterns) and those that were not differentially expressed between the same two groups. This process identified 276 genes that were differentially expressed at the <0.001 level between RCCs and other primary cancers, whereas the remainder were above that threshold. Further analysis of these genes demonstrated that they were related to renal lineage specificity, and their inclusion in the data set was due to the strong differences between epithelial cancers and normal kidneys. However, few genes were truly specific for renal oncogenesis, and those were rescued with a separate analysis discussed later. Because the range of significance between <0.001 and 0.01 was felt to be ambiguous for the separation of genes belonging to either group, these genes were also excluded from the analysis. Thus, another 233 genes were excluded from the data set, and the remaining 850 genes were left as putative oncogenesis-specific genes. No information about normal tissues from the other tumors was available. Therefore, some of the differences found might still be due to the presence of lineage-specific genes related to ontogeny of other cancers; however, this is unlikely because these cancers originated from different epithelial tissues.

Among the 850 clones differentially up-regulated in cancer independently of histologic background, 176 clones included expression tag sequences or other genes with unknown function. Therefore, we eliminated them from further analysis and focused our attention on the remaining 674 clones with functional annotations (Fig. 2,B). As expected, unsupervised reshuffling of tissue samples based on these genes (see the dendrogram in Fig. 2 C) segregated all normal renal samples into a separate cluster that also included one RCCs (R2838 TU; red bar). Primary epithelial tumors segregated into two main subclusters, one exclusively including seven RCCs (blue bar) and a second one that included all other tumors including sarcomas and the remaining six RCCs lesions (maroon bar). Therefore, even the elimination of genes differentially expressed between RCCs and other epithelial cancers did not completely abrogate the lineage specificity, at least for some RCCs. The first cluster exclusive for RCCs included all of the RCCs samples from groups b and c (R2834 TU, R2839 TU, and R2828 TU) except for R2838 TU that clustered together with normal renal tissue. The other cluster included exclusively RCCs that belonged to the least differentiated cluster a and R2833 TU that also clustered in the previous analyses with tumors other than RCCs, emphasizing the lower level of differentiation of these samples.

Of the 674 genes specific for oncogenesis, 329 were down-regulated in cancer tissues compared with normal kidney, and most of them represented lineage-specific genes associated with renal function. In the end, 345 genes were identified that were specifically up-regulated in cancerous tissues compared with normal renal tissue.

Interestingly, we noted substantial heterogeneity in the expression of these genes in different cancers independent of lineage. Thus, we examined which of the oncogenesis-related genes were consistently expressed in all cancers or more frequently expressed in RCCs lesions or cancers of different histology. This was achieved by identifying within each group the genes with the highest median ratio. Due to their ambiguous biological behavior, sarcomas were excluded from the analysis. Those genes with a log2 ratio between median and mean value ≥ 1 within each category were consistently expressed. This parameter identified genes that were expressed above the threshold level in at least 50% of cancers within a given category.

This exercise identified 132 genes whose median expression was similar or higher than their mean level of expression. Of those, 41 were categorized as expressed in RCCs samples, 43 were concordantly expressed by most cancers, and 48 were preferentially expressed by nonurologic epithelial tumors (Supplementary Tables 1–3, respectively).

Influence of RCCs-specific Oncogenesis on the Transcriptional Program of RCCs.

A separate analysis was performed to identify genes uniquely expressed by RCCs and unrelated to their renal heritage. This separate analysis was performed to rescue genes possibly missed by the previous analysis due to their down-regulation in RCCs or due to the dilution of their significance level by combining in the analysis the total number of primary cancers. The complete data set was reanalyzed by comparing 14 RCCs samples against the other 12 epithelial cancers. First, this was performed by studying all RCCs cancers independently of the relationship among themselves according to the unsupervised clustering shown in Fig. 1 B. The comparison between the two sets of cancers was performed based on a two-tailed unpaired Student’s t test. This analysis identified 1,114 clones differentially expressed by the two sets of cancers. Ontogenesis-related genes were subsequently excluded by subtracting genes coexpressed between RCCs and normal renal tissue (P2 ≥ 0.001). The large majority of genes dissipated with this test, suggesting that differences between RCCs and other cancers are predominantly due to lineage specificity. However, 43 clones remained that were highly specific for RCCs and differentially expressed between RCCs and other cancer as well as normal renal tissue.

Genes Commonly Expressed by Cluster a RCCs and Sarcomas.

To explain the close association between RCCs in cluster a and sarcomas, genes differentially expressed by these tissues were compared with those of other tumors excluding melanoma metastases and other RCCs that did not cluster in cluster a. This analysis identified 597 genes differentially expressed at the 0.001 level (two-tailed t test). The large majority of these genes were coordinately expressed by normal renal tissue, suggesting that the proximity of sarcomas to RCCs is due to a particular expression profile in common with normal tissues (compared with other cancer) or specifically to normal renal tissue. For identification of genes specifically coordinately expressed by RCCs in cluster a and sarcomas, the expression profile of both tissue types was compared with normal renal tissue at the P2 value of 0.001 (two-tailed t test). In this fashion, 48 clones were identified, of which only a few were up-regulated (Fig. 3). The genes almost totally overlapped the RCCs-specific genes identified with the previous analyses.

Expression of Glutathione S-Transferase and Carbonic Anhydrase in RCCs.

Because of their established relevance in the context of RCCs, we specifically analyzed the expression of the glutathione S-transferase (GST) and carbonic anhydrase (CA) gene products (8, 18; Fig. 4). As expected, GST A 2 and 3 were expressed in some RCCs and in normal kidney tissue. Similarly, CA II and IX were preferentially expressed by RCCs, including the only chromophobe RCCs (R2828 TU) that expressed high levels of CA II, as suggested by others (8). Interestingly, whereas CA IX expression was shared by other primary epithelial cancers (Supplementary Table 2), CA II appeared to be selectively expressed by RCCs and sarcomas and minimally expressed by normal renal parenchyma.

Expression of Genes Reported by Others to Have Prognostic Significance.

Takahashi et al.(19) have described two molecular phenotypes of clear cell RCCs associated with divergent prognosis. We therefore searched our database for genes identified as prognostic markers by that study. A significant proportion of them were present in our arrays and found to be significantly up-regulated or down-regulated in some clear cell RCCs. As reported, these genes were RCCs specific and, even in this small series, separated clear cell RCCs into two subclasses (Fig. 5). Interestingly, we observed that the clear cell RCCs subclass expressing good prognostic indicators contained all of the RCCs that had appeared as most differentiated clustering in group b or c, suggesting that this relatively benign phenotype combines the expression of oncogenesis-related good indicators with a less-differentiated transcriptional program.

The molecular subclassification of kidney tumors and the discovery of new diagnostic markers were described previously by others and are beyond the purpose of this work (4, 5, 8, 19). Our goal, rather, was to explain the common observation of cancers of similar histology segregating onto different molecular subgroups. For instance, Takahashi et al.(19) described molecular subgroups of RCCs that could prognosticate the natural history of the RCCs. That study, however, did not distinguish whether the two subclasses represented distinct disease taxonomies or different stages of a genetically similar disease. In a previous study, we postulated that melanomas could be subcategorized into different molecular classes based on their transcriptional profile (2). Subsequently, analysis of serial fine-needle aspirates obtained from identical lesions demonstrated a unidirectional shift of molecular profiles of the same lesions from a more to less differentiated phenotype along with disease progression, raising questions about a rigid classification of morphologically identical diseases based on one time point observations (7). In this study, therefore, we measured the role that the anamnestic expression of lineage-specific genes plays in determining the genetic classification of morphologically similar RCCs. We relied predominantly on clear cell RCCs specimens (12 of the 14 specimens analyzed) representing the most common type of RCCs, resulting mostly from the relatively homogenous effects of VHL gene inactivation. In addition, a chromophobic RCCs (R2828 TU) and a chromophilic RCCs (R2858 TU) were included.

The first finding was that primary RCCs segregate into at least two molecular subgroups. Such subgroups are portrayed in Fig. 1,A by cluster a containing only RCCs samples and the closely related clusters b and c with a mixture of RCCs and normal renal specimens including the chromophobe RCCs (R2828 TU) consistent with the relatively high level of differentiation of this tumor (4). Interestingly, both normal kidney and tumor tissues of the samples 2838 and 2839 clustered into group b, whereas other normal and tumor samples are quite distant in Fig. 1 B. We truly have no exact explanation for the specific pairing of these two tumor/normal tissue samples, although we and others have seen it before in other experimental situations. The best explanation we have is that in conditions of relatively good differentiation, primary tumors tend to align genetically with the normal tissue of origin because of the individual’s genetic background. This hypothesis has never been properly tested, and a much larger population would be necessary to demonstrate this point. We are, however, quite confident that the similarities noted are not artifactual, based on RNA quality analysis and pathological testing (the samples from which RNA was extracted were composed almost exclusively of tumor or normal cells, respectively, in a proportion similar to other samples). The genes responsible for the differences noted between the RCCs of cluster a and those of the other two clusters belonged predominantly to the renal lineage and were mostly coexpressed by the latter groups of RCCs and normal renal tissues, suggesting that global transcript separated RCCs based on their level of differentiation. Conversely, genes overexpressed by RCCs of group a represented a minority and were coordinately expressed by the other RCCs and other primary epithelial cancers. Thus, one could conclude that if no information was available about the transcriptional profile of paired normal tissues, it would have been difficult to explain the separation of the two major classes of RCCs (cluster a compared with the close subcluster b or c), and it might have been tempting to attribute such differences to separate taxonomies of disease as described previously in the context of metastatic melanoma (2) before serially analyzing identical lesions during disease progression (7). In conclusion, the identification of molecular subcategories of disease may be influenced by the level of dedifferentiation of a given cancer rather than portray a distinct disease taxonomy.

The second observation suggests that removal of lineage-specific genes segregated primary RCCs together with other nonurological cancers because at least some mechanisms of oncogenesis are shared by most cancers. Thus, this study suggests that most genes responsible for segregating cancers of various histology into molecular subclasses are related to the ontogeny of the individual cancers, whereas only a small proportion is responsible for the subclassification of cancers into different molecular entities when ontogeny is removed.

The third important observation demonstrated that although the exclusion of lineage-specific genes closely approximates RCCs with other nonurologic epithelial cancers, several genes (signatures) could be identified as specific for RCCs and, surprisingly, soft-tissue sarcomas. Several of the genes have been described previously by suppression subtractive hybridization (20) and functional genomics (19) to be strongly associated with RCCs oncogenesis, such as, for instance, lysyl oxidase and ceruloplasmin. Ceruloplasmin is secreted by human clear cell carcinoma cells in patients (21) and relevant experimental models (22). Caveolin-1 overexpression has been associated with poor prognosis in RCCs (23). Enolase 2 (neuronal γ-enolase) is a specific serum marker for RCCs (24), and experimental models have shown that an isozyme switch from α-enolase to γ-enolase occurs during carcinogenesis (25), an observation that we confirmed in this study comparing normal renal tissue with RCCs. Similarly, complement component 1 was characterized a long time ago as a risk factor for the development of RCCs (26). Collagen type V has been directly associated with the VHL/hypoxia pathway, but its role in oncogenesis has not been determined (27), whereas collagen type VI is commonly expressed in RCCs and minimally expressed by normal kidney after development (28). Interestingly, both collagen type V and VI were preferentially expressed in sarcomas, although they were coexpressed by RCCs (27). Annexin IV has been recently identified as selectively expressed by RCCs using combined two-dimensional gel electrophoresis (29). CD68 has been associated with extensive infiltration of RCCs tissue by macrophages (30) and, together with the expression of major histocompatibility complex molecules, has been previously demonstrated to be strongly increased in RCCs tissues compared with normal renal tissue and benign renal tumors (31). In addition, insulin-like growth factor-binding protein 3 coexpressed specifically by RCCs in group a and sarcomas (Fig. 3) has been described previously by gene expression profiling (8).

Recent studies suggested that GST A is expressed in the proximal convoluted tubules of normal renal parenchyma and that expression of GST A is retained mostly by clear cell RCCs (8). In fact, GST A 2 and 3 were coordinately expressed in most clear cell RCCs (8 of 13) but were not expressed by the chromophobe (R2828 TU) and chromophilic (R2858) RCCs as predicted by others (8). GST was also expressed by normal renal tissue but not by sarcomas, in line with the anamnestic expression of these genes.

CA IX is an independent predictor of survival in RCCs (18), whereas CA II is a reliable marker specific for chromophobe RCCs and erratically expressed by clear cell RCCs (30%) (8). CA II is also expressed in a small proportion of distal convoluted tubules of normal renal parenchyma. Indeed, CA II transcript could be found in normal kidney tissue, suggesting that expression of CA II transcript is related to the ontogenesis of RCCs. Although the highest level of CA II expression was noted in chromophobe RCCs, several clear cell RCCs expressed this gene, suggesting that, at least at the RNA level, there is overlap between the two types of RCCs (8). Interestingly, CA IX expression was specific for RCCs compared with normal renal tissue, confirming that this gene may be a better target for drug or immune therapy (32). Interestingly, the expression of CA IX was shared by other primary epithelial cancers but not melanoma metastases, suggesting that the expression of this gene is at least partly associated with oncogenesis and that it may not be solely responsible for the immune responsiveness of these diseases to immune therapy.

Other genes such as phosphofructokinase have been associated with childhood nephroblastomas, but not with adult RCCs (33). In addition, scavenger receptors have been identified on RCCs cell lines in the past, although no further work has been performed on this subject (34). Surprisingly, tapasin appeared to be overexpressed in RCCs, which is in contrast with immunohistochemical findings using paraffin-embedded tumor material (35). It is possible that the increased mRNA tapasin levels are not associated with enhanced protein expression or that the mRNA is produced by normal cells infiltrating RCCs lesions.

Other genes that have been identified in clear cell RCCs were found in this study to also be coexpressed by primary tumors of other histology such as pro-collagen-lysine, 2-oxoglutarate 5-dioxygenase 2 (8).

Others have compared the transcriptional profile of clear cell RCCs with clinical staging and outcome (19). Surprisingly, transcriptional profiling did not differentiate between stage I/II and stage III/IV lesions. On the contrary, and most relevantly, it was found that genetic profiling could segregate clear cell RCCs into two categories, one with a very poor 5-year survival, and the other with a good prognosis. This a very interesting finding suggesting that tumor biology is better represented at the transcriptional level. We also found no correlation between clinical staging and transcriptional profiling; however, analysis of genes whose expression is associated with good prognosis in RCCs (Fig. 5) suggested that such genes are preferentially expressed by cancers with a genetic profile closer to normal kidney.

In summary, this study is largely confirmatory of other studies in which RCCs-associated markers have been described. However, the three-way comparison between RCCs, normal renal tissue, and assorted nonurologic primary epithelial cancers allowed the differentiation of genes expressed by RCCs as a remnant of their origin from normal kidney from those directly related to the general process of oncogenesis. This differentiation is important because different categories may have diverse biological, diagnostic, and therapeutic value. Genes responsible for lineage specificity may represent poor molecular targets for immune or drug therapy. Most genes associated with oncogenesis are shared with other cancers and may represent better therapeutic targets. Finally, a subset of gene associated with oncogenesis is lineage specific and may provide information regarding the specific biological behavior of individual cancers and facilitate their diagnostic classification. More generally, this study suggests that in large proportion, the molecular portraits of primary tumors are dependent on their ontogeny, and therefore caution should be applied when defining distinct molecularly defined subcategories of cancer.

Fig. 1.

A, PCA depicting the special distribution of normal kidney samples (R NN and CHTN-R NN), RCCs (R TU), sarcomas (CHTU Sarcoma), breast cancers (CHTU Breast Ca), colon cancers (CHTU Colon Ca), endometrial cancers (CHTU Endom. Ca), laryngeal carcinomas (CHTU Laryng. Ca), and ovarian cancers (OT TU). PCA analysis was performed on the complete data set. Melanoma metastases were excluded to simplify the graph because they clustered far away from all other tissues. B, Eisen’s clustering of the same samples including melanoma metastases based on 5,098 genes filtered from the complete data set as described in Results. The three subclasses of renal samples are shown by the blue (group a), yellow (group b), and red (group c) vertical bars. C, supervised description of the same cluster presenting the 350 genes responsible for the segregation of RCCs in group a from normal kidney samples (see Results section). Renal-related clusters according to B are shown as horizontal bars over the clustering portrait. Vertical bars underline signature of genes overexpressed in RCCs compared with normal kidney (red bars) and underexpressed (green bars).

Fig. 1.

A, PCA depicting the special distribution of normal kidney samples (R NN and CHTN-R NN), RCCs (R TU), sarcomas (CHTU Sarcoma), breast cancers (CHTU Breast Ca), colon cancers (CHTU Colon Ca), endometrial cancers (CHTU Endom. Ca), laryngeal carcinomas (CHTU Laryng. Ca), and ovarian cancers (OT TU). PCA analysis was performed on the complete data set. Melanoma metastases were excluded to simplify the graph because they clustered far away from all other tissues. B, Eisen’s clustering of the same samples including melanoma metastases based on 5,098 genes filtered from the complete data set as described in Results. The three subclasses of renal samples are shown by the blue (group a), yellow (group b), and red (group c) vertical bars. C, supervised description of the same cluster presenting the 350 genes responsible for the segregation of RCCs in group a from normal kidney samples (see Results section). Renal-related clusters according to B are shown as horizontal bars over the clustering portrait. Vertical bars underline signature of genes overexpressed in RCCs compared with normal kidney (red bars) and underexpressed (green bars).

Close modal
Fig. 2.

A, clustering of all primary cancer samples and normal kidney samples based on the 350 genes described in Fig. 1,C (see Results section). Three major categories are defined: one containing normal renal samples (red vertical bar), one predominantly including RCCs (blue vertical bar), and a third group predominantly including other primary epithelial tumors (maroon vertical bar). B, Treeview image of 674 clones believed to be differentially expressed between primary cancers and normal kidney tissue in relation to the general process of oncogenesis (see Results section). C, clustering of normal kidney and primary cancers according to the genes shown in B. Three subclasses were identified: one including normal kidneys (red bar), one containing only RCCs and including the “differentiated” samples according to Fig. 1 B (blue bar), and a third including all other primary epithelial cancers, sarcomas, and some less differentiated RCCs (maroon bar).

Fig. 2.

A, clustering of all primary cancer samples and normal kidney samples based on the 350 genes described in Fig. 1,C (see Results section). Three major categories are defined: one containing normal renal samples (red vertical bar), one predominantly including RCCs (blue vertical bar), and a third group predominantly including other primary epithelial tumors (maroon vertical bar). B, Treeview image of 674 clones believed to be differentially expressed between primary cancers and normal kidney tissue in relation to the general process of oncogenesis (see Results section). C, clustering of normal kidney and primary cancers according to the genes shown in B. Three subclasses were identified: one including normal kidneys (red bar), one containing only RCCs and including the “differentiated” samples according to Fig. 1 B (blue bar), and a third including all other primary epithelial cancers, sarcomas, and some less differentiated RCCs (maroon bar).

Close modal
Fig. 3.

Supervised clusterogram including 48 clones differentially expressed specifically by RCCs and sarcomas (see Results section). These clones partially overlap with those identified by a separate analysis and presented in Supplementary Table 1. The order of the specimens is identical to the one shown by the unsupervised dendrogram shown in Fig. 1 B with the exclusion of melanoma metastases. Gene annotations are shown to the right of the clusterogram.

Fig. 3.

Supervised clusterogram including 48 clones differentially expressed specifically by RCCs and sarcomas (see Results section). These clones partially overlap with those identified by a separate analysis and presented in Supplementary Table 1. The order of the specimens is identical to the one shown by the unsupervised dendrogram shown in Fig. 1 B with the exclusion of melanoma metastases. Gene annotations are shown to the right of the clusterogram.

Close modal
Fig. 4.

Selected gene clusterogram depicting the expression of different GST and CA gene products (see Results section). The order of the specimens is identical to the one shown by the unsupervised dendrogram shown in Fig. 1 B. Gene annotations are shown to the right of the clusterogram.

Fig. 4.

Selected gene clusterogram depicting the expression of different GST and CA gene products (see Results section). The order of the specimens is identical to the one shown by the unsupervised dendrogram shown in Fig. 1 B. Gene annotations are shown to the right of the clusterogram.

Close modal
Fig. 5.

Selected gene clusterogram based on genes reported to be predictors of good prognosis of RCCs (19) present in our arrays. Gene annotations are shown to the right of the clusterogram.

Fig. 5.

Selected gene clusterogram based on genes reported to be predictors of good prognosis of RCCs (19) present in our arrays. Gene annotations are shown to the right of the clusterogram.

Close modal

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Note: Supplementary data for this article can be found at Cancer Research Online (http://cancerres.aacrjournals.org).

Requests for reprints: Francesco M. Marincola, Department of Transfusion Medicine, Clinical Center, Building 10, Room 1C-711, 10 Center Drive MSC 1502, Bethesda, MD 20892-1502. Phone: 301-496-9702; Fax: 301-594-1981; E-mail: [email protected]

4

http://nciarray.nci.nih.gov/gal_files/index.shtml.

Table 1

Histologic and pathologic characteristics of the specimens analyzed

IDSubtypeSize (cm)StageGradeGroup
2733 NN/TU Clear cell 7.5 pT3b, N0, Mx 
2820 NN/TU Clear cell 8.5 pT2, pN0, M0 
2825 NN/TU Clear cell 4.5 pT1, N0, M0 
2826 NN/TU Clear cell pT3b, pN0, M0 
2828 NN/TU Chromophobe 2.8 pT1, N0, M0 
2829 NN/TU Clear cell pT3a, N0, M0 
2832 NN/TU Clear cell pT3a, pN0, pM1 NA 
2833 NN/TU Clear cell 16.5 pT3b, pN0, M0 – 
2834 NN/TU Clear cell 11 pT2, pN0, M0 
2835 NN/TU Chromophilic 3.5 pT3a, pN1, M13–4 NA 
2837 NN/TU Clear cell 5.5 pT1, N0, M0 
2838 NN/TU Clear cell ND pT3b, pN2, M0 
2839 NN/TU Clear cell pT3b, pN0, M0 
2843 NN/TU Leiomyosarcoma ND  NA 
2858 NN/TU Chromophilic pT3a, pNx, pMx 
2865 NN/TU Clear cell pT4, pN2, G3 
2875 NN/TU Clear cell  pT1, Nx, Mx 
IDSubtypeSize (cm)StageGradeGroup
2733 NN/TU Clear cell 7.5 pT3b, N0, Mx 
2820 NN/TU Clear cell 8.5 pT2, pN0, M0 
2825 NN/TU Clear cell 4.5 pT1, N0, M0 
2826 NN/TU Clear cell pT3b, pN0, M0 
2828 NN/TU Chromophobe 2.8 pT1, N0, M0 
2829 NN/TU Clear cell pT3a, N0, M0 
2832 NN/TU Clear cell pT3a, pN0, pM1 NA 
2833 NN/TU Clear cell 16.5 pT3b, pN0, M0 – 
2834 NN/TU Clear cell 11 pT2, pN0, M0 
2835 NN/TU Chromophilic 3.5 pT3a, pN1, M13–4 NA 
2837 NN/TU Clear cell 5.5 pT1, N0, M0 
2838 NN/TU Clear cell ND pT3b, pN2, M0 
2839 NN/TU Clear cell pT3b, pN0, M0 
2843 NN/TU Leiomyosarcoma ND  NA 
2858 NN/TU Chromophilic pT3a, pNx, pMx 
2865 NN/TU Clear cell pT4, pN2, G3 
2875 NN/TU Clear cell  pT1, Nx, Mx 

NOTE. Bold type indicates the ability to extract and amplify high-quality aRNA for microarray hybridization. Group refers to the assignment of the RCCs according to the unsupervised analysis shown in Fig. 1 B.

Abbreviations: NN, normal kidney; TU, tumor; ND, not determined; ?, size unknown; NA, not applicable (no tumor specimen used for analysis).

1
Alizadeh AA, Eisen MB, Davis RE, et al Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.
Nature (Lond)
2000
;
403
:
503
-11.
2
Bittner M, Meltzer P, Chen Y, et al Molecular classification of cutaneous malignant melanoma by gene expression: shifting from a continuous spectrum to distinct biologic entities.
Nature (Lond)
2000
;
406
:
536
-40.
3
Perou CM, Sertle T, Eisen MB, et al Molecular portraits of human breast tumours.
Nature (Lond)
2000
;
406
:
747
-52.
4
Boer JM, Huber WK, Sultmann H, et al Identification and classification of differentially expressed genes in renal cell carcinoma by expression profiling on a global human 31,500-element cDNA array.
Genome Res
2001
;
11
:
1861
-70.
5
Young AN, Amin MB, Moreno CS, et al Expression profiling of renal epithelial neoplasms: a method for tumor classification and discovery of diagnostic molecular markers.
Am J Pathol
2001
;
158
:
1639
-51.
6
Van’t Veer LJ, Dai H, van de Vijver MJ, et al Gene expression profiling predicts clinical outcome of breast cancer.
Nature (Lond)
2002
;
415
:
530
-6.
7
Wang E, Miller LD, Ohnmacht GA, et al Prospective molecular profiling of subcutaneous melanoma metastases suggests classifiers of immune responsiveness.
Cancer Res
2002
;
62
:
3581
-6.
8
Takahashi M, Yang XJ, Sugimura J, et al Molecular subclassification of kidney tumors and the discovery of new diagnostic markers.
Oncogene
2003
;
22
:
6810
-8.
9
Wang E, Panelli MC, Marincola FM Genomic analysis of cancer.
Princ Pract Oncol
2003
;
17
:
1
-16.
10
Marincola FM, Wang E, Herlyn M, Seliger B, Ferrone S Tumors as elusive targets of T cell-directed immunotherapy.
Trends Immunol
2003
;
24
:
334
-41.
11
Wang E, Miller L, Ohnmacht GA, Liu E, Marincola FM High fidelity mRNA amplification for gene profiling using cDNA microarrays.
Nat Biotechnol
2000
;
17
:
457
-9.
12
Wang E, Marincola FM Amplification of small quantities of mRNA for transcript analysis Bowtell D Sambrook J eds. .
DNA arrays: a molecular cloning manual
2002
p. 204
-13. Cold Spring Harbor Laboratory Press Cold Spring Harbor, NY
13
Panelli MC, Wang E, Phan G, et al. Genetic profiling of peripheral mononuclear cells and melanoma metastases in response to systemic interleukin-2 administration. Genome Biol. Available at: http://genomebiology.com/2002/3/7/research/0035.
14
Feldman AL, Costouros NG, Wang E, et al Advantages of mRNA amplification for microarray analysis.
Biotechniques
2002
;
33
:
906
-14.
15
Jin P, Zhao Y, Ngalame Y, et al Looking for reference genes in a broad panel of gene expression array experiments.
BMC Genomics
2004
;
5
:
55
16
Eisen MB, Spellman PT, Brown PO, Botstein D Cluster analysis and display of genome-wide expression patterns.
Proc Natl Acad Sci USA
1998
;
95
:
14863
-8.
17
Ross DT, Scherf U, Eisen MB, et al Systematic variation in gene expression patterns in human cancer cell lines.
Nat Genet
2000
;
24
:
227
-35.
18
Bui MH, Seligson D, Han KR, et al Carbonic anhydrase IX is an independent predictor of survival in advanced renal clear cell carcinoma: implications for prognosis and therapy.
Clin Cancer Res
2003
;
9
:
802
-11.
19
Takahashi M, Rhodes DR, Furge KA, et al Gene expression profiling of clear cell renal cell carcinoma: gene identification and prognostic classification.
Proc Natl Acad Sci USA
2001
;
98
:
9754
-9.
20
Stassar MJ, Devitt G, Brosius M, et al Identification of human renal cell carcinoma associated genes by suppression subtractive hybridization.
Br J Cancer
2001
;
85
:
1372
-82.
21
Pejovic M, Djordjevic V, Ignjatovic I, Stamenic T, Stefanovic V Serum levels of some acute phase proteins in kidney and urinary tract urothelial cancers.
Int Urol Nephrol
1997
;
29
:
427
-32.
22
Saito K, Saito T, Draganac PS, et al Secretion of ceruloplasmin by a human clear cell carcinoma maintained in nude mice.
Biochem Med
1985
;
33
:
45
-52.
23
Campbell L, Gumbleton M, Griffiths DF Caveolin-1 overexpression predicts poor disease-free survival of patients with clinically confined renal cell carcinoma.
Br J Cancer
2003
;
89
:
1909
-13.
24
Yaman O, Baltaci S, Arikan N, et al Serum neuron specific enolase: can it be a tumour marker for renal cell carcinoma?.
Int Urol Nephrol
1996
;
28
:
207
-10.
25
Takashi M, Sakata T, Inaguma Y, Kato K Elevated concentrations of gamma-enolase in renal cell tumors in rats: similarity to renal cell carcinoma in man.
Urol Res
1996
;
24
:
375
-9.
26
Germenis A, Dimopoulos MA, Fertakis A, Dimopoulos C Genetic markers in renal adenocarcinoma.
J Urol
1984
;
132
:
173
-4.
27
Wykoff CC, Pugh CW, Maxwell PH, Harris AL, Ratcliffe PJ Identification of novel hypoxia dependent and independent target genes of the von Hippel-Lindau (VHL) tumour suppressor by mRNA differential expression profiling.
Oncogene
2000
;
19
:
6297
-305.
28
Lohi J, Leivo I, Oivula J, Lehto VP, Virtanen I Extracellular matrix in renal cell carcinomas.
Histol Histopathol
1998
;
13
:
785
-96.
29
Unwin RD, Harnden P, Pappin D, et al Serological and proteomic evaluation of antibody responses in the identification of tumor antigens in renal cell carcinoma.
Proteomics
2003
;
3
:
45
-55.
30
Hemmerlein B, Markus A, Wehner M, et al Expression of acute and late-stage inflammatory antigens, c-fms, CSF-1, and human monocytic serine esterase 1, in tumor-associated macrophages of renal cell carcinomas.
Cancer Immunol Immunother
2000
;
49
:
485
-92.
31
Ohmori T, Okada K, Arita N, et al Characteristics of MHC antigen expression and tumor-infiltrating mononuclear cells in renal cell adenomas and carcinomas.
Histol Histopathol
1995
;
10
:
789
-94.
32
Mukouyama H, Janzen NK, Hernandez JM, et al Generation of kidney cancer-specific antitumor immune responses using peripheral blood monocytes transduced with a recombinant adenovirus encoding carbonic anhydrase 9.
Clin Cancer Res
2004
;
10
:
1421
-9.
33
Rousseau-Merck MF, Cottreau D, Kahn A Isozyme pattern in serially xenotransplanted childhood tumors.
Cancer Res
1984
;
44
:
1163
-6.
34
Miki S, Matsumoto A, Nakamura Y, et al Expression of scavenger receptors on renal cell carcinoma cells in vitro.
Biochem Biophys Res Commun
1992
;
189
:
1323
-8.
35
Eskander ED, Harvey HA, Givant E, Lipton A Phase I study combining tumor necrosis factor with interferon-alpha and interleukin-2.
Am J Clin Oncol
1997
;
20
:
511
-4.

Supplementary data