Embryonal carcinoma is a histologic subgroup of testicular germ cell tumors (TGCTs), and its cells may follow differentiation lineages in a manner similar to early embryogenesis. To acquire new knowledge about the transcriptional programs operating in this tumor development model, we used 22k oligo DNA microarrays to analyze normal and neoplastic tissue samples from human testis. Additionally, retinoic acid–induced in vitro differentiation was studied in relevant cell lines. We identified genes characterizing each of the known histologic subtypes, adding up to a total set of 687 differentially expressed genes. Among these, there was a significant overrepresentation of gene categories, such as genomic imprinting and gene transcripts associated to embryonic stem cells. Selection for genes highly expressed in the undifferentiated embryonal carcinomas resulted in the identification of 58 genes, including pluripotency markers, such as the homeobox genes NANOG and POU5F1 (OCT3/4), as well as GAL, DPPA4, and NALP7. Interestingly, abundant expression of several of the pluripotency genes was also detected in precursor lesions and seminomas. By use of tissue microarrays containing 510 clinical testicular samples, GAL and POU5F1 were up-regulated in TGCT also at the protein level and hence validated as diagnostic markers for undifferentiated tumor cells. The present study shows the unique gene expression profiles of each histologic subtype of TGCT from which we have identified deregulated components in selected processes operating in normal development, such as WNT signaling and DNA methylation.

In many respects, germ cell tumorigenesis resembles early embryogenesis (1). The most common type of human germ cell tumor, testicular germ cell tumor (TGCT) of adolescents and young adults, develops from premalignant and noninvasive intratubular germ cell neoplasia (IGCN, also called carcinoma in situ) and is histologically classified into seminoma, nonseminoma, or a combination of the two (2). Whereas seminomas morphologically resemble IGCN, nonseminomas are a more heterogeneous group of tumors. In addition to the undifferentiated and pluripotent embryonal carcinomas, which have a capacity to differentiate into various lineages, the nonseminomas include yolk sac tumors and choriocarcinomas with extraembryonic differentiation as well as teratomas with somatic differentiation (Fig. 1A and B).

Figure 1.

Relationship between embryogenesis and testicular tumorigenesis. A, human early embryogenesis. B, developmental relationship between the various histologic subtypes included in the gene expression analysis. Colors correspond to the related stages in the embryogenesis. C, relatedness of the samples illustrated by principal component analysis with input of the 1,575 genes that had at least three samples with expression levels deviating >4-fold from the median across the sample set. The second, third, and sixth components are shown. The first component mostly separates normal versus tumor and is not included in the plot as it does not contribute to the subgrouping of tumor samples. D, number of genes that deviates >3-fold in expression levels between the various in vivo and in vitro histologic subgroups. The median expression levels within each group were used. N, normal testis; I, IGCN; S, seminoma; E, embryonal carcinoma; C, choriocarcinoma; Y, yolk sac tumor; T, teratoma; U, undifferentiated cell line; D, differentiated cell line.

Figure 1.

Relationship between embryogenesis and testicular tumorigenesis. A, human early embryogenesis. B, developmental relationship between the various histologic subtypes included in the gene expression analysis. Colors correspond to the related stages in the embryogenesis. C, relatedness of the samples illustrated by principal component analysis with input of the 1,575 genes that had at least three samples with expression levels deviating >4-fold from the median across the sample set. The second, third, and sixth components are shown. The first component mostly separates normal versus tumor and is not included in the plot as it does not contribute to the subgrouping of tumor samples. D, number of genes that deviates >3-fold in expression levels between the various in vivo and in vitro histologic subgroups. The median expression levels within each group were used. N, normal testis; I, IGCN; S, seminoma; E, embryonal carcinoma; C, choriocarcinoma; Y, yolk sac tumor; T, teratoma; U, undifferentiated cell line; D, differentiated cell line.

Close modal

Several cell lines derived from TGCT express features typical of embryonal carcinomas (3). A few of these cell lines, like the pluripotent NTERA2, differentiate extensively in culture in response to morphogens like retinoic acid (RA; ref. 4). When xenografted, they spontaneously differentiate into teratomas (5). Thus, induction of differentiation of NTERA2 cells is a model for the in vivo differentiation of embryonal carcinomas (Fig. 1). However, other established human embryonal carcinoma cell lines, such as the 2102Ep, are relatively nullipotent and remain in the undifferentiated state even after RA treatment (6); they form xenografts of pure embryonal carcinoma histology (7).

To explore the transcriptional programs during germ cell tumorigenesis, we studied samples of each histologic subtype of TGCT, testicular parenchyma from healthy individuals, and samples taken from testis with IGCN. We also tested the RA-induced in vitro differentiation of the two embryonal carcinoma cell lines NTERA2 and 2102Ep. Particular attention was paid on dysregulation of gene categories known to be related to the early embryogenesis, such as the homeobox gene family, genes specifically expressed in embryonic stem cells, genes involved in the WNT, transforming growth factor-β (TGF-β), and Notch signaling pathways, and genes related to DNA methylation and imprinting.

Tissue samples and cell culture. The tumors were histologically classified according to the WHO recommendations (2). Only tissue samples containing a single histologic component from H&E-stained sections of the frozen tissue were selected (Table 1). Included were normal testicular parenchyma (n = 3), premalignant IGCN (n = 3), and the malignant histologic subtypes seminoma (n = 3), embryonal carcinoma (n = 5), yolk sac tumor (n = 4), choriocarcinoma (n = 1), and teratoma (n = 4). Two of the three normal testicular tissue samples were from organ donors (22 and 29 years) with no known history of cancer. The third morphologically normal tissue was obtained adjacent to a TGCT. The present use of the human clinical material is approved by our institution, the Norwegian Radium Hospital, University Division, and the Regional Committee for Medical Research Ethics, The Health Region South, Norway.

Table 1.

Sample set

Sample IDHistologyAdditional information
N_9013 Normal testicular parenchyma Healthy individual 
N_9014 Normal testicular parenchyma Healthy individual 
N_0140 Normal testicular parenchyma Adjacent to tumor 
CIS_3493 IGCN  
CIS_3879 IGCN  
CIS_1692 IGCN  
Sem_0737 Seminoma  
Sem_1113 Seminoma  
Sem_1748 Seminoma  
EC_0502 Embryonal carcinoma  
EC_0564 Embryonal carcinoma  
EC_1017 Embryonal carcinoma  
EC_1740 Embryonal carcinoma  
EC_1838 Embryonal carcinoma  
Cc_0915 Choriocarcinoma  
YST_0216 Yolk sac tumor  
YST_0307 Yolk sac tumor  
YST_0738 Yolk sac tumor  
YST_2110 Yolk sac tumor  
Ter_0691 Teratoma  
Ter_0696 Teratoma  
Ter_1282 Teratoma  
Ter_2201 Teratoma  
EC_9018_0 Cell line, embryonal carcinoma NTERA2, without RA 
EC_9018_3 Cell line, embryonal carcinoma + differentiated cells NTERA2, 3 d with RA 
EC_9018_7 Cell line, embryonal carcinoma + differentiated cells NTERA2, 7 d with RA 
EC_9019_0 Cell line, embryonal carcinoma 2102Ep, without RA 
EC_9019_3 Cell line, embryonal carcinoma 2102Ep, 3 d RA 
EC_9019_7 Cell line, embryonal carcinoma 2102Ep, 7 d RA 
Self_self_9999 Reference vs reference  
Sample IDHistologyAdditional information
N_9013 Normal testicular parenchyma Healthy individual 
N_9014 Normal testicular parenchyma Healthy individual 
N_0140 Normal testicular parenchyma Adjacent to tumor 
CIS_3493 IGCN  
CIS_3879 IGCN  
CIS_1692 IGCN  
Sem_0737 Seminoma  
Sem_1113 Seminoma  
Sem_1748 Seminoma  
EC_0502 Embryonal carcinoma  
EC_0564 Embryonal carcinoma  
EC_1017 Embryonal carcinoma  
EC_1740 Embryonal carcinoma  
EC_1838 Embryonal carcinoma  
Cc_0915 Choriocarcinoma  
YST_0216 Yolk sac tumor  
YST_0307 Yolk sac tumor  
YST_0738 Yolk sac tumor  
YST_2110 Yolk sac tumor  
Ter_0691 Teratoma  
Ter_0696 Teratoma  
Ter_1282 Teratoma  
Ter_2201 Teratoma  
EC_9018_0 Cell line, embryonal carcinoma NTERA2, without RA 
EC_9018_3 Cell line, embryonal carcinoma + differentiated cells NTERA2, 3 d with RA 
EC_9018_7 Cell line, embryonal carcinoma + differentiated cells NTERA2, 7 d with RA 
EC_9019_0 Cell line, embryonal carcinoma 2102Ep, without RA 
EC_9019_3 Cell line, embryonal carcinoma 2102Ep, 3 d RA 
EC_9019_7 Cell line, embryonal carcinoma 2102Ep, 7 d RA 
Self_self_9999 Reference vs reference  

Cultures of the TGCT cell lines NTERA2/clone D1 and 2102Ep/clone 2A6 were maintained at high cell densities (>5 × 106 cells per 75 cm2 tissue culture flask) and incubated at 37°C under a humidified atmosphere of 10% CO2 in air in DMEM high glucose formulation (Life Technologies, Paisley, United Kingdom) supplemented with 10% FCS. NTERA2 cells were harvested for subculture by scraping with glass beads (5). 2102Ep cells were harvested using 0.25% trypsin in calcium- and magnesium-free Dulbecco's PBS containing 1 mmol/L EDTA (trypsin-EDTA; ref. 4). To induce differentiation, cells were harvested by using trypsin-EDTA and seeded at 106 per 75 cm2 tissue culture flask in medium containing 10−5 mol/L all-trans RA (Eastman-Kodak, Rochester, NY; ref. 4). For both NTERA2 and 2102Ep, cell pellets for RNA isolation were made at time points 0, 3, and 7 days of culturing with RA.

Gene expression microarrays. We used Agilent Human 1A oligomicroarrays (GEO accession no. GPL885, Agilent Technologies, Palo Alto, CA) containing 60-mer DNA probes synthesized in situ in a 22k format. Of 19,061 spots, 18,086 are noncontrols and there are 17,086 unique transcript sequences, matching to 15,989 unique human genes.8

8

TIGR Resourcerer 10.0 July 2004 Release (http://www.tigr.org/tigr-scripts/magic/r1.pl).

Probe preparations, hybridizations, image generation, and image analyses were done according to the manufacturer's protocol. Briefly, we obtained the total RNA fraction from ground tissue samples (in liquid N2) and cell culture pellets by using the Trizol reagent (Life Technologies, Rockville, MD). The RNA quality was evaluated by use of the Agilent 2100 Bioanalyzer (Agilent Technologies). Labeled cDNA was synthesized from 20 μg RNA (Fluorescent Direct Label kit, Agilent Technologies) in the presence of cyanine 3-dCTP for the test sample and cyanine 5-dCTP (Perkin-Elmer Life Sciences, Boston, MA) for the common reference, consisting of a pool of 10 human cell lines, including one from embryonal carcinoma (Universal Human Reference RNA, Stratagene, La Jolla, CA). Differentially labeled test and reference samples were mixed with Agilent control targets before hybridization onto the oligomicroarrays for 17 hours at 60°C in a rotating oven.

Data processing and statistics. The fluorescence intensities at the targets were detected by a laser confocal scanner (Agilent Technologies), and resulting images were processed using the Feature Extraction software, version 6.1.1.1 (Agilent Technologies). This included defining the spots, measuring intensities, flagging spots with inadequate measurements, subtracting local background, and LOWESS dye normalization. For spots that were not flagged as having inadequate measurements, ratios (sample over reference) of the processed intensities were used further. The measured ratios of all genes and samples were divided by the median of the ratios of the three normal samples before log2 transformation. This was done to facilitate the interpretation of the expression values, because now the sign of the value indicates relative up-regulation or down-regulation in the sample compared with the expression level in normal testis.

For the 60-mer DNA sequences that are represented more than once on the array (100 sequences are present in 10 times abundance), the medians were used for further analyses. The total gene expression matrix contained after this adjustment results for 17,086 unique sequences, of which 259 had technically inadequate measurements in >5 of the 29 samples and were excluded from the analyses. Thus, we ended up with 16,817 unique and valid transcript sequences represented by 15,756 individual gene symbols. For further analyses, missing values were estimated by the k-nearest neighbor imputation (k = 10).

Principal component analysis was done by input of the 1,575 genes that were altered in at least three samples to at least a 4-fold deviation from the median across the sample set (J-Express Pro version 2.0, MolMine, Bergen, Norway). Hierarchical clustering for improved visualization in Figs. 2 and 3 was done by average linkage of Euclidean distance measures (J-Express Pro).

Figure 2.

Histologic subtype-specific gene expression. The 10 most significantly overexpressed genes from each histologic subtype are shown. The genes were derived from individual significance analyses of microarrays comparing each of the histologic subgroup against the rest of the analyzed tissues. Clustering of genes within each subgroup was done for improved visualization.

Figure 2.

Histologic subtype-specific gene expression. The 10 most significantly overexpressed genes from each histologic subtype are shown. The genes were derived from individual significance analyses of microarrays comparing each of the histologic subgroup against the rest of the analyzed tissues. Clustering of genes within each subgroup was done for improved visualization.

Close modal
Figure 3.

Specific molecular signature for pluripotent and undifferentiated embryonal carcinoma cells. A stepwise gene selection algorithm was used to identify the genes (see text). The expression data are clustered for improved visualization. For the 2102Ep and NTERA2 cell lines, the numbers of days treated with RA are indicated.

Figure 3.

Specific molecular signature for pluripotent and undifferentiated embryonal carcinoma cells. A stepwise gene selection algorithm was used to identify the genes (see text). The expression data are clustered for improved visualization. For the 2102Ep and NTERA2 cell lines, the numbers of days treated with RA are indicated.

Close modal

Significance analysis of microarrays (SAM; ref. 8) was used to extract significantly differentially expressed genes characterizing each of the histologic subtypes. Here, the same false discovery rate (FDR) for all subtypes could not be used as, for example, a 1% FDR resulted in from 0 significant genes for the IGCN to 780 for the normal samples. Rather, we selected the same number of up-regulated and down-regulated genes from each histologic subtype (the 80 most up-regulated genes and the 20 most down-regulated genes) to the interhistologic subtype-regulated gene list.

In the first step of finding discriminator genes for the undifferentiated and pluripotent phenotype, a SAM analysis of embryonal carcinomas versus differentiated tissues was done (1.84% FDR; d = 0.965), resulting in 217 significant genes. These were again filtered, leaving only genes that were significant in the same direction from a second SAM analysis (10% FDR) comparing undifferentiated cell lines (the three samples from 2102Ep and the untreated sample from NTERA2) in one group and cell lines where differentiation were induced (the two treated NTERA2 samples) in the other.

Functional classes of gene products were assigned according to the Gene Ontology (GO) Consortium database.9

9

Gene Ontology Consortium downloadable database as of June 2004 (http://www.godatabase.org/dev/database/).

The total list of gene symbols with valid DNA microarray measurements and our lists of significant genes were along with the June 2004 version of the GO database uploaded into the GoMiner software (9). Here, the null hypothesis is that genes are evenly distributed among the GO categories represented by the genes on the microarray, and Fisher's exact tests are done to test the significance level for enrichment of genes within each GO category. Also for the pairwise comparisons of the self-curated gene lists (Table 2), Fisher's exact tests were used to estimate the statistical significance levels.

Table 2.

Overlapping genes between significant gene lists from the present study and gene categories known to be involved in the embryogenesis

CategoryCommon genes within 687 interhistologic subtype-regulated genesPCommon genes within the 68 genes with specific expression changes in the undifferentiated histologyP
Homeobox genes (209) EMX2, HESX1, I_1221473, NANOG, PAX3, PAX5, PCBD, POU5F1, SHOX2, TCF8, TLX2 0.30 HESX1, MOX2, NANOG, POU5F1 0.01 
Specifically expressed in embryonic stem cells (83) CYP26A1, DNMT3B, FABP5, GAL, GDF3, HMGB2, MGST1, NANOG, POU5F1, SEMA6A, SPS, TK1 3 × 10−4 DNMT3B, EIF4A1, GAL, GDF3, NANOG, POU5F1, SEMA6A 1 × 10−7 
WNT signaling components (79) AES, DKK1, FRAT2, FZD4, FZD7, TCF7L1, TCF7L2 0.26 FRAT2, WNT5A 0.05 
WNT signaling target genes (52) FZD7, ID2, SLC2A1 0.40  1.00 
TGF-β signaling (102) BMP2, GDF10, GDF3, ID2, INHA, MAPK4, TGFBI 0.16 GDF3 0.36 
Notch signaling (9)  1.00  1.00 
DNA methylation (16) DNMT3B, DNMT3L 0.15 DNMT3B, DNMT3L 2 × 10−3 
Genomic imprinting (6) DNMT3B, DNMT3L 0.03 DNMT3B, DNMT3L 3 × 10−4 
Imprinted genes (32)  1.00  1.00 
CategoryCommon genes within 687 interhistologic subtype-regulated genesPCommon genes within the 68 genes with specific expression changes in the undifferentiated histologyP
Homeobox genes (209) EMX2, HESX1, I_1221473, NANOG, PAX3, PAX5, PCBD, POU5F1, SHOX2, TCF8, TLX2 0.30 HESX1, MOX2, NANOG, POU5F1 0.01 
Specifically expressed in embryonic stem cells (83) CYP26A1, DNMT3B, FABP5, GAL, GDF3, HMGB2, MGST1, NANOG, POU5F1, SEMA6A, SPS, TK1 3 × 10−4 DNMT3B, EIF4A1, GAL, GDF3, NANOG, POU5F1, SEMA6A 1 × 10−7 
WNT signaling components (79) AES, DKK1, FRAT2, FZD4, FZD7, TCF7L1, TCF7L2 0.26 FRAT2, WNT5A 0.05 
WNT signaling target genes (52) FZD7, ID2, SLC2A1 0.40  1.00 
TGF-β signaling (102) BMP2, GDF10, GDF3, ID2, INHA, MAPK4, TGFBI 0.16 GDF3 0.36 
Notch signaling (9)  1.00  1.00 
DNA methylation (16) DNMT3B, DNMT3L 0.15 DNMT3B, DNMT3L 2 × 10−3 
Genomic imprinting (6) DNMT3B, DNMT3L 0.03 DNMT3B, DNMT3L 3 × 10−4 
Imprinted genes (32)  1.00  1.00 

NOTE: Enrichment analyses were also done (Fisher's exact Ps). In parentheses are the numbers of genes present on the microarray within each category. The actual genes making up each category list can be found in Supplementary Table S3.

Protein expression on tissue microarrays.In situ protein expression analyses of GAL and POU5F1 were done using immunohistochemistry on tissue microarrays containing 510 testicular tissue cores covering all histologic subtypes and clinical stages of TGCT (10). Tissue microarray sections were stained by immunohistochemistry using the EnVision+ system, peroxidase-horseradish peroxidase by following the manufacturer's protocol (DakoCytomation, Glostrup, Denmark). Briefly, one tissue microarray section for each antibody was deparaffinized and rehydrated, and high temperature antigen retrieval was done by microwave oven at 900 W. The endogenous peroxidase activity was blocked before incubation with the polyclonal antibodies GAL (C-20, sc-16413, 1:100, 2 μg IgG/mL, Santa Cruz Biotechnology, Inc., Santa Cruz, CA) and POU5F1 (H-134, sc-9081, 1:100, 2 μg IgG/mL, Santa Cruz Biotechnology) for 30 minutes at room temperature. Next, the sections were incubated with labeled polymer for 30 minutes and with the peroxidase substrate 3′3-diaminobenzidine tetrahydrochloride for 10 minutes before the sections were counterstained using hematoxylin, dehydrated, and mounted. Negative results on controls using IgG from nonimmunized rabbit and goat were achieved.

For GAL, cases with granular staining in the cytoplasm and/or the extracellular spaces were scored as positive. For POU5F1, tissue cores with staining in >5% of the relevant nuclei were scored as positive.

Gene expression profiles. Testicular tumorigenesis was investigated by gene expression profiling of normal testis, premalignant IGCN, the various malignant histologic subtypes of TGCT (seminoma, embryonal carcinoma, choriocarcinoma, yolk sac tumor, and teratoma), and two TGCT cell lines before and after differentiation (Fig. 1B). DNA microarrays containing oligo 60-mers of 17,086 well-annotated genes and transcripts were used to measure mRNA levels relative to a common universal human reference RNA. Raw data were deposited to the Gene Expression Omnibus public repository for microarray data (GEO accession no. GSE1818)10

according to the Minimum Information About a Microarray Experiment (MIAME) recommendations for recording and reporting microarray-based gene expression data (11).

To illustrate the relationship between expression profiles of the individual samples, principal component analysis was carried out (Fig. 1C). Here, samples from the same histologic subtypes appear in the vicinity of each other, demonstrating, as expected, that part of the variability in gene expression profiles of TGCTs is dependent on histologic type. A measure of the actual differences between the individual histologic subtypes is shown in the matrix of numbers of genes with expression medians deviating >3-fold between each pair of all combinations of the histologic subtypes (Fig. 1D). For example, there are 292 genes with a >3-fold higher expression value in seminomas than in embryonal carcinomas and vice versa and 182 genes that were more highly expressed by the embryonal carcinomas compared with seminomas. Interestingly, the pluripotent, undifferentiated cell lines have a similar number of genes differentially expressed from seminomas as they have from the embryonal carcinomas, the subtype that they are generally believed to model.

Histologic subtype-specific gene expression. We identified sets of differentially expressed genes within each of the histologic subgroups compared with the rest of the samples (individual SAM analyses; ref. 8). The top 10 up-regulated genes within each subgroup are shown in Fig. 2. From each of the seven histologic subtypes, 80 genes with the most significant up-regulation and the 20 most down-regulated genes were listed as interhistologic subtype-regulated genes (Supplementary Table S1). As 13 genes appeared for more than one histologic subtype, the complete set contained altogether 687 genes and not the expected 700. The GOs of the differentially expressed genes were explored, and several GO categories were significantly enriched within the interhistologic subtype-regulated gene list (Table 3; Supplementary Table S2). Significance levels of enrichment of genes belonging to embryogenesis-related gene categories were also calculated (Table 2). The 687 interhistologic subtype-regulated genes were particularly enriched among embryonic stem cell–specific genes (12 of 83, P = 2 × 10−4) identified by a DNA microarray study (12).

Table 3.

Significantly enriched (P < 0.01) GO terms within the interhistologic subtype-regulated genes (n = 687)

GO termGO IDChanged/totalPs
Biological process    
    Lipid transport 6869 9/49 0.0003 
    Hormone metabolism 42445 7/37 0.0013 
        C21-steroid hormone metabolism 8207 5/13 0.0002 
    Lipid metabolism 6629 29/336 0.0012 
        Steroid metabolism 8202 12/88 0.0007 
            Cholesterol metabolism 8203 6/38 0.0074 
    Copper ion homeostasis 6878 3/5 0.0009 
    Gonad development 8406 5/18 0.0011 
    Hormone biosynthesis 42446 5/18 0.0011 
        C21-steroid hormone biosynthesis 6700 4/12 0.0017 
    Chromatin assembly/disassembly 6333 8/52 0.0025 
        Nucleosome assembly 6334 5/23 0.0035 
    Lipid biosynthesis 8610 14/129 0.0025 
        Steroid biosynthesis 6694 7/42 0.0028 
    Icosanoid metabolism 6690 4/19 0.0100 
        Prostanoid metabolism 6692 3/8 0.0046 
            Prostaglandin metabolism 6693 3/8 0.0046 
    Glycosylceramide metabolism 6677 2/3 0.0062 
    Phosphocreatine metabolism 6603 2/3 0.0062 
    Cell cycle arrest 7050 4/17 0.0066 
Cellular component    
    Extracellular 5576 106/1,706 0.0008 
        Extracellular matrix 5578 22/243 0.0027 
            Collagen 5581 9/32 <0.0001 
            Sheet-forming collagen 30935 2/2 0.0021 
                Collagen type IV 5587 2/2 0.0021 
        Extracellular space 5615 96/1,541 0.0013 
    Chromatin 785 10/81 0.0040 
        Nucleosome 786 4/14 0.0031 
        Nuclear heterochromatin 5720 3/9 0.0067 
    Clathrin coat of coated pit 30132 3/8 0.0046 
    Vesicular fraction 42598 7/50 0.0077 
        Microsome 5792 7/50 0.0077 
Molecular function    
    Extracellular matrix structural constituent 5201 11/55 <0.0001 
        Extracellular matrix structural constituent conferring tensile strength 30020 8/30 <0.0001 
    Transferase activity, transferring alkyl or aryl (other than methyl) groups 16765 6/39 0.0084 
        Glutathione transferase activity 4364 5/13 0.0002 
    Galactosidase activity 15925 3/4 0.0004 
        β-Galactosidase activity 4565 2/2 0.0021 
    Intramolecular oxidoreductase activity 16860 5/27 0.0072 
        Intramolecular oxidoreductase activity, transposing C = C bonds 16863 3/8 0.0046 
            Steroid δ-isomerase activity 4769 2/2 0.0021 
        Prostaglandin D synthase activity 4667 2/3 0.0062 
    Oxidoreductase activity, acting on NADH or NADPH, nitrogenous group as acceptor 16657 2/2 0.0021 
        GMP reductase activity 3920 2/2 0.0021 
    Carbonyl reductase (NADPH) activity 4090 2/2 0.0021 
    Steroid dehydrogenase activity 16229 3/9 0.0067 
        3-β-Hydroxy-δ5-steroid dehydrogenase activity 3854 2/2 0.0021 
    Reduced folate carrier activity 8518 2/2 0.0021 
    Heparin binding 8201 7/40 0.0021 
    Copper ion binding 5507 6/30 0.0022 
    Folic acid binding 5542 3/7 0.0030 
    Lipid transporter activity 5319 6/34 0.0042 
    Creatine kinase activity 4111 2/3 0.0062 
    Diphosphoinositol-polyphosphate diphosphatase activity 8486 2/3 0.0062 
    Cyclic AMP–dependent protein kinase regulator activity 8603 3/9 0.0067 
GO termGO IDChanged/totalPs
Biological process    
    Lipid transport 6869 9/49 0.0003 
    Hormone metabolism 42445 7/37 0.0013 
        C21-steroid hormone metabolism 8207 5/13 0.0002 
    Lipid metabolism 6629 29/336 0.0012 
        Steroid metabolism 8202 12/88 0.0007 
            Cholesterol metabolism 8203 6/38 0.0074 
    Copper ion homeostasis 6878 3/5 0.0009 
    Gonad development 8406 5/18 0.0011 
    Hormone biosynthesis 42446 5/18 0.0011 
        C21-steroid hormone biosynthesis 6700 4/12 0.0017 
    Chromatin assembly/disassembly 6333 8/52 0.0025 
        Nucleosome assembly 6334 5/23 0.0035 
    Lipid biosynthesis 8610 14/129 0.0025 
        Steroid biosynthesis 6694 7/42 0.0028 
    Icosanoid metabolism 6690 4/19 0.0100 
        Prostanoid metabolism 6692 3/8 0.0046 
            Prostaglandin metabolism 6693 3/8 0.0046 
    Glycosylceramide metabolism 6677 2/3 0.0062 
    Phosphocreatine metabolism 6603 2/3 0.0062 
    Cell cycle arrest 7050 4/17 0.0066 
Cellular component    
    Extracellular 5576 106/1,706 0.0008 
        Extracellular matrix 5578 22/243 0.0027 
            Collagen 5581 9/32 <0.0001 
            Sheet-forming collagen 30935 2/2 0.0021 
                Collagen type IV 5587 2/2 0.0021 
        Extracellular space 5615 96/1,541 0.0013 
    Chromatin 785 10/81 0.0040 
        Nucleosome 786 4/14 0.0031 
        Nuclear heterochromatin 5720 3/9 0.0067 
    Clathrin coat of coated pit 30132 3/8 0.0046 
    Vesicular fraction 42598 7/50 0.0077 
        Microsome 5792 7/50 0.0077 
Molecular function    
    Extracellular matrix structural constituent 5201 11/55 <0.0001 
        Extracellular matrix structural constituent conferring tensile strength 30020 8/30 <0.0001 
    Transferase activity, transferring alkyl or aryl (other than methyl) groups 16765 6/39 0.0084 
        Glutathione transferase activity 4364 5/13 0.0002 
    Galactosidase activity 15925 3/4 0.0004 
        β-Galactosidase activity 4565 2/2 0.0021 
    Intramolecular oxidoreductase activity 16860 5/27 0.0072 
        Intramolecular oxidoreductase activity, transposing C = C bonds 16863 3/8 0.0046 
            Steroid δ-isomerase activity 4769 2/2 0.0021 
        Prostaglandin D synthase activity 4667 2/3 0.0062 
    Oxidoreductase activity, acting on NADH or NADPH, nitrogenous group as acceptor 16657 2/2 0.0021 
        GMP reductase activity 3920 2/2 0.0021 
    Carbonyl reductase (NADPH) activity 4090 2/2 0.0021 
    Steroid dehydrogenase activity 16229 3/9 0.0067 
        3-β-Hydroxy-δ5-steroid dehydrogenase activity 3854 2/2 0.0021 
    Reduced folate carrier activity 8518 2/2 0.0021 
    Heparin binding 8201 7/40 0.0021 
    Copper ion binding 5507 6/30 0.0022 
    Folic acid binding 5542 3/7 0.0030 
    Lipid transporter activity 5319 6/34 0.0042 
    Creatine kinase activity 4111 2/3 0.0062 
    Diphosphoinositol-polyphosphate diphosphatase activity 8486 2/3 0.0062 
    Cyclic AMP–dependent protein kinase regulator activity 8603 3/9 0.0067 

NOTE: Complete list and the actually dysregulated genes belonging to each of the GO terms can be found in Supplementary Table S2.

Comparison of embryonal carcinomas and seminomas. By comparing the data from seminomas and embryonal carcinomas, we extracted a list of 169 genes significantly differentially expressed between these histologic types (SAM, 2% FDR; Supplementary Table S4). Thirty-nine of these genes were specifically expressed in embryonal carcinomas compared with seminomas, and among these, GAL was the most significantly differentially expressed gene. Some of the other significant genes have been linked previously to TGCT [i.e., GDF3 (13, 14), DNMT3B (15), and BCAT (16)]. Among the 130 genes with higher expression in seminomas, PRAME (preferentially expressed antigen in melanoma) was the most significantly different, and the list also included the previously published SLC43A1 (POV1; refs. 17, 18) and KIT (1921). To ascertain how the TGCT cell lines express these 169 genes, a hierarchical cluster analysis showed that, as might be anticipated, the cell lines cluster more tightly to the embryonal carcinomas than to the seminomas (Supplementary Fig. S1). However, several of the genes with highest expression in the seminomas compared with the embryonal carcinomas are also highly expressed in the cell lines. This suggests that the TGCT cell lines 2102Ep and NTERA2 have a transcriptional phenotype that to some extent lies between those of seminoma and embryonal carcinoma histologic subtypes.

Undifferentiated and pluripotency-specific gene expression. To identify genes that are robustly differentially expressed between undifferentiated and pluripotent embryonal carcinoma and differentiated tissues or cell lines, a stepwise gene selection was implemented. First, SAM analysis was done, comparing the two extreme in vivo groups in terms of differentiation [i.e., the highly undifferentiated embryonal carcinomas (n = 5) versus the differentiated tissue types (n = 12; normal testis, yolk sac tumor, choriocarcinoma, and teratoma)]. This resulted in 176 genes that were most highly expressed in the embryonal carcinomas and 41 genes that were most highly expressed in the differentiated tissues (Table 4; Supplementary Table S5). The same analysis was also done, including only malignant tissue samples, and again, the same five genes, GAL, POU5F1, NANOG, DPPA4, and MT1H, appeared as the most significantly overexpressed in embryonal carcinoma (data not shown). The 217 (176 + 41) genes that were overall differentially expressed between the tissue samples were filtered further, leaving only those genes showing up-regulation or down-regulation in the in vitro models of TGCT differentiation (altered in the same direction after RA treatment for the pluripotent NTERA2 but remained unchanged in the relatively nullipotent 2102Ep). This left us with 68 genes, 58 that are characteristic of the undifferentiated embryonic stem cell–like state and 10 that are characteristic of the differentiated derivatives (Fig. 3). Significantly enriched GO terms assigned to these 68 genes are listed in Table 5. The involvement of these genes in various gene function categories related to developmental biology are found from Table 2. Gene categories, such as DNA methylation genes, homeobox genes, and genes highly expressed in embryonic stem cells, were particularly abundant among the 68 genes specifically expressed in undifferentiated embryonal carcinomas.

Table 4.

The most significantly up-regulated and down-regulated genes (top 15 of each; complete list from SAM analysis in Supplementary Table S5) in the undifferentiated and pluripotent embryonal carcinomas compared with the differentiated tissue samples (normal testis and the differentiated components of TGCT choriocarcinoma, yolk sac tumor, and teratoma)

Gene symbolGene named score
Highly expressed in embryonal carcinomas   
    GAL Galanin 8.1 
    POU5F1 POU domain, class 5, transcription factor 1 7.8 
    NANOG Nanog homeobox 6.8 
    DPPA4 Developmental pluripotency associated 4 6.7 
    MT1H Metallothionein 1H 6.6 
    FLJ10884 Hypothetical protein FLJ10884 6.3 
    BCAT1 Branched chain aminotransferase 1, cytosolic 5.9 
    DNMT3B DNA (cytosine-5)-methyltransferase 3β 5.9 
    APEH N-acylaminoacyl-peptide hydrolase 5.8 
    MRS2L MRS2-like, magnesium homeostasis factor (Saccharomyces cerevisiae5.7 
    JARID2 Jumonji, AT-rich interactive domain 2 5.7 
    VSNL1 Visinin-like 1 5.6 
    MT2A Metallothionein 2A 5.5 
    MT1F Metallothionein 1F (functional) 5.4 
    DNMT3L DNA (cytosine-5)-methyltransferase 3-like 5.0 
Down-regulated in embryonal carcinomas   
    MGC5576 Hypothetical protein MGC5576 −4.7 
    TTC3 Tetratricopeptide repeat domain 3 −4.3 
    C9orf154 Chromosome 9 open reading frame 154 −4.2 
    FLJ13615 Hypothetical protein FLJ13615 −4.1 
    GPSM1 G-protein signaling modulator 1 (AGS3-like, Caenorhabditis elegans−4.0 
    PPP3CA Protein phosphatase 3, catalytic subunit, α isoform −3.9 
    MGC:13379 HSPC244 −3.9 
    GPSM1 G-protein signaling modulator 1 (AGS3-like, C. elegans−3.9 
    TCEA2 Transcription elongation factor A (SII), 2 −3.8 
    KIAA1223 KIAA1223 protein −3.8 
    CD99 CD99 antigen −3.7 
    PPA2 Inorganic pyrophosphatase 2 −3.7 
    SMARCA1 SWI/SNF related, matrix-associated, actin-dependent regulator of chromatin, subfamily A, member 1 −3.7 
    LAMB2 Laminin β2 (laminin S) −3.6 
    SEC15L SEC15 (S. cerevisiae)-like −3.6 
Gene symbolGene named score
Highly expressed in embryonal carcinomas   
    GAL Galanin 8.1 
    POU5F1 POU domain, class 5, transcription factor 1 7.8 
    NANOG Nanog homeobox 6.8 
    DPPA4 Developmental pluripotency associated 4 6.7 
    MT1H Metallothionein 1H 6.6 
    FLJ10884 Hypothetical protein FLJ10884 6.3 
    BCAT1 Branched chain aminotransferase 1, cytosolic 5.9 
    DNMT3B DNA (cytosine-5)-methyltransferase 3β 5.9 
    APEH N-acylaminoacyl-peptide hydrolase 5.8 
    MRS2L MRS2-like, magnesium homeostasis factor (Saccharomyces cerevisiae5.7 
    JARID2 Jumonji, AT-rich interactive domain 2 5.7 
    VSNL1 Visinin-like 1 5.6 
    MT2A Metallothionein 2A 5.5 
    MT1F Metallothionein 1F (functional) 5.4 
    DNMT3L DNA (cytosine-5)-methyltransferase 3-like 5.0 
Down-regulated in embryonal carcinomas   
    MGC5576 Hypothetical protein MGC5576 −4.7 
    TTC3 Tetratricopeptide repeat domain 3 −4.3 
    C9orf154 Chromosome 9 open reading frame 154 −4.2 
    FLJ13615 Hypothetical protein FLJ13615 −4.1 
    GPSM1 G-protein signaling modulator 1 (AGS3-like, Caenorhabditis elegans−4.0 
    PPP3CA Protein phosphatase 3, catalytic subunit, α isoform −3.9 
    MGC:13379 HSPC244 −3.9 
    GPSM1 G-protein signaling modulator 1 (AGS3-like, C. elegans−3.9 
    TCEA2 Transcription elongation factor A (SII), 2 −3.8 
    KIAA1223 KIAA1223 protein −3.8 
    CD99 CD99 antigen −3.7 
    PPA2 Inorganic pyrophosphatase 2 −3.7 
    SMARCA1 SWI/SNF related, matrix-associated, actin-dependent regulator of chromatin, subfamily A, member 1 −3.7 
    LAMB2 Laminin β2 (laminin S) −3.6 
    SEC15L SEC15 (S. cerevisiae)-like −3.6 
Table 5.

The most significantly enriched GO classes related to the 68 genes up-regulated or down-regulated genes in the pluripotent phenotype

GO termGO IDTotal genes in categoryOverexpressedUnderexpressedPs for enrichment
Imprinting 6349 DNMT3B, DNMT3L  0.0003 
Nuclear heterochromatin 5720 DNMT3B, DNMT3L  0.0007 
Nuclear chromatin 790 14 DNMT3B, DNMT3L  0.0018 
Heterochromatin 792 16 DNMT3B, DNMT3L  0.0024 
Large ribosomal subunit 15934 74 MRPL18, RPL18A, RPL28  0.0047 
Steroid binding 5496 27 HSD17B4 HES1 0.0068 
Chromosome 5694 165 DNMT3B, DNMT3L, HUS1, TERF1  0.0068 
Development 7275 1,423 ADORA2E, CD9, DNMT3B, DNMT3L, GAL, GDF3, HESX1, JARID2, LST1, POU5F1, SEMA6A HES1, WNT5A 0.0090 
G-protein-coupled receptor binding 1664 35 GAL WNT5A 0.0112 
Cytosolic large ribosomal subunit 5842 35 RPL18A, RPL28  0.0112 
Regulation of gene expression, epigenetic 40029 39 DNMT3B, DNMT3L  0.0138 
Nuclear chromosome 228 39 DNMT3B, DNMT3L  0.0138 
Neurogenesis 7399 317 GAL, HESX1, POU5F1, SEMA6A HES1 0.0145 
Steroid biosynthesis 6694 42 HSD17B4 HES1 0.0159 
Brain development 7420 52 HESX1, POU5F1  0.0238 
Embryonic development 9790 142 DNMT3L, POU5F1 WNT5A 0.0273 
Protein targeting 6605 146 HSD17B4, POU5F1 PPP3CA 0.0293 
Embryonic morphogenesis 9795 61 POU5F1 WNT5A 0.0320 
Ribosome 5840 154 MRPL18, RPL18A, RPL28  0.0336 
Structural constituent of ribosome 3735 154 MRPL18, RPL18A, RPL28  0.0336 
Cytosolic ribosome (sensu Eukarya) 5830 64 RPL18A, RPL28  0.0349 
Nucleic acid binding 3676 1,529 APEH, DNMT3B, EIF4A1, HESX1, JARID2, NANOG, NARS, POU5F1, RPL18A, RPL28, TERF1 HES1 0.0380 
Central nervous system development 7417 69 HESX1, POU5F1  0.0401 
Protein metabolism 19538 1,589 APEH, COX11, EIF4A1, FKBP4, HSD17B4, HUS1, NARS, PIM2, POU5F1, RPL18A, RPL28 PPP3CA 0.0494 
GO termGO IDTotal genes in categoryOverexpressedUnderexpressedPs for enrichment
Imprinting 6349 DNMT3B, DNMT3L  0.0003 
Nuclear heterochromatin 5720 DNMT3B, DNMT3L  0.0007 
Nuclear chromatin 790 14 DNMT3B, DNMT3L  0.0018 
Heterochromatin 792 16 DNMT3B, DNMT3L  0.0024 
Large ribosomal subunit 15934 74 MRPL18, RPL18A, RPL28  0.0047 
Steroid binding 5496 27 HSD17B4 HES1 0.0068 
Chromosome 5694 165 DNMT3B, DNMT3L, HUS1, TERF1  0.0068 
Development 7275 1,423 ADORA2E, CD9, DNMT3B, DNMT3L, GAL, GDF3, HESX1, JARID2, LST1, POU5F1, SEMA6A HES1, WNT5A 0.0090 
G-protein-coupled receptor binding 1664 35 GAL WNT5A 0.0112 
Cytosolic large ribosomal subunit 5842 35 RPL18A, RPL28  0.0112 
Regulation of gene expression, epigenetic 40029 39 DNMT3B, DNMT3L  0.0138 
Nuclear chromosome 228 39 DNMT3B, DNMT3L  0.0138 
Neurogenesis 7399 317 GAL, HESX1, POU5F1, SEMA6A HES1 0.0145 
Steroid biosynthesis 6694 42 HSD17B4 HES1 0.0159 
Brain development 7420 52 HESX1, POU5F1  0.0238 
Embryonic development 9790 142 DNMT3L, POU5F1 WNT5A 0.0273 
Protein targeting 6605 146 HSD17B4, POU5F1 PPP3CA 0.0293 
Embryonic morphogenesis 9795 61 POU5F1 WNT5A 0.0320 
Ribosome 5840 154 MRPL18, RPL18A, RPL28  0.0336 
Structural constituent of ribosome 3735 154 MRPL18, RPL18A, RPL28  0.0336 
Cytosolic ribosome (sensu Eukarya) 5830 64 RPL18A, RPL28  0.0349 
Nucleic acid binding 3676 1,529 APEH, DNMT3B, EIF4A1, HESX1, JARID2, NANOG, NARS, POU5F1, RPL18A, RPL28, TERF1 HES1 0.0380 
Central nervous system development 7417 69 HESX1, POU5F1  0.0401 
Protein metabolism 19538 1,589 APEH, COX11, EIF4A1, FKBP4, HSD17B4, HUS1, NARS, PIM2, POU5F1, RPL18A, RPL28 PPP3CA 0.0494 

NOTE: Only significant GO categories with at least two changed genes are shown.

Intratubular germ cell neoplasia versus normal testis. A comparison of normal testis and premalignant IGCN yielded 96 significantly differentially expressed genes (SAM, 1.6% FDR; Supplementary Fig. S2). The pluripotency-specific genes NANOG, DPPA4, and POU5F1 as well as the KIT gene were all among the 12 most significantly up-regulated genes in the premalignant IGCN.

Protein expression of GAL and POU5F1. The in situ protein expression of GAL and POU5F1, two genes identified as significantly overexpressed in embryonal carcinoma, were analyzed on tissue microarrays containing 510 tissue samples from normal testis, IGCN, and the various histologic subgroups of TGCT. Both GAL and POU5F1 were negative for all normal testis samples but were positive for 21% and 54% of the overall tumor samples, respectively, and both with significant differences between various histologic subtypes (Fig. 4). Interestingly, the frequency of POU5F1-positive seminomas (63%) was similar to that of embryonal carcinomas (65%), whereas for GAL this was different (8% positive seminomas and 34% positive embryonal carcinomas; P = 5 × 10−6).

Figure 4.

In situ protein expression of GAL and POU5F1. A tissue microarray containing 510 tissue samples from normal testis, IGCN, and the various histologic subgroups of TGCT was used. Histograms of the frequencies of positive tumors according to histologic subtypes are shown for GAL (A) and POU5F1 (B). Bars, SE. Examples of immunohistochemical detection of GAL and POU5F1 are also shown, where the frames' colors indicate histologic subgroups as color coded in the histograms.

Figure 4.

In situ protein expression of GAL and POU5F1. A tissue microarray containing 510 tissue samples from normal testis, IGCN, and the various histologic subgroups of TGCT was used. Histograms of the frequencies of positive tumors according to histologic subtypes are shown for GAL (A) and POU5F1 (B). Bars, SE. Examples of immunohistochemical detection of GAL and POU5F1 are also shown, where the frames' colors indicate histologic subgroups as color coded in the histograms.

Close modal

Gene expression studies in human testicular germ cell tumors. We have explored the transcriptional programs altered during the early embryo-like development of TGCT and present gene expression profiles from both an in vivo and an in vitro setting from pluripotent and undifferentiated tumor subtypes to those more differentiated. To our knowledge, this is the first comprehensive DNA microarray study analyzing all the malignant histologic subtypes of TGCT as well as premalignant IGCN and normal testis. The applied oligomicroarrays, covering most annotated genes in our genome, and the universal human reference RNA are commercially available; thus, the current expression data can readily be compared with and supplemented with other expression data. The first study applying microarrays to TGCT reported expression of genes located to chromosome 17 as well as of proto-oncogenes located genome-wide (17). Following this, additional microarray studies of primary TGCT have been published, either focusing into specific genomic regions (16, 22) or concentrating on selected histologic subtypes (15, 18). These studies were all done by use of different platforms, mostly in-house made DNA microarrays and/or reference RNA samples, and cross-comparisons between studies are thus not a trivial issue.

Histologic subtype-specific gene expression and parallels to early embryogenesis. The transcriptional profiling of individual histologic subtypes using SAM analyses revealed mostly novel genes with respect to testicular tumorigenesis. However, some previously known biological and clinical markers also appeared and add support to the importance of the resulting genes, reliability of the technology, and strength of the statistics [e.g., α-fetoprotein (AFP) was highly expressed in yolk sac tumors, and subunits of the chorionic gonadotropin (CGB5 and CGA) were highly expressed in choriocarcinomas (Fig. 2)]. In addition, when comparing normal testis and IGCN (Supplementary Fig. S2), several genes that are expressed in the IGCN samples, such as KIT and the pluripotency markers NANOG and POU5F1, were also identified in a recent DNA microarray study focusing on IGCN (23). Among the novel genes to this study, DPPA4 was specifically expressed in IGCN compared with normal testis, and interestingly, the expression of these pluripotency-specific genes was also seen in seminomas (Fig. 3).

When we searched for gene expression patterns characterizing the pluripotent and undifferentiated phenotype of embryonal carcinomas, we did therefore not include IGCN and seminomas in any of the groups compared to avoid exclusion of the pluripotency-specific markers. The 68 resulting genes with specific expression pattern in the undifferentiated and pluripotent phenotype of embryonal carcinomas included 7 genes that were reported previously as enriched in embryonic stem cells (P = 1 × 10−7; Table 2; ref. 12), and 2 of these, NANOG and POU5F1, were also abundantly present in IGCN and seminomas. Both these genes are known as key regulators of pluripotency (2426), and IGCN and seminoma cells might therefore have similar pluripotent capabilities as embryonal carcinoma cells. Also on the protein level, as seen from immunohistochemistry on the tissue microarrays, POU5F1 was present in most IGCN and seminomas. This was not the case for GAL, which was highly associated to embryonal carcinomas compared with seminomas. However, a few seminomas were also positive for GAL, which lends support to the existence of a distinct subgroup of seminomas that resembles embryonal carcinomas on the molecular level.11

11

Hofer et al., unpublished data.

For the nonseminomas, morphologic parallels to early embryogenesis are described. In the same way that embryonal carcinomas share many characteristics with the inner cell mass of blastocysts (27), the choriocarcinomas parallel the trophoblastic and syncytiotrophoblastic cells of the placenta, yolk sac tumors parallel the endodermal differentiation giving rise to the yolk sac, and teratomas parallel the differentiation into somatic tissues of all three primary germ layers (2). Hence, transcriptional programs of the early embryogenesis are likely to be altered in testicular tumorigenesis. The enrichment of embryonic stem cell–specific genes in testicular tumorigenesis validates this morphologic link to early embryogenesis also on the gene transcription level. Hence, by presenting gene expression patterns specific to the histologic subtypes of human TGCT (Fig. 2), we also provide novel transcriptional information relevant to the parallel differentiation stages of human development.

Pluripotent embryonal carcinomas are considered to be the malignant counterparts of embryonic stem cells (28). Embryonic stem cells are again potential mediators to regenerative medicine, but as transplants may be rejected from the host's immune system, it would be advantageous if personalized embryonic stem cells could be made from human adult stem cells by dedifferentiation back to the embryonic stage without the need for cloning (29). The identified embryonal carcinoma–specific transcripts (as in Fig. 3) can potentially work as reprogramming factors that lead to dedifferentiation of adult cells into personalized embryonic stem cells. Human embryonic stem cells have thus far been considered pluripotent, rather than totipotent, because of their limited differentiation potency into extraembryonic membranes and tissue types (30). Recently, however, they were shown to have the capability to differentiate into trophoblast cells (31), an observation that may broaden the definition of their developmental potential. However, as TGCTs often contain tissue types with extraembryonic morphology (yolk sac tumor and choriocarcinoma), the current histologic subtype-specific gene expression profiles (Fig. 2) may indeed contain information that can be useful in the search for ways to make the embryonic stem cells differentiate into these lineages; e.g., by overexpression of genes specifically expressed in the TGCT subtype paralleling the desired lineage differentiation.

Gene categories involved in early embryogenesis. Knowledge from the field of embryogenesis may also be used to understand germ cell tumorigenesis, and we have identified subsets of genes that are also known to play a role in early embryonic development (Table 2). For example, all the four most significantly overexpressed genes in the undifferentiated embryonal carcinomas, POU5F1, GAL, NANOG, and DPPA4 (Table 4), are known as pluripotency-associated genes (12). For WNT signaling, we found seven components of this pathway within the interhistologic subtype-regulated genes list (Table 2). Except FRAT2, which was highest expressed in seminomas and embryonal carcinomas, these WNT signaling components (AES, DKK1, FZD4, TCF7L1, and TCF7L2) were highest expressed in yolk sac tumors and teratomas. Further, the three significantly altered target genes of this signaling pathway, FZD7, ID2, and, SLC2A1, were also up-regulated in the yolk sac tumors and teratomas. Thus, activation of the WNT signaling pathway may be a key step in the transition of embryonal carcinomas into differentiated nonseminomas.

High expression of the de novo DNA methyltransferase DNMT3B and its homologue involved in establishment of imprinting, DNMT3L, was significantly associated with the embryonal carcinoma subtype (Fig. 3; Tables 2 and 4). The third homologue, DNMT3A, had a similar expression profile not reaching statistical significance. This discovery of an up-regulated DNA methylation machinery in embryonal carcinomas provide a malignant parallel to the complete epigenetic reprogramming events taking place in the inner cell mass of the blastocysts (32, 33).

In vivo versus in vitro clues to potential drug targets. The RA-induced differentiation of embryonal carcinoma cell lines in this study is used to model the in vivo differentiation of embryonal carcinomas. The shared transcriptional changes between the in vivo and the in vitro systems (Fig. 3) are of special interest as they may be specific and/or necessary to make embryonal carcinoma differentiate and play biologically as well as clinically important roles. In fact, one of the genes sorted out by this approach, NALP7, was recently confirmed to play a functional oncogenic role in TGCT (34). In the treatment of TGCT, it could therefore be advantageous to repress the genes specifically expressed in the undifferentiated cells (e.g., GAL, POU5F1, NANOG, and DPPA4), as growth and malignant potential could be repressed by forcing the cells into terminal differentiation. Further intriguing, cancer drugs targeting genes that in the normal setting are specifically expressed during the blastocyst stage should in the postembryonal body be specific to tumor cells.

The current data could also facilitate cancer drug discovery by use of the genes always expressed in the differentiated tissues but not in the undifferentiated ones (e.g., MGC5576, TTC3, or C9orf154; Table 4), genes that may be used as general markers of differentiation. Their gene products can be used in drug screening programs where the agents causing expression of these selective markers in embryonal carcinoma cells are considered candidate cancer drugs.

In conclusion, we have presented the first comprehensive gene expression profiling of all histologic subtypes of TGCT as well as of precursor lesions and normal testis. This pinpointed specifically expressed genes within all analyzed subgroups. We also uncovered novel transcriptional information about tumorigenesis and its functional relation to normal development. The key genes described, as GAL and POU5F1, will be useful in molecularly assisted diagnosis as well as serve as potential drug targets.

Note: Supplementary data for this article are available at Cancer Research Online (http://cancerres.aacrjournals.org/).

Grant support: Norwegian Cancer Society grant A95068 (R.A. Lothe, R.I. Skotheim, and G.E. Lind), Nordic Academy for Advanced Study (R.I. Skotheim), and Biotechnology and Biological Sciences Research Council and Yorkshire Cancer Research (P.W. Andrews).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

We thank Christine Pigott for the cell culturing work and Tuula Airaksinen for the demonstration of DNA microarray protocols.

1
Teilum G. Comparative pathology—tumors of germ cell origin. In: Special tumors of ovary and testis. 2nd ed. Copenhagen: Munksgaard; 1976. p. 32–119.
2
Mostofi FK, Sesterhenn IA. WHO International histological classification of tumours: histological typing of testis tumours. 2nd ed. Berlin: Springer-Verlag; 1998.
3
Andrews PW, Casper J, Damjanov I, et al. Comparative analysis of cell surface antigens expressed by cell lines derived from human germ cell tumours.
Int J Cancer
1996
;
66
:
806
–16.
4
Andrews PW. Retinoic acid induces neuronal differentiation of a cloned human embryonal carcinoma cell line in vitro.
Dev Biol
1984
;
103
:
285
–93.
5
Andrews PW, Damjanov I, Simon D, et al. Pluripotent embryonal carcinoma clones derived from the human teratocarcinoma cell line Tera-2. Differentiation in vivo and in vitro.
Lab Invest
1984
;
50
:
147
–62.
6
Matthaei KI, Andrews PW, Bronson DL. Retinoic acid fails to induce differentiation in human teratocarcinoma cell lines that express high levels of a cellular receptor protein.
Exp Cell Res
1983
;
143
:
471
–4.
7
Andrews PW, Bronson DL, Benham F, Strickland S, Knowles BB. A comparative study of eight cell lines derived from human testicular teratocarcinoma.
Int J Cancer
1980
;
26
:
269
–80.
8
Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response.
Proc Natl Acad Sci U S A
2001
;
98
:
5116
–21.
9
Zeeberg BR, Feng W, Wang G, et al. GoMiner: a resource for biological interpretation of genomic and proteomic data.
Genome Biol
2003
;
4
:
R28
.
10
Skotheim RI, Abeler VM, Nesland JM, et al. Candidate genes for testicular cancer evaluated by in situ protein expression analyses on tissue microarrays.
Neoplasia
2003
;
5
:
397
–404.
11
Brazma A, Hingamp P, Quackenbush J, et al. Minimum information about a microarray experiment (MIAME)—toward standards for microarray data.
Nat Genet
2001
;
29
:
365
–71.
12
Bhattacharya B, Miura T, Brandenberg R, et al. Gene expression in human embryonic stem cell lines: unique molecular signature.
Blood
2004
;
103
:
2956
–64.
13
Caricasole AAD, van Schaik RHN, Zeinstra LM, et al. Human growth-differentiation factor 3 (hGDF3): developmental regulation in human teratocarcinoma cell lines and expression in primary testicular germ cell tumours.
Oncogene
1998
;
16
:
95
–103.
14
Clark AT, Rodriguez RT, Bodnar MS, et al. Human STELLAR, NANOG, and GDF3 genes are expressed in pluripotent cells and map to chromosome 12p13, a hotspot for teratocarcinoma.
Stem Cells
2004
;
22
:
169
–79.
15
Sperger JM, Chen X, Draper JS, et al. Gene expression patterns in human embryonic stem cells and human pluripotent germ cell tumors.
Proc Natl Acad Sci U S A
2003
;
100
:
13350
–5.
16
Rodriguez S, Jafer O, Goker H, et al. Expression profile of genes from 12p in testicular germ cell tumors of adolescents and adults associated with i(12p) and amplification at 12p11.2-p12.1.
Oncogene
2003
;
22
:
1880
–91.
17
Skotheim RI, Monni O, Mousses S, et al. New insights into testicular germ cell tumorigenesis from gene expression profiling.
Cancer Res
2002
;
62
:
2359
–64.
18
Okada K, Katagiri T, Tsunoda T, et al. Analysis of gene-expression profiles in testicular seminomas using a genome-wide cDNA microarray.
Int J Oncol
2003
;
23
:
1615
–35.
19
Tian Q, Frierson HF Jr, Krystal GW, Moskaluk CA. Activating c-kit gene mutations in human germ cell tumors.
Am J Pathol
1999
;
154
:
1643
–7.
20
Looijenga LH, De Leeuw H, van Oorschot M, et al. Stem cell factor receptor (c-KIT) codon 816 mutations predict development of bilateral testicular germ-cell tumors.
Cancer Res
2003
;
63
:
7674
–8.
21
Kemmer K, Corless CL, Fletcher JA, et al. KIT mutations are common in testicular seminomas.
Am J Pathol
2004
;
164
:
305
–13.
22
Bourdon V, Naef F, Rao PH, et al. Genomic and expression analysis of the 12p11-p12 amplicon using EST arrays identifies two novel amplified and overexpressed genes.
Cancer Res
2002
;
62
:
6218
–23.
23
Almstrup K, Hoei-Hansen CE, Wirkner U, et al. Embryonic stem cell-like features of testicular carcinoma in situ revealed by genome-wide gene expression profiling.
Cancer Res
2004
;
64
:
4736
–43.
24
Nichols J, Zevnik B, Anastassiadis K, et al. Formation of pluripotent stem cells in the mammalian embryo depends on the POU transcription factor Oct4.
Cell
1998
;
95
:
379
–91.
25
Chambers I, Colby D, Robertson M, et al. Functional expression cloning of Nanog, a pluripotency sustaining factor in embryonic stem cells.
Cell
2003
;
113
:
643
–55.
26
Mitsui K, Tokuzawa Y, Itoh H, et al. The homeoprotein Nanog is required for maintenance of pluripotency in mouse epiblast and ES cells.
Cell
2003
;
113
:
631
–42.
27
Henderson JK, Draper JS, Baillie HS, et al. Preimplantation human embryos and embryonic stem cells show comparable expression of stage-specific embryonic antigens.
Stem Cells
2002
;
20
:
329
–37.
28
Andrews PW. From teratocarcinomas to embryonic stem cells.
Philos Trans R Soc Lond B Biol Sci
2002
;
357
:
405
–17.
29
Dennis C. Developmental reprogramming: take a cell, any cell.
Nature
2003
;
426
:
490
–1.
30
Thomson JA, Itskovitz-Eldor J, Shapiro SS, et al. Embryonic stem cell lines derived from human blastocysts.
Science
1998
;
282
:
1145
–7.
31
Xu RH, Chen X, Li DS, et al. BMP4 initiates human embryonic stem cell differentiation to trophoblast.
Nat Biotechnol
2002
;
20
:
1261
–4.
32
Monk M, Boubelik M, Lehnert S. Temporal and regional changes in DNA methylation in the embryonic, extraembryonic and germ cell lineages during mouse embryo development.
Development
1987
;
99
:
371
–82.
33
Kierszenbaum AL. Genomic imprinting and epigenetic reprogramming: unearthing the garden of forking paths.
Mol Reprod Dev
2002
;
63
:
269
–72.
34
Okada K, Hirota E, Mizutani Y, et al. Oncogenic role of NALP7 in testicular seminomas.
Cancer Sci
2004
;
95
:
949
–54.