The ability to induce pluripotent stem cells from committed, somatic human cells provides tremendous potential for regenerative medicine. However, there is a defined neoplastic potential inherent to such reprogramming that must be understood and may provide a model for understanding key events in tumorigenesis. Using genome-wide assays, we identify cancer-related epigenetic abnormalities that arise early during reprogramming and persist in induced pluripotent stem cell (iPS) clones. These include hundreds of abnormal gene silencing events, patterns of aberrant responses to epigenetic-modifying drugs resembling those for cancer cells, and presence in iPS and partially reprogrammed cells of cancer-specific gene promoter DNA methylation alterations. Our findings suggest that by studying the process of induced reprogramming, we may gain significant insight into the origins of epigenetic gene silencing associated with human tumorigenesis, and add to means of assessing iPS for safety. Cancer Res; 70(19); 7662–73. ©2010 AACR.

Induction of pluripotent stem cells (iPS) from committed, somatic cells encompasses exciting biology with tremendous potential for regenerative medicine (1, 2). However, such cells are currently generated using multiple inducing factors with oncogenic potential (36), and mice generated with iPS have increased tumorigenicity and mortality (7). Moreover, fully reprogrammed human iPS phenotype arise only as rare clonal populations (≤0.1%) among partially reprogrammed cells some of which display an immortalized phenotype (8). Thus, many new metrics are needed to characterize iPS (4, 9).

Induction of iPS from committed cells necessitates that a myriad of epigenetic parameters must be reversed and properly reestablished. Genome-wide studies of chromatin and DNA methylation patterns (10, 11) indicate that this is globally done remarkably well including required reversal of promoter DNA methylation and reexpression of pluripotency-related genes such as OCT4 (POU5F1), both of which mark successful reprogramming (10, 12). However, multiple loci can fail in this reversal of DNA methylation (10, 12), and use of drug-induced DNA demethylation can improve efficiency of obtaining iPS (10, 13, 14). Another implication of such abnormal loci is that the relatively short duration of the reprogramming process (2–3 weeks; refs. 1, 2, 4, 10) might implicate epigenetic mechanisms in the neoplastic potential of iPS. A recent study compared iPS to cancer (15), but the most cancer-specific, proximal promoter epigenetic changes accompanying abnormal gene expression were not outlined. We now address this issue using genome-wide analysis approaches to match gene silencing associated with DNA hypermethylation of normally unmethylated CpG island–containing promoters (16). We identify cancer-related epigenetic abnormalities associated with the timing, and degree, of inducing cellular reprogramming.

Reprogramming protocols

MP2 and MP4 cells were reprogrammed from IMR90 fibroblasts by lentiviral vectors, and retroviral vectors were used for all other protocols (17).

Expression data

Global gene expression was analyzed using Agilent whole human genome, 4x44K, microarrays as previously described (18). Heat map color schemes are based on hierarchical clustering by Ward's algorithm and standard Euclidean distance on log-transformed expression measures adjusted for cutoffs for silent (red) and active (green) genes as previously determined (19).

Genome-wide DNA methylation analysis

We used the Infinium (Illumina, Inc.) platform (20) to analyze bisulfite-treated DNA (EZ DNA Methylation kit, Zymo Research). In this hybridization procedure, β values are generated as the signal of methylation-specific probe over the sum of the signals of the methylation- and unmethylated-specific probes and assigning a score of 1.0 for full methylation of a specific CpG site, 0 for absence of methylation, and 0 ≤ β ≤ 1 for all signals between (21). Probes with poor overall signals (P > 0.05) were removed from analysis. For all genes, only probes positioned from −1,000 to +200 bp around transcription start site (TSS) are analyzed. In vitro, DNA-methylated, genomic DNA (IVD) and DNA from DKO cells genetically deleted for DNA methyltransferases (DNMT) 1 and 3b (22) serve as fully methylated and unmethylated controls, respectively. Heat maps are based on hierarchical clustering of β values using Euclidean distance and Ward's algorithm, and all probes were mapped to the genome (National Center for Biotechnology Information Build 36.3) using the bowtie algorithm and ultrafast and memory-efficient alignment of short DNA sequences (Genome Biology, 10, R25) with genome annotation via the matching release of the Ensembl database. X-linked genes were removed from analyses.

Drug responses

Drug-responsive genes were selected from Agilent expression arrays (see Supplementary Fig. S3) based on a 1.41-fold expression (0.5 on a log2 scale) difference between mock versus 5-aza-2′-deoxycytidine (DAC)– or trichostatin A (TSA)–treated samples and classified as responsive to DAC alone but not TSA, to TSA alone but not DAC, and to both DAC alone and TSA alone.

Characteristics of reprogrammed human cell lines

We first examined tumor xenografts from reprogrammed clones (Supplementary Table S1A) for pluripotent potential and neoplastic features. The highest cancer potential was for clone MP4, a fibroblast line induced by lentiviral introduction of OCT4, SOX2, NANOG, LIN28, and SV40 T-antigen (OSNLT; refs. 1, 17) and known to be only partially reprogrammed (17). MP4 cells express undifferentiated markers such as OCT4 but not TRA-1-60, are refractory to differentiation induction in vitro and in vivo (i.e., nullipotent), and possess an abnormal karyotype. Mouse xenografts show high nuclear to cytoplasmic ratio, an extremely high mitotic rate, areas of necrosis, and the histology of a primitive, aggressive, mesenchymal tumor (Fig. 1A). In comparison, clone MP2, prepared identically to MP4 and a typical iPS expressing TRA-1-60 and all classic iPS markers (17), forms what seems to be a benign, multilineage teratoma (Fig. 1A, top right). However, this clone has an abnormal karyotype, and on closer examination, the teratoma contains small foci suggesting malignancy, including regions where cells infiltrate host skeletal muscle bundles (Fig. 1A, bottom right).

Figure 1.

Features of teratomas from reprogrammed cells. A, immunohistochemical stains (top right) for epithelial cytokeratins or H&E-stained sections from MP4 (left, top and bottom) and MP2 tumors (right, top and bottom). MP4 is positive only for the mesenchymal marker vimentin (data not shown) and has areas of necrosis (pink area; top left), a malignant feature. Higher magnification (bottom left; original magnification, ×600) shows multiple mitotic figures (arrowheads) and abnormal nuclear shapes. MP2 (top right) exhibits alternating areas of epithelial and mesenchymal differentiation, as evidenced by cytokeratin staining (brown), but has small areas with cells infiltrating host skeletal muscle bundles (pink staining; bottom right). B, differentiated tissues in tumors from H9 ESC and iPS clones MB45 and MR45. C, morphologic correlates of malignancy in iPS clones (from left to right, MB45, MR45, MMW2, and MR31), including severe nuclear atypia, host muscle infiltration, abnormally high mitotic rates, and necrosis.

Figure 1.

Features of teratomas from reprogrammed cells. A, immunohistochemical stains (top right) for epithelial cytokeratins or H&E-stained sections from MP4 (left, top and bottom) and MP2 tumors (right, top and bottom). MP4 is positive only for the mesenchymal marker vimentin (data not shown) and has areas of necrosis (pink area; top left), a malignant feature. Higher magnification (bottom left; original magnification, ×600) shows multiple mitotic figures (arrowheads) and abnormal nuclear shapes. MP2 (top right) exhibits alternating areas of epithelial and mesenchymal differentiation, as evidenced by cytokeratin staining (brown), but has small areas with cells infiltrating host skeletal muscle bundles (pink staining; bottom right). B, differentiated tissues in tumors from H9 ESC and iPS clones MB45 and MR45. C, morphologic correlates of malignancy in iPS clones (from left to right, MB45, MR45, MMW2, and MR31), including severe nuclear atypia, host muscle infiltration, abnormally high mitotic rates, and necrosis.

Close modal

We then performed blinded comparison (B.S. and D.M.B.) of teratomas (Fig. 1; Supplementary Table S1B) from the conventional embryonic stem cell (ESC) lines H1, H9, and SC233 versus from multiple additional pluripotent iPS with normal karyotypes and metrics of fully reprogrammed human iPS (23), including expression of the embryonic markers AP, OCT4, and TRA-1-60. These include MR45 and MR46, generated from IMR90 fibroblasts by retrovirally introducing OCT4, SOX2, KLF4 and c-MYC (OSKM; ref. 2), MB41, MB45, MMW1, and MMW2 generated with the same retroviral vectors but from mesenchymal stem cells (MSC), and two iPS lines (MR31 and MR32) retrovirally derived from IMR90 fibroblasts using three factors (OSK, without c-MYC). All formed mouse xenografts with differentiated cell types of multiple embryonic germ layers (Fig. 1B). However, the successfully reprogrammed iPS cell xenografts show a range of degree of maturation (from 0.067 to 0.231 lineage structures/areas examined) of defined structures such as cartilage, bone, and intestine (Fig. 1C), which is lower than the range of values (0.338–0.97) for three ESC-derived teratomas (Fig. 1B and C; Supplementary Table S1B). Most importantly, all iPS teratomas examined have foci with malignant-like characteristics, which include focal necrosis, nuclear pleomorphism, aberrantly high mitotic rates, and infiltration into the mouse musculature (examples shown in Fig. 1C; Supplementary Table S1B). Such foci were generally not found in the three ESC-derived teratomas (Supplementary Table S1B), save for one small area of focal necrosis in the teratoma derived from H1 ESC.

Overall gene expression profiles of reprogrammed clones

We compared, using full transcriptome Agilent arrays, global gene expression between representative reprogrammed clones versus ESCs and found, as have others (10), highly similar, but not identical, patterns. Even the partially reprogrammed MP4 clone deviated only slightly from the pluripotent clones and ESC and clustered separately from four human cancer cell lines (Supplementary Fig. S1). However, focus on genes highly expressed in ESC revealed distinct differences. Compared with IMR90 fibroblasts pluripotent clones MR46 and MR45 significantly upregulated to levels in ESC, 12 of 16 and 13 of 16 such genes, respectively (Supplementary Table S3), MP2 increased only 11 of 16 above levels in parent fibroblasts, and only 8 to levels in ESC. The malignant, partially reprogrammed MP4 properly increased only 5 of the 16 genes with respect to starting fibroblasts and only 6 reached levels in ESC. In addition, partially reprogrammed clone, MP4, expressed both OCT4 and c-MYC due to incomplete repression of introduced transgenes, significantly higher than in ESC (Supplementary Table S3).

Abnormal gene silencing and gene responses to epigenetic-modifying agents associated with cellular reprogramming

We next linked, in reprogrammed clones, gene expression to epigenetic aspects of neoplasia with respect to abnormal gene silencing events during reprogramming. In pluripotent clones MR46, MR45, and MP2, normal silencing events (Fig. 2A, left) dominate over abnormal silencing events for both genes that should be silent in fibroblasts but activated in ESC, or which should be active in both cell types (Fig. 2A, middle and right), whereas the opposite is true for the tumorigenic MP4 cells (Fig. 2B and C; Supplementary Table S4). However, whereas MP4 has the highest number of abnormal silencing events (∼800), even the best-performing iPS clone, MR46, has ∼100 such genes (Fig. 2B and C, combined; Supplementary Table S4). These numbers may be characteristic of iPS because analysis of the best-reprogrammed mouse iPS clone [MCV8.1, subclone 8.1 (24) from a recent study of Mikkelsen and colleagues (10)] revealed 418 abnormally silenced genes (Supplementary Fig. S2).

Figure 2.

Abnormal epigenetic gene silencing during reprogramming. A, induced silencing genes (left and middle) have basal expression in reprogrammed clones >2-fold reduced when compared with the IMR90 parental line, and below an arbitrary silent cutoff point determined for the microarray in previous studies (19). This silencing is normal (left) if expression levels in the reprogrammed clone are within 0.5 log (1.4-fold) of that observed in H9 ES cells and abnormal if expression in the iPS clone differs by >0.5 log (1.4-fold) from ESC cells. Abnormal retained silencing (right): genes silent in the fibroblasts and iPS clone, which should have been activated to within 0.5 log of the expression in ESC. B, normal (blue columns) and abnormal (green columns) induced silencing genes in MR46, MR45, MP2, and MP4 clones. C, abnormally retained silenced genes in MR46, MR45, MP2, and MP4 cells. D, response of abnormally silenced genes to DAC and TSA (from B) in MR46, MR45, MP2, and MP4 clones. Blue columns, reactivated by DAC alone; green columns, reactivated by TSA alone; orange columns, activated by either DAC or TSA alone. *, P < 0.05, Fisher's exact test, compared with expected values for the entire genome. E, drug responsiveness of genes with abnormal retained silencing (from C) in MR46, MR45, MP2, and MP4. Color coding and statistical testing as in D.

Figure 2.

Abnormal epigenetic gene silencing during reprogramming. A, induced silencing genes (left and middle) have basal expression in reprogrammed clones >2-fold reduced when compared with the IMR90 parental line, and below an arbitrary silent cutoff point determined for the microarray in previous studies (19). This silencing is normal (left) if expression levels in the reprogrammed clone are within 0.5 log (1.4-fold) of that observed in H9 ES cells and abnormal if expression in the iPS clone differs by >0.5 log (1.4-fold) from ESC cells. Abnormal retained silencing (right): genes silent in the fibroblasts and iPS clone, which should have been activated to within 0.5 log of the expression in ESC. B, normal (blue columns) and abnormal (green columns) induced silencing genes in MR46, MR45, MP2, and MP4 clones. C, abnormally retained silenced genes in MR46, MR45, MP2, and MP4 cells. D, response of abnormally silenced genes to DAC and TSA (from B) in MR46, MR45, MP2, and MP4 clones. Blue columns, reactivated by DAC alone; green columns, reactivated by TSA alone; orange columns, activated by either DAC or TSA alone. *, P < 0.05, Fisher's exact test, compared with expected values for the entire genome. E, drug responsiveness of genes with abnormal retained silencing (from C) in MR46, MR45, MP2, and MP4. Color coding and statistical testing as in D.

Close modal

In cancer, DNA hypermethylation of CpG island promoters (16) is a prime candidate for mediating abnormal gene silencing, and several abnormally silenced genes in reprogrammed clones are known to have such changes, including CDKN2B (Supplementary Ref. 1), LXN (Supplementary Refs. 2, 3), TIMP3 (Supplementary Refs. 2, 4), and PYCARD (Supplementary Ref. 5). We thus investigated global gene expression responses to both the DNA demethylation agent DAC and the histone deacetylase (HDAC) inhibitor TSA. DAC effectively reexpresses genes in cancer with densely hypermethylated, promoter CpG islands, whereas TSA alone does not (see Supplementary Fig. S3 for array expression responses; ref. 18). The results strikingly separate cancer cells, iPS cells, and especially the partially reprogrammed MP4 clone from parent fibroblasts, adult MSC, and ESC. Addition of either DAC or TSA (Fig. 2D) reactivates between 67% (MR46) and 84% (MR45) of the abnormal silenced genes in Fig. 2A. A dramatic finding is that, in partially reprogrammed clone MP4, and to a lesser degree other iPS (Fig. 2D), more induced silent genes are responsive only to DAC alone, or both DAC and TSA (left to right, Fig. 2D), but not to TSA alone, and this is also true, to a slightly less extent (Fig. 2E), for abnormal retained silencing genes (those in Fig. 2A, right). This is true for both CpG island and non-CpG island–containing genes (Fig. 3A and B, respectively). The pattern for MP4 cells begins to resemble that for the cancer cell lines, which have an extraordinary dominance of DAC alone– versus TSA alone–responsive genes. Moreover, genes in normal ESC, adult MSC, and committed bone progenitor cells (osteoblasts) have a starkly dominant response to TSA alone (Fig. 3A and B). These patterns also hold when ∼400 CpG island–containing genes responsive to DAC alone in the U2OS osteosarcoma cells (Fig. 3C) and HT1080 fibrosarcoma cells (Fig. 3D) serve as their own controls (i.e., for these genes, TSA responses dominate in normal cells, DAC responses dominate in cancer cells, and the reprogrammed clones show a mixed response to DAC and TSA consisting of a predominantly DAC response most apparent in the MP4 clone). Finally, normal committed IMR90 fibroblasts have more of a mixture of DAC- and TSA-responsive genes, but with responses still skewed toward TSA (Fig. 3A and B), and the increased frequency of DAC-responsive genes even separates all of the fully reprogrammed clones from ESC and adult MSC (Fig. 3A and B).

Figure 3.

DAC- and TSA-responsive genes in reprogrammed clones. A, expression response of all silent CpG island–containing genes to DAC (blue columns), TSA (green columns), or both DAC and TSA alone (orange columns). B, response of all silent, non-CpG island–containing genes as in A. C, overall drug responsiveness of 430 silent genes for U2OS (i.e., which are not expressed in this cell line), responsive to DAC alone, but not TSA alone, in the other cell lines shown. Color coding as in A and B. D, same analysis as in C for the 435 silent genes responsive to DAC alone in HT1080 fibrosarcoma cells.

Figure 3.

DAC- and TSA-responsive genes in reprogrammed clones. A, expression response of all silent CpG island–containing genes to DAC (blue columns), TSA (green columns), or both DAC and TSA alone (orange columns). B, response of all silent, non-CpG island–containing genes as in A. C, overall drug responsiveness of 430 silent genes for U2OS (i.e., which are not expressed in this cell line), responsive to DAC alone, but not TSA alone, in the other cell lines shown. Color coding as in A and B. D, same analysis as in C for the 435 silent genes responsive to DAC alone in HT1080 fibrosarcoma cells.

Close modal

Taken together, the data indicate that reprogrammed somatic cells can acquire both abnormal gene silencing events and aberrant responses to epigenetic-modifying drugs that deviate from normal and are reminiscent of changes observed in cancer cells.

Cancer-related promoter DNA methylation changes arise early during reprogramming

To further examine abnormal gene silencing and cancer-specific promoter DNA methylation on a global scale, we used the Infinium platform (25), which queries ∼27,000 CpG sites, most, but not all, in annotated promoter CpG islands. When focused on analyzing CpGs located from −1,000 to +200 bp from gene TSS, we find, as have others in genome-wide DNA methylation studies (10, 26, 27), that the vast majority (∼90%) of ∼7,500 loci in ∼5,700 different genes with well-annotated promoter CpG islands are unmethylated in all normal cells (ESC, MSC, and fibroblasts; Supplementary Fig. S4). In contrast, for ∼3,750 probes in ∼800 autosomal genes not containing dense CpG island promoters, CpG poor promoters are far more methylated, with a varied tissue pattern, in normal cells (Supplementary Fig. S5), consistent with studies of others (12, 26) and verifying a long-held biological premise. Finally, consistent with our previous studies (18), gene promoter CpG island hypermethylation in cancer is starkly apparent (Supplementary Fig. S4). The number of unmethylated CpG island promoters falls to ∼78% in four adult cancer cell lines and to ∼87% in two germline teratocarcinoma lines representing 900 hypermethylated genes in the former and >200 in the latter. Non-CpG island promoters have not been as carefully examined in cancer, but the cancer cell lines still cluster separately from normal cells largely because of multiple genes that have gained methylation but also due to many that have lost normal methylation (Supplementary Fig. S5). Importantly, for cancer cell lines such as HCT116 colorectal cells, the Infinium analyses correctly identified (data not shown) ∼90% of genes validated in our laboratory to have cancer-specific DNA hypermethylation (18).

With this above background, we examined not only our reprogrammed cell clones but also pools of cells (Supplementary Table S1 for listing) from early stages (days 6–18) following insertion of four factors into both our fibroblast and MSC parent cells and after introduction of OCT4 alone into fibroblasts. In these pools, cells are in a dynamic meta-stable state with a wide range of reprogramming stages (10, 28), wherein most clones do not exhibit ESC morphology and many cells can be cultured indefinitely. Overall, cells in these pools maintain the normally unmethylated state of CpG island promoters, harboring many less abnormalities than seen in cancer cells (Supplementary Fig. S4). However, we observe that among normally unmethylated CpG island genes, 50 show abnormal methylation in one or more of the early cell pools by day 6, and 38 have abnormally added methylation in individual reprogrammed clones (Fig. 4A; Table 1, individual clone genes; Supplementary Table S5, pool genes). Although the genes generally differ between the pools and clones, ∼15% are shared between the two. Of 10 clones examined, including partially reprogrammed MP4 cells, only 2 (MMW1 and MMW2) failed to exhibit a hypermethylated gene, whereas the remainder contained between 3 (MR46) and 17 (MB45). Significantly, >60% of all these genes (P = 3.055 × 10−7, Fisher's exact test) in the clones and 50% (P = 7.302 × 10−5, Fisher's exact test) in the pools are also hypermethylated in one or more cancer cell lines (Fig. 4A; Table 1 and Supplementary Table S5, individual genes). Finally, and very importantly, even pooled cells with introduction of OCT4 alone contained from 12 to 16 hypermethylated genes, and again, ∼50% of these were also hypermethylated in at least one cancer cell line (Fig. 4A; Supplementary Table S5).

Figure 4.

DNA methylation analysis of all cells studied. A, heat map for methylation β scores from the Infinium platform (color bar at the top left; green, lowest methylation; red, highest methylation) for all aberrantly methylated genes with a CpG island. X axis, sample clustering (top) and samples (bottom); Y axis, gene clustering (left) and individual genes (right; those hypermethylated in iPS clones and/or reprogrammed pools are also designated in Table 1 and Supplementary Table S4, respectively). Samples analyzed are as follows: MR46, MR45, MP2, and MP4 clones; parent IMR90 fibroblasts; parent adult MS (MSC-A and MSC-B); adult osteoblasts; ESC lines H1ES and H9ES; cancer cell lines (U2OS, HT1080, MDA-MB-231, and HCT116); reprogrammed clones (MB41, MB45, MMW1, and MMW2) derived from MSC using retrovirally introduced OCT4, SOX2, KLF4, and c-MYC (OSKM); two iPS lines (MR31 and MR32) retrovirally derived from IMR90 fibroblasts using three factors (OSK, without c-MYC); teratocarcinoma lines Tera2+EV (empty plasmid vector) and 2102EP; reprogramming pools of IMR90 cells at days 6, 12, and 18 (TR, technical repeat; BR, biological repeat) after retroviral infection using the OSKM cocktail (IMR904F-d6, IMR904F-d12, and IMR904F-d18, respectively) and at day 12 after introduction of OCT4 alone (IMR901F-d12); pools of reprogramming cells using adult MSC as the parental line, and after 6 and 18 d for OSKM introduction (MSC-B-4F-d6 and MSC-B-4F-d16); parent IMR90 cells infected by lentivirus with T antigen alone (IMR90T); IMR90 infected with green fluorescent protein (IMR90+GFP-RV); and IMR90 cells infected with four factors grown for only 6 days (IMR904F-d6). B, heat map for non-CpG island genes that either gained or lost methylation in clones or reprogramming pools relative to all normal cell types. All samples as for A, and abnormal iPS and/or reprogrammed pool genes are in Supplementary Tables S5 and S6, respectively.

Figure 4.

DNA methylation analysis of all cells studied. A, heat map for methylation β scores from the Infinium platform (color bar at the top left; green, lowest methylation; red, highest methylation) for all aberrantly methylated genes with a CpG island. X axis, sample clustering (top) and samples (bottom); Y axis, gene clustering (left) and individual genes (right; those hypermethylated in iPS clones and/or reprogrammed pools are also designated in Table 1 and Supplementary Table S4, respectively). Samples analyzed are as follows: MR46, MR45, MP2, and MP4 clones; parent IMR90 fibroblasts; parent adult MS (MSC-A and MSC-B); adult osteoblasts; ESC lines H1ES and H9ES; cancer cell lines (U2OS, HT1080, MDA-MB-231, and HCT116); reprogrammed clones (MB41, MB45, MMW1, and MMW2) derived from MSC using retrovirally introduced OCT4, SOX2, KLF4, and c-MYC (OSKM); two iPS lines (MR31 and MR32) retrovirally derived from IMR90 fibroblasts using three factors (OSK, without c-MYC); teratocarcinoma lines Tera2+EV (empty plasmid vector) and 2102EP; reprogramming pools of IMR90 cells at days 6, 12, and 18 (TR, technical repeat; BR, biological repeat) after retroviral infection using the OSKM cocktail (IMR904F-d6, IMR904F-d12, and IMR904F-d18, respectively) and at day 12 after introduction of OCT4 alone (IMR901F-d12); pools of reprogramming cells using adult MSC as the parental line, and after 6 and 18 d for OSKM introduction (MSC-B-4F-d6 and MSC-B-4F-d16); parent IMR90 cells infected by lentivirus with T antigen alone (IMR90T); IMR90 infected with green fluorescent protein (IMR90+GFP-RV); and IMR90 cells infected with four factors grown for only 6 days (IMR904F-d6). B, heat map for non-CpG island genes that either gained or lost methylation in clones or reprogramming pools relative to all normal cell types. All samples as for A, and abnormal iPS and/or reprogrammed pool genes are in Supplementary Tables S5 and S6, respectively.

Close modal
Table 1.

Genes with abnormally methylated CpG island promoters in clones, partially reprogrammed pools, or cancers

graphic
 
graphic
 
graphic
 
graphic
 

NOTE: Dark squares, β values above 0.45; light gray squares, values ranging to 0.45.

The fully reprogrammed iPS clones generally also behave very well with respect to non-CpG island genes. Thus, for all clones, 87% to 95% of such promoters properly either gain or lose DNA methylation properly relative to ESC (Fig. 4B). Interestingly, and as might be expected, most of the genes in these groups do not make the changes required for iPS in the reprogramming pools of fibroblasts or MSC, generally retaining the methylation patterns of the starting parent cells (Fig. 4B). Again, despite the global proper behavior, multiple non-CpG island genes, relative to normal cells, both gain and lose promoter methylation. Thirteen genes abnormally gain methylation in the clones (Supplementary Table S6), and another 13 in the pools (Supplementary Table S7), whereas 56 and 37 lose methylation respectfully (Supplementary Tables S6 and S7). Particularly, MP4 contains 37 abnormal genes: 5 having abnormal gain and 32 having abnormal losses of methylation (Fig. 4B; Supplementary Tables S6 and S7). A high percentage, 60% for the clones (P = 1.645 × 10−12, Fisher's exact test) and 43% for the pools (P = 5.141 × 10−8), of the abnormal gains and losses of DNA methylation also appear in one or more of the cancer cell lines (Supplementary Tables S6 and S7). Finally, for MP4, only 35% of non-CpG island genes properly gained or lost DNA methylation relative to ESC.

Importantly, many of the above abnormal CpG island genes are hypermethylated in primary human cancers and have roles in malignancy and/or embryonic cell fate patterning. GATA4 (Supplementary Ref. 6), central to proper endodermal differentiation, is hypermethylated in multiple cancer types; O6-MGMT (Supplementary Refs. 7, 8) is also hypermethylated in cancer and loss of function impedes DNA repair; TCERG1L has been recently identified as a low-frequency mutated gene in colon cancer (Supplementary Ref. 9) and is DNA hypermethylated in virtually all such tumors12

12J.-M. Yi, N. Ahuja, S.B. Baylin, L. Van Neste, J.G. Herman, unpublished data.

; SPRY2 plays a role in development and differentiation through negatively regulating the extracellular signal-regulated kinase pathway and is hypermethylated in cancers with prognostic significance (Supplementary Ref. 10); SOX1 is DNA hypermethylated in cancer and also plays key roles in development (Supplementary Ref. 11); BMP4 is an important morphogen for mesenchymal development and is hypermethylated in breast cancer (Supplementary Ref. 12); and HOXA9 is an important cell pattern control gene frequently hypermethylated in breast cancer (Supplementary Ref. 13). LXN, hypermethylated in tumorigenic clone MP4, contains a weak CpG island and is hypermethylated in several cancer types (Supplementary Refs. 2, 3).

For selection of these important genes, we verified Infinium results by performing the methylation-specific PCR (MSP) assay (29), querying four to six CpG sites positioned near the Infinium probes yielding hypermethylation values. Although Infinium probes query small numbers of CpGs in the islands, and development of abnormal methylation may be quite partial over the short time of reprogramming, 7 of 10 genes were methylated by MSP, and at one of two promoter sites queried, TCERG1L is fully methylated in iPS clones MP2 and MR45 (Supplementary Fig. S6A and B).

Abnormally silenced genes, chromatin, and cellular reprogramming

The small fraction of DAC-responsive (Supplementary Fig. S3), silenced genes in iPS and reprogramming pools that are DNA methylated in reprogrammed cells is surprising. Although this could represent failure to detect very partial DNA methylation in queried promoters, it may involve links between abnormal gene silencing, DNA methylation in cancer, and chromatin states of embryonic cells. We (30) and others (31, 32) have found that high percentages of DNA hypermethylated genes in cancer are marked by polycomb group silencing proteins (PcG) in embryonic cells. Such genes may have an abnormal progression from PcG marking to promoter DNA hypermethylation in cancer (27, 30). In ESC, such promoters are not methylated but rather maintained in a low poised expression state by PcG regulation (3335) and “bivalent” chromatin constituted by transcriptional positive (H3K4me2 and me3) and negative (the PcG mark, H3K27me3) histone modifications (36). Interestingly, abnormally low expression genes in partially reprogrammed cells from mice have bivalent chromatin (28), and aggressive human cancers have increased expression of PcG genes (37) to levels such as those found in our iPS cells (Supplementary Table S2).

We thus queried the promoter occupancy of our abnormally silenced genes in iPS clones, reprogramming pools, and cancer cell lines (Table 1; Supplementary Tables S4 and S5) in databases of others for embryonic cells (3740). Although there is no significant enrichment at the promoters of these genes for occupancy by the iPS reprogramming factors themselves (37), save for NANOG at the promoters of the DNA hypermethylated genes in Table 1 and Supplementary Table S5, in the clones and partially reprogrammed pools, there is an enrichment for PcG promoter marking (Fig. 5A). This is confirmed by local chromatin immunoprecipitation (ChIP) of the PcG mark H3K27me3 for promoter regions of selected genes in ESC, iPS clone MP2, and the partially reprogrammed clone MP4 (Fig. 5B).

Figure 5.

Gene promoter occupancy by transcription factors and PcG-repressive histone modification (H3K27me3). A, percentage of genes with transcription factor (OCT4, SOX2, NANOG, and c-MYC) and H3K27me3 promoter occupancy, in tiling arrays of others (3740), for all abnormally silenced genes in Fig. 2, abnormally silenced and DAC-responsive genes in Fig. 3, abnormally DNA-methylated genes in Table 1 and Supplementary Tables S4 to S6, and DAC-responsive, silenced genes in cancer cell lines. B, real-time ChIP of H3K27me3 at promoters of selected, abnormally silenced genes that are reactivated by DAC, but are not DNA hypermethylated, in MP2 iPS, partially reprogrammed MP4, H9, and ESC. Triangles, H3K27me3 enrichment (log scale) relative to a value of 1.0 (horizontal line) for parent IMR90 fibroblasts. Primer sequences and positions provided online.

Figure 5.

Gene promoter occupancy by transcription factors and PcG-repressive histone modification (H3K27me3). A, percentage of genes with transcription factor (OCT4, SOX2, NANOG, and c-MYC) and H3K27me3 promoter occupancy, in tiling arrays of others (3740), for all abnormally silenced genes in Fig. 2, abnormally silenced and DAC-responsive genes in Fig. 3, abnormally DNA-methylated genes in Table 1 and Supplementary Tables S4 to S6, and DAC-responsive, silenced genes in cancer cell lines. B, real-time ChIP of H3K27me3 at promoters of selected, abnormally silenced genes that are reactivated by DAC, but are not DNA hypermethylated, in MP2 iPS, partially reprogrammed MP4, H9, and ESC. Triangles, H3K27me3 enrichment (log scale) relative to a value of 1.0 (horizontal line) for parent IMR90 fibroblasts. Primer sequences and positions provided online.

Close modal

It seems that during the reprogramming process, iPS may be prone to abnormal epigenetic changes characteristic of neoplasia. Our results differ from a recent study (15) in which differential changes both between cancer and normal cells, and iPS and normal ESC, were thought confined to “shores” or regions upstream from promoter CpG islands. In contrast, we find that cancer-specific promoter CpG island hypermethylation is easily visualized on the Infinium arrays in hundreds of genes in the cancer cells (Fig. 4; Supplementary Figs. S4 and S5). These changes can be seen, to a lesser degree, in iPS and partially reprogrammed cells (Table 1; Supplementary Table S4; Fig. 4A).

The abnormal gene silencing events in reprogramming could well involve the inducing factors themselves. We find just one essential reprogramming factor, OCT4 (4, 41), that produces cancer-specific epigenetic changes, and this may help explain why its forced overexpression in the germline of mice produces a striking neoplastic phenotype in skin and intestine (42).

A dramatic finding in our study concerns the dominance of gene responsiveness to TSA in ESC but to DAC in cancer cells and iPS. The chromatin of ESC especially exists in a far more open, and histone-acetylated, pattern than in more committed embryonic and adult cells (43). Many genes in ESC may be in a poised state for induction in response to commitment signals and are balanced between very active targeting by histone acetyltransferases and removal of acetylation by HDACs. Blocking HDACs would then shift the dynamics to allow rapid promoter acetylation to enhance expression of such genes. The more dominant response to DAC in iPS, and especially cancer cells, may reflect an abnormally repressed state of genes, which prevents normal cell differentiation and/or full progression of cells during reprogramming to the ESC epigenetic state.

iPS and cancer cells do differ dramatically in that the Infinium assay validates that a majority (>70%) of basally silent genes that respond to DAC alone but not to TSA (i.e., candidate genes in Supplementary Fig. S3) have promoter DNA methylation in cancer cells but not in iPS cells. A key explanation may lie in the potential molecular progression from PcG promoter occupancy to DNA promoter hypermethylation that may be ongoing in cancer as suggested by ourselves and others (27). This progression may involve targeting of DNMTs to involved genes (27) and resultant DAC reexpression of even non–DNA-methylated genes. DNMTs bind corepressors, experimentally can repress gene promoters independently of catalyzing DNA methylation, and can act as scaffolding proteins by using regions distal to their DNA methylation catalytic sites (4446). To inhibit DNMTs, DAC incorporates into DNA and covalently bind to DNMTs, resulting in removal of DNMT1 from the soluble nuclear protein pool (47) and its degradation (48). Thus, DAC may reexpress genes with or without DNA methylation as indicated by our recent result for overexpressing a polycomb constituent in teratocarcinoma cells (49).

In summary, epigenetic changes may contribute to neoplastic potential of iPS. Mapping these may help provide constructive parameters to monitor the preparation and use of such cells, guide use of epigenetic-modifying drugs to enhance efficiency of obtaining iPS (10), and help derive gene marker panels for the above purposes. Finally, cellular reprogramming may provide a model to study how epigenetic abnormalities may be central to the origins of cancer and whether reprogramming might play a role in the formation of key subpopulations of cancer cells. For example, cellular reprogramming might underlie the fact that neoplasia can initiate even in mature populations of normal cells (50).

S.B.B. consults for OncoMethylome Sciences. MSP is licensed to OncoMethylome Sciences in agreement with Johns Hopkins University (JHU), and S.B.B. and JHU are entitled to royalty shares received from sales.

We thank Dr. Saul Sharkis and members of the Baylin and Cheng labs for reading the manuscript and for helpful discussions and Kathy Bender for manuscript preparation.

Grant Support: NIH grants CA116160 and UO1 HL099775 (S.B.B.) and State of Maryland TEDCO.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1
Yu
J
,
Vodyanik
MA
,
Smuga-Otto
K
, et al
. 
Induced pluripotent stem cell lines derived from human somatic cells
.
Science
2007
;
318
:
1917
20
.
2
Takahashi
K
,
Tanabe
K
,
Ohnuki
M
, et al
. 
Induction of pluripotent stem cells from adult human fibroblasts by defined factors
.
Cell
2007
;
131
:
861
72
.
3
Loh
YH
,
Agarwal
S
,
Park
IH
, et al
. 
Generation of induced pluripotent stem cells from human blood
.
Blood
2009
;
113
:
5476
9
.
4
Hochedlinger
K
,
Plath
K
. 
Epigenetic reprogramming and induced pluripotency
.
Development
2009
;
136
:
509
23
.
5
Yu
J
,
Hu
K
,
Smuga-Otto
K
, et al
. 
Human induced pluripotent stem cells free of vector and transgene sequences
.
Science
2009
;
324
:
797
801
.
6
Kaji
K
,
Norrby
K
,
Paca
A
,
Mileikovsky
M
,
Mohseni
P
,
Woltjen
K
. 
Virus-free induction of pluripotency and subsequent excision of reprogramming factors
.
Nature
2009
;
458
:
771
5
.
7
Okita
K
,
Ichisaka
T
,
Yamanaka
S
. 
Generation of germline-competent induced pluripotent stem cells
.
Nature
2007
;
448
:
313
7
.
8
Yamanaka
S
. 
A fresh look at iPS cells
.
Cell
2009
;
137
:
13
7
.
9
Daley
GQ
,
Lensch
MW
,
Jaenisch
R
,
Meissner
A
,
Plath
K
,
Yamanaka
S
. 
Broader implications of defining standards for the pluripotency of iPSCs
.
Cell Stem Cell
2009
;
4
:
200
1
.
10
Mikkelsen
TS
,
Hanna
J
,
Zhang
X
, et al
. 
Dissecting direct reprogramming through integrative genomic analysis
.
Nature
2008
;
454
:
49
55
.
11
Lister
R
,
Pelizzola
M
,
Dowen
RH
, et al
. 
Human DNA methylomes at base resolution show widespread epigenomic differences
.
Nature
2009
;
462
:
315
22
.
12
Meissner
A
,
Mikkelsen
TS
,
Gu
H
, et al
. 
Genome-scale DNA methylation maps of pluripotent and differentiated cells
.
Nature
2008
;
454
:
766
70
.
13
Huangfu
D
,
Maehr
R
,
Guo
W
, et al
. 
Induction of pluripotent stem cells by defined factors is greatly improved by small-molecule compounds
.
Nat Biotechnol
2008
;
26
:
795
7
.
14
Shi
Y
,
Desponts
C
,
Do
JT
,
Hahm
HS
,
Scholer
HR
,
Ding
S
. 
Induction of pluripotent stem cells from mouse embryonic fibroblasts by Oct4 and Klf4 with small-molecule compounds
.
Cell Stem Cell
2008
;
3
:
568
74
.
15
Doi
A
,
Park
IH
,
Wen
B
, et al
. 
Differential methylation of tissue- and cancer-specific CpG island shores distinguishes human induced pluripotent stem cells, embryonic stem cells and fibroblasts
.
Nat Genet
2009
;
41
:
1350
3
.
16
Jones
PA
,
Baylin
SB
. 
The epigenomics of cancer
.
Cell
2007
;
128
:
683
92
.
17
Mali
P
,
Ye
Z
,
Hommond
HH
, et al
. 
Improved efficiency and pace of generating induced pluripotent stem cells from human adult and fetal fibroblasts
.
Stem Cells
2008
;
26
:
1998
2005
.
18
Schuebel
KE
,
Chen
W
,
Cope
L
, et al
. 
Comparing the DNA hypermethylome with gene mutations in human colorectal cancer
.
PLoS Genet
2007
;
3
:
1709
23
.
19
McGarvey
K
,
Van Neste
L
,
Cope
L
, et al
. 
Defining a chromatin pattern that characterizes DNA hypermethylated genes in colon cancer cells
.
Cancer Res
2008
;
68
:
5753
9
.
20
Bibikova
M
,
Fan
JB
. 
GoldenGate assay for DNA methylation profiling
.
Methods Mol Biol
2009
;
507
:
149
63
.
21
Network
CGAR
. 
Comprehensive genomic characterization defines human glioblastoma genes and core pathways
.
Nature
2008
;
455
:
1061
8
.
22
Rhee
I
,
Bachman
KE
,
Park
BH
, et al
. 
DNMT1 and DNMT3b cooperate to silence genes in human cancer cells
.
Nature
2002
;
416
:
552
6
.
23
Chan
EM
,
Ratanasirintrawoot
S
,
Park
IH
, et al
. 
Live cell imaging distinguishes bona fide human iPS cells from partially reprogrammed cells
.
Nat Biotechnol
2009
;
27
:
1033
7
.
24
Meissner
A
,
Wernig
M
,
Jaenisch
R
. 
Direct reprogramming of genetically unmodified fibroblasts into pluripotent stem cells
.
Nat Biotechnol
2007
;
25
:
1177
81
.
25
Butler
H
,
Ragoussis
J
. 
BeadArray-based genotyping
.
Methods Mol Biol
2008
;
439
:
53
74
.
26
Straussman
R
,
Nejman
D
,
Roberts
D
, et al
. 
Developmental programming of CpG island methylation profiles in the human genome
.
Nat Struct Mol Biol
2009
;
16
:
564
71
.
27
Cedar
H
,
Bergman
Y
. 
Linking DNA methylation and histone modification: patterns and paradigms
.
Nat Rev Genet
2009
;
10
:
295
304
.
28
Sridharan
R
,
Tchieu
J
,
Mason
MJ
, et al
. 
Role of the murine reprogramming factors in the induction of pluripotency
.
Cell
2009
;
136
:
364
77
.
29
Herman
JG
,
Graff
JR
,
Myohanen
S
,
Nelkin
BD
,
Baylin
SB
. 
Methylation-specific PCR: a novel PCR assay for methylation status of CpG islands
.
Proc Natl Acad Sci U S A
1996
;
93
:
9821
6
.
30
Ohm
JE
,
McGarvey
KM
,
Yu
X
, et al
. 
A stem cell-like chromatin pattern may predispose tumor suppressor genes to DNA hypermethylation and heritable silencing
.
Nat Genet
2007
;
39
:
237
42
.
31
Widschwendter
M
,
Fiegl
H
,
Egle
D
, et al
. 
Epigenetic stem cell signature in cancer
.
Nat Genet
2007
;
39
:
157
8
.
32
Schlesinger
Y
,
Straussman
R
,
Keshet
I
, et al
. 
Polycomb-mediated methylation on Lys27 of histone H3 pre-marks genes for de novo methylation in cancer
.
Nat Genet
2007
;
39
:
232
6
.
33
Lund
AH
,
van Lohuizen
M
. 
Polycomb complexes and silencing mechanisms
.
Curr Opin Genet Dev
2004
;
16
:
1
8
.
34
Otte
AP
,
Kwaks
TH
. 
Gene repression by Polycomb group protein complexes: a distinct complex for every occasion?
Curr Opin Genet Dev
2003
;
13
:
448
54
.
35
Kuzmichev
A
,
Margueron
R
,
Vaquero
A
, et al
. 
Composition and histone substrates of polycomb repressive group complexes change during cellular differentiation
.
Proc Natl Acad Sci U S A
2005
;
102
:
1859
64
.
36
Chi
AS
,
Bernstein
BE
. 
Developmental biology. Pluripotent chromatin state
.
Science
2009
;
323
:
220
1
.
37
Ben-Porath
I
,
Thomson
MW
,
Carey
VJ
, et al
. 
An embryonic stem cell-like gene expression signature in poorly differentiated aggressive human tumors
.
Nat Genet
2008
;
40
:
499
507
.
38
Lee
TI
,
Jenner
RG
,
Boyer
LA
, et al
. 
Control of developmental regulators by polycomb in human embryonic stem cells
.
Cell
2006
;
125
:
301
13
.
39
Boyer
LA
,
Plath
K
,
Zeitlinger
J
, et al
. 
Polycomb complexes repress developmental regulators in murine embryonic stem cells
.
Nature
2006
;
441
:
349
53
.
40
Mikkelsen
TS
,
Ku
M
,
Jaffe
DB
, et al
. 
Genome-wide maps of chromatin state in pluripotent and lineage-committed cells
.
Nature
2007
;
448
:
553
60
.
41
Nakagawa
M
,
Koyanagi
M
,
Tanabe
K
, et al
. 
Generation of induced pluripotent stem cells without Myc from mouse and human fibroblasts
.
Nat Biotechnol
2007
;
26
:
101
6
.
42
Hochedlinger
K
,
Yamada
Y
,
Beard
C
,
Jaenisch
R
. 
Ectopic expression of Oct-4 blocks progenitor-cell differentiation and causes dysplasia in epithelial tissues
.
Cell
2005
;
121
:
465
77
.
43
Meshorer
E
,
Misteli
T
. 
Chromatin in pluripotent embryonic stem cells and differentiation
.
Nat Rev Mol Cell Biol
2006
;
7
:
540
6
.
44
Fuks
F
,
Burgers
WA
,
Godin
N
,
Kasai
M
,
Kouzarides
T
. 
Dnmt3a binds deacetylases and is recruited by a sequence-specific repressor to silence transcription
.
EMBO J
2001
;
20
:
2536
44
.
45
Rountree
MR
,
Bachman
KE
,
Baylin
SB
. 
DNMT1 binds HDAC2 and a new co-repressor, DMAP1, to form a complex at replication foci
.
Nat Genet
2000
;
25
:
269
77
.
46
Robertson
KD
,
Ait-Si-Ali
S
,
Yokochi
T
,
Wade
PA
,
Jones
PL
,
Wolffe
AP
. 
DNMT1 forms a complex with rb, E2F1 and HDAC1 and represses transcription from E2F-responsive promoters
.
Nat Genet
2000
;
25
:
338
42
.
47
Cheng
JC
,
Weisenberger
DJ
,
Gonzales
FA
, et al
. 
Continuous zebularine treatment effectively sustains demethylation in human bladder cancer cells
.
Mol Cell Biol
2004
;
24
:
1270
8
.
48
Datta
J
,
Ghoshal
K
,
Denny
WA
, et al
. 
A new class of quinoline-based DNA hypomethylating agents reactivates tumor suppressor genes by blocking DNA methyltransferase 1 activity and inducing its degradation
.
Cancer Res
2009
;
69
:
4277
85
.
49
Mohammad
HP
,
Cai
Y
,
McGarvey
KM
, et al
. 
Polycomb CBX7 promotes initiation of heritable repression of genes frequently silenced with cancer-specific DNA hypermethylation
.
Cancer Res
2009
;
69
:
6322
30
.
50
Gidekel Friedlander
SY
,
Chu
GC
,
Snyder
EL
, et al
. 
Context-dependent transformation of adult pancreatic cells by oncogenic K-Ras
.
Cancer Cell
2009
;
16
:
379
89
.