Purpose: The causative genes for autosomal dominantly inherited familial adenomatous polyposis (FAP) and hereditary nonpolyposis colorectal cancer have been well characterized. There is, however, another 10% to 15% of early onset colorectal cancers (CRC) in which the genetic components are unclear. In this study, we used microarray technology to systematically search for differentially expressed genes in early onset CRC.

Experimental Design: Young patients with non–FAP or non–hereditary nonpolyposis colorectal cancer, and healthy controls were age- (≤50 years old), ethnicity- (Chinese), and tissue-matched. RNAs extracted from colonic mucosa specimens were analyzed using GeneChip U133-Plus 2.0 Array.

Results: Seven genes, CYR61, UCHL1, FOS, FOS B, EGR1, VIP, and KRT24, were consistently up-regulated in the mucosa of all six patients compared with the mucosa from four healthy controls. The overexpression of these genes was independently validated with a testing set of six patients and six healthy controls. Principal component analysis clustered the healthy control specimens separately from the patient specimens. Real-time PCR quantification with SYBR-Green on nine other patient specimens not previously used in microarray assays confirmed the up-regulation of these seven genes. These genes function in a multitude of biological processes ranging from transcription, angiogenesis, adhesion, and inflammatory regulation to protein catabolism in various cellular compartments, from extracellular to the nucleus. They integrate known tumorigenesis (Wnt, PI3K, MAP kinase, hypoxia, G protein–coupled receptor), neurologic, insulin-signaling, and NFAT-immune pathways into an intricate biological network.

Conclusions: The data suggest that the patient's mucosa is primed for tumorigenesis when cellular homeostasis is disrupted, and that the seven overexpressed genes could potentially predict early onset CRC.

Colorectal cancer (CRC) is one of the leading cancers in the developed world. In Singapore, it has the highest incidence among cancers and is the second leading cause of cancer death (1). Despite the recent advent in therapeutic intervention, the most promising strategy to reduce mortality and morbidity from the disease is still early detection (2). A study on twins has estimated that up to 35% of CRCs could be attributed to genetic factors (3). Germ line mutations in adenomatous polyposis coli and mismatch repair genes are responsible for the highly penetrant, and autosomal, dominantly inherited familial adenomatous polyposis (FAP) and hereditary nonpolyposis colorectal cancer (HNPCC), respectively. Nevertheless, FAP and HNPCC comprise only 5% to 10% of total CRC incidence. There are at least another 10% to 15% of early onset CRCs for which the genetic susceptibility is unclear (4).

Accumulating evidence has alluded to the role of many low-penetrant modifying genes, such as the G870A single nucleotide polymorphism in cyclin D1, in increasing CRC risk (5, 6). However, no systematic study to ascertain the extent and cumulative risk purported to these susceptibility genes has been reported.

In recent years, microarray technology has increasingly been used to analyze various tumors including CRC (711). The microarray assay is a powerful tool for systematic and combinatorial search for molecular markers of cancer classification and outcome prediction (12). To our knowledge, however, no whole genome expression profiling of the germ line of patients with early onset compared with healthy controls has hitherto been done, probably because of the paucity of such well-defined sample sets.

In the present study, we did a genome-wide expression profile using complementary RNA from normal-appearing colonic mucosa of young patients without a family history of the disease and compared that to the profile of age-, ethnicity-, and tissue-matched healthy controls. The aim was to ascertain a panel of differentially expressed genes that could potentially serve as a susceptibility gene set for early onset CRC.

Patients and healthy control specimens. Tumor specimens and adjacent grossly normal-appearing tissue at least 8 cm away were routinely collected and archived from patients undergoing colorectal resection at the Singapore General Hospital. Young (≤50 years old) Chinese patients whose tumors were classified as microsatellite-stable were included in this retrospective study. These patients did not have clinicopathologic features (retrieved from medical records) that fit the Bethesda criteria for HNPCC. Furthermore, the mucosa from these patients harbored three or fewer adenomatous polyps. Hence, these patients were young onset but non–FAP and non-HNPCC. All patients, except one, had early stage (Dukes A/B) tumors at the time of the operation. Only one patient whose tumor was Dukes C was included in this study. All tumors, except one, were left-sided (to the left of the splenic flexure).

Five to seven pinch biopsies from several locations throughout the colon were obtained from Chinese individuals (≤50 years old) undergoing colonoscopic examination and were found to have no polyps and no known family history or previous CRC incidence: these were designated as healthy controls. These healthy controls underwent colonoscopies for reasons ranging from abdominal pain, change in bowel habits, and bleeding. All those with primary bleeding were found to have hemorrhoids.

Both patient and healthy control specimens were snap-frozen in liquid nitrogen within 30 min of removal from the colon and were stored at −80°C. This study was approved by the Ethics Committee of Singapore General Hospital.

Microsatellite instability assay. The HNPCC microsatellite instability test kit from Roche Molecular Biochemicals (Penzberg, Germany) was used to determine the microsatellite instability status of the patients' tumors. The panel included five microsatellite loci, BAT25, BAT26, D5S346, D17S250, and D2S123. Briefly, ∼200 ng of DNA from matched mucosa and tumor tissues from each patient were used in each multiplex PCR reaction according to the manufacturer's protocol. A microsatellite-stable tumor is a tumor with identical profiles in all five loci compared with the matched mucosa tissue. A microsatellite-unstable tumor is a tumor with two or more loci with different profiles from mucosa.

Genome-wide expression analysis. Patients and healthy controls were divided into a training set and a testing set with approximately equal numbers. The training set was used to build a gene set which was subsequently validated using the testing set. Total RNA was extracted from each specimen and biotinylated complementary RNA targets were prepared with 5 μg of total RNA according to the manufacturer's protocols (Affymetrix, Santa Clara, CA). Only RNA of high integrity (intensity of 28S/18S rRNA ∼1.5) were used. No specimen pooling was done because the ultimate aim of the study was to identify biomarkers consistently differentially expressed across all individual patients compared with healthy controls. Targets were hybridized to GeneChip Human Genome U133 Plus 2.0 Arrays for 16 h.

The hybridized arrays were washed and stained with streptavidin/phycoerythrin conjugate and anti-streptavidin antibody in Affymetrix Fluidic Station 450, and scanned with GeneChip scanner 3000. Signals were analyzed with the GeneChip operating system software. Chips were rejected if the marginal gene call was >2% and the 3′/5′ ratio of the reference gene, human glyceraldehyde-3-phosphate dehydrogenase, was >3.0. Detection (“present”, “absent”, or “marginal”), change (“increased”, “decreased”, or “no change”), and fold change status (−log 2) between the signals generated from the mucosa of patients compared with the mucosa of healthy controls were analyzed with the Affymetrix data mining tool. The cutoff criterion for gene calls was at least a 4-fold difference in expression between any two profiles compared, to avoid gene calls attributable to allelic imbalances or stochastic fluctuations. Differential expressions due to allelic differences were generally in the order of 1- to 4-fold (13). The microarray data set was submitted to the GEO repository (GSE4107) at http://www.ncbi.nlm.nih.gov/geo/info/linking.html.

Quantitative real-time PCR analysis. First-strand cDNA was synthesized from total RNA using the SuperScript First-Strand Synthesis System with SuperScript II reverse transcriptase according to the protocols of the manufacturer (Invitrogen, Carlsbad, CA). The cDNA generated was used as a template in real-time PCR reactions with QuantiTect SYBR-Green PCR master-mix (Qiagen, Hilden, Germany) and were run on an ABI PRISM 7700 machine. All PCR reactions were done in duplicate. Validation experiments verified that the efficiencies of amplification of the seven target genes and glyceraldehyde-3-phosphate dehydrogenase, the internal reference, were approximately equal. Hence, relative quantification or fold change between patients and the mean of four healthy controls for each gene (2−ΔΔCt) were determined using the comparative Ct method. The assay was done on all 12 patient specimens previously arrayed and in 9 other patient specimens (five females and four males) similarly selected.

Statistical methods. Affymetrix data files were imported into “Spotfire” for further analysis. After filtering off genes (35%) that were not present in any of the specimens and outlying signal intensities with a magnitude >40,000 (1.1%), the remaining 35,120 probe sets were normalized by Z score calculation and analyzed by t test, hierarchical clustering, and principal component analysis. All tests of statistical significance were two-sided and differences were taken as significant at P < 0.05.

General data management and regression plot between fold change generated by the microarray and quantitative real-time PCR assays were done using Microsoft Excel program. Differentially expressed probe sets were fed into the NetAffy web site and the Gene Ontology database for further sequence information and functional annotations. Additional pathway analyses were done with the BiblioSphere PathwayEdition database (version 6.12) from Genomatix GmbH (Munich, Germany).

Expression profiling identifies differentially expressed probe sets and patient subgroups. The complementary RNAs generated from both healthy controls and patient specimens were of high and comparable integrity. The mean percentage of transcripts scored as present and marginal were very similar between healthy controls and patients (Table 1). Furthermore, the mean 3′/5′ ratios of the internal control and human glyceraldehyde-3-phosphate dehydrogenase for both the healthy controls and patient specimens were below the threshold of 3.0 (Table 1).

Table 1.

Characteristics of patients and healthy controls microarrayed

FeaturesPatients (n = 12)Healthy controls (n = 10)
Age, mean (range) 40.9 (22-49) 39.5 (27-48) 
Sex, female/male 7:5 5:5 
Microarray report   
    Present gene call (mean %) 36.6 37.0 
    Marginal gene call (mean %) 1.5 1.4 
    Glyceraldehyde-3-phosphate dehydrogenase 3′/5′, mean (range) 2.08 (1.19-2.50) 1.74 (1.27-2.40) 
FeaturesPatients (n = 12)Healthy controls (n = 10)
Age, mean (range) 40.9 (22-49) 39.5 (27-48) 
Sex, female/male 7:5 5:5 
Microarray report   
    Present gene call (mean %) 36.6 37.0 
    Marginal gene call (mean %) 1.5 1.4 
    Glyceraldehyde-3-phosphate dehydrogenase 3′/5′, mean (range) 2.08 (1.19-2.50) 1.74 (1.27-2.40) 

After filtering for detection (present) and change (increased) status, fold change analyses of patient specimens (px) compared with healthy controls (signalpx/signalmeanHC) were done using the data mining tool. Eight probe sets were consistently up-regulated at least 4-fold in all six patient specimens compared with the mean of the four healthy controls. In fact, for specimen 3416, the fold-change for one of the probe sets (representing FOS B) was >1,000-fold that of the four healthy controls (Table 3).

To validate this gene set, we arrayed a further testing set of six healthy controls and six patient specimens. The same eight probe sets were found to be consistently up-regulated in this testing set. All patient specimens exhibited at least 4-fold up-regulation in all eight probe sets compared with the mean of the six healthy controls except for one probe set (representing KRT24) in specimens 3838 (0.2-fold) and 3894 (3.3-fold), respectively (Table 3). This anomaly could have been due to the loss of the particular transcript in these specimens.

There were 9,628 probe sets differentially expressed between patients and healthy controls according to t test analyses using the Spotfire software. Hierarchical clustering (using Euclidean distance) was then done (Fig. 1A). Specimens from patients and healthy controls were grouped into separate clusters. Interestingly, specimens from patients in cluster 3 (columns 16-22) whose tumors had clinical features associated with higher malignancies, or were later found to have metastasized, were grouped separately from the rest of the patients in cluster 2 (columns 11-15). For example, one of these patients in cluster 3 whose tumor was Dukes B (T3N0) at the time of the operation was subsequently found to have lung and liver metastases 9 and 18 months later, respectively. Thus, genome-wide expression profiling was able to differentiate the mucosa of patients from healthy controls as well as the mucosa of patients whose tumors were of higher malignancy from those of lower malignancy. All eight probe sets identified by fold change analyses were among the 9,628 probe sets identified by t test analyses.

Fig. 1.

Hierarchical clustering (A) and principal component analysis (B) for all specimens (n = 22). A, rows, differentially expressed genes; columns, specimens [left to right: columns 1-10, mucosa specimens from healthy controls (cluster 1); columns 11-22, mucosa specimens from patients (clusters 2 and 3)]. Red and green, relative high and low expression, respectively; *, specimen arrayed in the training set. B, Eigenvalues indicate that four principal components account for 95% of the variability; red circles, 10 healthy controls; blue circles, 12 patient specimens.

Fig. 1.

Hierarchical clustering (A) and principal component analysis (B) for all specimens (n = 22). A, rows, differentially expressed genes; columns, specimens [left to right: columns 1-10, mucosa specimens from healthy controls (cluster 1); columns 11-22, mucosa specimens from patients (clusters 2 and 3)]. Red and green, relative high and low expression, respectively; *, specimen arrayed in the training set. B, Eigenvalues indicate that four principal components account for 95% of the variability; red circles, 10 healthy controls; blue circles, 12 patient specimens.

Close modal

We then did principal component analysis to verify whether these eight probe sets could group the specimens correctly. Normalized principal component scores clustered all 10 healthy controls together, whereas the patient specimens were more heterogeneous but never overlapped with the healthy controls (Fig. 1B). K-means clustering (data not shown) also indicated that patient specimens clustered separately from the healthy controls. Therefore, the data suggests that these eight probe sets could differentiate the mucosa of patients from that of healthy controls. Eigenvector analysis indicates that the seven genes contributed equally to the principal components.

Data mining the eight up-regulated probe sets. These eight probe sets were fed into the NetAffy web site and Gene Ontology database for further annotation. They were found to comprise seven genes, i.e., cysteine-rich angiogenic inducer 61 (CYR61), ubiquitin COOH terminus esterase L1 (UCHL1), FBJ murine osteosarcoma viral oncogene homologue (FOS), murine osteosarcoma viral oncogene homologue B (FOS B), early growth response 1 (EGR1), vasoactive intestinal peptide (VIP), and keratin 24 (KRT24). CYR61 was represented by two probe sets. These seven genes were found to participate in various biological processes in multiple cellular compartments, ranging from the nucleus to the extracellular domain (1424). Moreover, they were implicated in carcinogenesis to varying degrees (Table 2). When fed into the BiblioSphere PathwayEdition database, six genes were shown to link distinct signaling pathways into an intricate biological network (Fig. 2). The seventh gene, KRT24, is a novel gene that is separately linked to the Naegeli-Franceschetti-Jadassohn syndrome (24).

Table 2.

Seven consistently up-regulated genes in normal-appearing mucosa of human early onset CRC patients

No.GenesBiological process/molecular functionCellular compartmentImplication in carcinogenesis
(1) CYR61 Insulin-like growth factor–binding protein, regulation of cell growth, adhesion, angiogenesis, chemotaxis, wound healing, and morphogenesis Extracellular Overexpression of Cyr61 activates integrin-linked kinase-mediated Akt and β-catenin-TCF signaling pathway 
(2) UCHL1 Ubiquitin-dependent protein catabolism Cytoplasm UCHL1 is highly expressed in lung cancer. May contribute to p27 degradation 
(3) FOS DNA binding, regulation of transcription from RNA Pol II; cell growth/maintenance, inflammatory response, development Nucleus FOS, FOS B, FOS L1, and FOS L2 are proto-oncogenes which are leucine zipper proteins that dimerize with proteins of the JUN family, forming transcription factor complex activating protein-1 
(4) FOS B  Nucleus  
(5) EGR1 Regulation of transcription—zinc ion binding Nucleus Neovascularization, microvascular endothelial cell growth, tumor angiogenesis, and tumor growth are processes critically dependent on Egr1 
(6) VIP G protein–coupled receptor protein signaling, positive regulation of cell proliferation Extracellular VIP increases proliferation of cancer cells, elevates cyclic AMP and expression of c-fos, c-jun, c-myc, and vascular endothelial growth factor 
(7) KRT24 Structural molecular activity (intermediate filament). Influence cellular response to proapoptotic signals and the routing of membrane proteins in polarized epithelial cells Membrane Cytokeratin 20 and 7 protein and RNA expression are up-regulated in tumor cells. KRT is up-regulated during wound repair and in diseased skin. Mutations underlie several epithelial fragility disorders 
No.GenesBiological process/molecular functionCellular compartmentImplication in carcinogenesis
(1) CYR61 Insulin-like growth factor–binding protein, regulation of cell growth, adhesion, angiogenesis, chemotaxis, wound healing, and morphogenesis Extracellular Overexpression of Cyr61 activates integrin-linked kinase-mediated Akt and β-catenin-TCF signaling pathway 
(2) UCHL1 Ubiquitin-dependent protein catabolism Cytoplasm UCHL1 is highly expressed in lung cancer. May contribute to p27 degradation 
(3) FOS DNA binding, regulation of transcription from RNA Pol II; cell growth/maintenance, inflammatory response, development Nucleus FOS, FOS B, FOS L1, and FOS L2 are proto-oncogenes which are leucine zipper proteins that dimerize with proteins of the JUN family, forming transcription factor complex activating protein-1 
(4) FOS B  Nucleus  
(5) EGR1 Regulation of transcription—zinc ion binding Nucleus Neovascularization, microvascular endothelial cell growth, tumor angiogenesis, and tumor growth are processes critically dependent on Egr1 
(6) VIP G protein–coupled receptor protein signaling, positive regulation of cell proliferation Extracellular VIP increases proliferation of cancer cells, elevates cyclic AMP and expression of c-fos, c-jun, c-myc, and vascular endothelial growth factor 
(7) KRT24 Structural molecular activity (intermediate filament). Influence cellular response to proapoptotic signals and the routing of membrane proteins in polarized epithelial cells Membrane Cytokeratin 20 and 7 protein and RNA expression are up-regulated in tumor cells. KRT is up-regulated during wound repair and in diseased skin. Mutations underlie several epithelial fragility disorders 
Fig. 2.

Pathway analysis by Bibliosphere PathwayEdition database. Seven probe sets comprising six genes integrate diverse signaling and metabolic pathways into a biological network.

Fig. 2.

Pathway analysis by Bibliosphere PathwayEdition database. Seven probe sets comprising six genes integrate diverse signaling and metabolic pathways into a biological network.

Close modal

Validation of the seven-gene set with quantitative real-time PCR. To verify that the seven genes were indeed up-regulated in normal-appearing mucosa of patients compared with the healthy controls and were not an experimental artifact attributable to the microarray platform, a second independent assay was conducted. Quantitative real-time PCR done on all 12 patient specimens previously arrayed, and 9 other patient specimens, indicated that the seven genes were consistently up-regulated in all 21 patient specimens. In fact, real-time PCR seemed to be a more sensitive assay than microarray assays, as the fold change analysis for the former was generally higher than the latter (Table 3), which is in agreement with the findings of earlier studies (25). Regression plot analyses for the 12 patient specimens also shows that, except for a few outliers, the fold change by both microarray and real-time PCR assays for the seven genes independently (Fig. 3A-G) or combined (Fig. 3H) were positively correlated.

Table 3.

Fold change comparison in mucosa of early onset patients by microarray and real-time PCR assays

Specimen no.Genes
CYR61
EGR1
FOS
FOSB
UCHL1
VIP
KRT24
RT-PCRMicroarrayRT-PCRMicrorrayRT-PCRMicroarrayRT-PCRMicroarrayRT-PCRMicroarrayRT-PCRMicroarrayRT-PCRMicroarray
3335 111.6 44.0 30.9 14.9 184.3 68.7 67.7 72.4 39.9 14.1 74.6 77.5 698.8 8.3 
3416 359.0 70.6 158.3 36.4 414.8 131.6 1,409.8 1,099.8 67.3 19.8 81.7 39.3 1,917.5 15.7 
3446 171.6 37.3 37.3 11.6 83.9 45.7 115.1 124.1 22.4 9.6 40.2 23.4 1,132.3 14.3 
3578 561.3 86.6 125.0 15.7 306.8 48.8 248.4 149.5 29.9 9.2 34.1 14.7 1,300.6 13.3 
3798 107.8* 131.4* 32.9 30.9 64.3 37.7 281.4 306.2 11.4* 45.2* 5.1* 227.1* 61.0* 22.8* 
3950 40.9 24.5 10.4 6.7 25.5 15.2 47.2 32.8 10.5 5.4 35.7 23.3 403.1 6.6 
2965 379.7 87.9 24.4 21.1 87.3 23.5 58.3 102.7 21.7 4.0 94.4 35.5 27.2* 6.4* 
3838 161.3 60.6 25.1 17.5 129.1 28.1 58.7 62.4 128.1 30.9 81.4 46.7 6.0 0.2 
3216 78.5 30.9 15.2 11.7 219.4 43.8 101.5 84.0 14.5 5.9 20.0 11.8 135.5 7.2 
3894 288.8 90.2 32.2 23.9 149.9 24.5 127.1 135.4 24.8 11.3 68.9 34.1 202.3 3.3 
4145 66.2 30.3 32.9 8.5 82.4 41.9 134.8 85.9 9.8 6.3 22.7 8.5 896.1 13.1 
4160 446.6 65.6 58.3 13.9 80.4 22.2 131.6 45.4 54.1 16.0 299.5 55.4 527.3 6.5 
Specimen no.Genes
CYR61
EGR1
FOS
FOSB
UCHL1
VIP
KRT24
RT-PCRMicroarrayRT-PCRMicrorrayRT-PCRMicroarrayRT-PCRMicroarrayRT-PCRMicroarrayRT-PCRMicroarrayRT-PCRMicroarray
3335 111.6 44.0 30.9 14.9 184.3 68.7 67.7 72.4 39.9 14.1 74.6 77.5 698.8 8.3 
3416 359.0 70.6 158.3 36.4 414.8 131.6 1,409.8 1,099.8 67.3 19.8 81.7 39.3 1,917.5 15.7 
3446 171.6 37.3 37.3 11.6 83.9 45.7 115.1 124.1 22.4 9.6 40.2 23.4 1,132.3 14.3 
3578 561.3 86.6 125.0 15.7 306.8 48.8 248.4 149.5 29.9 9.2 34.1 14.7 1,300.6 13.3 
3798 107.8* 131.4* 32.9 30.9 64.3 37.7 281.4 306.2 11.4* 45.2* 5.1* 227.1* 61.0* 22.8* 
3950 40.9 24.5 10.4 6.7 25.5 15.2 47.2 32.8 10.5 5.4 35.7 23.3 403.1 6.6 
2965 379.7 87.9 24.4 21.1 87.3 23.5 58.3 102.7 21.7 4.0 94.4 35.5 27.2* 6.4* 
3838 161.3 60.6 25.1 17.5 129.1 28.1 58.7 62.4 128.1 30.9 81.4 46.7 6.0 0.2 
3216 78.5 30.9 15.2 11.7 219.4 43.8 101.5 84.0 14.5 5.9 20.0 11.8 135.5 7.2 
3894 288.8 90.2 32.2 23.9 149.9 24.5 127.1 135.4 24.8 11.3 68.9 34.1 202.3 3.3 
4145 66.2 30.3 32.9 8.5 82.4 41.9 134.8 85.9 9.8 6.3 22.7 8.5 896.1 13.1 
4160 446.6 65.6 58.3 13.9 80.4 22.2 131.6 45.4 54.1 16.0 299.5 55.4 527.3 6.5 

Abbreviation: RT-PCR, real-time PCR assays.

*

Indicates outliers.

Fig. 3.

Regression plots for fold change by microarray (Y-axis) and quantitative real-time PCR (X-axis) assays for the seven genes individually (A-G) or combined (H). ▴, outliers (A, E-G); outliers are points >2 SD away from the regression line; ▪ and ♦, KRT24 and all other genes, respectively (H).

Fig. 3.

Regression plots for fold change by microarray (Y-axis) and quantitative real-time PCR (X-axis) assays for the seven genes individually (A-G) or combined (H). ▴, outliers (A, E-G); outliers are points >2 SD away from the regression line; ▪ and ♦, KRT24 and all other genes, respectively (H).

Close modal

Hitherto, it was generally assumed that normal-appearing mucosa adjacent to a tumor in the patient's colon was metabolically normal. Previous studies have often used such “matched” tissue as a baseline for comparisons with tumor. We show in the present study that the gene expression levels in morphologically normal-appearing mucosa from patients with cancer were vastly different from that of healthy controls, suggesting that the mucosa of patients were actually not normal but were already “primed” for carcinogenesis (Fig. 1A).

Stringent criteria were adopted to select for potential markers of early onset CRC without dominant family history. Patients and healthy controls were homogeneous with respect to age and ethnicity, and were tissue-matched to minimize artifacts attributable to age, ethnicity, or tissue-specific expression. Furthermore, only non–FAP and non-HNPCC patient specimens were included to minimize sample heterogeneity. To minimize variability due to stochastic fluctuation (26), the signal from each patient was compared with the mean of several healthy controls. The order in which the healthy controls and patients specimens were arrayed was also randomized to avoid bias from signals that could inadvertently be introduced by the machine (27).

Seven consistently up-regulated genes selected from a training set were validated by a completely independent testing set. The microarray data were further validated by quantitative real-time PCR assay (Table 3; Fig. 3), as suggested by the conclusions of earlier studies (9, 10).

These seven genes play key roles in diverse biological processes (Table 2) such as transcription regulation (FOS, FOS B, EGR1), angiogenesis (EGR1, VIP, CYR61), apoptosis (KRT-24), adhesive, migratory, and inflammatory signaling (CYR61, KRT24), and protein catabolism (UCHL1). Furthermore, these genes integrate not only known colorectal tumorigenesis pathways such as the Wnt, PI3K, MAP kinase, hypoxia, and G protein–coupled receptor signaling pathways, but also the insulin-signaling (glucose metabolism), the Ca2+/NFAT immune, and the human gastrointestinal nervous system via the UCHL1 (PGP9.5)-VIP-EGR1 link (Fig. 2). The latter pathways have generally not been previously regarded as important for the transformation to the cancerous state. Thus, the data suggests that a general disruption of signaling and metabolic pathways is essential for tumorigenesis to occur.

This is consistent with accumulating evidence that early onset CRC among young patients without a dominant family history of the disease was a result of the cumulative effects of many susceptibility genes (4, 28) rather than highly penetrant mutations in a tumor suppressor or oncogene. Although these seven genes may not necessarily be the same genes in which combined effects initiate the tumorigenic process, this panel of genes was the most sensitive to perturbation, and hence, could potentially serve as biomarkers for early changes in the mucosa of individuals at risk. In this regard, it is interesting to note that although FOS and EGR1 (Fig. 2) seem to be the signal transducers or transcription factors that link the various pathways into a network, the other five genes contribute equally to the principal components of the analyses. The data thus imply that all seven genes could potentially be targets for therapeutic intervention.

The only specimen from a patient with a right-sided tumor (Fig. 1B) was within the patients' cluster in the principal component analysis, suggesting that the seven genes were able to differentiate the mucosa of a patient from healthy controls irrespective of the eventual site of tumor development.

To our knowledge, there is only one previous study reporting differential expression by quantitative real-time PCR from 5 out of 15 genes examined in the mucosa of patients compared with healthy controls (29). The patients and controls were not age- and ethnicity-matched, and the 15 genes were not preselected by whole genome profiling but were culled from previous reports. We examined the expression of these five genes in our series. Only cyclooxygenase-2 and proliferating peroxisome-activating receptor-γ were up-regulated and down-regulated, respectively, in all patient specimens. These two genes were not included in our list because they were <4-fold differentially expressed in some of the patient specimens compared with healthy controls.

In summary, we have shown in this study that the expression of a large proportion of genes from the mucosa of patients differed from that of the healthy controls. Use of patients' matched mucosa as a baseline in gene expression studies for carcinoma is likely to underestimate the degree of altered regulation. We further show that seven genes were consistently up-regulated in all patients' mucosa compared with healthy controls using two independent assays, microarray assays and quantitative real-time PCR. These genes are key players in multiple biological processes in several cellular compartments and link diverse pathways into a biological network. The overexpression of these genes could serve as the signal for cell-wide disintegration of homeostasis and “priming” of the colonic epithelium for the cancerous state. Therefore, the findings imply that these seven genes could potentially serve as a susceptibility gene set for early onset CRC.

Grant support: In part by the National Medical Research Council (NMRC/0988/2005), the Biomedical Research Council (BMRC/03/1/27/19/257) of Singapore, a SingHealth Cluster grant (EX060/2004), and the Department of Clinical Research, Singapore General Hospital (P.Y. Cheah).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The authors thank Dr. Michelle Tan of the Department of Clinical Research and Dr. S.C. Liew of the Center for Remote Imaging, Sensing, and Processing, National University of Singapore for technical assistance, and the Singapore Polyposis Registry for clinical data and family history retrieval.

1
Chia KS, Seow A, Lee HP, Shanmugaratnam K. Cancer incidence in Singapore 1993–1997. Singapore Cancer Registry Report No. 5, 2000.
2
Vogelstein B, Kinzler KW. Cancer genes and the pathways they control.
Nat Med
2004
;
10
:
789
–99.
3
Lichtenstein P, Holm NV, Verkasalo PK, et al. Environmental and heritable factors in the causation of cancer—analysis of cohorts of twins from Sweden, Denmark and Finland.
N Engl J Med
2000
;
343
:
78
–85.
4
Houlston RS, Tomlinson IPM. Polymorphisms and colorectal tumor risk.
Gastroenterology
2001
;
121
:
282
–301.
5
Lewis RC, Bostick RM, Xie D, et al. Polymorphism of the cyclin D1 gene, CCND1, and risk for incident sporadic colorectal adenomas.
Cancer Res
2003
;
63
:
8549
–53.
6
Hong Y, Eu KW, Seow-Choen F, Fook-Chong S, Cheah PY. The GG genotype of cyclin D1 G870A polymorphism is associated with increased risk and advanced colorectal cancer in Singapore patients.
Eur J Cancer
2005
;
41
:
1037
–44.
7
Notterman DA, Alon R, Sierk AJ, Levine AJ. Transcriptional gene expression profiles of colorectal adenoma, adenocarcinoma, and normal tissue examined by oligonucleotide arrays.
Cancer Res
2001
;
61
:
3124
–30.
8
Zou T, Selaru FM, Xu Y, et al. Application of cDNA microarrays to generate a molecular taxonomy capable of distinguishing between colon cancer and normal colon.
Oncogene
2002
;
21
:
4855
–62.
9
Birkenkamp-Demtroder K, Christensen LL, Olesen SH, et al. Gene expression in colorectal cancer.
Cancer Res
2002
;
62
:
4352
–63.
10
Chiu S, Hsieh F, Chen S, Chen C, Shu H, Li H. Clinicopathologic correlation of up-regulated genes identified using cDNA microarray and real-time reverse transcription-PCR in human colorectal cancer.
Cancer Epidemiol Biomarkers Prev
2005
;
14
:
437
–43.
11
Watanabe T, Komuro Y, Kiyomatsu T, et al. Prediction of sensitivity of rectal cancer cells in response to preoperative radiotherapy by DNA microarray analysis of gene expression profiles.
Cancer Res
2006
;
66
:
3370
–4.
12
Wang Y, Jatkoe T, Zhang Y, et al. Gene expression profiles and molecular markers to predict recurrence of Dukes' B colon cancer.
J Clin Oncol
2004
;
22
:
1564
–71.
13
Lo HS, Wang Z, Hu Y, et al. Allelic variation in gene expression is common in the human genome.
Genome Res
2003
;
13
:
1855
–62.
14
Tsai MS, Bogart DF, Castaneda JM, Li P, Lupu R. Cyr61 promotes breast tumorigenesis and cancer progression.
Oncogene
2002
;
21
:
8178
–85.
15
Xie D, Yin D, Tong X, et al. Cyr61 is overexpressed in gliomas and involved in integrin-linked kinase-mediated Akt and β-catenin-TCF/Lef signaling pathways.
Cancer Res
2004
;
64
:
1987
–96.
16
Caballero OL, Resto V, Patturajan M, et al. Interaction and colocalization of PGP9.5 with JAB1 and p27(Kip1).
Oncogene
2002
;
21
:
3003
–10.
17
Anlauf M, Schafer MK, Eiden L, Weihe E. Chemical coding of the human gastrointestinal nervous system: cholinergic, VIPergic, and catecholaminergic phenotypes.
J Comp Neurol
2003
;
459
:
90
–111.
18
Monje P, Marinissen MJ, Gutkind JS. Phosphorylation of the carboxyl-terminal transactivation domain of c-Fos by extracellular signal-regulated kinase mediates the transcriptional activation of AP-1 and cellular transformation induced by platelet-derived growth factor.
Mol Cell Biol
2003
;
23
:
7030
–43.
19
Bergman MR, Cheng S, Honbo N, Piacentini L, Karliner JS, Lovett DH. A functional activating protein 1 (AP-1) site regulates matrix metalloproteinase 2 (MMP-2) transcription by cardiac cells through interactions with Jun B-Fra1 and Jun B-Fos B heterodimers.
Biochem J
2003
;
369
:
485
–96.
20
Fujino H, Xu W, Regan JW. Prostaglandin E2 induced functional expression of early growth response factor-1 by EP4, but not EP2, prostanoid receptors via the phosphatidylinositol 3-kinase and extracellular signal-regulated kinases.
J Biol Chem
2003
;
278
:
12151
–6.
21
Fahmy RG, Dass CR, Sun LQ, Chesterman CN, Khachigian LM. Transcription factor Egr-1 supports FGF-dependent angiogenesis during neovascularization and tumor growth.
Nat Med
2003
;
9
:
1026
–32.
22
Hayez N, Harfi I, Lema-Kisoka R, Svoboda M, Corazza F, Sariban E. The neuropeptide vasoactive intestinal peptide (VIP) and pituitary adenylate cyclase activating polypeptide (PACAP) modulate several biochemical pathways in human leukemic myeloid cells.
J Neuroimmunol
2004
;
149
:
167
–81.
23
Tjwa ET, Bradley JM, Keenan CM, Kroese AB, Sharkey KA. Interleukin-1β activates specific populations of enteric neurons and enteric glia in the guinea pig ileum and colon.
Am J Physiol Gastrointest Liver Physiol
2003
;
285
:
G1268
–76.
24
Sprecher E, Itin P, Whittock NV, et al. Refined mapping of Naegeli-Franceschetti-Jadassohn syndrome to a 6 cM interval on chromosome 17q11.2-q21 and investigation of candidate genes.
J Invest Dermatol
2002
;
119
:
692
–8.
25
Smith MW, Walters K, Korth MJ, et al. Gene expression patterns that correlate with hepatitis C and early progression to fibrosis in liver transplant recipients.
Gastroenterology
2006
;
130
:
179
–87.
26
Raser JM, O'Shea EK. Noise in gene expression: origins, consequences, and control.
Science
2005
;
309
:
2010
–3.
27
Ransohoff DF. Bias as a threat to the validity of cancer molecular-marker research.
Nat Rev Cancer
2005
;
5
:
142
–8.
28
Ahmed FE. Gene-gene, gene-environment & multiple interactions in colorectal cancer.
J Environ Sci Health C Environ Carcinog Ecotoxicol Rev
2006
;
24
:
1
–101.
29
Chen L, Hao C, Chiu YSY, et al. Alteration of gene expression in normal-appearing colon mucosa of APCmin mice and human cancer patients.
Cancer Res
2004
;
64
:
3694
–700.