Understanding molecular alterations in colorectal cancer (CRC) is needed to define newbiomarkers and treatment targets. We used oligonucleotide microarrays tomonitor gene expression of about 6,800 known genes and 35,000 expressed sequence tags (ESTs) on five pools (four to six samples in each pool) of total RNA from left-sided sporadic colorectal carcinomas. We compared normal tissue to carcinoma tissue from Dukes’ stages A−D (noninvasive to distant metastasis) and identified 908 known genes and 4,155 ESTs that changed remarkably from normal to tumor tissue. Based on intensive filtering 226 known genes and 157 ESTs were found to be highly relevant for CRC. The alteration of known genes was confirmed in >70% of the cases by array analysis of 25 single samples. Two-way hierarchical average linkage cluster analysis clustered normal tissue together with Dukes’ A, clustered Dukes’ B with Dukes’ C, and clustered Dukes’ D separately. Real-time PCR of 10 known genes and 5 ESTs demonstrated excellent reproducibility of the array-based findings. The most frequently altered genes belonged to functional categories of metabolism (22%), transcription and translation (11%), and cellular processes (9%). Fifteen nuclear encoded mitochondrial proteins were all down-regulated in CRC. We identified several chromosomal locations with clusters of either potential oncogenes or potential tumor suppressors. Some of these, such as aminopeptidase N/CD13 and sigma B3 protein on chromosome 15q25, coincided with a high frequency of loss of heterozygosity. The genes and ESTs presented in this study encode new potential tumor markers as well as potential novel therapeutic targets for prevention or therapy of CRC.

CRC4 is one of the most frequent cancers in the Western world. The prognosis in advanced cases is poor, and more than one-third of the patients will die from progressive disease because the overall survival is about 40% (15–65%) after 5 years (1). Current treatment is only curing a fraction of the patients and has the best effect on the early-stage disease. CRC was one of the first major epithelial cancers in which molecular alterations were described to occur in a systematic fashion during disease progression (2). It was hoped that this knowledge, e.g., of p53 inactivation, could form the basis for new treatments, and several of these, such as gene therapy, have been tried without any improvement in survival. Therefore, new studies aimed at identifying further molecular alterations in CRC and thereby new diagnostic biomarkers and new treatment targets are needed.

Normal cell growth depends on a balanced expression of growth-promoting and growth-suppressing genes. If growth-promoting genes are activated to a state of hyperfunction, either by mutation or quantity, they are termed “oncogenes,” which exert a positive effect on cell growth. One common example is K-ras, which is activated in CRC and many other tumors (3). Growth-suppressing genes are defined as “tumor suppressors” (4) and are commonly lost following Knudson’s “two-hit” hypothesis (loss of one allele and inactivation of the other allele by mutation or promoter methylation; Ref. 5). The best-characterized tumor suppressors are p53 on chromosome 17p13.1, which shows a LOH frequency of 75% in CRC, Smad4/DPC4 (deleted in pancreatic cancer locus 4) on chromosome 18q21, and APC (adenomatous polyposis coli) on chromosome 5q21–22 (6, 7, 8). Most frequent genomic losses in CRC are found at chromosome 18q21 (9), and most frequent gains are found at 20q13 as determined by comparative genomic hybridization (10). Identification of new oncogenes and tumor suppressors would be of benefit to our understanding of the biology of CRC, and they might constitute new targets for therapy.

Several techniques have been used to monitor gene expression, but most of the methods used are time- and labor-intensive. The recently developed microarray technique permits us to investigate the expression of thousands of genes within a single patient sample. Recently, Kitahara et al.(11) and Notterman et al.(12) published expression array-based studies analyzing CRC tumors with corresponding noncancerous colonic epithelia. They identified important new clusters of genes that showed alterations in cancer tissue. In the present study, we have used a similar methodological approach based on Affymetrix GeneChip microarrays with immobilized oligonucleotide probes, and we monitored the expression of 6,500 known genes and 35,000 ESTs, representing unknown genes or genes with unknown function. Our aim was to identify candidate tumor suppressor genes and oncogenes relevant for CRC development. Furthermore, genes with a specific behavior at certain Dukes’ stages were identified as potential classifying genes for those stages.

We found that 908 known genes and 4155 ESTs changed vastly from normal to tumor tissue or in one of the Dukes’ stages. Based on filtering of these data, we ended up with identification of 226 known genes and 157 ESTs of very high relevance for CRC, covering a spectrum of candidate oncogenes and tumor suppressors as well as classifiers. The data were validated by microarray analyses of an independent set of 25 CRC samples and by real-time PCR of the samples used for pool analysis.

Hierarchical cluster analyses clustered normal tissue with Dukes’ A and clustered Dukes’ B with Dukes’ C. Dukes’ D was separated from the other tissues but was connected with the Dukes’ B/C cluster, clearly pointing out the difference between normal tissue and invasive tumor tissue.

The genes discovered were assigned to functional classes and to chromosomal location to gain insight into the functional status of CRC cells and to identify chromosomal “hot spots”. Some of the hot spots for down-regulation were shown by microsatellite-based LOH analysis to be frequently lost and were compared with a large microsatellite-based study on 55 cases of sporadic cancer from Finnish patients that showed common loss of regions possibly harboring tumor suppressors.

The genes presented in this study not only represent new biomarkers but also represent potential targets for application of chemical genetics such as, for example, target-based screenings (13). According to the functional categories presented in this study, the genes selected could be screened rapidly with a limited number of ligands of interest. Application of such target-based screens will probably identify potential novel therapeutic targets usable for prevention or therapy of CRC.

A detailed description of materials and methods is presented as online supplementary data.

Tissue Samples, Patient Information, and RNA Isolation.

Left-sided colorectal tumor samples from the upper rectum, sigmorectum, or sigmoideum from Dukes’ stages A−D were obtained fresh from surgery, taken from the luminal aspect of the tumors in the surgical specimens, transferred immediately to a solution containing SDS and guanidinium isothiocyanate, snap-frozen in liquid nitrogen, and stored at −80°C. The Dukes’ classification of the clinical stage of disease was applied according to the following criteria: (a) Dukes’ A, tumor confined to the bowel wall without penetration of the muscularis propria (equivalent to stage I or modified As-Co stage A and B1); (b) Dukes’ B, tumor has penetrated the muscle wall and possibly infiltrated the pericolic or colorectal fat, with no detectable metastatic lymph nodes (equivalent to stage II or As-Co stage B2); (c) Dukes’ C, lymph node metastasis is detectable (equivalent to stage III or As-Co stages C1-C2); and (d) Dukes’ D, metastases detected in distant organs (e.g., liver; equivalent to stage IV or As-Co stage D). Paired “normal” control samples were obtained from the oral resection margins of the operative specimens by taking mucosal biopsies from the luminal aspect of the bowel wall. Informed consent was obtained from patients to use their specimens and clinicopathological data for research purposes. All tumors were sporadic because none of the patients belonged to families with hereditable CRC or other cancers. The project was approved by the local scientific ethical committee.

Total RNA was isolated from about 100 mg of single tissue samples using a Polytron homogenizer or FastPrep FP120 (Bio101 Savant) followed by treatment with RNAzol (WAK-Chemie Medical) according to the manufacturer’s instructions. Five pools were made from equal amounts of total RNA from the following tissues: (a) normal (n = 6; median patient age, 69 years; tissue from oral resection margin of three Dukes’ stage A and three Dukes’ stage B tumors,); (b) Dukes’ stage A tumors (n = 5; median patient age, 72 years); (c) Dukes’ stage B (n = 6; median patient age, 74 years); (d) Dukes’ stage C (n = 6; median patient age, 67 years); and (e) Dukes’ stage D (n = 4; median patient age, 61 years). An independent set of samples that had not been included in the pools was used for single GeneChip analysis. This included five samples each from normal tissue [tissue from oral resection margin of three Dukes’ stage B tumors, one Dukes’ stage A tumor, and one Dukes’ stage C tumor (median patient age, 68 years)], Dukes’ stage A tumors (median patient age, 68 years), Dukes’ stage B tumors (median patient age, 79 years), Dukes’ stage C tumors (median patient age, 68 years), and Dukes’ stage D tumors (median patient age, 55 years). Detailed clinical information on the samples as well as the approximate percentages of the volume fractions of tumor cells and stromal cells, estimated semiquantitatively by an experienced pathologist using paraffin-embedded diagnostic tissue sections (4–12 sections/tumor), are shown in Supplementary Table 7. All samples had a volume fraction showing at least >53% malignant tumor cells, and two-thirds of the samples had a volume fraction showing >80% malignant tumor cells. The diagnostic samples from paraffin-embedded sections contained both normal and tumor tissue as well as transmural tissue from the colon wall (submucosa and muscularis). The estimated percentage of tumor cells is a conservative estimate because tissue used for RNA extraction was from the most superficial tumor-rich areas, avoiding most of the deeper stroma-containing layers, and the percentage of tumor cells is probably higher in the arrayed samples as in the screened, paraffin-embedded, diagnostic histological tissue sections.

cRNA Preparation.

Reverse transcription was performed on 12 μg of pooled total RNA for 1 h at 42°C using a T7-oligo(dT)24 primer and Superscript II reverse transcriptase (Life Technologies, Inc.). Second-strand cDNA synthesis was performed for 2 h at 16°C using Escherichia coli DNA polymerase I, DNA ligase, and RNase H (Life Technologies, Inc.), followed by incubation in 50 mm NaOH/0.1 mm EDTA for 10 min at 65°C, leading to degradation of rRNA and tRNA. After phenol-chloroform extraction in vitro, transcription was performed for 6 h at 37°C using Bio-16-UTP (Enzo), Bio-11-CTP (Enzo), and T7-Megascript Kit (Ambion). The cRNA was purified on RNeasy spin columns (Qiagen) followed by fragmentation for 35 min at 95°C.

GeneChip Analyses.

Samples were analyzed on GeneChips (Affymetrix Inc., Santa Clara, CA) HuGeneFL (Hum6.8k all) with 6,800 genes and on EST chips 35K (subA−D) with 35,000 ESTs. To check the quality of each sample with regard to GAPDH and β-actin, 15 μg of labeled cRNA were run on Test2 arrays (Affymetrix). Expression CHIPs were hybridized with 15 μg of labeled cRNA for 16 h at 45°C under rotation. CHIPs were stained in an Affymetrix Fluidics station with streptavidin/phycoerythrin, followed by staining with an antistreptavidin antibody and streptavidin/phycoerythrin. The CHIPs were scanned with a HP-Laserscanner, and the data were analyzed with GeneCHIP software and Microarray Suite Software 4.0 (Affymetrix). Each microarray was scaled to “150” as described by Thykjaer et al.(14). Expression patterns of normal tissue were compared with carcinoma tissue derived from Dukes’ stage A−D.

Data Analysis of Pools and Selection of Candidate Genes.

A total of 42,843 data sets (7,129 data sets from HU6800 arrays and 35,714 datasets from 35K subA−D EST arrays) were sorted according to stringent criteria (see overview in Fig. 1). For filter 1, data sets were excluded if (a) the Abs Call was absent (A) in all five pools; (b) the Diff Call was not changed (NC) in all four comparisons or the Diff Call was not changed in three of four comparisons and one comparison was called marginal decreased or marginal increased (MD or MI); or (c) it was marked as AFFX internal control. In the remaining data sets, the absolute value of the sort score had to be ≥0.5 in at least one of the four comparisons. This resulted in 908 known genes (12.7%) and 4,155 ESTs (11.6%; data not shown). For filter 2, candidates were selected by a combination of absolute analysis (Abs Call: P, present; M, medium present) and comparison analysis (Diff Call: NC, not changed; I, increased; D, decreased; MI medium increased; and MD, medium decreased). As a convention, losses of expression from normal to all Dukes’ stages [represented by PAAAA, present (P) in normal and absent (A) in Dukes’ A−D] always had to be accompanied by Diff Call D in all Dukes’ stages and gains of expression by Diff Call I in all Dukes’ stages’ (APPPP). Losses of expression at one or more Dukes’ stages had to be accompanied by Diff Call D at that stage (e.g., PPAPP). For the known genes, the exclusion limit of the Avg Diff was arbitrarily set to ≥50, and for the ESTs, it was set to be ≥300, referring to Avg Diff of (a) normal data if decreased or (b) tumor data if increased from normal to tumor. Fold change criteria of genes present in normal and tumor (PPPPP) were set to be ≥3- or ≤−3-fold (≥ 5- or ≤−5-fold for ESTs), respectively in at least one of the comparisons. Avg Diff and fold change criteria for the selection of ESTs were arbitrarily set to a higher value to exclude false positives. Also, ESTs below these criteria are regarded as candidates of interest and will be subjected to a closer investigation in the future.

Correction.

Twenty-nine genes with fold changes between <3 and >−3 and 21 ESTs with fold changes between <5 and >−5, respectively, were nevertheless regarded as important candidates because they were accompanied by Diff Calls decrease or increase in all four comparisons. This resulted in the most remarkable 226 known genes (3.1%) and 157 ESTs (0.4%; Supplementary Table 4, a and b, respectively). Purported or hypothetical proteins derived from ESTs were identified using the NCBI UniGene database.5 Functional categories of known genes or EST-derived homologous proteins were attributed using databases OMIM, SWISS-PROT and TrEMBL.5

Data Analysis of Single GeneChips.

RNA from 25 single samples (normal colon tissue and Dukes’ stages A−D, five samples each, not included in the pools) was analyzed on HuGeneFL (Hum6.8k) GeneChips. One hundred comparison analyses corresponding to 25 comparisons for each Dukes’ stage were run comparing each normal sample versus each tumor sample using Affymetrix Microarray suite 4.0, Microdatabase 2.0, and Datamining Tool 2.0 (DMT 2.0). t tests and Mann-Whitney tests with an exclusion limit of P < 0.05 and based on Avg Diff analyses were made using Datamining Tool 2.0 software from Affymetrix. Genes showing an increase or decrease in >52% of all comparisons within at least one Dukes’ stage (>13 of 25 comparisons) were selected. (Detailed results on this array set will be published elsewhere).6

Two-way Hierarchical Average Linkage Cluster Analysis.

Using the log Avg Diff of selected genes and EST obtained from pool analysis, clustering was performed by Cluster 2.11 and Treeview 1.5 [Life Sciences Division, Lawrence Berkeley National Laboratory, Department of Molecular and Cellular Biology, University of California at Berkeley, Berkeley, CA (15)].

Real-time PCR.

cDNA was synthesized from single samples analyzed previously as pools on GeneChips. Reverse transcription was performed on 1 μg of total RNA for 1 h at 42°C using a T7-oligo(dT)24 primer and Superscript II reverse transcriptase (Life Technologies, Inc.). Identical amounts of cDNA from each normal sample or each tumor were pooled, and real-time PCR analysis was performed on selected genes using the primers shown in Supplementary Table 1. Triple determinations were performed on a Lightcycler using the FastStart DNA Master SYBR Green I kit (Roche), and GAPDH levels were determined using Lightcycler Primer Set Human GAPDH (Search LC GmbH). All samples were normalized to GAPDH as described by Mensink et al.(16). Avg Diffs from GeneChip analyses were compared with the normalized real-time PCR data.

Microsatellite Analysis.

DNA was extracted from microdissected tumor and normal colon tissue. Microdissection was performed using ×100 magnification in a microscope and serial sections of 4-μm-thick tissue-TEK or 10-μm-thick paraffin-embedded tissues. Sections were rinsed in Xylol, and DNA was extracted using a Puregene DNA extraction kit (Gentra Systems, Minneapolis, MN). DNA from the microdissected tissue was analyzed for allelic deletions (LOH) using microsatellite markers. Sequences of the fluorescence-labeled primers for nine different chromosomal locations are listed in Supplementary Table 2. PCR products were analyzed using an ABI Prism 377 DNA sequencer and Genescan software. LOH was defined as a loss of one allele in case of heterozygosity. In case of homozygosity, scoring was done by peak height, and a decrease of >50% was defined as LOH. MSI was defined as the presence of new bands after PCR amplification of tumor DNA that were not present in corresponding normal DNA. If MSI was present, data could not be used for LOH detection.

Analysis of 5′ CpG Island Methylation.

The 5′ CpG island methylation of selected down-regulated genes (i.e., H1F2, H2BFB, sigma 3B, and hsp70) was examined using a PCR-based methylation assay (17, 18). One μg of genomic DNA from nonmicrodissected tumor and corresponding normal tissue was digested with one of the methylation-sensitive restriction enzymes HapII (identical to HpaII), CfoI, or FnuDII or with the methylation-insensitive restriction enzyme MspI for 2 h using 10 units of restriction enzyme. Subsequently, 50 ng of digested or undigested DNA were amplified using the primers listed in Supplementary Table 3. Exon 2 of the p16 gene has been shown to be methylated in bladder cell lines (17). A methylation analysis of this exon in the bladder cell line T24 was carried out as a positive control for detection of methylation by the PCR-based methylation assay.

Promoter Sequencing.

Approximately 500 bp upstream from the start codon of the promoter region of Acyl-CoA dehydrogenase (Z80345 SCAD) were sequenced as described previously by Gregersen et al.(19) to find an explanation for down-regulation of expression of this gene. Apart from a known polymorphism at position −171, no mutation was detected in DNA from normal and microdissected tumor tissue of 16 different patients (data not shown).

Alteration in Expression of Genes and ESTs at Dukes’ A, B, C, and D in CRC.

By using oligonucleotide arrays, we analyzed the expression level of approximately 6,500 known genes and 35,000 ESTs in biopsies from the left colon (sigmoid and upper rectum). We compared pools of RNA from tumors to a pool of RNA from normal biopsies taken from the oral resection margin. Each pool consisted of equal amounts of four to six samples with identical histology and staging.

In an effort to reduce the very large data set, we applied different filtering methods (Fig. 1). As the first step, we eliminated genes and ESTs that were absent in all samples and genes that did not show a variation from normal to tumors (see “Materials and Methods”). That left us with approximately 12% of the initial genes and ESTs. We then applied a set of more stringent criteria, resulting in a reduced number of informative alterations (Fig. 1). In the supplementary data, the 226 known genes (Supplementary Table 4a) and 157 ESTs (Supplementary Table 4b) are listed that fulfilled all criteria. The most interesting genes and ESTs from the selected candidates that have been validated with single sample analysis are shown in Table 1 (53 genes) and Table 2 (46 ESTs).

Known Genes.

The group of 226 known genes that alter their expression from normal to cancer was subdivided into candidate tumor suppressors that are lost or much reduced in cancer tissue (70 genes) and candidate oncogenes that are gained de novo or much increased in cancer tissue (88 genes). A third set of genes showed a behavior that was related to only one or two of the Dukes’ stages, peaking at this stage or being severely reduced at this stage. Those genes were named candidate Dukes’ classifiers (68 genes; 7 for Dukes’ A, 18 for Dukes’ B, 26 for Dukes’ C, and 17 for Dukes’ D) because they may be used to identify a specific stage of the disease.

To validate the gene alterations on an independent microarray-based data set, we analyzed 25 single samples from normal mucosa and Dukes’ A, B, C, and D (5 samples each), all from the same location in the left colon. The data from the pools were cross-validated against these 25 arrays and showed matching alterations in 72%, corresponding to 161 genes. The criterion used was conformity in >52% of samples (>13 of 25 comparisons within at least one Dukes’ stage; t test and Mann-Whitney test, P < 0.05). The genes fulfilling this criterion are marked with an asterisk in Supplementary Table 4, a and b. Of these, 73 genes should be regarded as general tumor markers because they showed conformity in >52% of comparisons of consecutive Dukes’ stages (28 genes in Dukes’ A, B, C, and D; 40 genes in Dukes’ A, B, and C; and 5 genes in Dukes’ B, C, and D). The correlation was surprisingly good in the case of 90 genes because their alteration could be reproduced in >80% of comparisons in at least one Dukes’ stage (>20 of the 25 comparisons made). In all of the nonconfirmed cases, an alteration was detected, but it was either only valid for <52% of comparisons or not significant upon Mann-Whitney analysis.

To validate the expression alterations using an independent method, we used real-time PCR on pools of 4–6 RNA samples/Dukes’ stage analyzed previously on microarrays (Fig. 2). We used triple determinations and normalization based on GAPDH level. A remarkably good correlation for the 10 known genes and 5 ESTs analyzed was found between the two methods as shown in Fig. 2, indicating that the array-based determinations were highly reproducible.

ESTs.

In case of the ESTs, we detected 45 candidate tumor suppressors and 32 candidate oncogenes as well as 81 ESTs (Supplementary Table 4b) that could be used as classifiers of Dukes’ stages (15 Dukes’ A, 18 Dukes’ B, 16 Dukes’ C, 24 Dukes’ D, 3 Dukes’ A and B, and 5 Dukes’ B and C). In the case of the ESTs, no cross-validation was done with single arrays; however, five of the ESTs were analyzed by the independent real-time PCR method (Fig. 2). As with the known genes, a remarkably good correlation was found between the alterations detected by the two methods on the same RNA sample.

Functional Categories.

The identity of 93 of the 157 ESTs was established using the NCBI Unigene database as either already known genes or genes encoding hypothetical proteins. The function of the proteins encoded by these 93 genes as well as by the 226 known genes was estimated by the OMIM, SWISS-PROT, and TrEMBL databases. Based on the classification of Lander et al. using 12 functional categories (20), these functions were divided into 15 main functional categories and 15 subcategories, resulting in 29 functional groups (miscellaneous group excluded) used as subheadings in Supplementary Table 4, a and b, and in Fig. 3. Multifunctional proteins were categorized according to their most important function (e.g., an ion transporter is found under transporters, although it is also a membrane protein), and proteins with unknown function were listed in a separate group. For the calculations that follow, we only used genes with a known function because the purported function ascribed to some ESTs may still be error-prone. The numerically most prominent group of genes that change expression during cancer progression of the colon encode proteins related to metabolism (22%), in particular, mitochondrial metabolism, followed by genes encoding proteins related to transcription and translation (11%), cellular processes (9%) including cell cycle proteins and proteins involved in growth and differentiation, cell adhesion (8%), protein folding and degradation (7%), transport (6%), immune system (6%), and nucleic acid interaction (6%). Remarkably, proteins related to apoptosis or signaling and signal transduction were only rarely altered. For some groups, it was remarkable that the alterations detected were mainly either up-regulation or down-regulation (Fig. 3). Most of the genes encoding proteins related to cell cycle, methylation, DNA and RNA metabolism, translation, cell adhesion, or proteases were up-regulated. Genes encoding proteins that were mainly down-regulated belonged to the groups membrane and protein trafficking, lipid metabolism, and membrane proteins and kinases/phosphorylases. Nuclear-encoded mitochondrial proteins showed a distinct behavior because the 15 genes encoding these were all down-regulated in at least one of the tumor stages (Table 3). Fourteen of these 15 genes were cross-validated on the 25 single samples and confirmed to be decreased significantly in all samples (P ≤ 0.05). One of these, the SCAD gene, was also shown to be decreased by real-time PCR validation (Fig. 2).

Two-way Hierarchical Average Linkage Cluster Analysis.

Using the log Avg Diff of pool analysis, a hierarchical cluster analysis clustered normal tissue together with Dukes’ A and clustered Dukes’ B together with Dukes’ C. Dukes’ D was separated from the other tissues but was connected with the Dukes’ B/C cluster, clearly pointing out the difference between normal tissue and invasive tumor tissue (Fig. 4). Six selected clusters are shown in Fig. 4, and the complete cluster analysis is shown in Supplementary Fig. 1. Gene clustering did not clearly identify larger gene families.

Chromosomal Location.

We used the NCBI UniSTS database to precisely map as many of the genes and ESTs as possible. Interestingly, many of the candidate oncogenes and candidate tumor suppressors clustered to well-defined chromosomal regions (Table 4). A detailed mapping of each of these hot spots for expression alteration is shown in Supplementary Table 5. Apart from clusters of genes (marked with a black line) showing a similar alteration in expression, some genes encoding candidate oncogenes and candidate tumor suppressors were located next to each other on the same chromosomal arm, as seen, for example, at chromosomes 4q and 6p (Supplementary Table 5). For several locations, it was found that genes within a cluster were very close or arranged next to each other, as shown in an exemplary fashion in Table 5 for chromosomes 5, 10, 12, 13, 15, and 19, respectively, and in Supplementary Table 5. One possible meaning of this could be a common regulation of genes close to each other.

Examination of Mechanisms Leading to Altered Expression.

Several mechanisms can lead to altered expression of a gene, and we decided to look at two of these, methylation of proximal promoter CpG islands and loss of an allele. Both mechanisms can lead to a reduced gene expression; methylation can do so because of reduced binding of transcription factors, and loss of an allele can do so because of a gene dose effect. We searched databases for promoter CpG islands for four genes, H1F2, H2BFB, sigma 3B, and hsp70, that all were vastly reduced in tumor tissue compared with normal tissues. All four promoter sites were methylated at specific sites as detected by a PCR-based methylation assay and validated by sequencing of PCR products (data not shown). However, we did not detect any difference in this pattern between tumor tissue and normal tissue as shown in Supplementary Table 6. Because we did not use microdissected tissue for analysis, methylation involvement in down-regulation cannot be excluded here.

Allelic loss in Danish samples was determined by the use of microsatellites on microdissected tumor tissue followed by comparison with normal tissue in 24 sets of tissues, 16 of which had been analyzed previously on single GeneChips. Analysis of a locus close to TN and trans-Golgi p230 on chromosome 3p22-p21.3 showed 25% LOH, and analysis of a locus close to Cdx1 homeobox transcription factor on chromosome 5q32-q33 showed 17% LOH. Analysis of a locus close to ESTs AA171913 and AA151674 (carbonic anhydrase XII) on chromosome 15q22 showed 13% LOH, and analysis of a locus close to sigma 3B and aminopeptidase N/CD13 on chromosome 15q25 showed 28% LOH. No LOH was found on chromosome 6p21.3, where tenascin-XB1 is located. We concluded that loss of an allele could be one of several mechanisms involved in the down-regulation in these chromosomal areas. Through collaboration, we got access to a Finnish CRC patient material in which LOH had been scored throughout the genome by the use of 372 microsatellites in 55 patients with sporadic CRC (21). Ten chromosomal areas (1p36, 4q21, 5q31, 6p19, 12q13, 14q, 15q11, 17p13, 18q11, and 22q13) showed LOH in >25% of the patients and had adjacent candidate genes detected on microarrays. A comparison with the present expression analysis data showed that in all 10 cases, some or all adjacent candidate genes showed down-regulation. As an example, the microsatellites D15S153 and D15S127 showed 38% and 33% LOH, respectively, in the Finnish material and surrounded the candidate tumor suppressors sigma 3B and aminopeptidase N/CD13. This correlated well with our own findings described above. Microsatellites D17S849 and D17S938 were found to show 52% and 56% LOH, respectively, in the Finnish samples and surrounded the candidate suppressors very-long-chain acyl-CoA dehydrogenase (VLCAD) and EST AA447145 (KIAA0399 protein) as shown in Supplementary Table 5. These four genes are located relatively close to their corresponding marker, which increases the likelihood that these genes are lost in carcinogenesis. We hypothesize that matching regions of LOH and decreased or lost gene expression could represent hot spots for new tumor suppressors.

Oligonucleotide microarray analyses of pools of approximately 6,500 known genes and 35,000 ESTs led to the identification of 226 known genes (3.1%) and 157 ESTs (0.4%) that seemed to be of interest for CRC. For the known genes, 72% could be reproduced by microarray analysis of 25 different single tumor samples not included in the previously analyzed pools, and for both known genes and ESTs, it was possible to find a remarkable correlation with real-time PCR. The number of varying genes detected agrees with recent findings, in which cluster analysis and mathematical selection of expression profiles identified about 2.5% and 1.8% of the genes detected on the microarray to differ between adenocarcinoma and normal colon tissue (12). Our samples for array analysis were not microdissected as in a previous publication (11) and did contain RNA from stromal components such as histiocytes, endothelial cells, muscle cells, and inflammatory cells. However, because of the recent literature on the important signaling between tumor cells and surrounding stromal and inflammatory cells (22), we believe that it would lead to a substantial loss of information if only tumor cells were analyzed. Furthermore, it was recently shown that working with tissue samples that were not microdissected did not influence the ability to cluster adenomas, carcinomas, and normal tissue into distinct trunks (10). In that study, 18 adenocarcinomas and paired normal samples were analyzed (12) using the same microarray system as in the present study. Although analyses were applied to CRC tumors in general without referring to the grading and location of the samples, and candidate genes were selected according to cluster analyses based on fold changes, those data are highly consistent with our data. Examples of up-regulated genes in common are melanoma growth-stimulatory activity (MGSA), transcription factor IIIa (TFIIIA), and S-adenosylhomocysteine hydrolase (AHCY). Examples of down-regulated genes in common are guanylin, TN, adipsin/complement factor D, or carbonic anhydrase IV. Some of those genes have also previously been confirmed by serial analysis of gene expression. Beyond the candidates in common, we present several additional candidates validated by single sample analyses to change significantly, such as MDP4/MDP7 microsomal dipeptidase, aminopeptidase N/CD13, or NGAL/LCN2 lipocalin 2 (neutrophil gelatinase-associated lipocalin). Analyses of EST chips resulted in known genes and hypothetical proteins with and without predicted functions. Those hypothetical proteins could be of use as new molecular markers for CRC and deserve further investigation.

Some of the selected candidate genes can be regarded as progression markers and molecular predictors (changes in at least two consecutive Dukes’ stages) or Dukes’ classifiers (major changes in one Dukes’ stage only). Examples of progression markers are phosphoenolpyruvate carboxykinase (PCK1) and monocyte-derived neutrophil chemotactic factor (MDNCF; IL-8). The latter, which is an angiogenic cytokine, was shown in a CRC cell line-based study to be produced by the tumor cells (23).

Furthermore, our results on the dramatic reduction in expression of molecules like CgA and TN correspond to previous findings. CgA is a neuroendocrine differentiation marker, and only a minority of stage III and stage IV CRC patients (11% and 22%, respectively) showed a positive staining for CgA (24). TN, a stromal component of tumors and a participant in proteolytic processes through its binding to plasminogen, is regarded as a tumor suppressor. A low plasma TN level is related to a shortened survival (25, 26). The good correspondence between our data and those from previous studies indicates the validity of the large number of new alterations that we detect in gene expression. We only analyzed samples from the left colon and upper rectum because of the large difference often found between the left and the right side of colon (27).7

The human genome contains about 30,000–70,000 protein-encoding genes. Lander et al.(20) classified about 40% of these proteins into 12 functional categories. In the present study, the distribution on functional categories was, in some cases, far from what could be expected. As an example, genes involved in cellular processes constitute approximately 2.2% of the genome but 9% of the genes that show changes during CRC progression. This indicates a cancer-specific change of certain functional groups rather than a selection due to the distribution in the genome.

Remarkably, many genes that showed a decrease or loss of expression in at least one Dukes’ stage were mitochondrial proteins. Microarray analysis of 25 CRC single samples confirmed that 14 of 15 selected genes found by pool analyses showed a statistically significant decrease from normal tissue to tumor in at least one Dukes’ stage (t test, P < 0.05). Three of those genes were found to be decreased at all Dukes’ stages. Rhodanese (thiosulfate sulfurtransferase) is involved in forming iron-sulfur complexes and cyanide detoxification and catalyzes the transfer of the sulfane atom of thiosulfate to cyanide to form sulfite and thiocyanate (PROSITE PDOC00322). 3-Hydroxy-3-methylglutaryl CoA synthase (HMG-CoA synthase) catalyzes the condensation of acetyl-CoA with acetoacetyl-CoA to produce HMG-CoA and CoA (PROSITE PDOC00942). The mitochondrial form is responsible for ketone body biosynthesis. SCAD (acyl-CoA dehydrogenase) was found to be significantly decreased in all Dukes’ stages [Dukes’ A, 7-fold (P = 0.018); Dukes’ B, 5-fold (P = 0.008); Dukes’ C, 17-fold (P = 0.002); and Dukes’ D, 4-fold (P = 0.024)]. SCAD is a FAD flavoprotein and catalyzes the β-oxidation of Butyryl-CoA to Acetyl-CoA (PROSITE PDOC00070), and SCAD deficiency results in an increase of butyric acid. Butyrate is the primary source of energy for colonocytes and was shown to act in a contradictory fashion, depending on the availability of other energy sources: at low concentrations, butyrate stimulates growth under glucose- and pyruvate-depleted conditions; whereas it causes apoptosis at the same concentrations in the presence of glucose and pyruvate (28). We hypothesize that one possible cause of CRC carcinogenesis might be located in the mitochondria. Mitochondrial DNA accumulates more damage due to less efficient repair systems in the mitochondria compared with those in the nucleus. Although all of the selected mitochondrial proteins are nuclear encoded, the mitochondrial function is altered directly by expression changes of nuclear-encoded proteins involved in electron transport and oxidative phosphorylation and altered indirectly because oxidative phosphorylation is linked to many pathways of intermediary metabolism, as discussed by Augenlicht and Heerdt (29).

Decreased RNA transcription could be due to hypermethylation of the proximal promoter as shown previously, e.g., the mismatch repair enzyme MLH1 (30). A similar mechanism could cause the down-regulation of the candidate tumor suppressors described in the present study. Consequently, we investigated the 5′ CpG island methylation of four selected down-regulated genes, two ESTs (i.e., H1F2 and H2BFB) and two known genes (i.e., sigma 3B and hsp70). The results of the methylation analyses demonstrated that the 5′ region of all four genes is methylated at specific sites in both normal and tumor tissue. However, we did not detect any difference in this pattern between tumor tissue and normal tissue.

Chromosomal instability and MSI in sporadic CRC are thought to constitute two major pathways for CRC progression. With regard to chromosomal instability, the combination of loss of an allele and inactivation of the other allele by methylation or mutation is considered a general mechanism for inactivation of tumor suppressors. As we identified clusters of down-regulated genes at certain chromosomal locations, we hypothesized that loss of an allele in the same location would further strengthen the likelihood that we had identified candidate tumor suppressors. Thus, we made two approaches: one in which we analyzed microsatellites located in those areas; and one in which we compared our locations with a large study of 55 Finnish sporadic tumor patients scrutinized with 372 microsatellites. Due to the diversity of the genetic background, those comparisons are not universally valid, but the colocation of clusters of candidate suppressors and a raised LOH frequency might strengthen the likelihood of defining new tumor suppressor locations.

Surprisingly, we found several genes of interest located on chromosome 15q, although this is reported to be a region infrequently affected by alterations in CRC. Neogenin at 15q22.3-q23 has been reported to be generally involved in genetic disorders (31, 32), and Park et al.(33) proposed thrombospondin 1 (THBS1) to be a new tumor suppressor gene on chromosome 15q21.1. THBS1 (U12471) was found to be down-regulated (Avg Diff = 83 in normal tissue; pattern, PAPAA; decreased in Dukes A–D). Other genes, e.g., integrin ITA3, a receptor for THBS1, are concomitantly decreasing. Down-regulation of expression of EST AA171913 (carbonic anhydrase XII, CA12) on chromosome 15q22 and aminopeptidase N/CD13 (APN/CD13), sigma 3B protein (adaptor-related protein complex AP-3, sigma 2 subunit) and CIB (calcium and integrin-binding protein) on chromosome 15q25-q26 coincides with a LOH frequency of 38% and 33% in Finnish samples and 13% and 28% in our Danish samples, respectively. Thus far, neither information about the role of chromosome 15q25 nor an involvement of APN/CD13 or sigma 3B protein in CRC carcinogenesis is found in the literature, but according to the data provided here, both genes might resemble novel potential tumor suppressors. APN/CD13 is a member of the peptidase family M1 (zinc-binding metalloproteinase), resembling a type II membrane surface antigen glycoprotein, and catalyzes final protein degradation by removal of single amino acids of small peptides. ANP/CD13, described to be expressed on colon carcinoma Caco-2 cells (34), was found to be substantially decreased in renal cancer tissues (35), and, surprisingly, highly expressed APN/CD13 probably plays a role in the invasion and metastasis of prostate cancer cells (36). Sigma 3B protein, which facilitates the budding of vesicles from the Golgi membrane, is involved in trafficking to lysosomes and might play a role in the recognition of intracellular, tyrosine-based sorting signals (37, 38). Keeping in mind that the basic definition of a tumor suppressor reveals one copy to be lost and the other to be inactivated by methylation or mutation, future research should be directed toward methylation analysis and sequencing of these genes.

It is remarkable that both up-and down-regulated genes occurred in clusters along the chromosomes. In a few cases, this was separated by individual genes behaving in a contradictory fashion, but this does not remove the impression of coregulation of downstream-located genes. Whether this is due to the use of common transcription factors, opening or closing of the double strand, or other events is not known at present. This area deserves additional study because it might reveal important mechanisms for tumor progression.

As can be seen from Supplementary Table 4a, most known genes (70%) that are up-regulated or down-regulated do so during the transition from normal to early-stage Dukes’ A tumors: 37 genes are generated de novo (pattern, APPPP), 30 are lost from normal to tumor (26 PAAAA and 4 PPAAA), 51 are increased in all Dukes’ stages, and 40 are decreased in all Dukes’ stages. Far fewer genes are changing their level of expression during the progression through the different Dukes’ stages. This indicates that the basic properties of tumor cells are acquired in the early tumor stages and that only minor changes, probably those involving the stromal components (22), are needed later on. From a therapeutic point of view, this is important because the same targets seem to be present at most Dukes’ stages.

Fig. 1.

Flow sheet demonstrating the filtering of 42,843 datasets (see “Materials and Methods” for details).

Fig. 1.

Flow sheet demonstrating the filtering of 42,843 datasets (see “Materials and Methods” for details).

Close modal
Fig. 2.

Comparison of GeneChip analyses with real-time PCR analyses. Expression analyses are shown of 10 selected genes and 5 ESTs using pools of normal tissue and Dukes’ stages A−D. Y axis, expression intensities. •, Avg Diffs from GeneChip analyses. Numbers in parentheses represent Abs Calls A (absent). The five-letter code (e.g., PAAAA) represents the expression pattern for tissues (P, present; A, absent). ▪, average of triple determinations normalized to GAPDH from real-time PCR. The error bars indicate the variations between the single determinations. Bars are not visible when triple determinations resulted in nearly identical results.

Fig. 2.

Comparison of GeneChip analyses with real-time PCR analyses. Expression analyses are shown of 10 selected genes and 5 ESTs using pools of normal tissue and Dukes’ stages A−D. Y axis, expression intensities. •, Avg Diffs from GeneChip analyses. Numbers in parentheses represent Abs Calls A (absent). The five-letter code (e.g., PAAAA) represents the expression pattern for tissues (P, present; A, absent). ▪, average of triple determinations normalized to GAPDH from real-time PCR. The error bars indicate the variations between the single determinations. Bars are not visible when triple determinations resulted in nearly identical results.

Close modal
Fig. 3.

Functional categories of known genes with increasing (striated) or decreasing (black) expression from normal tissue to tumor in at least one Dukes’ stage. The majority of genes involved in cellular processes, transcription and translation, protein folding and degradation, cell adhesion, immune response and inflammation, collagens, methylation, and nucleic acid interaction show increasing expression. The majority of proteins involved in Golgi/endoplasmic reticulum-associated protein trafficking and metabolism, especially all of the mitochondrial located proteins, show decreasing expression. Details are shown in Supplementary Table 4a.

Fig. 3.

Functional categories of known genes with increasing (striated) or decreasing (black) expression from normal tissue to tumor in at least one Dukes’ stage. The majority of genes involved in cellular processes, transcription and translation, protein folding and degradation, cell adhesion, immune response and inflammation, collagens, methylation, and nucleic acid interaction show increasing expression. The majority of proteins involved in Golgi/endoplasmic reticulum-associated protein trafficking and metabolism, especially all of the mitochondrial located proteins, show decreasing expression. Details are shown in Supplementary Table 4a.

Close modal
Fig. 4.

Two-way hierarchical average linkage cluster analysis of logarithmic transformed gene expressions selected as described in “Material and Methods.” Rows are individual genes, columns are pools of colorectal normal and tumor tissue. Each square in the matrix represents the expression level of a single gene in a single pool. Yellow and blue indicate an expression level above and below, respectively, the median of that gene across all of the samples. Six selected clusters are shown: A, genes with high expression in normal tissue and low expression in Dukes’ B and C; B, genes with low expression in normal tissue and increased expression in tumors; C, genes with low expression in Dukes’ B; D, genes with high expression in Dukes’ B; E, genes with low expression in Dukes’ C; and F, genes with low expression in Dukes’ D. The complete cluster is presented as Supplementary Fig. 1 [available at Cancer Research Online (http://cancerres.aacrjournals.org)].

Fig. 4.

Two-way hierarchical average linkage cluster analysis of logarithmic transformed gene expressions selected as described in “Material and Methods.” Rows are individual genes, columns are pools of colorectal normal and tumor tissue. Each square in the matrix represents the expression level of a single gene in a single pool. Yellow and blue indicate an expression level above and below, respectively, the median of that gene across all of the samples. Six selected clusters are shown: A, genes with high expression in normal tissue and low expression in Dukes’ B and C; B, genes with low expression in normal tissue and increased expression in tumors; C, genes with low expression in Dukes’ B; D, genes with high expression in Dukes’ B; E, genes with low expression in Dukes’ C; and F, genes with low expression in Dukes’ D. The complete cluster is presented as Supplementary Fig. 1 [available at Cancer Research Online (http://cancerres.aacrjournals.org)].

Close modal

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1

Supported in part by funds from the Karen Elise Jensen Foundation, The Danish Research Council, The University of Aarhus, AROS Applied Biotechnology Aps (Aarhus, Denmark), and GlaxoSmithKline Plc (Rixensart, Belgium).

2

Supplementary data for this article is available at Cancer Research Online (http://cancerres.aacrjournals.org).

4

The abbreviations used are: CRC, colorectal cancer; EST, expressed sequence tag; LOH, loss of heterozygosity; MSI, microsatellite instability; As-Co, Astler-Coller; GAPDH, glyceraldehyde-3-phosphate dehydrogenase; Abs Call, absolute call; Diff Call, difference call; Avg Diff, average difference; NCBI, National Center for Biotechnology Information; CgA, chromogranin A; TN, tetranectin.

5

The URLs referred to are: NCBI’s UniGene database (http://www.ncbi.nlm.nih.gov/UniGene); OMIM (http://www.ncbi.nlm.nih.gov:80/entrez/query.fcgi?db=OMIM); SWISS-PROT (release 39.21, 13-Jun-2001) and TrEMBL (release 16.13 of 08-Jun-2001; http://www.expasy.ch/cgi-bin/sprot-search-ful; GeneMap’99 (http://www.ncbi.nlm.nih.gov/genemap).

6

C. Møller Frederiksen, S. Knudsen, S. Laurberg, and T. F. Ørntoft. Classification of Dukes’ B and C colorectal cancers using expression arrays, manuscript in preparation.

7

K. Birkenkamp-Demtroder, and T. F. Ørntoft, unpublished observations.

Table 1

The 53 most interesting known genes that show up- or down-regulation in CRC when compared with normal colon

All the genes have been validated by single analyses on HUGeneFL arrays with known genes.

Accession no.Gene nameAbs CallAvg DiffFCa% LOHc
NbABCDN-AN-BN-CN-D
1.1 Cell cycle             
 S78187 CDC25 cell division cycle 25B APPPP absd 603 436 627 1041      
1.2 Growth/proliferation/differentiation             
 X14253 CRIPTO-1 growth factor; TDGF1 APPPP abs 532 268 293 370      
 L19183 MAC30 insulin-like-growth factor binding protein 7; IGFBP7 APPPP abs 128 125 224 155      
 J03915 CgA PAAAA 831 abs abs abs abs     24 
 U52101 EMP3 or YMP epithelial membrane protein 3 PAAAA 459 abs abs abs abs     14 
2.0 Apoptosis             
 U33286 CAS chromosome segregation gene homologue PPPPP 86 558 364 340 436 6.5 4.2 4.0 5.1  
3.1 Transcription             
 HG4312-HT4582 TFIIIA transcription factor IIIa APPPP abs 837 1163 948 908      
 L02785 Colon mucosa-associated (DRA) PPPAP 2978 1691 161 abs 678 −1.8 −18.5  −4.4 20 
4.2 Proteases             
 J05257 MDP4, MDP7 microsomal dipeptidase APPPP abs 1606 1102 1403 989      
 X57766 Stromelysin-3/matrix metalloproteinase 11 STMY3 APPPP abs 643 1007 1116 1023      
 L09708 Complement component 2 (C2); C3/C5 convertase APPPP abs 515 578 625 669     26 
 X54667 Cystatin S-CST4 APPPP abs 336 192 352 106      
 M22324 Aminopeptidase N/CD13 PAAAA 657 abs abs abs abs     33 
6.0 Transport             
 X99133 NGAL/LCN2 lipocalin 2 PPPPP 327 2536 1081 1606 2272 7.8 3.3 4.9 6.9  
 S75256 LCN2 lipocalin 2 HNL-neutrophil lipocalin or NGAL PPPPP 361 3453 1389 2491 2925 9.6 3.8 6.9 8.1  
 U14528 DTD sulfate transporter PAAAA 397 abs abs abs abs     26 
 U28249 MAT8 PPAAA 233 47 abs abs abs −5.0    10 
7.0 Cell adhesion             
 M77349 TGFBI or BIGH3 PPPPP 426 1742 2476 1641 2086 4.1 5.8 3.9 4.9  
 U58516 Lactadherin; BA46 breast epithelial antigen PPPPP 169 668 664 740 676 4.0 3.9 4.4 4.0  
 HG2850–HT4814 Biliary glycoprotein/CEACAM1 PPPAP 489 176 123 abs 218 −2.8 −4.0  −2.2 14 
 X98311 CEACAM7 PPPPP 2456 790 192 356 1004 −3.1 −12.8 −6.9 −2.4 14 
 L08010 Reg gene homologue APPPP abs 1165 762 294 3147      
 X64559 TNA tetranectin PAAAA 235 abs abs abs abs     17 
8.0 Membrane & protein trafficking (Golgi/ER associated)             
 M57763 AKF6 ADP-ribosylation factor 6 PAAAA 209 abs abs abs abs      
 U77643 K12 protein precursor, SECTM1 PPAAA 429 121 abs abs abs −3.5     
 X99459 AP3S2 sigma 3B protein PPPAP 722 350 293 abs 383 −2.1 −2.5  −1.9 33 
9.0 Immune system/inflammation             
 X54489 MGSA melanoma growth-stimulatory activity APPPP abs 731 202 330 725      
 M84526 Adipsin, complement factor D CDA (EST) PAAAA 822 abs abs abs abs     15 
 D84239 IgG Fc-binding protein PPPPP 3755 1328 215 586 535 −2.8 −17.5 −6.4 −7.0 10 
10.0 Kinase/phosphorylase             
 X60188 ERK1 protein serine/threonine kinase MAP3 kinase PPPAP 576 366 270 abs 264 −1.6 −2.1  −2.2 14 
 L05144 PCK1 phosphoenolpyruvate carboxykinase PPPPP 2630 532 314 146 783 −4.9 −8.4 −18.0 −3.4 19 
 M16364 CKB creatine kinase-B brain PPPPP 4089 921 575 1855 1113 −4.4 −7.1 −2.2 −3.7 24 
 U85611 PRKDCIP or KIP or CIB PPPPP 1983 588 616 1067 628 −3.4 −3.2 −1.9 −3.2 33 
11.0 Membrane proteins             
 D15049 PTPRH protein tyrosine phosphatase receptor type H PPAAA 277 43 abs abs abs −6.4    11 
12.0 Cytoskeletal/structural             
 Y00503 Keratin 19 PPPPP 1557 709 591 591 597 −2.2 −2.6 −2.6 −2.6 26 
13.0 Metabolism             
 Z70295 GCAP-II guanylate cyclase activator 2B; Uroguanylin; UGNPAAAA 572 abs abs abs abs     29 
 J00306 SMST somatostatin I PAAAA 516 abs abs abs abs     15 
 Z47553 Flavin-containing monooxygenase 5 (FMO5) PAAAA 110 abs abs abs abs     14 
 M83088 Phosphoglucomutase 1 (PGM1) PPAPP 1096 685 abs 592 512 −1.6  −1.9 −2.1 19 
 M97496 GUCA1B guanylate cyclase activator 1B/guanylin PPAPP 4983 390 abs 35 309 −12.8   −16.1 19 
 U29091 SBP selenium-binding protein PPPAP 1849 1635 564 abs 337 −1.1 −3.3  −5.5 14 
 Z80345 SCAD acyl-coenzyme A dehydrogenase (ACADS) PAAAA 326 abs abs abs abs     17 
 L04490 NADH-ubiquinone oxidoreductase subunit PPAPP 683 566 abs 363 317 −1.2  −1.9 −2.2 23 
 X83618 HMG-COA synthase PPAPP 2196 704 abs 942 694 −3.1  −2.3 −3.2 19 
 J04469 CKMT mitochondrial creatine kinase PPPPP 2173 673 484 694 684 −3.2 −4.5 −3.1 −3.2 33 
 M22430 RASF-A PLA2 phospholipase A2, group IIA PPPPP 4983 2169 1305 811 1104 −2.3 −3.8 −6.1 −4.5 29 
 M10050 L-FABP liver fatty acid-binding protein 1 PPPPP 5360 2134 1054 1391 1114 −2.5 −5.1 −3.9 −4.8 
 U67963 HU-K5 lysophospholipase homologue PPPPP 1944 657 722 770 686 −3.0 −2.7 −2.5 −2.8 23 
 M94856 PA-FABP fatty acid-binding protein 5 homologue PPPPP 576 330 261 272 273 −1.7 −2.2 −2.1 −2.1 38 
 M61832 S-Adenosylhomocysteine hydrolase (AHCY) APPPP abs 882 984 817 1009      
 Z26491 Catechol O-methyltransferase PPPPP 123 362 259 294 541 2.9 2.1 2.4 4.4  
 M12759 Step II splicing factor SLU7; Ig J chain gene PPPAP 2193 707 153 abs 148 −3.1 −14.3  −14.8 22 
 M32053 H19 RNA APPPP abs 72 1400 4498 147      
Accession no.Gene nameAbs CallAvg DiffFCa% LOHc
NbABCDN-AN-BN-CN-D
1.1 Cell cycle             
 S78187 CDC25 cell division cycle 25B APPPP absd 603 436 627 1041      
1.2 Growth/proliferation/differentiation             
 X14253 CRIPTO-1 growth factor; TDGF1 APPPP abs 532 268 293 370      
 L19183 MAC30 insulin-like-growth factor binding protein 7; IGFBP7 APPPP abs 128 125 224 155      
 J03915 CgA PAAAA 831 abs abs abs abs     24 
 U52101 EMP3 or YMP epithelial membrane protein 3 PAAAA 459 abs abs abs abs     14 
2.0 Apoptosis             
 U33286 CAS chromosome segregation gene homologue PPPPP 86 558 364 340 436 6.5 4.2 4.0 5.1  
3.1 Transcription             
 HG4312-HT4582 TFIIIA transcription factor IIIa APPPP abs 837 1163 948 908      
 L02785 Colon mucosa-associated (DRA) PPPAP 2978 1691 161 abs 678 −1.8 −18.5  −4.4 20 
4.2 Proteases             
 J05257 MDP4, MDP7 microsomal dipeptidase APPPP abs 1606 1102 1403 989      
 X57766 Stromelysin-3/matrix metalloproteinase 11 STMY3 APPPP abs 643 1007 1116 1023      
 L09708 Complement component 2 (C2); C3/C5 convertase APPPP abs 515 578 625 669     26 
 X54667 Cystatin S-CST4 APPPP abs 336 192 352 106      
 M22324 Aminopeptidase N/CD13 PAAAA 657 abs abs abs abs     33 
6.0 Transport             
 X99133 NGAL/LCN2 lipocalin 2 PPPPP 327 2536 1081 1606 2272 7.8 3.3 4.9 6.9  
 S75256 LCN2 lipocalin 2 HNL-neutrophil lipocalin or NGAL PPPPP 361 3453 1389 2491 2925 9.6 3.8 6.9 8.1  
 U14528 DTD sulfate transporter PAAAA 397 abs abs abs abs     26 
 U28249 MAT8 PPAAA 233 47 abs abs abs −5.0    10 
7.0 Cell adhesion             
 M77349 TGFBI or BIGH3 PPPPP 426 1742 2476 1641 2086 4.1 5.8 3.9 4.9  
 U58516 Lactadherin; BA46 breast epithelial antigen PPPPP 169 668 664 740 676 4.0 3.9 4.4 4.0  
 HG2850–HT4814 Biliary glycoprotein/CEACAM1 PPPAP 489 176 123 abs 218 −2.8 −4.0  −2.2 14 
 X98311 CEACAM7 PPPPP 2456 790 192 356 1004 −3.1 −12.8 −6.9 −2.4 14 
 L08010 Reg gene homologue APPPP abs 1165 762 294 3147      
 X64559 TNA tetranectin PAAAA 235 abs abs abs abs     17 
8.0 Membrane & protein trafficking (Golgi/ER associated)             
 M57763 AKF6 ADP-ribosylation factor 6 PAAAA 209 abs abs abs abs      
 U77643 K12 protein precursor, SECTM1 PPAAA 429 121 abs abs abs −3.5     
 X99459 AP3S2 sigma 3B protein PPPAP 722 350 293 abs 383 −2.1 −2.5  −1.9 33 
9.0 Immune system/inflammation             
 X54489 MGSA melanoma growth-stimulatory activity APPPP abs 731 202 330 725      
 M84526 Adipsin, complement factor D CDA (EST) PAAAA 822 abs abs abs abs     15 
 D84239 IgG Fc-binding protein PPPPP 3755 1328 215 586 535 −2.8 −17.5 −6.4 −7.0 10 
10.0 Kinase/phosphorylase             
 X60188 ERK1 protein serine/threonine kinase MAP3 kinase PPPAP 576 366 270 abs 264 −1.6 −2.1  −2.2 14 
 L05144 PCK1 phosphoenolpyruvate carboxykinase PPPPP 2630 532 314 146 783 −4.9 −8.4 −18.0 −3.4 19 
 M16364 CKB creatine kinase-B brain PPPPP 4089 921 575 1855 1113 −4.4 −7.1 −2.2 −3.7 24 
 U85611 PRKDCIP or KIP or CIB PPPPP 1983 588 616 1067 628 −3.4 −3.2 −1.9 −3.2 33 
11.0 Membrane proteins             
 D15049 PTPRH protein tyrosine phosphatase receptor type H PPAAA 277 43 abs abs abs −6.4    11 
12.0 Cytoskeletal/structural             
 Y00503 Keratin 19 PPPPP 1557 709 591 591 597 −2.2 −2.6 −2.6 −2.6 26 
13.0 Metabolism             
 Z70295 GCAP-II guanylate cyclase activator 2B; Uroguanylin; UGNPAAAA 572 abs abs abs abs     29 
 J00306 SMST somatostatin I PAAAA 516 abs abs abs abs     15 
 Z47553 Flavin-containing monooxygenase 5 (FMO5) PAAAA 110 abs abs abs abs     14 
 M83088 Phosphoglucomutase 1 (PGM1) PPAPP 1096 685 abs 592 512 −1.6  −1.9 −2.1 19 
 M97496 GUCA1B guanylate cyclase activator 1B/guanylin PPAPP 4983 390 abs 35 309 −12.8   −16.1 19 
 U29091 SBP selenium-binding protein PPPAP 1849 1635 564 abs 337 −1.1 −3.3  −5.5 14 
 Z80345 SCAD acyl-coenzyme A dehydrogenase (ACADS) PAAAA 326 abs abs abs abs     17 
 L04490 NADH-ubiquinone oxidoreductase subunit PPAPP 683 566 abs 363 317 −1.2  −1.9 −2.2 23 
 X83618 HMG-COA synthase PPAPP 2196 704 abs 942 694 −3.1  −2.3 −3.2 19 
 J04469 CKMT mitochondrial creatine kinase PPPPP 2173 673 484 694 684 −3.2 −4.5 −3.1 −3.2 33 
 M22430 RASF-A PLA2 phospholipase A2, group IIA PPPPP 4983 2169 1305 811 1104 −2.3 −3.8 −6.1 −4.5 29 
 M10050 L-FABP liver fatty acid-binding protein 1 PPPPP 5360 2134 1054 1391 1114 −2.5 −5.1 −3.9 −4.8 
 U67963 HU-K5 lysophospholipase homologue PPPPP 1944 657 722 770 686 −3.0 −2.7 −2.5 −2.8 23 
 M94856 PA-FABP fatty acid-binding protein 5 homologue PPPPP 576 330 261 272 273 −1.7 −2.2 −2.1 −2.1 38 
 M61832 S-Adenosylhomocysteine hydrolase (AHCY) APPPP abs 882 984 817 1009      
 Z26491 Catechol O-methyltransferase PPPPP 123 362 259 294 541 2.9 2.1 2.4 4.4  
 M12759 Step II splicing factor SLU7; Ig J chain gene PPPAP 2193 707 153 abs 148 −3.1 −14.3  −14.8 22 
 M32053 H19 RNA APPPP abs 72 1400 4498 147      
a

FC, fold change calculated on Avg Diff.

b

N, normal; A–D, Dukes’ stage A–D, respectively.

c

Percentage of LOH found in 55 Finnish sporadic CRC samples for a microsatellite marker closest to the gene.

d

abs, Abs Call absent.

Table 2

The 46 most interesting hypothetical proteins from ESTs (predicted functions) that show up- or down-regulation in CRC when compared with normal colon

Accession no.Protein name/descriptionAbs CallAvg DiffFCa calculated% LOHc
NbABCDN-AN-BN-CN-D
AA075642 AJ243224 DMBT1/8kb.2 protein APPPP absd 3364 893 5271 7085      
AA232508 CGI-89 protein; hypothetical protein APPPP abs 2511 2138 1942 3179      
AA609013 Microsomal dipeptidase APPPP abs 2176 3117 3163 2447      
N33920 Diubiquitin and/or ubiquitin-like protein FAT10 APPPP abs 1173 811 425 524      
AA007218 FARP 1 FERM, RhoGEF (ARHGEF) APPPP abs 902 878 884 711      
AA024482 Unnamed protein product DKFZP434G032 protein APPPP abs 828 445 360 443      
AA443793 32% similar to KIAA0561 protein APPPP abs 781 594 847 473      
AA428964 Kallikrein 10 serine protease-like protease APPPP abs 559 1894 969 1037      
T52813 Putative lymphocyte G0/G1 switch protein 2 APPPP abs 471 1827 1597 604      
H27498 SNC73 and protein Tro alpha1 H. myeloma also IgG chain AAPPP abs abs 6246 9295 7876      
N50971 AXIN2 or conductin or Axil PPPPP 129 1031 581 761 599 8.0 4.5 5.9 4.6  
AA194833 CLDN-1 claudin-1 or SEMP1 PPPPP 185 999 1316 328 1281 5.4 7.1 1.8 6.9  
AA121315 KIAA1077 protein PPPPP 429 1764 3493 1724 1220 4.1 8.1 4.0 2.8  
N22015 Hypothetical protein PPPPP 638 3335 3608 2944 2893 5.2 5.7 4.6 4.5  
H04768 EST PAAAA 1078 abs abs abs abs     33 
Z39652 EST PAAAA 959 abs abs abs abs      
T47089 TNXB tenascin-XB1 PAAAA 815 abs abs abs abs     26 
W31906 CAA76365 secretagogin PAAAA 736 abs abs abs abs      
R70212 CD79A antigen, isoform 2 precursor PPAAA 718 227 abs abs abs −3.2    14 
R01646 Homology to mouse Pcbp1-poly(rC)-binding protein 1 PAAAA 607 abs abs abs abs     30 
N33927 H2B histone family, member B PPAAA 599 312 abs abs abs −1.9    26 
AA099820 EST (BAC clone AC016778) PAAAA 587 abs abs abs abs      
T90190 Histone family, member 2 PPAAA 574 102 abs abs abs −5.6    26 
AA319615 Secretory carrier membrane protein 2 (SCAMP2) PAAAA 568 abs abs abs abs      
T40995 ADH γ 2 subunit (aa 1–375) PPAAA 1973 841 abs abs abs −2.3    34 
T71132 VIPR1 vasoactive intestinal polypeptide receptor 1 PPAAA 325 131 abs abs abs −2.5    17 
H07011 Tetraspan NET-6 mRNA; transmembrane 4 superfamily PAAAA 324 abs abs abs abs     24 
N30796 HT018 protein PPPAA 1338 584 217 abs abs −2.3 −6.2   31 
AA447145 KIAA0399 protein PPPAA 374 123 64 abs abs −3.0 −5.8   56 
W86850 CTP mitochondrial citrate transporter, solute carrier family 20 PPPPA 644 273 −63 474 abs −2.4 −10.2 −1.4  24 
AA058357 Carcinoembryonic antigen-related cell adhesion molecule 7 PPPPP 5524 1757 232 460 1392 −3.1 −23.8 −12.0 −4.0 14 
T90492 Step II splicing factor SLU7 PPPPP 4498 1829 468 321 623 −2.5 −9.6 −14.0 −7.2 22 
M12759 Step II splicing factor SLU7; IgJ chain gene PPPPP 2628 686 239 107 190 −3.8 −11.0 −24.6 −13.8 22 
AA405715 ST2 syntenin-2 protein PPPPP 4188 1080 542 521 1120 −3.9 −7.7 −8.0 −3.7 24 
N80129 MT-11 protein; metallothionein IX; metallothionein 1L PPPPP 3623 1009 323 516 737 −3.6 −11.2 −7.0 −4.9 18 
M12272 ADH3 alcohol dehydrogenase class I γ subunit PPPPP 3368 904 266 120 290 −3.7 −12.7 −28.1 −11.6 34 
AA253330 Hypothetical protein PPPPP 2590 740 448 420 697 −3.5 −5.8 −6.2 −3.7 38 
M83670 Carbonic anhydrase IV PPPPP 2138 432 249 342 249 −4.9 −8.6 −6.3 −8.6 20 
AA151674 HUMCAIVA carbonic anhydrase IV PPPPP 1820 357 188 116 327 −5.1 −9.7 −15.7 −5.6 38 
Y09616 CES 2 carboxylesterase 2e PPPPP 1776 605 299 260 333 −2.9 −5.9 −6.8 −5.3 14 
N23665 L11708 Estradiol-17 β-dehydrogenase 2 PPPPP 1150 246 84 68 143 −4.7 −13.7 −16.9 −8.0 25 
AA171913 Carbonic anhydrase XII (CA12) PPPPP 1117 148 78 85 148 −7.5 −14.3 −13.1 −7.5 38 
AA432130 KIAA0867 protein MondoA PPPPP 1007 475 226 534 437 −2.1 −4.5 −1.9 −2.3 31 
T24011 Nuclear protein H731—programmed cell death 4 PDCD4 PPPPP 900 317 177 157 371 −2.8 −5.1 −5.7 −2.4 13 
AA242824 EST ac005233 PAC clone PPPPP 710 178 246 144 188 −4.0 −2.9 −4.9 −3.8 26 
AA429253 YOTIAO-A kinase (PRKA) anchor protein 9 PPPPP 555 279 103 111 213 −2.0 −5.4 −5.0 −2.6 18 
Accession no.Protein name/descriptionAbs CallAvg DiffFCa calculated% LOHc
NbABCDN-AN-BN-CN-D
AA075642 AJ243224 DMBT1/8kb.2 protein APPPP absd 3364 893 5271 7085      
AA232508 CGI-89 protein; hypothetical protein APPPP abs 2511 2138 1942 3179      
AA609013 Microsomal dipeptidase APPPP abs 2176 3117 3163 2447      
N33920 Diubiquitin and/or ubiquitin-like protein FAT10 APPPP abs 1173 811 425 524      
AA007218 FARP 1 FERM, RhoGEF (ARHGEF) APPPP abs 902 878 884 711      
AA024482 Unnamed protein product DKFZP434G032 protein APPPP abs 828 445 360 443      
AA443793 32% similar to KIAA0561 protein APPPP abs 781 594 847 473      
AA428964 Kallikrein 10 serine protease-like protease APPPP abs 559 1894 969 1037      
T52813 Putative lymphocyte G0/G1 switch protein 2 APPPP abs 471 1827 1597 604      
H27498 SNC73 and protein Tro alpha1 H. myeloma also IgG chain AAPPP abs abs 6246 9295 7876      
N50971 AXIN2 or conductin or Axil PPPPP 129 1031 581 761 599 8.0 4.5 5.9 4.6  
AA194833 CLDN-1 claudin-1 or SEMP1 PPPPP 185 999 1316 328 1281 5.4 7.1 1.8 6.9  
AA121315 KIAA1077 protein PPPPP 429 1764 3493 1724 1220 4.1 8.1 4.0 2.8  
N22015 Hypothetical protein PPPPP 638 3335 3608 2944 2893 5.2 5.7 4.6 4.5  
H04768 EST PAAAA 1078 abs abs abs abs     33 
Z39652 EST PAAAA 959 abs abs abs abs      
T47089 TNXB tenascin-XB1 PAAAA 815 abs abs abs abs     26 
W31906 CAA76365 secretagogin PAAAA 736 abs abs abs abs      
R70212 CD79A antigen, isoform 2 precursor PPAAA 718 227 abs abs abs −3.2    14 
R01646 Homology to mouse Pcbp1-poly(rC)-binding protein 1 PAAAA 607 abs abs abs abs     30 
N33927 H2B histone family, member B PPAAA 599 312 abs abs abs −1.9    26 
AA099820 EST (BAC clone AC016778) PAAAA 587 abs abs abs abs      
T90190 Histone family, member 2 PPAAA 574 102 abs abs abs −5.6    26 
AA319615 Secretory carrier membrane protein 2 (SCAMP2) PAAAA 568 abs abs abs abs      
T40995 ADH γ 2 subunit (aa 1–375) PPAAA 1973 841 abs abs abs −2.3    34 
T71132 VIPR1 vasoactive intestinal polypeptide receptor 1 PPAAA 325 131 abs abs abs −2.5    17 
H07011 Tetraspan NET-6 mRNA; transmembrane 4 superfamily PAAAA 324 abs abs abs abs     24 
N30796 HT018 protein PPPAA 1338 584 217 abs abs −2.3 −6.2   31 
AA447145 KIAA0399 protein PPPAA 374 123 64 abs abs −3.0 −5.8   56 
W86850 CTP mitochondrial citrate transporter, solute carrier family 20 PPPPA 644 273 −63 474 abs −2.4 −10.2 −1.4  24 
AA058357 Carcinoembryonic antigen-related cell adhesion molecule 7 PPPPP 5524 1757 232 460 1392 −3.1 −23.8 −12.0 −4.0 14 
T90492 Step II splicing factor SLU7 PPPPP 4498 1829 468 321 623 −2.5 −9.6 −14.0 −7.2 22 
M12759 Step II splicing factor SLU7; IgJ chain gene PPPPP 2628 686 239 107 190 −3.8 −11.0 −24.6 −13.8 22 
AA405715 ST2 syntenin-2 protein PPPPP 4188 1080 542 521 1120 −3.9 −7.7 −8.0 −3.7 24 
N80129 MT-11 protein; metallothionein IX; metallothionein 1L PPPPP 3623 1009 323 516 737 −3.6 −11.2 −7.0 −4.9 18 
M12272 ADH3 alcohol dehydrogenase class I γ subunit PPPPP 3368 904 266 120 290 −3.7 −12.7 −28.1 −11.6 34 
AA253330 Hypothetical protein PPPPP 2590 740 448 420 697 −3.5 −5.8 −6.2 −3.7 38 
M83670 Carbonic anhydrase IV PPPPP 2138 432 249 342 249 −4.9 −8.6 −6.3 −8.6 20 
AA151674 HUMCAIVA carbonic anhydrase IV PPPPP 1820 357 188 116 327 −5.1 −9.7 −15.7 −5.6 38 
Y09616 CES 2 carboxylesterase 2e PPPPP 1776 605 299 260 333 −2.9 −5.9 −6.8 −5.3 14 
N23665 L11708 Estradiol-17 β-dehydrogenase 2 PPPPP 1150 246 84 68 143 −4.7 −13.7 −16.9 −8.0 25 
AA171913 Carbonic anhydrase XII (CA12) PPPPP 1117 148 78 85 148 −7.5 −14.3 −13.1 −7.5 38 
AA432130 KIAA0867 protein MondoA PPPPP 1007 475 226 534 437 −2.1 −4.5 −1.9 −2.3 31 
T24011 Nuclear protein H731—programmed cell death 4 PDCD4 PPPPP 900 317 177 157 371 −2.8 −5.1 −5.7 −2.4 13 
AA242824 EST ac005233 PAC clone PPPPP 710 178 246 144 188 −4.0 −2.9 −4.9 −3.8 26 
AA429253 YOTIAO-A kinase (PRKA) anchor protein 9 PPPPP 555 279 103 111 213 −2.0 −5.4 −5.0 −2.6 18 
a

FC, fold change calculated on Avg Diff.

b

N, normal; A–D, Dukes’ stage A–D, respectively.

c

Percentage of LOH found in 55 Finnish sporadic CRC samples for a microsatellite marker closest to the EST or gene.

d

abs, Abs Call absent.

e

Validated by single analyses on HUGeneFL-arrays with known genes.

Table 3

The 15 nuclear-encoded mitochondrial located proteins (known genes) that show up- or down-regulation in CRC when compared with normal colon

Accession no.Gene nameAbs CallAvg DiffFCa% LOHcChromosome
NbABCDN-AN-BN-CN-D
Z80345 *d SCAD acyl-coenzyme A dehydrogenase PAAAA 326 abse abs abs abs     17 12q22-qter 
D10511 ACAT 1 mitochondrial acetoacetyl-CoA thiolase PPAPP 198 126 abs 93 145 −1.6  −2.1 −1.4 26 11q22.3–q23.1 
L04490 NADH-ubiquinone oxidoreductase subunit PPAPP 683 566 abs 363 317 −1.2  −1.9 −2.2 23 12p13.3 
X83618 HMG-COA synthase PPAPP 2196 704 abs 942 694 −3.1  −2.3 −3.2 19 1p13–p12 
D16294 Mitochondrial 3-oxoacyl-CoA thiolase PPPAP 529 274 85 abs 524 −1.9 −6.2  −1.0 62 18q 
D16481 Mitochondrial 3-ketoacyl-CoA thiolase β-subunit PPPAP 268 236 214 abs 241 −1.1 −1.3  −1.1 20 2p23 
M37104 Mitochondrial ATPase coupling factor 6 subunit (ATP5A) PPPAP 373 221 167 abs 236 −1.7 −2.2  −1.6 15 19p13.2 
U49352 DECR1 2,4-dienoyl-CoA-reductase, mitochondrial PPPAP 101 58 58 abs 116 −1.7 −1.7  −0.9 30 8q21.3 
U78678  Thioredoxin PPPAP 411 228 343 abs 219 −1.8 −1.2  −1.9 24 22q13.1 
D14710 ATP5A1 ATP synthase α PPPPP 3580 940 640 1168 1691 −3.8 −5.6 −3.1 −2.1  18q12-q21 
D87292 TST thiosulfate sulfurtransferase (rhodanese) PPPPP 5066 2675 1466 2269 1832 −1.9 −3.5 −2.2 −2.8 24 22q11.2-qter 
J04469 CKMT mitochondrial creatine kinase PPPPP 2173 673 484 694 684 −3.2 −4.5 −3.1 −3.2 33 15q15 
M19483 ATP5B ATP synthase PPPPP 2993 1793 1756 1508 1329 −1.7 −1.7 −2.0 −2.3  12p13-qter 
U16660 ECH1 PPPPP 1377 667 532 648 577 −2.1 −2.6 −2.1 −2.4 10 19q13.1 
L21936 SDH2 succinate dehydrogenase flavoprotein subunit PPPPP 882 604 414 624 491 −1.5 −2.1 −1.4 −1.8 20 5p15 
Accession no.Gene nameAbs CallAvg DiffFCa% LOHcChromosome
NbABCDN-AN-BN-CN-D
Z80345 *d SCAD acyl-coenzyme A dehydrogenase PAAAA 326 abse abs abs abs     17 12q22-qter 
D10511 ACAT 1 mitochondrial acetoacetyl-CoA thiolase PPAPP 198 126 abs 93 145 −1.6  −2.1 −1.4 26 11q22.3–q23.1 
L04490 NADH-ubiquinone oxidoreductase subunit PPAPP 683 566 abs 363 317 −1.2  −1.9 −2.2 23 12p13.3 
X83618 HMG-COA synthase PPAPP 2196 704 abs 942 694 −3.1  −2.3 −3.2 19 1p13–p12 
D16294 Mitochondrial 3-oxoacyl-CoA thiolase PPPAP 529 274 85 abs 524 −1.9 −6.2  −1.0 62 18q 
D16481 Mitochondrial 3-ketoacyl-CoA thiolase β-subunit PPPAP 268 236 214 abs 241 −1.1 −1.3  −1.1 20 2p23 
M37104 Mitochondrial ATPase coupling factor 6 subunit (ATP5A) PPPAP 373 221 167 abs 236 −1.7 −2.2  −1.6 15 19p13.2 
U49352 DECR1 2,4-dienoyl-CoA-reductase, mitochondrial PPPAP 101 58 58 abs 116 −1.7 −1.7  −0.9 30 8q21.3 
U78678  Thioredoxin PPPAP 411 228 343 abs 219 −1.8 −1.2  −1.9 24 22q13.1 
D14710 ATP5A1 ATP synthase α PPPPP 3580 940 640 1168 1691 −3.8 −5.6 −3.1 −2.1  18q12-q21 
D87292 TST thiosulfate sulfurtransferase (rhodanese) PPPPP 5066 2675 1466 2269 1832 −1.9 −3.5 −2.2 −2.8 24 22q11.2-qter 
J04469 CKMT mitochondrial creatine kinase PPPPP 2173 673 484 694 684 −3.2 −4.5 −3.1 −3.2 33 15q15 
M19483 ATP5B ATP synthase PPPPP 2993 1793 1756 1508 1329 −1.7 −1.7 −2.0 −2.3  12p13-qter 
U16660 ECH1 PPPPP 1377 667 532 648 577 −2.1 −2.6 −2.1 −2.4 10 19q13.1 
L21936 SDH2 succinate dehydrogenase flavoprotein subunit PPPPP 882 604 414 624 491 −1.5 −2.1 −1.4 −1.8 20 5p15 
a

FC, fold change calculated on Avg Diff.

b

N, normal; A–D, Dukes’ stage A–D, respectively.

c

Percentage of LOH found in 55 Finnish sporadic CRC samples for a microsatellite marker closest to the gene.

d

Asterisk, validated by single analyses on HUGeneFL arrays with known genes.

e

abs, Abs Call absent.

Table 4

Chromosomal localization of candidate tumor suppressors and oncogenes

ChromosomeCandidate suppressorsCandidate oncogenes
CytoBandMban              bCytoBandMbn
1q21 144.07    
3p22–p21.3 42.48    
3q26–q28 181.2    
4q21–q23 70.4 4q12–q13 70.3 
5q31–q32 141.7 5q13–q14 85.8 
6p21.3 29.1 6p21.3 35.3 
   8q11–8q22 86.16 
   9q34 119.4 
10 10q24 102.11    
11    11q13 65.7 
11 11q13 64.5 11p15 0.27 
12 12q22qter 130.1 12q22qter 129.1 
13    13q11–q13 29.1 
14 14q32 96.6    
15 15q22 62.1    
15 15q25 87.6    
16 16p11–p13 24.7    
16 16q22 70.2 16q24 91.4 
17 17p13-pte1 3.8 17q11-qter 64 
18 18q11-q12 21.2    
19 19q13 53.3 11 19p13-pter 12.4 
20    20p13–p11 39.6 
20    20q12–q13 44.4 
   Xq28 134.57 
ChromosomeCandidate suppressorsCandidate oncogenes
CytoBandMban              bCytoBandMbn
1q21 144.07    
3p22–p21.3 42.48    
3q26–q28 181.2    
4q21–q23 70.4 4q12–q13 70.3 
5q31–q32 141.7 5q13–q14 85.8 
6p21.3 29.1 6p21.3 35.3 
   8q11–8q22 86.16 
   9q34 119.4 
10 10q24 102.11    
11    11q13 65.7 
11 11q13 64.5 11p15 0.27 
12 12q22qter 130.1 12q22qter 129.1 
13    13q11–q13 29.1 
14 14q32 96.6    
15 15q22 62.1    
15 15q25 87.6    
16 16p11–p13 24.7    
16 16q22 70.2 16q24 91.4 
17 17p13-pte1 3.8 17q11-qter 64 
18 18q11-q12 21.2    
19 19q13 53.3 11 19p13-pter 12.4 
20    20p13–p11 39.6 
20    20q12–q13 44.4 
   Xq28 134.57 
a

Position in megabases from p-arm given for one gene of special interest.

b

Number of genes in one cluster.

Table 5

Comparison of Finnish LOH data and Danish RNA expression data

Markera% LOHbΔcMbdPositioneCytobandAccession no.Gene nameAbs Callsg
Chromosome 5          
 D5S471 31  115.6 115638887–115639129      
  Down 140.5 140552690–140552841 5q31.1 X13334  CD14 antigen PPPAPg 
  Down 140.7 140730922–140731102 5q31 Z11793  Selenoprotein P PPPAP 
  Down 141.5 141569970–141570137 5q31–q33 U51095  Cdx1 homeobox transcription factor 1 PPPPP 
  Down 141.7 141766842–141766971 5q32–q33.1 U14528  DTD sulfate transporter PAAAA 
 D5S436 26  144.0 144057662–144057905      
          
Chromosome 10          
 D10S1651 13  106.0 106064646–106064841      
  Down 102.3 102358989–110826641 10q24 T24011  Programmed cell death 4 PDCD4 PPPPP 
  Down 110.8 110826642–117550954 10q25.1–q25.2 U67319 *f Lice2 β cysteine protease/CASP7 caspase 7 PPPAA 
  Down 110.8 110826642–119792392 10q25 U15932 DUSP5 dual-specificity phosphatase 5 PPMAP 
  Down 111.7 111712764–111712904 10q24 D87435  GBF1 Golgi-specific brefeldin A resistance factor 1 PPAPP 
 D10S212 24  139.4 139449782–139449970      
          
Chromosome 12          
 D12S85 23  48.4 48482271–48482395      
  Down 52.0 52020469–52020594 12q13 M74491  ARF3 ADP-ribosylation factor 3 PPPPP 
  Down 56.1 56118928–56119150 12q12–q13 X99140  HHB5 type II intermediate filament of hair keratin PAAAA 
  Down 56.6 56643298–56643411 12q13 X52943  ATF7 activating transcription factor 7 PAAAA 
  Down 57.3 57398306–57398699 12q13 U29700  AMHR2 anti-Mullerian hormone type II receptor PAAAA 
  Down 59.2 59290126–59290293 12q13.3 X62535 Diacylglycerol kinase PPPPP 
  Down 64.0 64000244–64000357 12q14.1–q21.1 M35252 CO-029; transmembrane 4 superfamily member 3 PPPPP 
 D12S78 19  102.9 102912296–102912484      
          
Chromosome 13          
 D13S265 30  86.2 86269933–86270056      
  Up 104.4 104428691–104428952 13q32 S79219 Metastasis-associated gene PCCA PPPPP 
  Up 114.6 114681770–114681987 13q34 X05610  Type IV collagen α-2 chain PPPPP 
  Up 114.8 114822797–114823045 13q34 U58090 Cullin 4a APPPA 
  Up 115.6 115689304–115689458 13qtel U18291 APC6 or CDC16Hs cell division cycle 16 PPPPP 
  Up 115.7 111577914–111578014 13q34 M26576 α-1 Collagen type IV gene, exon 52 PPPPP 
 D13S285 34  116.4 116474326–116474421      
          
Chromosome 15          
 D15S994 33  40.3 40247885–40248102      
  Down 46.8 46889444–46889569 15q21 H04768  EST unknown protein PAAAA 
  Down 62.0 62056942–62057079 15q11.2 AA253330  NM_017689 hypothetical protein PPPPP 
  Down 62.1 62112166–62112357 15q22 AA171913  EST carbonic anhydrase XII (CA12) PPPPP 
  Down 62.1 62112166–62112357 15q22 AA151674  EST carbonic anhydrase XII (CA12) PPPPP 
 D15S153 38  65.2 65283718–65283929 15q11.2     
  Up 75.9 75951465–75951610 15q N71781  KIAA1199 protein function unknown APPPP 
  Up 84.4 84433834–84434018 15q25 U58516  Lactadherin/BA46 breast epithelial antigen PPPPP 
  Up 85.5 85588470–85588626 15q25 D45556  CAB98208 hypothetical protein AAAPA 
  Down 87.6 87696272–87696386 15q25–q26 M22324  Aminopeptidase N/CD13 PAAAA 
  Down 87.6 87666605–87666835 15q25 X99459  AP3S2 sigma 3B protein PPPAP 
 D15S127 33  88.6 88688163–88688292      
          
Chromosome 19          
 D19S209 14  31.0 31043823–31044004      
  Down 46.8 46851275–46851437 19pter–p13.3 U28249  MAT8/FXYD3/FXYD PPAAA 
  Down 47.0 47089599–47089721 19q13 AD000684  FIP/USF2 upstream transcription factor 2 PPPPP 
  Down 48.1 48134757–48135013 19 X04106 Calpain 4 calcium-dependent protease PPPPP 
 D19S220 10  49.9 49975902–49976174      
  Down 50.5 50508282–50508382 19q13.1 U16660  ECH1 Δ3,5-Δ2,4-dienoyl-CoA-isomerase PPPPP 
  Down 52.1 52102936–52103060 19q13.1 D84239  IgG Fc-binding protein PPPPP 
  Down 53.3 53357635–56108028 19q13.2 X98311  Carcinoembryonic antigen CGM2 PPPPP 
  Down 53.3 53357635–56108028 19q13.2 AA058357  EST carcinoembryonic antigen-related PPPPP 
  Down 55.1 55137658–55137779 19q13.2 HG2850–HT4814  Biliary glycoprotein PPPAP 
Markera% LOHbΔcMbdPositioneCytobandAccession no.Gene nameAbs Callsg
Chromosome 5          
 D5S471 31  115.6 115638887–115639129      
  Down 140.5 140552690–140552841 5q31.1 X13334  CD14 antigen PPPAPg 
  Down 140.7 140730922–140731102 5q31 Z11793  Selenoprotein P PPPAP 
  Down 141.5 141569970–141570137 5q31–q33 U51095  Cdx1 homeobox transcription factor 1 PPPPP 
  Down 141.7 141766842–141766971 5q32–q33.1 U14528  DTD sulfate transporter PAAAA 
 D5S436 26  144.0 144057662–144057905      
          
Chromosome 10          
 D10S1651 13  106.0 106064646–106064841      
  Down 102.3 102358989–110826641 10q24 T24011  Programmed cell death 4 PDCD4 PPPPP 
  Down 110.8 110826642–117550954 10q25.1–q25.2 U67319 *f Lice2 β cysteine protease/CASP7 caspase 7 PPPAA 
  Down 110.8 110826642–119792392 10q25 U15932 DUSP5 dual-specificity phosphatase 5 PPMAP 
  Down 111.7 111712764–111712904 10q24 D87435  GBF1 Golgi-specific brefeldin A resistance factor 1 PPAPP 
 D10S212 24  139.4 139449782–139449970      
          
Chromosome 12          
 D12S85 23  48.4 48482271–48482395      
  Down 52.0 52020469–52020594 12q13 M74491  ARF3 ADP-ribosylation factor 3 PPPPP 
  Down 56.1 56118928–56119150 12q12–q13 X99140  HHB5 type II intermediate filament of hair keratin PAAAA 
  Down 56.6 56643298–56643411 12q13 X52943  ATF7 activating transcription factor 7 PAAAA 
  Down 57.3 57398306–57398699 12q13 U29700  AMHR2 anti-Mullerian hormone type II receptor PAAAA 
  Down 59.2 59290126–59290293 12q13.3 X62535 Diacylglycerol kinase PPPPP 
  Down 64.0 64000244–64000357 12q14.1–q21.1 M35252 CO-029; transmembrane 4 superfamily member 3 PPPPP 
 D12S78 19  102.9 102912296–102912484      
          
Chromosome 13          
 D13S265 30  86.2 86269933–86270056      
  Up 104.4 104428691–104428952 13q32 S79219 Metastasis-associated gene PCCA PPPPP 
  Up 114.6 114681770–114681987 13q34 X05610  Type IV collagen α-2 chain PPPPP 
  Up 114.8 114822797–114823045 13q34 U58090 Cullin 4a APPPA 
  Up 115.6 115689304–115689458 13qtel U18291 APC6 or CDC16Hs cell division cycle 16 PPPPP 
  Up 115.7 111577914–111578014 13q34 M26576 α-1 Collagen type IV gene, exon 52 PPPPP 
 D13S285 34  116.4 116474326–116474421      
          
Chromosome 15          
 D15S994 33  40.3 40247885–40248102      
  Down 46.8 46889444–46889569 15q21 H04768  EST unknown protein PAAAA 
  Down 62.0 62056942–62057079 15q11.2 AA253330  NM_017689 hypothetical protein PPPPP 
  Down 62.1 62112166–62112357 15q22 AA171913  EST carbonic anhydrase XII (CA12) PPPPP 
  Down 62.1 62112166–62112357 15q22 AA151674  EST carbonic anhydrase XII (CA12) PPPPP 
 D15S153 38  65.2 65283718–65283929 15q11.2     
  Up 75.9 75951465–75951610 15q N71781  KIAA1199 protein function unknown APPPP 
  Up 84.4 84433834–84434018 15q25 U58516  Lactadherin/BA46 breast epithelial antigen PPPPP 
  Up 85.5 85588470–85588626 15q25 D45556  CAB98208 hypothetical protein AAAPA 
  Down 87.6 87696272–87696386 15q25–q26 M22324  Aminopeptidase N/CD13 PAAAA 
  Down 87.6 87666605–87666835 15q25 X99459  AP3S2 sigma 3B protein PPPAP 
 D15S127 33  88.6 88688163–88688292      
          
Chromosome 19          
 D19S209 14  31.0 31043823–31044004      
  Down 46.8 46851275–46851437 19pter–p13.3 U28249  MAT8/FXYD3/FXYD PPAAA 
  Down 47.0 47089599–47089721 19q13 AD000684  FIP/USF2 upstream transcription factor 2 PPPPP 
  Down 48.1 48134757–48135013 19 X04106 Calpain 4 calcium-dependent protease PPPPP 
 D19S220 10  49.9 49975902–49976174      
  Down 50.5 50508282–50508382 19q13.1 U16660  ECH1 Δ3,5-Δ2,4-dienoyl-CoA-isomerase PPPPP 
  Down 52.1 52102936–52103060 19q13.1 D84239  IgG Fc-binding protein PPPPP 
  Down 53.3 53357635–56108028 19q13.2 X98311  Carcinoembryonic antigen CGM2 PPPPP 
  Down 53.3 53357635–56108028 19q13.2 AA058357  EST carcinoembryonic antigen-related PPPPP 
  Down 55.1 55137658–55137779 19q13.2 HG2850–HT4814  Biliary glycoprotein PPPAP 
a

Microsatellite marker.

b

Percentage of LOH found in 55 Finnish sporadic CRC samples for a microsatellite marker closest to the EST or gene.

c

Expression change (up- or down-regulation).

d

Mb, megabases from p-arm.

e

Gene position in nucleotides from the p-arm of the chromosome.

f

Asterisk, Genes not selected as candidates.

g

The expression pattern of normal tissues and Dukes’ A, B, C, and D using A for absent and P for present.

We thank Bente Devantier, Hanne Steen, Inge-Lis Thorsen, and Susanne Bruun for excellent technical assistance; project nurses Birgitte Gustafsson and Edith Kirkedahl for the collection of CRC tissue samples; and Margrethe Kjeldsen (Research Unit for Molecular Medicine, Skejby, Aarhus, Denmark) for sequencing of the SCAD promoter region.

1
Pohl C., Hombach A., Kruis W. Chronic inflammatory bowel disease and cancer.
Hepatogastroenterology
,
47
:
57
-70,  
2000
.
2
Fearon E. R., Hamilton S. R., Vogelstein B. Clonal analysis of human colorectal tumors.
Science (Wash. DC)
,
238
:
193
-197,  
1987
.
3
Forrester K., Almoguera C., Han K., Grizzle W. E., Perucho M. Detection of high incidence of K-ras oncogenes during human colon tumorigenesis.
Nature (Lond.)
,
327
:
298
-303,  
1987
.
4
DeVita V. T., Jr. Hellman S. Rosenberg S. A. eds. .
Cancer—Principles and Practice of Oncology
, Sixth Edition Lippincott Williams & Wilkins Philadelphia  
2001
.
5
Knudson A. G. Hereditary cancer: two hits revisited.
J. Cancer Res. Clin. Oncol.
,
122
:
135
-140,  
1996
.
6
Baker S. J., Fearon E. R., Nigro J. M., Hamilton S. R., Preisinger A. C., Jessup J. M., van Tuinen P., Ledbetter D. H., Barker D. F., Nakamura Y. Chromosome 17 deletions and p53 gene mutations in colorectal carcinomas.
Science (Wash. DC)
,
244
:
217
-221,  
1989
.
7
Chiao P. J., Hunt K. K., Grau A. M., Abramian A., Fleming J., Zhang W., Breslin T., Abbruzzese J. L., Evans D. B. Tumor suppressor gene Smad4/DPC4, its downstream target genes, and regulation of cell cycle.
Ann. N. Y. Acad. Sci.
,
880
:
31
-37,  
1999
.
8
Nishisho I., Nakamura Y., Miyoshi Y., Miki Y., Ando H., Horii A., Koyama K., Utsunomiya J., Baba S., Hedge P. Mutations of chromosome 5q21 genes in FAP and colorectal cancer patients.
Science (Wash. DC)
,
253
:
665
-669,  
1991
.
9
Fearon E. R., Cho K. R., Nigro J. M., Kern S. E., Simons J. W., Ruppert J. M., Hamilton S. R., Preisinger A. C., Thomas G., Kinzler K. W. Identification of a chromosome 18q gene that is altered in colorectal cancers.
Science (Wash. DC)
,
247
:
49
-56,  
1990
.
10
Korn W. M., Yasutake T., Kuo W. L., Warren R. S., Collins C., Tomita M., Gray J., Waldman F. M. Chromosome arm 20q gains and other genomic alterations in colorectal cancer metastatic to liver, as analyzed by comparative genomic hybridization and fluorescence in situ hybridization.
Genes Chromosomes Cancer
,
25
:
82
-90,  
1999
.
11
Kitahara O., Furukawa Y., Tanaka T., Kihara C., Ono K., Yanagawa R., Nita M. E., Takagi T., Nakamura Y., Tsunoda T. Alterations of gene expression during colorectal carcinogenesis revealed by cDNA microarrays after laser-capture microdissection of tumor tissues and normal epithelia.
Cancer Res.
,
61
:
3544
-3549,  
2001
.
12
Notterman D. A., Alon U., Sierk A. J., Levine A. J. Transcriptional gene expression profiles of colorectal adenoma, adenocarcinoma, and normal tissue examined by oligonucleotide arrays.
Cancer Res.
,
61
:
3124
-3130,  
2001
.
13
Stockwell B. R. Chemical genetics: ligand-based discovery of gene function.
Nat. Rev. Genet.
,
1
:
116
-125,  
2000
.
14
Thykjaer T., Workman C., Kruhoffer M., Demtroder K., Wolf H., Andersen L. D., Frederiksen C. M., Knudsen S., Orntoft T. F. Identification of gene expression patterns in superficial and invasive human bladder cancer.
Cancer Res.
,
61
:
2492
-2499,  
2001
.
15
Eisen M. B., Spellman P. T., Brown P. O., Botstein D. Cluster analysis and display of genome-wide expression patterns.
Proc. Natl. Acad. Sci. USA
,
95
:
14863
-14868,  
1998
.
16
Mensink E., van de Locht A., Schattenberg A., Linders E., Schaap N., Geurts v. K., De Witte T. Quantitation of minimal residual disease in Philadelphia chromosome positive chronic myeloid leukaemia patients using real-time quantitative RT-PCR.
Br. J. Haematol.
,
102
:
768
-774,  
1998
.
17
Gonzalez-Zulueta M., Bender C. M., Yang A. S., Nguyen T., Beart R. W., Van Tornout J. M., Jones P. A. Methylation of the 5′ CpG island of the p16/CDKN2 tumor suppressor gene in normal and transformed human tissues correlates with gene silencing.
Cancer Res.
,
55
:
4531
-4535,  
1995
.
18
Zhang S. J., Endo S., Ichikawa T., Washiyama K., Kumanishi T. Frequent deletion and 5′ CpG island methylation of the p16 gene in primary malignant lymphoma of the brain.
Cancer Res.
,
58
:
1231
-1237,  
1998
.
19
Gregersen N., Winter V. S., Corydon M. J., Corydon T. J., Rinaldo P., Ribes A., Martinez G., Bennett M. J., Vianey-Saban C., Bhala A., Hale D. E., Lehnert W., Kmoch S., Roig M., Riudor E., Eiberg H., Andresen B. S., Bross P., Bolund L. A., Kolvraa S. Identification of four new mutations in the short-chain acyl-CoA dehydrogenase (SCAD) gene in two patients: one of the variant alleles, 511C→T, is present at an unexpectedly high frequency in the general population, as was the case for 625G→A, together conferring susceptibility to ethylmalonic aciduria.
Hum. Mol. Genet.
,
7
:
619
-627,  
1998
.
20
Lander E. S., Linton L. M., Birren B., Nusbaum C., Zody M. C., Baldwin J., Devon K., Dewar K., Doyle M., FitzHugh W., et al Initial sequencing and analysis of the human genome.
Nature (Lond.)
,
409
:
860
-921,  
2001
.
21
Laiho P., Launonen V., Lahermo P., Esteller M., Guo M., Herman J. G., Mecklin J-P., Järvinen H., Sistonen P., Kim K-M., Shibata D., Houlston R. S., Aaltonen L. A. Low-level microsatellite instability in most colorectal carcinomas.
Cancer Res.
,
62
:
1166
-1170,  
2002
.
22
Liotta L. A., Kohn E. C. The microenvironment of the tumour-host interface.
Nature (Lond.)
,
411
:
375
-379,  
2001
.
23
Wigmore S. J., Maingay J. P., Fearon K. C., Ross J. A. Endogenous production of IL-8 by human colorectal cancer cells and its regulation by cytokines.
Int. J. Oncol.
,
18
:
467
-473,  
2001
.
24
Grabowski P., Schindler I., Anagnostopoulos I., Foss H. D., Riecken E. O., Mansmann U., Stein H., Berger G., Buhr H. J., Scherubl H. Neuroendocrine differentiation is a relevant prognostic factor in stage III-IV colorectal cancer.
Eur. J. Gastroenterol. Hepatol.
,
13
:
405
-411,  
2001
.
25
Hogdall C. K., Christiansen M., Norgaard-Pedersen B., Bentzen S. M., Kronborg O., Clemmensen I. Plasma tetranectin and colorectal cancer.
Eur. J. Cancer
,
31A
:
888
-894,  
1995
.
26
Hogdall C. K. Human tetranectin: methodological and clinical studies.
APMIS Suppl.
,
86
:
1
-31,  
1998
.
27
Fric P., Sovova V., Sloncova E., Lojda Z., Jirasek A., Cermak J. Different expression of some molecular markers in sporadic cancer of the left and right colon.
Eur. J. Cancer Prev.
,
9
:
265
-268,  
2000
.
28
Singh B., Halestrap A. P., Paraskeva C. Butyrate can act as a stimulator of growth or inducer of apoptosis in human colonic epithelial cell lines depending on the presence of alternative energy sources.
Carcinogenesis (Lond.)
,
18
:
1265
-1270,  
1997
.
29
Augenlicht L. H., Heerdt B. G. Mitochondria: integrators in tumorigenesis?.
Nat. Genet.
,
28
:
104
-105,  
2001
.
30
Cunningham J. M., Christensen E. R., Tester D. J., Kim C. Y., Roche P. C., Burgart L. J., Thibodeau S. N. Hypermethylation of the hMLH1 promoter in colon cancer with microsatellite instability.
Cancer Res.
,
58
:
3455
-3460,  
1998
.
31
Vielmetter J., Chen X. N., Miskevich F., Lane R. P., Yamakawa K., Korenberg J. R., Dreyer W. J. Molecular characterization of human neogenin, a DCC-related protein, and the mapping of its gene (NEO1) to chromosomal position 15q22.3-q23.
Genomics
,
41
:
414
-421,  
1997
.
32
Meyerhardt J. A., Look A. T., Bigner S. H., Fearon E. R. Identification and characterization of neogenin, a DCC-related gene.
Oncogene
,
14
:
1129
-1136,  
1997
.
33
Park W. S., Park J. Y., Oh R. R., Yoo N. J., Lee S. H., Shin M. S., Lee H. K., Han S., Yoon S. K., Kim S. Y., Choi C., Kim P. J., Oh S. T., Lee J. Y. A distinct tumor suppressor gene locus on chromosome 15q21.1 in sporadic form of colorectal cancer.
Cancer Res.
,
60
:
70
-73,  
2000
.
34
Stange T., Kettmann U., Holzhausen H. J., Riemann D. Expression patterns of the ectopeptidases aminopeptidase N/CD13 and dipeptidyl peptidase IV/CD26: immunoultrastructural topographic localization on different types of cultured cells.
Acta Histochem.
,
100
:
157
-169,  
1998
.
35
Ishii K., Usui S., Yamamoto H., Sugimura Y., Tatematsu M., Hirano K. Decreases of metallothionein and aminopeptidase N in renal cancer tissues.
J. Biochem.
,
129
:
253
-258,  
2001
.
36
Ishii K., Usui S., Sugimura Y., Yoshida S., Hioki T., Tatematsu M., Yamamoto H., Hirano K. Aminopeptidase N regulated by zinc in human prostate participates in tumor cell invasion.
Int. J. Cancer
,
92
:
49
-54,  
2001
.
37
Simpson F., Peden A. A., Christopoulou L., Robinson M. S. Characterization of the adaptor-related protein complex. AP-3.
J. Cell Biol.
,
137
:
835
-845,  
1997
.
38
Scales S. J., Gomez M., Kreis T. E. Coat proteins regulating membrane traffic.
Int. Rev. Cytol.
,
195
:
67
-144,  
2000
.