Genetic changes underlie tumor progression and may lead to cancer-specific expression of critical genes. Over 1100 publications have described the use of comparative genomic hybridization (CGH) to analyze the pattern of copy number alterations in cancer, but very few of the genes affected are known. Here, we performed high-resolution CGH analysis on cDNA microarrays in breast cancer and directly compared copy number and mRNA expression levels of 13,824 genes to quantitate the impact of genomic changes on gene expression. We identified and mapped the boundaries of 24 independent amplicons, ranging in size from 0.2 to 12 Mb. Throughout the genome, both high- and low-level copy number changes had a substantial impact on gene expression, with 44% of the highly amplified genes showing overexpression and 10.5% of the highly overexpressed genes being amplified. Statistical analysis with random permutation tests identified 270 genes whose expression levels across 14 samples were systematically attributable to gene amplification. These included most previously described amplified genes in breast cancer and many novel targets for genomic alterations, including the HOXB7 gene, the presence of which in a novel amplicon at 17q21.3 was validated in 10.2% of primary breast cancers and associated with poor patient prognosis. In conclusion, CGH on cDNA microarrays revealed hundreds of novel genes whose overexpression is attributable to gene amplification. These genes may provide insights to the clonal evolution and progression of breast cancer and highlight promising therapeutic targets.

Gene expression patterns revealed by cDNA microarrays have facilitated classification of cancers into biologically distinct categories, some of which may explain the clinical behavior of the tumors (1, 2, 3, 4, 5, 6). Despite this progress in diagnostic classification, the molecular mechanisms underlying gene expression patterns in cancer have remained elusive, and the utility of gene expression profiling in the identification of specific therapeutic targets remains limited.

Accumulation of genetic defects is thought to underlie the clonal evolution of cancer. Identification of the genes that mediate the effects of genetic changes may be important by highlighting transcripts that are actively involved in tumor progression. Such transcripts and their encoded proteins would be ideal targets for anticancer therapies, as demonstrated by the clinical success of new therapies against amplified oncogenes, such as ERBB2 and EGFR(7, 8), in breast cancer and other solid tumors. Besides amplifications of known oncogenes, over 20 recurrent regions of DNA amplification have been mapped in breast cancer by CGH5(9, 10). However, these amplicons are often large and poorly defined, and their impact on gene expression remains unknown.

We hypothesized that genome-wide identification of those gene expression changes that are attributable to underlying gene copy number alterations would highlight transcripts that are actively involved in the causation or maintenance of the malignant phenotype. To identify such transcripts, we applied a combination of cDNA and CGH microarrays to: (a) determine the global impact that gene copy number variation plays in breast cancer development and progression; and (b) identify and characterize those genes whose mRNA expression is most significantly associated with amplification of the corresponding genomic template.

Breast Cancer Cell Lines.

Fourteen breast cancer cell lines (BT-20, BT-474, HCC1428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the American Type Culture Collection (Manassas, VA). Cells were grown under recommended culture conditions. Genomic DNA and mRNA were isolated using standard protocols.

Copy Number and Expression Analyses by cDNA Microarrays.

The preparation and printing of the 13,824 cDNA clones on glass slides were performed as described (11, 12, 13). Of these clones, 244 represented uncharacterized expressed sequence tags, and the remainder corresponded to known genes. CGH experiments on cDNA microarrays were done as described (14, 15). Briefly, 20 μg of genomic DNA from breast cancer cell lines and normal human WBCs were digested for 14–18 h with AluI and RsaI (Life Technologies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six μg of digested cell line DNAs were labeled with Cy3-dUTP (Amersham Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization (14, 15) and posthybridization washes (13) were done as described. For the expression analyses, a standard reference (Universal Human Reference RNA; Stratagene, La Jolla, CA) was used in all experiments. Forty μg of reference RNA were labeled with Cy3-dUTP and 3.5 μg of test mRNA with Cy5-dUTP, and the labeled cDNAs were hybridized on microarrays as described (13, 15). For both microarray analyses, a laser confocal scanner (Agilent Technologies, Palo Alto, CA) was used to measure the fluorescence intensities at the target locations using the DEARRAY software (16). After background subtraction, average intensities at each clone in the test hybridization were divided by the average intensity of the corresponding clone in the control hybridization. For the copy number analysis, the ratios were normalized on the basis of the distribution of ratios of all targets on the array and for the expression analysis on the basis of 88 housekeeping genes, which were spotted four times onto the array. Low quality measurements (i.e., copy number data with mean reference intensity <100 fluorescent units, and expression data with both test and reference intensity <100 fluorescent units and/or with spot size <50 units) were excluded from the analysis and were treated as missing values. The distributions of fluorescence ratios were used to define cutpoints for increased/decreased copy number. Genes with CGH ratio >1.43 (representing the upper 5% of the CGH ratios across all experiments) were considered to be amplified, and genes with ratio <0.73 (representing the lower 5%) were considered to be deleted.

Statistical Analysis of CGH and cDNA Microarray Data.

To evaluate the influence of copy number alterations on gene expression, we applied the following statistical approach. CGH and cDNA calibrated intensity ratios were log-transformed and normalized using median centering of the values in each cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were median centered. For each gene, the CGH data were represented by a vector that was labeled 1 for amplification (ratio, >1.43) and 0 for no amplification. Amplification was correlated with gene expression using the signal-to-noise statistics (1). We calculated a weight, wg, for each gene as follows:

where mg1, ςg1 and mg0, ςg0 denote the means and SDs for the expression levels for amplified and nonamplified cell lines, respectively. To assess the statistical significance of each weight, we performed 10,000 random permutations of the label vector. The probability that a gene had a larger or equal weight by random permutation than the original weight was denoted by α. A low α (<0.05) indicates a strong association between gene expression and amplification.

Genomic Localization of cDNA Clones and Amplicon Mapping.

Each cDNA clone on the microarray was assigned to a Unigene cluster using the Unigene Build 141.6 A database of genomic sequence alignment information for mRNA sequences was created from the August 2001 freeze of the University of California Santa Cruz’s GoldenPath database.7 The chromosome and bp positions for each cDNA clone were then retrieved by relating these data sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three adjacent clones in a single cell line. The amplicon start and end positions were extended to include neighboring nonamplified clones (ratio, <1.5). The amplicon size determination was partially dependent on local clone density.

FISH.

Dual-color interphase FISH to breast cancer cell lines was done as described (17). Bacterial artificial chromosome clone RP11-361K8 was labeled with SpectrumOrange (Vysis, Downers Grove, IL), and Spectrum- Orange-labeled probe for EGFR was obtained from Vysis. SpectrumGreen-labeled chromosome 7 and 17 centromere probes (Vysis) were used as a reference. A tissue microarray containing 612 formalin-fixed, paraffin-embedded primary breast cancers (17) was applied in FISH analyses as described (18). The use of these specimens was approved by the Ethics Committee of the University of Basel and by the NIH. Specimens containing a 2-fold or higher increase in the number of test probe signals, as compared with corresponding centromere signals, in at least 10% of the tumor cells were considered to be amplified. Survival analysis was performed using the Kaplan-Meier method and the log-rank test.

RT-PCR.

The HOXB7 expression level was determined relative to GAPDH. Reverse transcription and PCR amplification were performed using Access RT-PCR System (Promega Corp., Madison, WI) with 10 ng of mRNA as a template. HOXB7 primers were 5′-GAGCAGAGGGACTCGGACTT-3′ and 5′-GCGTCAGGTAGCGATTGTAG-3′.

Global Effect of Copy Number on Gene Expression.

13,824 arrayed cDNA clones were applied for analysis of gene expression and gene copy number (CGH microarrays) in 14 breast cancer cell lines. The results illustrate a considerable influence of copy number on gene expression patterns. Up to 44% of the highly amplified transcripts (CGH ratio, >2.5) were overexpressed (i.e., belonged to the global upper 7% of expression ratios), compared with only 6% for genes with normal copy number levels (Fig. 1A). Conversely, 10.5% of the transcripts with high-level expression (cDNA ratio, >10) showed increased copy number (Fig. 1,B). Low-level copy number increases and decreases were also associated with similar, although less dramatic, outcomes on gene expression (Fig. 1).

Identification of Distinct Breast Cancer Amplicons.

Base-pair locations obtained for 11,994 cDNAs (86.8%) were used to plot copy number changes as a function of genomic position (Fig. 2, Supplement Fig. A). The average spacing of clones throughout the genome was 267 kb. This high-resolution mapping identified 24 independent breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 1). Several amplification sites detected previously by chromosomal CGH were validated, with 1q21, 17q12–q21.2, 17q22–q23, 20q13.1, and 20q13.2 regions being most commonly amplified. Furthermore, the boundaries of these amplicons were precisely delineated. In addition, novel amplicons were identified at 9p13 (38.65–39.25 Mb), and 17q21.3 (52.47–55.80 Mb).

Direct Identification of Putative Amplification Target Genes.

The cDNA/CGH microarray technique enables the direct correlation of copy number and expression data on a gene-by-gene basis throughout the genome. We directly annotated high-resolution CGH plots with gene expression data using color coding. Fig. 2,C shows that most of the amplified genes in the MCF-7 breast cancer cell line at 1p13, 17q22–q23, and 20q13 were highly overexpressed. A view of chromosome 7 in the MDA-468 cell line implicates EGFR as the most highly overexpressed and amplified gene at 7p11–p12 (Fig. 3,A). In BT-474, the two known amplicons at 17q12 and 17q22–q23 contained numerous highly overexpressed genes (Fig. 3,B). In addition, several genes, including the homeobox genes HOXB2 and HOXB7, were highly amplified in a previously undescribed independent amplicon at 17q21.3. HOXB7 was systematically amplified (as validated by FISH, Fig. 3 B, inset) as well as overexpressed (as verified by RT-PCR, data not shown) in BT-474, UACC812, and ZR-75-30 cells. Furthermore, this novel amplification was validated to be present in 10.2% of 363 primary breast cancers by FISH to a tissue microarray and was associated with poor prognosis of the patients (P = 0.001).

Statistical Identification and Characterization of 270 Highly Expressed Genes in Amplicons.

Statistical comparison of expression levels of all genes as a function of gene amplification identified 270 genes whose expression was significantly influenced by copy number across all 14 cell lines (Fig. 4, Supplemental Fig. B). According to the gene ontology data,8 91 of the 270 genes represented hypothetical proteins or genes with no functional annotation, whereas 179 had associated functional information available. Of these, 151 (84%) are implicated in apoptosis, cell proliferation, signal transduction, and transcription, whereas 28 (16%) had functional annotations that could not be directly linked with cancer.

The importance of recurrent gene and chromosome copy number changes in the development and progression of solid tumors has been characterized in >1000 publications applying CGH9(9, 10), as well as in a large number of other molecular cytogenetic, cytogenetic, and molecular genetic studies. The effects of these somatic genetic changes on gene expression levels have remained largely unknown, although a few studies have explored gene expression changes occurring in specific amplicons (15, 19, 20, 21). Here, we applied genome-wide cDNA microarrays to identify transcripts whose expression changes were attributable to underlying gene copy number alterations in breast cancer.

The overall impact of copy number on gene expression patterns was substantial with the most dramatic effects seen in the case of high-level copy number increase. Low-level copy number gains and losses also had a significant influence on expression levels of genes in the regions affected, but these effects were more subtle on a gene-by-gene basis than those of high-level amplifications. However, the impact of low-level gains on the dysregulation of gene expression patterns in cancer may be equally important if not more important than that of high-level amplifications. Aneuploidy and low-level gains and losses of chromosomal arms represent the most common types of genetic alterations in breast and other cancers and, therefore, have an influence on many genes. Our results in breast cancer extend the recent studies on the impact of aneuploidy on global gene expression patterns in yeast cells, acute myeloid leukemia, and a prostate cancer model system (22, 23, 24).

The CGH microarray analysis identified 24 independent breast cancer amplicons. We defined the precise boundaries for many amplicons detected previously by chromosomal CGH (9, 10, 25, 26) and also discovered novel amplicons that had not been detected previously, presumably because of their small size (only 1–2 Mb) or close proximity to other larger amplicons. One of these novel amplicons involved the homeobox gene region at 17q21.3 and led to the overexpression of the HOXB7 and HOXB2 genes. The homeodomain transcription factors are known to be key regulators of embryonic development and have been occasionally reported to undergo aberrant expression in cancer (27, 28). HOXB7 transfection induced cell proliferation in melanoma, breast, and ovarian cancer cells and increased tumorigenicity and angiogenesis in breast cancer (29, 30, 31, 32). The present results imply that gene amplification may be a prominent mechanism for overexpressing HOXB7 in breast cancer and suggest that HOXB7 contributes to tumor progression and confers an aggressive disease phenotype in breast cancer. This view is supported by our finding of amplification of HOXB7 in 10% of 363 primary breast cancers, as well as an association of amplification with poor prognosis of the patients.

We carried out a systematic search to identify genes whose expression levels across all 14 cell lines were attributable to amplification status. Statistical analysis revealed 270 such genes (representing ∼2% of all genes on the array), including not only previously described amplified genes, such as HER-2, MYC, EGFR, ribosomal protein s6 kinase, and AIB3, but also numerous novel genes such as NRAS-related gene (1p13), syndecan-2 (8q22), and bone morphogenic protein (20q13.1), whose activation by amplification may similarly promote breast cancer progression. Most of the 270 genes have not been implicated previously in breast cancer development and suggest novel pathogenetic mechanisms. Although we would not expect all of them to be causally involved, it is intriguing that 84% of the genes with associated functional information were implicated in apoptosis, cell proliferation, signal transduction, transcription, or other cellular processes that could directly imply a possible role in cancer progression. Therefore, a detailed characterization of these genes may provide biological insights to breast cancer progression and might lead to the development of novel therapeutic strategies.

In summary, we demonstrate application of cDNA microarrays to the analysis of both copy number and expression levels of over 12,000 transcripts throughout the breast cancer genome, roughly once every 267 kb. This analysis provided: (a) evidence of a prominent global influence of copy number changes on gene expression levels; (b) a high-resolution map of 24 independent amplicons in breast cancer; and (c) identification of a set of 270 genes, the overexpression of which was statistically attributable to gene amplification. Characterization of a novel amplicon at 17q21.3 implicated amplification and overexpression of the HOXB7 gene in breast cancer, including a clinical association between HOXB7 amplification and poor patient prognosis. Overall, our results illustrate how the identification of genes activated by gene amplification provides a powerful approach to highlight genes with an important role in cancer as well as to prioritize and validate putative targets for therapy development.

Fig. 1.

Impact of gene copy number on global gene expression levels. A, percentage of over- and underexpressed genes (Y axis) according to copy number ratios (X axis). Threshold values used for over- and underexpression were >2.184 (global upper 7% of the cDNA ratios) and <0.4826 (global lower 7% of the expression ratios). B, percentage of amplified and deleted genes according to expression ratios. Threshold values for amplification and deletion were >1.5 and <0.7.

Fig. 1.

Impact of gene copy number on global gene expression levels. A, percentage of over- and underexpressed genes (Y axis) according to copy number ratios (X axis). Threshold values used for over- and underexpression were >2.184 (global upper 7% of the cDNA ratios) and <0.4826 (global lower 7% of the expression ratios). B, percentage of amplified and deleted genes according to expression ratios. Threshold values for amplification and deletion were >1.5 and <0.7.

Close modal
Fig. 2.

Genome-wide copy number and expression analysis in the MCF-7 breast cancer cell line. A, chromosomal CGH analysis of MCF-7. The copy number ratio profile (blue line) across the entire genome from 1p telomere to Xq telomere is shown along with ±1 SD (orange lines). The black horizontal line indicates a ratio of 1.0; red line, a ratio of 0.8; and green line, a ratio of 1.2. B–C, genome-wide copy number analysis in MCF-7 by CGH on cDNA microarray. The copy number ratios were plotted as a function of the position of the cDNA clones along the human genome. In B, individual data points are connected with a line, and a moving median of 10 adjacent clones is shown. Red horizontal line, the copy number ratio of 1.0. In C, individual data points are labeled by color coding according to cDNA expression ratios. The bright red dots indicate the upper 2%, and dark red dots, the next 5% of the expression ratios in MCF-7 cells (overexpressed genes); bright green dots indicate the lowest 2%, and dark green dots, the next 5% of the expression ratios (underexpressed genes); the rest of the observations are shown with black crosses. The chromosome numbers are shown at the bottom of the figure, and chromosome boundaries are indicated with a dashed line.

Fig. 2.

Genome-wide copy number and expression analysis in the MCF-7 breast cancer cell line. A, chromosomal CGH analysis of MCF-7. The copy number ratio profile (blue line) across the entire genome from 1p telomere to Xq telomere is shown along with ±1 SD (orange lines). The black horizontal line indicates a ratio of 1.0; red line, a ratio of 0.8; and green line, a ratio of 1.2. B–C, genome-wide copy number analysis in MCF-7 by CGH on cDNA microarray. The copy number ratios were plotted as a function of the position of the cDNA clones along the human genome. In B, individual data points are connected with a line, and a moving median of 10 adjacent clones is shown. Red horizontal line, the copy number ratio of 1.0. In C, individual data points are labeled by color coding according to cDNA expression ratios. The bright red dots indicate the upper 2%, and dark red dots, the next 5% of the expression ratios in MCF-7 cells (overexpressed genes); bright green dots indicate the lowest 2%, and dark green dots, the next 5% of the expression ratios (underexpressed genes); the rest of the observations are shown with black crosses. The chromosome numbers are shown at the bottom of the figure, and chromosome boundaries are indicated with a dashed line.

Close modal
Fig. 3.

Annotation of gene expression data on CGH microarray profiles. A, genes in the 7p11-p12 amplicon in the MDA-468 cell line are highly expressed (red dots) and include the EGFR oncogene. B, several genes in the 17q12, 17q21.3, and 17q23 amplicons in the BT-474 breast cancer cell line are highly overexpressed (red) and include the HOXB7 gene. The data labels and color coding are as indicated for Fig. 2 C. Insets show chromosomal CGH profiles for the corresponding chromosomes and validation of the increased copy number by interphase FISH using EGFR (red) and chromosome 7 centromere probe (green) to MDA-468 (A) and HOXB7-specific probe (red) and chromosome 17 centromere (green) to BT-474 cells (B).

Fig. 3.

Annotation of gene expression data on CGH microarray profiles. A, genes in the 7p11-p12 amplicon in the MDA-468 cell line are highly expressed (red dots) and include the EGFR oncogene. B, several genes in the 17q12, 17q21.3, and 17q23 amplicons in the BT-474 breast cancer cell line are highly overexpressed (red) and include the HOXB7 gene. The data labels and color coding are as indicated for Fig. 2 C. Insets show chromosomal CGH profiles for the corresponding chromosomes and validation of the increased copy number by interphase FISH using EGFR (red) and chromosome 7 centromere probe (green) to MDA-468 (A) and HOXB7-specific probe (red) and chromosome 17 centromere (green) to BT-474 cells (B).

Close modal
Fig. 4.

List of 50 genes with a statistically significant correlation (α value <0.05) between gene copy number and gene expression. Name, chromosomal location, and the α value for each gene are indicated. The genes have been ordered according to their position in the genome. The color maps on the right illustrate the copy number and expression ratio patterns in the 14 cell lines. The key to the color code is shown at the bottom of the graph. Gray squares, missing values. The complete list of 270 genes is shown in supplemental Fig. B.

Fig. 4.

List of 50 genes with a statistically significant correlation (α value <0.05) between gene copy number and gene expression. Name, chromosomal location, and the α value for each gene are indicated. The genes have been ordered according to their position in the genome. The color maps on the right illustrate the copy number and expression ratio patterns in the 14 cell lines. The key to the color code is shown at the bottom of the graph. Gray squares, missing values. The complete list of 270 genes is shown in supplemental Fig. B.

Close modal

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1

Supported in part by the Academy of Finland, Emil Aaltonen Foundation, the Finnish Cancer Society, the Pirkanmaa Cancer Society, the Pirkanmaa Cultural Foundation, the Finnish Breast Cancer Group, the Foundation for the Development of Laboratory Medicine, the Medical Research Fund of the Tampere University Hospital, the Foundation for Commercial and Technical Sciences, and the Swedish Research Council.

2

Supplementary data for this article are available at Cancer Research Online (http://cancerres.aacrjournals.org).

5

The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluorescence in situ hybridization; RT-PCR, reverse transcription-PCR.

6

Internet address: http://research.nhgri.nih.gov/microarray/downloadable_cdna.html.

7

Internet address: www.genome.ucsc.edu.

8

Internet address: http://www.geneontology.org/.

9

Internet address: http://www.ncbi.nlm.nih.gov/entrez.

Table 1

Summary of independent amplicons in 14 breast cancer cell lines by CGH microarray

LocationStart (Mb)End (Mb)Size (Mb)
1p13 132.79 132.94 0.2 
1q21 173.92 177.25 3.3 
1q22 179.28 179.57 0.3 
3p14 71.94 74.66 2.7 
7p12.1–7p11.2 55.62 60.95 5.3 
7q31 125.73 130.96 5.2 
7q32 140.01 140.68 0.7 
8q21.11–8q21.13 86.45 92.46 6.0 
8q21.3 98.45 103.05 4.6 
8q23.3–8q24.14 129.88 142.15 12.3 
8q24.22 151.21 152.16 1.0 
9p13 38.65 39.25 0.6 
13q22–q31 77.15 81.38 4.2 
16q22 86.70 87.62 0.9 
17q11 29.30 30.85 1.6 
17q12–q21.2 39.79 42.80 3.0 
17q21.32–q21.33 52.47 55.80 3.3 
17q22–q23.3 63.81 69.70 5.9 
17q23.3–q24.3 69.93 74.99 5.1 
19q13 40.63 41.40 0.8 
20q11.22 34.59 35.85 1.3 
20q13.12 44.00 45.62 1.6 
20q13.12–q13.13 46.45 49.43 3.0 
20q13.2–q13.32 51.32 59.12 7.8 
LocationStart (Mb)End (Mb)Size (Mb)
1p13 132.79 132.94 0.2 
1q21 173.92 177.25 3.3 
1q22 179.28 179.57 0.3 
3p14 71.94 74.66 2.7 
7p12.1–7p11.2 55.62 60.95 5.3 
7q31 125.73 130.96 5.2 
7q32 140.01 140.68 0.7 
8q21.11–8q21.13 86.45 92.46 6.0 
8q21.3 98.45 103.05 4.6 
8q23.3–8q24.14 129.88 142.15 12.3 
8q24.22 151.21 152.16 1.0 
9p13 38.65 39.25 0.6 
13q22–q31 77.15 81.38 4.2 
16q22 86.70 87.62 0.9 
17q11 29.30 30.85 1.6 
17q12–q21.2 39.79 42.80 3.0 
17q21.32–q21.33 52.47 55.80 3.3 
17q22–q23.3 63.81 69.70 5.9 
17q23.3–q24.3 69.93 74.99 5.1 
19q13 40.63 41.40 0.8 
20q11.22 34.59 35.85 1.3 
20q13.12 44.00 45.62 1.6 
20q13.12–q13.13 46.45 49.43 3.0 
20q13.2–q13.32 51.32 59.12 7.8 
1
Golub T. R., Slonim D. K., Tamayo P., Huard C., Gaasenbeek M., Mesirov J. P., Coller H., Loh M. L., Downing J. R., Caligiuri M. A., Bloomfield C. D., Lander E. S. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.
Science (Wash. DC)
,
286
:
531
-537,  
1999
.
2
Alizadeh A. A., Eisen M. B., Davis R. E., Ma C., Lossos I. S., Rosenwald A., Boldrick J. C., Sabet H., Tran T., Yu X., et al Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.
Nature (Lond.)
,
403
:
503
-511,  
2000
.
3
Bittner M., Meltzer P., Chen Y., Jiang Y., Seftor E., Hendrix M., Radmacher M., Simon R., Yakhini Z., Ben-Dor A., et al Molecular classification of cutaneous malignant melanoma by gene expression profiling.
Nature (Lond.)
,
406
:
536
-540,  
2000
.
4
Perou C. M., Sorlie T., Eisen M. B., van de Rijn M., Jeffrey S. S., Rees C. A., Pollack J. R., Ross D. T., Johnsen H., Akslen L. A., et al Molecular portraits of human breast tumours.
Nature (Lond.)
,
406
:
747
-752,  
2000
.
5
Dhanasekaran S. M., Barrette T. R., Ghosh D., Shah R., Varambally S., Kurachi K., Pienta K. J., Rubin M. A., Chinnaiyan A. M. Delineation of prognostic biomarkers in prostate cancer.
Nature (Lond.)
,
412
:
822
-826,  
2001
.
6
Sorlie T., Perou C. M., Tibshirani R., Aas T., Geisler S., Johnsen H., Hastie T., Eisen M. B., van de Rijn M., Jeffrey S. S., et al Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications.
Proc. Natl. Acad. Sci. USA
,
98
:
10869
-10874,  
2001
.
7
Ross J. S., Fletcher J. A. The HER-2/neu oncogene: prognostic factor, predictive factor and target for therapy.
Semin. Cancer Biol.
,
9
:
125
-138,  
1999
.
8
Arteaga C. L. The epidermal growth factor receptor: from mutant oncogene in nonhuman cancers to therapeutic target in human neoplasia.
J. Clin. Oncol.
,
19
:
32
-40,  
2001
.
9
Knuutila S., Bjorkqvist A. M., Autio K., Tarkkanen M., Wolf M., Monni O., Szymanska J., Larramendy M. L., Tapper J., Pere H., El-Rifai W., et al DNA copy number amplifications in human neoplasms: review of comparative genomic hybridization studies.
Am. J. Pathol.
,
152
:
1107
-1123,  
1998
.
10
Knuutila S., Autio K., Aalto Y. Online access to CGH data of DNA sequence copy number changes.
Am. J. Pathol.
,
157
:
689
2000
.
11
DeRisi J., Penland L., Brown P. O., Bittner M. L., Meltzer P. S., Ray M., Chen Y., Su Y. A., Trent J. M. Use of a cDNA microarray to analyse gene expression patterns in human cancer.
Nat. Genet.
,
14
:
457
-460,  
1996
.
12
Shalon D., Smith S. J., Brown P. O. A DNA microarray system for analyzing complex DNA samples using two-color fluorescent probe hybridization.
Genome Res.
,
6
:
639
-645,  
1996
.
13
Mousses S., Bittner M. L., Chen Y., Dougherty E. R., Baxevanis A., Meltzer P. S., Trent J. M. Gene expression analysis by cDNA microarrays Livesey F. J. Hunt S. P. eds. .
Functional Genomics
,
113
-137, Oxford University Press Oxford  
2000
.
14
Pollack J. R., Perou C. M., Alizadeh A. A., Eisen M. B., Pergamenschikov A., Williams C. F., Jeffrey S. S., Botstein D., Brown P. O. Genome-wide analysis of DNA copy-number changes using cDNA microarrays.
Nat. Genet.
,
23
:
41
-46,  
1999
.
15
Monni O., Bärlund M., Mousses S., Kononen J., Sauter G., Heiskanen M., Paavola P., Avela K., Chen Y., Bittner M. L., Kallioniemi A. Comprehensive copy number and gene expression profiling of the 17q23 amplicon in human breast cancer.
Proc. Natl. Acad. Sci. USA
,
98
:
5711
-5716,  
2001
.
16
Chen Y., Dougherty E. R., Bittner M. L. Ratio-based decisions and the quantitative analysis of cDNA microarray images.
J. Biomed. Optics
,
2
:
364
-374,  
1997
.
17
Bärlund M., Forozan F., Kononen J., Bubendorf L., Chen Y., Bittner M. L., Torhorst J., Haas P., Bucher C., Sauter G., et al Detecting activation of ribosomal protein S6 kinase by complementary DNA and tissue microarray analysis.
J. Natl. Cancer Inst.
,
92
:
1252
-1259,  
2000
.
18
Andersen C. L., Hostetter G., Grigoryan A., Sauter G., Kallioniemi A. Improved procedure for fluorescence in situ hybridization on tissue microarrays.
Cytometry
,
45
:
83
-86,  
2001
.
19
Kauraniemi P., Bärlund M., Monni O., Kallioniemi A. New amplified and highly expressed genes discovered in the ERBB2 amplicon in breast cancer by cDNA microarrays.
Cancer Res.
,
61
:
8235
-8240,  
2001
.
20
Clark J., Edwards S., John M., Flohr P., Gordon T., Maillard K., Giddings I., Brown C., Bagherzadeh A., Campbell C., Shipley J., Wooster R., Cooper C. S. Identification of amplified and expressed genes in breast cancer by comparative hybridization onto microarrays of randomly selected cDNA clones.
Genes Chromosomes Cancer
,
34
:
104
-114,  
2002
.
21
Varis A., Wolf M., Monni O., Vakkari M. L., Kokkola A., Moskaluk C., Frierson H., Powell S. M., Knuutila S., Kallioniemi A., El-Rifai W. Targets of gene amplification and overexpression at 17q in gastric cancer.
Cancer Res.
,
62
:
2625
-2629,  
2002
.
22
Hughes T. R., Roberts C. J., Dai H., Jones A. R., Meyer M. R., Slade D., Burchard J., Dow S., Ward T. R., Kidd M. J., Friend S. H., Marton M. J. Widespread aneuploidy revealed by DNA microarray expression profiling.
Nat. Genet.
,
25
:
333
-337,  
2000
.
23
Virtaneva K., Wright F. A., Tanner S. M., Yuan B., Lemon W. J., Caligiuri M. A., Bloomfield C. D., de La Chapelle A., Krahe R. Expression profiling reveals fundamental biological differences in acute myeloid leukemia with isolated trisomy 8 and normal cytogenetics.
Proc. Natl. Acad. Sci. USA
,
98
:
1124
-1129,  
2001
.
24
Phillips J. L., Hayward S. W., Wang Y., Vasselli J., Pavlovich C., Padilla-Nash H., Pezullo J. R., Ghadimi B. M., Grossfeld G. D., Rivera A., Linehan W. M., Cunha G. R., Ried T. The consequences of chromosomal aneuploidy on gene expression profiles in a cell line model for prostate carcinogenesis.
Cancer Res.
,
61
:
8143
-8149,  
2001
.
25
Bärlund M., Tirkkonen M., Forozan F., Tanner M. M., Kallioniemi O. P., Kallioniemi A. Increased copy number at 17q22–q24 by CGH in breast cancer is due to high-level amplification of two separate regions.
Genes Chromosomes Cancer
,
20
:
372
-376,  
1997
.
26
Tanner M. M., Tirkkonen M., Kallioniemi A., Isola J., Kuukasjärvi T., Collins C., Kowbel D., Guan X. Y., Trent J., Gray J. W., Meltzer P., Kallioniemi O. P. Independent amplification and frequent co-amplification of three nonsyntenic regions on the long arm of chromosome 20 in human breast cancer.
Cancer Res.
,
56
:
3441
-3445,  
1996
.
27
Cillo C., Faiella A., Cantile M., Boncinelli E. Homeobox genes and cancer.
Exp. Cell Res.
,
248
:
1
-9,  
1999
.
28
Cillo C., Cantile M., Faiella A., Boncinelli E. Homeobox genes in normal and malignant cells.
J. Cell. Physiol.
,
188
:
161
-169,  
2001
.
29
Care A., Silvani A., Meccia E., Mattia G., Stoppacciaro A., Parmiani G., Peschle C., Colombo M. P. HOXB7 constitutively activates basic fibroblast growth factor in melanomas.
Mol. Cell. Biol.
,
16
:
4842
-4851,  
1996
.
30
Care A., Silvani A., Meccia E., Mattia G., Peschle C., Colombo M. P. Transduction of the SkBr3 breast carcinoma cell line with the HOXB7 gene induces bFGF expression, increases cell proliferation and reduces growth factor dependence.
Oncogene
,
16
:
3285
-3289,  
1998
.
31
Care A., Felicetti F., Meccia E., Bottero L., Parenza M., Stoppacciaro A., Peschle C., Colombo M. P. HOXB7: a key factor for tumor-associated angiogenic switch.
Cancer Res.
,
61
:
6532
-6539,  
2001
.
32
Naora H., Yang Y. Q., Montz F. J., Seidman J. D., Kurman R. J., Roden R. B. A serologically identified tumor antigen encoded by a homeobox gene promotes growth of ovarian epithelial cells.
Proc. Natl. Acad. Sci. USA
,
98
:
4060
-4065,  
2001
.

Supplementary data