The array-based comparative genomic hybridization using microarrayed bacterial artificial chromosome clones allows high-resolution analysis of genome-wide copy number changes in tumors. To analyze the genetic alterations of primary lung adenocarcinoma in a high-throughput way, we used laser-capture microdissection of cancer cells and array comparative genomic hybridization focusing on 800 chromosomal loci containing cancer-related genes. We identified a large number of chromosomal numerical alterations, including frequent amplifications on 7p12, 11q13, 12q14-15, and 17q21, and two homozygous deletions on 9p21 and one on 8p23. Unsupervised hierarchical clustering analysis of multiple alterations revealed three subgroups of lung adenocarcinoma that were characterized by the accumulation of distinct genetic alterations and associated with smoking history and gender. The mutation status of the epidermal growth factor receptor (EGFR) gene was significantly associated with specific genetic alterations and supervised clustering analysis based on EGFR gene mutations elucidated a subgroup including all EGFR gene mutated tumors, which showed significantly shorter disease-free survival. Our results suggest that there exist multiple molecular carcinogenesis pathways in lung adenocarcinoma that may associate with smoking habits and gender, and that genetic cancer profiling will reveal previously uncharacterized genetic heterogeneity of cancer and be beneficial in estimating patient prognosis and discovering novel cancer-related genes including therapeutic targets.

Lung cancer is one of the most lethal and increasing cancers in Western countries as well as in Japan (1). Lung cancer is histopathologically divided in two subgroups, small cell lung carcinoma and non–small cell lung carcinoma, and lung adenocarcinoma comprises >40% of the latter (1).

Previous genetic analyses using allelotyping, comparative genomic hybridization (CGH), or the candidate gene approach revealed many genomic (genetic and epigenetic) alterations of tumor suppressor genes (such as p53, p16INK4a, FHIT, LKB1, and PTEN) and oncogenes [such as K-ras, B-RAF, MYC, epidermal growth factor receptor (EGFR), and ERBB2] as well as many chromosomal imbalances (such as on 3p, 8p, 9p, 17p, 18q, and 19p) in lung adenocarcinomas (210). However, overall understanding of genomic alterations in lung adenocarcinomas is far from complete and analysis of the relationship between the overall profile and combinations of genetic alterations with clinicopathologic parameters is still lacking. Recently, genome-wide gene expression analyses have uncovered a novel dimension of cancer profiling and helped define the nature of the heterogeneous subgroups of lung adenocarcinoma, each of which shows distinct tumor histology and patient prognosis (1114). However, it is unclear whether there exist multiple genomic pathways in lung adenocarcinoma because of the lack of a genome-wide view of genetic alterations. It is clinically important to examine the correlations of certain molecular-genetic pathways with cancer cell traits relating to patient prognosis or chemotherapy sensitivity because it is possible that genetic alteration profiling may predict tumor recurrence/metastasis or sensitivity to molecular-target therapies as well as mRNA or protein expression profiles do (1517).

The recently developed array-based CGH method using microarrayed bacterial artificial chromosome clones allows high-resolution analysis of genome-wide copy number changes in various tumors (18, 19). To define and analyze the genetic alterations of lung adenocarcinoma in a more detailed way, we used the array CGH technique and laser-capture microdissection of cancer cells, a combination that we have successfully used in other tumor types (20).9

9

Unpublished data. Shibata T, Hosoda F, Ohki M, Hirohashi S.

Using these two powerful methodologies enabled us to detect a considerable number of novel chromosomal numerical alterations in primary lung adenocarcinoma. Unsupervised and supervised hierarchical clustering analyses of genome-wide genetic alterations revealed the presence of heterogeneous groups of lung adenocarcinoma, which are characterized by specific combinations of genetic alterations, varying EGFR gene mutation status, and tumor recurrence rates.

Patient materials. Surgical specimens of 55 lung adenocarcinoma patients who had been diagnosed and undergone operation between June 2001 and May 2002 at the National Cancer Center Hospital were examined. Fragments of tumor and corresponding normal lung tissue were taken immediately after surgery, fixed with 100% methanol, and embedded in paraffin. This study was approved by the institutional review boards of the National Cancer Center. The clinicopathologic data of the patients are shown in Table 1.

Table 1.

The clinicopathologic characteristics and oncogenic mutation profiles of patients and tumors

No. casesFrequency (%)
Total no. patient 55  
Mean age (range) 62.3 (35-79)  
Gender   
    Male 28 50.9 
    Female 27 49.1 
Smoking history   
    Never 27 49.1 
    Former 10 18.2 
    Current 18 32.7 
Tumor differentiation   
    Well 20 36.4 
    Moderate 25 45.5 
    Poor 10 18.1 
Stage*   
    I (IA and IB) 24 43.6 
    II (IIA and IIB) 10.9 
    III (IIIA and IIIB) 21 38.2 
EGFR mutation 26 47 
    Exon 18 G719S 1.8 
    Exon 19 Del746-750 14.5 
    Exon 19 Del747-752 3.6 
    Exon 19 Del747-752insS 1.8 
    Exon 21 L858R 14 25.5 
K-ras mutation 11 
    Codon 12 7.3 
    Codon 13 1.8 
    Codon 61 1.8 
No. casesFrequency (%)
Total no. patient 55  
Mean age (range) 62.3 (35-79)  
Gender   
    Male 28 50.9 
    Female 27 49.1 
Smoking history   
    Never 27 49.1 
    Former 10 18.2 
    Current 18 32.7 
Tumor differentiation   
    Well 20 36.4 
    Moderate 25 45.5 
    Poor 10 18.1 
Stage*   
    I (IA and IB) 24 43.6 
    II (IIA and IIB) 10.9 
    III (IIIA and IIIB) 21 38.2 
EGFR mutation 26 47 
    Exon 18 G719S 1.8 
    Exon 19 Del746-750 14.5 
    Exon 19 Del747-752 3.6 
    Exon 19 Del747-752insS 1.8 
    Exon 21 L858R 14 25.5 
K-ras mutation 11 
    Codon 12 7.3 
    Codon 13 1.8 
    Codon 61 1.8 
*

Clinical stage of four cases was not evaluated.

Laser-capture microdissection and whole-genome amplification. Laser-capture microdissection was done using LM200 (Arcuturus, Mount View, CA) as described (21). Only cancer cells were microdissected and lymphocytes, fibroblasts, and endothelial cells were carefully excluded. Corresponding normal lung epithelial cells were similarly microdissected and used as reference. To amplify the genomic DNA fragments, we used an adaptor-ligated whole-genome PCR as previously reported (22).

Array-based comparative genomic hybridization. This study used a custom-made CGH array called “MCG CancerArray-800 ver.2,” which consists of 800 duplicated bacterial artificial chromosome clones corresponding to various chromosomal loci that have been reported or considered to be altered in various human cancers (20, 23). Details of hybridization procedures have been previously reported (20). Sixteen-bit fluorescence intensity TIF images were obtained using a scanner (FLA8000, Fuji Film, Tokyo, Japan) and analyzed using GenePix Pro 5.0 (Axon Instruments, Inc., Foster City, CA). Thresholds for chromosomal gain (ratio >1.25) and loss (ratio <0.75) were determined by “normal versus normal experiments” (23, 24). We also validated our array CGH data by other methods. Loss of the 17p13 locus was confirmed by loss of heterozygosity of the p53 gene, which is located within that bacterial artificial chromosome using a microsatellite marker (TP53CA). Gene amplification of a representative gene, cyclin D1, was validated by fluorescence in situ hybridization analysis (24). We applied multiplex ligation-dependent probe amplification (MLPA) to validate ourarray data. Copy number alterations of multiple loci were analyzed using MLPA-SALSA kit (MRC-Holland, Amsterdam, the Netherlands) as per the recommendation of the manufacturer (25). Size and quantity of PCR products were calculated by Gene Mapper software (Version 3.5, Applied Biosystems, Tokyo, Japan) and copy number was determined by the ratio to the average of five normal control experiments.

Mutational analysis. We amplified exons 18, 19, 21, and 23 of the EGFR gene; exons 2 and 3 (covering codons 12, 13, and 61) of the K-ras gene; exons 20 and 21 of the ERBB2 gene; and exons 10, 14, 16, 17, 18, 19, and 20 of the MET gene from microdissected tumor and corresponding normal DNA samples with PCR using High Fidelity Taq polymerase (Roche, Mannheim, Germany) and appropriate primers (primer sequences are available on request). All PCR products were purified and analyzed by sequencing. PCR products showing deletions were subcloned in TA-vector (Invitrogen, Carlsbad, CA) and sequenced.

Immunohistochemical analysis. Four-micrometer sections of formalin-fixed, paraffin-embedded specimens of lung adenocarcinoma were stained with an anti-MET mouse monoclonal antibody (×100 dilution, Zymed, San Francicso, CA) as the suppliers recommended.

Statistical analysis. Two-dimensional hierarchical clustering analysis of the samples and signal ratios was done using the Impressionist (Gene Data, Basel, Switzerland) and GeneMaths (Applied Maths, Sint-Martens-Latem, Belgium) software programs as described (26, 27). Data were standardized by dividing by the root means and dendrograms were produced using the Pearson Correlation algorithm. For supervised clustering, we first selected loci that were significantly different between EGFR wild-type and mutated tumors based on the average ratio by Student's t test. We then used a machine-learning method, in which the leave-one-out cross-validation was done with all combinations of loci and multiple independent classifier algorithms, and selected 46 loci that could discriminate EGFR mutation status most accurately to classify the tumors. The Kaplan-Meier method was used to estimate the probability of disease-free survival. Cox proportional hazards regression model and multivariate analysis were done to detect the association between the presence of chromosomal alterations and disease-free survival. Log-rank analysis was used to assess the significance of the difference between subgroups.

Array-based comparative genomic hybridization analysis of primary lung adenocarcinoma. We analyzed 55 cases of lung adenocarcinoma by array-based CGH and the chromosomal alteration profiles of 800 loci are shown in Fig. 1. We identified 32 loci that were lost in >40% of cases (Table 2). Among them, the 9p21 locus containing the p16INK4a gene and the 17p13.1 locus containing the p53 gene were lost in 54% and 40% of analyzed cases, respectively. We found homozygous deletions of three loci, including two on 9p21 and one on 8p23.3. We also identified 19 loci that were gained in >50% of cases (Table 3) and recurrent (>4 cases) amplifications (>4 copies) on 12q14-15 (9 of 55, 16.3%) followed by 7p12.3 (5 of 55), 11q13 (5 of 55), 17q12 (5 of 55), 1p36.1 (4 of 55), 1q21 (4 of 55), 5p15 (4 of 55), 7q31 (4 of 55), 8q24 (4 of 55), 14q12 (4 of 55), and 17q21.2 (4 of 55). These included genes previously reported to be amplified in lung cancer, such as the cyclin D1 (11q13), EGFR (7p12.3), and ERBB2 (17q21.2) genes (28, 29). We further validated copy number alterations on 8q24.3, 17q21.2, 3p21, and 17p13.1 by MLPA method. Chromosomal copy number changes (both gains and losses) detected by array CGH corresponded to those by MLPA (Fig. 1C).

Fig. 1.

Copy number alteration profiles of primary lung adenocarcinoma. Frequencies (%) of chromosomal losses (A) and gains (B) detected by array CGH analyses are plotted from chromosome 1p (left) to Y (right). Each chromosome is represented by underlining boxes (short and long arms are indicated by closed and open boxes, respectively). Chromosomal arms that contain frequent losses or gains are also indicated. C, validation of array CGH analysis. Distributions of copy number on 8q24.3, 17q21.2, 3p21, and 17p13.1 loci detected by MLPA method in five normal samples (Normal), tumors with alterations detected by array CGH (Amp/Gain or Loss), and tumors without alteration by array CGH method (No change) were shown. D and E, genetic classification of lung adenocarcinoma. D, number of loci as a function of the percentage of altered loci in lung adenocarcinoma. Two modes, which can be separated by the vertical dotted line, can be observed. Loci within the first mode (altered in <20% of cases) were considered random, whereas those in the second mode (altered in >20% of cases) were considered pathogenic. E, unsupervised genetic profiling of lung adenocarcinoma. Fifty-five lung adenocarcinomas were clustered hierarchically on the basis of copy number changes in 397 loci. Multiple lost (green) or gained (red) loci are observed in individual tumors. Overall pattern of standardized gene copy numbers and the cluster tree of individual tumors are shown. Tumors were clustered in three subgroups (clusters A, B, and C). Loci that are specific for (black letters) or shared with (blue letters) each subgroup are indicated on the right side.

Fig. 1.

Copy number alteration profiles of primary lung adenocarcinoma. Frequencies (%) of chromosomal losses (A) and gains (B) detected by array CGH analyses are plotted from chromosome 1p (left) to Y (right). Each chromosome is represented by underlining boxes (short and long arms are indicated by closed and open boxes, respectively). Chromosomal arms that contain frequent losses or gains are also indicated. C, validation of array CGH analysis. Distributions of copy number on 8q24.3, 17q21.2, 3p21, and 17p13.1 loci detected by MLPA method in five normal samples (Normal), tumors with alterations detected by array CGH (Amp/Gain or Loss), and tumors without alteration by array CGH method (No change) were shown. D and E, genetic classification of lung adenocarcinoma. D, number of loci as a function of the percentage of altered loci in lung adenocarcinoma. Two modes, which can be separated by the vertical dotted line, can be observed. Loci within the first mode (altered in <20% of cases) were considered random, whereas those in the second mode (altered in >20% of cases) were considered pathogenic. E, unsupervised genetic profiling of lung adenocarcinoma. Fifty-five lung adenocarcinomas were clustered hierarchically on the basis of copy number changes in 397 loci. Multiple lost (green) or gained (red) loci are observed in individual tumors. Overall pattern of standardized gene copy numbers and the cluster tree of individual tumors are shown. Tumors were clustered in three subgroups (clusters A, B, and C). Loci that are specific for (black letters) or shared with (blue letters) each subgroup are indicated on the right side.

Close modal
Table 2.

Loci frequently lost in primary lung adenocarcinoma

Chromosomal locationCovered candidate geneChromosomal loss in lung adenocarcinoma (%)
9p22 MLLT3 56.4 
9p21 p16INK4a54.5 
9p21 TEK 54.5 
18q23d CTDP1 54.5 
9p23 GASC1 52.7 
9p21.3 MTAP 52.7 
15q25 NTRK3 50.9 
13q14.1 FKHR 50.9 
18q21 SMAD4* 50.9 
8p22 NAT2 50.9 
18q21.3 PI5 50.9 
18q21 GRP 50.9 
18q22 BCL2 50.9 
8p22 LZTS1* 49.1 
15q12 SNRPN 47.3 
8p23.3 D8S504 47.3 
18q21.3 SCCA1 47.3 
8p22 N33 45.5 
13q11-12 FGF9 45.5 
8p22-11 NRG1 43.6 
13q22.1 KLF12 43.6 
Xq28 MAGEA2 41.8 
3p24.3 THRB* 41.8 
3p14.2 FHIT* 41.8 
8p22-21.3 DLC1 41.8 
8p23.1 AAC1 41.8 
17p11.2 RH68621 40 
13q14.1 LCP1 40 
8p22-8p21 TNFRSF10B 40 
13q33 EFNB2 40 
17p13.1 RCV1 40 
18q21.3 FVT1 40 
Chromosomal locationCovered candidate geneChromosomal loss in lung adenocarcinoma (%)
9p22 MLLT3 56.4 
9p21 p16INK4a54.5 
9p21 TEK 54.5 
18q23d CTDP1 54.5 
9p23 GASC1 52.7 
9p21.3 MTAP 52.7 
15q25 NTRK3 50.9 
13q14.1 FKHR 50.9 
18q21 SMAD4* 50.9 
8p22 NAT2 50.9 
18q21.3 PI5 50.9 
18q21 GRP 50.9 
18q22 BCL2 50.9 
8p22 LZTS1* 49.1 
15q12 SNRPN 47.3 
8p23.3 D8S504 47.3 
18q21.3 SCCA1 47.3 
8p22 N33 45.5 
13q11-12 FGF9 45.5 
8p22-11 NRG1 43.6 
13q22.1 KLF12 43.6 
Xq28 MAGEA2 41.8 
3p24.3 THRB* 41.8 
3p14.2 FHIT* 41.8 
8p22-21.3 DLC1 41.8 
8p23.1 AAC1 41.8 
17p11.2 RH68621 40 
13q14.1 LCP1 40 
8p22-8p21 TNFRSF10B 40 
13q33 EFNB2 40 
17p13.1 RCV1 40 
18q21.3 FVT1 40 
*

Loss of heterozygosity or mutations, and

aberrant expression was previously reported in lung cancer (35).

Table 3.

Loci frequently gained in primary lung adenocarcinoma

Chromosomal locationCovered candidate geneChromosomal gain in lung adenocarcinoma (%)
17q25 MAFG 67.3 
1q21 MUC1* 63.6 
1q21 MCL1* 61.8 
7p21 IL6 58.2 
1q21 ARHGEF2 58.2 
16p13.3 ABCA3 56.4 
17q11 ITGB4 56.4 
20q13 Livin-2 56.4 
5p15 TERT 54.5 
8q24 GLI4 54.5 
16p13.3 IGFALS 54.5 
17q24-25 GRB2 54.5 
1q21 AF1Q 54.5 
1q23.1 PMF1 54.5 
12q24 stSG8935 52.7 
17q12 PPARBP 52.7 
17q25 Survivin* 50.9 
8q24 RECQL4 50.9 
11q12-13 RELA 50.9 
Chromosomal locationCovered candidate geneChromosomal gain in lung adenocarcinoma (%)
17q25 MAFG 67.3 
1q21 MUC1* 63.6 
1q21 MCL1* 61.8 
7p21 IL6 58.2 
1q21 ARHGEF2 58.2 
16p13.3 ABCA3 56.4 
17q11 ITGB4 56.4 
20q13 Livin-2 56.4 
5p15 TERT 54.5 
8q24 GLI4 54.5 
16p13.3 IGFALS 54.5 
17q24-25 GRB2 54.5 
1q21 AF1Q 54.5 
1q23.1 PMF1 54.5 
12q24 stSG8935 52.7 
17q12 PPARBP 52.7 
17q25 Survivin* 50.9 
8q24 RECQL4 50.9 
11q12-13 RELA 50.9 
*

Aberrant expression was previously reported in lung cancer (35).

Unsupervised hierarchical clustering of array comparative genomic hybridization data. To examine whether there exist multiple carcinogenesis pathways in lung adenocarcinoma, we attempted two-dimensional hierarchical profiling of the chromosomal alterations detected. We first plotted the number of loci showing various incidences of alterations and found that there exist two peaks (loci altered in 10-15% and 20-25% of cases; Fig. 1D). We assumed that alterations appearing in <20% of cases reflect mostly random alterations as observed in genome-wide allelotyping analyses (30), whereas alterations affecting >20% of cases probably represent nonrandom (cancer-specific) alterations. Therefore, to exclude random changes that may be caused by the intrinsic genetic instability of cancer, we selected the loci that were affected in >25% of analyzed cases (397 loci in total) and did the unsupervised hierarchical clustering analysis. When analyzed by using loci affected in >5% and 15% of cases, we obtained almost the same classification as described below (data not shown).

Our hierarchical clustering yielded three distinct subclasses of primary lung adenocarcinoma (clusters A, B, and C shown in Fig. 1E). Cluster B exhibited significantly fewer genetic alterations (losses and gains) in all examined clones than the other two clusters; the average number of alterations (losses, gains, total) in cluster A was 102, 141, and 244, respectively; in B 43, 81, and 125; and in C 85, 131, and 216 (A versus B: P < 0.0001; B versus C: P < 0.0001). Frequencies of the various lost or gained loci were significantly different among these three cluster groups (P < 0.01). Cluster A was characterized by gains on 1p32-26, 4p16.3, 11p15, 12q13-14, 16p11.2-13.3, 17q11.1-25, 19q13.2, 20p11, 20q11.2, and 22q12.2 and losses on 1p22, 6q26, 10q24-26, 13q22.1-34, 15q21-25, and 18p11.2. Cluster C significantly showed gains on 5p12-14.3, 7p12.3-21.1, 7q22, 7q31, 8q12-21, and 14q11-24, and losses on 1q23.3-41, 10q22.1, and Xq. Some loci were similarly altered in both clusters A and C, including losses on 3p21-24, 6q26, 8q24.3, 9q21, 10p15, 10q11, 10q26, 15q21.1, 15q26.1, and 19p13.3 (containing LKB1), and gains on 5p15, 6p21, 7p21-22, 7q21, 8q21, 8q22-24 (containing MYC), 9q21-22, 11q13 (containing cyclin D1), and 20q13.1. Losses on 3p14 (containing FHIT), 8p22-23.3, 9p21 (containing p16INK4a), 13q11-34, 17p13.1 (containing p53), 18q21, and gains on 1q21-23, 1q42, 7p15, 17q12, 17q21.2, and 17q25 were observed in all subgroups with similar frequency. Two alterations (a gain on 19q13.1 and a loss on 22q12.2) were more frequently observed in cluster B than in clusters A and C. The above classification into cluster groups showed significant correlation with the patients' smoking history (P < 0.01) and gender (P < 0.001); cluster A frequently contained female patients without any smoking history (female: 17 of 20 cases and never smoker: 14 of 20 cases), whereas cluster B included male patients with current or former smoking history (male: 11 of 15 cases and smoker: 11 of 15 cases). Cluster C included more male patients (male: 14 of 20 cases), but showed no significant association with smoking history (smoker: 11 of 20 cases). No significant differences were observed between the groups with regard to other clinical features (histologic differentiation, clinical stage, and disease-free survival). Multivariate analysis revealed that two chromosomal alterations showed significant association with disease-free survival: a loss on 13q14.1 (P = 0.01, hazard ratio, 3.21; 95% confidence interval, 1.30-7.91) and a gain on 8q24.2 (P = 0.02, hazard ratio, 2.92; 95% confidence interval, 1.16-7.37).

It has been reported that somatic mutations of the K-ras, EGFR, and ERBB2 genes are frequent in lung adenocarcinoma (2, 810, 3133). We attempted to determine the correlation of these oncogenic mutations with the above classification. We sequenced exons covering the kinase domain of the EGFR gene and found somatic mutations in 26 cases (47%; Table 1). EGFR mutations were more frequently observed in never-smoker patients (P < 0.001) and in the A and C cluster groups (P = 0.01 and 0.02). We detected K-ras activating mutations in six cases (11%; Table 1) and EGFR and K-ras mutations were mutually exclusive in our cases as reported by others (31, 32). No mutation in the reported exons of the ERBB2 gene was detected.

Supervised clustering analysis revealed correlation of EGFR gene mutation with specific genetic alterations. Tumors with EGFR mutations showed significantly more genetic alterations (losses and gains) than those without EGFR mutations; the average numbers of alterations (losses, gains, total) were 68, 110, and 178 in EGFR wild-type tumors and 93, 134, and 227 in EGFR mutated tumors, respectively (wild-type versus mutated P = 0.01, 0.03, and 0.01). We found 58 loci that showed significant differences in the frequency of copy number alterations between EGFR wild-type and mutated tumors. To further examine the genetic profile of EGFR mutated tumors, we classified lung adenocarcinomas based on their EGFR mutation status with the use of supervised hierarchical clustering. We preformed a machine-learning method with leave-one-out cross-validation and selected 46 loci that could discriminate EGFR mutation status. EGFR wild-type and mutated tumors were clustered in distinct groups using the ratios of 46 selected loci (Fig. 2A). Tumors carrying K-ras mutations were segregated from the EGFR mutant branch (Fig. 2A). Interestingly, some cases without EGFR mutation were clustered with the mutant branch, and tumors carrying EGFR mutations were separated in two subbranches (EGFR-MUT-A and EGFR-MUT-B; Fig. 2A). One branch (EGFR-MUT-A) was characterized by amplification of 12q14 or 1p36.1, whereas the other (EGFR-MUT-B) contained frequent amplification of 7p12.3 (containing the EGFR gene), 1q44-23, 5p12, 14q31, and 16p13.3. Poorly differentiated tumors were significantly (P = 0.01) segregated in the EGFR-MUT-B subgroup (EGFR-MUT-A: 1 of 15 cases and EGFR-MUT-B: 5 of 21 cases).

Fig. 2.

Supervised genetic profiling of EGFR mutation–related loci. A, Hierarchical clustering determined copy number change patterns against 46 loci that were identified using the training testing, cross-validation analysis. Lost (green) or gained (red) loci were indicated in individual tumors. Tumors are classified into two branches (EGFR-WT and EGFR-MUT). All EGFR mutated tumors (red spot) are clustered in the EGFR-MUT branch. Most tumors without EGFR mutations (blue spot) are clustered with the EGFR-WT branch, although some are clustered with the EGFR-MUT branch. Six tumors carrying K-ras mutations (yellow spot) are clustered with the EGFR-WT branch and four MET-amplified tumors (green spot) are clustered with the EGFR-MUT branch. The EGFR-MUT branch is subdivided into two subgroups (EGFR-MUT-A and EGFR-MUT-B) with distinctive genetic changes. B, genetic profiles and patient disease-free survival. Relationship between patients' disease-free survival and genetic classification based on the EGFR mutation–related loci. EGFR-WT and EGFR-MUT groups were significantly different (P = 0.01).

Fig. 2.

Supervised genetic profiling of EGFR mutation–related loci. A, Hierarchical clustering determined copy number change patterns against 46 loci that were identified using the training testing, cross-validation analysis. Lost (green) or gained (red) loci were indicated in individual tumors. Tumors are classified into two branches (EGFR-WT and EGFR-MUT). All EGFR mutated tumors (red spot) are clustered in the EGFR-MUT branch. Most tumors without EGFR mutations (blue spot) are clustered with the EGFR-WT branch, although some are clustered with the EGFR-MUT branch. Six tumors carrying K-ras mutations (yellow spot) are clustered with the EGFR-WT branch and four MET-amplified tumors (green spot) are clustered with the EGFR-MUT branch. The EGFR-MUT branch is subdivided into two subgroups (EGFR-MUT-A and EGFR-MUT-B) with distinctive genetic changes. B, genetic profiles and patient disease-free survival. Relationship between patients' disease-free survival and genetic classification based on the EGFR mutation–related loci. EGFR-WT and EGFR-MUT groups were significantly different (P = 0.01).

Close modal

The EGFR wild-type tumors that were clustered with the EGFR-MUT branch led us to hypothesize that aberrant activation of tyrosine kinases other than EGFR have an effect equivalent to that of EGFR mutations in these tumors. We examined the copy number changes of loci containing oncogenic receptor–type tyrosine kinases (FGFR1, FGFR2, FGFR3, PDGFR, KIT, MET, ERBB2, FLT3, NTRK1, and NTRK3) detected by our arrays. We found that amplification of a locus containing the MET gene (7q31) was observed in EGFR wild-type tumors (three of four amplified cases; a representative case was shown in Fig. 3A) and that these tumors were clustered with the EGFR-MUT branch (Fig. 2A). Overexpression of MET protein was immunohistochemically detected in 24% (13 of 55) of cases including all cases with amplification of the MET gene (Fig. 3B), although there was no significant association between MET overexpression and the clustering. To determine whether somatic mutations of this kinase also occur in lung adenocarcinoma, we sequenced the exons of the MET gene, which have been reported to exhibit activating mutations in various tumors (3436); however, no mutation was found in any of examined 55 cases.

Fig. 3.

Amplification and overexpression of MET in lung adenocarcinoma. A, a chromosomal copy number alteration profile of a lung adenocarcinoma with an amplification of MET locus. Each signal ratio (tumor/normal) of 800 examined loci was plotted from chromosome 1p (left) to Y (right). Each chromosome is represented by underlining boxes as in Fig. 1A. Chromosomal arms (7q, 8p, and 11q) with high-level amplifications were indicated. More than 6-fold amplification of a bacterial artificial chromosome containing the MET gene (7q21) was detected (arrow). This tumor also showed amplifications on 8p and 11q (arrowheads). B, immunohistochemical analysis of MET protein expression in the same case (×200). Four-micrometer sections of formalin-fixed, paraffin-embedded specimens were stained with H&E (left) and an anti-MET monoclonal antibody (right). Overexpression of the MET protein in tumor cells (T) was observed compared with the surrounding noncancerous lung tissue (N). All MET-amplified tumors exhibited prominent membranous and cytoplasmic expression. Bar, 200 μm.

Fig. 3.

Amplification and overexpression of MET in lung adenocarcinoma. A, a chromosomal copy number alteration profile of a lung adenocarcinoma with an amplification of MET locus. Each signal ratio (tumor/normal) of 800 examined loci was plotted from chromosome 1p (left) to Y (right). Each chromosome is represented by underlining boxes as in Fig. 1A. Chromosomal arms (7q, 8p, and 11q) with high-level amplifications were indicated. More than 6-fold amplification of a bacterial artificial chromosome containing the MET gene (7q21) was detected (arrow). This tumor also showed amplifications on 8p and 11q (arrowheads). B, immunohistochemical analysis of MET protein expression in the same case (×200). Four-micrometer sections of formalin-fixed, paraffin-embedded specimens were stained with H&E (left) and an anti-MET monoclonal antibody (right). Overexpression of the MET protein in tumor cells (T) was observed compared with the surrounding noncancerous lung tissue (N). All MET-amplified tumors exhibited prominent membranous and cytoplasmic expression. Bar, 200 μm.

Close modal

We further examined whether this classification is of clinical significance. Kaplan-Meier plots showed a statistically significant difference in disease-free survival between the two groups (EGFR-WT and EGFR-MUT; log-rank analysis, P = 0.01; Fig. 2B) although EGFR mutation status alone did not (log-rank analysis, P = 0.06; data not shown).

This study is the first high-resolution copy number analyses of primary lung adenocarcinoma by array CGH method. To extract common and specific genetic alterations, we collected and analyzed 55 cases of primary lung adenocarcinoma by combining the array-based CGH analysis with laser-capture microdissection of tumor cells. Importantly, our results were validated by the elucidation of frequent alterations of previously reported cancer-related genes in lung adenocarcinoma, including losses on the p16INK4a, p53, and FHIT loci, and amplifications on the cyclin D1 and EGFR loci. Moreover, we elucidated novel frequent alterations in small chromosomal regions such as losses on 13q11-14 and 15q12-25 and gains on 17q25, 1q21, and 16p13.3, which have not been detected by previous studies. Novel recurrent amplification, which may be a landmark for the existence of oncogenes, was also detected on loci, including 1p36.1, 1q21, 5p15, 12q14-15, and 14q12. We also found a homozygous deletion on 8p23.3 accompanied with frequent chromosomal loss and identification of candidate tumor suppressor genes in this locus is in progress.

It has been argued that there are distinct subclasses of lung adenocarcinoma by histopathologic observations and recent gene expression profilings (2, 13). Girard et al. (30) reported the possibility of classification of lung cancer by genome-wide allelotyping although their study only examined lung cancer cell lines and could not discriminate between copy number gain and loss. In our study, we analyzed primary lung adenocarcinoma and used unsupervised hierarchical cluster analysis to identify three groups of lung adenocarcinoma based on their distinct genetic changes. Among them, two subclasses (clusters A and C, 20 of 55 cases and 36.3% of total cases, respectively) shared many genetic alterations but also had changes unique to each other. This implied that they may be derived from a common precursor and diverge via the acquisition of specific genetic alterations during tumor development. In contrast, the third subclass (cluster B, 15 of 55 cases, 27.3%) showed characteristically fewer genetic alterations than the other two. Although one would expect this group to consist of tumors of an earlier stage, it contained tumors that varied clinically (from stage I to stage III) and there was no significant correlation of histopathologic features with the above classification. Interestingly, this clustering classification is significantly associated with smoking habits, suggesting that the specific carcinogen exposure may affect overall genetic profile of lung cancer. We propose three possibilities for the carcinogenesis process in the third group; these tumors may predominantly acquire (a) genetic alterations not covered by our arrays, although they contain most of the known cancer-related genes; (b) genetic alterations that do not involve copy number changes, such as balanced chromosomal translocations or microsatellite instability (37, 38); or (c) epigenetic alterations such as aberrant methylation of gene promoters, which have been reported to associate with smoking history (39). Further analysis of this group, focusing on the above mechanisms, will provide a more complete view of lung carcinogenesis.

Because the EGFR gene is frequently altered in lung adenocarcinoma and its mutation status is correlated to the sensitivity to the specific inhibitor, Gefitinib (810), we assumed that the EGFR pathway plays important roles in lung cancer and examined whether EGFR mutated tumors have any genetic characteristics in nature. We detected the EGFR gene mutations at similar frequency as reported (31, 32, 40) and the presence of somatic mutations was significantly associated with never-smoking history as previous studies reported (810, 31, 32, 40). We detected K-ras mutations relatively less frequent than previously reported (41) but comparatively to other study (42) probably because our analyzed cases contained more female and nonsmokers. We found that EGFR mutation and K-ras mutation were mutually exclusive as reported (31, 32, 40) and this finding is consistent with the notion that activation of both EGFR and K-ras stimulates the same downstream pathway (43).

We identified 58 loci whose alterations significantly correlated with the presence of EGFR mutations. It is interesting to note that amplification of the EGFR gene itself is significantly observed in EGFR mutated tumors, indicating that both somatic mutation and amplification of the EGFR gene simultaneously occur in part of lung adenocarcinoma. Using these selected loci, we classified the tumors by supervised hierarchical clustering. This classification revealed two groups: one containing only EGFR wild-type tumors (EGFR-WT) and the other (EGFR-MUT) containing all EGFR mutated and some EGFR wild-type tumors. Because the EGFR wild-type tumors that were grouped with the EGFR-MUT group shared similar genetic alterations with the EGFR mutated tumors, we hypothesized that they may have unknown genetic alterations complementary to EGFR activation and subsequently examined loci containing oncogenic receptor–type tyrosine kinases in our arrays. We found that a locus (7q21) containing the MET gene was amplified in part of these EGFR wild-type tumors and immunohistochemically validated overexpression of MET protein in these tumors. MET was shown to be implicated in ras-mediated tumorigenicity (44, 45) and activated in many tumors (3436). Although the number of cases with MET amplification is small in this study, it is tempting to speculate that amplification of the MET gene may play a role similar to EGFR mutation in lung adenocarcinoma. Recently, somatic alterations of the MET gene were detected in lung cancer and pharmacologic inhibitors specific to the MET kinase have been reported (4648). Our results also support the idea that the MET oncoprotein is a potent new candidate for therapeutic target in lung adenocarcinoma although there was no somatic mutation in the analyzed exons of our cases. In our cases, there are seven tumors without either EGFR mutations or MET amplification in the EGFR-MUT group. Somatic mutations in the kinase domain of ERBB2 were reported in EGFR wild-type lung adenocarcinomas (7, 33). Therefore, we searched for ERBB2 mutations in all 55 cases and found no somatic mutations, suggesting that other oncogenic kinases might be involved in these tumors.

EGFR mutation status could not predict tumor recurrence, which is consistent with a previous report on the insignificant relationship between EGFR mutation and patient prognosis (40). However, we found that EGFR-MUT group, which is revealed by genetic classification, showed significantly shorter disease-free survival than EGFR-WT group. Our results imply the possibility that specific combinations of genetic alterations (genetic code) selected by genome-wide analysis could evaluate tumor characteristics and estimation of such codes would be applicable for diagnostic purposes. Our classification also revealed that there are two genetically distinctive subgroups in the EGFR mutated lung adenocarcinoma, which were associated with tumor histologic differentiation. Because Gefitinib is one of the most promising molecular target drugs against lung cancer and molecular mechanisms determining its efficacy are still unclear (49), further analysis of a larger cohort is warranted to determine any possible relationship of genetic profiling with sensitivity to chemotherapeutic agents, including tyrosine kinase inhibitors.

Grant support: Grant-in-Aid for the Comprehensive 10-Year Strategy for Cancer Control from the Ministry of Health, Labor and Welfare, Japan.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1
Travis WD, Colby TV, Corrin B, Shimosato Y, Brambilla E. World Health Organization international histological classification of tumors. New York: Springer-Verlag;1999. p. 5–26.
2
Osada H, Takahashi T.
Oncogene
2002
;
21
:
7421
–34.
3
Sekido Y, Fong KM, Minna JD.
Annu Rev Med
2003
;
54
:
73
–87.
4
Sanchez-Cepedes M.
Lung Cancer
2003
;
40
:
111
–21.
5
Balsara BR, Testa JR.
Oncogene
2002
;
21
:
6877
–83.
6
Naoki K, Chen TH, Richards WG, Sugarbaker DJ, Meyerson M.
Cancer Res
2002
;
62
:
7001
–3.
7
Stephens P, Hunter C, Bignell G, et al.
Nature
2004
;
431
:
525
–6.
8
Lynch TJ, Bell DW, Sordella R, et al.
N Engl J Med
2004
;
350
:
2129
–39.
9
Paez JG, Janne PA, Lee JC, et al.
Science
2004
;
304
:
1497
–500.
10
Pao W, Miller V, Zakowski M, et al.
Proc Natl Acad Sci U S A
2004
;
101
:
13306
–11.
11
Bhattacharjee A, Richards WG, Staunton J, et al. Classification of human lung carcinoma by mRNA expression profiling reveals distinct adenocarcinoma subclasses.
Proc Natl Acad Sci U S A
2001
;
98
:
13790
–5.
12
Garber EM, Troyanskaya OG, Schluens K, et al. Diversity of gene expression in adenocarcinoma of the lung.
Proc Natl Acad U S A
2001
;
98
:
13784
–9.
13
Beer DG, Kardia SL, Huang CC, et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma.
Nat Med
2002
;
8
:
816
–24.
14
Meyerson M, Franklin WA, Kelley MJ. Molecular classification and molecular genetics of human lung cancers.
Semin Oncol
2004
;
31
:
4
–19.
15
Druker BJ, Tamura S, Buchdunger E, et al. Effects of a selective inhibitor of the Abl tyrosine kinase on the growth of Bcr-Abl positive cells.
Nat Med
1996
;
2
:
561
–6.
16
Wreesmann VB, Sieczka EM, Socci ND, et al. Genome-wide profiling of papillary thyroid cancer identifies MUC1 as an independent prognostic marker.
Cancer Res
2004
;
64
:
3780
–9.
17
Risques RA, Moreno V, Ribas M, Marcuello E, Capella G, Peinado MA. Genetic pathways and genome-wide determinants of clinical outcome in colorectal cancer.
Cancer Res
2003
;
63
:
7206
–14.
18
Nakao K, Mehta KR, Fridlyand J, et al. High-resolution analysis of DNA copy number alterations in colorectal cancer by array-based comparative genomic hybridization.
Carcinogenesis
2004
;
25
:
1345
–57.
19
Aguirre AJ, Brennan C, Bailey G, et al. High-resolution characterization of the pancreatic adenocarcinoma genome.
Proc Natl Acad Sci U S A
2004
;
101
:
9067
–72.
20
Sonoda I, Imoto I, Inoue J, et al. Frequent silencing of low density lipoprotein receptor–related protein 1B (LRP1B) expression by genetic and epigenetic mechanisms in esophageal squamous cell carcinoma.
Cancer Res
2004
;
64
:
3741
–7.
21
Emmert-Buck MR, Bonner RF, Smith PD, et al. Laser capture microdissection.
Science
1996
;
274
:
998
–1001.
22
Tanabe C, Aoyagi K, Sakiyama T, et al. Evaluation of a whole–genome amplification method based on adaptor–ligation PCR of randomly sheared genomic DNA.
Genes Chromosomes Cancer
2003
;
38
:
168
–76.
23
Inazawa J, Inoue J, Imoto I. Comparative genomic hybridization (CGH)–arrays pave the way for identification of novel cancer-related genes.
Cancer Sci
2004
;
95
:
559
–63.
24
Katoh H, Shibata T, Kokubu A, et al. Genetic profile of hepatocellular carcinoma revealed by array–based comparative genomic hybridization: Identification of genetic indicators to predict patient outcome. J Hepatol. In press 2005.
25
Schouten JP, McElgunn CJ, Waaijer R, Zwijnenburg D, Diepvens F, Pals G. Relative quantification of 40 nucleic acid sequences by multiplex ligation–dependent probe amplification.
Nucleic Acids Res
2002
;
30
:
57
–62.
26
Yokoo H, Kondo T, Fujii K, Yamada T, Todo S, Hirohashi S. Proteomic signature corresponding to alpha fetoprotein expression in liver cancer cells.
Hepatology
2004
;
40
:
609
–17.
27
Seike M, Kondo T, Fujii K, et al. Proteomic signature of human cancer cells.
Proteomics
2004
;
4
:
2776
–88.
28
Reissmann PT, Koga H, Figlin RA, Holmes EC, Slamon DJ. Amplification and overexpression of the cyclin D1 and epidermal growth factor receptor genes in non–small–cell lung cancer. Lung Cancer Study Group.
J Cancer Res Clin Oncol
1999
;
125
:
61
–70.
29
Pellegrini C, Falleni M, Marchetti A, et al. HER–2/Neu alterations in non‐small cell lung cancer: a comprehensive evaluation by real time reverse transcription–PCR, fluorescence in situ hybridization, and immunohistochemistry.
Clin Cancer Res
2003
;
9
:
3645
–52.
30
Girard L, Zochbauer-Muller S, Virmani AK, Gazdar AF, Minna JD. Genome–wide allelotyping of lung cancer identifies new regions of allelic loss, differences between small cell lung cancer and non–small cell lung cancer, and loci clustering.
Cancer Res
2000
;
60
:
4894
–906.
31
Shigematsu H, Lin L, Takahashi T, et al. Clinical and biological features associated with epidermal growth factor receptor gene mutations in lung cancers.
J Natl Cancer Inst
2005
;
97
:
339
–46.
32
Tokumo M, Toyooka S, Kiura K, et al. The relationship between epidermal growth factor receptor mutations and clinicopathologic features in non–small cell lung cancers.
Clin Cancer Res
2005
;
11
:
1167
–73.
33
Shigematsu H, Takahashi T, Nomura M, et al. Somatic mutations of the HER2 kinase domain in lung adenocarcinomas.
Cancer Res
2005
;
65
:
1642
–6.
34
Schmidt L, Duh FM, Chen F, et al. Germline and somatic mutations in the tyrosine kinase domain of the MET proto–oncogene in papillary renal carcinomas.
Nat Genet
1997
;
16
:
68
–73.
35
Park WS, Dong SM, Kim SY, et al. Somatic mutations in the kinase domain of the Met/hepatocyte growth factor receptor gene in chilhood hapatocellular carcinomas.
Cancer Res
1999
;
59
:
307
–10.
36
Ma PC, Kijima T, Maulik G, et al. c–Met mutational analysis in small cell lung cancer: novel juxtamembrane domain mutations regulating cytoskeletal functions.
Cancer Res
2003
;
63
:
6272
–81.
37
Dang TP, Gazdar AF, Virmani AK, et al. Chromosome 19 translocation, overexpression of Notch3, and human lung cancer.
J Natl Cancer Inst
2000
;
92
:
1355
–7.
38
Forgacs E, Wren JD, Kamibayashi C, et al. Searching for microsatellite mutations in coding regions in lung, breast, ovarian and colorectal cancers.
Oncogene
2001
;
20
:
1005
–9.
39
Eguchi K, Kanai Y, Kobayashi K, Hirohashi S. DNA hypermethylation at the D17S5 locus in non-small cell lung cancers: its association with smoking history.
Cancer Res
1997
;
57
:
4913
–5.
40
Kosaka T, Yatabe Y, Endoh H, Kuwano H, Takahashi T, Mitsudomi T. Mutations of the epidermal growth factor receptor gene in lung cancer: biological and clinical implications.
Cancer Res
2004
;
64
:
8919
–23.
41
Yatabe Y, Koga T, Mitsudomi T, Takahashi T. CK20 expression, CDX2 expression, K-ras mutation, and goblet cell morphology in a subset of lung adenocarcinomas.
J Pathol
2004 Jun
;
203
:
645
–52.
42
Noda N, Matsuzoe D, Konno T, Kawahara K, Yamashita Y, Shirakusa T. K-ras gene mutations in non-small cell lung cancer in Japanese.
Oncol Rep
2001
;
8
:
889
–92.
43
Sordella R, Bell DW, Haber DA, Settleman J. Gefitinib-sensitizing EGFR mutations in lung cancer activate anti-apoptotic pathways.
Science
2004
;
305
:
1163
–7.
44
Webb CP, Taylor GA, Jeffers M, et al. Evidence for a role of Met-HGF/SF during Ras-mediated tumorigenesis/metastasis.
Oncogene
1998 Oct 22
;
17
:
2019
–25.
45
Furge KA, Kiewlich D, Le P, et al. Suppression of Ras-mediated tumorigenicity and metastasi through inhibition of the Met receptor tyrosine kinase.
Proc Natl Acad Sci U S A
2001
;
98
:
10722
–7.
46
Ma PC, Jagadeeswaran R, Jagadeesh S, et al. Functional expression and mutations of c-Met and it's therapeutic inhibition with SU11274 and small interfering RNA in non-small cell lung cancer.
Cancer Res
2005
;
65
:
1479
–88.
47
Sattler M, Pride YB, Ma P, et al. A novel small molecule met inhibitor induces apoptosis in cells transformed by the oncogenic TPR-MET tyrosine kinase.
Cancer Res
2003
;
63
:
5462
–9.
48
Michieli P, Mazzone M, Basilico C, et al. Targeting the tumor and it's microenvironment by a dual-function decoy Met receptor.
Cancer Cell
2004
;
6
:
61
–73.
49
Lynch TJ, Adjei AA, Bunn PA Jr, et al. Novel against in the treatment of lung cancer: conference summary statement.
Clin Cancer Res
2004
;
10
:
4199
–204.