To identify the major tumor suppressor gene (TSG) loci involved in the pathogenesis of lung cancer, we have conducted a high-resolution (10 cM), genome-wide search of loss of heterozygosity (LOH). Thirty-six lung cancer cell lines [14 small cell lung cancers (SCLCs) and 22 non-SCLCs (NSCLCs)] and their matched control DNAs were analyzed using 399 fluorescent microsatellite markers from the ABI Prism linkage mapping set v.2 on an ABI 377 sequencer/genotyper. Overall, 22 different regions with more than 60% LOH were identified:(a) 13 regions with a preference for SCLC;(b) 7 regions with a preference for NSCLC; and(c) 2 regions affecting both SCLC and NSCLC. The chromosomal arms with the most frequent LOH were 1p, 3p, 4p, 4q, 5q,8p, 9p (p16), 9q, 10p, 10q, 13q(Rb), 15q, 17p (p53), 18q, 19p, Xp, Xq. In addition, new homozygous deletions were found at 2p23, 8q24, 18q11,and Xq22. On average, 34% (SCLC) to 36% (NSCLC) of markers showed allele loss in individual tumors, with an average size of subchromosomal region of loss of five to six markers (50–60 cM). Whereas SCLC and NSCLC had different regions of frequent LOH (hot spots), and NSCLC had more of these regions (n = 22) than SCLC (n = 17), in all other parameters (fractional allelic loss, number of breakpoints, and number of microsatellite alterations), SCLC and NSCLC were not significantly different. Clustering analysis revealed correlations between LOH on different chromosomes that suggest previously unknown genetic interactions for lung cancer development. We conclude that(a) in lung cancer cell lines, at least 17–22 chromosomal regions with frequent allele loss are involved, suggesting that the same number of putative TSGs are inactivated;(b) SCLC and NSCLC frequently undergo different specific genetic alterations; and (c) clusters of TSGs are likely to be inactivated together. Overall, these data provide global estimates of the extent of genetic changes leading to lung cancer and will be useful for the positional cloning of new TSGs and for the identification of multiple new biomarkers for translational research.

Lung cancer is one of the leading cause of cancer-related deaths in industrialized countries. Its major etiological factor is tobacco smoke, a complex mixture containing many carcinogens that play a role in the genetic and epigenetic changes occurring in the respiratory epithelium during the multistage process of lung carcinogenesis. It is currently believed that 10 or more such events may have occurred before lung cancers become clinically evident (1). These genetic changes include activation of proto-oncogenes such as myc, Kras, EGFR, Her-2/neu, and BCL-2 as well as the inactivation of TSGs3like p53, Rb, FHIT, and p16(2, 3). Genetic changes affecting TSGs usually involve two events: (a) the loss of large chromosomal DNA regions of one parental DNA; and (b) a smaller mutational event(e.g., point mutation, smaller deletion, or promoter hypermethylation) affecting the second allele. The loss of chromosomal DNA can be detected by several cytogenetic techniques or by screening the tumors for LOH by allelotyping (reviewed in Refs. 2and 3). Several areas of frequent allelic loss [1p, 3p(including FHIT), 4p, 4q, 5q, 6q, 8p, 9p21(p16INK4), 10q, 11p, 13q14 (RB1),17p13 (p53), 18q, 19p, 22q, and Xq (2, 3)]have been found previously in lung cancers. In a few cases, the underlying TSGs have been identified, whereas the search for the other putative TSGs is under way in several laboratories.

Whereas several regions (such as 3p) have received detailed analysis with many markers, to our knowledge, there has not been a comprehensive, genome-wide analysis of allelic loss with a high density of markers. In particular, there are little or no data on analysis of individual lung cancers with a battery of allele loss tests. To further refine the previous mapping of regions of allele loss and to find other potential LOH sites, we carried out a genome-wide allelotyping in 36 lung cancer cell lines (14 SCLCs and 22 NSCLCs) using 399 fluorescent microsatellite markers and an ABI 377 sequencer/genotyper. We have previously reported preliminary allelotyping of multiple chromosomal regions using 85 polymorphic microsatellite markers (4),as well as studies of selected regions such as chromosomes 3p, 4p, 4q,and 8p for analysis with high densities of markers (5, 6, 7). As a prelude to performing the present genome-wide analysis on lung cancer cell lines, we tested a subpanel of 12 NSCLC lines and their archival specimen for morphology, aneuploidy, immunohistochemical expression of HER2/neu and p53 proteins, and LOH at 13 chromosomal regions using 37 of these 85 microsatellite markers (8). We found an excellent concordance (>95%) in these parameters between the lung tumor cell lines and their corresponding uncultured tumor tissues. Thus, although tumor cell lines may have acquired additional abnormalities in culture, in the large majority of cases, they faithfully represent the uncultured tumors. The 399 markers used in the current study are widely used, commercially available, and evenly spaced across the genome. They are different from those used previously and, together with our prior tests, give us LOH data on well over 500 different microsatellite markers. In this report, several regions with LOH have been identified in lung cancer cell lines, extending previous results, and four new HDs have been detected. In addition, clustering analysis uncovered previously unknown interactions between sites of allelic loss. The lung tumor cell lines we have genotyped in this report have been deposited by us in the American Type Culture Collection and have been widely distributed to many investigators. They provide an extremely useful renewable resource for verification of results, for positional cloning and characterization of new TSGs and oncogenes, and for expression and functional studies in lung cancer. The data generated here will provide new markers that can be used for early lung cancer detection, monitoring of chemoprevention trials, and positional cloning of new TSGs.

Tumor Cell Lines.

The 36 lung cancer cell lines used in this study (14 SCLCs and 22 NSCLCs) and their normal matched BL cell line controls were established by us at the National Cancer Institute (NCI-H series) and at the Hamon Center for Therapeutic Oncology Research, University of Texas Southwestern Medical Center (HCC series; Table 1). They were grown in cell culture using previously published conditions(RPMI 1640 supplemented with 5% or 10% fetal bovine serum), and DNA was prepared by standard methods (4). These cell lines have been deposited for distribution in the American Type Culture Collection.4

Markers and Optimization of Multiplex PCR.

The 400 microsatellite markers used in this study were obtained from the ABI Prism linkage mapping set v.2(Perkin-Elmer)5and were originally selected from the Généthon human linkage map (9). They consist of fluorescent primer pairs end-labeled with fluorochromes FAM, HEX, or NED that amplify dinucleotide-repeat fragments of 70–400 bp. Chromosomal localization of each marker was estimated by combining data from the Généthon genetic map and from the Genome Database.6We optimized the vendor’s conditions to perform multiplex PCR by varying the following parameters: (a) the combination of microsatellite marker primers from the various ABI Prism sets that can be coamplified and loaded together (only markers with the same fluorochrome and different size products were combined); (b)the relative concentration of marker primers within each combination;and (c) the MgCl2 concentration. We were unable to use one of the markers (D1S2890) from the ABI Prism set,and it was discarded. The list of multiplex sets and the specific optimization conditions we used are available on request.

PCR Reactions.

Optimized multiplex PCR reactions were performed in 10-μl total volumes with three to four ABI markers (0.1–0.65 μl for each individual undiluted primer pair or 0.5–3.25 pmol for each primer),1× PCR buffer, 50 ng of cell line DNA, 1.5–4.5 mmMgCl2 (depending on the multiplex set of primers), 0.25 mm each deoxynucleotide triphosphate, and 0.4 unit of AmpliTaq Gold DNA polymerase (Perkin-Elmer). The following PCR run conditions were used: (a) 95°C for 10 min (1 cycle); (b) 94°C for 30 s, 55°C for 30 s, and 72°C for 30 s (10 cycles); (c) 89°C for 30 s,55°C for 30 s, and 72°C for 30 s (20 cycles); and(d) 72°C for 20 min (1 cycle).

Electrophoresis Using the ABI Prism 377 Automatic Sequencer/Genotyper.

The loading mix was prepared with 1.5 μl of deionized formamide, 0.3μl of dye (50 mg/ml dextran blue and 25 mm EDTA), and 0.25 μl of ABI internal size standard. Each lane contained this loading mix plus 0.3 μl each of FAM-, HEX-, and NED-labeled PCR product (the FAM-labeled product was diluted 1:2 before loading). The gels were prepared with 5% Long Ranger gel solution (Perkin-Elmer) and run at 3000 V for 2 h and 15 min. The fluorescent bands were laser scanned, and the data were stored electronically by the ABI 377 sequencer/genotyper. Representative output results are shown in Fig. 1. Complete or nearly complete loss of one allele was scored as LOH. The loss of both alleles was scored as HD. Retention of both alleles was scored as HET. A unique peak in both the tumor cell line and the matched constitutional BL cell line was scored as NI. A MA was scored when one of the tumor cell allele peaks was different from the BL peaks. Specifically, when the tumor cell had two peaks, the MA was scored as MA/HET; when the tumor cell had one peak, it was scored as either MA/LOH (when the BL cell had two peaks) or MA/NI (when the BL cell had one peak). MA/NI was interpreted as LOH because it is usually unlikely that the two BL alleles (represented by a single peak) would be shifted to the exact same position in the tumor cell.

Computer Analysis.

The data were analyzed using ABI Prism Genescan software. In addition,a series of Microsoft Visual Basic programs were written specifically for various computations or repetitive tasks in Microsoft Excel,including color-coded formatting of LOH patterns, LOH characteristics(LOH frequencies, sizes of LOH, breakpoint frequencies, and so forth),and cluster analysis (see below). The program codes are in a user-ready format and are available on request.

Cluster Analysis.

For the purpose of this analysis, each allelotyping datapoint was assigned a number and a color code as follows: (a) +1(green), HET and MA/HET (i.e., clear examples of retention of heterozygosity); (b) +0.5 (light green), NI or a group of consecutive NIs that is bordered on either side (for a given tumor cell line) by two regions of heterozygosity (+1; such NIs are considered likely to have both copies retained); (c) 0 (gray), NI or a group of consecutive NIs that is bordered by a region of heterozygosity (+1) on one side and by a region of loss (−1) on the other (i.e.,no way of interpolating for allele loss or retention); (d)−0.5 (light blue), NI or a group of consecutive NIs that is bordered by two regions of loss (−1; such NIs are considered likely to have undergone allele loss); and (e) 1(blue), LOH, MA/LOH, MA/NI, and HD (i.e., clear examples of allele loss).

This number coding allows NI data to be assigned a “partially informative” value in a conservative manner based on their context of loss, i.e., NIs that are surrounded by LOHs for a given tumor are statistically more likely to be LOHs than HETs, and,conversely, NIs that are in a region of HETs are more likely to be HETs. Next, a concordance value is assigned for any two data points by multiplying their corresponding numerical values as defined above. Data points with the same sign thus have a positive concordance value that may vary between 0 (no concordance) and 1 (full concordance), and data points with different signs have a negative concordance value that ranges from −1 (fully discordant) to 0 (no discordance).

The concordance value of two different markers for a group of tumor samples (such as SCLC or NSCLC) is defined as the average concordance value of all pairs of corresponding data points for the two markers. In other words, two markers are concordant for a group of tumor samples if the allelotyping data for these two markers are similar for each of the samples, and, conversely, they are discordant if the allelotyping data are opposite for each tumor, with different degrees of concordance(from −1 to 1; from totally discordant to totally concordant).

The clustering algorithm is based in part on a strategy described by Eisen et al.(10) and was designed as follows. In the first round, all pairs of 399 markers (there are 79,800 such pairs) are scanned for the pair that has the highest concordance value. One of the markers from this pair is discarded, and the other is replaced by an array containing these two markers and whose LOH data are the average of the two sets of data for each marker (in their number format). The second round now has 398 elements that are either markers or arrays of markers associated with LOH data. The same procedure can be repeated on these elements, resulting in the replacement of the pair with the higher concordance by an array consisting of two or more markers made from the joining of the two elements (the average of LOH data are weighted by the number of markers in each element). This process is repeated until only one element remains that consists of an array of 399 markers listed in an order that clusters together markers with similar LOH patterns. This clustering is arranged so that markers with high LOH frequencies are preferentially positioned at the beginning of the clustered array (this is done in each round by joining the marker with the lower LOH frequency to the right of the marker with the higher LOH frequency) and chromosomal distances between adjacent markers in the cluster are minimized (this is done in each round by selecting the pair with the highest concordance, and, if there is more than one, selecting the pair with the shortest chromosomal distance). Finally, the whole clustered array is divided into nonoverlapping subgroups so that markers within each group have at least 75% concordance (i.e., have a concordance value of at least 0.75) between each other (Fig. 4).

Statistical Analysis.

For association between LOH frequencies and lung cancer types (SCLC versus NSCLC; Fig. 2), a one-sided binomial distribution test was performed (comparing the lower proportion against the higher one). Comparison of the averages of FAL, heterozygosity rates, numbers of breakpoints, numbers of MAs,numbers of allelic loss hot spots, average sizes of LOH (number of contiguous markers involved in LOH), and numbers of whole chromosome losses were done by using Student’s two-sided t test. Correlation between the presence of MAs and the frequency of allelic loss was assessed using both Pearson correlation coefficient and covariance.

LOH Analysis Reveals Regions of Frequent Allelic Loss in Both SCLC and NSCLC.

In this report, 399 fluorescent polymorphic microsatellite markers(from ABI Prism set v.2) were used that span the entire human genome with an average intermarker distance of ∼10 cM. To increase the efficiency of the allelotyping protocol described in the ABI Prism manuals, we first carried out optimization of multiplex PCR conditions(see “Materials and Methods”). The 399 markers were combined into 110 panels containing three to four markers each that can amplify PCR fragments of different sizes. Because the markers within each panel are labeled with a common fluorescent dye (FAM, HEX, or NED), the PCR products generated from the 110 panels can be further combined and loaded into 36 lanes so that each lane contains products of different sizes and fluorochromes. One gel (36 lanes) is therefore sufficient to analyze each DNA sample for all 399 markers.

Thirty-six lung cancer cell line DNAs and their normal matched BL cell line DNA controls were allelotyped with all 399 markers (Fig. 2). Fig. 1 shows representative output data from the ABI genotyper/sequencer. Overall, the average marker heterozygosity rate for the BL DNAs was 79%, with no significant difference between SCLC and NSCLC cases(Table 1). This heterozygosity rate is an exact match with the average ABI/Généthon heterozygosity rates (79%) for these markers. Thus it appeared that the BL lines from these lung cancer patients were reasonable controls. The percentage of LOH per informative cases for both SCLC and NSCLC ranged from 0% (marker D7S519) to 92% (marker D17S938). The average percentage of LOH was 36%, with no significant differences between SCLC and NSCLC (Table 1). Previously, we and others have made the working assumption that LOH rates < 30%were probably due to random genetic changes (4). To examine this question, we plotted the number of markers showing various amounts of LOH (Fig. 3). It appears that a trough in the range 30–35% separates two regions,one (Fig. 3, left, with a peak at 20–25%) of which we assume contains mostly random losses and another (Fig. 3, right) that represents losses presumably associated with the cancer-specific phenotype. In addition, 42 markers show very frequent LOH (≥60%) for both SCLC and NSCLC and are located in regions 3p, 4q, 8p, 9q, 13q, 17p, Xp, and Xq (Fig. 2, markers in red). The 17p region loss most likely represents LOH for p53, whereas the 13q region loss predominantly represents LOH for Rb.

SCLC and NSCLC Share Some Common Features of Allelic Loss.

The FAL is a measure of the extent of allele loss in a given tumor sample and is defined as the number of LOH events in a sample divided by the total informative (heterozygous) markers in the corresponding normal DNA (11). The FAL in lung cancers ranged from 1%(HCC78) to 63% (H2882). The average FAL was 34% for SCLCs and 36%for NSCLCs, a difference that is not significant (Table 1).

At least one example of MA was found for 34% (137 of 399) of the markers, most of which occurred in only one tumor DNA sample (Table 1). No significant differences in the number of MAs between SCLC and NSCLC were observed. Two markers had four tumors with MAs (D16S3136 and D20S178) but were not located in regions of frequent allelic loss. Each lung cancer cell line had an average of five MAs in the 399 markers (Table 1). No significant correlation was found between the presence of MA and the frequency of allelic loss for individual markers; the average frequency of allelic loss for markers showing MAs were, in fact, slightly lower than the overall average (33%versus 36%).

The sizes of regional allelic loss in individual tumors were determined. The size of each of these region losses was calculated as the number of contiguous markers showing LOH, including intervening NI markers between flanking regions of LOH. The average size of contiguous LOH was calculated for each lung tumor cell line (excluding clear cases of whole chromosomal losses; Table 1). The average number of contiguous markers involved in LOH in SCLC was 5.9 ± 1.3 markers,whereas in NSCLC, the average size was 5.5 ± 1.6 markers (not a significant difference; Table 1). Assuming an average intermarker distance of 10 cM, this means that the average length of regions of allele loss in lung cancer was 50–60 cM. The number of whole chromosome losses in lung cancer ranged from 0 (H1963 and HCC78)to 13 (HCC827; average, 3–4 chromosomes lost/tumor; Table 1). HCC827 lost 13 chromosomes and had only six regions of subchromosomal allelic loss. Thus, in this cell line, chromosome nondisjunction appeared to be the predominant mechanism of allele loss. In contrast, HCC78 had no loss of whole chromosomes and only two regions of LOH. Different mechanisms of allelic loss can therefore occur for different tumors.

To determine the location of regions with very frequent breakpoints in the tumors, we calculated the percentages of breakpoints for each pair of contiguous markers (SCLC and NSCLC combined). Here, breakpoints are defined as the junction between a marker showing LOH and an adjacent marker retaining heterozygosity (HET) in a given tumor sample DNA. If a LOH and a HET are separated by NI markers, the intervening junctions are considered part of the same breakpoint. The highest percentages of breakpoints (≥20%) were found in 1q12, 1q23, 3p11–13, 3p21.3–22,5p11–13, 8p12–q11, 9p13–21.1, 10p11–q11, 16p11, and 17p11 (Fig. 2, double bars between markers). Several of these were located near a hot spot of allelic loss. The average percentage of breakpoints for all pairs of consecutive markers was 4.8% and ranged from 0–47%with a SD of 6.2%. Hence, contiguous markers showing ≥20%breakpoints are significantly different (P = 0.01) from the average markers. For individual tumors, the number of breakpoints was 18 ± 6 in SCLC and 20 ± 9 in NSCLC (not a significant difference; Table 1).

Hot Spots of Frequent Allelic Loss Have Distinct Patterns in SCLC and NSCLC.

To define more clearly the regions that undergo allelic loss, areas of common allele loss (also called minimal areas of loss or hot spots)were determined by identifying the smallest common regions of frequent LOH in SCLC and in NSCLC (Fig. 2). For individual samples, these regions were scored as: (a) HET when at least one HET (or MA/HET) occurred in the region for this sample (the rest,if any, were NI); (b) NI when only NIs were observed in the region or when both HET and LOH were present; and (c) LOH when at least one LOH (or HD, MA/LOH, or MA/NI) occurred in the region for this sample (and the rest were NI). Frequencies of allele loss for these common areas of loss are calculated as usual as(LOH)/(HET + LOH). Regions of minimal loss are listed in Table 2 along with potential candidate genes.

Several such regions of allelic loss were found, the most prominent of which (>60% loss) were 1p36, 3p (multiple sites), 4p14–q13,4q21–28, 4q34–ter, 5p13–q13, 5q32–ter, 8p21–23, 9p21–22(p16 and p15), 9q22–32, 10p15, 10q22–23, 13q11,13q12–14 (RB1), 13q34, 17p12–13 (p53), 19p13,Xp–q21, and Xq22, consistent with previous results. Several loci were novel in lung cancer (Table 2). Thirty-two regions of loss were confined to one marker or located between two markers and, as such, are useful candidates for further allelotyping with higher densities of markers (Fig. 2). Note that SCLCs and NSCLCs have very distinct patterns of allele loss, as observed previously (4). In fact, one-sixth of all markers (65 of 399 markers) showed a statistically significant difference in LOH frequencies between SCLC and NSCLC (Fig. 2). On average, each SCLC has undergone loss of 17 ± 4 loci (hot spots), whereas each NSCLC has lost 22 ± 8 loci, results that were significantly different(Table 1).

HDs.

Because of the importance of HDs in the positional cloning of TSGs, we searched for clear examples where both alleles were lost. Nine HDs were detected (Table 3), two of which appeared to be contiguous in the same cell line (H2052). They are located in chromosome regions 2p23.3, 3p14.2, 8q24.3, 9p21,18q11.2, and Xq22.1. Four of these occurred in LOH hot spot regions where putative TSGs are probably located: (a) 3p14(FHIT) and 9p21 (p16INK4A), which have been described previously; and (b) 18q11.2 and Xq22,which are novel sites. Interestingly, in this series of markers, HDs were found only in NSCLCs. Three of these (H1648, HCC95, and H2052) had two HDs each (Table 3). However, HDs do occur in SCLCs as well, as has been shown previously (3).

The percentage of HDs that were detected with all 399 markers and in all 36 tumor samples was 0.06% [9/(399*36)]. This represents an estimate of the likelihood of finding a HD. It also means that one of four tumors can be expected to harbor a HD with our assay. However,because of the relatively small size of most reported HDs (typically less than 1 Mb) and the current intermarker distance of 10 cM (10 Mb),a larger number of markers would likely uncover proportionally more HDs.

Cluster Analysis Identifies LOH Correlations Suggestive of Gene Interaction.

This data set represents one of the first opportunities to see whether alterations (allele loss) in one chromosomal region are linked to changes in another region. We approached this by cluster analysis. The clustering algorithm we designed is based on a strategy similar to that described by Eisen et al.(10) and was implemented in Visual Basic for Excel in a user-ready format (available on request; see “Materials and Methods”).

The clustering analysis reordered all 399 markers in such a way that markers with similar LOH patterns tend to be grouped (clustered)together. It also identified discrete, nonoverlapping regions within the clustered array where LOH patterns are more than 75% concordant to one another as defined in “Materials and Methods” (vertical bars in Fig. 4, A and B). In most cases, these regions consist of markers that map to the same chromosome because their physical linkage tends to correlate with similar patterns of allelic loss. Obviously, these provide little new information. In contrast,biologically interesting results are obtained from those regions that contain clustered markers located on different chromosomes. A preliminary clustering analysis of all lung cancer samples did not reveal any concordance between markers from different chromosomes,presumably because of the different patterns of allelic loss between SCLC and NSCLC. Similarly, very little concordances were found when NSCLCs were analyzed as a group. However, analysis of SCLC and the adenocarcinoma subgroup of NSCLC (15 of 22 NSCLCs) revealed new correlations.

Fig. 4, A and B, shows the clustering analysis of all SCLCs and all adenocarcinomas, respectively. In each case, several regions of high concordance were revealed (vertical bars),and the magnifications in Fig. 4, C and D, show the most significant ones (consisting of markers from different chromosomes with high LOH frequencies). In SCLCs, concordances were observed between markers in the 3p14 locus (FHIT) and the 13q14 locus (RB), markers in the 13q14 locus (RB)and the 17p13 locus (p53), and 4q12 and 5q. In adenocarcinomas, concordances between 1q and 3p22, 3p14(FHIT) and 20p12–13, and 3q23 and 15q12 were noted. Overall, these data suggest that genetic collaboration occurs between the loss of corresponding TSGs during tumor development and that these concordant losses have a significant degree of histological specificity.

The present study describes a high-resolution, genome-wide allelotyping of 36 lung cancer cell lines using a large number(n = 399) of fluorescent microsatellite markers with a coverage of ∼10 cM on average. Previous allelotyping analyses of lung cancer by our group and others were restricted to particular chromosomal regions or arms or used relatively low densities of markers (4, 5, 6, 12). Very few reports have presented allelotyping data on multiple sites in the same tumors. Our results thus represent, to date, the highest resolution of lung cancer allelotyping with genome-wide coverage and allow for more comprehensive studies, including clustering analysis for the determination of loci interactions. To increase throughput, we have optimized the conditions for PCR multiplexing to allow analysis of each DNA sample with only one 36-lane ABI sequencer gel. In addition, several Visual Basic software programs were designed to facilitate data processing.

Several sites of frequent allelic loss have been detected in this study. The location of these sites was based on the determination of the minimal regions of loss that are defined by the occurrence of breakpoints (transitions between markers showing LOH and those retaining heterozygosity) surrounding regions of LOHs. These sites are likely to represent the major TSG regions that are affected in lung cancer and are therefore good indicators of critical regions of loss that warrant further investigation.

It was shown previously that chromosome arms 3p (several sites), 13q(Rb), and 17p (p53) have the most frequent allelic loss in both SCLC (>90%) and NSCLC (>70%; Refs.4 and 13, 14, 15). LOHs were also found in >60%of cases in 4q, 5q, 10q, and 22q for SCLC and on 6q, 8p, 9p, and 19p for NSCLC, and additional sites with moderate frequencies were also detected throughout the genome (4, 5, 6, 16). Our results agree with and extend these findings. In particular, of 38 chromosomal arms previously analyzed by us, 25 from SCLC or NSCLC had an average frequency of LOH that agrees with the present findings to within 10%,although the markers used were different (4, 5, 6, 7).

Overall, we found more than 50 different sites of frequent allelic loss(hot spots) in SCLC and NSCLC; about half of these sites have not been described previously in lung cancer. The most prominent of these novel sites (>60% LOH) occurred on 1q23, 9q22.33–32 (PTCHlocus), 10p15.3, and 13q34 in SCLC and on 13q11 and Xq22.1 in NSCLC. To our knowledge, except for PTCH on 9q, none of these novel sites with high-frequency LOH have been shown to harbor candidate TSGs. Future investigations with an even higher resolution of microsatellite markers will therefore be crucial to narrow down the sites of frequent allelic loss. Such studies have been performed previously by us in 3p (54 markers), 4p and 4q (16 markers), and 8p (26 markers), and we identified several regions of loss in addition to those reported here (5, 6, 7).

On average, 17 regions of high LOH frequency (hot spots) were found in each SCLC, and 22 regions of high LOH frequency were found in each NSCLC, giving an indication of the possible number of TSGs mutated in each lung cancer. Of course, some of the allelic losses may be caused by genomic instability or the presence of fragile sites(16). In fact, some breakpoints were identified that were recurrent in a significant fraction (>20%) of tumors. They are located on 1q, 3p, 5p11–13, 8p12–q11, 9p13–q21, 10p11–q11, 16p11,and 17p11. Most are located near a hot spot of frequent allelic loss,and some of these losses may therefore be the consequence of recurrent alterations at common, potentially fragile sites. Whether or not losses associated with these fragile sites participate in the tumor growth selection process is a question that will need to be addressed through other experimental approaches such as functional assays. Whereas we use the term breakpoint to describe these LOH-HET junctions, we stress that these could occur by physical chromosomal deletion or by other processes such as mitotic recombination (17, 18).

The statistically significant but relatively small difference in the number of hot spots in SCLC and NSCLC (17 versus 22) is possibly a consequence of the greater number of NSCLC samples that were analyzed because this tends to refine the minimal areas of loss and hence increase their number. This would be supported by the otherwise overall similarities in the characteristics of SCLC and NSCLC (FAL,average sizes of LOH, number of breakpoints, and microsatellite abnormalities) shown in Table 1. However, these two histological types do show significantly different LOH patterns for specific chromosomal regions.

Interestingly, one cell line (HCC78) appears to have a very stable genome with a low percentage of allele loss (FAL of 1%). In this cell line, only two markers (D3S1300/FHIT and D6S292)demonstrated a LOH, and both occur in a chromosomal hot spot for allelic loss [3p14.2 (FHIT locus) and 6q22.3–23],suggesting that only a few TSG mutations or losses may have been sufficient (together with some oncogene activations) for the transformation of this lung cancer. This tumor, although unusual, thus represents an important case for future detailed study.

Nine HDs were found in six regions, and all may be of potential use in identifying the resident TSGs. Of those six regions, four are localized to hot spot areas of allele loss: (a) 3p14; (b)9p21; (c) 18q11; and (d) Xq22. Two of these, 3p14 and 9p21, harbor known TSGs, namely FHIT and p16,respectively. Several HDs have been reported previously in 18q21–22 in colorectal and pancreatic cancers where DCC, SMAD2, and SMAD4 are located(19, 20, 21), but none to date were found at 18q11, and no known TSG appears to be present at this locus. Because of its high frequency of allelic loss (59%) and its specificity to NSCLC (0% in SCLC), this region appears particularly interesting and, along with another HD in 8q24, is currently under investigation. The deletion we found in Xq22 is not, strictly speaking, a HD because it was detected in a cell line from a male patient, but the absence of a wild-type allelic copy suggests that the loss of this site may also be involved in the tumorigenic process. No such Xq22 deletions have been reported previously.

MAs have been reported in 35% of SCLCs and 22% of NSCLCs using a relatively small number of markers and are associated with advanced tumor stage (2). Our results show that nearly all tumors have at least one MA, and half of them have more than five MAs. Five of the 36 tumors (14%) had more than 10 MAs and might be good candidates for screening of gene mutation in the DNA mismatch repair pathway. However, none of the MAs had the appearance of “laddering”characteristic of the replication error phenotype seen in colon cancers with a mutation in one of the mismatch repair enzymes(22).

Cluster analysis was performed using a Visual Basic program that searches for correlations between the loss or retention of different markers in lung cancer. The algorithm we designed is similar to one that has been used for cluster analysis of microarray data(10) and is well adapted to allelotyping studies. A number of correlated groups of markers that appeared to be lost or retained in a coordinate fashion were thus identified. Concordance groups might represent different TSGs whose synergistic loss would provide a growth advantage during cancer development. Because of the unavoidable presence of NI data, the groups of concordant markers listed by our clustering program may constitute only a subset of what would be detected if all markers were completely informative.

In SCLC, correlations were found between markers on 3p14–25 and 13q12–14 and between 13q14 and 17p13 that suggest synergistic interactions between the loss of putative TSGs in these loci(e.g.,FHIT and RB, and RBand p53, respectively). Alternatively, these correlations may be the consequence of sequential molecular genetic changes that occur in the development of lung cancer(23, 24, 25, 26). A more novel finding is the concordance found between allelic loss on 4q12 and 5q, with both regions showing a high rate of LOH. Whereas no candidate TSGs have been described in the first region, several have been reported in 5q, including MCC, APC,DOC2, and IRF1(27, 28, 29, 30). Further investigation of the 4q12 region may thus reveal a gene collaborating with one of the putative TSGs on 5q.

An initial clustering analysis in NSCLCs did not reveal significant concordances, presumably because of the histopathological heterogeneity of these NSCLCs. However, in the subset of 15 adenocarcinomas, two concordance groups were observed that had frequent allelic loss, one associating 1q and 3p22, and the other associating 3p14 and 20p12–13. Two novel hot spots of allelic loss were uncovered in 1q, and one novel hot spot of allelic loss was uncovered in 20p12,and neither currently harbors candidate TSGs deposited in GenBank. These regions are thus interesting candidates as collaborating TSGs in adenocarcinomas.

Future experiments will include a genome-wide mRNA expression analysis using cDNA microarrays and will relate those results with the present findings. This will allow us to find several candidate target genes that may be affected by the various chromosomal losses described here and will further refine the genomic study of lung cancer-related changes. Together with the present study, it will provide us with a comprehensive genomic characterization of several tumor cell lines that will be very useful for future functional analyses and gene discovery experiments.

Fig. 1.

Representative allelotyping output data. Top, normal B lymphocyte cell line; bottom, HCC366 lung cancer cell line. Each PCR product of a given size is represented by a major peak with minor peaks(stutter bands) on the left. The left scale (in arbitrary units) represents the intensity of the peaks. The size (in bp) is shown above the figure.

Fig. 1.

Representative allelotyping output data. Top, normal B lymphocyte cell line; bottom, HCC366 lung cancer cell line. Each PCR product of a given size is represented by a major peak with minor peaks(stutter bands) on the left. The left scale (in arbitrary units) represents the intensity of the peaks. The size (in bp) is shown above the figure.

Close modal
Fig. 2.

Complete allelotyping results of 14 SCLCs and 22 NSCLCs. Microsatellite markers are shown on the left; markers in red have >60% LOH in SCLC and NSCLC. Double bars, breakpoints. Cell lines are indicated above the figure. The percentages of LOH/informative cases are shown on the right of each panel. Percentages >35% are color-coded, with darker blue corresponding to higher percentages. Minimal areas of allele loss (hot spots) are depicted by vertical bars,together with their chromosomal locations and percentage of regional loss. P (right) indicates statistically significant differences in LOH frequencies between SCLC and NSCLC. Legend for allelotyping data: green box, HET; hatched green box, MA with both alleles retained; blue box, LOH; hatched blue box, MA with loss of one allele; blue box with a white H, HD; gray box, NI.

Fig. 2.

Complete allelotyping results of 14 SCLCs and 22 NSCLCs. Microsatellite markers are shown on the left; markers in red have >60% LOH in SCLC and NSCLC. Double bars, breakpoints. Cell lines are indicated above the figure. The percentages of LOH/informative cases are shown on the right of each panel. Percentages >35% are color-coded, with darker blue corresponding to higher percentages. Minimal areas of allele loss (hot spots) are depicted by vertical bars,together with their chromosomal locations and percentage of regional loss. P (right) indicates statistically significant differences in LOH frequencies between SCLC and NSCLC. Legend for allelotyping data: green box, HET; hatched green box, MA with both alleles retained; blue box, LOH; hatched blue box, MA with loss of one allele; blue box with a white H, HD; gray box, NI.

Close modal
Fig. 3.

Number of markers as a function of LOH percentage in combined SCLCs and NSCLCs. Two modes, which are separated by the vertical line, can be seen. LOH percentages in the first mode (i.e., <35% LOH) are considered random or semirandom, whereas those in the second mode (>35%) are considered nonrandom and significant for the pathogenic process.

Fig. 3.

Number of markers as a function of LOH percentage in combined SCLCs and NSCLCs. Two modes, which are separated by the vertical line, can be seen. LOH percentages in the first mode (i.e., <35% LOH) are considered random or semirandom, whereas those in the second mode (>35%) are considered nonrandom and significant for the pathogenic process.

Close modal
Fig. 4.

Cluster analysis of microsatellite markers in SCLCs(A and C) and adenocarcinomas(B and D). A and B, complete clustering results for all markers. Vertical bars, regions with >75% concordance. C and D, magnification of the most significant regions showing concordances between markers mapping to different chromosomes. Top, cell lines; left, markers; right, chromosomal locations. Color codes: dark blue box, LOH; dark green box, HET; gray box, NI; light blue box, partially informative (LOH); light green box, partially informative (HET); hatched boxes,MA.

Fig. 4.

Cluster analysis of microsatellite markers in SCLCs(A and C) and adenocarcinomas(B and D). A and B, complete clustering results for all markers. Vertical bars, regions with >75% concordance. C and D, magnification of the most significant regions showing concordances between markers mapping to different chromosomes. Top, cell lines; left, markers; right, chromosomal locations. Color codes: dark blue box, LOH; dark green box, HET; gray box, NI; light blue box, partially informative (LOH); light green box, partially informative (HET); hatched boxes,MA.

Close modal

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1

Supported by National Cancer Institute Grant CA71618, Specialized Program of Research Excellence Grant CA70907(National Cancer Institute, Bethesda, MD), and the G. Harold and Leila Y. Mathers Charitable Foundation. S. Z-M. was supported by the Austrian Science Foundation (J1658-MED, J1860-MED).

3

The abbreviations used are: TSG, tumor suppressor gene; SCLC, small cell lung cancer; NSCLC, non-SCLC; LOH,loss of heterozygosity; MA, microsatellite alteration; HET,heterozygote; NI, noninformative; HD, homozygous deletion; FAL,fractional allelic loss; BL, B lymphoblastoid..

4

World Wide Web address: www.atcc.org.

5

World Wide Web address:www.Perkin-Elmer.com/ab.

6

World Wide Web address: http://www.gdb.org/.

Table 1

Characteristics of SCLC and NSCLC cell linesa

SCLCSexMatched BL heterozygosityFALAverage size of LOH (no. of markers)No. of whole chromosome lossesNo. of breakpointsNo. of MAsbNo. of LOH hot spotsNSCLCSubtypecSexMatched BL heterozygosityFALAverage size of LOH (no. of markers)No. of whole chromosome lossesNo. of breakpointsNo. of MAsbNo. of LOH hot spots
H128 85% 44% 5.6 24 24 16 H1395 AD 78% 36% 7.4 19 20 
H209 81% 22% 6.1 16 13 H1648 AD 83% 40% 5.9 21 27 
H289 77% 43% 7.7 11 18 H1819 AD 81% 45% 4.9 42 34 
H1184 77% 33% 5.9 23 17 H1993 AD 76% 34% 5.3 28 21 
H1450 75% 28% 5.4 21 12 H2009 AD 77% 41% 4.9 37 14 29 
H1607 78% 45% 5.2 33 22 H2087 AD 77% 40% 6.1 20 10 22 
H1672 80% 36% 9.2 12 16 H2122 AD 78% 29% 6.9 12 11 16 
H1963 84% 12% 4.3 13 10 H2347 AD 79% 17% 7.3 13 
H2107 83% 25% 4.4 21 17 H2882 NS 82% 63% 5.3 21 36 
H2141 76% 28% 5.9 11 16 H2887 NS 77% 35% 3.6 22 22 
H2171 77% 48% 6.4 17 22 HCC44 AD 80% 40% 6.3 18 22 
H2195 78% 38% 5.8 20 17 HCC78 AD 78% 1% 1.0 
HCC33 83% 25% 5.3 19 15 HCC193 AD 77% 23% 7.2 17 
HCC970 77% 42% 6.0 11 23 HCC515 AD 77% 34% 5.7 21 20 
         HCC827 AD 81% 52% 4.7 13 30 
Average  79.4% 34% 5.9 18 17 HCC366 AS 75% 29% 3.8 25 11 17 
SD  3.1% 11% 1.3 H1770 NE 81% 31% 7.8 11 12 
         H2052 ME 79% 32% 5.7 20 20 
         H2126 LC 77% 49% 5.5 27 30 
         HCC15 SQ 84% 40% 4.6 18 23 
         HCC95 SQ 77% 43% 7.4 18 29 
         HCC1171 LC 79% 35% 4.8 20 19 
                   
         Average   78.7% 36% 5.5 20 22 
         SD   2.4% 12% 1.6 
Pd  0.5 0.6 0.4 0.4 0.6 0.9 0.012           
SCLCSexMatched BL heterozygosityFALAverage size of LOH (no. of markers)No. of whole chromosome lossesNo. of breakpointsNo. of MAsbNo. of LOH hot spotsNSCLCSubtypecSexMatched BL heterozygosityFALAverage size of LOH (no. of markers)No. of whole chromosome lossesNo. of breakpointsNo. of MAsbNo. of LOH hot spots
H128 85% 44% 5.6 24 24 16 H1395 AD 78% 36% 7.4 19 20 
H209 81% 22% 6.1 16 13 H1648 AD 83% 40% 5.9 21 27 
H289 77% 43% 7.7 11 18 H1819 AD 81% 45% 4.9 42 34 
H1184 77% 33% 5.9 23 17 H1993 AD 76% 34% 5.3 28 21 
H1450 75% 28% 5.4 21 12 H2009 AD 77% 41% 4.9 37 14 29 
H1607 78% 45% 5.2 33 22 H2087 AD 77% 40% 6.1 20 10 22 
H1672 80% 36% 9.2 12 16 H2122 AD 78% 29% 6.9 12 11 16 
H1963 84% 12% 4.3 13 10 H2347 AD 79% 17% 7.3 13 
H2107 83% 25% 4.4 21 17 H2882 NS 82% 63% 5.3 21 36 
H2141 76% 28% 5.9 11 16 H2887 NS 77% 35% 3.6 22 22 
H2171 77% 48% 6.4 17 22 HCC44 AD 80% 40% 6.3 18 22 
H2195 78% 38% 5.8 20 17 HCC78 AD 78% 1% 1.0 
HCC33 83% 25% 5.3 19 15 HCC193 AD 77% 23% 7.2 17 
HCC970 77% 42% 6.0 11 23 HCC515 AD 77% 34% 5.7 21 20 
         HCC827 AD 81% 52% 4.7 13 30 
Average  79.4% 34% 5.9 18 17 HCC366 AS 75% 29% 3.8 25 11 17 
SD  3.1% 11% 1.3 H1770 NE 81% 31% 7.8 11 12 
         H2052 ME 79% 32% 5.7 20 20 
         H2126 LC 77% 49% 5.5 27 30 
         HCC15 SQ 84% 40% 4.6 18 23 
         HCC95 SQ 77% 43% 7.4 18 29 
         HCC1171 LC 79% 35% 4.8 20 19 
                   
         Average   78.7% 36% 5.5 20 22 
         SD   2.4% 12% 1.6 
Pd  0.5 0.6 0.4 0.4 0.6 0.9 0.012           
a

Sizes of LOH regions for individual tumors are measured as the number of contiguous markers with LOH, including intervening NI markers between flanking regions of LOH. Whole chromosome losses are not included in these calculations. Breakpoints are defined as junctions between LOH and HET (with or without intervening NIs) for individual samples. Hot spots are minimal regions of frequent (>35%) LOH.

b

105 markers have 1 MA; 23 markers have 2 MAs; 7 markers have 3 MAs (D1S450, D6S1574, D6S309, D6S470,D17S1868, D20S178, and D22S283); and 2 markers have 4 MAs (D16S3136 and D20S178).

c

AD, adenocarcinoma; AS,adenosquamous cell; LC, large cell; ME, mesothelioma; NE,neuroendocrine; SQ, squamous cell; NS, not specified.

d

P for differences between SCLC and NSCLC as measured by Student’s t test.

Table 2

Markers defining the minimal areas of allele loss (hot spots)

SCLCNSCLC
MarkersaLocibFrequency of regional lossCandidate TSGscMarkersLociFrequency of regional lossCandidate TSGs
D1S468 1p36.3 40% p73,TNFR2 D1S468-D1S2697 1p36.12-pter 64% p73, NBL1/DAN, TNFR2 
D1S196 1q23              d 82%  D1S207 1p31.1 53%  
D3S1297-D3S1263 3p25-26 100% VHL D1S206-D1S252 1p22-q12d 52%  
D3S1266 3p22.3-24 100% TGFBR2, DLC1 D1S196/D1S218 1q23-24d 55%  
D3S1289-D3S1566 3p13-p22 100% TGFBR2, MLH1/HNPCC2 D1S2800-D1S2842 1q41-43d 40%  
   DLC1, PTPG, FHIT, BAP1 D3S1297-D3S1304 3p25.2-pter 50% VHL 
D4S412-D4S419 4p15-16 54%  D3S2338 3p23-25.1 58%  
D4S405-D4S392 4p14-q13 67%  D3S1277/D3S1289 3p21.2-22 59% TGFBR2, DLC1, 
D4S414-D4S406 4q21-28 79%     MLH1/HNPCC2, BAP1 
D4S1535-D4S426 4q34-ter 77%  D3S1300 3p14.2 53% FHIT, PTPG 
D5S407-D5S424 5p13-q13 79% DOC2/DAB2 D3S3681-D3S1271 3p13-q13 50%  
D5S410-D5S408 5q32-ter 79%  D4S414 4q21-23d 43%  
D8S277-D8S258 8p21.3-23.1 50%  D4S1535 4q34-35 50%  
D9S286 9p23d 38%  D5S471-D5S2115 5q21.3-31 35% MCC, APC, IRF1 
D9S1677 9q22.33-32              d 64% PTCH D6S1574-D6S309 6p24-terd 41%  
D10S249 10p15.3              d 89%  D6S292 6q22.3-23d 55%  
D10S548/D10S197 10p12.3d 57%  D6S264-D6S281 6q26-27 48% IGF2R 
D10S1686 10q22-23.1 85%  D8S277-D8S258 8p21.3-23.1 73% DR5 
D11S4046-D11S1338 11p15.3-15.5 50% p57KIP2, BE2, HTS1/ST5 D9S171 9p21.3-22 79% p16INK4A, p15INK4B, p14              ARF 
D13S171-D13S263 13q12-14 93% RB1, BRCA2 D9S1677-D9S290 9q22.33-34.1d 55% TSC1 
D13S285 13q34              d 62% ING1 D9S1826-D9S158 9q34d 50% TSC1 
D15S128-D15S165 15q11-13d 57%  D10S591-D10S189 10p15d 38%  
D15S117 15q21-22.2d 58%  D10S208/D10S196 10p11-q11d 41%  
D16S3136-D16S520 16q 36% RB2, CDH1 D10S212 10q26.13-terd 36%  
D17S849-D17S921 17p12-ter 93% p53 D11S935-D11S905 11p11-14 35% WT1, KA11 
D18S462-D18S70 18q23 36%  D11S4175 11q14.3-21 45%  
D20S117 20p13d 40%  D12S352-D12S364 12p12.3-ter 45%  
    D13S175 13q11              d 67%  
    D13S171-D13S263 13q12-14 62% RB1, BRCA2 
    D15S128 15q11d 50%  
    D15S1007-D15S994 15q13-21.1d 48%  
    D15S127-D15S120 15q25-terd 36%  
    D16S423-D16S404 16p13.3d 45% TSC2 
    D16S515-D16S520 16q22-ter 45% CDH1 
    D17S849-D17S1852 17p12-13 86% p53 
    D18S478 18q11.2d 59%  
    D18S462-D18S70 18q23 57%  
    D19S209-D19S884 19p13.3 77%  
    D19S220-D19S420 19q12-13.2 50%  
    D19S210 19q13.33-ter 52%  
    D20S186 20p12d 50%  
    D21S1256/D21S1914 21q21 38%  
    D22S420 22q11 40% SNF5/INI1, CLTD 
    D22S423 22q13 59%  
    DXS1060-DXS991 Xp-q21 67%  
    DXS990 Xq22.1              d 67%  
SCLCNSCLC
MarkersaLocibFrequency of regional lossCandidate TSGscMarkersLociFrequency of regional lossCandidate TSGs
D1S468 1p36.3 40% p73,TNFR2 D1S468-D1S2697 1p36.12-pter 64% p73, NBL1/DAN, TNFR2 
D1S196 1q23              d 82%  D1S207 1p31.1 53%  
D3S1297-D3S1263 3p25-26 100% VHL D1S206-D1S252 1p22-q12d 52%  
D3S1266 3p22.3-24 100% TGFBR2, DLC1 D1S196/D1S218 1q23-24d 55%  
D3S1289-D3S1566 3p13-p22 100% TGFBR2, MLH1/HNPCC2 D1S2800-D1S2842 1q41-43d 40%  
   DLC1, PTPG, FHIT, BAP1 D3S1297-D3S1304 3p25.2-pter 50% VHL 
D4S412-D4S419 4p15-16 54%  D3S2338 3p23-25.1 58%  
D4S405-D4S392 4p14-q13 67%  D3S1277/D3S1289 3p21.2-22 59% TGFBR2, DLC1, 
D4S414-D4S406 4q21-28 79%     MLH1/HNPCC2, BAP1 
D4S1535-D4S426 4q34-ter 77%  D3S1300 3p14.2 53% FHIT, PTPG 
D5S407-D5S424 5p13-q13 79% DOC2/DAB2 D3S3681-D3S1271 3p13-q13 50%  
D5S410-D5S408 5q32-ter 79%  D4S414 4q21-23d 43%  
D8S277-D8S258 8p21.3-23.1 50%  D4S1535 4q34-35 50%  
D9S286 9p23d 38%  D5S471-D5S2115 5q21.3-31 35% MCC, APC, IRF1 
D9S1677 9q22.33-32              d 64% PTCH D6S1574-D6S309 6p24-terd 41%  
D10S249 10p15.3              d 89%  D6S292 6q22.3-23d 55%  
D10S548/D10S197 10p12.3d 57%  D6S264-D6S281 6q26-27 48% IGF2R 
D10S1686 10q22-23.1 85%  D8S277-D8S258 8p21.3-23.1 73% DR5 
D11S4046-D11S1338 11p15.3-15.5 50% p57KIP2, BE2, HTS1/ST5 D9S171 9p21.3-22 79% p16INK4A, p15INK4B, p14              ARF 
D13S171-D13S263 13q12-14 93% RB1, BRCA2 D9S1677-D9S290 9q22.33-34.1d 55% TSC1 
D13S285 13q34              d 62% ING1 D9S1826-D9S158 9q34d 50% TSC1 
D15S128-D15S165 15q11-13d 57%  D10S591-D10S189 10p15d 38%  
D15S117 15q21-22.2d 58%  D10S208/D10S196 10p11-q11d 41%  
D16S3136-D16S520 16q 36% RB2, CDH1 D10S212 10q26.13-terd 36%  
D17S849-D17S921 17p12-ter 93% p53 D11S935-D11S905 11p11-14 35% WT1, KA11 
D18S462-D18S70 18q23 36%  D11S4175 11q14.3-21 45%  
D20S117 20p13d 40%  D12S352-D12S364 12p12.3-ter 45%  
    D13S175 13q11              d 67%  
    D13S171-D13S263 13q12-14 62% RB1, BRCA2 
    D15S128 15q11d 50%  
    D15S1007-D15S994 15q13-21.1d 48%  
    D15S127-D15S120 15q25-terd 36%  
    D16S423-D16S404 16p13.3d 45% TSC2 
    D16S515-D16S520 16q22-ter 45% CDH1 
    D17S849-D17S1852 17p12-13 86% p53 
    D18S478 18q11.2d 59%  
    D18S462-D18S70 18q23 57%  
    D19S209-D19S884 19p13.3 77%  
    D19S220-D19S420 19q12-13.2 50%  
    D19S210 19q13.33-ter 52%  
    D20S186 20p12d 50%  
    D21S1256/D21S1914 21q21 38%  
    D22S420 22q11 40% SNF5/INI1, CLTD 
    D22S423 22q13 59%  
    DXS1060-DXS991 Xp-q21 67%  
    DXS990 Xq22.1              d 67%  
a

Slashes indicate that the minimal region of loss is between the two markers but does not include these markers.

b

Loci with more than 60% loss are underlined.

c

Genes in bold represent those that were definitely shown to be abnormal in lung cancer.

d

Novel sites of frequent allelic loss in lung cancer.

Table 3

HDs

MarkerLocusCell lineCandidate genes
D2S305 2p23.3 H2882 
D3S1300 3p14.2 H1648 FHIT 
D3S1300 3p14.2 HCC95 FHIT 
D8S272 8q24.3 H2122 
D9S161 9p21.1 H2052 p16INK4A, p15INK4B 
D9S171 9p21.3 H2052 p16INK4A, p15INK4B 
D9S171 9p21.3 HCC95 p16INK4A, p15INK4B 
D18S478 18q11.2 H2887 
DXS990 Xq22.1 H1648 
MarkerLocusCell lineCandidate genes
D2S305 2p23.3 H2882 
D3S1300 3p14.2 H1648 FHIT 
D3S1300 3p14.2 HCC95 FHIT 
D8S272 8q24.3 H2122 
D9S161 9p21.1 H2052 p16INK4A, p15INK4B 
D9S171 9p21.3 H2052 p16INK4A, p15INK4B 
D9S171 9p21.3 HCC95 p16INK4A, p15INK4B 
D18S478 18q11.2 H2887 
DXS990 Xq22.1 H1648 
1
Minna J. D. Molecular biology overview Pass H. I. Mitchell J. B. Johnson D. H. Turrisi A. T. eds. .
Lung Cancer: Principles and Practice
,
:
143
-148, Lippincott-Raven Publishers Philadelphia  
1996
.
2
Fong K. M., Sekido Y., Minna J. D. Molecular pathogenesis of lung cancer.
J. Thorac. Cardiovasc. Surg.
,
118
:
1136
-1152,  
1999
.
3
Sekido Y., Fong K. M., Minna J. D. Progress in understanding the molecular pathogenesis of human lung cancer.
Biochim. Biophys. Acta
,
1378
:
F21
-F59,  
1998
.
4
Virmani A. K., Fong K. M., Kodagoda D., McIntire D., Hung J., Tonk V., Minna J. D., Gazdar A. F. Allelotyping demonstrates common and distinct patterns of chromosomal loss in human lung cancer types.
Genes Chromosomes Cancer
,
21
:
308
-319,  
1998
.
5
Wistuba I. I., Behrens C., Virmani A. K., Milchgrub S., Syed S., Lam S., Mackay B., Minna J. D., Gazdar A. F. Allelic losses at chromosome 8p21–23 are early and frequent events in the pathogenesis of lung cancer.
Cancer Res.
,
59
:
1973
-1979,  
1999
.
6
Shivapurkar N., Virmani A. K., Wistuba I. I., Milchgrub S., Mackay B., Minna J. D., Gazdar A. F. Deletions of chromosome 4 at multiple sites are frequent in malignant mesothelioma and small cell lung carcinoma.
Clin. Cancer Res.
,
5
:
17
-23,  
1999
.
7
Wistuba, I. I., Behrens, C., Virmani, A. K., Mele, G., Milchgrub, S., Girard, L., Fondon, J. W., III, Garner, H. R., Mackay, B., Latif, F., Lerman, M. I., Lam, S., Gazdar, A. F., and Minna, J. D. High resolution chromosome 3p allelotyping of human lung cancer and preneoplastic bronchial epithelium reveals multiple, discontinuous sites of 3p allele loss and three regions of frequent breakpoints. Cancer Res., 60: 1949–1960, 2000.
8
Wistuba I. I., Bryant D., Behrens C., Milchgrub S., Virmani A. K., Ashfaq R., Minna J. D., Gazdar A. F. Comparison of features of human lung cancer cell lines and their corresponding tumors.
Clin. Cancer Res.
,
5
:
991
-1000,  
1999
.
9
Dib C., Faure S., Fizames C., Samson D., Drouot N., Vignal A., Millasseau P., Marc S., Hazan J., Seboun E., Lathrop M., Gyapay G., Morissette J., Weissenbach J. A comprehensive genetic map of the human genome based on 5,264 microsatellites.
Nature (Lond.)
,
380
:
152
-154,  
1996
.
10
Eisen M. B., Spellman P. T., Brown P. O., Botstein D. Cluster analysis and display of genome-wide expression patterns.
Proc. Natl. Acad. Sci. USA
,
95
:
14863
-14868,  
1998
.
11
Vogelstein B., Fearon E. R., Kern S. E., Hamilton S. R., Preisinger A. C., Nakamura Y., White R. Allelotype of colorectal carcinomas.
Science (Washington DC)
,
244
:
207
-211,  
1989
.
12
Wang S. S., Virmani A., Gazdar A. F., Minna J. D., Evans G. A. Refined mapping of two regions of loss of heterozygosity on chromosome band 11q23 in lung cancer.
Genes Chromosomes Cancer
,
25
:
154
-159,  
1999
.
13
Kawanishi M., Kohno T., Otsuka T., Adachi J., Sone S., Noguchi M., Hirohashi S., Yokota J. Allelotype and replication error phenotype of small cell lung carcinoma.
Carcinogenesis (Lond.)
,
18
:
2057
-2062,  
1997
.
14
Shiseki M., Kohno T., Adachi J., Okazaki T., Otsuka T., Mizoguchi H., Noguchi M., Hirohashi S., Yokota J. Comparative allelotype of early and advanced stage non-small cell lung carcinomas.
Genes Chromosomes Cancer
,
17
:
71
-77,  
1996
.
15
Yokota J., Wada M., Shimosato Y., Terada M., Sugimura T. Loss of heterozygosity on chromosomes 3, 13, and 17 in small-cell carcinoma and on chromosome 3 in adenocarcinoma of the lung.
Proc. Natl. Acad. Sci. USA
,
84
:
9252
-9256,  
1987
.
16
Kohno T., Yokota J. How many tumor suppressor genes are involved in human lung carcinogenesis?.
Carcinogenesis (Lond.)
,
20
:
1403
-1410,  
1999
.
17
Hagstrom S. A., Dryja T. P. Mitotic recombination map of 13cen-13q14 derived from an investigation of loss of heterozygosity in retinoblastomas.
Proc. Natl. Acad. Sci. USA
,
96
:
2952
-2957,  
1999
.
18
Turker M. S., Gage B. M., Rose J. A., Elroy D., Ponomareva O. N., Stambrook P. J., Tischfield J. A. A novel signature mutation for oxidative damage resembles a mutational pattern found commonly in human cancers.
Cancer Res.
,
59
:
1837
-1839,  
1999
.
19
Hilgers, W., Su, G. H., Groot Koerkamp, B., Tang, D. J., Shekher, M. C., Sugar, A. Y., Yeo, C. J., Hruban, R. H., and Kern, S. E. Novel homozygous deletions of chromosomal band 18q22 in pancreatic adenocarcinoma identified by STS marker scanning. Genes Chromosomes Cancer, 25: 370–375, 1999.
20
Hahn S. A., Schutte M., Hoque A. T., Moskaluk C. A., da Costa L. T., Rozenblum E., Weinstein C. L., Fischer A., Yeo C. J., Hruban R. H., Kern S. E. DPC4, a candidate tumor suppressor gene at human chromosome 18q21.
1. Science (Washington DC)
,
271
:
350
-353,  
1996
.
21
Fearon E. R., Cho K. R., Nigro J. M., Kern S. E., Simons J. W., Ruppert J. M., Hamilton S. R., Preisinger A. C., Thomas G., Kinzler K. W., Vogelstein B. Identification of a chromosome 18q gene that is altered in colorectal cancers.
Science (Washington DC)
,
247
:
49
-56,  
1990
.
22
Lengauer C., Kinzler K. W., Vogelstein B. Genetic instabilities in human cancers.
Nature (Lond.)
,
396
:
643
-649,  
1998
.
23
Sundaresan V., Ganly P., Hasleton P., Rudd R., Sinha G., Bleehen N. M., Rabbitts P. p53 and chromosome 3 abnormalities, characteristic of malignant lung tumours, are detectable in preinvasive lesions of the bronchus.
Oncogene
,
7
:
1989
-1997,  
1992
.
24
Sundaresan V., Heppell-Parton A., Coleman N., Miozzo M., Sozzi G., Ball R., Cary N., Hasleton P., Fowler W., Rabbitts P. Somatic genetic changes in lung cancer and precancerous lesions.
Ann. Oncol.
,
6(Suppl.1)
:
27
-31,  
1995
.
25
Chung G. T., Sundaresan V., Hasleton P., Rudd R., Taylor R., Rabbitts P. H. Sequential molecular genetic changes in lung cancer development.
Oncogene
,
11
:
2591
-259,  
1995
.
26
Hung J., Kishimoto Y., Sugio K., Virmani A., McIntire D. D., Minna J. D., Gazdar A. F. Allele-specific chromosome 3p deletions occur at an early stage in the pathogenesis of lung carcinoma.
J. Am. Med. Assoc.
,
273
:
558
-563,  
1995
.
27
Cooper C. A., Bubb V. J., Smithson N., Carter R. L., Gledhill S., Lamb D., Wyllie A. H., Carey F. A. Loss of heterozygosity at 5q21 in non-small cell lung cancer: a frequent event but without evidence of apc mutation.
J. Pathol.
,
180
:
33
-37,  
1996
.
28
Mok S. C., Chan W. Y., Wong K. K., Cheung K. K., Lau C. C., Ng S. W., Baldini A., Colitti C. V., Rock C. O., Berkowitz R. S. DOC-2, a candidate tumor suppressor gene in human epithelial ovarian cancer.
Oncogene
,
16
:
2381
-2387,  
1998
.
29
Willman C. L., Sever C. E., Pallavicini M. G., Harada H., Tanaka N., Slovak M. L., Yamamoto H., Harada K., Meeker T. C., List A. F., Taniguchi T. Deletion of IRF-1, mapping to chromosome 5q31.
1
,
inhumanleukemiaandpreleukemicmyelodysplasia.Science(WashingtonDC),259
:
968
-971,  
1993
.
30
Kinzler K. W., Nilbert M. C., Vogelstein B., Bryan T. M., Levy D. B., Smith K. J., Preisinger A. C., Hamilton S. R., Hedge P., Markham A., Carlson M., Joslyn G., Groden J., White R., Miki Y., Miyoshi Y., Nishisho I., Nakamura Y. Identification of a gene located at chromosome 5q21 that is mutated in colorectal cancers.
Science (Washington DC)
,
251
:
1366
-1370,  
1991
.