To identify the genetic determinants of colon tumorigenesis, 268 male mice from 33 inbred strains derived from different genealogies were treated with azoxymethane (AOM; 10 mg/kg) once a week for six weeks to induce colon tumors. Tumors were localized exclusively within the distal colon in each of the strains examined. Inbred mouse strains exhibit a large variability in genetic susceptibility to AOM-induced colon tumorigenesis. The mean colon tumor multiplicity ranged from 0 to 38.6 (mean = 6.5 ± 8.6) and tumor volume ranged from 0 to 706.5 mm3 (mean = 87.4 ± 181.9) at 24 weeks after the first dose of AOM. AOM-induced colon tumor phenotypes are highly heritable in inbred mice, and 68.8% and 71.3% of total phenotypic variation in colon tumor multiplicity and tumor volume, respectively, are attributable to strain-dependent genetic background. Using 97,854 single-nucleotide polymorphisms, we carried out a genome-wide association study (GWAS) of AOM-induced colon tumorigenesis and identified a novel susceptibility locus on chromosome 15 (rs32359607, P = 6.31 × 10–6). Subsequent fine mapping confirmed five (Scc3, Scc2, Scc12, Scc8, and Ccs1) of 16 linkage regions previously found to be associated with colon tumor susceptibility. These five loci were refined to less than 1 Mb genomic regions of interest. Major candidates in these loci are Sema5a, Fmn2, Grem2, Fap, Gsg1l, Xpo6, Rabep2, Eif3c, Unc5d, and Gpr65. In particular, the refined Scc3 locus shows high concordance with the human GWAS locus that underlies hereditary mixed polyposis syndrome. These findings increase our understanding of the complex genetics of colon tumorigenesis, and provide important insights into the pathways of colorectal cancer development and might ultimately lead to more effective individually targeted cancer prevention strategies. Mol Cancer Res; 10(1); 66–74. ©2011 AACR.

Colorectal cancer is the second leading cause of cancer-related mortality in the United States. In 2010, there will be an estimated 50,000 deaths associated with this disease (1). Genetic factors play important roles in colorectal cancer development and account for approximately 35% of colorectal cancer risk (2). Highly penetrant germline mutations in APC, SMAD4, BMPR1A, MUTYH, STK11, and DNA mismatch repair genes are estimated to account for approximately 6% of all colorectal cancer cases (3). In addition to these rare variants, much of colorectal cancer inherited susceptibility is likely attributable to multiple low penetrance common variants. Direct evidence for common variants for colorectal cancer is highlighted by recent genome-wide association studies (GWAS). These GWAS on colorectal cancer have identified 10 independent loci that confer risk of colorectal cancer, including those on chromosomes 8q24.21, 11q23, 18q21.1, 8q23.1 15q, 19q13.1, 20q12.3, 14q22.2, 16q22.1, and 10p14 (4–10). However, risk associated with these common variants is modest and only a small proportion of colorectal cancer risk can be explained by currently identified loci. Other genetic factors underlying colorectal cancer remain unidentified and this strongly supports the continued search for novel colorectal cancer susceptibility genes.

Inbred strains of laboratory mice have been valuable in the identification of tumor susceptibility genes because they display a wide range of spontaneous and chemically induced tumor incidence. A number of linkage mapping studies with cross-breeding experiments were carried out for chemically induced colon tumors in mice (11–16). As a result, 16 quantitative trait loci (QTL) responsible for chemically induced colon tumors have been mapped on the mouse genome, implying a very complex picture of inherited susceptibility of colon tumorigenesis exists. However, a major obstacle of identifying QTL genes is the difficulty of resolving these chromosomal regions (10 ∼ 20 cM) into sufficiently small intervals to make positional cloning possible. These colon tumor QTLs span a total of 178 cM that corresponds to 351 Mb, covering 13% of the mouse genome. Recent advances in genomic sequence analysis and single-nucleotide polymorphism (SNP) discovery have provided researchers with the necessary resources to explore a wide range of genetic variation in laboratory inbred mice (17). The use of dense SNP maps in laboratory inbred mice has proven successful in the refinement of previous QTL regions and the identification of new genetic determinants of complex traits (18, 19).

Here, we carried out a GWAS to map novel susceptibility loci and refine previous QTL regions for colon tumorigenesis in inbred mice. A total of 268 male mice from 33 inbred strains were treated with azoxymethane (AOM), an organotropic colon carcinogen, to induce colon tumors. We identified a novel genetic susceptibility locus on mouse chromosome 15 and narrowed 5 of 16 previous linkage regions into less than 1 Mb genomic regions of interest in which candidate genes were identified.

Inbred mouse strains and SNP data

Thirty-three inbred mouse strains were used in our colon cancer study. The chosen inbred strains were derived from different genealogies and include 16 Castle's mice (129S1/SvImJ, 129S4/SvJae, 129 × 1/SvJ, A/J, AKR/J, BALB/cByJ, C3H/HeJ, CBA/J, DBA/1J, DBA/2J, I/LnJ, LP/J, NZB/BlNJ, NZW/LacJ, SEA/GnJ, and SM/J), 4 C57-related strains (C57BL/6J, C57BLKS/J, C57L/J, and C58/J), 4 Swiss mice (FVB/NJ, NON/ShiLtJ, SJL/J, and SWR/J), 2 wild-derived strains (CAST/EiJ and PERA/EiJ), 1 strain derived from colonies from China and Japan (KK/HlJ), and 6 other inbred strains (BTBR T+ tf/J, BUB/BnJ, CE/J, LG/J, PL/J and RIIIS/J). This brought a wide range of variation in colon tumorigenesis between inbred mouse strains. The SNP data were obtained from the Mouse Phenome Database (MPD; http://phenome.jax.org/), which contains 190,903 SNPs on commonly used mouse inbred strains. These SNP data were further filtered by removing SNPs with less than 26 strains typed or without genetic mapping information. To be included, each SNP allele also had to be present in 5 or more inbred strains. The resulting data consisted of 97,854 SNPs, spanning the mouse genome at an average density of approximately 28 kb per SNP.

Colon tumorigenesis assays

The inbred mouse strains were purchased from The Jackson Laboratory. Animals were housed in plastic cages with hardwood bedding and dust covers, in a high-efficiency particulate air (HEPA) filtered, environmentally controlled room (24°C ± 1°C, 12/12-hour light/dark cycle). Animals were given Rodent Lab Chow, #5001 (Purina) and water ad libitum. Animals received administration of AOM (Sigma) with a dose of 10 mg/kg in 0.1 mL of PBS at 5 weeks of age. All mice were injected intraperitoneally with AOM once a week for 6 weeks. Animals were killed 24 weeks after the first dose of AOM by CO2 asphyxiation. Immediately after sacrifice, the colons (proximal and distal) were flushed with ice-cold PBS to remove fecal material, opened longitudinally, and placed flat on a filter paper. In general, 5 to 10 mice per strain were treated and phenotyped. The flushed colons were fixed in Tellyesniczky's solution (20) overnight followed by 70% ethanol. The fixed colons were evaluated by at least 2 investigators under a dissecting microscope to obtain the fixed surface tumor counts (i.e., tumor multiplicity), and individual tumor volume was measured based on the following formula: V = 4/3πr3, where r is radius of tumor. The total tumor volume was obtained by adding the individual tumor volume per mouse. Tumor multiplicity and volume are widely used phenotypes in cancer studies of animal models, which are more informative than the binary phenotype (case vs. disease-free control).

Genome-wide SNP association analysis

To correct for population structure and genetic relatedness among inbred strains, we used a recently developed method, efficient mixed model association (EMMA) approach, to assess association of colon tumor susceptibility with SNPs (21). Specifically, the mixed model in the EMMA method can be represented by:

where y is an n × 1 vector of observed phenotypes (i.e., colon tumor multiplicity and volume), and X is an n × q matrix of fixed effects including mean, SNPs, and other covariate variables. β is a q × 1 vector representing coefficients of the fixed effects. Z is an n × t incidence matrix mapping each observed phenotype to one of t inbred strains. u is the random effect (i.e., strain effects) of the mixed model with Var(u) = |$2{\rm K}\sigma _g^2$| where K is the t × t kinship matrix inferred from genotypes, and e is an n × n matrix of residual effect such that Var(e) = |${\rm I}\sigma _e^2$|⁠. The overall phenotypic variance–covariance matrix can be represented as |${\rm V} = 2{\rm ZKZ^\prime}\sigma _g^2 + {\rm I}\sigma _e^2$|⁠. The kinship matrix K based on genetic similarity was inferred from SNP genotype data.

An R package implementation of the EMMA method is publicly available (http://mouse.cs.ucla.edu/emma/). A 2-sided P value from the EMMA for each SNP was obtained for testing hypothesis of no association between the SNP and colon tumor phenotypes. Prior to the statistical analysis, colon tumor multiplicity and volume were converted into normal data by Box-Cox Transformation.

Two hundred permutations were used to establish a genome-wide threshold (a global P = 0.05) for declaring significant associations in the association analysis, which take into account linkage disequilibrium among SNPs on the genome (22). Specifically, colon tumor phenotypes were randomly reshuffled among subjects while fixing the genotypes. For each permutation, the EMMA approach described above was implemented, and the most significant −log10(P) was recorded. Sorting the maximum −log10(P) from large to small, the 5% quantile of the empirical distribution was taken as the genome-wide threshold (a global P = 0.05 and 0.10). We also used the above simulation procedure to determine a region-wide threshold for each colon susceptibility locus from previous linkage studies. The only difference here is to carry out association tests in that linkage region instead of the whole genome. Briefly, we first identified one logarithm of the odds (LOD) supporting interval for each of the previous linkage regions. Then, we conducted permutation analysis as described above in the identified one LOD supporting interval.

Detection of differential expression in colon tumor models and human colon cancer studies

Mouse cDNA array data were downloaded from the Gene Expression Omnibus database (GSE5261). We analyzed 14 tumor samples from the AOM-induced mouse colon model (23), 9 tumor samples from the ApcMin/+ mouse model (24), and 3 adult normal mouse colon. The expression data were normalized by Lowess intensity–dependent normalization as implemented in GeneSpring 7.2. Then, the samples were referenced to expression levels of normal control colon samples (25). Two-sample t tests were used to detect differential expression between tumor and control samples.

Human colon cancer microarray data were downloaded from the Gene Expression Omnibus database (GSE10950). The microarray hybridization of 24 colon normal and tumor pairs was carried out by the Illumina Gene Expression SentrixBeadChip HumanRef-8_V2, and the expression data were normalized by cubic spline normalization in GeneSpring 7.2. Pair-wise t tests were used to detect differential expression between tumor and normal tissues. Fold changes of gene expression in tumor versus normal tissues were also recorded. Flow charts of the study design and data analysis and fine mapping of previous linkage regions were described in Supplementary Fig. S1.

Colon tumorigenesis in inbred strains

Two hundred sixty-eight male mice from 33 inbred mouse strains were measured for colon tumor multiplicity and volume 24 weeks after injection with AOM. In each of the strains examined, the tumors were localized exclusively within the distal colon and not detected grossly or histologically within the proximal colon. Inbred mouse strains had a large variability in AOM-induced colon tumor multiplicity and volume (Fig. 1). Mean colon tumor multiplicity ranged from 0 to 38.6 (mean ± SD, 6.5 ± 8.6) and tumor volume ranged from 0 to 706.5 mm3 (mean ± SD, 87.4 ± 181.9) at 24 weeks after the first dose of AOM. Several strains show marked sensitivity to AOM-induced colon tumorigenesis and developed more than 10 tumors per mouse with diameter of 2 mm or more including C57L/J, FVB/NJ, BTBR T+ tf/J, A/J, NON/ShiLtJ, SM/J, KK/HlJ, and I/LnJ. In contrast, strains AKR/J, PERA/EiJ, RIIIS/J, DBA/1J, C57BL/6J, and DBA/2J had less than 1 tumor per mouse after treatment with AOM. Tumor multiplicity is highly correlated with tumor volume in colon tumorigenesis in inbred mice (r2 = 0.74), suggesting some degree of common genetic components between these 2 tumor phenotypes. The between-strain variance accounts for 68.8% and 71.3% of total phenotypic variation in colon tumor multiplicity and tumor volume, respectively, implying that most of the variations we observed in colon tumorigenesis are heritable.

Figure 1.

AOM-induced colon tumorigenesis in inbred strains. A, tumor multiplicity. B, tumor volume. A total of 278 mice from 33 inbred mouse strains were measured for tumor multiplicity and volume at 24 weeks after the first doze of AOM.

Figure 1.

AOM-induced colon tumorigenesis in inbred strains. A, tumor multiplicity. B, tumor volume. A total of 278 mice from 33 inbred mouse strains were measured for tumor multiplicity and volume at 24 weeks after the first doze of AOM.

Close modal

Genome-wide association analysis

To identify the genetic basis of colon tumorigenesis in the AOM model, we conducted a genome-wide association analysis on 33 strains of inbred mice with colon tumor multiplicity and volume as the phenotypes (Fig. 2). To correct for population structure and genetic relatedness in inbred mouse strains (18), we used the EMMA approach to assess association of colon tumor susceptibility with SNPs. The distribution of observed P values was similar to the expected distribution, indicating no inflation of test statistics from population structure or any other form of bias (Supplementary Fig. S2). Therefore, any bias due to population structure and genetic relatedness among inbred strains has been largely eliminated by the EMMA approach. Using the permutation analysis, we established genome-wide significance levels of 0.05 and 0.10, which correspond to a point-wise P = 3.39 × 10–6 and 7.29 × 10–6 for the analysis of colon tumor multiplicity and corresponds to P = 2.53 × 10–6 and 6.04 × 10–6 for tumor volume (Fig. 2).

Figure 2.

Results from genome-wide association analysis of AOM-induced colon tumor multiplicity (A) and tumor volume (B) in inbred mice. Scatter plot of P values in −log scale for 97,854 SNPs. The 2 dash lines are genome-wide thresholds of P = 0.05 and 0.10 for colon tumor phenotypes.

Figure 2.

Results from genome-wide association analysis of AOM-induced colon tumor multiplicity (A) and tumor volume (B) in inbred mice. Scatter plot of P values in −log scale for 97,854 SNPs. The 2 dash lines are genome-wide thresholds of P = 0.05 and 0.10 for colon tumor phenotypes.

Close modal

The GWAS identified 20 SNPs (P < 10–4) potentially associated with colon tumor susceptibility (Table 1). In particular, 2 SNPs on chromosome 15, rs32359607, and rs32137981, are strongly associated with colon tumor multiplicity (P = 6.31 × 10–6 and 7.80 × 10–6, respectively). These 2 SNPs achieved genome-wide significance level of 0.10. Candidate genes nearby these 2 SNPs include Tas2r119, Snord123, and Sema5a (Fig. 3). This is a novel susceptibility locus on the proximal mouse chromosome 15 and has not been reported in previous linkage studies (26).

Figure 3.

A genetic locus on chromosome 15 affecting colon tumorigenesis in inbred mice. Blue and red dots represent association results from AOM-induced colon tumor multiplicity and volume, respectively. Genes shown on the upper side of the chromosome (turquoise lines) are transcribed in the − orientation (from right to left), and those on the lower side (pink lines) in the + orientation (from left to right).

Figure 3.

A genetic locus on chromosome 15 affecting colon tumorigenesis in inbred mice. Blue and red dots represent association results from AOM-induced colon tumor multiplicity and volume, respectively. Genes shown on the upper side of the chromosome (turquoise lines) are transcribed in the − orientation (from right to left), and those on the lower side (pink lines) in the + orientation (from left to right).

Close modal
Table 1.

Top SNPs associated with colon tumorigenesis in inbred mice

dbSNPChrPosaAllelesbMafcPd
Tumor multiplicity 
 rs32359607 15 32327123 A/G 0.424 6.31 × 10–6 
 rs32137981 15 32369170 A/G 0.364 7.80 × 10–6 
 rs31836718 77495091 G/A 0.469 2.89 × 10–5 
 rs31119421 133221746 A/G 0.438 4.22 × 10–5 
 rs33125466 17 57041453 C/T 0.400 4.83 × 10–5 
 rs31940065 82034606 G/A 0.424 5.17 × 10–5 
 rs13482505 15 30566973 A/C 0.375 5.29 × 10–5 
 rs3677347 133844466 T/A 0.414 6.48 × 10–5 
 rs30919908 19 21656425 G/C 0.485 6.63 × 10–5 
Tumor volume 
 rs6182695 107061698 G/A 0.267 2.33 × 10–5 
 rs31836718 77495091 G/A 0.469 6.14 × 10–5 
 rs33204842 17 38672775 C/A 0.406 6.81 × 10–5 
 rs32137981 15 32369170 A/G 0.364 7.51 × 10–5 
 rs28127134 107084372 G/A 0.25 8.28 × 10–5 
 rs28127052 107097861 A/G 0.25 8.28 × 10–5 
 rs32014282 107076284 G/T 0.296 8.30 × 10–5 
 rs28127136 107084304 A/G 0.273 8.34 × 10–5 
 rs32359607 15 32327123 A/G 0.424 8.81 × 10–5 
 rs28127240 107055865 C/T 0.242 9.64 × 10–5 
 rs32695065 107066262 C/T 0.242 9.64 × 10–5 
dbSNPChrPosaAllelesbMafcPd
Tumor multiplicity 
 rs32359607 15 32327123 A/G 0.424 6.31 × 10–6 
 rs32137981 15 32369170 A/G 0.364 7.80 × 10–6 
 rs31836718 77495091 G/A 0.469 2.89 × 10–5 
 rs31119421 133221746 A/G 0.438 4.22 × 10–5 
 rs33125466 17 57041453 C/T 0.400 4.83 × 10–5 
 rs31940065 82034606 G/A 0.424 5.17 × 10–5 
 rs13482505 15 30566973 A/C 0.375 5.29 × 10–5 
 rs3677347 133844466 T/A 0.414 6.48 × 10–5 
 rs30919908 19 21656425 G/C 0.485 6.63 × 10–5 
Tumor volume 
 rs6182695 107061698 G/A 0.267 2.33 × 10–5 
 rs31836718 77495091 G/A 0.469 6.14 × 10–5 
 rs33204842 17 38672775 C/A 0.406 6.81 × 10–5 
 rs32137981 15 32369170 A/G 0.364 7.51 × 10–5 
 rs28127134 107084372 G/A 0.25 8.28 × 10–5 
 rs28127052 107097861 A/G 0.25 8.28 × 10–5 
 rs32014282 107076284 G/T 0.296 8.30 × 10–5 
 rs28127136 107084304 A/G 0.273 8.34 × 10–5 
 rs32359607 15 32327123 A/G 0.424 8.81 × 10–5 
 rs28127240 107055865 C/T 0.242 9.64 × 10–5 
 rs32695065 107066262 C/T 0.242 9.64 × 10–5 

Abbreviations: Maf, minor allele frequency; Pos, SNP positions.

aThe SNP positions in base pairs were based on the National Center for Biotechnology Information mouse genome build 37.1.

bAllele in bold is minor allele of SNPs.

cMinor allele frequency in the sample of inbred mice.

dP = 3.39 × 10–6 and 7.29 × 10–6 correspond to 5% and 10% genome-wide thresholds in tumor multiplicity, respectively, and P = 2.53 × 10–6 and 6.04 × 10–6 correspond to 5% and 10% genome-wide thresholds in tumor volume.

Fine mapping of previous linkage regions

A total of 16 QTLs responsible for chemically induced colon tumors have been previously mapped on the genome by linkage analysis of intercross and/or backcross of inbred mouse strains (Supplementary Fig. S3; ref. 26). Here, we systematically investigated genetic association signals in these colon tumor susceptibility loci. We first identified microsatellite markers flanking 1 LOD supporting interval in each susceptibility locus based on previous linkage mapping studies and thus determined genomic locations of these loci. Using permutation analysis, we then established region-wide thresholds for declaring significant associations in each locus (Supplementary Table S1). The region-wide association analysis identified a number of SNPs showing significant associations with colon tumor susceptibility in 5 of 16 susceptibility loci, including susceptibility to colon cancer locus 3 (Scc3), Scc2, Scc12, Scc8, and colon-cancer susceptibility locus 1 (Ccs1; Supplementary Fig. S4). These linkage regions were generally refined into less than 1 Mb genomic regions by association mapping. They are located on chromosomes 1, 2, 7, 8, and 12, respectively.

Due to high linkage disequilibrium within the mouse genome, we also checked candidate genes in 500 kb flanking regions on either side of these susceptibility loci (Fig. 4). Candidate genes in these refined linkage regions were prioritized based on location of the most significant SNPs, mRNA expression in tumors, and their functional relevance in the literature. The most significant association in the Scc3 was located at 176.7 Mb on chromosome 1 (rs32121685, P = 3.47 × 10–4; Table 2). The refined Scc3 locus includes 10 annotated genes, of which Fmn2 and Grem2 are major candidates. The Scc2 locus on chromosome 2 (rs28039498, P = 2.09 × 10–4) covers 7 genes, of which Fap, Ifih1, and Gca are strong candidates. The Scc12 locus was narrowed to approximately 1 Mb on chromosome 7 (rs31119421, P = 4.22 × 10–5), which encompassing 18 genes of which Xpo6 is a strong candidate. Only 2 candidates, Gm3920 and Unc5d, were identified in the Scc8 locus on chromosome 8 (rs33399853, P = 3.33 × 10–4). The Ccs1 locus on chromosome 12 also had significant region-wide associations with AOM-induced colon tumorigenesis (rs29164033, P = 2.62 × 10–4). Eight candidate genes were identified in this locus: Gm2417, Galc, Gpr65, Kcnk10, Spata7, Ptpn21, Zc3h14, Zc3h14, and Eml5.

Figure 4.

Fine mapping of previous linkage regions. Associations achieved region-wide significance in 5 previous linkage regions, including Scc3, Scc2, Scc12, Scc8, and Ccs1. Blue and red dots represent association results from the analysis of colon tumor multiplicity and volume, respectively. The x-axis is physical distance of mouse genome (Mb; National Center for Biotechnology Information mouse build 37.1).

Figure 4.

Fine mapping of previous linkage regions. Associations achieved region-wide significance in 5 previous linkage regions, including Scc3, Scc2, Scc12, Scc8, and Ccs1. Blue and red dots represent association results from the analysis of colon tumor multiplicity and volume, respectively. The x-axis is physical distance of mouse genome (Mb; National Center for Biotechnology Information mouse build 37.1).

Close modal
Table 2.

Candidate genes and SNPs in the refined previous linkage regions

QTLChrRegion (Mb)dbSNPaPCandidates
Scc3 174.9–191.1 rs32121685 3.47 × 10–4 Fmn2, Grem2 
Scc2 34.8–65.3 rs28039498 2.09 × 10–4 Fap, Ifih1, Gca 
Scc12 126.8–149.6 rs31119421 4.22 × 10–5 Gsg1l, Spns1, Rabep2, Sh2b1, Eif3c 
Scc8 7.2–35.7 rs33399853 3.33 × 10–4 Unc5d 
Ccs1 12 80.1–101.3 rs29164033 2.62 × 10–4 Gpr65, Ptpn21 
QTLChrRegion (Mb)dbSNPaPCandidates
Scc3 174.9–191.1 rs32121685 3.47 × 10–4 Fmn2, Grem2 
Scc2 34.8–65.3 rs28039498 2.09 × 10–4 Fap, Ifih1, Gca 
Scc12 126.8–149.6 rs31119421 4.22 × 10–5 Gsg1l, Spns1, Rabep2, Sh2b1, Eif3c 
Scc8 7.2–35.7 rs33399853 3.33 × 10–4 Unc5d 
Ccs1 12 80.1–101.3 rs29164033 2.62 × 10–4 Gpr65, Ptpn21 

aThe most significant SNP in the linkage region was presented.

Gene expression of candidates in the identified loci

To identify whether candidate genes were significantly activated or repressed in colon tumorigenesis, we analyzed gene expression profiles of colon tumors from ApcMin/+ (24) and AOM-induced (23) mouse colon tumor models compared with normal colon. Ten candidate genes had significant differential expression in both models of colon tumorigenesis compared with normal colon controls (P < 0.05; Supplementary Table S2). Sema5a, Grem2, and Rabep2 were the most significantly decreased transcripts in both ApcMin/+ and AOM mouse models; whereas Eif3c and Tufm were the most increased transcripts (Fig. 5). When comparing human colon tumors with matched normal tissues, GREM2, SEMA5A and TUFM were altered in the consistent trends with the above mouse models (Supplementary Table S2).

Figure 5.

Expression of Sema5a, Grem2, Rabep2, and Eif3c in colon tumors. All the samples were referenced to expression levels of normal control colon samples.

Figure 5.

Expression of Sema5a, Grem2, Rabep2, and Eif3c in colon tumors. All the samples were referenced to expression levels of normal control colon samples.

Close modal

In this study, we carried out large-scale phenotyping to systematically evaluate the sensitivity of 33 inbred mouse strains to AOM-induced colon carcinogenesis (Fig. 1). The strains used in the study were derived from genetically diverse genealogies and most of them are on the priority list of the publicly accessible MPD (http://phenome.jax.org/pub-cgi/phenome/mpdcgi). Many of these strains have not been characterized previously for AOM colon carcinogenesis. The relative AOM sensitivities for a large set of inbred mouse strains established in the study will facilitate future studies of colon tumorigenesis. AOM is an organotropic colon carcinogen that is commonly used to induce colon tumors in rodents (27). In inbred mice, the high frequency of AOM-induced tumors was observed exclusively within the distal colon; the majority of mouse colon tumors arise as aberrant crypt foci, progress to adenomas, and ultimately result in adenocarcinomas, paralleling the progression of colorectal cancer in humans (27). Notably, a recent study showed that APC protein is aberrant in AOM-induced mouse colon adenomas and carcinomas (28). These support the use of mouse models for studying the genetics and pathogenesis of colon cancer. In general, AOM-induced colon cancer in rodents can recapitulate in a highly reliable way the phases of initiation and progression of tumor that occur in humans. However, it should be noted that p53 mutations are rarely observed and the tendency to metastasize is low in AOM-induced colon cancer mouse model (29). Other mouse models such as genetically modified animals are also very useful for studying diverse human colorectal cancer

Using these AOM-induced colon tumor data, we conducted a genome-wide association analysis in 268 mice from 33 inbred strains, and subsequently fine mapping of 16 previous linkage regions with the association data. The GWAS identified a novel susceptibility locus on chromosome 15 in which Sema5a is a major candidate (Fig. 3). Sema5a belongs to a large family of proteins involved in the patterning of both the vascular and the nervous systems. Interestingly, Sema5a expression was significantly repressed in both ApcMin/+ and AOM-induced colon tumors (Fig. 5). Fine mapping association analysis narrowed 5 of 16 QTLs for colon tumorigenesis into less than 1 Mb genomic regions of interest. This makes positional identification of candidate genes in these susceptibility loci more feasible (Fig. 4, Table 2).

The Scc3 locus on mouse chromosome 1 was initially mapped in the linkage analysis of 192 (BALB/c × CcS-19) F2 mice (16) and was further narrowed to less than 500 kb by our association mapping. The most significant SNP in the refined Scc3, rs32121685, is located near 2 genes, Fmn2 and Grem2. Grem2 mRNA expression was significantly repressed in both ApcMin/+ and AOM-induced colon tumors, although we did not observe significant changes in Fmn2 expression. A recent GWAS identified common genetic variants at CRAC1 (HMPS) locus on human chromosome 15q13.3 that confers colorectal cancer risk in the Ashkenazi population (9). The CRAC1 locus was initially characterized in the classical linkage analysis of families with hereditary mixed polyposis syndrome (HMPS). The HMPS is a Mendelian condition characterized by multiple colorectal polyps and colorectal cancer (30). Interestingly, 2 candidate genes near the strongest SNP association were FMN1 and GREM1. Syntenic regions containing FMN1/GREM1 and FMN2/GREM2 are highly conserved between mice and humans. This has been suggested to arise from an ancient gene duplication event (Supplementary Fig. S5; ref. 31). Genes around the Scc3 locus in mice are present in the same order and orientation in humans. These data suggest FMN1/GREM1 and FMN2/GREM2 may be lineage-specific susceptibility genes for colon cancer.

Three major candidates were identified in the refined Scc2 locus: Fap, Ifih1, and Gca. Fap encodes a homodimeric integral membrane gelatinase belonging to the serine protease family. It is selectively expressed in reactive stromal fibroblasts of epithelial cancers, granulation tissue of healing wounds, and malignant cells of bone and soft tissue sarcomas (32, 33). This gene is involved in the control of fibroblast growth or epithelial–mesenchymal interactions during development, tissue repair, and epithelial carcinogenesis. Abrogation of Fap enzymatic activity attenuates tumor growth in HEK293 cells (34). Increased expression of Fap is associated with lymph node metastasis in colorectal, esophageal, ovarian, and pancreatic cancers (35–38).

The refined Scc12 locus is a gene-rich region in which Gsg1l, Spns1, Rabep2, and Sh2b1 are interesting candidates. Of them, Spns1 play roles in programmed cell death in Drosophila melanogaster and has orthologs in nematode, mouse, and human (39). Rabep2 is a RAB GTPase binding effector protein which encodes a member of AP-1 family of transcription factors involved in cell proliferation, differentiation, apoptosis, and other biological processes. Rebep2 was upregulated in hyperplastic and neoplastic breast disorders (40). Sh2b1 mediates activation of various kinases and may function in cytokine and growth factor receptor signaling and cellular. A recent study found that SH2-B β specifically activates JAK2 and functions as an adapter protein that cross-links actin filaments, leading to modulation of cellular responses in response to JAK2 activation (41, 42). Another study shows an essential role of SH2-B β in the activation of the Src kinase and the resulting mitogenic response, causing phenotypic cell transformation involving the Src substrate STAT3 (43). The refined Scc8 locus only contained one predicted gene Gm3920 and one known gene Unc5d (i.e., Unc5h4). UNC5H4 is a netrin-1 receptor UNC5H family member and is a direct transcriptional target of p53 that is induced during DNA damage–mediated apoptosis (44).

Gpr65, Ptpn21, and Eml5 are major candidates in the refined Ccs1 locus. Gpr65 encodes a proapoptotic G protein–coupled receptor that promotes glucocorticoid-induced apoptosis. Activation of Gpr65 by its agonist psychosine markedly enhanced dexamethasone-induced apoptosis in a Gpr65-dependent manner (45). Upregulation of Gpr65 in human tumors is involved in driving or maintaining tumor formation (46). Ptpn21 is a member of the protein tyrosine phosphatase (PTP) family that is known to regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Frameshift mutations in coding repeats of PTP genes were frequently observed in colorectal tumors with microsatellite instability (47).

In summary, we identified a novel susceptibility locus on mouse chromosome 15 for AOM-induced colon tumorigenesis through a GWAS in inbred mice. Subsequent fine mapping analysis further identified 5 of 16 previous linkage regions to be associated with colon tumor susceptibility. These susceptibility loci were narrowed to less than 1 Mb regions of interest in which candidate genes were identified. These findings will provide important insights into the pathways of colorectal cancer development and may ultimately lead to more effective individually targeted cancer prevention strategies.

No potential conflicts of interest were disclosed.

The authors thank the Broad Institute of Harvard and MIT, the Wellcome Trust Center for Human Genetics, Perlegen Sciences, The Jackson Laboratory for releasing/collecting inbred laboratory mouse SNP data, and Nancy Saccone and Weimin Duan for their assistance in the data analysis.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Jemal
A
,
Siegel
R
,
Ward
E
,
Hao
Y
,
Xu
J
,
Thun
MJ
. 
Cancer statistics, 2009
.
CA Cancer J Clin
2009
;
59
:
225
49
.
2.
Lichtenstein
P
,
Holm
NV
,
Verkasalo
PK
,
Iliadou
A
,
Kaprio
J
,
Koskenvuo
M
, et al
Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from Sweden, Denmark, and Finland
.
N Engl J Med
2000
;
343
:
78
85
.
3.
Aaltonen
L
,
Johns
L
,
Jarvinen
H
,
Mecklin
JP
,
Houlston
R
. 
Explaining the familial colorectal cancer risk associated with mismatch repair (MMR)-deficient and MMR-stable tumors
.
Clin Cancer Res
2007
;
13
:
356
61
.
4.
Broderick
P
,
Carvajal-Carmona
L
,
Pittman
AM
,
Webb
E
,
Howarth
K
,
Rowan
A
, et al
A genome-wide association study shows that common alleles of SMAD7 influence colorectal cancer risk
.
Nat Genet
2007
;
39
:
1315
7
.
5.
Houlston
RS
,
Webb
E
,
Broderick
P
,
Pittman
AM
,
Di Bernardo
MC
,
Lubbe
S
, et al
Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer
.
Nat Genet
2008
;
40
:
1426
35
.
6.
Tenesa
A
,
Farrington
SM
,
Prendergast
JG
,
Porteous
ME
,
Walker
M
,
Haq
N
, et al
Genome-wide association scan identifies a colorectal cancer susceptibility locus on 11q23 and replicates risk loci at 8q24 and 18q21
.
Nat Genet
2008
;
40
:
631
7
.
7.
Tomlinson
I
,
Webb
E
,
Carvajal-Carmona
L
,
Broderick
P
,
Kemp
Z
,
Spain
S
, et al
A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21
.
Nat Genet
2007
;
39
:
984
8
.
8.
Tomlinson
IP
,
Webb
E
,
Carvajal-Carmona
L
,
Broderick
P
,
Howarth
K
,
Pittman
AM
, et al
A genome-wide association study identifies colorectal cancer susceptibility loci on chromosomes 10p14 and 8q23.3
.
Nat Genet
2008
;
40
:
623
30
.
9.
Jaeger
E
,
Webb
E
,
Howarth
K
,
Carvajal-Carmona
L
,
Rowan
A
,
Broderick
P
, et al
Common genetic variants at the CRAC1 (HMPS) locus on chromosome 15q13.3 influence colorectal cancer risk
.
Nat Genet
2008
;
40
:
26
8
.
10.
Zanke
BW
,
Greenwood
CM
,
Rangrej
J
,
Kustra
R
,
Tenesa
A
,
Farrington
SM
, et al
Genome-wide association scan identifies a colorectal cancer susceptibility locus on chromosome 8q24
.
Nat Genet
2007
;
39
:
989
94
.
11.
Jacoby
RF
,
Hohman
C
,
Marshall
DJ
,
Frick
TJ
,
Schlack
S
,
Broda
M
, et al
Genetic analysis of colon cancer susceptibility in mice
.
Genomics
1994
;
22
:
381
7
.
12.
Moen
CJ
,
Groot
PC
,
Hart
AA
,
Snoek
M
,
Demant
P
. 
Fine mapping of colon tumor susceptibility (Scc) genes in the mouse, different from the genes known to be somatically mutated in colon cancer
.
Proc Natl Acad Sci U S A
1996
;
93
:
1082
6
.
13.
Moen
CJ
,
Snoek
M
,
Hart
AA
,
Demant
P
. 
Scc-1, a novel colon cancer susceptibility gene in the mouse: linkage to CD44 (Ly-24, Pgp-1) on chromosome 2
.
Oncogene
1992
;
7
:
563
6
.
14.
Ruivenkamp
CA
,
van Wezel
T
,
Zanon
C
,
Stassen
AP
,
Vlcek
C
,
Csikos
T
, et al
Ptprj is a candidate for the mouse colon-cancer susceptibility locus Scc1 and is frequently deleted in human cancers
.
Nat Genet
2002
;
31
:
295
300
.
15.
van Wezel
T
,
Ruivenkamp
CA
,
Stassen
AP
,
Moen
CJ
,
Demant
P
. 
Four new colon cancer susceptibility loci, Scc6 to Scc9 in the mouse
.
Cancer Res
1999
;
59
:
4216
8
.
16.
van Wezel
T
,
Stassen
AP
,
Moen
CJ
,
Hart
AA
,
van der Valk
MA
,
Demant
P
. 
Gene interaction and single gene effects in colon tumour susceptibility in mice
.
Nat Genet
1996
;
14
:
468
70
.
17.
Wade
CM
,
Daly
MJ
. 
Genetic variation in laboratory mice
.
Nat Genet
2005
;
37
:
1175
80
.
18.
Liu
P
,
Vikis
H
,
Lu
Y
,
Wang
D
,
You
M
. 
Large-scale in silico mapping of complex quantitative traits in inbred mice
.
PLoS One
2007
;
2
:
e651
.
19.
Liu
P
,
Wang
Y
,
Vikis
H
,
Maciag
A
,
Wang
D
,
Lu
Y
, et al
Candidate lung tumor susceptibility genes identified through whole-genome association analyses in inbred mice
.
Nat Genet
2006
;
38
:
888
95
.
20.
Zhang
Z
,
Wang
Y
,
Yao
R
,
Li
J
,
Yan
Y
,
La Regina
M
, et al
Cancer chemopreventive activity of a mixture of Chinese herbs (antitumor B) in mouse lung tumor models
.
Oncogene
2004
;
23
:
3841
50
.
21.
Kang
HM
,
Zaitlen
NA
,
Wade
CM
,
Kirby
A
,
Heckerman
D
,
Daly
MJ
, et al
Efficient control of population structure in model organism association mapping
.
Genetics
2008
;
178
:
1709
23
.
22.
Churchill
GA
,
Doerge
RW
. 
Empirical threshold values for quantitative trait mapping
.
Genetics
1994
;
138
:
963
71
.
23.
Bissahoyo
A
,
Pearsall
RS
,
Hanlon
K
,
Amann
V
,
Hicks
D
,
Godfrey
VL
, et al
Azoxymethane is a genetic background-dependent colorectal tumor initiator and promoter in mice: effects of dose, route, and diet
.
Toxicol Sci
2005
;
88
:
340
5
.
24.
Haigis
KM
,
Hoff
PD
,
White
A
,
Shoemaker
AR
,
Halberg
RB
,
Dove
WF
. 
Tumor regionality in the mouse intestine reflects the mechanism of loss of Apc function
.
Proc Natl Acad Sci U S A
2004
;
101
:
9769
73
.
25.
Kaiser
S
,
Park
YK
,
Franklin
JL
,
Halberg
RB
,
Yu
M
,
Jessen
WJ
, et al
Transcriptional recapitulation and subversion of embryonic colon development by mouse colon tumor models and human colon cancer
.
Genome Biol
2007
;
8
:
R131
.
26.
Demant
P
. 
Cancer susceptibility in the mouse: genetics, biology and implications for human cancer
.
Nat Rev Genet
2003
;
4
:
721
34
.
27.
Nambiar
PR
,
Girnun
G
,
Lillo
NA
,
Guda
K
,
Whiteley
HE
,
Rosenberg
DW
. 
Preliminary analysis of azoxymethane induced colon tumors in inbred mice commonly used as transgenic/knockout progenitors
.
Int J Oncol
2003
;
22
:
145
50
.
28.
Maltzman
T
,
Whittington
J
,
Driggers
L
,
Stephens
J
,
Ahnen
D
. 
AOM-induced mouse colon tumors do not express full-length APC protein
.
Carcinogenesis
1997
;
18
:
2435
9
.
29.
Kobaek-Larsen
M
,
Thorup
I
,
Diederichsen
A
,
Fenger
C
,
Hoitinga
MR
. 
Review of colorectal cancer and its metastases in rodent models: comparative aspects with those in humans
Comp Med
2000
;
50
:
16
26
.
30.
Jaeger
EE
,
Woodford-Richens
KL
,
Lockett
M
,
Rowan
AJ
,
Sawyer
EJ
,
Heinimann
K
, et al
An ancestral Ashkenazi haplotype at the HMPS/CRAC1 locus on 15q13-q14 is associated with hereditary mixed polyposis syndrome
.
Am J Hum Genet
2003
;
72
:
1261
7
.
31.
Waterston
RH
,
Lindblad-Toh
K
,
Birney
E
,
Rogers
J
,
Abril
JF
,
Agarwal
P
, et al
Initial sequencing and comparative analysis of the mouse genome
.
Nature
2002
;
420
:
520
62
.
32.
Abbas
O
,
Richards
JE
,
Mahalingam
M
. 
Fibroblast-activation protein: a single marker that confidently differentiates morpheaform/infiltrative basal cell carcinoma from desmoplastic trichoepithelioma
.
Mod Pathol
2010
;
23
:
1535
43
.
33.
Scanlan
MJ
,
Raj
BK
,
Calvo
B
,
Garin-Chesa
P
,
Sanz-Moncasi
MP
,
Healey
JH
, et al
Molecular cloning of fibroblast activation protein alpha, a member of the serine protease family selectively expressed in stromal fibroblasts of epithelial cancers
.
Proc Natl Acad Sci U S A
1994
;
91
:
5657
61
.
34.
Cheng
JD
,
Valianou
M
,
Canutescu
AA
,
Jaffe
EK
,
Lee
HO
,
Wang
H
, et al
Abrogation of fibroblast activation protein enzymatic activity attenuates tumor growth
.
Mol Cancer Ther
2005
;
4
:
351
60
.
35.
Goscinski
MA
,
Suo
Z
,
Florenes
VA
,
Vlatkovic
L
,
Nesland
JM
,
Giercksky
KE
. 
FAP-alpha and uPA show different expression patterns in premalignant and malignant esophageal lesions
.
Ultrastruct Pathol
2008
;
32
:
89
96
.
36.
Cohen
SJ
,
Alpaugh
RK
,
Palazzo
I
,
Meropol
NJ
,
Rogatko
A
,
Xu
Z
, et al
Fibroblast activation protein and its relationship to clinical outcome in pancreatic adenocarcinoma
.
Pancreas
2008
;
37
:
154
8
.
37.
Iwasa
S
,
Jin
X
,
Okada
K
,
Mitsumata
M
,
Ooi
A
. 
Increased expression of seprase, a membrane-type serine protease, is associated with lymph node metastasis in human colorectal cancer
.
Cancer Lett
2003
;
199
:
91
8
.
38.
Chen
H
,
Yang
WW
,
Wen
QT
,
Xu
L
,
Chen
M
. 
TGF-beta induces fibroblast activation protein expression; fibroblast activation protein expression increases the proliferation, adhesion, and migration of HO-8910PM [corrected]
.
Exp Mol Pathol
2009
;
87
:
189
94
.
39.
Nakano
Y
,
Fujitani
K
,
Kurihara
J
,
Ragan
J
,
Usui-Aoki
K
,
Shimoda
L
, et al
Mutations in the novel membrane protein spinster interfere with programmed cell death and cause neural degeneration in Drosophila melanogaster
.
Mol Cell Biol
2001
;
21
:
3775
88
.
40.
Chiappetta
G
,
Ferraro
A
,
Botti
G
,
Monaco
M
,
Pasquinelli
R
,
Vuttariello
E
, et al
FRA-1 protein overexpression is a feature of hyperplastic and neoplastic breast disorders
.
BMC Cancer
2007
;
7
:
17
.
41.
O'Brien
KB
,
O'Shea
JJ
,
Carter-Su
C
. 
SH2-B family members differentially regulate JAK family tyrosine kinases
.
J Biol Chem
2002
;
277
:
8673
81
.
42.
Rider
L
,
Tao
J
,
Snyder
S
,
Brinley
B
,
Lu
J
,
Diakonova
M
. 
Adapter protein SH2B1beta cross-links actin filaments and regulates actin cytoskeleton
.
Mol Endocrinol
2009
;
23
:
1065
76
.
43.
Zhang
M
,
Deng
Y
,
Riedel
H
. 
PSM/SH2B1 splice variants: critical role in src catalytic activation and the resulting STAT3s-mediated mitogenic response
.
J Cell Biochem
2008
;
104
:
105
18
.
44.
Wang
H
,
Ozaki
T
,
Shamim
Hossain M
,
Nakamura
Y
,
Kamijo
T
,
Xue
X
, et al
A newly identified dependence receptor UNC5H4 is induced during DNA damage-mediated apoptosis and transcriptional target of tumor suppressor p53
.
Biochem Biophys Res Commun
2008
;
370
:
594
8
.
45.
Malone
MH
,
Wang
Z
,
Distelhorst
CW
. 
The glucocorticoid-induced gene tdag8 encodes a pro-apoptotic G protein-coupled receptor whose activation promotes glucocorticoid-induced apoptosis
.
J Biol Chem
2004
;
279
:
52850
9
.
46.
Sin
WC
,
Zhang
Y
,
Zhong
W
,
Adhikarakunnathu
S
,
Powers
S
,
Hoey
T
, et al
G protein-coupled receptors GPR4 and TDAG8 are oncogenic and overexpressed in human cancers
.
Oncogene
2004
;
23
:
6299
303
.
47.
Korff
S
,
Woerner
SM
,
Yuan
YP
,
Bork
P
,
von Knebel Doeberitz
M
,
Gebert
J
. 
Frameshift mutations in coding repeats of protein tyrosine phosphatase genes in colorectal tumors with microsatellite instability
.
BMC Cancer
2008
;
8
:
329
.