Thirty-five percent of colorectal cancer (CRC) susceptibility is thought to be attributable to genetics, but only a small proportion of the cases (<6%) can be explained by highly penetrant mutations. The rest of the susceptibility could be explained by a number of low-penetrance variants following a polygenic model of inheritance. Genetic modeling in rodents has been a successful tool for the unraveling of the genetic basis of diseases. The investigation of mouse quantitative trait loci led to the discovery of 15 “susceptibility to colorectal cancer” (Scc) loci. Thus, we aimed to analyze the human-mouse syntenic regions defined by these Scc loci and select human candidate genes within. Twenty-one genes were chosen and their single-nucleotide polymorphisms were tested as possible low-penetrance variants predisposing to CRC risk. Our most strongly associated single-nucleotide polymorphism, rs954353, seems to be in the 5′ region of the CYR61 gene, which could implicate it in terms of the cis-regulation of the gene. CYR61 has been proposed as a connection point among signaling pathways and a probable marker for early CRC detection. However, we could not replicate the association. Despite our negative results, we believe that our candidate gene selection strategy could be quite useful in the future determination of variants predisposing to disease. Cancer Epidemiol Biomarkers Prev; 19(2); 619–23

Colorectal cancer (CRC) is the second most frequent neoplasm and one of the most important morbidity causes in the developed world (1). Despite the fact that 35% of CRC susceptibility could be attributable to genetics, only a small proportion of the cases (<6%) can be explained by highly penetrant mutations, suggesting that the rest of the susceptibility should exist in the form of low-penetrance variants following a polygenic model of inheritance (2).

Genetic modeling in rodents has been proved to be an important tool in the unraveling of the genetic basis of diseases. The investigation of mouse quantitative trait loci (QTL) to identify chromosomal regions harboring genetic variants that affect susceptibility successfully led to the discovery of 15 “susceptibility to colorectal cancer” (Scc) loci (3, 4). Because there is increasing evidence that causal genes underlying disease QTLs are conserved between rodents and humans (5), a sensible approach to identify these genes would be to map them in mice and, subsequently, investigate the role of their human homologues.

Hence, our aim is to analyze the human-mouse syntenic regions defined by these Scc loci and select human candidate genes to screen their single-nucleotide polymorphisms (SNP) and test them as possible low-penetrance variants predisposing to CRC risk in a two-stage case-control study.

Study Populations

Subjects on stage I were 515 CRC cases and 515 controls from EPICOLON I, a prospective, multicenter, population-based epidemiology study (6). Subjects on stage II (933 cases and 955 controls) belonged to EPICOLON II, an extension of EPICOLON I. Cases and controls were matched for sex and age. All samples were obtained with informed consent reviewed by the ethical board of the corresponding hospital.

Candidate Gene Selection

QTLs were defined by their flanking markers by revision of the author's data and the MGI (7). Genes within each human-mouse syntenic region showing enriched expression in primary affected tissues in mice were selected with ExQuest (8). Finally, 21 human genes were chosen from the 15 Scc (Table 1; ref. 9).

Table 1.

Description of the 15 Scc loci and the selected genes within the human-mouse QTL syntenic regions

QTLMouse chrHuman geneHuman mappingGene descriptionGene ontologySNPs analyzed
Scc1 PTPRJ 11p11.2 Protein tyrosine-phosphatase receptor type J Regulation of cellular growth, differentiation and oncogenic transformation rs10742827; rs100838801; rs10838810; rs11039519; rs1503185; rs1566734; rs2270992; rs2270993; rs4752904; rs7117386; rs7123436; rs7947811 
Scc2 CRB2 9q33.2 Crumbs homolog 2 Polarized cell morphogenesis rs10818812; rs1105223; rs1891632; rs1891638; rs33984675; rs4838051; rs7033144; rs884320 
Scc3 TGFB2 1q41 Transforming growth factor β2 Suppressive effects on interleukin-2–dependent T-cell growth rs10863396; rs1539399; rs17558745; rs1890994; rs1891467; rs2000220; rs2796821; rs4846476; rs4846479 
Scc4 17 PRKD3 2p22-p21 Protein kinase D3 Receptor of phorbol esters: a class of tumor promoters rs10177176; rs10460527; rs1056021; rs11124575; rs11887618; rs2300880; rs2300771; rs2300894; rs2302650; rs3770761 
MSH2 2p21 MutS homolog 2 DNA mismatch repair rs13019654; rs17036614; rs458314; rs7607076 
Scc5 18 TNFAIP8 5q23.1 Tumor necrosis factor α–induced protein 8 Negative mediator of apoptosis with a role in tumor progression rs10077888; rs1045241; rs1045242; rs11064; rs17385413; rs3203922; rs32658; rs3797339; rs3797345 
Scc6 11 EGFR 7p12 Epidermal growth factor receptor Cell growth and differentiation control rs1015793; rs1050171; rs1140475; rs11487218; rs11971997; rs12538489; rs12671550; rs17172446; rs17290169; rs17337023; rs2072454; rs2293347; rs3800827; rs4947492; rs4947971; rs6593205; rs6972246; rs759170; rs759171; rs7796139; rs7809394; rs88425 
Scc7 CYR61 1p31-p32 Cysteine-rich 61 Promotes cell proliferation, chemotaxis, angiogenesis, and cell adhesion rs12086058; rs12239954; rs1576424; rs3753793; rs721471; rs954353; rs9658584 
Scc8 TFDP1 13q34 Transcription factor Dp-1 Regulation of the expresion of cellular promoters rs2316121; rs6577058; rs9577286 
CDC16 13q34 Cell division cycle 16 homolog Ubiquitin ligase with role in cell cycle control rs3211416; rs7318644; rs7994151; rs8002514; rs9590408; rs9590409 
Scc9 10 MDM2 12q14.3-q15 Transformed 3T3 cell double minute 2 p53 inhibitor rs1470383; rs1795481; rs769412 
LGR5 12q22-q23 Leucine-rich repeat–containing G-protein–coupled receptor 5 Overexpressed in human colon tumors rs10748178; rs10784923; rs11178798; rs11178832; rs11178845; rs1148985; rs12422259; rs12829521; rs17109799; rs17109924; rs17109926; rs1880892; rs3803033; rs389150; rs3923863; rs7298504; rs941197 
Scc11 HEYL 1p34.3 Hairy/enhancer-of-split related with YRPW motif-like Downstream effector of Notch signaling that networks together with Wnt rs1180320; rs4660892; rs784622 
MYCL1 1p34.2 V-myc myelocytomatosis viral oncogene homolog 1 Loss of heterozygosity at MYCL1 is a marker for poor prognosis in CRC rs3117088; rs3134614; rs3134615 
Scc12 DMBT1 10q25.3-26 Deleted in malignant brain tumors 1 Role in the interaction of tumor cells and the immune system rs1051715; rs2981783; rs3013236 
Scc13 TRAF2 9q34 TNF receptor–associated factor 2 Regulates TNF-induced apoptosis rs10870140; rs2784078; rs2784075; rs908831 
Scc14 10 LATS1 6q24-q25.1 Large tumor suppressor homolog 1 (DrosophilaMaintenance of ploidy and tumor supressor activity through regulation of p53 rs3798761; rs3924871 
VIP 6q25 Vasoactive intestinal peptide Proangiogenic factor rs12212849; rs3823082; rs637572; rs671330; rs680314; rs688136 
Scc15 11 LLGL1 17p11.2 Lethal giant larvae homolog 1 (DrosophilaReduced expression related to progression of colon cancer; similar to a tumor supressor in Drosophila rs11869582; rs2245430; rs2245737; rs2290505; rs2746027; rs8821 
Ccs1 12 FOS 14q24.3 v-fos FBJ murine osteosarcoma viral oncogene homolog Signal transduction protein implicated in cell proliferation and differentiation rs1046117; rs1569328; rs3742769; rs7101 
JDP2 14q24.3 Jun dimerization protein 2 Mediator in UV-induced apoptosis, cell differentiation, tumorigenesis, and angiogenesis rs10057; rs10873278; rs1474503; rs175644; rs4899566; rs84044 
QTLMouse chrHuman geneHuman mappingGene descriptionGene ontologySNPs analyzed
Scc1 PTPRJ 11p11.2 Protein tyrosine-phosphatase receptor type J Regulation of cellular growth, differentiation and oncogenic transformation rs10742827; rs100838801; rs10838810; rs11039519; rs1503185; rs1566734; rs2270992; rs2270993; rs4752904; rs7117386; rs7123436; rs7947811 
Scc2 CRB2 9q33.2 Crumbs homolog 2 Polarized cell morphogenesis rs10818812; rs1105223; rs1891632; rs1891638; rs33984675; rs4838051; rs7033144; rs884320 
Scc3 TGFB2 1q41 Transforming growth factor β2 Suppressive effects on interleukin-2–dependent T-cell growth rs10863396; rs1539399; rs17558745; rs1890994; rs1891467; rs2000220; rs2796821; rs4846476; rs4846479 
Scc4 17 PRKD3 2p22-p21 Protein kinase D3 Receptor of phorbol esters: a class of tumor promoters rs10177176; rs10460527; rs1056021; rs11124575; rs11887618; rs2300880; rs2300771; rs2300894; rs2302650; rs3770761 
MSH2 2p21 MutS homolog 2 DNA mismatch repair rs13019654; rs17036614; rs458314; rs7607076 
Scc5 18 TNFAIP8 5q23.1 Tumor necrosis factor α–induced protein 8 Negative mediator of apoptosis with a role in tumor progression rs10077888; rs1045241; rs1045242; rs11064; rs17385413; rs3203922; rs32658; rs3797339; rs3797345 
Scc6 11 EGFR 7p12 Epidermal growth factor receptor Cell growth and differentiation control rs1015793; rs1050171; rs1140475; rs11487218; rs11971997; rs12538489; rs12671550; rs17172446; rs17290169; rs17337023; rs2072454; rs2293347; rs3800827; rs4947492; rs4947971; rs6593205; rs6972246; rs759170; rs759171; rs7796139; rs7809394; rs88425 
Scc7 CYR61 1p31-p32 Cysteine-rich 61 Promotes cell proliferation, chemotaxis, angiogenesis, and cell adhesion rs12086058; rs12239954; rs1576424; rs3753793; rs721471; rs954353; rs9658584 
Scc8 TFDP1 13q34 Transcription factor Dp-1 Regulation of the expresion of cellular promoters rs2316121; rs6577058; rs9577286 
CDC16 13q34 Cell division cycle 16 homolog Ubiquitin ligase with role in cell cycle control rs3211416; rs7318644; rs7994151; rs8002514; rs9590408; rs9590409 
Scc9 10 MDM2 12q14.3-q15 Transformed 3T3 cell double minute 2 p53 inhibitor rs1470383; rs1795481; rs769412 
LGR5 12q22-q23 Leucine-rich repeat–containing G-protein–coupled receptor 5 Overexpressed in human colon tumors rs10748178; rs10784923; rs11178798; rs11178832; rs11178845; rs1148985; rs12422259; rs12829521; rs17109799; rs17109924; rs17109926; rs1880892; rs3803033; rs389150; rs3923863; rs7298504; rs941197 
Scc11 HEYL 1p34.3 Hairy/enhancer-of-split related with YRPW motif-like Downstream effector of Notch signaling that networks together with Wnt rs1180320; rs4660892; rs784622 
MYCL1 1p34.2 V-myc myelocytomatosis viral oncogene homolog 1 Loss of heterozygosity at MYCL1 is a marker for poor prognosis in CRC rs3117088; rs3134614; rs3134615 
Scc12 DMBT1 10q25.3-26 Deleted in malignant brain tumors 1 Role in the interaction of tumor cells and the immune system rs1051715; rs2981783; rs3013236 
Scc13 TRAF2 9q34 TNF receptor–associated factor 2 Regulates TNF-induced apoptosis rs10870140; rs2784078; rs2784075; rs908831 
Scc14 10 LATS1 6q24-q25.1 Large tumor suppressor homolog 1 (DrosophilaMaintenance of ploidy and tumor supressor activity through regulation of p53 rs3798761; rs3924871 
VIP 6q25 Vasoactive intestinal peptide Proangiogenic factor rs12212849; rs3823082; rs637572; rs671330; rs680314; rs688136 
Scc15 11 LLGL1 17p11.2 Lethal giant larvae homolog 1 (DrosophilaReduced expression related to progression of colon cancer; similar to a tumor supressor in Drosophila rs11869582; rs2245430; rs2245737; rs2290505; rs2746027; rs8821 
Ccs1 12 FOS 14q24.3 v-fos FBJ murine osteosarcoma viral oncogene homolog Signal transduction protein implicated in cell proliferation and differentiation rs1046117; rs1569328; rs3742769; rs7101 
JDP2 14q24.3 Jun dimerization protein 2 Mediator in UV-induced apoptosis, cell differentiation, tumorigenesis, and angiogenesis rs10057; rs10873278; rs1474503; rs175644; rs4899566; rs84044 

NOTE: For some of the Scc loci, more than one gene was selected because of their possible functional implications.

SNP Selection and Genotyping

One hundred forty-seven SNPs were selected from the 21 genes with PupaSuite (10), FESD (11), dbSNP (12), and HapMap Phase II (genome build 36; ref. 13). SNPs with unadjusted P values <0.01 were replicated in an independent case-control series. Genotyping was done in the SNPlex (Applied Biosystems), MassARRAY (Sequenom, Inc.), and TaqMan (Applied Biosystems) platforms at the Santiago de Compostela node of the Spanish Genotyping Center.

Statistical Analyses

Quality control was assessed with the Genotyping Data Filter (14) and Structure v2.2 (15). Genotypic distributions in controls followed Hardy-Weinberg equilibrium, and there was no sign of underlying population stratification. Association was evaluated for every single SNP and all possible haplotypes in each gene with Haploview v4.0 (16) and Unphased (17). Permutation tests and Bonferroni were used for multiple-testing corrections. Odds ratio (OR) and 95% confidence intervals were calculated with PLINK v1.03 (18). Descriptive information and association data for all the SNPs that passed quality control are shown in Supplementary Table S1.

Allelic association tests revealed only one significant SNP after multiple-testing correction: rs12086058, lying in an intergenic region 6.4 kb upstream the CYR61 gene (1p31-p22). The OR value for this SNP showed a protective effect of the minor allele (Table 2). Haplotype analysis and comparisons between sporadic and familial groups did not yield any significant associations (data not shown).

Table 2.

Association analyses for the three SNPs selected for replication on stage II

SNP_IDGeneRelevanceAllelesObserved MAFOR (95% CI)χ2 1dfPStage I permutations PBonferroni PStage II χ2 1dfP
rs12086058 CYR61 5′UTR A/G 0.428 0.71 (0.59-0.86) 0.0005 0.0326 0.0405 0.4099 
rs954353 CYR61 5′UTR A/G 0.434 0.70 (0.59-0.84) 0.0002 0.0246 0.027 0.3917 
rs10077888 TNFAIP8 Intronic C/G 0.302 0.75 (0.61-0.92) 0.0019 0.2058 0.2565 0.8188 
SNP_IDGeneRelevanceAllelesObserved MAFOR (95% CI)χ2 1dfPStage I permutations PBonferroni PStage II χ2 1dfP
rs12086058 CYR61 5′UTR A/G 0.428 0.71 (0.59-0.86) 0.0005 0.0326 0.0405 0.4099 
rs954353 CYR61 5′UTR A/G 0.434 0.70 (0.59-0.84) 0.0002 0.0246 0.027 0.3917 
rs10077888 TNFAIP8 Intronic C/G 0.302 0.75 (0.61-0.92) 0.0019 0.2058 0.2565 0.8188 

Abbreviations: MAF, minor allele frequency; 95% CI, 95% confidence interval; UTR, untranslated region.

Linkage disequilibrium analysis in the CYR61 region showed rs12086058 to be in high correlation with rs954353 (r2 = 1). This SNP was located 1.8 kb upstream CYR61, which suggested a possible implication in the cis-regulation of the gene. Genotyping of rs954353 yielded a better association value than rs12086058 (2 × 10−4). OR also showed a protective effect of the minor allele (Table 2).

To verify the results, SNPs with nominal P < 0.01 (rs12086058, rs954353, and rs10077888) were further replicated on an independent sample. Nevertheless, none of the associations could be replicated (Table 2).

Our study combines the advances in CRC genetics in animal models with the investigation of the variation underlying the disease in humans. We selected 21 genes identified from syntenic regions defined by mouse QTLs to screen their SNP variability in a two-stage case-control association study. However, we did not find any replicable association. Our study had enough power to detect OR ≥1.3, assuming allelic association and α = 0.05 (19). Results in stage I were therefore simply due to chance or to type I error.

Nevertheless, our most strongly associated SNP, rs954353, seems to be in the 5′ region of the CYR61 gene, which could still implicate it in terms of cis-regulation. We analyzed the region harboring rs954353 and found it to be lying very close to two transcription factor binding site sequences. The direct sequencing of these failed to find any common variants within the consensus target that could explain the association signal found in stage I. However, we did find a 6-bp insertion polymorphism 38 bp upstream the first transcription factor binding site. This variant showed significant differences in frequencies between cases and controls (P = 0.0236), although no further implications could be stated about its relationship with CRC susceptibility (data not shown).

CYR61 has been proposed as a connection point among signaling pathways and a probable marker for early CRC detection (20). Besides, it has been extensively implicated in carcinogenesis-related events such as angiogenesis (21), tissue invasion (22), cell migration, and metastasis (23), although no association studies have been published thus far that analyze its relationship with CRC.

Despite our negative results, we believe that our candidate gene selection, through the identification of genes or regions conferring susceptibility to other species, could be quite useful in the future determination of variants predisposing to disease. Our QTLs analyses proved to be very helpful as a starting point in the search for candidate genes affecting CRC susceptibility because all the genes identified were somehow related to carcinogenetic events. In fact, although this approach has not been successful thus far for CRC, it positively identified a haplotype in PTPRJ as a breast cancer genetic susceptibility low-penetrance allele (24). Hence, we encourage future efforts in this field and believe that the relationship between CYR61 and CRC should be studied in other populations to fully discard a putative genetic association.

No potential conflicts of interest were disclosed.

We thank all the patients that participated in this study, who were recruited in 25 Spanish hospitals as part of the EPICOLON project. S. Castellví-Bel is supported by a contract from the Fondo de Investigación Sanitaria (CP 03-0070). C. Fernández-Rozadilla has obtained a FPU Fellowship from the Ministerio de Educacion; CIBERER and CIBEREHD are funded by Instituto de Salud Carlos III. We thank Maria Magdalena-Castro, Olga Lortes, and Eva Fernández for their excellent technical assistance. M. Magdalena-Castro and E. Fernández are supported by Isabel Barreto's program from Xunta de Galicia, and O. Lortes by a contract from the CIBERER.

Grant Support: Fondo de Investigación Sanitaria/FEDER (06/1384, 08/0024, 08/1276), Fundación Mutua Madrileña (C. Ruiz-Ponte and S. Castellví-Bel), Ministerio de Educación y Ciencia (SAF 07-64873), Asociación Española contra el Cáncer, Fundación Olga Torres (S. Castellví-Bel), Acción en Cáncer (Instituto de Salud Carlos III), and Xunta de Galicia (RHI07/04).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1
Ferlay
J
,
Bray
F
,
Pisani
P
,
Parkin
DM
.
GLOBOCAN 2002: cancer incidence, mortality and prevalence worldwide. IARC Cancer Base No. 5. Version 2.0
.
IARC Press
:
Lyon
2004
.
2
Houlston
RS
,
Peto
J
. 
The search for low-penetrance cancer susceptibility alleles
.
Oncogene
2004
;
23
:
6471
6
.
3
Demant
P
. 
Cancer susceptibility in the mouse: genetics, biology and implications for human cancer
.
Nat Rev Genet
2003
;
4
:
721
34
.
4
Ruivenkamp
CA
,
van Wezel
T
,
Zanon
C
, et al
. 
Ptprj is a candidate for the mouse colon-cancer susceptibility locus Scc1 and is frequently deleted in human cancers
.
Nat Genet
2002
;
31
:
295
300
.
5
Peters
LL
,
Robledo
RF
,
Bult
CJ
,
Churchill
GA
,
Paigen
BJ
,
Svenson
KL
. 
The mouse as a model for human biology: a resource guide for complex trait analysis
.
Nat Rev Genet
2007
;
8
:
58
69
.
6
Piñol
V
,
Castells
A
,
Andreu
A
, et al
. 
Accuracy of revised Bethesda guidelines, microsatellite instability, and immunohistochemistry for the identification of patients with hereditary nonpolyposis colorectal cancer
.
JAMA
2005
;
293
:
1986
94
.
7
Mouse Genomic Informatics. http://www.informatics.jax.org/.
9
The Gene Ontology. http://www.geneontology.org/.
10
Conde
L
,
Vaquerizas
JM
,
Dopazo
H
, et al
. 
PupaSuite: finding functional single nucleotide polymorphisms for large-scale genotyping purposes
.
Nucleic Acids Res
2006
;
34
:
W621
5
.
11
Kang
HJ
,
Choi
KO
,
Kim
BD
,
Kim
S
,
Kim
YJ
. 
FESD: a Functional Element SNPs Database in human
.
Nucleic Acids Res
2005
;
33
:
D518
22
.
13
The International HapMap Consortium. The International HapMap Project
.
Nature
2003
;
426
:
789
96
.
14
Genotyping Data Filter Software. Available at: http://bioinformatics.cesga.es/gdf/nav_input.php.
15
Pritchard
JK
,
Stephens
M
,
Donnelly
P
. 
Inference of population structure using multilocus genotype data
.
Genetics
2000
;
155
:
945
59
.
16
Barrett
JC
,
Fry
B
,
Maller
J
,
Daly
MJ
. 
Haploview: analysis and visualization of LD and haplotype maps
.
Bioinformatics
2005
;
21
:
263
5
.
17
Dudbridge
F
. 
Pedigree disequilibrium tests for multilocus haplotypes
.
Genet Epidemiol
2003
;
25
:
115
21
.
18
Purcell S, Neale B, Todd-Brown K, et al. PLINK: a toolset for whole-genome association and population-based linkage-analysis. Am J Hum Genet, 81. Available at: http://pngu.mgh.harvard.edu/~purcell/plink/.
19
Genetic Power Calculator. Available at: http://pngu.mgh.harvard.edu/~purcell/gpc/.
20
Hong
Y
,
Ho
KS
,
Eu
KW
,
Cheah
PY
. 
A susceptibility gene set for early onset colorectal cancer that integrates diverse signaling pathways: implication for tumorigenesis
.
Clin Cancer Res
2007
;
13
:
1107
14
.
21
Gashaw
I
,
Stiller
S
,
Boing
C
,
Kimmig
R
,
Winterhager
E
. 
Premenstrual regulation of the pro-angiogenic factor CYR61 in human endometrium
.
Endocrinology
2008
;
149
:
2261
9
.
22
Monnier
Y
,
Farmer
P
,
Bieler
G
, et al
. 
CYR61 and αVβ5 integrin cooperate to promote invasion and metastasis of tumors growing in preirradiated stroma
.
Cancer Res
2008
;
68
:
7323
31
.
23
Sun
ZJ
,
Wang
Y
,
Cai
Z
,
Chen
PP
,
Tong
XJ
,
Xie
D
. 
Involvement of CYR61 in growth, migration, and metastasis of prostate cancer cells
.
Br J Cancer
2008
;
99
:
1656
67
.
24
Lesueur
F
,
Pharoah
PD
,
Laing
S
, et al
. 
Allelic association of the human homologue of the mouse modifier Ptprj with breast cancer
.
Hum Mol Genet
2005
;
14
:
2349
56
.