Abstract
Background: Genome-wide association studies of colorectal cancer (CRC) have identified genetic variants that reproducibly associate with CRC. Associations of 12 single nucleotide polymorphisms at 8q24, 9p24, and 18q21 (SMAD7) and CRC were investigated in a three-center collaborative study including two U.K. case-control cohorts (Sheffield and Leeds) and a U.S. case-control study of CRC cases from high-risk Utah pedigrees.
Methods: Our combined resource included 1,092 CRC case subjects and 1,060 age- and sex-matched controls. Meta statistics and Monte Carlo significance testing using Genie software provided a valid combined analysis of our mixed independent and related case-control resource. We also evaluated whether these associations differed by sex, age at diagnosis, family history, or tumor site.
Results: At 8q24, we observed two independent significant associations at single nucleotide polymorphisms located in two different risk regions of 8q24: rs6983267 in region 3 [Ptrend = 0.01; per allele odds ratio (OR), 1.17; 95% confidence intervals (95% CI), 1.03-1.32] and rs10090154 in region 5 (Ptrend = 0.05; per allele OR, 1.24; 95% CI, 1.01-1.51). At 18q21, associations were observed in distal colon tumors but not in proximal or rectal cancers: rs4939827 (Ptrend = 0.007; per allele OR, 0.77; 95% CI, 0.64-0.93; case-case pdiff = 0.03) and rs12953717 (Ptrend = 0.01; per allele OR, 1.27; 95% CI, 1.06-1.52). We were unable to detect any associations at 9p24 with CRC.
Conclusions: Our investigation confirms that variants across multiple risk regions of 8q24 are associated with CRC, and that associations at 18q21 differ by tumor site. (Cancer Epidemiol Biomarkers Prev 2009;18(2):616–21)
Introduction
Genome-wide association studies have recently identified common variants at chromosomes 8q24, 9p24, and 18q21 that are associated with colorectal cancer (CRC; refs. 1-5). At the 8q24 locus, multiple different independent regions have been reported to be associated with several common cancers including prostate, breast, colorectal, and ovarian cancers (6-8). Most recently, five separate 8q24 cancer risk regions have been suggested (8). Of these five regions, the CRC genome-wide association studies consistently indicate variants in region 3 as associated with CRC (1, 2, 5), and these have been followed by replication in candidate single nucleotide polymorphism (SNP) studies of variants rs6983267 and rs10505477 (6, 8-12). One study included a broader set of 8q24 variants in their replication efforts and additionally suggested association of a region 5 variant, rs10090154, with CRC (6). A single genome-wide association study proposed a risk locus at 9p24 (1), which has been replicated in a single candidate SNP study (9). Two genome-wide association studies identified a risk locus for CRC at 18q21 (which contains the gene SMAD7, also a functional candidate gene for CRC; refs. 3, 5). It has also been observed that variants at 18q21 may differ in frequency by CRC tumor site (5).
In this three-center collaborative investigation, we studied 12 previously identified SNPs at chromosomes 8q24, 9p24, and 18q21 for association with CRC and evaluated if these differed by sex, age at diagnosis, family history, or tumor site. Our total resource included two U.K. case-control cohorts (Sheffield and Leeds) and a case-control study of cases from high-risk Utah cancer pedigrees. Single SNP and multilocus associations were carried out. Meta-association statistics and Monte Carlo significance testing were used to provide a valid combined analysis of the mixture of independent and related individuals.
Materials and Methods
Study Population
In Sheffield, CRC cases were identified from subjects resident in Sheffield, U.K. and undergoing surgery for a primary colorectal tumor at the Royal Hallamshire or Northern General Hospitals, Sheffield between March 2001 and June 2005. Control subjects were identified from Sheffield General Practice registers and recruited between October 2001 and December 2005. In Leeds, CRC cases were identified from examination of pathology records at the Leeds Teaching Hospitals NHS Trust, and age- and sex-matched controls were identified from the records of general practitioners of cases as described previously (13-15). In Utah, CRC cases were selected from 244 high-risk cancer pedigrees; one case per pedigree from 156 pedigrees (156 independent CRC cases) and two or more cases from 88 pedigrees (282 related CRC cases). A high-risk pedigree was defined as one containing a statistical excess of individuals with cancer, as assessed using the Utah Population DataBase. The Utah Population DataBase is a genealogic resource of ∼2.3 million individuals that is record-linked to data from Utah Cancer Registry. Utah controls, a convenience sample not specifically ascertained for this study, were selected to be cancer-free and were matched by sex and 5-y birth cohort to the prevalent cases. Because age of Utah controls represents their age at ascertainment for prior studies, age at diagnosis for cases and age at selection for controls do not necessarily correspond; however, cases and controls were well matched for age based on birth cohort (see footnote † of Table 1). Study subjects in all three centers were of North European descent. The total resource included 1,092 cases and 1,060 controls that were genotyped for 12 variants at 8q24, 9p24, and 18q21.
. | All centers . | . | Utah . | . | Sheffield . | . | Leeds . | . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
. | Cases . | Controls . | Cases* . | Controls . | Cases . | Controls . | Cases . | Controls . | |||||||
. | n (%) . | . | . | . | . | . | . | . | |||||||
Total subjects | 1,092 (100.0) | 1,060 (100.0) | 438 (100.0) | 439 (100.0) | 405 (100.0) | 403 (100.0) | 249 (100.0) | 218 (100.0) | |||||||
Independent | 810 (74.2) | 1,060 (100.0) | 156 (35.6) | 439 (100.0) | 405 (100.0) | 403 (100.0) | 249 (100.0) | 218 (100.0) | |||||||
Related | 282 (25.8) | 0 (0.0) | 282 (64.4) | 0 (0.0) | 0 (0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | |||||||
Men | 600 (54.9) | 569 (53.7) | 241 (55.0) | 245 (55.8) | 221 (54.6) | 198 (49.1) | 138 (55.4) | 126 (57.8) | |||||||
Women | 491 (45.0) | 485 (45.8) | 197 (45.0) | 194 (44.2) | 184 (45.4) | 199 (49.4) | 110 (44.2) | 92 (42.2) | |||||||
Unknown | 1 (0.1) | 6 (0.5) | 0 (0) | 0 (0.0) | 0 (0) | 6 (1.5) | 1 (0.4) | 0 (0.0) | |||||||
Family history of CRC | 1,083 (100.0) | 1,060 (100.0) | 438 (100.0) | 439 (100.0) | 405 (100.0) | 403 (100.0) | 240 (100.0) | 218 (100.0) | |||||||
None | 591 (54.6) | 967 (91.2) | 56 (12.8) | 410 (93.4) | 333 (82.2) | 364 (90.3) | 202 (84.2) | 193 (88.5) | |||||||
1 relative | 351 (32.4) | 82 (7.7) | 258 (58.9) | 25 (5.7) | 60 (14.8) | 33 (8.2) | 33 (13.8) | 24 (11.0) | |||||||
≥2 relatives | 136 (12.6) | 10 (0.9) | 124 (28.3) | 4 (0.9) | 12 (3.0) | 6 (1.5) | 5 (2.1) | 1 (0.5) | |||||||
Age at onset or selection (y)† | 1,079 (100.0) | 621 (100.0) | 426 (100.0) | — | 405 (100.0) | 403 (100.0) | 248 (100.0) | 218 (100.0) | |||||||
≤50 | 82 (7.6) | 20 (3.2) | 53 (12.4) | — | 20 (4.9) | 11 (2.7) | 9 (3.6) | 9 (4.1) | |||||||
50-59 | 186 (17.2) | 126 (20.3) | 67 (15.7) | — | 80 (19.8) | 97 (24.1) | 39 (15.7) | 29 (13.3) | |||||||
60-69 | 314 (29.1) | 236 (38.0) | 128 (30.0) | — | 104 (25.7) | 154 (38.2) | 82 (33.1) | 82 (37.6) | |||||||
70 or older | 497 (46.1) | 239 (38.5) | 178 (41.8) | — | 201 (49.6) | 141 (35.0) | 118 (47.6) | 98 (45.0) | |||||||
Site‡ | 1,120 (100.0) | — | 456 (100.0) | — | 415 (100.0) | — | 249 (100.0) | — | |||||||
Proximal colon | 318 (28.4) | — | 140 (30.7) | — | 106 (25.5) | — | 72 (28.9) | — | |||||||
Distal colon | 340 (30.4) | — | 147 (32.2) | — | 113 (27.2) | — | 80 (32.1) | — | |||||||
Colon, NOS | 19 (1.7) | — | 14 (3.1) | — | 0 (0.0) | — | 5 (2.0) | — | |||||||
Rectal | 369 (32.9) | — | 111 (24.3) | — | 167 (40.2) | — | 91 (36.6) | — | |||||||
Colorectal, unknown | 74 (6.6) | — | 44 (9.6) | — | 29 (7.0) | — | 1 (0.4) | — |
. | All centers . | . | Utah . | . | Sheffield . | . | Leeds . | . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
. | Cases . | Controls . | Cases* . | Controls . | Cases . | Controls . | Cases . | Controls . | |||||||
. | n (%) . | . | . | . | . | . | . | . | |||||||
Total subjects | 1,092 (100.0) | 1,060 (100.0) | 438 (100.0) | 439 (100.0) | 405 (100.0) | 403 (100.0) | 249 (100.0) | 218 (100.0) | |||||||
Independent | 810 (74.2) | 1,060 (100.0) | 156 (35.6) | 439 (100.0) | 405 (100.0) | 403 (100.0) | 249 (100.0) | 218 (100.0) | |||||||
Related | 282 (25.8) | 0 (0.0) | 282 (64.4) | 0 (0.0) | 0 (0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | |||||||
Men | 600 (54.9) | 569 (53.7) | 241 (55.0) | 245 (55.8) | 221 (54.6) | 198 (49.1) | 138 (55.4) | 126 (57.8) | |||||||
Women | 491 (45.0) | 485 (45.8) | 197 (45.0) | 194 (44.2) | 184 (45.4) | 199 (49.4) | 110 (44.2) | 92 (42.2) | |||||||
Unknown | 1 (0.1) | 6 (0.5) | 0 (0) | 0 (0.0) | 0 (0) | 6 (1.5) | 1 (0.4) | 0 (0.0) | |||||||
Family history of CRC | 1,083 (100.0) | 1,060 (100.0) | 438 (100.0) | 439 (100.0) | 405 (100.0) | 403 (100.0) | 240 (100.0) | 218 (100.0) | |||||||
None | 591 (54.6) | 967 (91.2) | 56 (12.8) | 410 (93.4) | 333 (82.2) | 364 (90.3) | 202 (84.2) | 193 (88.5) | |||||||
1 relative | 351 (32.4) | 82 (7.7) | 258 (58.9) | 25 (5.7) | 60 (14.8) | 33 (8.2) | 33 (13.8) | 24 (11.0) | |||||||
≥2 relatives | 136 (12.6) | 10 (0.9) | 124 (28.3) | 4 (0.9) | 12 (3.0) | 6 (1.5) | 5 (2.1) | 1 (0.5) | |||||||
Age at onset or selection (y)† | 1,079 (100.0) | 621 (100.0) | 426 (100.0) | — | 405 (100.0) | 403 (100.0) | 248 (100.0) | 218 (100.0) | |||||||
≤50 | 82 (7.6) | 20 (3.2) | 53 (12.4) | — | 20 (4.9) | 11 (2.7) | 9 (3.6) | 9 (4.1) | |||||||
50-59 | 186 (17.2) | 126 (20.3) | 67 (15.7) | — | 80 (19.8) | 97 (24.1) | 39 (15.7) | 29 (13.3) | |||||||
60-69 | 314 (29.1) | 236 (38.0) | 128 (30.0) | — | 104 (25.7) | 154 (38.2) | 82 (33.1) | 82 (37.6) | |||||||
70 or older | 497 (46.1) | 239 (38.5) | 178 (41.8) | — | 201 (49.6) | 141 (35.0) | 118 (47.6) | 98 (45.0) | |||||||
Site‡ | 1,120 (100.0) | — | 456 (100.0) | — | 415 (100.0) | — | 249 (100.0) | — | |||||||
Proximal colon | 318 (28.4) | — | 140 (30.7) | — | 106 (25.5) | — | 72 (28.9) | — | |||||||
Distal colon | 340 (30.4) | — | 147 (32.2) | — | 113 (27.2) | — | 80 (32.1) | — | |||||||
Colon, NOS | 19 (1.7) | — | 14 (3.1) | — | 0 (0.0) | — | 5 (2.0) | — | |||||||
Rectal | 369 (32.9) | — | 111 (24.3) | — | 167 (40.2) | — | 91 (36.6) | — | |||||||
Colorectal, unknown | 74 (6.6) | — | 44 (9.6) | — | 29 (7.0) | — | 1 (0.4) | — |
Two hundred forty-four high-risk cancer families; 1 case from 156 pedigrees (156 CRC cases) and ≥2 cases from 88 pedigrees (288 CRC cases).
Controls in Utah were matched on birth cohort to prevalent cases (see Materials and Methods); mean (±SE) birth cohort: cases, 1,922 (±0.52); controls, 1,923 (±0.53).
Includes multiple primary CRC cancers: Utah, 31; Sheffield, 6; Leeds, none.
Proximal colon site was defined as tumors of the cecum through transverse colon. Distal colon was defined as tumors of the splenic flexure, descending, and sigmoid colon. Rectal cancer was defined as tumors of the rectosigmoid junction and rectum.
Genotyping
Genotyping was carried out in 384-well plates using the Applied Biosystems SNPlex system, which allows multiplex analysis of up to 48 SNPs.4
At least 5% of samples were duplicated in the plates to assess the reproducibility of the genotype calls. For each SNP, duplicate concordance, call rate, and test for compliance with Hardy-Weinberg equilibrium in controls are shown in Supplementary Table S1.Statistical Analysis
The analyses were conducted using Genie 2.6.2, a freely available software package.5
Genie provides valid genetic association testing in cases and controls that include related individuals using Monte Carlo significance testing (16, 17). Genotypes for all SNP markers were tested for deviation from Hardy-Weinberg equilibrium in controls. Pairwise linkage disequilibrium was assessed between SNP markers within each region using r2 and D′. Meta statistics [χ2 test of trend, odds ratios (OR), and 95% confidence intervals (95% CI)] for SNP markers and CRC risk controlling for study were calculated using Cochran-Mantel Haenszel techniques available in Genie (17). We repeated our analyses also controlling for sex and early or late age of diagnosis. These secondary results did not differ substantively and are therefore not shown. Stratified analyses by sex, age at diagnosis, family history, combined age at diagnosis and family history, and tumor site were done. Cochran's Q test was conducted to assess homogeneity of effect size across studies. Statistical heterogeneity was considered present if P < 0.05. All P values were empirically derived based on 10,000 simulations in the Genie null distribution as described (16, 17). Haplotypes were estimated based on an expectation-maximization algorithm, and the hapConstructor module of Genie v.2.6.2 was used to comprehensively analyze multilocus haplotypes within the 8q24, 9p24, and 18q21 regions (18).Results
Descriptions of the study populations in the combined resource are shown in Table 1. Cases from Utah had a higher proportion of first-degree relatives with CRC than cases in the U.K. cohorts (PANOVA < 0.001), as well as a higher proportion of early-onset cases (age 59 and younger), as would be expected for CRC cases selected from high-risk cancer pedigrees (PANOVA = 0.002). Utah also had a lower proportion of rectal cancer (PANOVA < 0.001). This is also as expected because the CRC high-risk pedigrees were ascertained primarily for excess of colon cancers. Table 2 describes the 12 SNPs studied. All SNPs were in Hardy-Weinberg equilibrium. The number of cases and controls, of each SNP genotype, by individual study center is provided in Supplementary Table S2.
Locus . | SNP . | bp . | Region* . | Major/minor allele . | Cases . | . | Controls . | . | ||
---|---|---|---|---|---|---|---|---|---|---|
. | . | . | . | . | n . | MAF . | n . | MAF . | ||
8q24 | rs10505477 | 128476625 | 3 | T/C | 1,071 | 0.46 | 1,040 | 0.49 | ||
rs10808556 | 128482329 | 3 | T/C | 925 | 0.45 | 934 | 0.42 | |||
rs6983267 | 128482487 | 3 | G/T | 1,069 | 0.44 | 1,040 | 0.48 | |||
rs7013278 | 128484074 | 3 | C/T | 1,063 | 0.40 | 1,040 | 0.39 | |||
rs7000448 | 128510352 | 4 | C/T | 1,064 | 0.37 | 1,040 | 0.37 | |||
rs1447295 | 128554220 | 5 | C/A | 1,072 | 0.12 | 1,045 | 0.10 | |||
rs10090154 | 128601319 | 5 | C/T | 1,084 | 0.11 | 1,050 | 0.09 | |||
9p24 | rs719725 | 6355683 | — | A/C | 1,069 | 0.37 | 1,047 | 0.38 | ||
rs7857826 | 17833594 | — | T/A | 1,072 | 0.41 | 1,047 | 0.41 | |||
18q21 | rs4939827 | 44707461 | — | T/C | 1,065 | 0.45 | 1,041 | 0.48 | ||
rs12953717 | 44707927 | — | C/T | 1,070 | 0.46 | 1,041 | 0.43 | |||
rs4464148 | 44713030 | — | T/C | 1,070 | 0.31 | 1,047 | 0.29 |
Locus . | SNP . | bp . | Region* . | Major/minor allele . | Cases . | . | Controls . | . | ||
---|---|---|---|---|---|---|---|---|---|---|
. | . | . | . | . | n . | MAF . | n . | MAF . | ||
8q24 | rs10505477 | 128476625 | 3 | T/C | 1,071 | 0.46 | 1,040 | 0.49 | ||
rs10808556 | 128482329 | 3 | T/C | 925 | 0.45 | 934 | 0.42 | |||
rs6983267 | 128482487 | 3 | G/T | 1,069 | 0.44 | 1,040 | 0.48 | |||
rs7013278 | 128484074 | 3 | C/T | 1,063 | 0.40 | 1,040 | 0.39 | |||
rs7000448 | 128510352 | 4 | C/T | 1,064 | 0.37 | 1,040 | 0.37 | |||
rs1447295 | 128554220 | 5 | C/A | 1,072 | 0.12 | 1,045 | 0.10 | |||
rs10090154 | 128601319 | 5 | C/T | 1,084 | 0.11 | 1,050 | 0.09 | |||
9p24 | rs719725 | 6355683 | — | A/C | 1,069 | 0.37 | 1,047 | 0.38 | ||
rs7857826 | 17833594 | — | T/A | 1,072 | 0.41 | 1,047 | 0.41 | |||
18q21 | rs4939827 | 44707461 | — | T/C | 1,065 | 0.45 | 1,041 | 0.48 | ||
rs12953717 | 44707927 | — | C/T | 1,070 | 0.46 | 1,041 | 0.43 | |||
rs4464148 | 44713030 | — | T/C | 1,070 | 0.31 | 1,047 | 0.29 |
Abbreviation: MAF, minor allele frequency.
Regions as defined by Ghoussaini et al. (8).
The meta association results for each SNP with CRC are shown in Table 3. Based on Cochran's Q test, there were no results that exhibited significant heterogeneity across studies. In 8q24, we observed two nominally significant results for SNPs rs6983267 (Ptrend = 0.01) and rs10090154 (Ptrend = 0.05), associated with modest per allele increased risks. For rs6983267, the high-risk G allele was the major allele in our study population; however, allele T was considered the reference allele to be consistent with previous studies. SNP rs6983267 resides in region 3 of the 8q24 locus (8). In our controls, the linkage disequilibrium between rs6983267 and the other region 3 SNPs studied, rs10505477, rs10808556, and rs7013278, was r2 of 0.94, 0.65, and 0.57 (|D′ | of 0.99, 0.99, and 1.00), respectively. However, none of the other region 3 SNPs were significant. SNP rs10090154 resides in region 5 of 8q24 (8). One other region 5 SNP with an r2 of 0.89 (|D′| of 0.98) with rs10090154 was studied (rs1447295) but was not significantly associated with CRC (P = 0.10). We inspected the study-specific results for rs6983267 and rs10090154. Although there was no statistically significant evidence for heterogeneity across the three studies for these two SNPs (both Phom > 0.30), we observed that the association signal for both was primarily driven by the two U.K. cohorts (Supplementary Table S3). Stratification by family history (Supplementary Table S4), age at diagnosis, a combination of family history and age at diagnosis (Supplementary Table S5), or tumor site, variables that differed between the U.K. and U.S. sites, did not explain this observation. Potential sources of heterogeneity between the two U.K. sites and the U.S. site include environmental factors, particularly alcohol consumption and smoking, and phenotypic heterogeneity, particularly that undiscovered high-risk alleles could exist in the Utah pedigrees. However, these tests for heterogeneity could not be directly assessed because the data were not available for this study.
Locus . | SNP . | Genotype . | Total . | . | Meta OR (95% CI) . | Ptrend* . | Per allele OR (95% CI) . | Phom† . | |
---|---|---|---|---|---|---|---|---|---|
. | . | . | Controls . | Cases . | . | . | . | . | |
8q24 | rs10505477 | TT | 273 | 304 | 1 (Reference) | ||||
TC | 519 | 544 | 0.94 (0.76-1.16) | ||||||
CC | 248 | 223 | 0.81 (0.63-1.03) | 0.10 | 0.90 (0.79-1.02) | 0.11 | |||
rs10808556 | TT | 304 | 273 | 1 (Reference) | |||||
TC | 469 | 477 | 1.14 (0.92-1.42) | ||||||
CC | 161 | 175 | 1.22 (0.93-1.61) | 0.16 | 1.10 (0.96-1.26) | 0.08 | |||
rs6983267‡ | TT | 234 | 202 | 1 (Reference) | |||||
TG | 520 | 534 | 1.20 (0.95-1.50) | ||||||
GG | 286 | 333 | 1.36 (1.06-1.76) | 0.01 | 1.17 (1.03-1.32) | 0.32 | |||
rs7013278 | CC | 389 | 379 | 1 (Reference) | |||||
CT | 500 | 516 | 1.06 (0.87-1.28) | ||||||
TT | 151 | 168 | 1.14 (0.86-1.50) | 0.34 | 1.06 (0.93-1.21) | 0.38 | |||
rs7000448 | CC | 411 | 437 | 1 (Reference) | |||||
CT | 484 | 470 | 0.91 (0.75-1.10) | ||||||
TT | 145 | 157 | 1.03 (0.78-1.38) | 0.88 | 0.99 (0.87-1.13) | 0.97 | |||
rs1447295 | CC | 844 | 837 | 1 (Reference) | |||||
CA | 188 | 215 | 1.16 (0.93-1.44) | ||||||
AA | 13 | 20 | 1.59 (0.70-3.61) | 0.10 | 1.19 (0.98-1.45) | 0.60 | |||
rs10090154 | CC | 862 | 856 | 1 (Reference) | |||||
CT | 180 | 210 | 1.17 (0.94-1.46) | ||||||
TT | 8 | 18 | 2.24 (0.91-5.50) | 0.05 | 1.24 (1.01-1.51) | 0.49 | |||
9p24 | rs719725 | AA | 405 | 427 | 1 (Reference) | ||||
AC | 490 | 494 | 0.97 (0.80-1.17) | ||||||
CC | 152 | 148 | 0.93 (0.70-1.22) | 0.55 | 0.96 (0.84-1.10) | 0.37 | |||
rs7857826 | TT | 352 | 368 | 1 (Reference) | |||||
TA | 522 | 533 | 0.99 (0.81-1.20) | ||||||
AA | 173 | 171 | 0.95 (0.73-1.24) | 0.73 | 0.98 (0.86-1.11) | 0.88 | |||
18q21 | rs4939827 | TT | 274 | 324 | 1 (Reference) | ||||
TC | 538 | 520 | 0.83 (0.67-1.02) | ||||||
CC | 229 | 221 | 0.84 (0.65-1.08) | 0.12 | 0.91 (0.80-1.03) | 0.17 | |||
rs12953717 | CC | 332 | 314 | 1 (Reference) | |||||
CT | 521 | 530 | 1.07 (0.87-1.31) | ||||||
TT | 188 | 226 | 1.26 (0.98-1.63) | 0.09 | 1.12 (0.98-1.27) | 0.38 | |||
rs4464148 | TT | 535 | 503 | 1 (Reference) | |||||
TC | 423 | 472 | 1.18 (0.99-1.42) | ||||||
CC | 89 | 95 | 1.13 (0.81-1.57) | 0.14 | 1.11 (0.97-1.27) | 0.24 |
Locus . | SNP . | Genotype . | Total . | . | Meta OR (95% CI) . | Ptrend* . | Per allele OR (95% CI) . | Phom† . | |
---|---|---|---|---|---|---|---|---|---|
. | . | . | Controls . | Cases . | . | . | . | . | |
8q24 | rs10505477 | TT | 273 | 304 | 1 (Reference) | ||||
TC | 519 | 544 | 0.94 (0.76-1.16) | ||||||
CC | 248 | 223 | 0.81 (0.63-1.03) | 0.10 | 0.90 (0.79-1.02) | 0.11 | |||
rs10808556 | TT | 304 | 273 | 1 (Reference) | |||||
TC | 469 | 477 | 1.14 (0.92-1.42) | ||||||
CC | 161 | 175 | 1.22 (0.93-1.61) | 0.16 | 1.10 (0.96-1.26) | 0.08 | |||
rs6983267‡ | TT | 234 | 202 | 1 (Reference) | |||||
TG | 520 | 534 | 1.20 (0.95-1.50) | ||||||
GG | 286 | 333 | 1.36 (1.06-1.76) | 0.01 | 1.17 (1.03-1.32) | 0.32 | |||
rs7013278 | CC | 389 | 379 | 1 (Reference) | |||||
CT | 500 | 516 | 1.06 (0.87-1.28) | ||||||
TT | 151 | 168 | 1.14 (0.86-1.50) | 0.34 | 1.06 (0.93-1.21) | 0.38 | |||
rs7000448 | CC | 411 | 437 | 1 (Reference) | |||||
CT | 484 | 470 | 0.91 (0.75-1.10) | ||||||
TT | 145 | 157 | 1.03 (0.78-1.38) | 0.88 | 0.99 (0.87-1.13) | 0.97 | |||
rs1447295 | CC | 844 | 837 | 1 (Reference) | |||||
CA | 188 | 215 | 1.16 (0.93-1.44) | ||||||
AA | 13 | 20 | 1.59 (0.70-3.61) | 0.10 | 1.19 (0.98-1.45) | 0.60 | |||
rs10090154 | CC | 862 | 856 | 1 (Reference) | |||||
CT | 180 | 210 | 1.17 (0.94-1.46) | ||||||
TT | 8 | 18 | 2.24 (0.91-5.50) | 0.05 | 1.24 (1.01-1.51) | 0.49 | |||
9p24 | rs719725 | AA | 405 | 427 | 1 (Reference) | ||||
AC | 490 | 494 | 0.97 (0.80-1.17) | ||||||
CC | 152 | 148 | 0.93 (0.70-1.22) | 0.55 | 0.96 (0.84-1.10) | 0.37 | |||
rs7857826 | TT | 352 | 368 | 1 (Reference) | |||||
TA | 522 | 533 | 0.99 (0.81-1.20) | ||||||
AA | 173 | 171 | 0.95 (0.73-1.24) | 0.73 | 0.98 (0.86-1.11) | 0.88 | |||
18q21 | rs4939827 | TT | 274 | 324 | 1 (Reference) | ||||
TC | 538 | 520 | 0.83 (0.67-1.02) | ||||||
CC | 229 | 221 | 0.84 (0.65-1.08) | 0.12 | 0.91 (0.80-1.03) | 0.17 | |||
rs12953717 | CC | 332 | 314 | 1 (Reference) | |||||
CT | 521 | 530 | 1.07 (0.87-1.31) | ||||||
TT | 188 | 226 | 1.26 (0.98-1.63) | 0.09 | 1.12 (0.98-1.27) | 0.38 | |||
rs4464148 | TT | 535 | 503 | 1 (Reference) | |||||
TC | 423 | 472 | 1.18 (0.99-1.42) | ||||||
CC | 89 | 95 | 1.13 (0.81-1.57) | 0.14 | 1.11 (0.97-1.27) | 0.24 |
NOTE: Case-control comparison; reference genotype is homozygous for major allele.
Empirical Cochran-Mantel Haenszel Ptrend for additive model based on 10,000 simulations.
Empirical Q test to assess homogeneity, Phom, based on 10,000 simulations.
Allele G is the major allele in our resource; however, reference genotype is homozygous for minor T allele for consistency with other published studies.
To investigate multilocus associations at 8q24, we applied a haplotype mining method (18). Using data for all seven 8q24 SNPs, the method identified a two-SNP haplotype (G-T) across rs6983267 and rs10090154 variants as the most significantly associated with CRC when compared with a referent haplotype of T-C (P = 0.0004). It is also of interest that, in contrast to single-SNP analyses of rs6983267 and rs10090154, the two-locus model (carriage of G-T) association signal (ORmeta = 2.04; 95% CI, 1.42-3.34) was more similar across the U.S. and U.K. cohorts [ORSheffield, 2.65 (1.34-5.24); ORLeeds, 2.55 (0.98-6.63); and ORUtah, 1.62 (0.80-3.31)]. These two markers are not in linkage disequilibrium (r2 < 0.01 and |D′| = 0.13) and are considered to reside in two separate 8q24 regions (8). We simultaneously analyzed both SNPs, each modeled as log-additive. The significance for both SNPs, when included in the same model, was almost unchanged from their single SNP tests (Table 3; rs6983267: P = 0.02, compared with P = 0.01 in the single SNP test; rs10090154: P = 0.065, compared with P = 0.05 in the single SNP test). We further found no significant evidence for multiplicative interaction (P = 0.14), although our power to detect interaction was lower than for main effects. Hence, although rs10090154 did not reach significance at the 0.05 level in the simultaneous analysis (P = 0.065), the largely unchanged effects when rs6983267 and rs10090154 were considered together or separately and the distinctly greater risk and significance of the two-locus model are consistent with the conclusions of others that these two SNPs may be independent risk loci for CRC (6). Independent roles for SNPs in regions 3 and 5 have also been suggested for prostate cancer associations (8).
There were no significant associations in 9p24 and 18q21 SNPs and overall risk of CRC. Associations at 9p24 did not differ when the resource was stratified by sex, age, family history, age and family history, or tumor site. Associations at 18q21 and overall risk of CRC were improved in the stratification by early-onset and family history (Supplementary Table S5) and were best observed in the stratification by tumor site (Table 4). Significant associations were observed for distal colon tumors compared with controls for two SNPs but not for proximal colon and rectal tumors (Table 4). In a distal CRC-control comparison, the minor C allele at rs4939827 was associated with a decreased risk of distal colon tumors (Ptrend = 0.007) and the minor T allele at rs12953717 with increased risk (Ptrend = 0.01). The linkage disequilibrium between rs4939827 and rs12953717 is reasonably high (r2 = 0.59, |D′| = 0.92), suggesting redundancy of information between the SNPs that was confirmed in a simultaneous analysis of both SNPs. For rs4939827, there was a statistically significant difference between distal and other CRC sites in a case-case comparison (P = 0.03). The finding for distal CRC was consistently observed across the U.K. and Utah study sites (Supplementary Table S3). Findings were less significant in each study center considered alone, thus illustrating the benefit of increased power to detect associations from larger sample sizes inherent in a multistudy collaboration. Haplotype mining did not identify any interesting two-locus associations or a three-locus association with distal CRC.
SNP . | Genotype . | Proximal colon . | . | Distal colon . | . | Rectal . | . | |||
---|---|---|---|---|---|---|---|---|---|---|
. | . | Meta OR (95% CI) . | Ptrend* . | Meta OR (95% CI) . | Ptrend* . | Meta OR (95% CI) . | Ptrend* . | |||
rs4939827 | TC | 0.81 (0.60-1.09) | 0.73 | 0.67 (0.50-0.90) | 0.007 | 0.99 (0.75-1.30) | 0.91 | |||
CC | 0.94 (0.64-1.37) | 0.63 (0.43-0.90) | 1.00 (0.70-1.42) | |||||||
Per allele | 0.97 (0.80-1.17) | 0.77 (0.64-0.93) | 0.99 (0.83-1.18) | |||||||
rs12953717 | CT | 1.02 (0.76-1.37) | 0.46 | 1.15 (0.85-1.55) | 0.01 | 1.13 (0.85-1.49) | 0.45 | |||
TT | 1.18 (0.81-1.71) | 1.61 (1.11-2.34) | 1.12 (0.80-1.59) | |||||||
Per allele | 1.07 (0.89-1.29) | 1.27 (1.06-1.52) | 1.07 (0.90-1.27) | |||||||
rs4464148 | TC | 1.14 (0.87-1.50) | 0.46 | 1.32 (1.01-1.73) | 0.11 | 1.13 (0.88-1.46) | 0.44 | |||
CC | 1.07 (0.68-1.69) | 1.19 (0.77-1.86) | 1.08 (0.70-1.66) | |||||||
Per allele | 1.08 (0.88-1.31) | 1.17 (0.97-1.43) | 1.07 (0.89-1.30) |
SNP . | Genotype . | Proximal colon . | . | Distal colon . | . | Rectal . | . | |||
---|---|---|---|---|---|---|---|---|---|---|
. | . | Meta OR (95% CI) . | Ptrend* . | Meta OR (95% CI) . | Ptrend* . | Meta OR (95% CI) . | Ptrend* . | |||
rs4939827 | TC | 0.81 (0.60-1.09) | 0.73 | 0.67 (0.50-0.90) | 0.007 | 0.99 (0.75-1.30) | 0.91 | |||
CC | 0.94 (0.64-1.37) | 0.63 (0.43-0.90) | 1.00 (0.70-1.42) | |||||||
Per allele | 0.97 (0.80-1.17) | 0.77 (0.64-0.93) | 0.99 (0.83-1.18) | |||||||
rs12953717 | CT | 1.02 (0.76-1.37) | 0.46 | 1.15 (0.85-1.55) | 0.01 | 1.13 (0.85-1.49) | 0.45 | |||
TT | 1.18 (0.81-1.71) | 1.61 (1.11-2.34) | 1.12 (0.80-1.59) | |||||||
Per allele | 1.07 (0.89-1.29) | 1.27 (1.06-1.52) | 1.07 (0.90-1.27) | |||||||
rs4464148 | TC | 1.14 (0.87-1.50) | 0.46 | 1.32 (1.01-1.73) | 0.11 | 1.13 (0.88-1.46) | 0.44 | |||
CC | 1.07 (0.68-1.69) | 1.19 (0.77-1.86) | 1.08 (0.70-1.66) | |||||||
Per allele | 1.08 (0.88-1.31) | 1.17 (0.97-1.43) | 1.07 (0.89-1.30) |
NOTE: Case-control comparison; reference genotype is homozygous for major allele in study subjects.
Empirical Cochran-Mantel Haenszel Ptrend for additive model based on 10,000 simulations.
Discussion
Recent genome-wide association studies have suggested small effect susceptibility loci for CRC at 8q24, 9p24, and 18q21 (1-5). In our three-center study, we were able to replicate associations at 8q24 and 18q21 (SMAD7), but not 9p24. For 8q24, previous replications have been most abundant for SNPs in region 3 with CRC (1, 2, 5); however, region 5 has also been implicated (6). In region 3, variants rs6983267 and rs10505477 have predominantly been studied, with estimated per allele ORs ranging from 1.17 to 1.27 (1, 2, 5, 6, 8, 11). The magnitude of our estimate of effect size for rs6983267 is consistent with these previous values (OR, 1.17). In region 5, SNP rs10090154 has previously been suggested (6) with an estimated OR of 1.14. We were also able to replicate this finding with similar estimate for effect size (OR, 1.24). Multilocus analyses were consistent with two independent risk loci at 8q24 for CRC. Of note in our replications of both 8q24 SNPs is that, although no statistically significant heterogeneity was observed for either SNP, we did observe notable differences in effect size and association significance between the results for the two U.K. sites (Sheffield and Leeds) and the U.S. site (Utah; Supplementary Table S3). Estimates of the ORs in a meta analysis consisting of only the two U.K. cohorts yielded substantially larger per allele risks (1.33 and 1.43 for rs6983267 and rs10505477, respectively; Supplementary Table S3). Differences in familial and early onset disease, as observed by Poynter et al. (9), did not explain this observation (Supplementary Table S5). Potential differences in phenotype origin or environmental exposure are plausible differences between the U.K. and U.S. studies. Utah cases from high-risk pedigrees could be influenced by yet undiscovered high-risk alleles and therefore less influenced by low-risk 8q24 alleles, although it is pertinent to note that CRC cases in the pedigrees were screened for HNPCC variants and Amsterdam-type criteria, and none were found to be responsible for the clustering. Another plausible explanation is that there may be an important gene-environment interaction. It is certainly likely that the U.K. subjects differ in their exposure to cigarette smoke and alcohol consumption from individuals in Utah, many of whom abstain from smoking and drinking alcoholic beverages. This observation is anecdotal here but provides a hypothesis worthy of further exploration.
We were unable to confirm associations of variants rs719725 and rs7857826 at 9p24 and CRC. Results from our meta analysis and all stratification analyses indicated no evidence for association. SNP rs719725 was originally identified in a genome-wide association (1) and subsequently replicated in a candidate gene study (9); however, in the final internal replication of the genome-wide association, rs719725 was unsuccessfully replicated (P = 0.61). Although the overall seven-site meta was significant (P = 0.023), four of the seven individual sites were not significant, with three sites indicating ORs in the opposite direction. For small ORs, we cannot definitively refute the 9p24 locus, but it is clear that the signal at this locus is less robust.
Our investigation of 18q21 variants is the first candidate SNP study to follow up the genome-wide association findings for this locus, including both colon and rectal samples (3, 5). We were not able to replicate findings in a general CRC case-control comparison, although when subset by tumor site, we were able to find significant association for distal colon cancer compared with controls, but not proximal colon or rectal cancers, in tumor site–specific analyses. Allele C at rs4939827 indicated decreased risk for distal colon cancer compared with other sites (P = 0.03). Heterogeneity by tumor site has been reported elsewhere (5), although Broderick et al. (3) observed no difference by site. Whereas our results for distal colon cancer are in the same direction and of similar magnitude to those found by Tenesa et al. (5) for colon cancer, in a case-case comparison of rectal versus colon, the current study findings are opposite those indicated in the Tenesa study (5). However, the discrepancies are in large part due to our lack of evidence in rectal cases and may also be due to Tenesa et al. not stratifying by proximal and distal colon. Therefore, additional site-specific investigations are warranted for this locus. We investigated multilocus associations; similar to previous haplotype analyses findings (3), our results are consistent with a single risk allele at the 18q21 locus.
It should be noted that the increased power gained to detect association using familial cases (19) is accompanied by an overestimate of the effect size as measured by the OR for the general population. Tests of the null hypothesis (effect size or independence) remain valid with the combined populations; the Utah site contains predominantly familial cases and as such, whereas our significance values are valid, our meta OR estimates may be inflated.
In conclusion, our results from a multicenter, combined independent and related case-control resource provide replication of association of two regions at 8q24 and the locus at 18q21 with CRC. At 8q24, two regions of association were evident, with a possible suggestion for gene-environment interaction. At 18q21, our results provide confirmatory evidence of association with CRC that differs by tumor site.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Grant support: The genotype data collection and analysis was supported by a NIH grants CA123550 and CA98364 (N.J. Camp). Research was supported by the Utah Cancer Registry, which is funded by contract N01-PC-35141 from the National Cancer Institute Surveillance, Epidemiology, and End Results program with additional support from the Utah State Department of Health and the University of Utah. Partial support for all data sets within the Utah Population Database was provided by the University of Utah Huntsman Cancer Institute. Recruitment, data collection, and genotyping in Sheffield was supported by Yorkshire Cancer Research grants to A. Cox and Prof. Mark Meuth. Data collection in Leeds was supported by Cancer Research UK Programme Award C588/A4994 (D.T. Bishop).
Note: Supplementary data for this article are available at Cancer Epidemiology Biomarkers and Prevention Online (http://cebp.aacrjournals.org/).
A. Cox and N.J. Camp contributed equally to this work.
Acknowledgments
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
We thank the Study Coordinators, Laboratory Specialist Kim Nguyen, and Computer Specialist Jathine Wong.