Background:

The genetic basis for most individuals with high cumulative lifetime colonic adenomas is unknown. We investigated associations between known colorectal cancer–risk single-nucleotide polymorphisms (SNP) and increasing cumulative adenoma counts.

Methods:

The Cooperative Studies Program #380 screening colonoscopy cohort includes 612 selected participants age 50 to 75 with genotyped blood samples and 10 years of clinical follow-up. We evaluated 41 published “colorectal cancer–risk SNPs” for associations with individual cumulative adenoma counts or having ≥10 cumulative adenomas. SNPs were analyzed singly or combined in a polygenic risk score (PRS). The PRS was constructed from eight published SNPs associated with multiple adenomas, termed “adenoma-risk SNPs.”

Results:

Four colorectal cancer–risk SNPs were associated with increasing cumulative adenoma counts (P < 0.05): rs12241008 (gene: VTI1A), rs2423279 (BMP2/HAO1), rs3184504 (SH2B3), and rs961253 (FERMT1/BMP2), with risk allele risk ratios of 1.31, 1.29, 1.24, and 1.23, respectively. Three colorectal cancer–risk SNPs were associated with ≥10 cumulative adenomas (P < 0.05), with risk allele odds ratios of 2.09 (rs3184504), 2.30 (rs961253), and 1.94 (rs3217901). A weighted PRS comprised of adenoma-risk SNPs was associated with higher cumulative adenomas (weighted rate ratio = 1.57; P = 0.03).

Conclusions:

In this mostly male veteran colorectal cancer screening cohort, several known colorectal cancer–risk SNPs were associated with increasing cumulative adenoma counts and the finding of ≥10 cumulative adenomas. In addition, an increasing burden of adenoma-risk SNPs, measured by a weighted PRS, was associated with higher cumulative adenomas.

Impact:

Future work will seek to validate these findings in different populations and then augment current colorectal cancer risk prediction tools with precancerous, adenoma genetic data.

This article is featured in Highlights of This Issue, p. 2105

Studies suggest that individuals with high cumulative adenoma counts may be at significantly increased risk for poor clinical outcomes, including colorectal cancer, need for colectomy, and extra-intestinal malignancies (1, 2). As such, colorectal cancer screening guidelines now suggest that individuals who are found to have ≥10 cumulative lifetime adenomas be considered for genetic counseling and testing, as they are thought to have an increased risk for actionable hereditary colorectal cancer syndromes (3–5).

However, current genetic testing recommendations are based on limited data from referral populations. These studies found that highly-penetrant gene mutations associated with hereditary colonic adenomatous polyposis syndromes were actually present in only a small fraction of individuals who would be typically encountered during routine colorectal cancer screening and surveillance (i.e., those with 10–99 cumulative adenomas; refs. 6, 7). Therefore, the genetic basis for most individuals with high cumulative adenoma counts remains unexplained, and referring everyone with ≥10 cumulative adenomas for genetic testing will likely be costly and uninformative.

Although ongoing work continues to identify additional genes associated with this high-risk colonic adenomatous polyposis phenotype (8–11), these mutations remain rare in the general population. Rather, it is likely that many genetic changes responsible for this phenotype are low-penetrant single-nucleotide polymorphisms (SNP) that also have an association with colorectal cancer development (12, 13). Large meta-analyses of numerous genome-wide association studies have identified over 100 SNPs associated with the development of colorectal cancer (i.e., “colorectal cancer–risk SNPs”; refs. 13–17), but few studies have evaluated whether these colorectal cancer–risk SNPs are also associated with the development of multiple precancerous adenomas (18–23). Furthermore, these prior studies investigating SNPs associated with multiple adenomas (“adenoma-risk SNPs”) were performed in heterogeneous populations mostly with prior adenomas or those referred for genetic testing, and so there is a high likelihood of selection bias which limits generalizability. Additional investigations are needed into the low-penetrant SNPs that may be associated with the colonic adenomatous polyposis phenotype, either singly or in combination, to help further inform colorectal cancer clinical and genetic screening guidelines.

The goal of our analysis was to evaluate associations between previously identified colorectal cancer–risk SNPs and cumulative adenoma counts in a colorectal cancer screening population. We also investigated whether a polygenic risk score (PRS), comprised of multiple prespecified adenoma-risk SNPs, would be associated with higher cumulative adenoma counts.

In 1993, the U.S. Department of Veterans Affairs (VA) Cooperative Studies Program (CSP) initiated CSP #380, a prospective screening study of asymptomatic individuals in which the primary aim was to characterize the distribution of adenomas on baseline colonoscopy (24). The design and outcomes of CSP #380 have been described previously (25). In brief, this cohort is comprised of asymptomatic veterans age 50 to 75 from 13 VA Medical Centers who underwent a baseline screening colonoscopy from 1994 to 1997. Although participants were oversampled for a family history of colorectal cancer in first-degree relatives in the initial protocol, there was no association found between a first-degree family history of colorectal cancer and the development of ≥10 cumulative adenomas in the overall CSP #380 cohort. Following the completion of a randomized component comparing different surveillance intervals within the first 5 years based on index findings, a longitudinal study now with at least 10 years of follow-up was conducted to identify risk factors, including polyp number and histology, associated with the development of advanced neoplasia. Colonoscopy and pathology records were ascertained through the VA electronic medical record, which included procedures and pathology performed outside of the VA. The protocol was approved by Institutional Review Boards at each participating center.

In this analysis, our outcome of cumulative adenoma counts was characterized in two clinically relevant ways. The first outcome is the number of cumulative adenomas per individual, including advanced adenomas and invasive adenocarcinoma, summed across all available colonoscopies and associated pathology reports over at least 10 years of follow-up. The second outcome is the binary finding of ≥10 cumulative adenomas compared to those with zero cumulative adenomas, to allow for as much discrimination as possible between cases–controls. Importantly, no instances of sessile serrated lesions (SSL) were identified, as these exams were performed prior to an awareness of SSLs. However, there were no serrated polyps with adenomatous change or dysplasia in this cohort, therefore the finding of “no adenomas” includes both hyperplastic polyps or no polyps.

As part of the initial CSP #380 study, a biorepository was developed that included selected individuals with large (≥10 mm) polyps or colorectal cancer identified at baseline colonoscopy as well as age and gender-matched individuals with small (<10 mm) polyps or without neoplasia. The biorepository is comprised of 815 out of 3,121 (26.1%) total CSP #380 participants. A total of 612 individuals had sufficient DNA to allow genetic analysis and passed genome-wide association study (GWAS) quality control measures, including 176 (28.8%) individuals with large polyps or colorectal cancer and 436 (71.2%) with small polyps or no polyps. A CONSORT diagram is included as Fig. 1. A comparison of individuals with analyzable DNA from the biorepository and those not represented is included as Supplementary Table S1. This biorepository includes 612 individuals without known genetic syndromes.

Figure 1.

CONSORT flow diagram. Of 3,121 participants in the CSP #380 prospective screening cohort, 815 were selected for inclusion in the biorepository, of whom 612 had sufficient DNA and are included in this analysis. CRC, colorectal cancer.

Figure 1.

CONSORT flow diagram. Of 3,121 participants in the CSP #380 prospective screening cohort, 815 were selected for inclusion in the biorepository, of whom 612 had sufficient DNA and are included in this analysis. CRC, colorectal cancer.

Close modal

Genotyping was completed on participants' blood using the Illumina Infinium Omni2.5–8 v1.3 GWAS Beadchip; data analysis showed excellent genotyping quality. For this analysis, we evaluated a set of 43 prespecified colorectal cancer–risk SNPs (Supplementary Table S2), which were obtained from prior literature in similar populations (13, 16, 17). Two SNPs with a minor allele frequency in CSP #380 of less than 0.05 were excluded (rs12603526 and rs17879961), resulting in a total of 41 colorectal cancer–risk SNPs included for evaluation. These 41 colorectal cancer–risk SNPs include a subset of eight SNPs previously shown to be associated with multiple adenomas, so these eight specific SNPs are also termed “adenoma-risk SNPs” (Table 1; ref. 18). Otherwise, age at last colonoscopy was recorded since participants had different amounts of follow-up and at different ages. A principal component (PC) analysis on a set of independent SNPs was performed to develop genotype-based PCs to allow for adjustment based on genetic ancestry.

Table 1.

Adenoma-risk SNPs and ORs for cumulative adenomas.

SNP (n = 8)CHRCSP #380 risk alleleCSP #380 risk allele frequencyCSP #380 risk allele RR for multiple adenomasP valueCSP #380 risk allele OR for ≥10 cumulative adenomasP valuePublished adenoma risk allele ORPublished P valueClosest gene or putative target
rs10795668 10 0.71 1.02 0.80 1.20 0.59 1.13 0.004 None 
rs10936599 0.76 1.01 0.93 1.20 0.64 1.13 0.007 TERC 
rs1957636 14 0.56 1.10 0.27 1.04 0.92 1.10 0.012 BMP4 
rs3802842 11 0.30 1.08 0.39 1.41 0.29 1.22 1.74 × 10−6 POU2AF1 
rs4444235 14 0.54 1.06 0.46 1.22 0.53 1.15 0.001 BMP4 
rs4939827 18 0.50 1.12 0.17 1.09 0.79 1.19 1.17 × 10−5 SMAD7 
rs6983267 0.54 1.09 0.30 1.42 0.32 1.20 2.68 × 10−6 MYC 
rs961253 20 0.33 1.23 0.01 2.30 0.02 1.18 4.42 × 10−5 BMP2 
SNP (n = 8)CHRCSP #380 risk alleleCSP #380 risk allele frequencyCSP #380 risk allele RR for multiple adenomasP valueCSP #380 risk allele OR for ≥10 cumulative adenomasP valuePublished adenoma risk allele ORPublished P valueClosest gene or putative target
rs10795668 10 0.71 1.02 0.80 1.20 0.59 1.13 0.004 None 
rs10936599 0.76 1.01 0.93 1.20 0.64 1.13 0.007 TERC 
rs1957636 14 0.56 1.10 0.27 1.04 0.92 1.10 0.012 BMP4 
rs3802842 11 0.30 1.08 0.39 1.41 0.29 1.22 1.74 × 10−6 POU2AF1 
rs4444235 14 0.54 1.06 0.46 1.22 0.53 1.15 0.001 BMP4 
rs4939827 18 0.50 1.12 0.17 1.09 0.79 1.19 1.17 × 10−5 SMAD7 
rs6983267 0.54 1.09 0.30 1.42 0.32 1.20 2.68 × 10−6 MYC 
rs961253 20 0.33 1.23 0.01 2.30 0.02 1.18 4.42 × 10−5 BMP2 

We constructed a PRS similar to prior studies (16, 17, 20, 21) using the eight prespecified adenoma-risk SNPs for this analysis (Table 1). A PRS was assigned for each individual based on the total number of risk alleles present per adenoma-risk SNP (regardless if the allele is major or minor), with a range per SNP between 0 and 2 points. A specific adenoma-risk SNP received a score of 0 if the risk allele was not present, 1 if heterozygous, and 2 if homozygous. The score was summed across all adenoma-risk alleles present in each individual for the unweighted PRS. The weighted PRS was calculated by adjusting each SNP's effect size by the published risk allele log odds ratio (OR) for multiple adenomas (18), and summing together all effect size adjusted adenoma-risk alleles present in each individual.

Descriptive statistics of the dataset are reported using proportions and means with standard deviations (SD), and included in Table 2 and Supplementary Table S1. We fit a Poisson regression model to the cumulative adenoma count. Descriptive histograms of our data (Fig. 2) helped determine these corrections, which included use of a zero-inflated, negative binomial model. Individual colorectal cancer–risk SNP associations with increasing cumulative adenoma counts were modeled as rate ratios [RR with 95% confidence intervals (CI)] by corrected Poisson regression, adjusting for age at last colonoscopy (since participants have varying degrees of follow-up), gender, and PCs. The same corrected Poisson regression model was used to analyze the association between the PRS and higher cumulative adenoma counts, while controlling for the aforementioned covariates. Logistic regression was used to generate ORs for associations between the colorectal cancer–risk SNPs and the binary outcome of ≥10 cumulative adenomas or zero cumulative adenomas, again controlling for age at last colonoscopy, gender, and PCs. Wilcoxon rank sum tests were used to compare the mean adenoma PRS (both unweighted and weighted by log OR for effect size) between individuals with ≥10 cumulative adenomas versus those with no adenomas. Finally, stratified analyses were performed by sex (male vs. female), race (European vs. non-European), and age at last colonoscopy (age 50–64 vs. age 65+). All statistical analyses were performed using R (version 3.6.1). Given that we tested only prespecified colorectal cancer or adenoma-risk SNPs identified in independent data sets, all tests of significance were performed at an α level of 0.05.

Table 2.

Demographic and clinical characteristics of CSP #380 biorepository participants.

CharacteristicNumber of participants (n = 612)
Age, mean (SD) 64.1 (6.7) 
Male sex, N (%) 594 (97.1) 
Race, N (%) 
 White, non-Hispanic 500 (81.7) 
 Black, non-Hispanic 57 (9.3) 
 Hispanic 32 (5.2) 
 American Indian/Alaskan Native 14 (2.3) 
 Asian 7 (1.1) 
Daily smoker, N (%) 136 (22.2) 
Past smoker, N (%) 328 (53.6) 
First-degree relatives with colorectal cancer, N (%) 136 (22.2) 
Baseline cancer histology, N (%) 20 (3.3) 
Baseline advanced neoplasia, N (%) 172 (28.1) 
Number of adenomas, N (%) 
 0 246 (40.2) 
 1–2 192 (31.4) 
 3–9 151 (24.7) 
 10–22 23 (3.8) 
CharacteristicNumber of participants (n = 612)
Age, mean (SD) 64.1 (6.7) 
Male sex, N (%) 594 (97.1) 
Race, N (%) 
 White, non-Hispanic 500 (81.7) 
 Black, non-Hispanic 57 (9.3) 
 Hispanic 32 (5.2) 
 American Indian/Alaskan Native 14 (2.3) 
 Asian 7 (1.1) 
Daily smoker, N (%) 136 (22.2) 
Past smoker, N (%) 328 (53.6) 
First-degree relatives with colorectal cancer, N (%) 136 (22.2) 
Baseline cancer histology, N (%) 20 (3.3) 
Baseline advanced neoplasia, N (%) 172 (28.1) 
Number of adenomas, N (%) 
 0 246 (40.2) 
 1–2 192 (31.4) 
 3–9 151 (24.7) 
 10–22 23 (3.8) 
Figure 2.

Histogram illustrating the distribution of total cumulative adenoma counts from CSP #380 participants through 10 years.

Figure 2.

Histogram illustrating the distribution of total cumulative adenoma counts from CSP #380 participants through 10 years.

Close modal

The overall CSP #380 cohort includes 3,121 participants, of whom 612 (19.6%) had genotyped blood samples in the biorepository, and thus are included in this analysis. Table 2 describes the baseline demographic and clinical characteristics of all included individuals, including the distribution of cumulative adenoma counts. The mean age was 64.1 (SD 6.7) years, 97.1% were male, 81.7% were white, and 22.2% had a family history of colorectal cancer in first-degree relatives. At baseline, 174 (28.4%) were found to have advanced neoplasia, including 20 (3.2%) with colorectal cancer. Over 10 years, most individuals were found to have zero cumulative adenomas (40.2%), whereas 31.4% had one or two cumulative adenomas, 24.7% had three to nine cumulative adenomas, and 3.8% had ≥10 cumulative adenomas (Fig. 2). The maximum individual cumulative adenoma count was 22.

Supplementary Table S1 compares baseline characteristics between participants included and not included in the CSP #380 biorepository. When compared with the CSP #380 participants not included in the biorepository, the individuals represented in the biorepository are slightly older (64.1 vs. 62.6, P < 0.0001), more likely to have a first-degree family history of colorectal cancer (22.2% vs. 11.9%, P < 0.0001), and had a higher prevalence of advanced neoplasia (including colorectal cancer) on baseline screening colonoscopy (28.1% vs. 6.3%, P < 0.0001). All of these factors are expected and related to the initial design of the biorepository.

Supplementary Table S3 includes results for all 41 prespecified colorectal cancer–risk SNPs and their association with cumulative adenoma counts. Four of these colorectal cancer–risk SNPs were statistically significantly associated with increasing cumulative adenoma counts: rs12241008 (risk allele RR = 1.31; 95% CI, 1.00–1.71), rs2423279 (RR = 1.29; 95% CI, 1.07–1.55), rs3184504 (RR = 1.24; 95% CI, 1.04–1.46), and rs961253 (RR = 1.23; 95% CI, 1.04–1.47). Thus, for these four colorectal cancer–risk SNPs, mean cumulative adenoma counts were 1.2 to 1.3× higher for each additional risk allele present. These four SNPs are associated with the genes VTI1A (13, 17), HAO1/BMP2 (4, 19), SH2B3 (15–18), and BMP2 (6, 8, 9, 16, 17), respectively. Furthermore, these risk alleles are the same as reported in prior studies for colorectal cancer, with published risk allele RRs for colorectal cancer of 1.12, 1.07, 1.09, 1.12, respectively (13). Of the subset of eight colorectal cancer–risk SNPs previously found to be associated with multiple adenomas (i.e., “adenoma-risk SNPs”), only one (rs961253) was significant in our dataset, with the same literature risk allele for multiple adenomas (published RR of 1.18, Table 1; ref. 18). When aggregating all eight adenoma-risk SNPs into the PRS, the median unweighted PRS was 8 (7–9 IQR) and the median weighted PRS was 1.16 (0.97–1.38 IQR) for all included participants. Only the weighted PRS was associated with higher cumulative adenoma counts (unweighted RR = 1.03, P = 0.39; weighted RR = 1.57, P = 0.03) in our dataset. In other words, for each additional weighted PRS point, mean cumulative adenoma counts were increased by approximately 1.6x per individual. Figure 3 shows the average unweighted and weighted PRS by cumulative adenoma count. The results from sex, race, and age stratified analyses are available in Supplementary Table S4. In the age stratified analysis, both the unweighted and weighted PRS were significantly associated with higher cumulative counts in those age 50 to 64 at the time of last colonoscopy (unweighted RR = 1.17, P = 0.05; weighted RR = 2.48, P = 0.04), but not those age 65+ (unweighted RR = 1.04, P = 0.25; weighted RR = 21.39, P = 0.14).

Three of the prespecified colorectal cancer–risk SNPs were found to be associated with the finding of ≥10 cumulative adenomas, when compared with those with no cumulative adenomas: rs3184504 (risk allele OR = 2.09; 95% CI, 1.05–4.17), rs961253 (OR = 2.30; 95% CI, 1.17–4.49), and rs3217901 (OR = 1.94; 95% CI, 1.02–3.67, gene CCND2; ref. 16). Thus, for each additional risk allele of any of these three SNPs, the odds of developing ≥10 cumulative adenomas were about two times higher. These are the same risk alleles found in the literature, and compare to published risk allele RRs for colorectal cancer of 1.09, 1.12, and 1.10, respectively (13, 16). The mean adenoma PRS was slightly higher in those with ≥10 cumulative adenomas compared with those without, however these differences were not statistically significant: unweighted PRS of 8.43 versus 7.82, respectively (P = 0.13), and weighted PRS of 1.24 versus 1.13, respectively (P = 0.10). Results were similar in the sex, race, and age stratified analysis (Supplementary Table S5).

Figure 3.

Average unweighted (A) and weighted (B) PRS by cumulative adenoma count. The PRS was comprised of only eight SNPs known to be associated with multiple adenomas in published literature (i.e., “adenoma-risk SNPs”). Only the weighted PRS was associated with higher cumulative adenoma counts (unweighted RR = 1.03, P = 0.39; weighted RR = 1.57, P = 0.03) in our dataset.

Figure 3.

Average unweighted (A) and weighted (B) PRS by cumulative adenoma count. The PRS was comprised of only eight SNPs known to be associated with multiple adenomas in published literature (i.e., “adenoma-risk SNPs”). Only the weighted PRS was associated with higher cumulative adenoma counts (unweighted RR = 1.03, P = 0.39; weighted RR = 1.57, P = 0.03) in our dataset.

Close modal

Using the DNA from blood in selected participants from the CSP #380 screening cohort, we identified four known colorectal cancer–risk SNPs to also be associated with increasing cumulative adenoma counts and three known colorectal cancer–risk SNPs to be associated with the finding of ≥10 cumulative adenomas. Of the subset of colorectal cancer–risk SNPs previously shown to be associated with adenoma-risk (i.e., “adenoma-risk SNPs”), one was also statistically significant in our dataset (rs961253). When combining these eight known adenoma-risk SNPs into an aggregate PRS, we found that an increased burden of these adenoma-risk SNPs per individual was associated with higher cumulative adenoma counts when taking into account the adenoma-risk SNPs' published effect sizes (RR = 1.57; P = 0.03). This work suggests a role for developing precancerous “adenoma PRSs” (as opposed to colorectal cancer PRSs) using larger genomic datasets to provide enhanced risk stratification for the prevention of colorectal cancer.

Prior studies have also found similar germline genetic variants between individuals with colorectal cancer and those with multiple adenomas. A study by Carvajal-Carmona and colleagues found that eight of 18 previously identified colorectal cancer–risk SNPs were overrepresented in colorectal cancer–free individuals with adenomas when compared with controls (18). Another study by Hes and colleagues showed that two of 16 colorectal cancer–risk SNPs were associated with the finding of >10 adenomas, with ORs of 1.6 and 1.5 (19). These findings, along with three additional studies, are summarized in Supplementary Table S6. Our findings extend previous research, as we additionally found four known colorectal cancer–risk SNPs to be also associated with increasing cumulative adenoma counts, as well as three colorectal cancer–risk SNPs associated with ≥10 cumulative adenomas. Furthermore, two of our significant colorectal cancer–risk SNPs (rs961253, rs2423279) have been previously shown to be associated with adenoma development (18, 21, 26). We were unable to support growing evidence implicating an association between the development of multiple adenomas and the SNPs rs3802842, rs6983267, rs10505477, rs10795668, and rs4779584 (18–20, 22, 23, 27). However, the overall consistency of these results suggest common underlying risk genes between colorectal cancer and adenomatous polyposis.

Although many genetic risk scores have been tested to predict colorectal cancer risk in various study populations (28), few tools have been developed to predict precancerous adenoma risk. A recent study by Weigl and colleagues did not find an association between a PRS of 48 colorectal cancer–risk SNPs and the finding of nonadvanced adenomas (16). However, a study by Abuli and colleagues found that an increased burden of 14 colorectal cancer–risk SNPs was associated with a higher risk of advanced adenomas and/or ≥3 adenomas (21). However, this PRS incorporated only the SNPs found to be individually associated with this outcome in their population, thus strongly biasing the results. Finally, three other studies have suggested that a higher weighted PRS comprised of different subsets of published colorectal cancer–risk SNPs were associated with varying degrees of increased adenoma counts (20, 23, 27) in differing populations. Our study is unique and builds on prior literature, in that it utilized a PRS created solely from unbiased precancerous adenoma-risk SNPs and found that an increasing burden of adenoma-risk SNPs present per individual are associated with higher cumulative adenoma counts. Thus, a precancerous PRS comprised of germline variants may have the potential to earlier identify individuals at risk for colonic adenomatous polyposis, particularly those with a young-onset phenotype. Such information could inform genetic testing guidelines (7) or recommendations regarding age to start colorectal cancer screening (17).

The colorectal cancer–risk SNPs in this analysis found to be associated with increasing cumulative adenoma counts have been mapped to nearby genes of VTI1A (rs12241008), BMP2 (rs2423279, rs961253), SH2B3 (rs3184504), and CCND2 (rs3217901). The functional relevance of these genes on the outcome of adenoma or colorectal cancer formation have been theorized to disrupt proper tissue architecture through varying effects on cell adhesion, differentiation, and migration via dysregulation of the Wnt (VTI1A), TGF-β/BMP (BMP2), or MAPK (SH2B3, CCND2) signaling pathways (13). Only SH2B3 is identified as a missense mutation, which increases the likelihood that this is a candidate driver gene in the adenoma–carcinoma sequence. However, although several of these SNPs are located in intergenic regions (rs12241008, rs961253), it is possible that their impact on splicing may affect gene function through disruption of key regulatory elements or other potential mechanisms (29). Additional genetic and functional studies will be needed to validate the impact of these genetic variants on colonic adenomatous polyposis and/or colorectal cancer risk. Clarifying potential novel tumorigenesis or protein pathways may identify new therapeutic targets, which may be ideal for developing vaccines other chemopreventative agents to prevent colorectal cancer (30, 31).

Our study has important strengths. We investigated a novel precancerous adenoma PRS to assess risk for specific adenoma counts in individuals from a well characterized colonoscopy cohort with associated genetic data and 10 years of clinical follow-up. These findings are preliminary analyses on a high-risk clinical phenotype for whom there are limited data to inform colorectal cancer surveillance and genetic testing guidelines. Germline variants that predispose to high adenoma counts may predispose to early onset colorectal cancer, increase risk for missed lesions, or promote progression to colorectal cancer between colonoscopies (32–34). Thus, it will likely be important to identify these variants early and develop risk stratification protocols for colorectal cancer prevention. Our work should help conceive similar analyses in larger genomic datasets which seek to risk stratify screening participants using a precancer, adenoma genetic risk score to prevent colorectal cancer development (35). Importantly, we feel our tool represents an unbiased risk score, which should allow comparison to other risk scores.

This study has several limitations. The CSP #380 biorepository is comprised predominantly of Caucasian male veterans, and so possible differences in sex- or race-specific outcomes may be missed. The generalizability of our results may also be limited due to selection of CSP #380 biorepository participants based on the presence of advanced neoplasia or not at baseline colonoscopy. As such, only 612 of the initial 3,121 CSP #380 participants are included in this analysis. The baseline characteristics of participants selected for the biorepository differed from the overall CSP #380 cohort in several important ways that may have introduced potential bias due to incomplete genomic data (Supplementary Table S1). The individuals in the biorepository are slightly older, are more likely to have a first-degree family history of colorectal cancer, and have a higher incidence of colorectal cancer. In addition, follow-up over 10 years was variable, and so adenomas may have been present but undetected in some participants due to varying intensity of follow-up colonoscopies. Furthermore, significant false-positive associations are a possibility, as we did not adjust for numbers of comparisons. To address this concern, we used prespecified SNPs chosen based on prior knowledge of association patterns with colorectal cancer and the multiple adenoma phenotype to justify the conservative significance threshold. Finally, although the PRS in our analysis only included eight adenoma-risk SNPs based on a single prior study with a similar population, other studies have shown additional SNPs to be associated with multiple adenomas in different populations (Supplementary Table S6). Overall, while the consistency of our results with the literature seem to support our findings, our analyses should be considered exploratory given the small sample size and few cases of individuals with ≥10 cumulative adenomas. Future work in larger genomic datasets with diverse populations will be important for developing a standardized precancerous adenoma PRS based on an optimal number of specific adenoma-risk SNPs with reliable performance in independent samples across various populations (28).

In summary, we found specific germline genetic variants known to be associated with colorectal cancer development are also associated with increasing cumulative lifetime adenoma counts. In addition, an increasing burden of adenoma-risk SNPs, as measured by a weighted PRS, was also associated with higher cumulative adenoma counts. This work will serve as the basis for planned investigations to test precancerous risk stratification tools that could more accurately identify individuals who would benefit from genetic testing, earlier colorectal cancer screening, or enhanced colorectal cancer surveillance. Future work investigating the genetic underpinnings of the colonic adenomatous polyposis phenotype may also help prevent colorectal cancer by recognizing and addressing important colorectal cancer gene–environment interactions, clarifying therapeutic targets in cancer progression pathways, and developing novel colorectal cancer screening modalities using precancer genomic biomarkers (36–38).

D. Lieberman reports grants from the U.S. Department of Veterans Affairs during the conduct of the study. No potential conflicts of interest were disclosed by the other authors.

The views expressed in this article are those of the authors and do not necessarily represent the views of the U.S. Department of Veterans Affairs or the government of the United States.

B.A. Sullivan: Conceptualization, methodology, writing–original draft, writing–review and editing. X. Qin: Data curation, formal analysis, methodology. T.S. Redding IV: Data curation, formal analysis. Z.F. Gellad: Writing–review and editing. A. Stone: Data curation. D. Weiss: Writing–review and editing. A.N. Madison: Project administration, writing–review and editing. K.J. Sims: Data curation, writing–review and editing. C.D. Williams: Writing–review and editing. D. Lieberman: Supervision, writing–review and editing. E.R. Hauser: Supervision, methodology, writing–review and editing. D. Provenzale: Supervision, writing–review and editing.

This work was supported by the U.S. Department of Veterans Affairs Cooperative Studies Program.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Kallenberg
FGJ
,
Latchford
A
,
Lips
NC
,
Aalfs
CM
,
Bastiaansen
BAJ
,
Clark
SK
, et al
Duodenal adenomas in patients with multiple colorectal adenomas without germline APC or MUTYH mutations
.
Dis Colon Rectum
2018
;
61
:
58
66
.
2.
Tieu
AH
,
Edelstein
D
,
Axilbund
J
,
Romans
KE
,
Brosens
LA
,
Wiley
E
, et al
Clinical characteristics of multiple colorectal adenoma patients without germline APC or MYH mutations
.
J Clin Gastroenterol
2016
;
50
:
584
8
.
3.
National Comprehensive Cancer Network
.
Colorectal cancer screening (version 1.2017)
.
Plymouth Meeting (PA)
:
NCCN
[cited 2017 Sep 27]. Available from
: http://www.nccn.org/professionals/physician_gls/pdf/colorectal_screening.pdf.
4.
National Comprehensive Cancer Network
.
Genetic/familial high-risk assessment: colorectal (version 2.2017)
.
Plymouth Meeting (PA)
:
NCCN
[cited 2017 Sep 27]. Available from
: http://www.nccn.org/professionals/physician_gls/pdf/genetics_colon.pdf.
5.
Syngal
S
,
Brand
RE
,
Church
JM
,
Giardiello
FM
,
Hampel
HL
,
Burt
RW
, et al
ACG clinical guideline: genetic testing and management of hereditary gastrointestinal cancer syndromes
.
Am J Gastroenterol
2015
;
110
:
223
62
.
6.
Grover
S
,
Kastrinos
F
,
Steyerberg
EW
,
Cook
EF
,
Dewanwala
A
,
Burbidge
LA
, et al
Prevalence and phenotypes of APC and MUTYH mutations in participants with multiple colorectal adenomas
.
JAMA
2012
;
308
:
485
92
.
7.
Stanich
PP
,
Pearlman
R
,
Hinton
A
,
Gutierrez
S
,
LaDuca
H
,
Hampel
H
, et al
Prevalence of germline mutations in polyposis and colorectal cancer–associated genes in patients with multiple colorectal polyps
.
Clin Gastroenterol Hepatol
2019
;
17
:
2008
15
.
8.
Belhadj
S
,
Mur
P
,
Navarro
M
,
Gonzalez
S
,
Morena
V
,
Capella
G
, et al
Delineating the phenotypic spectrum of the NTHL1-associated polyposis
.
Clin Gastrol Hep
2017
;
15
:
461
2
.
9.
Bellido
F
,
Pineda
M
,
Aiza
G
,
Valdes-Mas
R
,
Navarro
M
,
Puente
DA
, et al
POLE and POLD1 mutations in 529 kindred with familial colorectal cancer and/or polyposis: review of reported cases and recommendations for genetic testing and surveillance
.
Genet Med
2016
;
18
:
325
32
.
10.
Spier
I
,
Holzapfel
S
,
Altmüller
J
,
Zhao
B
,
Horpaopan
S
,
Vogt
S
, et al
Frequency and phenotypic spectrum of germline mutations in POLE and seven other polymerase genes in 266 patients with colorectal adenomas and carcinomas
.
Int J Cancer
2015
;
137
:
320
31
.
11.
Weren
RD
,
Ligtenberg
MJ
,
Geurts van Kessel
A
,
De Voer
RM
,
Hoogerbrugge
N
,
Kuiper
RP
. 
NTHL1 and MUTYH polyposis syndromes: two sides of the same coin?
J Pathol
2018
;
244
:
135
42
.
12.
Wood
LD
,
Parsons
DW
,
Jones
S
,
Lin
J
,
Sjoblom
T
,
Leary
RJ
, et al
The genomic landscapes of human breast and colorectal cancers
.
Science
2007
;
318
:
1108
13
.
13.
Peters
U
,
Bien
S
,
Zubair
N
. 
Genetic architecture of colorectal cancer
.
Gut
2015
;
64
:
1623
36
.
14.
Peters
U
,
Jiao
S
,
Schumacher
FR
,
Hutter
CM
,
Aragaki
AK
,
Baron
JA
, et al
Identification of genetic susceptibility loci for colorectal tumors in a genome-wide meta-analysis
.
Gastroenterology
2013
;
144
:
799
807
.
15.
Huyghe
JR
,
Bien
SA
,
Harrison
TA
,
Kang
HM
,
Chen
S
,
Schmit
SL
, et al
Discovery of common and rare genetic risk variants for colorectal cancer
.
Nat Genet
2019
;
51
:
76
87
.
16.
Weigl
K
,
Thomsen
H
,
Balavarca
Y
,
Hellwege
JN
,
Shrubsole
MJ
,
Brenner
Het
. 
Genetic risk score is associated with prevalence of advanced neoplasms in a colorectal cancer screening population
.
Gastroenterology
2018
;
155
:
88
98
.
17.
Jeon
J
,
Du
M
,
Schoen
RE
,
Hoffmeister
M
,
Newcomb
PA
,
Berndt
SI
, et al
Determining risk of colorectal cancer and starting age of screening based on lifestyle, environmental, and genetic factors
.
Gastroenterology
2018
;
154
:
2152
4
.
18.
Carvajal-Carmona
LG
,
Zauber
AG
,
Jones
AM
,
Howarth
K
,
Wang
J
,
APC Trial Collaborators
et al
Much of the population genetic risk of colorectal cancer is likely to be mediated through susceptibility to adenomas
.
Gastroenterology
2013
;
144
:
53
5
.
19.
Hes
FJ
,
Ruano
D
,
Nieuwenhuis
M
,
Tops
CM
,
Schrumpf
M
,
Nielsen
M
, et al
Colorectal cancer risk variants on 11q23 and 15q13 are associated with unexplained adenomatous polyposis
.
J Med Genet
2014
;
51
:
55
60
.
20.
Cheng
TH
,
Gorman
M
,
Martin
L
,
Barclay
E
,
Casey
G
,
Colon Cancer Family Registry
et al
Common colorectal cancer risk alleles contribute to the multiple colorectal adenoma phenotype, but do not influence colonic polyposis in FAP
.
Eur J Hum Genet
2015
;
23
:
260
3
.
21.
Abulí
A
,
Castells
A
,
Bujanda
L
,
Lozano
JJ
,
Bessa
X
,
Hernandez
C
, et al
Genetic variants associated with colorectal adenoma susceptibility
.
PLoS One
2016
;
11
:
e0153084
.
22.
Berndt
SI
,
Potter
JD
,
Hazra
A
,
Yeager
M
,
Thomas
G
,
Makar
KW
, et al
Pooled analysis of genetic variation at chromosome 8q24 and colorectal neoplasia risk
.
Hum Mol Genet
2008
;
17
:
2665
72
.
23.
Zhang
B
,
Shrubsole
MJ
,
Li
G
,
Cai
Q
,
Edwards
T
,
Smalley
WE
, et al
Association of genetic variants for colorectal cancer differs by subtypes of polyps in the colorectum
.
Carcinogenesis
2012
;
33
:
2417
23
.
24.
Lieberman
DA
,
Weiss
DG
,
Bond
JH
,
Ahnen
DJ
,
Garewal
H
,
Chejfec
G
. 
Use of colonoscopy to screen asymptomatic adults for colorectal cancer
.
N Engl J Med
2000
;
343
:
162
8
.
25.
Lieberman
D
,
Sullivan
BA
,
Hauser
ER
,
Qin
X
,
Musselwhite
LW
,
O'Leary
MC
, et al
Baseline colonoscopy findings associated with 10-year outcomes in a screening cohort undergoing colonoscopy surveillance
.
Gastroenterology
2020
;
158
:
862
74
.
26.
Burnett-Hartman
AN
,
Newcomb
PA
,
Hutter
CM
,
Peters
U
,
Passarelli
MN
,
Schwartz
MR
, et al
Variation in the association between colorectal cancer susceptibility loci and colorectal polyps by polyp type
.
Am J Epidemiol
2014
;
180
:
223
32
.
27.
Hang
D
,
Joshi
AD
,
He
X
,
Chan
AT
,
Jovani
M
,
Gala
MK
, et al
Colorectal cancer susceptibility variants and risk of conventional adenomas and serrated polyps: results from three cohort studies
.
Int J Epidemiol
2020
;
49
:
259
69
.
28.
McGeoch
L
,
Saunders
CL
,
Griffin
SJ
,
Emery
JD
,
Walter
FM
,
Thompson
DJ
, et al
Risk prediction models for colorectal cancer incorporating common genetic variants: a systematic review
.
Cancer Epidemiol Biomarkers Prev
2019
;
28
:
1580
93
.
29.
Bass
AJ
,
Lawrence
MS
,
Brace
LE
,
Ramos
AH
,
Drier
Y
,
Cibulskis
K
, et al
Genomic sequencing of colorectal adenocarcinomas identifies a recurrent VTI1A-TCF7L2 fusion
.
Nat Genet
2011
;
43
:
964
8
.
30.
Spira
A
,
Yurgelun
MB
,
Alexandrov
L
,
Rao
A
,
Bejar
R
,
Polyak
K
, et al
Precancer atlas to drive precision prevention trials
.
Cancer Res
2017
;
77
:
1510
41
.
31.
Kimura
T
,
McKolanis
JR
,
Dzubinski
LA
,
Islam
K
,
Potter
DM
,
Salazar
AM
, et al
MUC1 vaccine for individuals with advanced adenoma of the colon: a cancer immunoprevention feasibility study
.
Cancer Prev Res
2013
;
6
:
18
26
.
32.
Stoffel
EM
,
Koeppe
E
,
Everett
J
,
Ulintz
P
,
Kiel
M
,
Osborne
J
, et al
Germline genetic features of young individuals with colorectal cancer
.
Gastroenterology
2018
;
154
:
897
905
.
33.
Zhao
S
,
Wang
S
,
Pan
P
,
Xia
T
,
Chang
X
,
Yang
X
, et al
Magnitude, risk factors, and factors associated with adenoma miss rate of tandem colonoscopy: a systematic review and meta-analysis
.
Gastroenterology
2019
;
156
1661
74
.
34.
Nieuwenhuis
MH
,
Vogt
S
,
Jones
N
,
Nielsen
M
,
Hes
FJ
,
Sampson
JR
, et al
Evidence for accelerated colorectal adenoma–carcinoma progression in MUTYH-associated polyposis?
Gut
2012
;
61
:
734
8
.
35.
Shaukat
A
,
Church
TR
,
Shanley
R
,
Kauff
ND
,
O'Brien
MJ
,
Mills
GM
, et al
Development and validation of a clinical score for predicting risk of adenoma at screening colonoscopy
.
Cancer Epidemiol Biomarkers Prev
2015
;
24
:
913
20
.
36.
Imperiale
TF
,
Ransohoff
DF
,
Itzkowitz
SH
,
Levin
TR
,
Lavin
P
,
Lidgard
GP
, et al
Multitarget stool DNA testing for colorectal-cancer screening
.
N Engl J Med
2014
;
370
:
1287
97
.
37.
Myint
NNM
,
Verma
AM
,
Fernandez-Garcia
D
,
Sarmah
P
,
Tarpey
PS
,
Al-Aqbi
SS
, et al
Circulating tumor DNA in patients with colorectal adenomas: assessment of detectability and genetic heterogeneity
.
Cell Death Dis
2018
;
9
:
894
.
38.
Cohen
JD
,
Li
L
,
Wang
Y
,
Thoburn
C
,
Afsari
B
,
Danilova
L
, et al
Detection and localization of surgically resectable cancers with a multi-analyte blood test
.
Science
2018
;
359
:
926
30
.