Background:

Human gut microbiome has complex relationships with the host, contributing to metabolism, immunity, and carcinogenesis.

Methods:

Summary-level data for gut microbiota and metabolites were obtained from MiBioGen, FINRISK and human metabolome consortia. Summary-level data for colorectal cancer were derived from a genome-wide association study meta-analysis. In forward Mendelian randomization (MR), we employed genetic instrumental variables (IV) for 24 gut microbiota taxa and six bacterial metabolites to examine their causal relationship with colorectal cancer. We also used a lenient threshold for nine apriori gut microbiota taxa as secondary analyses. In reverse MR, we explored association between genetic liability to colorectal neoplasia and abundance of microbiota studied above using 95, 19, and 7 IVs for colorectal cancer, adenoma, and polyps, respectively.

Results:

Forward MR did not find evidence indicating causal relationship between any of the gut microbiota taxa or six bacterial metabolites tested and colorectal cancer risk. However, reverse MR supported genetic liability to colorectal adenomas was causally related with increased abundance of two taxa: Gammaproteobacteria (β = 0.027, which represents a 0.027 increase in log-transformed relative abundance values of Gammaproteobacteria for per one-unit increase in log OR of adenoma risk; P = 7.06×10−8), Enterobacteriaceae (β = 0.023, P = 1.29×10−5).

Conclusions:

We find genetic liability to colorectal neoplasia may be associated with abundance of certain microbiota taxa. It is more likely that subset of colorectal cancer genetic liability variants changes gut biology by influencing both gut microbiota and colorectal cancer risk.

Impact:

This study highlights the need of future complementary studies to explore causal mechanisms linking both host genetic variation with gut microbiome and colorectal cancer susceptibility.

Colorectal cancer is globally recognized as the third most prevalent cancer and ranks as the fourth major contributor to cancer-related mortality (1, 2). The colon is the most heavily colonized part of the gastrointestinal tract (3) by microbiota, which is notable given that it is also the region of the gastrointestinal most commonly affected by cancer in economically developed countries (1).

The human gut microbiome has a complex relationship with the host, contributing to the absorption of nutrients, metabolism, immunity, and carcinogenesis (4, 5). Human gut microbiota in healthy individuals consists primarily of two predominant anaerobic phyla: Firmicutes and Bacteroidetes. However, their proportions and associated species are in a dynamic state and change over time, even within the same individual (6). Several studies have proposed a potential causal relationship between changes in the gut microbiome and the development of colorectal cancer, possibly via chronic metabolic and inflammatory changes (7–9). Meanwhile, gut microbial metabolites such as trimethylamine N-oxide (TMAO) have been also found to likely promote persistent inflammation and weaken host immunity (10), which might increase the risk of colorectal cancer.

Observational studies have linked multiple fecal microbiota (i.e., Enterococcus, Porphyromonas, Salmonella, Pseudomonas, Peptostreptococcus, Actinomyces, Fusobacterium, Bifidobacterium, and Roseburia) to colorectal neoplasia (11). The former seven taxa are consistently reported to be enriched in patients with colorectal cancer compared with healthy individuals, whilst the latter two consistently reported to be more abundant in colorectal cancer–free individuals (11). However, observational studies are unlikely to infer a causal relationship given that the studies conducted to date have been cross-sectional or case–control in design, and so are prone to confounding, reverse causality, and bias.

Mendelian randomization (MR) is an analytic approach which employs germline genetic variants as instrumental variables (IV) for exposures (12). The genetic variants, being randomly distributed during conception, are immune to reverse causality and confounding factors, thus providing estimates of disease risk with minimal interference from extraneous variables. By doing so, MR overcomes the limitations of conventional epidemiologic studies and produces reliable results, provided that pleiotropy (i.e., the phenomenon where genetic variants affect disease outcomes through alternative pathways) is absent (12). Here, we have conducted two-sample MR analyses to determine the causal relationship of gut microbiota and their metabolites with colorectal neoplasia. Using a reverse MR approach, we also tested whether SNPs associated with colorectal cancer and with adenomas are causally associated with specific microbiota to test whether tumor propensity and/or presence influences the gut microbiota.

Study design

We adopted a bidirectional two-sample MR strategy to evaluate the causal relationship between gut microbiota and risk of colorectal neoplasia. The forward MR analysis was designed to investigate the causal effects of gut microbiota and related metabolites on colorectal cancer risk, while the reverse MR to examine whether the genetic liability to colorectal neoplasia (i.e., colorectal cancer, adenomas, and polyps) influenced the abundance of the gut microbiota. The study design is shown in Fig. 1. Details of the data sources are summarized in Table 1.

Figure 1.

Study design. Schematic representation of the two-sample bi-directional MR analysis. MR was used to evaluate the causal relationships between gut microbiome as well as metabolites and colorectal neoplasia. Three key assumptions of MR: (1) genetic variants must be associated with exposures; (2) there should be no confounders between the genetic variants and the outcomes; (3) genetic variants must affect outcomes only through exposures, not through other pathways.

Figure 1.

Study design. Schematic representation of the two-sample bi-directional MR analysis. MR was used to evaluate the causal relationships between gut microbiome as well as metabolites and colorectal neoplasia. Three key assumptions of MR: (1) genetic variants must be associated with exposures; (2) there should be no confounders between the genetic variants and the outcomes; (3) genetic variants must affect outcomes only through exposures, not through other pathways.

Close modal
Table 1.

Detailed information on data sources.

Exposure or outcomeParticipants included in analysisAdjustmentsNumber of genetic instrumentsPubMed ID and/or URL
Gut microbiota taxa 18,340 European-descent individuals Age and any necessary study-specific covariates 24 gut microbiota taxa P < 5 × 10−8 27 33462485 https://mibiogen.gcc.rug.nl/ 
   Gammaproteobacteria P < 1 × 10−5  
   Lactobacillales  19  
   Enterobacteriaceae  11  
   Porphyromonadaceae  10  
   Actinomyces  10  
   Bifidobacterium  18  
   Roseburia  18  
   Peptostreptococcaceae  16  
 5,959 European-descent individuals Age, sex, genotyping batch and the top ten genetic principal components Fusobacteriaceae  28 35115689 
Gut microbial metabolites 2,076 European-descent individuals Age, sex Choline 23823483 
   Betaine 13  
   Carnitine 12  
   TMAO  
   GABA 10  
   Propionic acid  
Colorectal cancera 16,871 colorectal cancer cases and 26,328 controls of European ancestry Age, sex and any necessary study-specific covariates — 32638365 
Colorectal cancerb 23,262 colorectal cancer cases and 38,296 controls of European ancestry Age, sex and any necessary study-specific covariates 52 30510241 
 34,627 Colorectal cancer cases and 71,379 controls of European ancestry Age, sex and any necessary study-specific covariates 43 31089142 
Colorectal adenomas Discovery phase: 139 advanced adenoma cases and 1,267 controls of European ancestry; Validation phase: 4,175 colorectal adenoma cases and 5,036 controls of European ancestry Age, sex, family history, prior aspirin use, and treatment. 19 24084763 
Colorectal polyps 2,473 cases (1,831 with adenomas and 642 with hyperplastic polyps only) and 4,019 controls of European ancestry Age, sex, body mass index, cigarette smoking and alcohol drinking. 23027627 
Exposure or outcomeParticipants included in analysisAdjustmentsNumber of genetic instrumentsPubMed ID and/or URL
Gut microbiota taxa 18,340 European-descent individuals Age and any necessary study-specific covariates 24 gut microbiota taxa P < 5 × 10−8 27 33462485 https://mibiogen.gcc.rug.nl/ 
   Gammaproteobacteria P < 1 × 10−5  
   Lactobacillales  19  
   Enterobacteriaceae  11  
   Porphyromonadaceae  10  
   Actinomyces  10  
   Bifidobacterium  18  
   Roseburia  18  
   Peptostreptococcaceae  16  
 5,959 European-descent individuals Age, sex, genotyping batch and the top ten genetic principal components Fusobacteriaceae  28 35115689 
Gut microbial metabolites 2,076 European-descent individuals Age, sex Choline 23823483 
   Betaine 13  
   Carnitine 12  
   TMAO  
   GABA 10  
   Propionic acid  
Colorectal cancera 16,871 colorectal cancer cases and 26,328 controls of European ancestry Age, sex and any necessary study-specific covariates — 32638365 
Colorectal cancerb 23,262 colorectal cancer cases and 38,296 controls of European ancestry Age, sex and any necessary study-specific covariates 52 30510241 
 34,627 Colorectal cancer cases and 71,379 controls of European ancestry Age, sex and any necessary study-specific covariates 43 31089142 
Colorectal adenomas Discovery phase: 139 advanced adenoma cases and 1,267 controls of European ancestry; Validation phase: 4,175 colorectal adenoma cases and 5,036 controls of European ancestry Age, sex, family history, prior aspirin use, and treatment. 19 24084763 
Colorectal polyps 2,473 cases (1,831 with adenomas and 642 with hyperplastic polyps only) and 4,019 controls of European ancestry Age, sex, body mass index, cigarette smoking and alcohol drinking. 23027627 

aSummary-level data for colorectal cancer.

bGenetic instruments for colorectal cancer were extracted from two different GWAS studies. For replicated SNPs, the ones from GWAS with larger case number were chosen.

Data sources for gut microbiome and metabolites

Summary statistics of the genetic contributions to gut microbiota taxa were obtained from the MiBioGen consortium (13) and FINRISK study (14). As we know, the MiBioGen consortium is the most extensive attempt to scrutinize the correlations between host-genetics and microbiome on a populace level, encompassing 18,340 participants from 24 European collectives, and exhibiting a vast geographic reach, in conjunction with a substantial sample size. Details on the MiBioGen consortium, including geographic regions, recruitment processes and other characteristics have been described in the published genome-wide association study (GWAS) previously (13). Each cohort obtained ethical approval and consent to participate in accordance with the local regulations and institutional requirements. The FINRISK 2002 cohort (FR02) study comprised of 5,959 European individuals and has been previously described in detail (14). The FINRISK population surveys, which aim to track the prevalence of cardiovascular and other noncommunicable disease risk factors among the Finnish population, have been conducted at 5-year intervals since 1972. The current investigation uses data from the FR02 study, which included men and women between the ages of 25 and 74 years from six distinct geographic regions in Finland. The sample was stratified by sex, region, and 10-year age group, with each stratum consisting of 250 participants. The research protocol for FR02 was approved by the Coordinating Ethical Committee of the Helsinki and Uusimaa Hospital District (Ref. 558/E3/2001), and all participants provided informed consent. The study was conducted in compliance with the ethical principles outlined in the World Medical Association's Declaration of Helsinki.

To complement with gut microbiome-derived metabolites, we selected six plasma metabolites with available GWAS data for analyses. Genetic predictors for choline, betaine, carnitine, TMAO, gamma-aminobutyric acid (GABA), and propionic acid were derived from a GWAS on human metabolome (15), which included 2,076 Europeans from the Framingham Heart Study (FHS) Offspring Cohort. The family-based FHS Offspring Cohort involved 2,076 participants belonging to 873 sibships, who underwent metabolic profiling and genome-wide genotyping. The study protocol was granted ethical clearance by the Boston University Medical Center Institutional Review Board, and all participants provided their informed consent.

Selection of genetic instruments for MR analyses

For the forward MR analyses, SNPs associated with gut microbiota within the MiBioGen consortium at the GWAS significance level (P < 5×10−8) were selected as IVs. In total, 211 taxa (131 genera, 35 families, 20 orders, 16 classes, and 9 phyla) were eligible for the mbQTL (microbial quantitative trait locus) mapping analysis and of these 24 gut microbiota taxa met the P value threshold of 5×10−8 which were included in our forward MR analysis. On the basis of our previous systematic review of epidemiologic observational studies, nine gut microbiota taxa (i.e., Enterococcus, Porphyromonas, Salmonella, Pseudomonas, Peptostreptococcus, Actinomyces, Fusobacterium, Bifidobacterium, and Roseburia) were found to be consistently associated with the risk of colorectal neoplasia (11). Given that there were only three available SNPs (i.e., rs7322849, rs998451 for genus Bifidobacterium and rs61841503 for genus Peptostreptococcaceae) using the strict 5×10−8 for the nine gut microbiota taxa of interest which present with supporting evidence in their associations with colorectal neoplasm, we chose to use a lenient P value threshold of 1×10−5 as indicated by the MiBioGen as secondary analyses. Overall, we obtained SNPs associated with the abundance of the family Fusobacteriaceae from FR02 cohort (14) and SNPs associated with the other eight gut microbiota taxa at the genus or upper level (i.e., family, order, class) from the MiBioGen consortium (13). Our selection process aligns with a previous study which demonstrated that microbial features exhibit the greatest explained variance with associated SNPs falling below a significance threshold of P < 1×10−5 (16). To avoid issues with co-linearity between SNPs and each trait, we removed SNPs that exhibited linkage disequilibrium (LD, r2 > 0.01) and only retained SNPs with the smallest P values for their respective traits. After LD pruning, 27 SNPs associated with 24 gut microbiota taxa (at a stringent P value threshold of 5×10−8) and nine, 19, 11, 10, 28, 10, 18, 18, and 16 genetic IVs were included to proxy the abundance of Gammaproteobacteria, Lactobacillales, Enterobacteriaceae, Porphyromonadaceae, Fusobacteriaceae, Actinomyces, Bifidobacterium, Roseburia, and Peptostreptococcaceae (at a lenient P value threshold of 1×10−5; Supplementary Table S1–2). Similarly, 6, 13, 12, 8, 10, and 2 IVs were selected as proxies for plasma choline, betaine, carnitine, TMAO, GABA, and propionic acid, respectively (Supplementary Table S3). Regarding the reverse MR analysis, because there is no open available summary data for metabolites, we only used summary-level data of the microbiota taxa from the MiBioGen consortium and the FR02 cohort (13, 14).

Data sources for colorectal neoplasia

Summary-level data on colorectal cancer used in forward MR analyses were obtained from our previous meta-analysis of 12 GWASs, namely CCRR1, CCFR2, COIN, CORSA, Croatia, DACHS, FIN, NSCCG-OncoArray, SCOT, UK1, and VQ58, which comprise 16,871 cases and 26,328 controls (17). Each dataset underwent application of standard quality control measures. For reverse MR, to increase the proportion of variance explained, we extracted all SNPs associated with colorectal cancer at the genome-wide significance level (P < 5 × 10−8) from two latest and largest colorectal cancer GWASs. Huyghe and colleagues performed a genome-wide association analysis of 23,262 cases and 38,296 controls (18), and Law and colleagues reported a genome-wide association analysis of 34,627 colorectal cancer cases and 71,379 controls of European ancestry (19). We used this comprehensive list of SNPs for colorectal cancer but only adopted the estimates from study with larger sample sizes (i.e., Law and colleagues; ref. 19). After excluding SNPs in LD, 95 SNPs were used as IVs for colorectal cancer in the reverse MR analyses (Supplementary Table S4). SNPs associated with colorectal adenomas were selected from a GWAS including 1,406 Caucasian individuals participating in the Adenoma Prevention with Celecoxib trial, 4,175 cases of familial colorectal adenoma and 5,036 control subjects of European ancestry (20). SNPs associated with polyps were selected from the Tennessee Colorectal Polyp Study, which included 2,473 cases and 4,019 controls (21). After removing those in LD, we obtained 19 SNPs for colorectal adenomas and 7 SNPs for colorectal polyps as IVs (Supplementary Table S5).

Statistical analysis

We excluded SNPs with missing values and without proxy SNPs (r² > 0.8) in the outcome data source. We endeavored to deduce positive strand alleles by using allele frequencies for palindromes, with the default and conservative approaches and summary statistics were harmonized using the effect allele frequency such that all outcome effect estimates reflected the exposure-increasing effect allele. For each genetic instrument, βGP represents the estimated association between the genetic variant and the exposure, while βGD represents the estimated association between the same genetic variant and the outcome. By using these estimates, the causal estimates can be derived using the Wald ratio formula (βGDGP). Consequently, the estimates in the forward MR should be interpreted as changes in colorectal cancer risk for per one-unit increase in log-transformed relative abundance values of gut microbiota and for per one-SD increase in plasma metabolites levels, while the estimates in the reverse MR should be interpreted as changes in the abundance of gut microbiota for per one-unit increase in the log OR of colorectal neoplasia. For microbiota taxa with only one SNP, we employed the Wald ratio method for the MR analysis. For exposures with at least 3 IVs, MR estimates were computed using the inverse variance weighted random-effects (IVW-RE) model and complemented by five sensitivity analyses, namely the weighted median (22), weighted mode (23), MR-Egger regression (24), MR-PRESSO (25), and leave-one-out analysis. When applying the weighted median method, it is essential that at least 50% of the weight is derived from valid IVs to ensure consistency (22). In cases where the largest cluster SNPs are valid, the weight mode method can provide an unbiased causal effect estimate (23). MR-Egger correction method can be employed to address directional horizontal pleiotropy with decreased statistical power (24). MR-PRESSO can detect and remove potential outliers among IVs and provide causal estimates after the removal of the identified outliers (25). The leave-one-out analysis can be used to assess whether the association is driven by a single SNP. The heterogeneity among estimates of genetic instruments can be evaluated using Cochran's Q value and the P value for intercept in MR-Egger can be used to detect directional horizontal pleiotropy (24). The strength of instruments was evaluated by the F-statistic (Supplementary Methods) and the F-statistic being less than 10 implied weak instrument bias (26). To address the issue of multiple testing correction, FDR was employed. 39 and 124 independent tests were considered in the forward and reverse MR analyses, respectively. FDR < 0.05 was used as the threshold to indicate the strength of evidence against the null hypothesis. All analyses were performed using the TwoSampleMR (23) and MR-PRESSO (25) R packages in R software 4.1.1.

Ethics approval

Because it relied solely on summary statistics from published GWAS and no utilization of individual-level data, this study did not require ethical approval.

Data availability

All data examined in this research is comprehensively presented in this published article and its supplementary materials.

Effects of gut microbiota taxa and related metabolites on colorectal cancer risk

In the forward MR analyses, we observed little associations between the abundance of gut microbiota taxa and colorectal cancer risk in the Wald ratio, IVW-RE method or in any of the sensitivity analysis (Supplementary Tables S6–S7). Moderate-to-high heterogeneity was observed in the association between abundance of Roseburia and colorectal cancer risk (P for heterogeneity = 0.001), and suggestive pleiotropy was observed in the MR analysis of Lactobacillales (P for pleiotropy = 0.017). For the analyses on gut microbiota taxa, there was a very low risk of weak instruments bias, given the F-statistics of all the genetic IVs were above 10. No association between the six gut microbial metabolites and colorectal cancer risk was observed in either the main IVW-RE or in any of the sensitivity analysis (Supplementary Table S8).

Effects of liability to colorectal neoplasia on gut microbiota

In the reverse MR analyses, we found some evidence suggesting that genetic liability to colorectal neoplasia was associated with the abundance of gut microbiota.

For per one-unit increase in the log OR of genetic liability to colorectal cancer, the abundance of Tyzzerella3 would have 0.066 [95% confidence intervals (CI), −0.123–0.009; P = 0.024] lower log-transformed relative abundance, based on the IVW-RE model. Although it did not survive the multiple test correction, the result was consistent across all the sensitivity analyses.

For per one-unit increase in the log OR of genetic liability to colorectal adenomas, the abundance of Gammaproteobacteria, Enterobacteriaceae, Fusobacteriaceae, Allisonella, RuminococcaceaeUCG013 and Oxalobacteraceae, RuminococcaceaeUCG009 would have 0.027 (95% CI, 0.017–0.037; P = 7.06×10−8), 0.023 (95% CI, 0.013–0.034; P = 1.29×10−5), 0.006 (95% CI, 0.002–0.011; P = 0.006), 0.040(95% CI, 0.007–0.073; P = 0.021), 0.014(95% CI, 0.002–0.026; P = 0.018) higher log-transformed relative abundance and 0.018(95% CI, −0.036–0.001; P = 0.043), 0.025(95% CI, −0.039–0.011; P = 0.001) lower log-transformed relative abundance respectively, based on the IVW-RE model. The results for Gammaproteobacteria, Enterobacteriaceae survived the multiple test correction and were consistent across all the sensitivity analyses (Figs. 2 and 3). The MR-Egger results for Fusobacteriaceae and RuminococcaceaeUCG009 were inconsistent (β = −0.001 and β = 0.012, respectively), which might indicate the existence of potential heterogeneity and directional horizontal pleiotropy, however, the P value for the MR-Egger intercept were 0.917 and 0.660 for these tests. The results also indicated consistency in the β coefficients of the associations between colorectal cancer, adenomas, and polyps with these specific bacterial taxa, implying a uniform direction of effect between benign and malignant lesions.

Figure 2.

Effects of genetic liability to adenoma on Gammaproteobacteria from MiBioGen consortium. A and C, Forest plot for summarizing the results of all MR methods and heterogeneity tests of IVW and MR-PRESSO. B, Scatter plot for comparison of MR methods. Abbreviations: Q-test, Cochran's Q statistic heterogeneity test. MR-PRESSO, MR pleiotropy residual sum and outlier test.

Figure 2.

Effects of genetic liability to adenoma on Gammaproteobacteria from MiBioGen consortium. A and C, Forest plot for summarizing the results of all MR methods and heterogeneity tests of IVW and MR-PRESSO. B, Scatter plot for comparison of MR methods. Abbreviations: Q-test, Cochran's Q statistic heterogeneity test. MR-PRESSO, MR pleiotropy residual sum and outlier test.

Close modal
Figure 3.

Effects of genetic liability to adenoma on Enterobacteriaceae from MiBioGen consortium. A and C, Forest plot for summarizing the results of all MR methods and heterogeneity tests of IVW and MR-PRESSO. B, Scatter plot for comparison of MR methods. Abbreviations: Q-test, Cochran's Q statistic heterogeneity test. MR-PRESSO, MR pleiotropy residual sum and outlier test.

Figure 3.

Effects of genetic liability to adenoma on Enterobacteriaceae from MiBioGen consortium. A and C, Forest plot for summarizing the results of all MR methods and heterogeneity tests of IVW and MR-PRESSO. B, Scatter plot for comparison of MR methods. Abbreviations: Q-test, Cochran's Q statistic heterogeneity test. MR-PRESSO, MR pleiotropy residual sum and outlier test.

Close modal

We observed little associations of genetic liability to colorectal polyps with other gut microbiota in the reverse MR (Supplementary Table S9). We also derived genetic IVs for colorectal cancer and adenomas combined (colorectal neoplasia), and found that per one-unit increase in the log OR of genetic liability to colorectal neoplasia was suggestively associated with 0.017 higher log-transformed relative abundance for Gammaproteobacteria (95% CI, 0.002–0.032; P = 0.023), and the result was consistent with other approaches but did not survive FDR correction (Supplementary Table S9).

In this research, we examined the bidirectional relationship between gut microbiome and colorectal cancer within the framework of two-sample MR. Forward MR analyses provided little support for the relationship between levels of gut microbiota taxa or metabolites and colorectal cancer risk. These findings are at odds with evidence from various sources suggesting that bacterial taxa within the microbiome cause colorectal cancer (27–29). Whilst it is not feasible to completely exclude the potential existence of a causal relationship for all taxa, we find little evidence using genetic IVs for those bacterial taxa commonly associated in previous papers. In contrast to the forward analysis, our study revealed that genetic liability to colorectal adenomas was associated with the increased abundance of the class Gammaproteobacteria and the family Enterobacteriaceae in the reverse MR conducted on MiBioGen consortium datasets. Using the summary-level data from the FR02 cohort, suggestive evidence suggested that genetic liability to colorectal adenomas was associated with the abundance of the family Fusobacteriaceae. According to Brenner and colleagues, the adenoma–carcinoma sequence refers that majority of colorectal cancer cases are typically preceded by dysplastic adenomas, which may advance to malignant forms (30). Piciocchi and colleagues (31) showed that cycle-inhibiting factor toxin was linked to polyps or adenomas, whereas the presence of pks+ appeared to be a predisposing factor for colorectal cancer. Bacterial toxins are capable of promoting tumorigenesis through several mechanisms, including causing DNA damage and inducing genomic instability in host cells, resistance to cell death, stimulation of signaling pathways involved in cell proliferation, and inflammation (32, 33), thus creating a favorable host cell environment for the development of colorectal cancer. Nevertheless, the association and direction of effect for benign and malignant lesions are consistent, albeit that the do not survive multiple testing for colorectal cancer. This might be due to qualitatively different proportions of the variance explained in adenomas by the genetic IVs used in these analyses compared with cancer.

Gammaproteobacteria is an important class of the phylum Proteobacteria, and Enterobacteriaceae is one of the major families of Gammaproteobacteria. Although a previous study conducted in Japanese presented that Proteobacteria would increase the risk of colorectal cancer (34), our forward MR analyses did not find any causal relationship of Gammaproteobacteria and Enterobacteriaceae on colorectal cancer risk in this European population, which might be attributed to the heterogeneity of different ethnic lines. Nevertheless, the reverse MR analyses revealed that genetic liability to colorectal adenomas was linked to the increased abundance of Gammaproteobacteria and Enterobacteriaceae, suggesting that the abundance of these two species are more likely to be the consequences of altered pathological environment during the etiology of a colorectal adenoma. These findings are accord with previous observational studies. Analysis of mucosal samples revealed that patients with adenoma displayed an enrichment of eight bacterial within the class Gammaproteobacteria (35). Moreover, compared with adjacent normal colonic mucosa, Gammaproteobacteria was more predominant in the adenoma mucosa (36). On the basis of stool samples, a case–control study of 144 adenoma cases, 73 serrated polyps cases, and 323 polyp-free controls also found that adenoma cases had an increased abundance of Gammaproteobacteria (37). For the Enterobacteriaceae family, the most representative genera Escherichia and Shigella had been reported more enriched in the colonic mucosa of patients with adenoma than healthy controls (38, 39). Studies revealed that strong antimicrobial bile acid activities in early-stage tumor microenvironment led to marked alterations in the composition of gut microbiota, which included a proportional upsurge in certain species of Gammaproteobacteria and Bacteroidetes species (40). Gammaproteobacteria was also observed to form co-exclusive network with fungal classes like Ascomycota phylum and participate in new interkingdom interactions in colorectal cancer (41). Because the precise mechanism underlying the augmented prevalence of Enterobacteriaceae among patients with adenomas remains indeterminate, it necessitates further researches to explore the interplay between Gammaproteobacteria and other microbiota in the development of colorectal neoplasm.

There are several strengths in our study. Firstly, we conducted a bidirectional MR study which comprehensively estimates the causal relationship between gut microbiota and colorectal cancer in both directions. This design technique minimizes confounding by known and unknown risk factors and avoids reverse causality. Moreover, we included three different stages of colorectal carcinogenesis as outcomes, which could partly reflect the effects of studied gut microbiota as well as their metabolites on the sequencing progression of colorectal neoplasia. In addition, we performed several sensitivity analyses and consistency in our results between adenomas with both Gammaproteobacteria and Enterobacteriaceae indicated that our results were robust to violations of MR assumptions.

Despite the strengths, several limitations of this study should be considered. Firstly, GWASs for gut microbiota are still in their infancy and we cannot identify enough SNPs robustly associated with the nine gut microbiota taxa of interest at the widely used genome-wide significance level (P < 5×10−8), thus we used a lenient threshold (P < 1×10−5) for the selection of genetic IVs as a less stringent analysis. There exists a strong correlation between the significance of heritability and the number of independent loci under a relaxed threshold of 1×10−5, which also has the greatest explanatory power with regard to microbial features (13, 16). However, using a lenient P value threshold of 1×10−5 to select IVs may result in weak instrument bias and horizontal pleiotropy. Also, given many bacteria were only represented by one or two SNPs using 5×10−8 threshold, the genetic explanation (R2) was low and may yielded inadequate statistical power for the detection of modest or minor correlations. Therefore, the causal relationships between microbiota and colorectal cancer could not be completely ruled out by the negative results of this study. Secondly, it had been acknowledged that the genetic factors involved in proximal and distal colorectal cancer are different (42), and likewise, the composition of the microbiota involved in left and right colon cancer are also distinct (43–45). We cannot exclude the potential that the null findings for colorectal cancer might be due to the mixture of left and right cases on colorectal cancer summary statistics. Thirdly, the composition of gut microbiota is subject to multifactorial influences, including lifestyle factors such as dietary patterns, medication usage, and health status. It results in the diminishing the variance explained by genetic instruments, particularly in individuals who consume a westernized diet characterized by high levels of saturated fat and red meat, and low fiber (46). We cannot exclude the possible diet–gene, gene–environment interactions, as well as nongenetic effects on outcomes, which may influence the observed results, either.

Meanwhile, confining our analysis to the individuals of European descent reduces the extent to which our findings can be extrapolated to other populations. In addition, we extracted SNPs for polyps from a candidate study (21) instead of a GWAS, so the statistical power for the analysis is constrained, underscoring the necessity for larger GWASs of colorectal neoplasia to ensure sufficient statistical power. Likewise, due to the absence of available GWAS summary data for adenoma and polyps, we have solely employed colorectal cancer as the outcome in the forward MR analysis. It should be noted that when deriving a combined set of genetic IVs from adenoma and colorectal cancer to explore the combined effect of colorectal neoplasia, the directions of associations were consistent with the effect of solely colorectal cancer but their effect estimates had wider 95% CIs. One possible reason of the wide CIs is the different power in the contributing studies. Another possible reason may be the phenotypic heterogeneities between adenomas and colorectal cancer in their relations to these bacteria. Ideally, we would incorporate large-scale microbiome and genetic data from subjects who harbour colorectal adenomas or cancer at the time of fecal sampling. However, such data are not currently available and so this is a potential shortcoming of the current analysis. Furthermore, analysis of genetic sequencing has identified genetic variations (e.g., in LINC01605, PROKR2, and CCSER1 genes) linked to constituents of the metabolome or microbiome, as well as the risk of adenoma or colorectal cancer. In addition, the examination also discovered associations between genes responsible for cholesterol metabolism and the levels of high-density lipoprotein cholesterol would affect adenoma and colorectal cancer risk (47). Integration of metabolomics (e.g., fecal levels of cholesteryl esters and sphingolipids in colorectal cancer) and microbiome data (e.g., Fusobacterium, Parvimonas, and Staphylococcus) also indicated close interplay between bacteria and host (48). Finally, our study design is based on the assumption that the associations between gut microbiome with their metabolites and colorectal cancer risk are linear, which might veil the nonlinear effects.

In conclusion, there is not enough evidence of causal relationship between levels of gut microbiota taxa, or bacterial metabolites, and the risk of colorectal cancer based on current data. In contrast, the reverse MR provides evidence for positive associations between risk of colorectal adenomas with the abundance of the class Gammaproteobacteria, and the families Enterobacteriaceae. These findings suggest that changes in the abundance of Gammaproteobacteria and Enterobacteriaceae are potential microbial signatures during the adenoma–carcinoma sequence of colorectal cancer, while that attempts to modify the gut microbiota using measures such as antibiotic therapy and probiotics are unlikely to be successful in reducing colorectal cancer risk. This study yields insights regarding the causal relationship between colorectal carcinogenesis and gut microbiota and thus offering some reference and directions for the future study of gut microbiota.

M.G. Dunlop reports grants from University of Edinburgh during the conduct of the study. No disclosures were reported by the other authors.

W. Li: Writing–original draft. X. Zhou: Writing–original draft. S. Yuan: Writing–original draft, writing–review and editing. L. Wang: Writing–review and editing. L. Yu: Writing–review and editing. J. Sun: Writing–review and editing. J. Chen: Writing–review and editing. Q. Xiao: Writing–review and editing. Z. Wan: Writing–review and editing. J.S. Zheng: Writing–review and editing. C.X. Zhang: Writing–review and editing. S.C. Larsson: Writing–review and editing. S.M. Farrington: Writing–review and editing. P. Law: Writing–review and editing. R.S. Houlston: Writing–review and editing. I. Tomlinson: Writing–review and editing. K.F. Ding: Writing–review and editing. M.G. Dunlop: Conceptualization, writing–review and editing. E. Theodoratou: Conceptualization, writing–review and editing. X. Li: Conceptualization, writing–review and editing.

This work was supported by the Natural Science Fund for Distinguished Young Scholars of Zhejiang Province (LR22H260001; to X. Li) and the National Nature Science Foundation of China (82204019); CRUK Career Development Fellowship (C31250/A22804; to E. Theodoratou); the Swedish Heart Lung Foundation (Hjärt-Lungfonden, 20210351), the Swedish Research Council (Vetenskapsrådet, 2019–00977), and the Swedish Cancer Society (Cancerfonden; to S.C. Larsson); Natural Science Foundation of Zhejiang Province (LBY20H160002; to Q. Xiao); Project of the regional diagnosis and treatment center of the Health Planning Committee (No. JBZX-201903; to K.F. Ding). This work was also supported by grants from Cancer Research UK (C1298/A25514, C348/A12076, C6199/A16459, C348/A18927). Support was also provided by the DJ Fielding Medical Research Trust.

We thank the participants and investigators who participated in the MiBioGen consortium and FINRISK study for providing data.

The publication costs of this article were defrayed in part by the payment of publication fees. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

Note: Supplementary data for this article are available at Cancer Epidemiology, Biomarkers & Prevention Online (http://cebp.aacrjournals.org/).

1.
Sung
H
,
Ferlay
J
,
Siegel
RL
,
Laversanne
M
,
Soerjomataram
I
,
Jemal
A
, et al
.
Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries
.
CA Cancer J Clin
2021
;
71
:
209
49
.
2.
Bray
F
,
Ferlay
J
,
Laversanne
M
,
Brewster
DH
,
Gombe Mbalawa
C
,
Kohler
B
, et al
.
Cancer incidence in five continents: inclusion criteria, highlights from volume X, and the global status of cancer registration
.
Int J Cancer
2015
;
137
:
2060
71
.
3.
Lawley
TD
,
Walker
AW
.
Intestinal colonization resistance
.
Immunology
2013
;
138
:
1
11
.
4.
Visconti
A
,
Le Roy
CI
,
Rosa
F
,
Rossi
N
,
Martin
TC
,
Mohney
RP
, et al
.
Interplay between the human gut microbiome and host metabolism
.
Nat Commun
2019
;
10
:
4505
.
5.
Zheng
D
,
Liwinski
T
,
Elinav
E
.
Interaction between microbiota and immunity in health and disease
.
Cell Res
2020
;
30
:
492
506
.
6.
Rinninella
E
,
Raoul
P
,
Cintoni
M
,
Franceschi
F
,
Miggiano
GAD
,
Gasbarrini
A
, et al
.
What is the healthy gut microbiota composition? a changing ecosystem across age, environment, diet, and diseases
.
Microorganisms
2019
;
7
:
14
.
7.
Castellarin
M
,
Warren
RL
,
Freeman
JD
,
Dreolini
L
,
Krzywinski
M
,
Strauss
J
, et al
.
Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma
.
Genome Res
2012
;
22
:
299
306
.
8.
Nakatsu
G
,
Li
X
,
Zhou
H
,
Sheng
J
,
Wong
SH
,
Wu
WK
, et al
.
Gut mucosal microbiome across stages of colorectal carcinogenesis
.
Nat Commun
2015
;
6
:
8727
.
9.
Narayanan
V
,
Peppelenbosch
MP
,
Konstantinov
SR
.
Human fecal microbiome-based biomarkers for colorectal cancer
.
Cancer Prev Res
2014
;
7
:
1108
11
.
10.
Yu
LC
.
Microbiota dysbiosis and barrier dysfunction in inflammatory bowel disease and colorectal cancers: exploring a common ground hypothesis
.
J Biomed Sci
2018
;
25
:
79
.
11.
Yu
L
,
Zhao
G
,
Wang
L
,
Zhou
X
,
Sun
J
,
Li
X
, et al
.
A systematic review of microbial markers for risk prediction of colorectal neoplasia
.
Br J Cancer
2022
;
126
:
1318
28
.
12.
Smith
GD
,
Ebrahim
S
. '
Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease?
Int J Epidemiol
2003
;
32
:
1
22
.
13.
Kurilshikov
A
,
Medina-Gomez
C
,
Bacigalupe
R
,
Radjabzadeh
D
,
Wang
J
,
Demirkan
A
, et al
.
Large-scale association analyses identify host factors influencing human gut microbiome composition
.
Nat Genet
2021
;
53
:
156
65
.
14.
Qin
Y
,
Havulinna
AS
,
Liu
Y
,
Jousilahti
P
,
Ritchie
SC
,
Tokolyi
A
, et al
.
Combined effects of host genetics and diet on human gut microbiota and incident disease in a single population cohort
.
Nat Genet
2022
;
54
:
134
42
.
15.
Rhee
EP
,
Ho
JE
,
Chen
MH
,
Shen
D
,
Cheng
S
,
Larson
MG
, et al
.
A genome-wide association study of the human metabolome in a community-based cohort
.
Cell Metab
2013
;
18
:
130
43
.
16.
Sanna
S
,
van Zuydam
NR
,
Mahajan
A
,
Kurilshikov
A
,
Vich Vila
A
,
Võsa
U
, et al
.
Causal relationships among the gut microbiome, short-chain fatty acids, and metabolic diseases
.
Nat Genet
2019
;
51
:
600
5
.
17.
Li
X
,
Timofeeva
M
,
Spiliopoulou
A
,
McKeigue
P
,
He
Y
,
Zhang
X
, et al
.
Prediction of colorectal cancer risk based on profiling with common genetic variants
.
Int J Cancer
2020
;
147
:
3431
7
.
18.
Huyghe
JR
,
Bien
SA
,
Harrison
TA
,
Kang
HM
,
Chen
S
,
Schmit
SL
, et al
.
Discovery of common and rare genetic risk variants for colorectal cancer
.
Nat Genet
2019
;
51
:
76
87
.
19.
Law
PJ
,
Timofeeva
M
,
Fernandez-Rozadilla
C
,
Broderick
P
,
Studd
J
,
Fernandez-Tajes
J
, et al
.
Association analyses identify 31 new risk loci for colorectal cancer susceptibility
.
Nat Commun
2019
;
10
:
2154
.
20.
Wang
J
,
Carvajal-Carmona
LG
,
Chu
JH
,
Zauber
AG
,
Kubo
M
,
Matsuda
K
, et al
.
Germline variants and advanced colorectal adenomas: adenoma prevention with celecoxib trial genome-wide association study
.
Clin Cancer Res
2013
;
19
:
6430
7
.
21.
Zhang
B
,
Shrubsole
MJ
,
Li
G
,
Cai
Q
,
Edwards
T
,
Smalley
WE
, et al
.
Association of genetic variants for colorectal cancer differs by subtypes of polyps in the colorectum
.
Carcinogenesis
2012
;
33
:
2417
23
.
22.
Bowden
J
,
Davey Smith
G
,
Haycock
PC
,
Burgess
S
.
Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator
.
Genet Epidemiol
2016
;
40
:
304
14
.
23.
Hemani
G
,
Zheng
J
,
Elsworth
B
,
Wade
KH
,
Haberland
V
,
Baird
D
, et al
.
The MR-Base platform supports systematic causal inference across the human phenome
.
Elife
2018
;
7
:
e34408
.
24.
Bowden
J
,
Davey Smith
G
,
Burgess
S
.
Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression
.
Int J Epidemiol
2015
;
44
:
512
25
.
25.
Verbanck
M
,
Chen
CY
,
Neale
B
,
Do
R
.
Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases
.
Nat Genet
2018
;
50
:
693
8
.
26.
Burgess
S
,
Thompson
SG
.
Bias in causal estimates from Mendelian randomization studies with weak instruments
.
Stat Med
2011
;
30
:
1312
23
.
27.
Rebersek
M
.
Gut microbiome and its role in colorectal cancer
.
BMC Cancer
2021
;
21
:
1325
.
28.
Chattopadhyay
I
,
Dhar
R
,
Pethusamy
K
,
Seethy
A
,
Srivastava
T
,
Sah
R
, et al
.
Exploring the role of gut microbiome in colon cancer
.
Appl Biochem Biotechnol
2021
;
193
:
1780
99
.
29.
Allen
J
,
Sears
CL
.
Impact of the gut microbiome on the genome and epigenome of colon epithelial cells: contributions to colorectal cancer development
.
Genome Medicine
2019
;
11
:
11
.
30.
Brenner
H
,
Kloor
M
,
Pox
CP
.
Colorectal cancer
.
Lancet (London, England)
2014
;
383
:
1490
502
.
31.
Piciocchi
A
,
Germinario
EAP
,
Garcia Etxebarria
K
,
Rossi
S
,
Sanchez-Mete
L
,
Porowska
B
, et al
.
Association of polygenic risk score and bacterial toxins at screening colonoscopy with colorectal cancer progression: a multicenter case–control study
.
Toxins
2021
;
13
:
569
.
32.
van Elsland
D
,
Neefjes
J
.
Bacterial infections and cancer
.
EMBO Rep
2018
;
19
:
e46632
.
33.
Fiorentini
C
,
Carlini
F
,
Germinario
EAP
,
Maroccia
Z
,
Travaglione
S
,
Fabbri
A
.
Gut microbiota and colon cancer: a role for bacterial protein toxins?
Int J Mol Sci
2020
;
21
:
6201
.
34.
Mizutani
S
,
Yamada
T
,
SJCs
Y
.
Significance of the gut microbiome in multistep colorectal carcinogenesis
.
2020
;
111
:
766
73
.
35.
Alomair
AO
,
Masoodi
I
,
Alyamani
EJ
,
Allehibi
AA
,
Qutub
AN
,
Alsayari
KN
, et al
.
Colonic mucosal microbiota in colorectal cancer: a single-center metagenomic study in Saudi Arabia
.
Gastroenterol Res Pract
2018
;
2018
:
5284754
.
36.
Lu
Y
,
Chen
J
,
Zheng
J
,
Hu
G
,
Wang
J
,
Huang
C
, et al
.
Mucosal adherent bacterial dysbiosis in patients with colorectal adenomas
.
Sci Rep
2016
;
6
:
26337
.
37.
Peters
BA
,
Dominianni
C
,
Shapiro
JA
,
Church
TR
,
Wu
J
,
Miller
G
, et al
.
The gut microbiota in conventional and serrated precursors of colorectal cancer
.
Microbiome
2016
;
4
:
69
.
38.
Xu
K
,
Jiang
B
.
Analysis of mucosa-associated microbiota in colorectal cancer
.
Med Sci Monit
2017
;
23
:
4422
30
.
39.
Shen
XJ
,
Rawls
JF
,
Randall
T
,
Burcal
L
,
Mpande
CN
,
Jenkins
N
, et al
.
Molecular characterization of mucosal adherent bacteria and associations with colorectal adenomas
.
Gut Microbes
2010
;
1
:
138
47
.
40.
Yoshimoto
S
,
Loo
TM
,
Atarashi
K
,
Kanda
H
,
Sato
S
,
Oyadomari
S
, et al
.
Obesity-induced gut microbial metabolite promotes liver cancer through senescence secretome
.
Nature
2013
;
499
:
97
101
.
41.
Coker
OO
,
Nakatsu
G
,
Dai
RZ
,
Wu
WKK
,
Wong
SH
,
Ng
SC
, et al
.
Enteric fungal microbiota dysbiosis and ecological alterations in colorectal cancer
.
Gut
2019
;
68
:
654
62
.
42.
Huyghe
JR
,
Harrison
TA
,
Bien
SA
,
Hampel
H
,
Figueiredo
JC
,
Schmit
SL
, et al
.
Genetic architectures of proximal and distal colorectal cancer are partly distinct
.
Gut
2021
;
70
:
1325
.
43.
Miyake
T
,
Mori
H
,
Yasukawa
D
,
Hexun
Z
,
Maehira
H
,
Ueki
T
, et al
.
The comparison of fecal microbiota in left-side and right-side human colorectal cancer
.
Eur Surg Res
2021
;
62
:
121
7
.<./bib>
44.
Phipps
O
,
Quraishi
MN
,
Kumar
A
,
Dickson
EA
,
Ng
O
,
Acheson
AG
, et al
.
P317 Differences in normal mucosa and colorectal tumor microbiota between right and left colon
.
Gut
2021
;
70
:
A204
A
.
45.
Xi
Y
,
Yuefen
P
,
Wei
W
,
Quan
Q
,
Jing
Z
,
Jiamin
X
, et al
.
Analysis of prognosis, genome, microbiome, and microbial metabolome in different sites of colorectal cancer
.
J Transl Med
2019
;
17
:
1
22
.
46.
Mehta
RS
,
Song
M
,
Nishihara
R
,
Drew
DA
,
Wu
K
,
Qian
ZR
, et al
.
Dietary patterns and risk of colorectal cancer: analysis by tumor location and molecular subtypes
.
Gastroenterology
2017
;
152
:
1944
53
.
47.
Garcia-Etxebarria
K
,
Clos-Garcia
M
,
Telleria
O
,
Nafría
B
,
Alonso
C
,
Iruarrizaga-Lejarreta
M
, et al
.
Interplay between genome, metabolome, and microbiome in colorectal cancer
.
Cancers
2021
;
13
:
6216
.
48.
Clos-Garcia
M
,
Garcia
K
,
Alonso
C
,
Iruarrizaga-Lejarreta
M
,
D'Amato
M
,
Crespo
A
, et al
.
Integrative analysis of fecal metagenomics and metabolomics in colorectal cancer
.
Cancers
2020
;
12
:
1142
.
This open access article is distributed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.