Abstract
Colorectal adenomas are responsible for the origin of most colorectal cancers. Early detection together with active intervention of colorectal adenomas plays a crucial role in the prevention of colorectal cancer. This study aimed to construct and validate a new nomogram for the forecasting of the risk of colorectal adenomas based on lifestyle risk factors that could offer potential benefits for colorectal cancer prevention. Colonoscopy reports, pathology reports, physical factors, family history, personal history of disease, diet, and lifestyle habits were collected from 1,133 subjects who underwent complete colonoscopy. All subjects were divided into the training cohort (n = 792) and the validation cohort (n = 341). A nomogram predicting the risk of colorectal adenoma development was constructed using the training cohort, and the C-index was calculated. The predictive accuracy and clinical applicability of the nomogram were verified in the validation cohort. The nomogram was constructed by six statistically significant variables selected from 18 health factors, including advanced age, male, smoking, drinking, pickles, and irregular defecation. The C-index of the training cohort was 0.778, and the C-index of the validation cohort was 0.754. The calibration curve and decision curve analysis also confirmed that the model has good predictive ability and high profit. The nomogram constructed in this study was validated and can be applied to predicting the occurrence risk of colorectal adenoma. The model can guide the identification of patients with nonsymptomatic colorectal adenomas and the recognition of high-risk individuals for whom colonoscopy is advisable.
Prevention Relevance: Colorectal adenomas are the origin of most colorectal cancers. In this research, we explored the risk factors of colorectal adenomas and constructed a colorectal adenoma risk prediction nomogram in the expectation of early detection of patients with nonsymptomatic colorectal adenoma and advocated for their aggressive treatment to achieve colorectal cancer prevention.
Introduction
Colorectal cancer ranks as the third most prevalent cancer and the second leading cause of cancer-related death worldwide (1). In recent years, the incidence and mortality rates of colorectal cancer in China have increased owing to economic development and lifestyle changes (2). In 2020, newly diagnosed colorectal cancer cases in China accounted for 28.8% of all new cases globally (555,477/1,931,590), and colorectal cancer–related deaths accounted for 30.6% of all colorectal cancer–related deaths worldwide (286,162/935,173; ref. 3). With the increasing understanding of the biological development of colorectal cancer, targeted therapies directed against key oncogenes and pathways have emerged, but these treatments are only effective in a small proportion of patients with colorectal cancer (4).
Approximately 60% of colorectal cancers result from the adenomatous pathway, which progresses from nonadvanced tubular adenoma to villous tubular adenoma and potentially to invasive carcinoma (5). Colonoscopy is an effective tool to prevent colorectal cancer; however, a previous study conducted in China revealed that only 14% (25,593/182,927) of high-risk participants with colorectal cancer underwent colonoscopy, suggesting that colonoscopy is not yet well-accepted in the general population (6). Current colonoscopy resources in China do not accommodate comprehensive colonoscopy screening for all people of appropriate age, and colonoscopy may miss approximately one-quarter of colorectal adenomas, which greatly increases the potential cancerous risk of colorectal adenomas (7).
Previous studies have identified the following as hypothesized factors to influence the risk of colorectal adenomas occurrence: sex (8), age (8), body mass index (BMI; ref. 9), obesity (8–10), smoking (11, 12), drinking (12, 13), red meat consumption (11), dietary fiber (14), physical activity (8), and diabetes mellitus (8).
With a deeper understanding of colorectal cancer and precancerous lesions, early screening of high-risk groups has become the emphasis of colorectal cancer prevention and treatment. Although research on colorectal adenoma risk factors is extensive, there are fewer studies on risk probability prediction based on risk factors. In this study, the colorectal adenoma risk nomogram constructed based on lifestyle factors is easier to be extended for early screening of colorectal adenoma, and patients with high-risk colorectal adenomas can receive timely colonoscopies for the prevention of colorectal cancer.
Materials and Methods
Study population
The study cases were obtained from participants who underwent complete colonoscopy at the Second Hospital of Jilin University from 2021 to 2022. These subjects underwent a complete colonoscopy for diarrhea, abdominal pain, dark stool, and other digestive symptoms or physical examination. The pathologic examination of polyps was completed by experienced pathologists. All included participants underwent colonoscopy for the first time. The exclusion criteria for cases were as follows: (i) age less than 16 years old; (ii) the presence or a history of any type of malignancy; (iii) previous colorectal surgery; (iv) a family history of colorectal cancer; (v) previous or current detection of inflammatory bowel disease; (vi) the presence or a history of severe cardiovascular disease; and (vii) pathologic type of polyps other than colorectal adenomas. A total of 1,133 participants were eligible for analysis. The study was conducted in accordance with the Declaration of Helsinki. The study was approved by the Ethics Committee of the Second Hospital of Jilin University. We obtained written informed consent from each participant. Only study-related data were used in this study, and identifiable personal information was removed by the researchers.
Data collection
Data on physical factors, family history, personal disease history, diet, and lifestyle habits were collected by the investigators from the participants via telephone callback. Physical factors included age, sex, height, weight, and BMI (calculated from height and weight). Comorbidities included diabetes mellitus and hypertension. Family history mainly included a family history of colorectal cancer. Personal disease history was used to exclude participants who had any kind of malignancy or inflammatory bowel disease or who underwent colorectal surgery. Diet and lifestyle habits included smoking status, smoking duration, and quantity of cigarettes smoked; alcohol consumption status, alcohol consumption duration, and amount of consumed alcohol; spicy diet; high-fat diet; consumption of pickled products, vegetables, and fruits; regular aspirin use; regular NSAID use; water intake; physical activity; sleep duration; and defecation.
Smoking status was categorized into two types: smoking (continuous smoking for more than 2 years with an average of at least two cigarettes per day, excluding those who had quit smoking) and nonsmoking (never smoked). Alcohol consumption status was categorized into two types: drinking (continuous drinking for more than 2 years with an average daily intake of more than 20 g of ethanol, excluding those who had quit) and nondrinking (never drinking or with an average daily intake of less than 1 g of ethanol). The frequency of each dietary intake indicated the subject's dietary habits, categorizing spicy diets, high-fat diets, pickle consumption, and vegetable and fruit consumption into three separate statuses: negligible, occasionally (no more than 2 days per week), and frequently (at least 3 days or more per week). Regular use of aspirin or NSAIDs meant that the subject had taken them on average at least once a week for at least the past 5 years or continuously for at least 2 months each year for at least the past 5 years. The prevalence status of hyperglycemia and hypertension was divided into two groups: present and absent. Water intake status was divided into three groups: less than 500 mL/day, 500 to 1,500 mL/day, and more than 1,500 mL/day. Physical activity status was divided into three groups: negligible, occasional, and frequent (moderate intensity exercise at least once a week for at least 30 minutes). Sleeping time was divided into three groups: less than 6 hours/day, 6 to 8 hours/day, and more than 8 hours/day. Defecation was divided into two groups: regular (almost daily bowel movements at a fixed time) and irregular (daily but irregular or nondaily bowel movements). The collected information was collated, and all subjects were grouped after screening based on inclusion and exclusion criteria.
Establishment and validation of the predictive model
The predictive model was built by logistic regression analysis. We assessed health factors influencing the occurrence of colorectal adenomas in the training cohort by univariate stepwise analysis. The independent variables in the multiple logistic regression analysis were health factors that were significantly associated with the risk of colorectal adenomas in the univariate analysis. Based on the results of multivariate logistic regression analysis, we established a nomogram for predicting the occurrence risk of colorectal adenoma. We evaluated the probability of consistency between the predicted results of the model and the real situation through the plotting of the ROC curve and the C-index. Generally, we consider that the model has no predictive ability if the C-index is less than 0.5, and the model has weak or strong predictive ability if the C-index is between 0.5 and 0.7 or greater than 0.7.
We evaluated the consistency of the training cohort and validation cohort predicted probabilities with the actual results by calibration curves. The clinical benefit of the model at different decision thresholds was predicted by the decision curve analysis (DCA) curve.
Statistical analysis
We used SPSS, RRID:SCR_002865, and R4.2.2 software for statistical analysis. Quantitative variables that followed a normal distribution were expressed as (x̄ ± s) and analyzed using independent Student t tests. Categorical variables were expressed as frequencies and percentages (n%) and were analyzed using the χ2 test. The reported levels of statistical significance were two-sided, with significance less than 0.05 considered statistically significant.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
Results
Participant screening
A total of 1,518 subjects participated in this study, of which 385 were excluded by exclusion criteria. After the screening, 792 participants were included in the training cohort and 341 in the validation cohort (Fig. 1). The baseline clinical characteristics of subjects in the colorectal adenoma and no adenoma groups are shown in Supplementary Table S1. Baseline clinical characteristics of the training and validation cohorts are shown in Supplementary Table S2. There was no statistical difference in the characteristics of the patients in the training and validation cohorts (P > 0.05).
Screening for independent influencing factors and constructing a nomogram
In the training cohort, 792 participants were divided into colorectal adenoma group (433) and no adenoma group (359). All variables were analyzed by univariate analysis, and statistically significant variables were included in the multivariate analysis. The following six variables were finally identified as independent factors influencing the occurrence of colorectal adenomas: advanced age, male, smoking, drinking, frequent consumption of pickles, and irregular defecation (Table 1). We constructed a nomogram forecasting the occurrence risk of colorectal adenoma from these six variables (Fig. 2). Each factor had a different score depending on the variable, and the score for each participant was the sum of the scores for all factors. The predicted probability of the occurrence risk of colorectal adenoma is obtained by plotting a vertical line from the total score line to the risk line. The greater the total score, the higher the risk of colorectal adenoma. The ROC curve analysis of the subjects showed that the AUC of the model was 0.778 (95% confidence interval, 0.746–0.810). Meanwhile, the efficacy of the nomogram at the cutoff point showed 74.4% specificity, 68.4% sensitivity, 76.3% positive predictive value, and 66.1% negative predictive value. This indicates that the model has good discrimination ability (Fig. 3).
Healthy factor . | Univariate analysis . | Multivariate analysis . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (years) | 1.065 (1.052, 1.078) | <0.001 | 1.067 (1.053, 1.081) | <0.001 |
BMI (kg/m2) | 1.023 (0.977, 1.071) | 0.339 | ||
Sex | ||||
Female | 1.00 (ref) | 1.00 (ref) | ||
Male | 1.973 (1.486, 2.621) | <0.001 | 1.489 (1.017, 2.180) | 0.041 |
Smoking status | ||||
Never | 1.00 (ref) | 1.00 (ref) | ||
Current | 3.255 (2.318, 4.570) | <0.001 | 1.901 (1.216, 2.972) | 0.005 |
Drinking status | ||||
Never | 1.00 (ref) | 1.00 (ref) | ||
Current | 2.872 (2.069, 3.987) | <0.001 | 1.669 (1.071, 2.600) | 0.024 |
Spicy food | ||||
Negligible | 1.00 (ref) | |||
Occasionally | 0.741 (0.536, 1.026) | 0.071 | ||
Frequently | 0.775 (0.539, 1.114) | 0.168 | ||
High-fat diet | ||||
Negligible | 1.00 (ref) | |||
Occasionally | 1.015 (0.727, 1.416) | 0.932 | ||
Frequently | 1.086 (0.763, 1.546) | 0.645 | ||
Pickled products | ||||
Negligible | 1.00 (ref) | 1.00 (ref) | ||
Occasionally | 2.227 (1.539, 3.223) | <0.001 | 1.903 (1.254, 2.866) | 0.002 |
Frequently | 2.969 (2.101, 4.196) | <0.001 | 1.976 (1.320, 2.958) | 0.001 |
Vegetable consuming | ||||
Frequently | 1.00 (ref) | |||
Occasionally | 1.048 (0.701, 1.565) | 0.821 | ||
Negligible | 2.298 (1.052, 5.020) | 0.037 | ||
Fruit consuming | ||||
Frequently | 1.00 (ref) | 1.00 (ref) | ||
Occasionally | 1.391 (1.007, 1.920) | 0.045 | 1.163 (0.796, 1.700) | 0.435 |
Negligible | 1.934 (1.314, 2.848) | 0.001 | 0.941 (0.588, 1.508) | 0.802 |
Water intake | ||||
>1,500 mL | 1.00 (ref) | |||
500∼1,500 mL | 1.065 (0.726, 1.562) | 0.747 | ||
<500 mL | 1.457 (0.937, 2.267) | 0.095 | ||
Exercise | ||||
Frequent | 1.00 (ref) | |||
Occasional | 1.138 (0.665, 1.948) | 0.638 | ||
Negligible | 1.489 (0.914, 2.425) | 0.110 | ||
Regular aspirin use | ||||
No | 1.00 (ref) | |||
Yes | 1.559 (0.945, 2.574) | 0.082 | ||
Regular NSAID use | ||||
No | 1.00 (ref) | 1.00 (ref) | ||
Yes | 1.498 (1.065, 2.106) | 0.020 | 1.473 (0.989, 2.195) | 0.057 |
Hypertension | ||||
No | 1.00 (ref) | 1.00 (ref) | ||
Yes | 1.736 (1.136, 2.652) | 0.011 | 1.089 (0.672, 1.765) | 0.729 |
Diabetes | ||||
No | 1.00 (ref) | |||
Yes | 1.193 (0.594, 2.398) | 0.620 | ||
Sleep duration | ||||
>8 hours | 1.00 (ref) | |||
6∼8 hours | 0.619 (0.431, 0.889) | 0.009 | 0.845 (0.553, 1.290) | 0.435 |
<6 hours | 0.838 (0.571, 0.230) | 0.367 | 1.042 (0.664, 1.034) | 0.859 |
Defecation | ||||
Regular | 1.00 (ref) | 1.00 (ref) | ||
Irregular | 1.660 (1.252, 2.202) | <0.001 | 1.476 (1.056, 2.064) | 0.023 |
Healthy factor . | Univariate analysis . | Multivariate analysis . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (years) | 1.065 (1.052, 1.078) | <0.001 | 1.067 (1.053, 1.081) | <0.001 |
BMI (kg/m2) | 1.023 (0.977, 1.071) | 0.339 | ||
Sex | ||||
Female | 1.00 (ref) | 1.00 (ref) | ||
Male | 1.973 (1.486, 2.621) | <0.001 | 1.489 (1.017, 2.180) | 0.041 |
Smoking status | ||||
Never | 1.00 (ref) | 1.00 (ref) | ||
Current | 3.255 (2.318, 4.570) | <0.001 | 1.901 (1.216, 2.972) | 0.005 |
Drinking status | ||||
Never | 1.00 (ref) | 1.00 (ref) | ||
Current | 2.872 (2.069, 3.987) | <0.001 | 1.669 (1.071, 2.600) | 0.024 |
Spicy food | ||||
Negligible | 1.00 (ref) | |||
Occasionally | 0.741 (0.536, 1.026) | 0.071 | ||
Frequently | 0.775 (0.539, 1.114) | 0.168 | ||
High-fat diet | ||||
Negligible | 1.00 (ref) | |||
Occasionally | 1.015 (0.727, 1.416) | 0.932 | ||
Frequently | 1.086 (0.763, 1.546) | 0.645 | ||
Pickled products | ||||
Negligible | 1.00 (ref) | 1.00 (ref) | ||
Occasionally | 2.227 (1.539, 3.223) | <0.001 | 1.903 (1.254, 2.866) | 0.002 |
Frequently | 2.969 (2.101, 4.196) | <0.001 | 1.976 (1.320, 2.958) | 0.001 |
Vegetable consuming | ||||
Frequently | 1.00 (ref) | |||
Occasionally | 1.048 (0.701, 1.565) | 0.821 | ||
Negligible | 2.298 (1.052, 5.020) | 0.037 | ||
Fruit consuming | ||||
Frequently | 1.00 (ref) | 1.00 (ref) | ||
Occasionally | 1.391 (1.007, 1.920) | 0.045 | 1.163 (0.796, 1.700) | 0.435 |
Negligible | 1.934 (1.314, 2.848) | 0.001 | 0.941 (0.588, 1.508) | 0.802 |
Water intake | ||||
>1,500 mL | 1.00 (ref) | |||
500∼1,500 mL | 1.065 (0.726, 1.562) | 0.747 | ||
<500 mL | 1.457 (0.937, 2.267) | 0.095 | ||
Exercise | ||||
Frequent | 1.00 (ref) | |||
Occasional | 1.138 (0.665, 1.948) | 0.638 | ||
Negligible | 1.489 (0.914, 2.425) | 0.110 | ||
Regular aspirin use | ||||
No | 1.00 (ref) | |||
Yes | 1.559 (0.945, 2.574) | 0.082 | ||
Regular NSAID use | ||||
No | 1.00 (ref) | 1.00 (ref) | ||
Yes | 1.498 (1.065, 2.106) | 0.020 | 1.473 (0.989, 2.195) | 0.057 |
Hypertension | ||||
No | 1.00 (ref) | 1.00 (ref) | ||
Yes | 1.736 (1.136, 2.652) | 0.011 | 1.089 (0.672, 1.765) | 0.729 |
Diabetes | ||||
No | 1.00 (ref) | |||
Yes | 1.193 (0.594, 2.398) | 0.620 | ||
Sleep duration | ||||
>8 hours | 1.00 (ref) | |||
6∼8 hours | 0.619 (0.431, 0.889) | 0.009 | 0.845 (0.553, 1.290) | 0.435 |
<6 hours | 0.838 (0.571, 0.230) | 0.367 | 1.042 (0.664, 1.034) | 0.859 |
Defecation | ||||
Regular | 1.00 (ref) | 1.00 (ref) | ||
Irregular | 1.660 (1.252, 2.202) | <0.001 | 1.476 (1.056, 2.064) | 0.023 |
Abbreviation: CI, confidence interval.
Validation of the nomogram
To validate the applicability of the nomogram to other data sets, we performed a validation using data from 341 subjects in the validation cohort. The validation cohort included 169 subjects without adenomas and 172 subjects with colorectal adenomas. The C-index for the validation cohort was 0.754 (95% confidence interval, 0.703–0.805). Meanwhile, the efficacy of the validation cohort at the cutoff point showed 66.3% specificity, 69.9% sensitivity, 66.9% positive predictive value, and 66.3% negative predictive value, suggesting that the nomogram also showed good predictability in the validation cohort. The calibration curves for the training and validation cohorts illustrated that the nomogram also showed a high degree of agreement between the predicted probability of colorectal adenoma and the true situation (Fig. 4). DCA demonstrated that the net benefit of intervention by the model was significantly better than if all subjects had either the intervention or no intervention, suggesting that there is a higher benefit to using the model for the prediction of the risk of colorectal adenoma occurrence (Fig. 5).
Discussion
Adenomatous polyps are thought to be the precursor lesions for most colorectal cancers and mainly develop via the adenoma–carcinoma sequence (5, 15). Therefore, early detection and endoscopic removal of newly occurring adenomas is an important strategy to prevent colorectal cancer. In our study, we combined contemporary lifestyle habits and geographic dietary characteristics to evaluate possible risk factors for colorectal adenomas by classifying risk factors into physical factors, lifestyle factors, comorbidities, and medications.
We found gender and age to be nonmodifiable factors in the development of colorectal adenomas in our study, and this association remained significant after multivariate analysis. This may be because men are more prone to unhealthy lifestyle behaviors such as smoking, drinking alcohol, or consuming more fat than women, which are risk factors for colorectal adenomas (12). Estrogen reduces the risk of colorectal adenomas in women to some extent. Female hormones inhibit the formation of colorectal adenomas by binding to the estrogen receptor to lower secondary bile acid levels and by downregulating the expression of insulin-like growth factor-1 (16).
Several studies have explained the association between tobacco and colorectal polyps, given that tobacco contains numerous carcinogens that can damage the intestinal mucosa, leading to polyp development (17). Smoking is a well-known risk factor for colorectal polyps and colorectal cancer (18). Relative to nonsmokers, smoking significantly increased the risk of colorectal polyps. Some studies suggest that the duration and quantity of smoking showed a rough dose–response relationship with the risk of disease. The risk of colorectal polyps increased with increasing duration and quantity of smoking (19, 20). A previous study reported that the prevalence of polyps was 2.01 times higher in drinkers than that in nondrinkers (21). Some researchers believe that alcohol disrupts the stable state of the flora in the gut when the body is healthy, resulting in intestinal damage and the depletion of bacteria with anti-inflammatory activity (22). Our study also demonstrated that the occurrence risk of colorectal adenomas is associated with alcohol consumption.
In many areas of the world, due to the climatic conditions and geographical location, people generally prefer a high-fat, high-salt diet. Vegetables are stored for a longer time by salting them. We also investigated the effect of pickled product consumption on the occurrence risk of colorectal adenomas. The process of pickling vegetables requires the addition of a large amount of salt or sauce, which produces nitrites. Nitrites react with amines, amides, and other nitroso precursors to form N-nitroso compounds, which are associated with a high risk of colorectal and gastrointestinal cancers (23). Some studies have revealed that the risk of colorectal adenomas was 1.487 times higher with frequent pickle consumption than that without pickle consumption (24). Our findings are consistent with it.
Some studies suggest that dietary fiber prevents polyp formation by swelling the stool and enhancing bowel movements, thereby possibly reducing the colonic surface area exposed to cancer-causing toxins and bile acids in the stool (25). Our study explored the relationship between regular daily defecation or moderate water intake and the occurrence risk of colorectal adenomas. The researchers suggest that regular daily defecation or moderate water intake may exert an effect similar to that of dietary fiber, decreasing the residence time of fecal toxins in the colorectum and reducing their absorption. The results also confirm that regular defecation reduces the occurrence risk of colorectal adenomas. However, moderate water intake was not significantly associated with the risk of colorectal adenoma.
BMI is an international measure of obesity and health status, and many studies have discovered a positive correlation between BMI and the risk of developing colorectal adenoma (9). Abdominal obesity has a stronger association with the risk of colorectal adenoma than BMI (26). However, in our study, although the mean BMI and the frequency of high-fat diet were higher in the colorectal adenoma group than those in the no adenoma group, there was no statistical significance between the two groups. The relationship between red meat consumption and colorectal cancer has been investigated in several studies, and there is consensus that processed red meat consumption increases the risk of colorectal cancer and colorectal adenomas. However, there is a lack of consensus on the risk associated with unprocessed red meat consumption. The association between red meat and colorectal adenomas was not investigated in this study. This is because the distinction between processed and unprocessed red meat is difficult to define, and the health effects of each type of red meat may differ significantly. Therefore, without strict categorization, it may not be reasonable to group processed and unprocessed red meat (27).
Diabetes and hypertension increase the risk of colorectal adenoma (28). The use of aspirin or NSAIDs has a protective effect on the risk of colorectal adenoma (29, 30). However, the results confirmed that these potential risk factors did not show a significant correlation, probably because fewer participants had these characteristics.
Our nomogram is designed to screen people for high-risk colorectal adenomas and to guide people to healthy living, so they should be limited in the number of variables included in the nomogram. Excessive variables can make calculating predictive scores cumbersome and incomprehensible to people. Our nomogram showed good predictive power in the training cohort. The calibration curves showed similar distributions with the ideal reference line, and the DCA showed that relying on this model for prediction yielded better results at any threshold. Applying this model to the validation queue also shows good predictive power in its calibration curve and decision analysis curve. Thus, our nomogram can be applied to predicting the occurrence risk of colorectal adenomas.
Compared with other colorectal adenoma risk prediction models, this model has well predictive accuracy as well as excellent calibration and clinical utility. In an American study, a clinical scoring system was constructed to predict the risk of colorectal adenoma based on age, sex, family history of colorectal cancer, and smoking history, which had a C-index of 0.64 (31). Spanish scholars examined 88 SNPs associated with the risk of colorectal cancer and finally identified five SNPs associated with the occurrence of colorectal adenomas for calculating genetic risk scores. Combining sex and age with genetic risk scores to construct a colorectal adenoma risk prediction model was useful for colorectal adenomas risk stratification and screening program adjustment, and the C-index of this model was 0.655 (32). Researchers in China proposed a tool to predict the risk of colorectal adenomas based on advanced age, male, hyperlipidemia, smoking, high red meat intake, high salt intake, high dietary fiber intake, Helicobacter pylori infection, nonalcoholic fatty liver disease, and chronic diarrhea, which has a favorable discriminatory power, with a C-index of 0.775 (33).
However, there are some limitations to this study. First, collecting information on each participant’s daily diet or energy intake in a retrospective case–control study was restrictive, and participants were limited in providing realistic descriptions of health factors, which is prone to recall bias. Second, our study did not include related diseases such as nonalcoholic fatty liver disease and H. pylori infection as study factors. Because the subjects were all outpatients of our hospital, it was not possible to judge the authenticity of the illnesses, which may cause inclusion bias. The judgment of whether one has hyperglycemia or hypertension is easier and can reduce the risk of bias. Finally, the sample size of this study was narrow and we did not investigate the economic income and education level of the participants. Better-off and better-educated people are generally more health-conscious. Therefore, the risk of colorectal adenomas may be relatively lower. We can reduce model prediction bias by increasing the sample size to cover more population types (34).
In conclusion, we exploited a simple and reliable model for predicting the occurrence risk of colorectal adenoma based on dietary and lifestyle factors. The model has good predictive accuracy and clinical applicability, which is relevant for identifying patients with asymptomatic colorectal adenomas and selecting high-risk groups for colonoscopy screening.
Authors’ Disclosures
No disclosures were reported.
Authors’ Contributions
M. Li: Conceptualization, data curation, formal analysis, investigation, visualization, methodology, writing–original draft, project administration. M. Cui: Data curation, formal analysis, supervision. X. Zhou: Data curation, formal analysis. Y. Song: Conceptualization, supervision, project administration, writing–review and editing.
Acknowledgments
The authors would like to thank all the staff of the Gastroenteric Medicine and Digestive Endoscopy Centre of the Second Hospital of Jilin University for their help in the data collection of this study.
Note: Supplementary data for this article are available at Cancer Prevention Research Online (http://cancerprevres.aacrjournals.org/).