Abstract
The CanRisk Tool (https://canrisk.org) is the next-generation web interface for the latest version of the BOADICEA (Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm) state-of-the-art risk model and a forthcoming ovarian cancer risk model.
The tool captures information on family history, rare pathogenic variants in cancer susceptibility genes, polygenic risk scores, lifestyle/hormonal/clinical features, and imaging risk factors to predict breast and ovarian cancer risks and estimate the probabilities of carrying pathogenic variants in certain genes. It was implemented using modern web frameworks, technologies, and web services to make it extensible and increase accessibility to researchers and third-party applications. The design of the graphical user interface was informed by feedback from health care professionals and a formal evaluation.
This freely accessible tool was designed to be user friendly for clinicians and to boost acceptability in clinical settings. The tool incorporates a novel graphical pedigree builder to facilitate collection of the family history data required by risk calculations.
The CanRisk Tool provides health care professionals and researchers with a user-friendly interface to carry out multifactorial breast and ovarian cancer risk predictions. It is the first freely accessible cancer risk prediction program to carry the CE marking.
There have been over 3,100 account registrations, and 98,000 breast and ovarian cancer risk calculations have been run within the first 9 months of the CanRisk Tool launch.
Introduction
Cancer risk prediction is widely used to identify individuals at high risk who may benefit from increased surveillance or risk-reducing interventions. The BOADICEA model, first developed in 2002, models the risks of breast and ovarian cancer based on family history and genotypes for variants in BRCA1 and BRCA2 (1). The model has since been extensively updated to incorporate the effects of pathogenic variants in other genes, common genetic variants (summarized as polygenic risk scores, PRS), lifestyle, hormonal and clinical features, breast density, and disease pathology (2). The model has been extensively validated, both for the prediction of carrier probabilities and prediction of subsequent cancer risk (3, 4).
BOADICEA was first made available online in the BOADICEA Web Application (BWA) in 2007 (5). There have been several releases since then to accommodate extensions to the BOADICEA model and to implement improvements to the web interface (6, 7). The BWA uses a series of web forms to capture cancer family history, BRCA1/2 genetic test results, and tumor pathology data. The BWA has been widely used in clinician settings, particularly clinical genetic centres, with more than 15,000 user registrations from over 100 countries.
BOADICEA is currently recommended by several national bodies and organizations for determining eligibility for high-risk breast cancer screening, eligibility for BRCA1/2 mutation screening, and to inform breast cancer risk management. These include the UK NICE guidelines (https://www.nice.org.uk/guidance/cg164), the American Cancer Society (8), the Ontario Breast Screening program (https://www.cancercareontario.ca/en/guidelines-advice/cancer-continuum/screening/breast-cancer-high-risk-women), and the eviQ Australian guidelines for health professionals (https://www.eviq.org.au/cancer-genetics/adult/risk-management).
The motivation to develop a new web tool arose firstly from the need to accommodate advances in the underlying risk prediction algorithms. In addition, the new interface has been designed to enhance usage in multiple levels of clinical care, and to conform to more recent regulations on software for clinical application, specifically European Union (EU) medical device regulations (9). In addition, there was a need to resolve limitations resulting from older web technologies. For example, the BWA used multiple HTML forms (each form capturing details for a single pedigree member) to capture pedigrees of arbitrary size and structure because there were significant barriers to the implementation of graphical pedigree building tools using open standard web technologies (e.g., Scalable Vector Graphics) and Dynamic HTML when it was first developed. This was a novel and powerful approach at the time, but multiple form-based pedigree data entry can be complicated (requiring navigation backward/forward through multiple pages) and time consuming to complete in practice. Finally, there was a need to present the outputs from the risk calculations in a wider range of formats.
To address these issues and enhance usability, increase accessibility, and enable rapid software development and maintainability, the web tool has now been wholly redeveloped from the ground-up using state-of-the-art software engineering techniques and frameworks. This article describes the resulting CanRisk Tool, which provides access to the latest version of the BOADICEA model (2) and related ovarian cancer prediction model (10).
Materials and Methods
The design of the CanRisk Tool was motivated by the desire to maximize usability for the health care professional (HCP) and to facilitate the patient consultation process. This was done by providing a user-friendly interface and by presenting results in a range of popular output formats to help HCPs convey their meanings to the patient. The initial design and subsequent development of the tool was informed by frequent input from the target HCPs including clinical geneticists, genetic counsellors, surgeons, oncologists, practice nurses, and general practitioners, and included a formal evaluation (11). This study demonstrated that: “the prototype CanRisk tool was generally acceptable to most participants due to its intuitive design.”
User-friendly data entry was implemented using standard HTML form elements (text/number input fields) on a single web page that reacts dynamically and intelligently to HCP inputs. HCPs are presented with a simple and intuitive user interface. PRSs can also be incorporated in risk calculations either by uploading a variant call format (VCF) file containing the genotypes of common variants included in the PRSs (12), or by entering the PRSs directly.
A novel web-based graphical pedigree builder was developed that allows HCPs to graphically build the family structure (13) and capture this information rapidly in a visual and intuitive manner.
A web services architecture has been employed to provide programmatic access to the risk models across the Internet. This means that the risk models can be accessed by both HCPs using the CanRisk Tool, and by researchers using their own bespoke research software. Web services are used to calculate PRSs from a VCF file for use in the risk calculations.
Data security was also of paramount importance in the design of the CanRisk Tool. The tool uses in-built security features provided by the Django web development framework, https://www.djangoproject.com (e.g., cross-site request forgery protection), as well as data encryption (via HTTPS) during transmission between the HCP's web browser and the CanRisk server. To increase the security of the web pages, Open Web Application Security Project (https://owasp.org) security headers are implemented. The tool also removes patient identifiable information from the input dataset before it is submitted to the server for processing. In addition, once the calculations are complete, the input data are deleted immediately from the server before the results are sent back to the web browser.
The tool was developed using state-of-the-art software engineering practices for example, automated continuous integration, testing, version control, web application security, coding standards, and web development frameworks (e.g., Django). These techniques were adopted to make the tool easier to maintain, shorten software release cycles, and facilitate extensibility. The software architecture (Fig. 1) is based on the distributed system client–server model (14). Software development was carried out iteratively and prototypes were evaluated by HCPs in the United Kingdom, France, and Germany (11). The web interface was refined using feedback from successive rounds of evaluation.
Test suites have been developed for the web services and web tool. These make the tool more robust, verify performance, and increase development productivity by automatically testing the software at each stage of the development cycle. Web browser automation tests, using Selenium (https://www.selenium.dev/), validate the CanRisk Tool functionality and check compatibility with current web browser versions.
Results
The CanRisk Tool provides HCPs and researchers with a user-friendly interface to carry out multifactorial breast and ovarian cancer risk predictions. The tool can be customized by setting default preferences that are saved to the HCP's browser, such as the units of measurement and country of residence. To provide more specific risk predictions, HCPs are encouraged to input as much accurate information as possible and are led through the data input process via a series of sections in a vertical accordion style interface (Supplementary Video S1). Completed section headers are then marked as green (Fig. 2). A novel feature is the inclusion of PRS input to the risk predictions. On uploading a VCF file or directly entering the PRSs, graphical representations of the PRSs are displayed.
The family history section uses a graphical pedigree builder (13), Supplementary Video S2. It uses standard pedigree nomenclature to represent a family tree (15). To enable rapid building of family structures in various clinical care settings, an initial family structure (e.g., nuclear, primary, or secondary degree relatives) or the number of each type of relative (e.g., number of brothers and daughters) can be selected. Clicking on family members in the graphic reveals widgets (i.e., clickable icons) that are used to add family members (e.g., parents, siblings, and partners). This approach provides the flexibility to allow for arbitrary family sizes and structures of varying complexity (including loops). On selecting an individual, their details (e.g., age, year of birth, cancer history, pathology, and genetic test results) can be set.
While the CanRisk Tool implements the standard version of the risk models, several parameters in the models can be tailored as necessary by the HCP: these include the cancer incidences in the population, the frequencies of pathogenic variants, mutation testing sensitivity, and the PRS distribution in the population. These enable HCPs across the world to tailor risk modelling parameters to their region and setting.
The computed breast and ovarian cancer risks are presented in different output formats to help HCPs convey complex risk information (Supplementary Video S3). The remaining lifetime breast and ovarian cancer risks for the next 5 years, 10 years, and to the age of 80 are presented both in text and icon array formats. In addition, cancer risks are presented in a graph from the current age to the age of 80. For each representation, the personal cancer risks are given in the context of the general population risks. The breast cancer risk classification (i.e., near population, moderate or high risk), as defined by UK NICE clinical management guidelines, is presented in text, table, and graphical formats. The probabilities of carrying pathogenic variants in the cancer susceptibility genes used in the models are also presented.
The CanRisk Tool can also generate a customized report that summarizes the input dataset and results. Reports can be printed or downloaded in PDF format. For data protection reasons, the reports are generated within the browser so that patient identifiable data are not transmitted to the server. The input dataset can be saved to a CanRisk format data file that can later be loaded into the tool to recreate the session. Reports and data files can be attached to patient records in the HCP's patient management system.
The web services are used by the CanRisk Tool to run the risk calculations and calculate PRSs. However, the main advantage of using the web services layer is that it provides programmatic access to the underlying risk models over the Internet, thus widening access to third-party software. For example, researchers have developed software to submit pedigrees to the CanRisk web service to obtain risk estimates in a nongraphical manner. This approach is particularly useful when researchers wish to run BOADICEA model risk calculations within their own software tools as part of an epidemiologic analysis.
It is important for HCPs to be able to compute risks quickly. Calculation times are dependent on the size and complexity of pedigrees. The cancer risk models have been greatly optimized so that the CanRisk Tool is very responsive, with 50% of breast/ovarian cancer risks being completed within approximately 10 seconds and 98% within approximately 60 seconds.
Online documentation has been provided (https://canrisk.atlassian.net/wiki/spaces/FAQS/overview) and will be expanded to create a CanRisk knowledge base for users of the tool. In addition, the new tool has a dedicated helpdesk (https://canrisk.atlassian.net/servicedesk) to provide support. As a result of requests to the helpdesk for features and improvements, several updates to the tool have already been implemented (https://canrisk.org/releases/).
Discussion
The CanRisk Tool was developed to broaden access to the underlying cancer risk models and to facilitate data entry by HCPs. Within the first 9 months of its release, it has had over 3,100 account registrations, and 98,000 breast and ovarian cancer risk calculations have been run. In addition, the CanRisk Tool implementation of BOADICEA has now been incorporated into the National Comprehensive Cancer Network guidelines for familial breast/ovarian cancer (16).
The CanRisk Tool provides a user-friendly web interface for HCPs across clinical settings. The breast and ovarian cancer risk predictions and pathogenic variant carrier probabilities can be invaluable to HCPs and patients in deciding the patient's best clinical management. The interface brings together multifactorial elements including, for the first time, PRSs and a novel graphical pedigree builder that enables HCPs to build and visualize large and complex pedigrees as they are building them. This approach makes data entry much quicker and easier than the form-based approach previously used by the BWA. Data sent to and from the web services are encrypted and pedigree data are anonymized. The CanRisk web services are a powerful addition, providing programmatic access to the cancer risk prediction models. They enable researchers, clinical users, and third-party clinical applications to submit single/batch data files and compute risks without human interaction, that is, data entry via the CanRisk Tool web interface.
The CanRisk Tool is classified as a medical device under EU regulations. The tool carries the CE marking to indicate compliance with applicable safety and performance requirements for use by HCPs within the European Economic Area (EEA; ref. 9). In particular, the CE marking: (i) shows that the manufacturer has checked that the product meets EU safety, health, or environmental requirements; (ii) is an indicator of the product's compliance with EU legislation; and (iii) allows the free movement of products within the EEA. The CE marking process involves activities such as risk management, software verification, and performance evaluation. These activities help to ensure that the software is implemented in accordance with software engineering best practice and that it fulfils stated user requirements. The CanRisk Tool obtained CE marking in November 2019: to our knowledge, this is the first breast and ovarian cancer risk prediction tool to achieve this.
Future development of the CanRisk Tool will accommodate further risk model extensions. Additional evaluations of the tool will determine how it can be further enhanced to meet the requirements of specific clinical settings, for example in GP consultations by tailoring the risk communication and management components. A patient facing interface for data capture is also planned. Future studies aim to assess the feasibility and acceptability among HCPs and patients of using the CanRisk Tool across different clinical settings.
Authors' Disclosures
T. Carver reports grants from Cancer Research UK, European Union's Horizon 2020 research and innovation programme, Wellcome Trust Collaborative Award, Genome Canada and the Canadian Institutes of Health Research, and Genome Québec, and the Québec Breast Cancer Foundation during the conduct of the study, and other from Cambridge Enterprise outside the submitted work. A. Lee reports personal fees from FamHis Inc, Myriad Genetics Inc, and Gene42 Inc outside the submitted work. A.P. Cunningham reports grants from Cancer Research UK, European Union Horizon 2020 Research and Innovation Programme, Genome Canada and the Canadian Institutes of Health Research, and Genome Quebec and the Quebec Breast Cancer Foundation during the conduct of the study, and personal fees from Cambridge Enterprise outside the submitted work. D.F. Easton reports grants from European Commission and Wellcome Trust during the conduct of the study. A.C. Antoniou reports personal fees and other from Cambridge Enterprise during the conduct of the study. No disclosures were disclosed by the other authors.
Authors' Contributions
T. Carver: Resources, software, investigation, visualization, methodology, writing-original draft. S. Hartley: Software, investigation. A. Lee: Software. A.P. Cunningham: Conceptualization, resources, software, supervision, visualization, methodology, writing-original draft. S. Archer: Validation, visualization. C. Babb de Villiers: Validation, visualization. J. Roberts: Validation. R. Ruston: Validation. F.M. Walter: Conceptualization, supervision, funding acquisition. M. Tischkowitz: Conceptualization, supervision, funding acquisition. D.F. Easton: Conceptualization, supervision, funding acquisition, validation, project administration. A.C. Antoniou: Conceptualization, resources, supervision, funding acquisition, validation, investigation, project administration.
Acknowledgments
This work was supported by Cancer Research UK grants (C12292/A20861, to A.C. Antoniou and C1287/A16563, to D.F. Easton); the European Union's Horizon 2020 research and innovation programme under grant agreement numbers 633784 (B-CAST to M. Schmidt) and 634935 (BRIDGES, to P. Devilee); a Wellcome Trust Collaborative Award (203477/B/16/Z, to S.H. Teo); and the PERSPECTIVE programme: The Government of Canada through Genome Canada and the Canadian Institutes of Health Research (grant GPH- 129344), the Ministère de l'Économie, de la Science et de l'Innovation du Québec through Genome Québec, and the Québec Breast Cancer Foundation to J. Simard. F.M. Walter is Director of the multi-institutional CanTest Collaborative, which is funded by Cancer Research UK (C8640/A23385).
The authors acknowledge the clinicians, researchers, and patients who provided feedback during the design and evaluation process.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.