Notes
Article history
The research reported in this issue of the journal was funded by the HTA programme as project number 04/33/01. The contractual start date was in September 2006. The draft report began editorial review in March 2015 and was accepted for publication in December 2015. The authors have been wholly responsible for all data collection, analysis and interpretation, and for writing up their work. The HTA editors and publisher have tried to ensure the accuracy of the authors’ report and would like to thank the reviewers for their constructive comments on the draft document. However, they do not accept liability for damages or losses arising from material published in this report.
Declared competing interests of authors
Wendy Atkin receives funds from Cancer Research UK (Population Research Committee – Programme Award C8171/A16894). Jonathan Myles also receives funds from Cancer Research UK and the National Institute for Health Research (NIHR) Health Technology Assessment (HTA). Jonathan Myles was part funded by the following NIHR HTA awards: 11/136/120 and 09/22/192. Andrew Veitch has received expenses-only sponsorship from Boston Scientific and Norgine to attend Digestive Diseases Week 2015, Washington, DC, USA, and Digestive Diseases Federation 2015, London, UK.
Permissions
Copyright statement
© Queen’s Printer and Controller of HMSO 2017. This work was produced by Atkin et al. under the terms of a commissioning contract issued by the Secretary of State for Health. This issue may be freely reproduced for the purposes of private research and study and extracts (or indeed, the full report) may be included in professional journals provided that suitable acknowledgement is made and the reproduction is not associated with any form of advertising. Applications for commercial reproduction should be addressed to: NIHR Journals Library, National Institute for Health Research, Evaluation, Trials and Studies Coordinating Centre, Alpha House, University of Southampton Science Park, Southampton SO16 7NS, UK.
Chapter 1 Introduction
The call for proposal
This project was undertaken in response to a call for proposals by the National Institute for Health Research (NIHR) Health Technology Assessment (HTA) programme in anticipation of an unsustainable increase in requirements for surveillance colonoscopy with the impending introduction of the national Bowel Cancer Screening Programme (BCSP) in 2006. There was real concern that an increase in adenoma detection from the BCSP would diagnose many people as intermediate risk (IR), with a consequent impact on endoscopy resources. Therefore, a call was issued to determine the optimum frequency of colonoscopic follow-up in patients who were identified with intermediate-grade adenomas.
The current UK surveillance guideline was developed in 2002 and defines three risk groups (low, intermediate and high risk) with different surveillance recommendations. 1 From existing evidence it was suggested that, for the low-risk group, colonoscopy surveillance might not be necessary, whereas for the high-risk group surveillance was definitely indicated with an additional clearing examination 12 months after initial diagnosis (but this group constitutes only around 10% of people with adenomas2). The IR group, representing around 40% of patients with adenomas, was recommended to have a 3-yearly surveillance colonoscopy. However, this recommendation was based on limited evidence to indicate the optimum surveillance interval and the need for repeated surveillance. 1
Available evidence suggested that it might be safe to stop surveillance in the IR group after one or two negative examinations, depending on the age of the patient and the quality of the examination. Importantly, it was also proposed that patients with intermediate adenomas (IAs) may vary in their risk of developing colorectal cancer (CRC) and that there might be subgroups with different surveillance requirements. 3 The need to determine the optimum frequency of colonoscopic follow-up in IR patients was identified as a priority by the Department of Health (DH).
Rationale
Colonoscopy is the most widely used procedure for investigating colonic symptoms, and for surveillance of people at increased risk of CRC because of a personal or family history of CRC or adenomas. It is widely accepted that most CRCs develop from adenomatous polyps,4–7 and that the detection and removal (polypectomy) of these precursors through screening or surveillance reduces the risk of CRC. 8–13 Adenomas are very common and tend to recur. As such, the future risk of CRC after polypectomy is thought to depend on findings during baseline colonoscopy, particularly the number, size and histological grade of removed adenomas,3,14,15 as well as the completeness of examination and clearance of prevalent adenomas. This evidence was used to stratify patients into risk groups, each with different colonoscopic surveillance recommendations. 1,16–18
Since Atkin et al. 3 first suggested a variability in risk of CRC after adenoma removal in 1992, many countries have developed adenoma surveillance guidelines, most of which are based on either the UK or US guidelines. The indication for surveillance depends primarily on the presumed risk of recurrence of advanced adenomas (AAs),15,19–23 and development of CRC, and also by age, comorbidity and patient compliance. The current UK surveillance guideline was first commissioned and developed by the British Society of Gastroenterology (BSG) in 2002 and has since been adopted by the National Institute for Health and Care Excellence (NICE) and the European Union (EU) (Figure 1). 24 Both UK and US guidelines identify three risk groups, but the definitions and surveillance recommendations differ slightly. 1,2 Both guidelines identify a low-risk group, for which no surveillance or 5-yearly surveillance is recommended; an intermediate-(UK)/higher-risk (US) group, for which 3-yearly surveillance is recommended; and a high-risk group, for which additional colonoscopy is recommended. In the UK, the guideline specifies a single clearing colonoscopy at 12 months before continuing on 3-yearly surveillance. 1,2,16,17,25
The fact that guidelines vary – particularly in defining the IR group and their surveillance recommendation18,26 – is indicative of the uncertainty about the optimum adenoma surveillance regime. After adenoma removal, some patients have a risk of CRC similar to, or lower than, that of the general population,3,27,28 implying that not all patients are at sufficient risk to warrant surveillance. 3,14,27–31 The IR 3-yearly surveillance regime is based on results of the National Polyp Study,20 which compared two follow-up colonoscopies with one follow-up colonoscopy within 3 years and found no difference in the detection of adenomas with advanced pathology. Two other studies32,33 also found the incidence of adenomas with advanced pathology to be similar regardless of interval length. However, another trial found a non-significantly higher risk of CRC in patients who were examined at 4 years than in those examined at 2 years. 29,34
As colonoscopy is both costly and invasive, surveillance should be undertaken only in those who are at increased risk and at the minimum frequency required to provide adequate protection against the development of cancer. 25 There is evidence of both over- and underutilisation of colonoscopy, and a potential for more efficient allocation of endoscopy resources. 35 The IR group comprises nearly 20% of those subjects participating in the BCSP who undergo colonoscopy for a positive test,36 and nearly half of adenoma patients,2 yet no study has yet systematically examined whether or not there is heterogeneity in risk among patients who are currently offered 3-yearly surveillance. We sought to address the unanswered questions surrounding the current IR group surveillance strategy, that is:
-
What is the effect of interval length on detection rates of AA and CRC at follow-up examinations in IR patients?
-
Are there subgroups of IR patients who do not require surveillance, or who require only one follow-up? Similarly, are there are subgroups that might benefit from shorter or longer surveillance intervals?
-
Does the risk of AA or CRC at first and second follow-ups vary by patient, procedural and polyp characteristics, and surveillance interval length?
-
Can we define factors that affect the risk of CRC after baseline in IR patients, for example number of surveillance visits, patient/procedural/polyp characteristics?
Background to the design
As a randomised controlled trial (RCT) or prospective observational study would take many years to complete, the use of pre-existing hospital patient data in a retrospective cohort study was the recommended design. It was thought that this method would be quicker, cheaper and more convenient. In addition, the use of such hospital patient data ensured that there would be sufficient variation in adenoma surveillance intervals to enable comparison between them. This may not have been possible with data collected prospectively because of the widespread adoption of UK surveillance guidelines. Furthermore, longer patient follow-up times could also be obtained in this retrospective study design.
We also requested access to data from researchers of a number of screening studies on findings at surveillance colonoscopy. Eight screening data sets were identified; however, only three provided adequate data for our analyses (see Chapter 4, Screening data set, Background).
At the time there was no systematic call or recall of patients in adenoma surveillance, so the principal investigator (PI) also wrote to the manufacturers of the patient management systems that were used to manage patient data in hospitals in the UK NHS. The manufacturers were able to identify hospitals that had used their software for a sufficiently long period of time. These hospitals were contacted and were provided with a questionnaire to complete in order to determine their suitability for the study.
Aim and objectives
The overall aim was to examine the optimum frequency of surveillance in patients who were found to have IR adenomas and assess the risks and benefits with respect to prevention of cancer/AA; anxiety, morbidity and mortality; costs and cost-effectiveness; and implications for the NHS.
The primary objective was to assess whether or not there was substantial heterogeneity in the detection of AA or CRC according to baseline characteristics and interval to first follow-up colonoscopy. The study planned to determine if there was a subgroup of IR patients who do not require surveillance and whether or not the size of this group is clinically significant. Finally, the study examined whether subgroups could be identified for which the currently recommended 3-year interval is too long, or for which the interval can be safely extended, or if there is a group that requires a second examination but no further follow-up.
An economic analysis aimed to estimate the incremental cost-effectiveness of alternative adenoma follow-up strategies, including a policy of no follow-up for individuals who have intermediate-grade adenomas. It also planned to estimate the impact of alternative adenoma follow-up strategies on colonoscopy services, and the total cost impact of alternative adenoma follow-up strategies in England and Wales.
A psychological impact analysis aimed to examine the anxiety-inducing effects of colonoscopic surveillance or being informed that colonoscopy surveillance is required.
Study design and setting
This was a retrospective cohort study using data from two sources. A cohort of patients attending UK NHS hospitals for diagnostic or surveillance endoscopy formed the largest data set – termed the ‘hospital data set’. Three smaller data sets were obtained from a research or screening setting, and involved average-risk individuals undergoing screening: one from a UK screening trial, a second from a UK pilot screening programme and a third from a US health surveillance programme – termed the ‘screening data sets’. The core results were derived from the hospital data set, as there were difficulties in obtaining additional screening data sets, and limited data completeness in the screening data sets that we were able to obtain. A health-economics evaluation and psychological study were also conducted.
Structure of this report
The findings of the hospital and screening data sets are reported and discussed in Chapters 3 and 4, respectively. The methods, which were largely the same for the two data sets, are described in Chapter 2, with any additional methods unique to the screening data set described in Chapter 4. The health-economic evaluation is reported in Chapter 5 and the psychological study is reported in Chapter 6. Finally, Chapter 7 presents a synthesis of results from the preceding chapters, as well as strengths/limitations and future work.
Chapter 2 Methods
Hospital selection
The hospital data set comprised routine gastrointestinal endoscopy and pathology data for patients having diagnostic and surveillance procedures. Participating hospitals were required to have recorded endoscopy and pathology data electronically for at least 6 years prior to the study start in 2006. After contacting endoscopy and pathology database manufacturers, 28 NHS hospitals were identified as meeting these criteria, and their participation in the study was requested. A number of hospitals were excluded because of difficulties with data extraction and data quality issues (see Data collection from hospitals, below). In total, 18 hospitals were included in the study. Two of these merged into the Imperial College Healthcare Trust (Charing Cross and Hammersmith hospitals) and thus there were 17 hospital sites included; these are listed in Table 1.
Trust | Hospital | Study name | Study code | Collection dates |
---|---|---|---|---|
Brighton & Sussex University Hospitals NHS Trust | Royal Sussex County Hospital | Brighton | BRI | May 2001 to April 2008 |
North Cumbria Acute Hospitals Trust | Cumberland Infirmary | Cumberland | CI | August 1998 to September 2009 |
Imperial College Healthcare NHS Trust | Charing Cross Hospital/Hammersmith Hospital | Charing Cross/Hammersmith | CX/HH | October 1997 to November 2007 |
Greater Glasgow and Clyde NHS Trust | Glasgow Royal Infirmary | Glasgow | GRI | May 1996 to August 2009 |
University Hospitals of Leicester NHS Trust | Leicester General Hospital | Leicester | LGH | April 1998 to March 2008 |
Royal Liverpool and Broadgreen University Hospitals Trust | Royal Liverpool University Hospital | Liverpool | RLUH | January 2000 to October 2009 |
Royal Wolverhampton Hospitals NHS Trust | New Cross Hospital | New Cross | NC | January 1993 to November 2007 |
University Hospital of North Tees Trust | University Hospital of North Tees | North Tees | NT | June 1986 to December 2006 |
Queen Elizabeth Hospital NHS Trust | Queen Elizabeth Hospital | Queen Elizabeth | QEW | March 1999 to May 2006 |
Queen Mary’s Sidcup NHS Trust | Queen Mary’s Hospital | Queen Mary’s | QMH | October 1998 to July 2009 |
Shrewsbury and Telford Hospitals NHS Trust | Royal Shrewsbury Hospital | Shrewsbury | SH | January 2002 to September 2009 |
St George’s Healthcare NHS Trust | St George’s Hospital | St George’s | SGH | February 1992 to July 2009 |
London North West Healthcare NHS Trust | St Mark’s Hospital | St Mark’s | SMH | January 1985 to July 2007 |
Imperial College Healthcare NHS Trust | St Mary’s Hospital | St Mary’s | ICMS | December 1984 to July 2010 |
Royal Surrey County Hospital NHS Trust | Royal Surrey County Hospital | Surrey | SCH | September 1997 to May 2010 |
South Devon Healthcare NHS Foundation Trust | Torbay District General Hospital | Torbay | TDG | November 2000 to August 2007 |
Yeovil District Hospital Foundation Trust | Yeovil District Hospital | Yeovil | YDH | February 1997 to May 2008 |
Patient eligibility
Inclusion criteria
Patients with IR adenoma(s) and a baseline colonoscopy were eligible for inclusion in the study. Following the UK guideline, IR patients were defined as those with three or four small adenomas (of < 10 mm) or one or two adenomas, at least one of which was large (≥ 10 mm).
Exclusion criteria
Patients were excluded for having certain conditions if the condition increased their risk of CRC or could have led to an abnormal pattern of surveillance. Some diagnoses resulted in exclusion regardless of when they occurred, for example hereditary non-polyposis colorectal cancer (HNPCC), a genetic condition that confers an increased risk of cancer throughout an individual’s lifetime. Other conditions resulted in exclusion only if they were diagnosed at, or prior to, baseline, or, in other cases, patients were censored after diagnosis of a particular condition rather than excluded altogether.
Patients were excluded if they had any of the following diagnoses at, or prior to, baseline:
-
CRC or inflammatory bowel disease (IBD)
-
resection/anastomosis
-
volvulus.
Patients were excluded if they had the any of the following diagnoses at any time:
-
family history of familial adenomatous polyposis (FAP)
-
HNPCC
-
Cowden syndrome
-
juvenile or hamartomatous polyps.
Patients with polyposis could be excluded depending on polyposis type and time of diagnosis. Details of time-dependent exclusions for polyposis and colitis can be found later in the report (see Appendix 5).
Patients were also excluded if they had no baseline colonoscopy, or had one or more procedures without a date, or had more than 40 endoscopic procedures recorded.
Research governance
Data were collected from hospitals in England and Scotland. The following research governance approvals were obtained to permit data collection and follow-up via external agencies:
-
Approval was granted from the Royal Free Research Ethics Committee (REC) for the study throughout the UK (REC reference 06/Q0501/45). The REC agreed that all sites should be exempt from site-specific assessment. Further approval was granted for substantial amendments to allow changes to database hosting arrangements and logistical arrangements for data collection and follow-up.
-
Approval to access patient identifiable information without consent in England was granted from the Patient Information Advisory Group (PIAG) [later the National Information Governance Board (NIGB) and currently the Ethics and Confidentiality Committee at the Health Research Authority] in accordance with Section 60 of the Health and Social Care Act 200137 (re-enacted by Section 251 of the NHS Act 200638). Further approval was granted for substantial amendments to allow data to be extracted and anonymised, and to link identifiable information obtained from multiple sources including hospital endoscopy and pathology databases, the Hospital Episode Statistics (HES) database, and databases held by the Office for National Statistics (ONS), National Health Service Information Centre (NHSIC) [subsequently the Health and Social Care Information Centre (HSCIC)] and National Health Service Cancer Registries (NHSCR) [reference PIAG 1–05(e)/2006]. This was necessary because of the retrospective nature of the study and the large number of patients involved. Support was favourable based on the study’s System Level Security Policy and compliance with Imperial College’s policy on data handling and storage, and a recommendation from the Caldicott Guardian for the North West London Hospitals NHS Trust, who approved the arrangements to ensure patient confidentiality and anonymity. In Scotland, similar approvals were obtained in 2013 from the Community Health Index Advisory Group. Permission was granted to use the Community Health Index (CHI) to enable the Information Services Division to clean the patient information within the study data set, and to match identifiable information to data from the cancer and death registries.
-
Research approval for the study was obtained from all relevant NHS care organisations for the study sites, which were provided with the ethical approval documentation and the study protocol. As none of the members of the study team had a contractual relationship with the NHS, honorary contracts/letters of access were applied for and obtained for staff who were required to carry out work at the various study sites, in agreement with the Research Governance Framework.
-
Where necessary, applications were made to the custodians of external data sets to enable specific researchers to access information controlled by external sources and to allow the study data set to be linked to external data sets. In England, researcher status was approved and obtained from the ONS and NHSIC for individual researchers, and applications to use individual records for medical research were made to the NHSIC and the UK Association of Cancer Registries (UKACR). In Scotland, the Privacy Advisory Committee (PAC) granted approval for patient record linkage with NHS National Services Scotland (NSS) using CHI numbers, so that patients in Glasgow could be followed up to obtain details of cancers and deaths (reference PAC Application 66/11).
To ensure that patient confidentiality was maintained throughout the study, no patient-identifiable information – except date of birth – was stored on the study database. All patient identifiers were left at the individual study sites in secure locations, and all information kept at the trial office was in a pseudo-anonymised format. In addition, access to the Oracle database was controlled by username and password security, as well as a firewall that restricted access to the database server to a limited number of IP addresses. The majority of computers in the trial office were given access to the database, whereas specific access to the data via the Oracle Application Express (APEX) version 3.2.1.00.10 (Oracle Corporation, Redwood City, CA, USA) coding application was controlled via APEX’s built-in user management facility.
Data collection from hospitals
The data were extracted from hospital endoscopy and pathology databases by the study programmer. A minority of databases had an interface that permitted bulk extraction of the data according to specific criteria and in these cases data were extracted with assistance from hospital staff who were familiar with the systems. However, for most endoscopy and pathology databases, the application interface was not designed for bulk data extraction, so data extraction and processing was complex, with a number of problems encountered, for example:
-
When the maintenance and support of the endoscopy and pathology databases had been outsourced to the database manufacturers, often only they could help with extracting the data or by writing software enabling the study programmer to do so.
-
Specialist support was required when data were held on legacy systems.
-
Information technology (IT) staff at the hospitals sometimes had to restore archived data temporarily so that they could be extracted.
-
Most hospitals had replaced databases over the years, and therefore some data overlapped or were duplicated (e.g. the same patient had records on more than one system).
-
Sometimes several hospital visits were necessary to extract data from multiple databases, at the convenience of the local IT experts.
-
The data outputs from these databases were in a combination of structured and unstructured formats. Structured data could be easily cleaned and converted into a standardised format for uploading. Unstructured data (usually large text fields) needed bespoke programs written to extract, clean and convert the data into a suitable format.
Owing to various technical difficulties with data extraction, inability to access databases, partial availability of electronic data, unreliable Systematized Nomenclature of Medicine (SNOMED) coding and logistical difficulties due to local staff availability, nine hospitals were excluded from the study (Table 2). From the 17 hospitals that were included in the study, data were extracted from 27 endoscopy and 29 pathology systems. (A summary of data collection at each hospital can be seen in Appendix 1.)
Hospital | Reason for exclusion | Summary | Details |
---|---|---|---|
Blackpool Victoria | Software/data collection issue | Incompatibility of old vs. new software systems for importing endoscopy data Difficulties in bulk data transfer from pathology reports |
Following the creation of extraction programs and test runs, the statistics program was unable to extract all of the necessary endoscopy data. The main endoscopy reports could not be extracted from the older EndoScribe data imported into the newer ADAM system The pathology system did not allow the uploading of pathology reports in bulk |
Bradford Royal Infirmary | Software/data collection issue | Old software systems used to record data Difficulties in bulk data transfer |
Pre-2005 SNOMED coding for pathology data was unreliable The Co-Path system proved to be problematic and complex to extract multiple records – it would have taken too long and would have slowed down the system for the hospital |
City Hospital, Birmingham | Difficulties in obtaining R&D approval | Delays and limited resources | The study encountered long delays in R&D approval, owing to staff shortages in the R&D department and time-consuming internal procedures |
George Eliot, Nuneaton | Software/data collection issue | Old software systems used to record data Difficulties in bulk data transfer |
The same difficulties with the Co-Path system as encountered with Bradford Royal Infirmary |
King George, Ilford | Data collection issue | Impractical to extract the data Difficulties in bulk data transfer |
The majority of older pathology data were initially inaccessible because of software licensing issues. When access was achieved, the reports could be accessed only one at a time, making it impractical to extract the data |
Norfolk and Norwich University Hospital | Technical issues Missing data |
Raw endoscopy and pathology data were extracted between November 2009 and August 2010 from the Micromed, EndoScribe and Scribe databases and the pathology database over several visits to the hospital, and partly cleaned and anonymised. However, on subsequent visits to complete the task the data had been misplaced Re-extraction would have been costly and time-consuming |
|
Pinderfields, Yorkshire | Software/data collection issue | Incomplete data | The data provided by a new HICSS system (Ascribe Ltd, now EMIS Group plc, Leeds, UK) were lacking any endoscopy procedures other than colonoscopy (e.g. sigmoidoscopy and rigid sigmoidoscopy) |
Queen Alexandra, Portsmouth | Missing data | Missing pathology data 2006–8 | |
University Hospitals of North Staffordshire NHS Trust | Software/data collection issue | Difficulties in bulk data transfer | Not possible to do a bulk data extraction at this hospital |
Data extraction
Endoscopy data
Endoscopy databases were searched first in order to identify patients undergoing colonic examinations, as the pathology databases contained a wide range of extracolonic samples. Before removal from the hospital, the extracted data were split into patient identifiers and endoscopic data. Patient identifiers included surname and forename(s), hospital number(s), NHS number, gender, postcode and date of birth. Endoscopic data included date of procedure, type of procedure, indications, endoscopist name, endoscopist comments, polyp information (such as size, shape, location, information on any biopsies taken), segment reached, quality of bowel preparation, complications encountered, diagnosis and any other information. The list of patient identifiers was cleaned to remove errors, inconsistencies and duplicates, and a unique study number was assigned to every patient. Study numbers were made up of a three-letter code, representing the hospital, followed by a six-digit number.
Pathology data
Pathology databases were searched for reports on colorectal lesions. The preferred search method used in most hospitals was SNOMED (College of American Pathologists), which defined the site and type of colonic lesions present. When this was not possible, Systematized Nomenclature of Pathology (SNOP) (College of American Pathologists) codes (four-digit versions of SNOMED codes), keywords or SNOMED International 3.0 codes (College of American Pathologists) were used (see Appendix 1 for details of the methods used to collect pathology data at each centre). Initial validation checks were performed to ensure that the pathology extract included the date of report, unique report number, type of procedure where specimens were taken, number of specimens and histological details.
Linking endoscopy and pathology data
Patients identified from endoscopy records were matched to their pathology records using a combination of hospital number, name and date of birth. Patient study numbers were then assigned to the matched pathology records. Manual inspection of the data and preliminary analyses were performed at the hospital to check that a sufficient number of pathology records were linked to a patient and that they occurred on, or near, the date of an endoscopy. When there was cause for concern (e.g. very few endoscopies linked to pathology reports; or endoscopies at which a biopsy was taken did not have an associated pathology report; or a large number of pathology reports could not be linked to an endoscopy, suggesting that the endoscopy extract did not retrieve all records), further investigations were undertaken and the data were re-extracted from the endoscopy and pathology systems where necessary.
Pseudo-anonymising data
In order to maintain patient confidentiality in accordance with the EU Directive on Good Clinical Practice (Directive 2005/28/EC), the Data Protection Act 199839 and the NHS Caldicott Principles, all patient identifiers except date of birth were removed from the pathology and endoscopy data, and the anonymised data were encrypted before being removed from the hospital.
A ‘patient-linking-file’ in Microsoft’s .xls or .xlsx format (Microsoft Corporation, Redmond, WA, USA), storing each patient’s identifiable information and study number was created, encrypted, and left at each hospital site. The raw endoscopy and pathology data and patient-linking-file were copied on to CDs and stored in secure locations at the hospital under supervision of the local PI.
Development of the master database
A master database was created to store the data in a standardised, structured format. To facilitate the statistical analysis, the data had to be classified into quantitative and qualitative variables, ensuring that data from different hospitals were classified in the same way, as there was wide variation in the raw data (e.g. field names were different; some data were coded or semi-coded, whereas other data were in free-text fields; and data types varied). The PI, study researchers, statistician and study programmer defined the data requirements for the study and designed the structure of the master database (see Appendix 2). The master database was designed to store the following:
-
the original source data (to safeguard against data loss during coding)
-
fields to store structured data that had been automatically extracted, cleaned and standardised using bespoke programs
-
fields to store the structured data which were manually coded.
Reference data (sometimes referred to as look-up tables) were used to categorise and define permissible values for data fields on the database. This method restricted the values to be recorded in a data field, thereby preventing coding errors and also ensuring uniformity of data from different hospitals.
It was necessary to transform the variety of data received from different hospitals into a standardised data set. As the volume of records was very large, it was necessary to code and categorise the data as much as possible using automatic coding without compromising the integrity of the data. Programs were developed to transform, clean and automatically code the data where possible. This involved several steps:
-
identifying the fields containing information required for the study, taking into account varying field names, data types and value representations
-
extracting information from free-text fields using programming techniques such as ‘regular expressions’ and ‘fuzzy matching’, and translating them into the codes used on the master database
-
translating values in the raw data into the codes used on the master database if the information was already in a coded structured format
-
identifying and consolidating overlapping data, and removing any redundancies (e.g. the same endoscopy or pathology reports extracted from two different systems)
-
identifying errors in the data and validating and correcting them (e.g. misspellings, different date formats, accounting for false-positive matches)
-
transforming polyp data to fit the structure of the polyp table in the master database. Some raw data sets had structured data on polyps (i.e. each polyp was represented as a separate table record); for other data sets, the study programmer had to separate the data into individual records.
After data transformation and cleaning, the raw data and structured data were uploaded to the master database, ensuring that the data were linked correctly across tables. Exclusions of ineligible patients were made automatically where possible, using programming techniques such as ‘regular expressions’ and ‘fuzzy matching’ to identify relevant keywords or phrases in the reports. Approximately 17% of patients were excluded using automatic exclusions.
Manual data coding
The records of the remaining 83% patients who had not been auto-excluded were manually interpreted and coded; this also involved checking the automatic coding on these records.
A web-based coding application with a graphical user interface was developed using APEX, allowing the study researchers to read, interpret and code the information in the database efficiently. This was called the Endoscopy and Pathology Reports Application (EPRA). The development of the EPRA evolved over time as new data items were encountered at different hospitals, and as processes for coding and analysing the data were developed. A change log of new features was maintained on the study database and updated when a new version of the EPRA was released. (Details and screenshots of the EPRA are provided in Appendix 3.)
Documents detailing standard operating procedures (SOPs) were produced to ensure standardised coding methods between study researchers. SOPs covered all basic coding methods, rules for coding individual fields within the database, and more complex processes used for tasks such as polyp numbering (see below). All SOPs can be found in Appendix 4.
A specific study researcher was allocated a patient’s complete set of records to ensure that the study researcher had access to all available information. The study researchers were responsible for:
-
checking and correcting data that had been automatically coded
-
checking that endoscopy and pathology records were properly linked
-
coding the raw endoscopy and pathology data into structured data
-
creating individual polyp records from the data provided in endoscopy reports. In some cases, the study researchers found that polyps were described as groups rather than as individual polyps. This is discussed further below
-
raising queries on records that could not be fully coded due to incomplete or insufficient information
-
creating a blank ‘pathology-based procedure report’ in cases for which the pathology record had no linked procedure report. Clinical information available in the pathology report was used to deduce details about the procedure from which the histological sample was obtained.
Coding accuracy and data interpretation were monitored to maintain consistency, using the following methods:
-
Study researchers systematically reviewed a blinded random sample of records that had been coded by other study researchers.
-
Regular meetings and continuous discussion/feedback were used to ensure uniformity of coding.
Records that had not been coded because of incomplete or insufficient information were reviewed by the study researchers, and further data were obtained from hospitals, where possible, in order to complete the coding.
Polyp numbering
A polyp found at one endoscopy examination that was not removed or only partially removed could be seen again at a later examination. To ensure that there was no double-counting of polyps, each polyp was assigned a unique polyp number that could be used to link sightings of the same polyp at different examinations. This process was called ‘polyp numbering’.
In approximately 17,000 patients, polyps were found on at least two occasions and were reviewed for manual polyp numbering. All sightings of an individual polyp were assigned the same unique polyp number. Each polyp was also assigned a match probability (to the nearest 10%), to indicate the degree of certainty that two polyps were the same lesion. The polyp that appeared to have the greatest number of matches with a high degree of certainty was chosen as a reference polyp, and all possible matches were considered in relation to this polyp. Polyp numbering guidelines were used to match polyps accurately and methodically, using all of the available information from the endoscopy and pathology reports. Particular attention was given to the following factors, listed in order of importance. Sightings at different examinations were considered more likely to be the same polyp if:
-
they occurred in the same segment of the colon, or in adjacent segments
-
there was an indication that the polyp at the earlier examination was not removed or was only partially removed
-
the quality of bowel preparation at the first examination was poor, making it less likely that a lesion would be removed
-
the lesions had similar grades of dysplasia
-
the lesions were the same histological type
-
the lesions had similar degrees of villousness.
Quality checks were carried out by the study researchers, who manually reviewed and checked a random sample of records for which polyps had been numbered by other study researchers.
Polyps matched with an arbitrary probability of ≥ 70%, using the above criteria, were considered the same lesion. More details on polyp numbering can be found in Appendix 6.
Polyp groups
Sometimes endoscopy reports described groups of polyps using terms such as ‘several’, ‘many’ and ‘multiple’, rather than individual polyps. During manual coding, specific fields were used to record this information. Each group of polyps was recorded as a single record and populated with information such as site, shape and histology, where this was common to all polyps within the group. Descriptions of the size and number of polyps in the group (e.g. ‘tiny’, ‘multiple’) were recorded. Where information was given for an individual polyp within the group, a polyp record was created and linked to the group record. The whole group (and the individual polyps linked to it) was allocated a unique group number.
Patients with multiple polyps could have groups of polyps seen at more than one examination, and a group of polyps seen at a later examination could include some or all of the polyps seen at a previous examination. In order to link groups of polyps seen at more than one examination, a separate group linking number was assigned to each group of polyps. This task was completed after all polyp groups had been recorded for a patient. Groups of polyps seen at more than one examination were matched and a probability was assigned, indicating the study researcher’s certainty that groups of polyps seen at separate examinations were of the same group.
The records for groups of polyps were then expanded into individual polyp records so that they could be analysed. An estimate of the number of polyps in each group was deduced from a value coded for the approximate number where available; otherwise the average of the minimum and maximum number of polyps recorded by the endoscopist was used. Alternatively, a numeric value was estimated for each vague number description (e.g. ‘some’, ‘several’, ‘few’), taking the average value for all groups in which both the specific descriptor and a numeric value was reported; these values used to define the number of polyps in the such groups are shown in Table 3.
Description | Estimated no. of polyps |
---|---|
A few | 3 |
Some | 3 |
A number of | 3 |
Several | 3 |
Many | 5 |
Multiple | 5 |
Additional information on the number of individual polyps seen at previous or subsequent examinations which were considered to be part of the same group was used to refine the estimate of the number of polyps in that group (see Appendix 6).
Once a final estimate had been derived for the total number of polyps in each group at each examination, a program was written to create individual polyp records. Where a polyp record was created based on the presence of a polyp at a previous or subsequent examination, the program assigned the same polyp number to the new polyp record, to show they were the same lesion.
Creating summary values for polyp characteristics
Most polyps were seen and removed at a single examination, and information about a polyp’s features was available from a single endoscopy and pathology report. Alternatively, a polyp might be seen at more than one examination with descriptive information contained in numerous endoscopy and pathology reports. In both of these scenarios, a single polyp characteristic might be coded for in multiple data fields. It was therefore necessary to create summary values for each lesion, taking into account information provided in reports on polyp characteristics at individual examinations and across examinations. However, the following issues had to be resolved first.
Missing polyp information
When information on a polyp characteristic was missing from the endoscopy report, it was sometimes possible to obtain supplementary information from the pathology report, or from other examinations at which the same polyp was detected.
Inconsistent polyp information
Polyp information reported in an endoscopy and pathology report for a specific examination could be inconsistent. Similarly, information reported across multiple sightings of a single polyp could be inconsistent. Sometimes it was clear from available information that an inconsistency was due to a coding or transcriptional error in one or more hospital reports. Rules were identified to determine which data items were likely to be errors; these records were manually reviewed and errors corrected where possible. Inconsistencies that could not be explained by error were resolved using hierarchies of rules (see Appendices 7 and 8).
Vague polyp information
Wherever possible, information on polyp characteristics was recorded on the database exactly as reported in endoscopy or pathology reports, and usually precise values were provided. However, in rare cases the observations recorded in the endoscopy report about size and location could be vague; for example size could be merely described as tiny or < 10 mm, and location could be described using a range of values; this was particularly problematic when there were multiple lesions seen at an examination. These vague descriptions of size, and ranges of values for size and location, were recorded in specific fields on the database. Rules were defined to derive a summary value for size and location of each individual polyp at an examination by combining all the available information. These rules are described in greater detail in this section and in Appendices 7 and 8.
Summary values were determined for polyp size, histological features, location and shape, and were derived separately for each visit. Summary values for polyp characteristics were derived using hierarchies of rules. In general, three stages were involved:
-
data cleaning to identify, review and resolve any errors in the polyp data
-
assessment of polyp characteristics at a single examination
-
assessment of polyp characteristics across examinations within a visit, if a polyp had multiple sightings.
Size
Polyp size information was recorded in several fields on the database, as shown in Table 4.
Size field | Variable namea | Description | Derived valuesa |
---|---|---|---|
Endoscopy size | ENDO_SIZE | Field used to record the exact size of a polyp (in millimetres), when described precisely in the endoscopy report | Derived endoscopy sizeENDO_SIZE, ENDO_SIZE_MIN and ENDO_SIZE_MAX were combined using a hierarchy of rules to give a single derived endoscopy size for each sighting of a polyp |
Minimum endoscopy size | ENDO_SIZE_MIN | Field used to record the minimum size of a polyp when a size range was described in the endoscopy report (e.g. 8–10 mm) | |
Maximum endoscopy size | ENDO_SIZE_MAX | Field used to record the maximum size of a polyp when a size range was described in the endoscopy report | |
Endoscopy size descriptor | ENDO_SIZE_OTHER | Field used to record the size of a polyp when it was described in vague terms in the endoscopy report (e.g. tiny, > 10 mm, < 5 mm, etc.) | Derived endoscopy size descriptorA numeric value was derived from a description (see Table 5) |
Pathology size | PATH_SIZE | Field used to record the exact size of a polyp or biopsy specimen as described by the pathologist in the pathology report | Derived pathology sizeThe precise size given in most pathology reports is used |
Exact sizes (endoscopy or pathology size) were available for 65% of polyps (of all types, including adenomas) but 8% of polyps had a numeric size with a minor discrepancy, or a size range (minimum and maximum endoscopy size); both of these issues were resolved to give an accurate size. In only 6% of polyps was the size estimated based on a qualitative size description (endoscopy size descriptor). This does not account for other sightings of the same polyp, so the proportion without a precise, numerical size is likely to have been even smaller than this. These proportions relate to all patients for whom we had data, rather than just IR patients, as adenoma risk groups could not be discerned until summary values for size had been defined.
The ‘endoscopy size descriptor field’ was used in cases for which a ‘vague’, qualitative or approximate size description was given in the endoscopy report. A numerical value was derived for each size description by analysing reports in which both qualitative size descriptions and a precise numerical size were given. The median and interquartile range (IQR) were calculated for each numeric size field and cross-tabulated against associated categories of the endoscopy size descriptor field, as shown in Table 5.
Endoscopy size descriptor category | Endoscopy size (mm) | Derived value size | Rationale for derived value size | |
---|---|---|---|---|
Median (IQR) | n | |||
Tiny | 3 (2–3) | 660 | 3 mm | Used the median |
Small | 3 (3–5) | 1574 | 5 mm | Used the larger value of 5 mm to draw a distinction between ‘Small’ and ‘Tiny’ |
< 5 mm | 3 (2–3) | 35 | 3 mm | Used the median |
5–9 mm | n/a | 0 | 7 mm | No examples so took the halfway point |
< 10 | 8 (8–8) | 3 | 8 mm | Used the median of available examples |
≥ 10 mm | 15 (13–15) | 79 | 15 mm | Used the median |
Large | 20 (12–30) | 2701 | 20 | Used the median |
The endoscopy size and minimum and maximum endoscopy sizes were combined using a hierarchy of rules to give a derived endoscopy size for each sighting of a polyp (see Appendices 7 and 8 for details). The numerical values assigned to the endoscopy size descriptor field were used as the derived endoscopy size descriptor. Most pathology reports provided a precise size, which was coded for each individual polyp biopsied or resected at an examination – this was taken as the derived pathology size. Derived endoscopy and pathology sizes were automatically assigned when possible. Study researchers manually reviewed polyps for which derived endoscopy and pathology sizes could not be assigned automatically.
Finally, the three derived polyp sizes – derived endoscopy size, derived endoscopy size descriptor and derived pathology size – were compared across examinations within a visit, and the largest of each derived size was identified. The largest derived sizes were compared and the largest of these was used as the summary polyp size. The only time the largest size was not used was if it was the derived endoscopy size descriptor, and the derived endoscopy and derived pathology size was also available, which were considered more accurate. Full details of these methods are provided in Appendices 7 and 8.
Histopathology
In all patients for whom data were collected, including low-, intermediate- and high-risk patients, histological data were available for 66% of polyps (all types of polyps, including adenomas). In some cases (34%), data on histological features of a polyp were missing because no biopsies were taken, a pathology report could not be identified at the hospital, or the polyp in question was not retrieved at endoscopy or not mentioned in the pathology report. This value does not account for other sightings of polyps – they may have been removed at another examination – and we would expect some of these polyps to have been insignificant and therefore not excised. Rules were applied to the data to resolve such issues and derive polyp histology, where possible. First, if a polyp had any degree of villousness or dysplasia coded then the polyp was assumed to be an adenoma. If the polyp was ≥ 10 mm in size and no histology was recorded at any sighting of the polyp, the histology was set to ‘specimen not seen’ or ‘not able to diagnose’. If the polyp without histology was ≥ 10 mm in size and the patient had at least one adenoma recorded then the polyp was then assumed to be an adenoma.
A polyp seen at more than one examination may not have been diagnosed as an adenoma until a later sighting. As baseline started from first sighting of an adenoma, it was necessary to apply adenomatous histology back to earlier sightings, provided that the sighting without histology occurred no more than 3 years prior to the adenoma diagnosis. Adenomatous histology was applied to earlier sightings of a polyp only if the histology for the earlier sighting was unknown or recorded as hyperplastic, granulation tissue, previous polypectomy site, normal mucosa, not possible to diagnose, or specimen not seen; this ensured that histology of greater severity than an adenoma was not overwritten.
A single polyp seen at more than one examination could have different histological features recorded at each sighting. To resolve these inconsistencies, histological types encountered in the study were split into two groups: group 1 consisted of the outcomes of interest (CRCs and adenomas), along with all histological types that could potentially occur in such lesions over time (Table 6); group 2 consisted of all other histological types (data not presented). Group 1 histological types were listed from most to least severe within the following groups – CRC, possible CRC, benign lesion, no polyp features/not possible to diagnose – as shown in Table 6. When there was no clear-cut order in terms of malignant potential, histological types were arbitrarily ordered by the specificity of the description. Initially, polyps with histology from groups 1 and 2 recorded at different sightings were reviewed to check whether or not there was a reporting or coding error. Then, for remaining polyps with histology from both groups, group 1 histology took precedence for the purpose of this study, except when the group 1 histology was uncertain or unimportant (e.g. ‘normal mucosa’, ‘granulation tissue’, ‘previous polypectomy site’, ‘not possible to diagnose’ or ‘specimen not seen’), in which case the group 2 histology took precedence.
Category | POLYP_TYPE |
---|---|
CRC | Cancer with remnant of sessile serrated lesion |
Cancer with remnant of mixed/serrated adenoma | |
Cancer with remnant of mixed adenoma | |
Cancer with remnant of serrated adenoma | |
Cancer with remnant of adenoma | |
Cancer | |
Possible CRC | Cancer or adenoma with HGD? (cancer in dispute) |
Cancer with unknown primary | |
Possible cancer (suspicious features but may be non-adenomatous) | |
Benign lesion | Sessile serrated lesion |
Mixed polyp (adenomatous and metaplastic features) | |
Serrated adenoma | |
Adenoma/assumed adenoma | |
Unicryptal adenoma | |
Metaplastic/hyperplastic polyp | |
No polyp features/not possible to diagnose | Previous polypectomy site |
Granulation tissue | |
Normal mucosa | |
Not possible to diagnose | |
Specimen not seen |
The histology of adenomas was further defined using their greatest degree of villousness and worst dysplasia recorded within a visit.
Polyp location
The following rules were used to define a value for polyp location across all visits:
-
Where the segment was recorded at a surgical procedure, this took precedence over any segment recorded at other types of procedure.
-
If there was no surgical procedure, the most frequently described segment was taken.
-
If no segment was mentioned more frequently than another, the most distal segment was taken.
-
In cases where a segment range was given, the following rules were applied:
-
If only one range was described, the most proximal and distal segments were recorded on the database as the true range (proximal defined as descending colon to terminal ileum; distal defined as anus to sigmoid colon).
-
If several site ranges were described, the smallest segment range was used as the true range, provided that the difference in the position of the most proximal and distal segments in the range was ≤ 2. Table 7 was used to allocate a position to each segment in order to calculate this difference. If the segment range differed by more than two, the records were manually reviewed to reach a decision.
-
Position | Segment |
---|---|
1 | Ileum |
2 | Caecum |
3 | Ascending colon |
4 | Hepatic flexure |
5 | Transverse colon |
6 | Splenic flexure |
7 | Descending colon |
8 | Sigmoid colon |
9 | Rectosigmoid |
10 | Rectum |
11 | Anus |
Polyp shape
It was unclear if the most appropriate method for assigning the true shape of a polyp would be to use an order of precedence, as with other polyp characteristics, or if the first description might be the most accurate, as the shape of a polyp may have been altered once it was biopsied/resected. Shape values included flat, sessile, pedunculated or subpedunculated. It was decided that the first recorded shape of a lesion would be used.
Procedure information
Procedure date and order
In most cases, the procedure date was simply the date of the endoscopy. However, if the endoscopy report was not available, the pathology report was used to derive the examination date. Up to three dates might be specified on the pathology report. The examination date was derived using the following order of precedence:
-
date that the biopsy specimen was taken
-
date that the biopsy specimen was received at the laboratory
-
date of the pathologist’s report.
In the rare cases when it was not possible to derive a procedure date, the patient was excluded from the study. Where procedures occurred on the same day, the reasons were specified and the examinations were numbered to assign an order, otherwise it was specified why it was not possible to do so.
Procedure type
The master database contained two types of procedural report: endoscopy reports extracted directly from endoscopy databases and pathology-based procedure reports generated using clinical and procedural information from the pathology report. The latter were created by study researchers in cases when no endoscopy report was available.
In cases where the procedure type was not reported or not specified (e.g. ‘endoscopy’), procedure type was derived by applying a hierarchy of rules based on available information. For example, when a procedure type was unknown, yet there was evidence that the transverse colon or beyond was reached, the procedure type was probably a colonoscopy. When information such as bowel preparation and depth of insertion was given, the procedure was probably a colonoscopy or flexible sigmoidoscopy (FS). Likewise, if a lesion with a size of ≥ 10 mm was removed, or multiple adenomas were removed, the procedure was also probably a colonoscopy or FS. Full details of rules for deriving procedure type are given in Appendix 7.
For some patients, it remained uncertain whether or not they had a baseline colonoscopy even after procedure type was derived (i.e. derived procedure type was ‘colonoscopy or FS’ at baseline). As patients had to have a baseline colonoscopy for inclusion in the study (a baseline colonoscopy was necessary to accurately stratify patients into risk groups), procedures that were derived as ‘colonoscopy or FS’ were reclassified as colonoscopies, based on adenoma risk group and type/timing of follow-up examinations. For example, patients with a derived procedure type of ‘colonoscopy or FS’ at baseline who were classified as IR or high risk (see Defining adenoma surveillance risk groups, below) were assumed to have had a colonoscopy at some point during baseline, and so the derived procedure was relabelled as such (see Appendix 7). Unlike baseline examinations, no derived procedure types were relabelled at follow-up examinations. Instead, for each follow-up visit, the most complete whole-colon examination available was defined using a hierarchy from ‘complete colonoscopy’ to ‘unknown procedure type’.
For patients without a baseline colonoscopy who had a colonoscopy at follow-up visit 1 (FUV1), the baseline visit was shifted so that FUV1 became the baseline visit and the original baseline visit became a ‘prior’ visit. To ensure that risk was not underestimated as a result of shifting baseline in this way, any adenomas found at prior examinations (original baseline) were used to determine risk as well as those found during the baseline visit.
Colonoscopy quality
Where there were several colonoscopies within a visit, the most complete examination and the best bowel preparation achieved at any colonoscopy was taken as the highest quality examination achieved at that visit. The quality and completeness of a colonoscopy was assessed, based on the segment of the colon reached, the most proximal polyp site, the quality of bowel cleansing prior to the examination and whether or not the examination was marked as incomplete.
The quality of the colonoscopy was important for defining visits (see Defining baseline and surveillance visits, below) as well as being a potential risk factor in the final data analysis.
Defining baseline and surveillance visits
For the purposes of this study, a ‘visit’ (baseline or follow-up) was defined as one or more examinations, performed in close succession, with the aim of completing a full examination of the colon and removing all detected lesions. This is based on the assumption that a single endoscopy is not always sufficient to visualise the entire colon (e.g. owing to poor bowel preparation) or to remove large, numerous or residual lesions.
Lesions found during the baseline visit were used to classify baseline risk of CRC and to stratify patients into adenoma surveillance risk groups (see Defining adenoma surveillance risk groups, below). In addition, certain diagnoses during baseline rendered the patient ineligible for the study, including CRC (see Patient eligibility, above). Follow-up visits were then defined around the baseline visit, with the length of time between visits being used to determine surveillance intervals.
Baseline visit
The baseline visit included the examination with the first adenoma sighting and any completion examinations that occurred within the subsequent 11 months. For high-risk adenomas, surveillance examinations are scheduled 1 year after the initial examination, in accordance with UK surveillance guidelines, so 11 months was chosen as the most appropriate time frame to capture any completion examinations into the baseline visit, without including high-risk follow-up examinations. After including all examinations within 11 months, a small proportion of patients had additional procedures that occurred shortly after the ‘latest’ baseline examination and thus needed to be included into the baseline visit. Baseline was therefore extended a second time, to include any examinations within 6 months of the latest baseline examination. Finally, in a handful of special scenarios, a third repeated extension was performed to capture examinations within 6 or 9 months of the latest baseline examination (6 months if the latest baseline examination was a colonoscopy and 9 months if it was a sigmoidoscopy). These rare cases included scenarios for which:
-
the latest baseline examination was incomplete
-
quality of bowel preparation at the latest baseline examination was poor
-
a large polyp (≥ 15 mm) was seen at the latest baseline examination
-
the same polyp was seen at the latest baseline examination and the next examination, which occurred within 6/9 months
-
the latest baseline examination was followed directly by a surgical examination.
After the extension of baseline, the length of baseline was assessed; only 2% of patients with IR adenomas had a baseline that exceeded 11 months in length.
Surveillance visits
A surveillance or follow-up visit was defined using similar rules for baseline. A follow-up visit comprised the first examination after baseline (or after a follow-up visit) and any further examinations within the subsequent 11 months. As with the baseline visit, the final examination in a follow-up visit was identified, and the follow-up visit was extended as necessary, using the same criteria as for the extension of baseline. This procedure was repeated until all examinations had been grouped into a follow-up visit. Visits following a diagnosis of CRC, volvulus or resection/anastomosis were censored, as patient follow-up would be affected by such diagnoses.
Surveillance interval
Surveillance intervals were timed from the last most complete examination of one visit to the first examination of the next visit, as defined in the NHS BCSP. 40
Defining adenoma surveillance risk groups
Once the baseline visit and true polyp values were defined, patients could then be stratified into adenoma surveillance risk groups. The risk groups were defined using the criteria for stratification of patients as low risk, intermediate risk or high risk, as described in the current UK Guideline (adopted by NICE). These definitions were applied based on all adenomas found within the baseline visit, and are given below. In addition, patients who could not be classified into a specific adenoma risk group were grouped into broader categories.
-
Low risk One or two small (< 10 mm) adenomas [no large (≥ 10 mm) adenomas or adenomas of unspecified size].
-
Intermediate risk Three or four small adenomas (no large adenomas or adenomas of unspecified size) or one or two adenomas, of which at least one is large.
-
High risk Five or more adenomas (any or unknown size) or three or more adenomas, of which at least one is large.
-
Low/intermediate risk One adenoma of unknown size or two adenomas, of which none is large but one or more has an unknown size.
-
Intermediate/high risk Three or four adenomas, of which none is large but one or more has an unknown size.
Patient follow-up
We matched our study patient data with records from external repositories of national patient data: HSCIC, NHSCR Scotland and NSS in order to achieve the following:
-
List clean patient records obtained from hospitals To correct patient information that had been entered incorrectly into hospital databases.
-
Identify duplicate records across hospitals Patients who had procedures at more than one hospital, and would have been allocated a different study number for each hospital where their data were collected.
-
Identify duplicate records in the same hospital Some patients were seen at the same hospital but, as a result of variations in patient identifiers, they had not been identified as the same patient.
-
Obtain cancers and deaths data Necessary to determine the incidence of CRC and the mortality status of patients in the cohort. The HSCIC provided the cancers and mortality data for patients residing in England or patients who resided in Scotland and had moved to England; the NHSCR provided the cancers and mortality data on patients who resided in England but had now moved to Scotland (the NHSCR and HSCIC work in partnership); and the NSS provided us cancers and mortality data on patients who resided in Scotland.
List cleaning
The patient-linking-file that was left at each hospital by the study programmer inevitably contained some patient identifiers that had been entered incorrectly into the hospital databases. It was therefore necessary to use the HSCIC/NHSCR’s list cleaning and tracing service and NSS’s linking service to validate and link the patient records to their database.
When the data sets were sent to HSCIC/NHSCR, no match was found for 5% of the patients. This revealed a limitation of having missing information, in that there was a higher chance of the supplied information matching more than one patient on the HSCIC database, resulting in rejection of a match. The study programmer worked closely with HSCIC to create bespoke matching algorithms that accounted for minor differences in dates of birth, names and NHS numbers in order to get the correct match, and, in some cases, additional data were collected from hospital to resolve the differences. Ultimately, matches were found for 99.65% of all 253,798 patients by the HSCIC/NHSCR and NSS.
Duplicate patient records
The national data repositories provided a list of duplicate patients found across the cohort (including patients from England/Wales and Scotland). When all of the duplicates had been identified, each set of duplicate records on our master database were merged into one record and an audit log was kept to show which records had been merged.
Cancer matching
Cancer and mortality (deaths) data for patients in our cohort were obtained from national patient data repositories (HSCIC, NHSCR Scotland and NSS). These data had to be added to the master database, taking into account the patient and cancer data already present, to ensure that there was no duplicated or missing data. This process was termed ‘cancer matching’.
To identify duplicate records of cancers, a program was written to identify CRCs in the national repositories’ data set and link them with the procedures and individual polyp records (including cancers) on the master database. Data quality checks were carried out, and samples of records that had been linked automatically were manually reviewed.
Cancers were linked to individual polyp records based on a hierarchy involving the cancer diagnosis date from the external source, the date the polyp or cancer was identified in the hospital data, the location of lesions, the polyp number, the time between the date of cancer diagnosis, and the date of the procedure during which the lesion was identified. For cancers reported in hospital pathology reports but not in national repositories, the hospital data were accepted as conclusive evidence of cancer, except when the histology was recorded by the study researcher as ‘cancer in dispute’ or ‘cancer query’.
The histology for cancers recorded on the study database as ‘in situ’ cancers and ‘cancers in dispute’ was compared with data from the national registries and automatically reclassified if necessary, using a hierarchy of rules. For example, polyps mapped to an in situ cancer from external sources were reclassified as ‘assume adenoma’ with high-grade dysplasia (HGD) if they were not already coded as such. Similarly, polyps in the database not mapped to a cancer from external sources, but with a histology recorded as ‘cancer in dispute’, were reclassified as ‘assume adenoma’ with HGD. A full list of the rules is given in Appendix 7 (see rule 13).
Values for the cancer diagnosis date, and site of the cancer, were assigned by comparing the data recorded on the study database with data from the national registries and applying a hierarchy of rules to arrive at the true value. For example, if the external cancer date preceded the mapped endoscopy date then the external date was used. Likewise for site, if no site was given in the mapped endoscopy data (or the site was non-specific), the site used in the external cancer data was used. A full list of the rules is given in Appendix 7 (see rule 11).
Variables
Outcomes
The primary outcome measures were adenoma, AA and CRC detected at the first and second follow-up visits, and CRC incidence after baseline, and after first follow-up. Previously seen lesions were excluded from some analyses, as they were thought to be a proxy measure for patients undergoing polypectomy site surveillance, and confounded the analysis. Outcomes that had not been seen at a previous visit were termed ‘new’ outcomes (see Chapter 3, New and previously seen lesions at first follow-up).
Advanced adenoma was defined as an adenoma of ≥ 10 mm, or with villous or tubulovillous histology, or HGD. CRCs were ascertained using pathological data recorded on the study database and International Statistical Classification of Diseases and Related Health Problems (ICD) codes in data from national repositories. To determine which cancers from the national repositories were outcomes of interest, they were grouped according to site and morphology (details are given in Appendix 7, rules 11 and 13). Only cancers from national repositories that fell into the site groups ‘malignant lesions of the colon/rectum’ and certain ‘in situ neoplasm – colon’ were selected for the study. Specifically, outcomes included adenocarcinomas of the colorectum and carcinomas with unspecified morphology located between the rectum and caecum that were assumed to be adenocarcinomas. Cancers with unspecified morphology located at sites related to the anus were likely to be squamous cell carcinomas and were therefore not classed as outcomes unless they were linked to a rectal lesion, in which case they were assumed to be adenocarcinomas. CRCs reported as a cause of death in national repositories were classed as outcomes if the patient did not have cancer recorded in the cancer registry or hospital data.
Colorectal cancer sites were defined by the World Health Organization (WHO) ICD versions ICD-8, ICD-9 and ICD-10, and included site codes C18–C20 (www.who.int/classifications/icd/en/). Morphology of colorectal neoplasia was coded with the Manual of Tumor Nomenclature and Coding codes,41 the WHO International Classification of Diseases for Oncology (ICD-O) ICD-O-1 codes42 and ICD-O-2 codes. 43
Exposures
The main exposures of interest were the length of the surveillance interval between baseline and first follow-up, and between the first and second follow-ups. The surveillance interval was defined as the period of time from the last most complete colonic examination at one visit to the first examination of the next visit (as in the NHS BCSP). 40 In order to define interval, a patient’s examinations were split into baseline and follow-up visits (see Defining baseline and surveillance visits, above). Interval length was then calculated and converted into a categorical variable with seven groups: > 18 months, 2, 3, 4, 5 and 6 years (all ± 6 months) and ≥ 6.5 years. Patients with the shortest interval were used as a reference group to compare with those who were exposed to a longer interval.
The other exposure of interest was the effect of adenoma surveillance on risk of CRC after baseline. Patients who attended at least one follow-up visit at which cancer was not diagnosed were considered to be exposed to surveillance.
Risk factors and potential confounders
Patient, procedural and polyp characteristics at baseline and follow-up were assessed as a priori risk factors and confounders; these included age and gender, examination quality (based on completeness of examination, quality of bowel preparation and difficulties encountered), calendar year of examination and hospital attended, and the number, size, location and histology of polyps and adenomas, villousness and dysplasia. All potential risk factors and confounders examined are listed and defined in Table 8.
Factor | Definition |
---|---|
Number of adenomas | Total number of adenomas seen during a visit |
Size of adenoma | Size of the largest adenoma seen during visit |
Villousness of adenoma | Worst degree of villousness of an adenoma seen during visit |
Dysplasia of adenoma | Worst degree of dysplasia of an adenoma seen during visit |
Distal or proximal adenomas | Detection of distal or proximal adenoma(s) at a visit. Proximal defined as descending colon to terminal ileum; distal defined as anus to sigmoid colon |
Distal or proximal polyps | Detection of distal or proximal polyp(s) of any type, including adenomas, at a visit. Proximal defined as descending colon to terminal ileum; distal defined as anus to sigmoid colon |
Age, years | Age of patient at time of visit |
Gender | Gender of patient |
Length of visit | Total length of a visit (in days, months or years) |
Number of examinations | Total number of examinations that make up a visit |
Most complete examination | Most complete procedure during visit (at baseline this was based on colonoscopy). Completeness was determined from segment reached by scope or location of polyp(s). A complete colonoscopy was one during which the scope reached, or polyps were found in, the caecum or beyond. If no colonoscopy was performed during the visit then the next most complete procedure type was used |
Best bowel preparation at colonoscopy | Best bowel preparation at a colonoscopy during a visit. If there was no colonoscopy then this was classified as ‘no known colonoscopy’ |
Difficult examination | Composite variable of examination quality. Ascertained from endoscopy report information. Coded ‘yes’ if there was poor bowel preparation, the maximum segment was not reached (i.e. caecum for colonoscopy, sigmoid colon for sigmoidoscopy) and another indicator of poor examination quality was provided, such as patient discomfort, looping, technical difficulty, equipment failure, etc. |
Number of sightings of a unique adenoma | The greatest number of times an adenoma was seen during a visit |
Number of hyperplastic polyps | Total number of hyperplastic polyps in a visit |
Number of large hyperplastic polyps | Total number of large (≥ 10 mm) hyperplastic polyps in a visit |
Number of polyps with unknown histology | Total number of polyps for which there is no histology available |
Calendar year | Year during which the visit took place |
Hospital | Hospital at which the visit took place |
Family history of cancer/CRC reported | Patient has a family history of cancer or CRC indicated at an examination during or prior to a visit |
Grouping of variables
All of the aforementioned risk factors were considered separately for the baseline visit and FUV1. In some instances it was necessary to add an additional level to a variable; for example, at FUV1 some patients did not have any adenomas or a colonoscopy, whereas at baseline every individual had an adenoma and a colonoscopy. The quantitative variables interval length and calendar year were grouped into categorical variables for some analyses and used in their continuous form in other circumstances. The remaining quantitative variables (visit length, age, adenoma size, number of examinations, number of sightings of a unique adenoma and numbers of specific polyp types) were grouped into categorical variables. Standard categorisations were created for all categorical variables and these were used in the presentation of univariable results. When appropriate, the process of selecting risk factors for inclusion in multivariable models involved the investigation of the categorisation of some variables, and the final categorisation was selected by evaluating the difference in effect between levels of the variable. When data were missing for a particular variable, an ‘unknown’ category was created in order to avoid losing patients from the models, particularly those models adjusting for several confounders. Models were tested with, and without including, the ‘unknown’ category to assess the difference it made.
Study size
Sample size requirements were based on the comparison of the rates of detection of AA or CRC at first follow-up at two different intervals, using heterogeneity in practice with respect to follow-up intervals. It was deemed plausible that 5% of subjects would have an intermediate- or high-risk lesion at first follow-up at 4–6 years and 3% at 2–4 years. 44,45 For 90% power to detect this rate in the sample at the 5% significance level in a two-sided test, it was estimated that 4400 subjects with at least one follow-up examination were required. For second or subsequent follow-up, the more relaxed criterion to estimate the detection rate within 1% in either direction was applied. It was anticipated that 3% of subjects would have intermediate- or high-risk lesions at second or subsequent follow-up. This required 1200 subjects with at least two follow-up endoscopies.
Consideration was given to the fact that the sample size was required to provide relatively low coefficients of variation of the test sensitivity (S) and λ2, the rate of progression to clinical CRC, so as to enable the comparison of different intervals between follow-up with respect to rates of cancers that accrued. In order to use these with confidence to predict effects of different follow-up policies, a high degree of precision in estimation of S and λ2 was required. It was therefore stipulated that both have coefficients of variation of no more than 30% [i.e. the standard error (SE) of each estimate has magnitude no larger than 30% of the value of the estimate]. Closed-form estimation was not possible for these quantities and it was difficult to predict the variability of the estimates. Work by Chen et al. 46 and Wong et al. 47 suggests that, with around 30 events, coefficients of variation of ≤ 30% may be achieved if the rate of progression is small (≤ 0.2 per annum). Stratification or the introduction of covariates would reduce the precision and therefore the aim was to recruit a cohort with a total of 60 CRCs.
Stryker et al. 48 found rates of progression in untreated adenomas suggestive of a λ2 of around 0.01 for progression to CRC. Atkin et al. 3 studied a wide case mix of treated polyps at entry (corresponding to the situation in this project), and suggested a rate of around 2 per 1000 per year after colonoscopy overall and around 4.5 per 1000 per year for the high-risk subgroup. Thus, in the literature at the time of the call for proposal, the rate ranged from 2 to 10 per 1000 per year.
It was assumed that the underlying risk of CRC in the cohort would be considerably higher than the population risk, but that the relative risk might be brought down by the protection of endoscopic examination to between one and two times the population risk in males aged ≥ 50 years. This meant that there would be between 2.5 and 5 end points per 1000 per year. In total, therefore, between 12,000 and 24,000 person-years (pys) of follow-up after endoscopy episodes would be required. Assuming an average of 4 years’ observation, this required recruiting cohorts to a total of 6000 subjects. A failsafe strategy to recruit 10,000 was proposed.
Statistical methods
The statistical analysis strategy was split into three main stages: analysis of (1) first follow-up findings in relation to baseline findings; (2) second follow-up findings in relation to baseline and first follow-up findings; and (3) incidence of CRC after baseline in relation to risk factors and exposure to surveillance. The analysis of findings at follow-up aimed to ascertain whether or not there was substantial heterogeneity of results at subsequent examination, in terms of detection rates of AA or CRC, according to risk factors and confounders, and interval to follow-up colonoscopy. The analysis of the incidence of CRCs after baseline aimed to determine the effect of surveillance on long-term CRC risk and to identify independent risk factors for incident cancer. All tests were two-tailed with significance assigned at 5%. In all instances, adjusted effect estimates from multivariable analyses should be considered superior to unadjusted effect estimates reported in univariable analyses. Analyses were performed with Stata/IC version 13.1 (StataCorp LP, College Station, TX, USA).
The distribution of baseline characteristics among patients with and without follow-up visits was compared using chi-squared tests.
Follow-up visit 1 findings in relation to baseline findings
Initially, findings at FUV1 were investigated, considering any adenomas, AAs or CRCs, with a focus on AA and CRC outcomes. The relationship between baseline risk factors and findings at FUV1 was modelled using univariable logistic regression to estimate unadjusted odds ratios (OR). The association between interval length and baseline risk factors was evaluated using chi-squared tests. The relationship between interval from baseline to FUV1 and outcomes at FUV1 was explored both with and without adjustment for baseline risk factors using logistic regression models. Many risk factors for AA and CRC that were potential confounders were known already, based on the substantial body of evidence in the literature. 15,22,23 Owing to the large number of potential confounders, backwards stepwise logistic regression models and likelihood ratio tests (LRTs) were used to identify important confounders to be included in the models, with the significance level for inclusion set at 5%. Interval, our main variable of interest, was constrained to be included in all models. Models were also constructed to consider only ‘new’ outcomes, meaning that those lesions had not been previously seen before FUV1. Separate models for all outcomes were constructed for interval considered as a continuous variable and as a categorical variable. Effect modification of the association between interval and new findings at FUV1 by age and gender were investigated by fitting models with interaction parameters and performing a test for interaction; effect modification was investigated for only interval as a continuous variable, as this enabled examination of potential trends, and it was unlikely to be of any practical use to know whether or not the effect of interval was significantly different in a particular age group if there was no trend in the effect.
Risk factors associated with an interval of < 2 years were identified using logistic regression, and a backwards stepwise logistic regression model was used to identify independent predictors of an interval of < 2 years. All p-values from models were calculated using LRTs.
Follow-up visit 2 findings in relation to baseline and follow-up visit 1 findings
A similar approach to the analysis of outcomes at FUV1 was adopted for outcomes at follow-up visit 2 (FUV2). The relationships between FUV1 risk factors and AA and CRC at FUV2, and between baseline risk factors and AA and CRC at FUV2, were modelled using univariable logistic regression for each confounder separately. Owing to the small number of CRCs detected at FUV2, AA and CRC were grouped together and the outcome of interest was advanced neoplasia (AN). The relationship between interval from FUV1 to FUV2 and detection of AN at FUV2 was explored both with and without adjustment for FUV1 risk factors, baseline risk factors (including interval from baseline to FUV1) and cumulative baseline and FUV1 risk factors using logistic regression models. Backwards stepwise logistic regression models and LRTs were used to identify important confounders to be included in the models, with the significance level for inclusion set at 5%. The chosen confounders from each of these models were then added to a stepwise model to identify the most important factors. To compare the model fit of each of the constructed logistic regression models, pseudo R-squared values and the Akaike information criterion (AIC) were calculated. As before, our main variable of interest (interval to FUV2) was constrained to be included in all models. The complete model selection process was performed separately for interval considered as a continuous variable and as a categorical variable. All models considered only ‘new’ outcomes, that is lesions that had not been previously seen before FUV2. Effect modification of the association between continuous interval and new AN at FUV2 by age and gender was investigated by fitting models with interaction parameters and performing a test for interaction.
Colorectal cancer incidence after baseline
In the analysis of CRC incidence after baseline, for patients matched to national sources the cut-off for follow-up was either 31 December 2011 or 30 June 2012 (depending on the data source), and for unmatched patients it was the date of the patient’s last recorded procedure. All time-to-event data were censored at first CRC diagnosis, death, emigration or end of follow-up.
For the analysis of incidence following baseline, time at risk started from the latest most complete colonoscopy in baseline, and for the analysis of incidence following FUV1, time at risk started on the date of the first procedure in FUV1. If CRC was diagnosed at a follow-up visit, the follow-up visit was not included as a visit, as it did not offer any protection against CRC. Incident CRC outcomes included ‘new’ CRCs only, that is cancers arising in lesions that had not been seen at baseline.
‘One minus the Kaplan–Meier estimator of the survival function’ was used to illustrate the time to cancer diagnosis and to estimate the cumulative risk of cancer with 95% confidence intervals (CIs) at 3, 5 and 10 years. The effects of surveillance and patient, procedural and polyp characteristics at baseline and follow-up on long-term CRC incidence were examined using Cox proportional hazards models. Univariable models were used to estimate unadjusted hazard ratios (HRs). Independent predictors of cancer incidence were identified in a multivariable Cox proportional hazards model, using backward stepwise selection with a p-value of < 0.05 in the LRT to determine the retention of variables in the final model. The number of follow-up visits was included as a time-varying covariate and, as our main variable of interest, was constrained to be included in all adjusted models. Effect modification of the association between surveillance and long-term CRC risk by age and gender was investigated by fitting models with interaction parameters and performing a test for interaction.
For the analysis of risk after baseline, only baseline risk factors were considered. For the analysis of risk following FUV1, separate models were built, considering baseline factors only, FUV1 factors only and cumulative factors only. The risk factors identified from these models were then considered together and a final model selected. All p-values from models were calculated using LRTs.
The incidence of CRC was compared with that expected in the general population. Observed pys at risk were calculated by gender and 5-year age group. Expected numbers of CRC cases were calculated by multiplying the observed gender- and age-specific number of pys by the gender- and age-specific incidence in the general population of England in 2007. The ratio of observed to expected cases was reported as a standardised incidence ratio (SIR), and 95% CIs were computed assuming an exact Poisson distribution.
Sensitivity analyses and internal validation
We conducted several sensitivity analyses to investigate whether or not our methods were robust and did not introduce bias into the results. To assess the methods we used to define baseline, follow-up visits and interval, we restricted analyses of the effect of interval on the finding of new AA and CRC at FUV1 and the effect of surveillance on long-term CRC incidence after baseline: first, to patients with only one colonoscopy in baseline, and, second, to patients who had at least one complete colonoscopy at FUV1. To examine whether or not the definition of AA that was used had an impact on results, we performed a sensitivity analysis for the outcome of new AA detection at FUV1 with a definition of AA that excluded villous or tubulovillous histology, that is with AA defined as an adenoma with HGD or with a size of ≥ 10 mm. Finally, we conducted sensitivity analyses of the effect of surveillance on long-term CRC incidence when the cohort was restricted to patients who had at least 5 years and at least 7 years of time in which hospital data had been collected; this was to examine the possible effect of misclassification of attendance at follow-up visits on the estimated effect of surveillance.
To assess the predictive ability of the multivariable logistic models for the outcomes of new findings at follow-up and the multivariable Cox regression models for the analysis of long-term cancer incidence, we performed internal validation using k-fold cross-validation with k = 10. 49 For each model, the linear predictors were used to construct receiver operating characteristic (ROC) curves for each of the 10 validation sets, and the area under the ROC curve and its SE were calculated for each; the inverse variance weighted mean ROC curve and area below the curve were then calculated from these.
Chapter 3 Hospital data set: results and discussion
Routine endoscopy and pathology records for 253,798 patients were assessed; 174,978 were excluded as no adenomas were reported: 45,716 were found to be ineligible as a result of colonic conditions, 2752 had no colonoscopy at baseline 92 had missing procedure dates and one had > 40 examinations, leaving 30,259 eligible patients with a histologically confirmed adenoma at baseline. A total of 11,995 (40%) eligible patients were classified as having IR adenomas, of whom 51 IR patients were lost to follow-up (could not be matched with national cancer registry data or embarked before the end of baseline), leaving 11,944 patients for the analysis (Figure 2).
Baseline characteristics of all intermediate-risk patients and those with follow-up
We examined demographic, procedural, adenoma and polyp characteristics at baseline, and date and place of the baseline visit, for all 11,944 eligible IR patients. A total of 4608 (39%) patients had at least one follow-up visit and all patients were followed using NHS data and national cancer registries and deaths data (see Long-term cancer risk, below). We first assessed whether or not patients with and without follow-up visits after baseline differed in order to determine the risk of selection bias in analysis of findings at, and subsequent to, follow-up visits.
Table 9 describes demographic and procedural characteristics at baseline, and date of the baseline visit. The median age of the whole cohort of IR patients was 66.7 years (IQR 58.4–74.0 years) and 55% were male. Those who attended a follow-up were younger, on average, than those who did not (mean = 63.3 vs. 67.3 years; p < 0.001), but there was no difference by gender (p = 0.852). Most baseline examinations occurred between 2000 and 2010 (84%), but patients attending follow-up had their baseline visits and, consequently, their adenomas diagnosed significantly earlier than those without follow-up. The absolute differences between hospitals in the proportion having follow-up were small, but because of large numbers the results were significant (results not presented: p < 0.001).
Baseline factor | All IR patients (N = 11,944) | Patients with one or more follow-up visits (N = 4608) | Patients with no follow-up visits (N = 7336) | p-value (chi-squared test) | ||||
---|---|---|---|---|---|---|---|---|
n | % | n | % | n | % | |||
Age (years) | < 55 | 2122 | 17.77 | 1025 | 22.24 | 1097 | 14.95 | < 0.001 |
≥ 55 and < 60 | 1321 | 11.06 | 622 | 13.50 | 699 | 9.53 | ||
≥ 60 and < 65 | 1858 | 15.56 | 788 | 17.10 | 1070 | 14.59 | ||
≥ 65 and < 70 | 2171 | 18.18 | 813 | 17.64 | 1358 | 18.51 | ||
≥ 70 and < 75 | 1786 | 14.95 | 714 | 15.49 | 1072 | 14.61 | ||
≥ 75 and < 80 | 1416 | 11.86 | 413 | 8.96 | 1003 | 13.67 | ||
≥ 80 | 1270 | 10.63 | 233 | 5.06 | 1037 | 14.14 | ||
Gender | Male | 6625 | 55.47 | 2551 | 55.36 | 4074 | 55.53 | 0.852 |
Female | 5319 | 44.53 | 2057 | 44.64 | 3262 | 44.47 | ||
Family history of cancer | No | 11,445 | 95.82 | 4368 | 94.79 | 7077 | 96.47 | < 0.001 |
Yes | 499 | 4.18 | 240 | 5.21 | 259 | 3.53 | ||
Year of baseline | 1985–9 | 112 | 0.94 | 98 | 2.13 | 14 | 0.19 | < 0.001 |
1990–4 | 327 | 2.74 | 241 | 5.23 | 86 | 1.17 | ||
1995–9 | 1430 | 11.97 | 1030 | 22.35 | 400 | 5.45 | ||
2000–4 | 4251 | 35.59 | 2317 | 50.28 | 1934 | 26.36 | ||
2005–10 | 5824 | 48.76 | 922 | 20.01 | 4902 | 66.82 | ||
Length of baseline visit | 1 day | 6836 | 57.23 | 2496 | 54.17 | 4340 | 59.16 | < 0.001 |
2–30 days | 734 | 6.15 | 246 | 5.34 | 488 | 6.65 | ||
1–3 months | 1643 | 13.76 | 664 | 14.41 | 979 | 13.35 | ||
3–6 months | 1382 | 11.57 | 595 | 12.91 | 787 | 10.73 | ||
6–12 months | 1177 | 9.85 | 508 | 11.02 | 669 | 9.12 | ||
1–2 years | 160 | 1.34 | 91 | 1.97 | 69 | 0.94 | ||
2–3 years | 8 | 0.07 | 5 | 0.11 | 3 | 0.04 | ||
3–4 years | 4 | 0.03 | 3 | 0.07 | 1 | 0.01 | ||
Number of examinations in baseline visit | 1 | 6826 | 57.15 | 2489 | 54.01 | 4337 | 59.12 | < 0.001 |
2 | 3788 | 31.71 | 1518 | 32.94 | 2270 | 30.94 | ||
3 | 908 | 7.60 | 392 | 8.51 | 516 | 7.03 | ||
4+ | 422 | 3.53 | 209 | 4.54 | 213 | 2.90 | ||
Most complete colonoscopy | Complete | 9016 | 75.49 | 2973 | 64.52 | 6043 | 82.37 | < 0.001 |
Incomplete | 1601 | 13.40 | 1157 | 25.11 | 444 | 6.05 | ||
Unknown | 1327 | 11.11 | 478 | 10.37 | 849 | 11.57 | ||
Best bowel preparation at colonoscopy | Excellent | 246 | 2.06 | 92 | 2.00 | 154 | 2.10 | < 0.001 |
Good | 3710 | 31.06 | 1309 | 28.41 | 2401 | 32.73 | ||
Satisfactory | 1922 | 16.09 | 487 | 10.57 | 1435 | 19.56 | ||
Poor | 671 | 5.62 | 194 | 4.21 | 477 | 6.50 | ||
Unknown | 5395 | 45.17 | 2526 | 54.82 | 2869 | 39.11 | ||
Difficult examinationa | No | 11,229 | 94.01 | 4387 | 95.20 | 6842 | 93.27 | < 0.001 |
Yes | 715 | 5.99 | 221 | 4.80 | 494 | 6.73 |
More than half of patients had a 1-day baseline visit consisting of a single colonoscopy; however, 39% of patients required two or three examinations during their baseline visit, and 12 patients had a long baseline visit of ≥ 2 years, mainly to treat a large, recurring lesion (which was distally located in most cases). Patients attending follow-up tended to have more baseline examinations and a longer duration of the baseline visit, although absolute differences were small.
All patients had at least one baseline colonoscopy and 75% were reported to have had a complete colonoscopy. In around 50% of patients, the ‘best’ bowel preparation at a baseline colonoscopy (some individuals had more than one) was deemed to be satisfactory or better, and was described as poor in only 6% of cases; however, the quality of the bowel preparation was unknown for 45% of patients. In addition, 6% of patients were reported to have had a difficult examination at baseline: a composite measure of examination quality that indicated an incomplete examination with poor preparation and additional difficulties encountered. Patients who attended follow-up were more likely to have missing data on bowel preparation (p < 0.001) and less likely to have had a complete colonoscopy (p < 0.001) at baseline than those without follow-up.
Table 10 describes the characteristics of the adenomas and polyps diagnosed during the baseline visit. Patients defined as IR according to the UK Adenoma Surveillance guideline16 could not have had more than four adenomas at baseline otherwise they would have been classified as high risk. Owing to the use of adenoma size and number in the definition of IR, these characteristics were associated, and most patients had one large adenoma as opposed to three or four small ones (66% vs. 9%). In 37% of patients, the largest baseline adenoma was between 10 and 14 mm, whereas 34% had an adenoma of > 20 mm in size. In addition, 17% of patients had a baseline adenoma with HGD, whereas 10% had an adenoma with villous histology; 80% had an adenoma in the distal colon or rectum and 31% had a proximal adenoma, whereas 14% had adenomas in both regions. In most patients, adenomas were seen just once during baseline (74%); however, in some patients, a single adenoma was seen multiple times. The distribution of adenoma characteristics was significantly different for those with and without follow-up, but the absolute differences were small.
Baseline factor | All IR patients (N = 11,944) | Patients with one or more follow-up visits (N = 4608) | Patients with no follow-up visits (N = 7336) | p-value (chi-squared test) | ||||
---|---|---|---|---|---|---|---|---|
n | % | n | % | n | % | |||
Adenoma characteristics | ||||||||
Number | 1 | 7842 | 65.66 | 3107 | 67.43 | 4735 | 64.54 | < 0.001 |
2 | 3073 | 25.73 | 1151 | 24.98 | 1,922 | 26.20 | ||
3 | 748 | 6.26 | 240 | 5.21 | 508 | 6.92 | ||
4 | 281 | 2.35 | 110 | 2.39 | 171 | 2.33 | ||
Largest size (mm) | < 10 | 1029 | 8.62 | 350 | 7.60 | 679 | 9.26 | < 0.001 |
10–14 | 4417 | 36.98 | 1577 | 34.22 | 2840 | 38.71 | ||
15–19 | 2440 | 20.43 | 953 | 20.68 | 1487 | 20.27 | ||
≥ 20 | 4058 | 33.98 | 1728 | 37.50 | 2330 | 31.76 | ||
Worst histology | Tubular | 4742 | 39.70 | 1723 | 37.39 | 3019 | 41.15 | < 0.001 |
Tubulovillous | 5576 | 46.68 | 2136 | 46.35 | 3440 | 46.89 | ||
Villous | 1142 | 9.56 | 459 | 9.96 | 683 | 9.31 | ||
Unknown | 484 | 4.05 | 290 | 6.29 | 194 | 2.64 | ||
Worst dysplasia | Low grade | 9476 | 79.34 | 3427 | 74.37 | 6049 | 82.46 | < 0.001 |
High grade | 1994 | 16.69 | 850 | 18.45 | 1144 | 15.59 | ||
Unknown | 474 | 3.97 | 331 | 7.18 | 143 | 1.95 | ||
Location | Distal only | 7831 | 65.56 | 3070 | 66.62 | 4761 | 64.90 | < 0.001 |
Proximal only | 1985 | 16.62 | 681 | 14.78 | 1304 | 17.78 | ||
Distal and proximal | 1665 | 13.94 | 601 | 13.04 | 1064 | 14.50 | ||
Unknown | 463 | 3.88 | 256 | 5.56 | 207 | 2.82 | ||
Distal | No | 2448 | 20.50 | 937 | 20.33 | 1511 | 20.60 | 0.729 |
Yes | 9496 | 79.50 | 3671 | 79.67 | 5825 | 79.40 | ||
Proximal | No | 8294 | 69.44 | 3326 | 72.18 | 4968 | 67.72 | < 0.001 |
Yes | 3650 | 30.56 | 1282 | 27.82 | 2368 | 32.28 | ||
Number of sightings of a single adenoma | 1 | 8807 | 73.74 | 3311 | 71.85 | 5496 | 74.92 | < 0.001 |
2 | 2548 | 21.33 | 1005 | 21.81 | 1543 | 21.03 | ||
3 | 390 | 3.27 | 182 | 3.95 | 208 | 2.84 | ||
4 | 108 | 0.90 | 63 | 1.37 | 45 | 0.61 | ||
5+ | 91 | 0.76 | 47 | 1.02 | 44 | 0.60 | ||
Polyp characteristics (all types) | ||||||||
Number of hyperplastic polyps | 0 | 9874 | 82.67 | 3743 | 81.23 | 6131 | 83.57 | 0.005 |
1 | 1307 | 10.94 | 541 | 11.74 | 766 | 10.44 | ||
2 | 405 | 3.39 | 159 | 3.45 | 246 | 3.35 | ||
3 | 152 | 1.27 | 64 | 1.39 | 88 | 1.20 | ||
4 | 76 | 0.64 | 38 | 0.82 | 38 | 0.52 | ||
5+ | 130 | 1.09 | 63 | 1.37 | 67 | 0.91 | ||
Number of large hyperplastic polyps | 0 | 11,761 | 98.47 | 4525 | 98.20 | 7236 | 98.64 | 0.232 |
1 | 168 | 1.41 | 75 | 1.63 | 93 | 1.27 | ||
2 | 10 | 0.08 | 6 | 0.13 | 4 | 0.05 | ||
3 | 3 | 0.03 | 1 | 0.02 | 2 | 0.03 | ||
4 | 1 | 0.01 | 1 | 0.02 | 0 | 0.00 | ||
5 | 1 | 0.01 | 0 | 0.00 | 1 | 0.01 | ||
Number of polyps with unknown histology | 0 | 9322 | 78.05 | 3593 | 77.97 | 5729 | 78.09 | 0.004 |
1 | 1510 | 12.64 | 556 | 12.07 | 954 | 13.00 | ||
2 | 517 | 4.33 | 187 | 4.06 | 330 | 4.50 | ||
3 | 249 | 2.08 | 108 | 2.34 | 141 | 1.92 | ||
4 | 129 | 1.08 | 63 | 1.37 | 66 | 0.90 | ||
5+ | 217 | 1.82 | 101 | 2.19 | 116 | 1.58 | ||
Distal polyp | No | 1980 | 16.58 | 739 | 16.04 | 1241 | 16.92 | 0.208 |
Yes | 9964 | 83.42 | 3869 | 83.96 | 6095 | 83.08 | ||
Proximal polyp | No | 7369 | 61.70 | 2940 | 63.80 | 4429 | 60.37 | < 0.001 |
Yes | 4575 | 38.30 | 1668 | 36.20 | 2907 | 39.63 |
Baseline polyp characteristics (including number, location and type) were considered as potential risk factors for findings at follow-up. In addition to their IR adenoma(s), 17% had hyperplastic polyps and 2% had large (≥ 10 mm) hyperplastic polyps found at baseline. In total, 83% of patients had a distal polyp, 38% had a proximal polyp and 25% had polyps in both regions. Polyp characteristics in those with and without follow-up were generally similar; however, a greater proportion of patients without follow-up had proximal polyps (p < 0.001).
Hospitals data set: patients attending follow-up visits
Table 11 describes the amount of follow-up in the hospital cohort. A total of 4608 patients had at least one follow-up visit and 1635 had two. Only 555 patients had three or more follow-up visits, so analyses of findings at follow-up were restricted to the first and second follow-up visits in which there were sufficient numbers of outcomes (see Table 11).
Number of follow-up visits | Number of patients | Cumulative number of patients who had at least 1, 2, 3 . . . x examinations | |
---|---|---|---|
n | % | ||
1 | 2973 | 64.52 | 4608 |
2 | 1080 | 23.44 | 1635 |
3 | 354 | 7.68 | 555 |
4 | 135 | 2.93 | 201 |
5 | 45 | 0.98 | 66 |
6 | 14 | 0.30 | 21 |
7 | 2 | 0.04 | 7 |
8 | 2 | 0.04 | 5 |
9 | 2 | 0.04 | 3 |
10 | 1 | 0.02 | 1 |
Total | 4608 | 100.00 | 4608 |
Table 12 shows the intervals to visits in patients having follow-up. Almost 60% of patients returned for their FUV1 earlier than the 3-year interval currently recommended for people with IR adenomas. The interval between baseline and first follow-up was < 3 years in 59% of patients, 3–4 years in 31% of patients and ≥ 5 years in 10% of patients. With regard to the interval between the first and second follow-up visits, once again, most patients (47%) had an interval of < 3 years but a greater proportion (41%) of patients had an interval of 3–4 years. Excluding the outliers with six or more follow-ups, the proportion of patients with a short interval of < 18 months tended to decrease with increasing number of follow-up visits.
Follow-up visit number | Number of patients with varying interval lengthsa | Total | ||||||
---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsb | 3 yearsb | 4 yearsb | 5 yearsb | 6 yearsb | ≥ 6.5 years | ||
1 | 1760 (38.19) | 976 (21.18) | 1057 (22.94) | 355 (7.70) | 217 (4.71) | 123 (2.67) | 120 (2.60) | 4608 (100) |
2 | 397 (24.28) | 376 (23.00) | 518 (31.68) | 152 (9.30) | 131 (8.01) | 31 (1.90) | 30 (1.83) | 1635 (100) |
3 | 131 (23.60) | 110 (19.82) | 191 (34.41) | 51 (9.19) | 42 (7.57) | 17 (3.06) | 13 (2.34) | 555 (100) |
4 | 48 (23.88) | 45 (22.39) | 65 (32.34) | 22 (10.95) | 20 (9.95) | 1 (0.50) | 0 (0) | 201 (100) |
5 | 22 (33.33) | 12 (18.18) | 23 (34.85) | 4 (6.06) | 5 (7.58) | 0 (0) | 0 (0) | 66 (100) |
6 | 2 (9.52) | 7 (33.33) | 7 (33.33) | 3 (14.29) | 2 (9.52) | 0 (0) | 0 (0) | 21 (100) |
7 | 1 (14.29) | 3 (42.86) | 2 (28.57) | 1 (14.29) | 0 (0) | 0 (0) | 0 (0) | 7 (100) |
8 | 2 (40.00) | 1 (20.00) | 1 (20.00) | 0 (0) | 0 (0) | 0 (0) | 1 (20.00) | 5 (100) |
9 | 1 (33.33) | 1 (33.33) | 1 (33.33) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 3 (100) |
10 | 0 (0) | 1 (100.00) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 1 (100) |
Total | 2364 (33.29) | 1532 (21.57) | 1865 (26.26) | 588 (8.28) | 417 (5.87) | 172 (2.42) | 164 (2.31) | 7102 (100) |
First follow-up visit
Examinations and findings
Table 13 shows the proportion of patients found to have adenomas (all types), AA and CRC at FUV1 according to interval from baseline. Overall, 1605 (35%) patients had adenomas, 723 (16%) had AA and 84 (2%) had CRC detected at FUV1. The proportion of patients with adenomas was relatively constant across different intervals and ranged from 34% to 40%, whereas the proportion of patients with AA showed more variation, ranging from 14% to 26%, and the proportion with CRC ranged from 0.5% to 5%. The proportion of patients with CRC detected at FUV1 tended to increase with increasing interval to FUV1.
Interval baseline to first follow-up | IR patients | Findings at FUV1 | ||||||
---|---|---|---|---|---|---|---|---|
Adenoma | AA | CRC | ||||||
N a | % | n | % | n | % | n | % | |
< 18 months | 1760 | 38.19 | 595 | 33.81 | 268 | 15.23 | 29 | 1.65 |
2 yearsb | 976 | 21.18 | 349 | 35.76 | 165 | 16.91 | 25 | 2.56 |
3 yearsb | 1057 | 22.94 | 360 | 34.06 | 151 | 14.29 | 6 | 0.57 |
4 yearsb | 355 | 7.70 | 123 | 34.65 | 56 | 15.77 | 9 | 2.54 |
5 yearsb | 217 | 4.71 | 85 | 39.17 | 34 | 15.67 | 4 | 1.84 |
6 yearsb | 123 | 2.67 | 49 | 39.84 | 18 | 14.63 | 5 | 4.07 |
≥ 6.5 years | 120 | 2.60 | 44 | 36.67 | 31 | 25.83 | 6 | 5.00 |
Total | 4608 | 100.00 | 1605 | 34.83 | 723 | 15.69 | 84 | 1.82 |
Table 14 describes examinations undertaken during FUV1. For most patients, FUV1 comprised a single examination (88%) and in 72% of patients the most complete examination was a complete colonoscopy.
Examinations during FUV1 | IR patients (N = 4608) | ||
---|---|---|---|
n | % | ||
Length of FUV1 | 1 day | 4068 | 88.00 |
2–30 days | 64 | 1.39 | |
1–3 months | 126 | 2.73 | |
3–6 months | 149 | 3.23 | |
6–12 months | 162 | 3.52 | |
1–2 years | 38 | 0.82 | |
2–3 years | 1 | 0.02 | |
Number of examinations during FUV1 | 1 | 4060 | 88.00 |
2 | 394 | 8.55 | |
3 | 101 | 2.19 | |
4+ | 53 | 1.15 | |
Most complete examination during FUV1 | Complete colonoscopy | 3299 | 72.00 |
Colonoscopy of unknown completeness | 259 | 6.00 | |
Incomplete colonoscopy | 404 | 8.77 | |
Colonoscopy or FS | 192 | 4.17 | |
FS | 326 | 7.07 | |
Colonoscopy, FS or rigid sigmoidoscopy | 103 | 2.24 | |
Surgery | 16 | 0.35 | |
Unknown | 9 | 0.20 |
Baseline risk factors for findings at first follow-up
Using univariable analyses, we investigated the crude associations of baseline demographic, procedural, adenoma and polyp characteristics with findings at FUV1 in order to identify risk factors for adenomas, AA and CRC and to assess potential confounders of the association between interval and outcomes.
Demographic and procedural characteristics
Table 15 details the crude effect of baseline demographic and procedural characteristics on the odds of having adenomas (all types), AA or CRC found at FUV1.
Baseline factors | All IR patients | IR patients with findings at FUV1 | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
N = 4608 | Adenoma(s) (N = 1605) | AA(s) (N = 723) | CRC(s) (N = 84) | |||||||||||
n | % (n/N) | Unadjusted OR (95% CI) | p-value (LRT) | n | % (n/N) | Unadjusted OR (95% CI) | p-value (LRT) | n | % (n/N) | Unadjusted OR (95% CI) | p-value (LRT) | |||
Age (years) | < 55 | 1025 | 296 | 28.88 | 1 | < 0.0001 | 107 | 10.44 | 1 | < 0.0001 | 8 | 0.78 | 1 | < 0.0001 |
≥ 55 and < 60 | 622 | 240 | 38.59 | 1.55 (1.25 to 1.91) | 105 | 16.88 | 1.74 (1.30 to 2.33) | 2 | 0.32 | 0.41 (0.09 to 1.94) | ||||
≥ 60 and < 65 | 788 | 278 | 35.28 | 1.34 (1.10 to 1.64) | 116 | 14.72 | 1.48 (1.12 to 1.96) | 9 | 1.14 | 1.47 (0.56 to 3.82) | ||||
≥ 65 and < 70 | 813 | 270 | 33.21 | 1.22 (1.00 to 1.49) | 118 | 14.51 | 1.46 (1.10 to 1.93) | 15 | 1.85 | 2.39 (1.01 to 5.66) | ||||
≥ 70 and < 75 | 714 | 288 | 40.34 | 1.67 (1.36 to 2.04) | 137 | 19.19 | 2.04 (1.55 to 2.68) | 16 | 2.24 | 2.91 (1.24 to 6.85) | ||||
≥ 75 and < 80 | 413 | 148 | 35.84 | 1.38 (1.08 to 1.75) | 87 | 21.07 | 2.29 (1.68 to 3.12) | 21 | 5.08 | 6.81 (2.99 to 15.5) | ||||
≥ 80 | 233 | 85 | 36.48 | 1.41 (1.05 to 1.91) | 53 | 22.75 | 2.53 (1.75 to 3.64) | 13 | 5.58 | 7.51 (3.08 to 18.34) | ||||
Gender | Male | 2551 | 960 | 37.63 | 1 | < 0.0001 | 398 | 15.60 | 1.00 | 0.8543 | 53 | 2.08 | 1 | 0.147 |
Female | 2057 | 645 | 31.36 | 0.76 (0.67 to 0.86) | 325 | 15.80 | 1.02 (0.87 to 1.19) | 31 | 1.51 | 0.72 (0.46 to 1.13) | ||||
Family history of cancer | No | 4368 | 1535 | 35.14 | 1 | 0.0552 | 695 | 15.91 | 1 | 0.0678 | 81 | 1.85 | 1 | 0.4716 |
Yes | 240 | 70 | 29.17 | 0.76 (0.57 to 1.01) | 28 | 11.67 | 0.70 (0.47 to 1.04) | 3 | 1.25 | 0.67 (0.21 to 2.14) | ||||
Year of baseline | 1985–9 | 98 | 25 | 25.51 | 0.63 (0.40 to 1.00) | 0.0004 | 16 | 16.33 | 1.06 (0.61 to 1.83) | 0.9580 | 0 | 0.00 | n/a | 0.1283 |
1990–4 | 241 | 67 | 27.80 | 0.71 (0.53 to 0.95) | 42 | 17.43 | 1.14 (0.80 to 1.62) | 8 | 3.32 | 2.24 (1.03 to 4.88) | ||||
1995–9 | 1030 | 333 | 32.33 | 0.88 (0.75 to 1.02) | 162 | 15.73 | 1.01 (0.83 to 1.24) | 25 | 2.43 | 1.62 (0.97 to 2.72) | ||||
2000–4 | 2317 | 818 | 35.30 | 1 | 361 | 15.58 | 1 | 35 | 1.51 | 1 | ||||
2005–10 | 922 | 362 | 39.26 | 1.18 (1.01 to 1.39) | 142 | 15.40 | 0.99 (0.80 to 1.22) | 16 | 1.74 | 1.15 (0.63 to 2.09) | ||||
Length of baseline visit | 1 day | 2496 | 877 | 35.14 | 1 | 0.0037 | 342 | 13.70 | 1 | < 0.0001 | 37 | 1.48 | 1 | 0.3993 |
2–30 days | 246 | 68 | 27.64 | 0.71 (0.53 to 0.94) | 33 | 13.41 | 0.98 (0.66 to 1.43) | 7 | 2.85 | 1.95 (0.86 to 4.41) | ||||
1–3 months | 664 | 245 | 36.90 | 1.08 (0.90 to 1.29) | 111 | 16.72 | 1.26 (1.00 to 1.60) | 12 | 1.81 | 1.22 (0.63 to 2.36) | ||||
3–6 months | 595 | 184 | 30.92 | 0.83 (0.68 to 1.00) | 99 | 16.64 | 1.26 (0.98 to 1.60) | 15 | 2.52 | 1.72 (0.94 to 3.15) | ||||
6 to 12 months | 508 | 185 | 36.42 | 1.06 (0.87 to 1.29) | 107 | 21.06 | 1.68 (1.32 to 2.14) | 10 | 1.97 | 1.33 (0.66 to 2.7) | ||||
≥ 12 months | 99 | 46 | 46.46 | 1.60 (1.07 to 2.40) | 31 | 31.31 | 2.87 (1.85 to 4.46) | 3 | 3.03 | 2.08 (0.63 to 6.85) | ||||
Number of examinations in baseline visit | 1 | 2489 | 877 | 35.24 | 1 | 0.0228 | 342 | 13.74 | 1 | < 0.0001 | 37 | 1.49 | 1 | 0.1245 |
2 | 1518 | 499 | 32.87 | 0.90 (0.79 to 1.03) | 231 | 15.22 | 1.13 (0.94 to 1.35) | 29 | 1.91 | 1.29 (0.79 to 2.11) | ||||
3 | 392 | 138 | 35.20 | 1.00 (0.80 to 1.25) | 84 | 21.43 | 1.71 (1.31 to 2.24) | 11 | 2.81 | 1.91 (0.97 to 3.78) | ||||
4+ | 209 | 91 | 43.54 | 1.42 (1.07 to 1.89) | 66 | 31.58 | 2.90 (2.12 to 3.96) | 7 | 3.35 | 2.30 (1.01 to 5.22) | ||||
Most complete colonoscopy | Complete | 2973 | 966 | 32.49 | 1 | < 0.0001 | 388 | 13.05 | 1 | < 0.0001 | 41 | 1.38 | 1 | < 0.0001 |
Unknown | 1157 | 481 | 41.57 | 1.48 (1.29 to 1.70) | 244 | 21.09 | 1.78 (1.49 to 2.13) | 16 | 1.38 | 1.00 (0.56 to 1.79) | ||||
Incomplete | 478 | 158 | 33.05 | 1.03 (0.84 to 1.26) | 91 | 19.04 | 1.57 (1.22 to 2.02) | 27 | 5.65 | 4.28 (2.61 to 7.03) | ||||
Best bowel preparation at colonoscopy | Excellent/good | 1401 | 483 | 34.48 | 1 | 0.4337 | 181 | 12.92 | 1 | 0.0032 | 26 | 1.86 | 1 | 0.0229 |
Satisfactory | 487 | 179 | 36.76 | 1.10 (0.89 to 1.37) | 73 | 14.99 | 1.19 (0.89 to 1.59) | 6 | 1.23 | 0.66 (0.27 to 1.61) | ||||
Poor | 194 | 76 | 39.18 | 1.22 (0.90 to 1.67) | 36 | 18.56 | 1.54 (1.04 to 2.28) | 10 | 5.15 | 2.87 (1.36 to 6.06) | ||||
Unknown | 2526 | 867 | 34.32 | 0.99 (0.87 to 1.14) | 433 | 17.14 | 1.39 (1.16 to 1.68) | 42 | 1.66 | 0.89 (0.55 to 1.46) | ||||
Difficult examination | No | 4387 | 1551 | 35.35 | 1 | 0.0006 | 690 | 15.73 | 1 | 0.7493 | 73 | 1.66 | 1 | 0.0027 |
Yes | 221 | 54 | 24.43 | 0.59 (0.43 to 0.81) | 33 | 14.93 | 0.94 (0.64 to 1.37) | 11 | 4.98 | 3.10 (1.62 to 5.92) |
Adenomas (all types)
Patients aged ≥ 55 years were more likely to have an adenoma found at FUV1 than those aged < 55 years; however, no clear trend was seen after the age of 55 years. Women were 24% less likely to have an adenoma detected (OR 0.76, 95% CI 0.67 to 0.86). Patients with a family history of cancer had a non-significant 24% lower risk of adenoma (OR 0.76, 95% CI 0.57 to 1.01). The odds of detecting adenomas at FUV1 were greater in those with later baseline visits (p = 0.0004).
Patients with a baseline visit of longer than 12 months or with four or more baseline examinations were significantly more likely to have an adenoma detected. The association between the completeness of colonoscopy and risk of detection of one or more adenomas was difficult to interpret when no evidence was found of an association between adenoma detection and quality of bowel preparation. However, having a difficult examination at baseline – a composite measure of different aspects of examination quality including completeness and preparation – was associated with a significantly lower odds of having an adenoma detected at FUV1 (OR 0.59, 95% CI 0.43 to 0.81).
Advanced adenomas
The odds of detecting AA at FUV1 significantly increased with increasing age (p < 0.0001). There was no association with gender, or with year of the baseline visit.
There was a tendency for the AA detection rate to increase with increasing number of baseline examinations or a longer duration of the baseline visit, with patients whose baseline visit was 12 months or longer or who had four or more examinations having an almost threefold increased odds (OR 2.87, 95% CI 1.85 to 4.46, and OR 2.9, 95% CI 2.12 to 3.96, respectively). The odds of detecting AA were 57% greater among those with only an incomplete baseline colonoscopy (OR 1.57, 95% CI 1.22 to 2.02) and 78% greater in patients with a colonoscopy of unknown completeness (OR 1.78, 95% CI 1.49 to 2.13). Bowel preparation quality was also predictive of having AA at FUV1.
Colorectal cancers
Only 84 CRCs were detected at FUV1; therefore, although significant associations with baseline risk factors were seen, estimates were imprecise and CIs were wide. There was a strong relationship between increasing age and CRC at FUV1, with a more than sixfold greater odds in patients aged ≥ 75 years (OR 6.81, 95% CI 2.99 to 15.50, for those aged 75–80 years and OR 7.51, 95% CI 3.08 to 18.34, for those aged ≥ 80 years). No significant associations were found between gender, family history of cancer, year of baseline, length of baseline or number of examinations in baseline.
There was strong evidence of an association between having an incomplete colonoscopy or poor bowel preparation or a difficult examination at baseline and increased odds of detecting CRC at FUV1, with odds increased by three- to fourfold.
Adenoma and polyp characteristics
Table 16 describes the crude relationship between characteristics of adenomas and polyps detected at baseline and adenomas, AA and CRC at FUV1.
Baseline factors | Number of IR patients (N = 4608) | IR patients with findings at first follow-up | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Adenoma(s) | AA(s) | CRC(s) | ||||||||||||
n | % (n/N) | Unadjusted OR (95% CI) | p-value (LRT) | n | % (n/N) | Unadjusted OR (95% CI) | p-value (LRT) | n | % (n/N) | Unadjusted OR (95% CI) | p-value (LRT) | |||
Adenoma characteristics | ||||||||||||||
Number | 1 | 3107 | 972 | 31.28 | 1 | < 0.0001 | 470 | 15.13 | 1 | 0.0068 | 62 | 2.00 | 1 | 0.1166 |
2 | 1151 | 475 | 41.27 | 1.54 (1.34 to 1.77) | 211 | 18.33 | 1.26 (1.05 to 1.51) | 21 | 1.82 | 0.91 (0.55 to 1.50) | ||||
3 | 240 | 104 | 43.33 | 1.68 (1.29 to 2.19) | 25 | 10.42 | 0.65 (0.43 to 1.00) | 1 | 0.42 | 0.21 (0.03 to 1.49) | ||||
4 | 110 | 54 | 49.09 | 2.12 (1.45 to 3.10) | 17 | 15.45 | 1.03 (0.61 to 1.74) | 0 | 0.00 | n/a | ||||
Largest size (mm) | < 10 | 350 | 158 | 45.14 | 1 | < 0.0001 | 42 | 12.00 | 1 | < 0.0001 | 1 | 0.29 | 1 | 0.0361 |
10–14 | 1577 | 507 | 32.15 | 0.58 (0.45 to 0.73) | 190 | 12.05 | 1.00 (0.70 to 1.43) | 29 | 1.84 | 6.54 (0.89 to 48.16) | ||||
15–19 | 953 | 300 | 31.48 | 0.56 (0.43 to 0.72) | 122 | 12.80 | 1.08 (0.74 to 1.57) | 16 | 1.68 | 5.96 (0.79 to 45.10) | ||||
≥ 20 | 1728 | 640 | 37.04 | 0.71 (0.57 to 0.90) | 369 | 21.35 | 1.99 (1.41 to 2.80) | 38 | 2.20 | 7.85 (1.07 to 57.34) | ||||
Worst histology | Tubular | 1723 | 548 | 31.80 | 1 | < 0.0001 | 171 | 9.92 | 1 | < 0.0001 | 18 | 1.04 | 1 | 0.0004 |
Tubulovillous | 2136 | 738 | 34.55 | 1.13 (0.99 to 1.30) | 374 | 17.51 | 1.93 (1.59 to 2.34) | 39 | 1.83 | 1.76 (1.00 to 3.09) | ||||
Villous | 459 | 173 | 37.69 | 1.30 (1.05 to 1.61) | 115 | 25.05 | 3.03 (2.33 to 3.95) | 19 | 4.14 | 4.09 (2.13 to 7.86) | ||||
Unknown | 290 | 146 | 50.34 | 2.17 (1.69 to 2.80) | 63 | 21.72 | 2.52 (1.83 to 3.47) | 8 | 2.76 | 2.69 (1.16 to 6.24) | ||||
Worst dysplasia | Low grade | 3427 | 1144 | 33.38 | 1 | < 0.0001 | 483 | 14.09 | 1 | < 0.0001 | 51 | 1.49 | 1 | 0.0144 |
High grade | 850 | 297 | 34.94 | 1.07 (0.92 to 1.26) | 162 | 19.06 | 1.44 (1.18 to 1.75) | 26 | 3.06 | 2.09 (1.29 to 3.37) | ||||
Unknown | 331 | 164 | 49.55 | 1.96 (1.56 to 2.46) | 78 | 23.56 | 1.88 (1.43 to 2.47) | 7 | 2.11 | 1.43 (0.64 to 3.18) | ||||
Location | Distal only | 3070 | 1005 | 32.74 | 1 | < 0.0001 | 486 | 15.83 | 1 | 0.6382 | 60 | 1.95 | 1 | 0.9786 |
Proximal only | 681 | 256 | 37.59 | 1.24 (1.04 to 1.47) | 108 | 15.86 | 1.00 (0.80 to 1.26) | 13 | 1.91 | 0.98 (0.53 to 1.79) | ||||
Distal and proximal | 601 | 262 | 43.59 | 1.59 (1.33 to 1.90) | 96 | 15.97 | 1.01 (0.80 to 1.28) | 11 | 1.83 | 0.94 (0.49 to 1.79) | ||||
Unknown | 256 | 82 | 32.03 | 0.97 (0.74 to 1.27) | 33 | 12.89 | 0.79 (0.54 to 1.15) | 0 | 0.00 | n/a | ||||
Distal | No | 937 | 338 | 36.07 | 1 | 0.3723 | 141 | 15.05 | 1 | 0.5432 | 13 | 1.39 | 1 | 0.2488 |
Yes | 3671 | 1267 | 34.51 | 0.93 (0.80 to 1.08) | 582 | 15.85 | 1.06 (0.87 to 1.3) | 71 | 1.93 | 1.40 (0.77 to 2.54) | ||||
Proximal | No | 3326 | 1087 | 32.68 | 1 | < 0.0001 | 519 | 15.60 | 1 | 0.7968 | 60 | 1.80 | 1 | 0.8773 |
Yes | 1282 | 518 | 40.41 | 1.40 (1.22 to 1.6) | 204 | 15.91 | 1.02 (0.86 to 1.22) | 24 | 1.87 | 1.04 (0.64 to 1.67) | ||||
Number of sightings of a single adenoma | 1 | 3311 | 1102 | 33.28 | 1 | 0.0001 | 433 | 13.08 | 1 | < 0.0001 | 49 | 1.48 | 1 | 0.0412 |
2 | 1005 | 381 | 37.91 | 1.22 (1.06 to 1.42) | 196 | 19.50 | 1.61 (1.34 to 1.94) | 23 | 2.29 | 1.56 (0.95 to 2.57) | ||||
3 | 182 | 65 | 35.71 | 1.11 (0.82 to 1.52) | 50 | 27.47 | 2.52 (1.79 to 3.54) | 8 | 4.40 | 3.06 (1.43 to 6.56) | ||||
4 | 63 | 29 | 46.03 | 1.71 (1.04 to 2.82) | 21 | 33.33 | 3.32 (1.95 to 5.67) | 2 | 3.17 | 2.18 (0.52 to 9.18) | ||||
5+ | 47 | 28 | 59.57 | 2.95 (1.64 to 5.31) | 23 | 48.94 | 6.37 (3.56 to 11.39) | 2 | 4.26 | 2.96 (0.7 to 12.54) | ||||
Polyp characteristics | ||||||||||||||
Number of hyperplastic polyps | 0 | 3743 | 1297 | 34.65 | 1 | 0.0823 | 609 | 16.27 | 1 | 0.1341 | 68 | 1.82 | 1 | 0.9038 |
1 | 541 | 190 | 35.12 | 1.02 (0.85 to 1.23) | 69 | 12.75 | 0.75 (0.58 to 0.98) | 11 | 2.03 | 1.12 (0.59 to 2.13) | ||||
2 | 159 | 62 | 38.99 | 1.21 (0.87 to 1.67) | 25 | 15.72 | 0.96 (0.62 to 1.48) | 4 | 2.52 | 1.39 (0.50 to 3.87) | ||||
3 | 64 | 16 | 25.00 | 0.63 (0.36 to 1.11) | 5 | 7.81 | 0.44 (0.17 to 1.09) | 0 | 0.00 | n/a | ||||
4 | 38 | 10 | 26.32 | 0.67 (0.33 to 1.39) | 5 | 13.16 | 0.78 (0.30 to 2.01) | 1 | 2.63 | 1.46 (0.20 to 10.8) | ||||
5+ | 63 | 30 | 47.62 | 1.71 (1.04 to 2.82) | 10 | 15.87 | 0.97 (0.49 to 1.92) | 0 | 0.00 | n/a | ||||
Any large hyperplastic polyps? | No | 4525 | 1576 | 34.83 | 1 | 0.9832 | 705 | 15.58 | 1 | 0.1471 | 83 | 1.83 | 1 | 0.6513 |
Yes | 83 | 29 | 34.94 | 1.00 (0.64 to 1.58) | 18 | 21.69 | 1.50 (0.88 to 2.54) | 1 | 1.20 | 0.65 (0.09 to 4.75) | ||||
Number of polyps with unknown histology | 0 | 3593 | 1189 | 33.09 | 1 | < 0.0001 | 567 | 15.78 | 1 | 0.1511 | 66 | 1.84 | 1 | 0.6976 |
1 | 556 | 212 | 38.13 | 1.25 (1.04 to 1.50) | 74 | 13.31 | 0.82 (0.63 to 1.06) | 11 | 1.98 | 1.08 (0.57 to 2.06) | ||||
2 | 187 | 87 | 46.52 | 1.76 (1.31 to 2.36) | 36 | 19.25 | 1.27 (0.87 to 1.85) | 1 | 0.53 | 0.29 (0.04 to 2.08) | ||||
3 | 108 | 48 | 44.44 | 1.62 (1.10 to 2.38) | 20 | 18.52 | 1.21 (0.74 to 1.99) | 2 | 1.85 | 1.01 (0.24 to 4.17) | ||||
4 | 63 | 20 | 31.75 | 0.94 (0.55 to 1.61) | 6 | 9.52 | 0.56 (0.24 to 1.31) | 2 | 3.17 | 1.75 (0.42 to 7.32) | ||||
5+ | 101 | 49 | 48.51 | 1.91 (1.28 to 2.83) | 20 | 19.80 | 1.32 (0.80 to 2.17) | 2 | 1.98 | 1.08 (0.26 to 4.47) | ||||
Location of polyps | Distal only | 2716 | 873 | 32.14 | 1 | < 0.0001 | 431 | 15.87 | 1 | 0.5608 | 50 | 1.84 | 1 | 0.8835 |
Proximal only | 515 | 199 | 38.64 | 1.33 (1.09 to 1.62) | 88 | 17.09 | 1.09 (0.85 to 1.41) | 10 | 1.94 | 1.06 (0.53 to 2.10) | ||||
Distal and proximal | 1153 | 461 | 39.98 | 1.41 (1.22 to 1.62) | 174 | 15.09 | 0.94 (0.78 to 1.14) | 24 | 2.08 | 1.13 (0.69 to 1.85) | ||||
Unknown | 224 | 72 | 32.14 | 1.00 (0.75 to 1.34) | 30 | 13.39 | 0.82 (0.55 to 1.22) | 0 | 0.00 | n/a | ||||
Distal polyp | No | 739 | 271 | 36.67 | 1 | 0.2533 | 118 | 15.97 | 1 | 0.8213 | 10 | 1.35 | 1 | 0.2792 |
Yes | 3869 | 1334 | 34.48 | 0.91 (0.77 to 1.07) | 605 | 15.64 | 0.98 (0.79 to 1.21) | 74 | 1.91 | 1.42 (0.73 to 2.76) | ||||
Proximal polyp | No | 2940 | 945 | 32.14 | 1 | < 0.0001 | 461 | 15.68 | 1 | 0.9806 | 50 | 1.70 | 1 | 0.4138 |
Yes | 1668 | 660 | 39.57 | 1.38 (1.22 to 1.57) | 262 | 15.71 | 1.00 (0.85 to 1.18) | 34 | 2.04 | 1.20 (0.77 to 1.87) |
Adenomas (all types)
Associations between adenoma detection at FUV1 and the number, size, histology and dysplasia of adenomas detected at baseline were all highly significant (p < 0.0001). Increasing number of adenomas, villous histology and small size (< 10 mm), as opposed to larger size, were associated with a greater odds of having adenomas at FUV1, whereas the association with dysplasia was difficult to interpret. Patients with both a distal and proximal adenoma at baseline had a significant 59% increased odds of having an adenoma detected at FUV1 (OR 1.59, 95% CI 1.33 to 1.90); however, this relationship was probably confounded by the number of adenomas and, when considering proximal location separately, patients with any proximal adenoma at baseline had a 40% greater odds of having an adenoma at FUV1 (OR 1.40, 95% CI 1.22 to 1.60). There was also evidence that patients who had multiple sightings of an individual adenoma during baseline were more likely to have an adenoma detected at FUV1, with a large effect size and highly significant p-value (p = 0.0001). Detection of a proximal polyp at baseline conferred a significant 38% increased odds (OR 1.38, 95% CI 1.22 to 1.57).
Advanced adenomas
There was strong evidence that detection of an adenoma of ≥ 20 mm, with villous or tubulovillous histology or with HGD at baseline, was associated with an increased odds of AA at FUV1 – villous histology had a particularly strong effect (OR 3.03, 95% CI 2.33 to 3.95). The number of adenomas was significantly associated with detection of AA (p = 0.0068), but no clear trend was discernible. Multiple sightings of an adenoma at different examinations during baseline was highly predictive and five or more sightings conferred a more than sixfold increased odds (OR 6.37, 95% CI 3.56 to 11.39). Adenoma location had no effect on the likelihood of having AA at FUV1. There was no relationship between AA and any polyp-related variables.
Colorectal cancer
With only 84 CRCs detected at FUV1, CIs for associations between CRC and baseline adenoma and polyp characteristics were wide; nevertheless, several significant associations were found. Villous histology and HGD at baseline were significantly associated with increased odds of CRC at FUV1: patients with a villous adenoma were four times more likely to have CRC at FUV1 than those with a tubular adenoma (OR 4.09, 95% CI 2.13 to 7.86), whereas HGD at baseline doubled the odds of CRC (OR 2.09, 95% CI 1.29 to 3.37). Larger adenoma size appeared to confer an increased odds of CRC but, despite reaching statistical significance (p = 0.0361), the imprecision of the measures of effect prevented firm conclusions from being drawn. Multiple sightings of an adenoma during baseline was significantly associated with increased odds of CRC (p = 0.0412) but adenoma location had no effect. No polyp characteristics were associated with finding CRC at FUV1.
Baseline risk factors and interval
We explored the relationship between baseline risk factors and length of the interval between baseline and FUV1 to assess whether or not any factors could be acting as confounders of the association between findings at FUV1 and interval (Tables 17 and 18).
Baseline risk factor | Category | Interval between baseline and first follow-up | Total | p-value (chi squared) | ||||||
---|---|---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||||
n (%) | n (%) | n (%) | n (%) | n (%) | n (%) | n (%) | n (%) | |||
Total | 1760 (38.19) | 976 (21.18) | 1057 (22.94) | 355 (7.7) | 217 (4.71) | 123 (2.67) | 120 (2.6) | 4608 (100) | n/a | |
Age (years) | < 55 | 350 (34.15) | 202 (19.71) | 257 (25.07) | 95 (9.27) | 56 (5.46) | 32 (3.12) | 33 (3.22) | 1025 (100) | < 0.001 |
≥ 55 and < 60 | 211 (33.92) | 117 (18.81) | 164 (26.37) | 56 (9) | 36 (5.79) | 15 (2.41) | 23 (3.7) | 622 (100) | ||
≥ 60 and < 65 | 297 (37.69) | 170 (21.57) | 187 (23.73) | 56 (7.11) | 40 (5.08) | 24 (3.05) | 14 (1.78) | 788 (100) | ||
≥ 65 and < 70 | 308 (37.88) | 159 (19.56) | 196 (24.11) | 64 (7.87) | 39 (4.8) | 25 (3.08) | 22 (2.71) | 813 (100) | ||
≥ 70 and < 75 | 306 (42.86) | 165 (23.11) | 145 (20.31) | 41 (5.74) | 26 (3.64) | 13 (1.82) | 18 (2.52) | 714 (100) | ||
≥ 75 and < 80 | 185 (44.79) | 94 (22.76) | 74 (17.92) | 27 (6.54) | 16 (3.87) | 10 (2.42) | 7 (1.69) | 413 (100) | ||
≥ 80 | 103 (44.21) | 69 (29.61) | 34 (14.59) | 16 (6.87) | 4 (1.72) | 4 (1.72) | 3 (1.29) | 233 (100) | ||
Year of baseline | 1985–9 | 44 (44.90) | 13 (13.27) | 19 (19.39) | 9 (9.18) | 6 (6.12) | 3 (3.06) | 4 (4.08) | 98 (100) | < 0.001 |
1990–4 | 73 (30.29) | 55 (22.82) | 30 (12.45) | 27 (11.2) | 15 (6.22) | 10 (4.15) | 31 (12.86) | 241 (100) | ||
1995–9 | 387 (37.57) | 218 (21.17) | 192 (18.64) | 96 (9.32) | 41 (3.98) | 40 (3.88) | 56 (5.44) | 1030 (100) | ||
2000–4 | 714 (30.82) | 484 (20.89) | 673 (29.05) | 194 (8.37) | 153 (6.6) | 70 (3.02) | 29 (1.25) | 2317 (100) | ||
2005–10 | 542 (58.79) | 206 (22.34) | 143 (15.51) | 29 (3.15) | 2 (0.22) | 0 (0) | 0 (0) | 922 (100) | ||
Length of baseline visit | 1 day | 936 (37.5) | 468 (18.75) | 609 (24.4) | 205 (8.21) | 124 (4.97) | 73 (2.92) | 81 (3.25) | 2496 (100) | < 0.001 |
2–30 days | 112 (45.53) | 47 (19.11) | 51 (20.73) | 14 (5.69) | 8 (3.25) | 5 (2.03) | 9 (3.66) | 246 (100) | ||
1–3 months | 285 (42.92) | 113 (17.02) | 156 (23.49) | 40 (6.02) | 42 (6.33) | 18 (2.71) | 10 (1.51) | 664 (100) | ||
3–6 months | 251 (42.18) | 136 (22.86) | 124 (20.84) | 43 (7.23) | 20 (3.36) | 14 (2.35) | 7 (1.18) | 595 (100) | ||
6–12 months | 155 (30.51) | 174 (34.25) | 94 (18.5) | 44 (8.66) | 18 (3.54) | 11 (2.17) | 12 (2.36) | 508 (100) | ||
≥ 12 months | 21 (21.21) | 38 (38.38) | 23 (23.23) | 9 (9.09) | 5 (5.05) | 2 (2.02) | 1 (1.01) | 99 (100) | ||
Number of baseline examinations | 1 | 932 (37.44) | 467 (18.76) | 609 (24.47) | 203 (8.16) | 124 (4.98) | 73 (2.93) | 81 (3.25) | 2489 (100) | < 0.001 |
2 | 592 (39) | 313 (20.62) | 347 (22.86) | 114 (7.51) | 79 (5.2) | 41 (2.7) | 32 (2.11) | 1518 (100) | ||
3 | 167 (42.6) | 114 (29.08) | 67 (17.09) | 21 (5.36) | 9 (2.3) | 7 (1.79) | 7 (1.79) | 392 (100) | ||
4+ | 69 (33.01) | 82 (39.23) | 34 (16.27) | 17 (8.13) | 5 (2.39) | 2 (0.96) | 0 (0) | 209 (100) | ||
Most complete colonoscopy | Complete | 1254 (42.18) | 618 (20.79) | 715 (24.05) | 180 (6.05) | 118 (3.97) | 55 (1.85) | 33 (1.11) | 2973 (100) | < 0.001 |
Unknown | 318 (27.48) | 237 (20.48) | 257 (22.21) | 148 (12.79) | 76 (6.57) | 50 (4.32) | 71 (6.14) | 1157 (100) | ||
Incomplete | 188 (39.33) | 121 (25.31) | 85 (17.78) | 27 (5.65) | 23 (4.81) | 18 (3.77) | 16 (3.35) | 478 (100) | ||
Best bowel preparation at baseline colonoscopy | Excellent/good | 551 (39.33) | 292 (20.84) | 317 (22.63) | 122 (8.71) | 67 (4.78) | 43 (3.07) | 9 (0.64) | 1401 (100) | < 0.001 |
Satisfactory | 206 (42.3) | 114 (23.41) | 105 (21.56) | 26 (5.34) | 23 (4.72) | 9 (1.85) | 4 (0.82) | 487 (100) | ||
Poor | 85 (43.81) | 52 (26.8) | 38 (19.59) | 10 (5.15) | 6 (3.09) | 1 (0.52) | 2 (1.03) | 194 (100) | ||
Unknown | 918 (36.34) | 518 (20.51) | 597 (23.63) | 197 (7.8) | 121 (4.79) | 70 (2.77) | 105 (4.16) | 2526 (100) |
Baseline risk factor | Category | Interval between baseline and first follow-up | Total | p-value (chi squared) | ||||||
---|---|---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||||
n (%) | n (%) | n (%) | n (%) | n (%) | n (%) | n (%) | n (%) | |||
Total | 1760 (38.19) | 976 (21.18) | 1057 (22.94) | 355 (7.7) | 217 (4.71) | 123 (2.67) | 120 (2.6) | 4608 (100) | n/a | |
Adenoma characteristics | ||||||||||
Number | 1 | 1158 (37.27) | 663 (21.34) | 698 (22.47) | 251 (8.08) | 164 (5.28) | 84 (2.7) | 89 (2.86) | 3107 (100) | 0.006 |
2 | 488 (42.4) | 241 (20.94) | 256 (22.24) | 70 (6.08) | 37 (3.21) | 33 (2.87) | 26 (2.26) | 1151 (100) | ||
3 | 77 (32.08) | 52 (21.67) | 67 (27.92) | 24 (10) | 12 (5) | 4 (1.67) | 4 (1.67) | 240 (100) | ||
4 | 37 (33.64) | 20 (18.18) | 36 (32.73) | 10 (9.09) | 4 (3.64) | 2 (1.82) | 1 (0.91) | 110 (100) | ||
Largest size (mm) | < 10 | 114 (32.57) | 72 (20.57) | 103 (29.43) | 34 (9.71) | 16 (4.57) | 6 (1.71) | 5 (1.43) | 350 (100) | < 0.001 |
10–14 | 548 (34.75) | 305 (19.34) | 404 (25.62) | 127 (8.05) | 90 (5.71) | 50 (3.17) | 53 (3.36) | 1577 (100) | ||
15–19 | 348 (36.52) | 191 (20.04) | 223 (23.4) | 74 (7.76) | 57 (5.98) | 31 (3.25) | 29 (3.04) | 953 (100) | ||
≥ 20 | 750 (43.4) | 408 (23.61) | 327 (18.92) | 120 (6.94) | 54 (3.13) | 36 (2.08) | 33 (1.91) | 1728 (100) | ||
Worst histology | Tubular | 621 (36.04) | 350 (20.31) | 451 (26.18) | 132 (7.66) | 91 (5.28) | 39 (2.26) | 39 (2.26) | 1723 (100) | < 0.001 |
Tubulovillous | 828 (38.76) | 452 (21.16) | 477 (22.33) | 171 (8.01) | 95 (4.45) | 59 (2.76) | 54 (2.53) | 2136 (100) | ||
Villous | 192 (41.83) | 124 (27.02) | 64 (13.94) | 36 (7.84) | 15 (3.27) | 15 (3.27) | 13 (2.83) | 459 (100) | ||
Unknown | 119 (41.03) | 50 (17.24) | 65 (22.41) | 16 (5.52) | 16 (5.52) | 10 (3.45) | 14 (4.83) | 290 (100) | ||
Worst dysplasia | Low grade | 1252 (36.53) | 717 (20.92) | 842 (24.57) | 269 (7.85) | 168 (4.9) | 100 (2.92) | 79 (2.31) | 3427 (100) | < 0.001 |
High grade | 394 (46.35) | 192 (22.59) | 161 (18.94) | 46 (5.41) | 25 (2.94) | 14 (1.65) | 18 (2.12) | 850 (100) | ||
Unknown | 114 (34.44) | 67 (20.24) | 54 (16.31) | 40 (12.08) | 24 (7.25) | 9 (2.72) | 23 (6.95) | 331 (100) | ||
Proximal | No | 1213 (36.47) | 715 (21.5) | 766 (23.03) | 271 (8.15) | 168 (5.05) | 95 (2.86) | 98 (2.95) | 3326 (100) | 0.001 |
Yes | 547 (42.67) | 261 (20.36) | 291 (22.7) | 84 (6.55) | 49 (3.82) | 28 (2.18) | 22 (1.72) | 1282 (100) | ||
Number of sightings of a unique adenoma | 1 | 1216 (36.73) | 652 (19.69) | 807 (24.37) | 267 (8.06) | 176 (5.32) | 94 (2.84) | 99 (2.99) | 3311 (100) | < 0.001 |
2 | 428 (42.59) | 222 (22.09) | 211 (21) | 68 (6.77) | 33 (3.28) | 24 (2.39) | 19 (1.89) | 1005 (100) | ||
3 | 87 (47.8) | 50 (27.47) | 22 (12.09) | 11 (6.04) | 7 (3.85) | 3 (1.65) | 2 (1.1) | 182 (100) | ||
4 | 19 (30.16) | 31 (49.21) | 8 (12.7) | 4 (6.35) | 0 (0) | 1 (1.59) | 0 (0) | 63 (100) | ||
5+ | 10 (21.28) | 21 (44.68) | 9 (19.15) | 5 (10.64) | 1 (2.13) | 1 (2.13) | 0 (0) | 47 (100) | ||
Polyp characteristics | ||||||||||
Number of hyperplastic polyps | 0 | 1431 (38.23) | 782 (20.89) | 869 (23.22) | 283 (7.56) | 177 (4.73) | 93 (2.48) | 108 (2.89) | 3743 (100) | 0.010 |
1 | 192 (35.49) | 118 (21.81) | 139 (25.69) | 39 (7.21) | 19 (3.51) | 23 (4.25) | 11 (2.03) | 541 (100) | ||
2 | 64 (40.25) | 41 (25.79) | 22 (13.84) | 17 (10.69) | 9 (5.66) | 5 (3.14) | 1 (0.63) | 159 (100) | ||
3 | 24 (37.5) | 19 (29.69) | 9 (14.06) | 4 (6.25) | 6 (9.38) | 2 (3.13) | 0 (0) | 64 (100) | ||
4 | 17 (44.74) | 5 (13.16) | 7 (18.42) | 7 (18.42) | 2 (5.26) | 0 (0) | 0 (0) | 38 (100) | ||
5+ | 32 (50.79) | 11 (17.46) | 11 (17.46) | 5 (7.94) | 4 (6.35) | 0 (0) | 0 (0) | 63 (100) | ||
Location of polyps | Distal only | 1006 (37.04) | 591 (21.76) | 616 (22.68) | 215 (7.92) | 132 (4.86) | 79 (2.91) | 77 (2.84) | 2716 (100) | < 0.001 |
Proximal only | 226 (43.88) | 93 (18.06) | 119 (23.11) | 37 (7.18) | 22 (4.27) | 12 (2.33) | 6 (1.17) | 515 (100) | ||
Distal and proximal | 483 (41.89) | 247 (21.42) | 269 (23.33) | 72 (6.24) | 40 (3.47) | 22 (1.91) | 20 (1.73) | 1153 (100) | ||
Unknown | 45 (20.09) | 45 (20.09) | 53 (23.66) | 31 (13.84) | 23 (10.27) | 10 (4.46) | 17 (7.59) | 224 (100) | ||
Proximal polyp | No | 1051 (35.75) | 636 (21.63) | 669 (22.76) | 246 (8.37) | 155 (5.27) | 89 (3.03) | 94 (3.2) | 2940 (100) | < 0.001 |
Yes | 709 (42.51) | 340 (20.38) | 388 (23.26) | 109 (6.53) | 62 (3.72) | 34 (2.04) | 26 (1.56) | 1668 (100) |
All factors were highly significantly associated with interval at the 1% level except for gender (p = 0.462), family history of cancer (p = 0.067), a difficult examination (p = 0.150), large hyperplastic polyps (p = 0.645), number of polyps with unknown histology (p = 0.586), distal adenomas (p = 0.353) and distal polyps (p = 0.105). Results for non-significant factors are not presented here.
Patients of an older age, with an incomplete colonoscopy, poor bowel preparation, a large adenoma (≥ 20 mm), an adenoma with villous histology or HGD, a proximal adenoma or polyp, or multiple sightings of a unique adenoma at baseline tended to have a shorter interval. As all of these features were also associated with increased odds of finding an adenoma, AA or CRC at FUV1, they could potentially be confounding the association between findings at FUV1 and interval.
Effects of interval on findings at follow-up visit 1
Univariable analysis
The effect of interval on findings at the first follow-up was examined using univariable and multivariable analyses. Tables 19–21 show the crude and adjusted associations between interval and adenomas (advanced and non-advanced), AA and CRC at the first follow-up. The univariable analysis provided no evidence of an association between adenomas and interval, with large p-values, small effect estimates close to 1, and 95% CIs that included 1. Similarly, no relationship was observed between AA and interval. For CRC, there was evidence of an association with interval: with interval modelled as a categorical variable, there was evidence of a dose–response effect with a more than threefold increased odds of CRC with an interval of 6.5 years or longer (OR 3.14, 95% CI 1.28 to 7.72), and with interval modelled as a continuous variable, a 13% increased odds of CRC for every year increase in interval (OR 1.13, 95% CI 1.03 to 1.25).
Baseline risk factor | Category | Univariable analysis: adenoma (all types) | Multivariable analyses: adenoma (all types) | ||||
---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 4608) | Model 2 – interval as continuous (n = 4608) | ||||||
Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | ||
Interval | < 18 months | 1.00 | 0.5768 | 1.00 | 0.1848 | n/a | |
2 yearsa | 1.09 (0.93 to 1.28) | 1.05 (0.88 to 1.25) | |||||
3 yearsa | 1.01 (0.86 to 1.19) | 1.06 (0.89 to 1.25) | |||||
4 yearsa | 1.04 (0.82 to 1.32) | 1.08 (0.84 to 1.40) | |||||
5 yearsa | 1.26 (0.94 to 1.68) | 1.40 (1.03 to 1.90) | |||||
6 yearsa | 1.30 (0.89 to 1.88) | 1.42 (0.95 to 2.10) | |||||
≥ 6.5 years | 1.13 (0.77 to 1.66) | 1.47 (0.98 to 2.21) | |||||
Per year increase | 1.03 (0.99 to 1.07) | 0.11 | n/a | 1.06 (1.02 to 1.11) | 0.0024 | ||
Age (years) | < 55 | 1.00 | < 0.0001 | 1.00 | 0.0005 | 1.00 | 0.0004 |
≥ 55 and < 60 | 1.55 (1.25 to 1.91) | 1.46 (1.17 to 1.81) | 1.46 (1.17 to 1.81) | ||||
≥ 60 and < 65 | 1.34 (1.10 to 1.64) | 1.27 (1.04 to 1.57) | 1.28 (1.04 to 1.57) | ||||
≥ 65 and < 70 | 1.22 (1.00 to 1.49) | 1.10 (0.89 to 1.35) | 1.10 (0.89 to 1.35) | ||||
≥ 70 and < 75 | 1.67 (1.36 to 2.04) | 1.56 (1.27 to 1.93) | 1.57 (1.27 to 1.94) | ||||
≥ 75 and < 80 | 1.38 (1.08 to 1.75) | 1.33 (1.03 to 1.71) | 1.34 (1.04 to 1.72) | ||||
≥ 80 | 1.41 (1.05 to 1.91) | 1.25 (0.91 to 1.71) | 1.26 (0.92 to 1.72) | ||||
Gender | Male | 1.00 | < 0.0001 | 1.00 | 0.0005 | 1.00 | 0.0004 |
Female | 0.76 (0.67 to 0.86) | 0.79 (0.70 to 0.90) | 0.79 (0.70 to 0.90) | ||||
Year of baseline | Per year increase | 1.03 (1.02 to 1.05) | < 0.0001 | 1.05 (1.03 to 1.07) | < 0.0001 | 1.05 (1.03 to 1.07) | < 0.0001 |
Length of baseline visit | 1 day | 1.00 | 0.0037 | 1.00 | 0.0001 | 1.00 | 0.0001 |
2–30 days | 0.71 (0.53 to 0.94) | 0.47 (0.33 to 0.66) | 0.47 (0.33 to 0.66) | ||||
1–3 months | 1.08 (0.90 to 1.29) | 0.84 (0.67 to 1.06) | 0.84 (0.67 to 1.06) | ||||
3–6 months | 0.83 (0.68 to 1.00) | 0.67 (0.53 to 0.84) | 0.67 (0.53 to 0.84) | ||||
6–12 months | 1.06 (0.87 to 1.29) | 0.83 (0.65 to 1.06) | 0.83 (0.65 to 1.06) | ||||
≥ 12 months | 1.60 (1.07 to 2.40) | 0.78 (0.46 to 1.33) | 0.78 (0.46 to 1.32) | ||||
Most complete colonoscopy | Complete | 1.00 | < 0.0001 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
Incomplete/unknown | 1.33 (1.18 to 1.51) | 1.64 (1.41 to 1.92) | 1.64 (1.40 to 1.91) | ||||
Difficult examination | No | 1.00 | 0.0006 | 1.00 | 0.0001 | 1.00 | 0.0001 |
Yes | 0.59 (0.43 to 0.81) | 0.53 (0.38 to 0.74) | 0.54 (0.39 to 0.75) | ||||
Number of adenomas | 1 | 1.00 | < 0.0001 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
2 | 1.54 (1.34 to 1.77) | 1.43 (1.23 to 1.67) | 1.43 (1.23 to 1.66) | ||||
3 | 1.68 (1.29 to 2.19) | 1.57 (1.18 to 2.11) | 1.57 (1.17 to 2.10) | ||||
4 | 2.12 (1.45 to 3.10) | 1.94 (1.29 to 2.91) | 1.92 (1.28 to 2.89) | ||||
Largest adenoma (mm) | < 20 | 1.00 | 0.0151 | 1.00 | 0.0067 | 1.00 | 0.0062 |
≥ 20 | 1.17 (1.03 to 1.32) | 1.22 (1.06 to 1.4) | 1.22 (1.06 to 1.41) | ||||
Number of sightings of a unique adenoma | 1 | 1.00 | 0.0001 | 1.00 | 0.0002 | 1.00 | 0.0002 |
2 | 1.22 (1.06 to 1.42) | 1.5 (1.22 to 1.85) | 1.50 (1.22 to 1.85) | ||||
3 | 1.11 (0.82 to 1.52) | 1.29 (0.90 to 1.85) | 1.30 (0.90 to 1.86) | ||||
4 | 1.71 (1.04 to 2.82) | 1.88 (1.07 to 3.32) | 1.89 (1.07 to 3.34) | ||||
5+ | 2.95 (1.64 to 5.31) | 3.13 (1.53 to 6.39) | 3.13 (1.53 to 6.41) | ||||
Number of unknown histology polyps | 0 | 1.00 | < 0.0001 | 1.00 | 0.0065 | 1.00 | 0.0056 |
1 | 1.25 (1.04 to 1.50) | 1.17 (0.96 to 1.43) | 1.17 (0.96 to 1.43) | ||||
2 | 1.76 (1.31 to 2.36) | 1.54 (1.13 to 2.11) | 1.56 (1.14 to 2.13) | ||||
3 | 1.62 (1.10 to 2.38) | 1.35 (0.89 to 2.03) | 1.35 (0.89 to 2.03) | ||||
4 | 0.94 (0.55 to 1.61) | 0.79 (0.46 to 1.39) | 0.80 (0.46 to 1.39) | ||||
5+ | 1.91 (1.28 to 2.83) | 1.73 (1.14 to 2.63) | 1.74 (1.15 to 2.65) | ||||
Proximal polyp | No | 1.00 | < 0.0001 | 1.00 | 0.0057 | 1.00 | 0.0053 |
Yes | 1.38 (1.22 to 1.57) | 1.24 (1.06 to 1.44) | 1.24 (1.07 to 1.44) |
Baseline risk factor | Category | Univariable analysis: AA | Multivariable analyses: AA | ||||
---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 4318) | Model 2 – interval as continuous (n = 4318) | ||||||
Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | ||
Interval | < 18 months | 1.00 | 0.083 | 1.00 | 0.7658 | n/a | |
2 yearsa | 1.13 (0.92 to 1.40) | 1.05 (0.83 to 1.33) | |||||
3 yearsa | 0.93 (0.75 to 1.15) | 1.06 (0.83 to 1.34) | |||||
4 yearsa | 1.04 (0.76 to 1.43) | 1.10 (0.78 to 1.55) | |||||
5 yearsa | 1.03 (0.70 to 1.53) | 1.24 (0.81 to 1.91) | |||||
6 yearsa | 0.95 (0.57 to 1.60) | 0.90 (0.51 to 1.61) | |||||
≥ 6.5 years | 1.94 (1.26 to 2.98) | 1.52 (0.90 to 2.57) | |||||
Per year increase | 1.04 (0.99 to 1.09) | 0.1103 | n/a | 1.03 (0.98 to 1.09) | 0.2513 | ||
Age (years) | < 55 | 1.00 | < 0.0001 | 1.00 | 0.0014 | 1.00 | 0.0013 |
≥ 55 and < 60 | 1.74 (1.30 to 2.33) | 1.56 (1.14 to 2.13) | 1.57 (1.15 to 2.15) | ||||
≥ 60 and < 65 | 1.48 (1.12 to 1.96) | 1.37 (1.01 to 1.85) | 1.37 (1.01 to 1.86) | ||||
≥ 65 and < 70 | 1.46 (1.10 to 1.93) | 1.24 (0.91 to 1.68) | 1.24 (0.92 to 1.68) | ||||
≥ 70 and < 75 | 2.04 (1.55 to 2.68) | 1.74 (1.30 to 2.35) | 1.75 (1.30 to 2.35) | ||||
≥ 75 and < 80 | 2.29 (1.68 to 3.12) | 1.81 (1.28 to 2.54) | 1.81 (1.28 to 2.54) | ||||
≥ 80 | 2.53 (1.75 to 3.64) | 1.81 (1.21 to 2.70) | 1.81 (1.21 to 2.71) | ||||
Most complete colonoscopy | Complete | 1.00 | < 0.0001 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
Incomplete/unknown | 1.72 (1.46 to 2.02) | 1.91 (1.57 to 2.32) | 1.92 (1.58 to 2.33) | ||||
Number of adenomas | 1 | 1.00 | 0.0068 | 1.00 | 0.0358 | 1.00 | 0.0377 |
2 | 1.26 (1.05 to 1.51) | 1.25 (1.02 to 1.54) | 1.25 (1.02 to 1.53) | ||||
3 | 0.65 (0.43 to 1.00) | 1.22 (0.76 to 1.96) | 1.22 (0.76 to 1.96) | ||||
4 | 1.03 (0.61 to 1.74) | 2.05 (1.14 to 3.67) | 2.05 (1.14 to 3.67) | ||||
Largest adenoma (mm) | < 20 | 1.00 | < 0.0001 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
≥ 20 | 1.94 (1.65 to 2.27) | 1.69 (1.39 to 2.06) | 1.69 (1.39 to 2.06) | ||||
Worst adenoma histology | Tubular | 1.00 | < 0.0001 | 1.00 | 0.0016 | 1.00 | 0.0016 |
Tubulovillous | 1.93 (1.59 to 2.34) | 1.42 (1.15 to 1.76) | 1.42 (1.15 to 1.75) | ||||
Villous | 3.03 (2.33 to 3.95) | 1.59 (1.17 to 2.18) | 1.59 (1.17 to 2.17) | ||||
Proximal polyp | No | 1.00 | 0.98 | 1.00 | 0.0182 | 1.00 | 0.0188 |
Yes | 1.00 (0.85 to 1.18) | 1.28 (1.04 to 1.58) | 1.28 (1.04 to 1.58) | ||||
Maximum number of sightings of a unique adenoma | 1 | 1.00 | < 0.0001 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
2 | 1.61 (1.34 to 1.94) | 1.37 (1.11 to 1.69) | 1.37 (1.11 to 1.69) | ||||
3 | 2.52 (1.79 to 3.54) | 1.76 (1.20 to 2.56) | 1.76 (1.20 to 2.56) | ||||
4 | 3.32 (1.95 to 5.67) | 2.49 (1.40 to 4.44) | 2.49 (1.40 to 4.43) | ||||
5+ | 6.37 (3.56 to 11.39) | 3.79 (1.97 to 7.27) | 3.79 (1.98 to 7.25) | ||||
Number of hyperplastic polyps | 0 | 1.00 | 0.1341 | 1.00 | 0.0153 | 1.00 | 0.0147 |
1 | 0.75 (0.58 to 0.98) | 0.77 (0.57 to 1.03) | 0.76 (0.57 to 1.03) | ||||
2 | 0.96 (0.62 to 1.48) | 0.99 (0.62 to 1.58) | 0.99 (0.62 to 1.59) | ||||
3 | 0.44 (0.17 to 1.09) | 0.15 (0.04 to 0.65) | 0.15 (0.04 to 0.65) | ||||
4 | 0.78 (0.30 to 2.01) | 0.61 (0.20 to 1.83) | 0.61 (0.20 to 1.83) | ||||
5 | 0.97 (0.49 to 1.92) | 1.00 (0.48 to 2.08) | 1.00 (0.48 to 2.09) | ||||
Large hyperplastic polyp | No | 1.00 | 0.1471 | 1.00 | 0.0037 | 1.00 | 0.004 |
Yes | 1.50 (0.88 to 2.54) | 2.58 (1.40 to 4.73) | 2.56 (1.39 to 4.69) |
Baseline risk factor | Category | Univariable analysis: CRC | Multivariable analyses: CRC | ||||
---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 4186) | Model 2 – interval as continuous (n = 4186) | ||||||
Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | ||
Interval | < 18 months | 1.00 | 0.0006 | 1.00 | 0.0049 | n/a | |
2 yearsa | 1.57 (0.91 to 2.69) | 1.69 (0.94 to 3.03) | |||||
3 yearsa | 0.34 (0.14 to 0.82) | 0.53 (0.22 to 1.31) | |||||
4 yearsa | 1.55 (0.73 to 3.31) | 2.46 (1.12 to 5.44) | |||||
5 yearsa | 1.12 (0.39 to 3.22) | 2.08 (0.70 to 6.18) | |||||
6 yearsa | 2.53 (0.96 to 6.65) | 3.02 (1.00 to 9.10) | |||||
≥ 6.5 years | 3.14 (1.28 to 7.72) | 4.12 (1.37 to 12.41) | |||||
Per year increase | 1.13 (1.03 to 1.25) | 0.0232 | n/a | 1.21 (1.08 to 1.37) | 0.0040 | ||
Age (years) | < 55 | 1.00 | < 0.0001 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
≥ 55 and < 60 | 0.41 (0.09 to 1.94) | 0.20 (0.02 to 1.61) | 0.20 (0.02 to 1.57) | ||||
≥ 60 and < 65 | 1.47 (0.56 to 3.82) | 1.18 (0.44 to 3.18) | 1.16 (0.43 to 3.12) | ||||
≥ 65 and < 70 | 2.39 (1.01 to 5.66) | 1.86 (0.76 to 4.55) | 1.86 (0.76 to 4.56) | ||||
≥ 70 and < 75 | 2.91 (1.24 to 6.85) | 2.11 (0.86 to 5.19) | 2.18 (0.89 to 5.35) | ||||
≥ 75 and < 80 | 6.81 (2.99 to 15.5) | 5.92 (2.56 to 13.71) | 6.21 (2.69 to 14.35) | ||||
≥ 80 | 7.51 (3.08 to 18.34) | 4.68 (1.81 to 12.07) | 5.03 (1.96 to 12.94) | ||||
Best bowel preparation | Excellent/good/satisfactory/unknown | 1.00 | 0.0033 | 1.00 | 0.0022 | 1.00 | 0.006 |
Poor | 3.19 (1.62 to 6.27) | 3.80 (1.79 to 8.05) | 3.30 (1.54 to 7.07) | ||||
Difficult examination | No | 1.00 | 0.0027 | n/a | 1.00 | 0.0425 | |
Yes | 3.10 (1.62 to 5.92) | 2.32 (1.10 to 4.89) | |||||
Worst adenoma histology | Tubular | 1.00 | 0.0002 | 1.00 | 0.0302 | 1.00 | 0.0098 |
Tubulovillous | 1.76 (1.00 to 3.09) | 1.46 (0.82 to 2.60) | 1.52 (0.85 to 2.71) | ||||
Villous | 4.09 (2.13 to 7.86) | 2.55 (1.28 to 5.08) | 2.94 (1.48 to 5.82) | ||||
Worst adenoma dysplasia | Low grade | 1.00 | 0.0039 | 1.00 | 0.0258 | 1.00 | 0.0379 |
High grade | 2.09 (1.29 to 3.37) | 1.81 (1.09 to 3.02) | 1.74 (1.04 to 2.90) |
Multivariable analysis
To identify independent risk factors for having adenomas, AA or CRC at FUV1, and to adjust the effect of interval for potential confounding factors, multivariable logistic regression was used. Interval was first modelled as a categorical variable (model 1) and then as a continuous variable (model 2). Results of the models for adenomas (all types), AA and CRC are shown in Tables 19–21, respectively.
Adenomas (all types)
Comparison of crude and adjusted estimates for the effect of interval on adenoma findings at FUV1 showed evidence of weak negative confounding, with the effect masked slightly by that of covariates. After adjustment for covariates, the association between interval and detection of adenomas was strengthened, but there was considerable overlap between the 95% CIs (with interval as a categorical variable), most of which included 1, and statistical significance was reached only when interval was modelled as a continuous variable. The latter model showed 6% greater odds of adenomas at FUV1 per year increase in interval (OR 1.06, 95% CI 1.02 to 1.11; p = 0.0024).
A number of baseline characteristics were found to be independent risk factors for having an adenoma detected at FUV1. These included older age, male gender, later year of baseline, no complete colonoscopy and presence of multiple adenomas at baseline (all p < 0.001). Effect estimates for specific age categories should be interpreted with caution owing to their imprecision. Other risk factors included the presence of an adenoma of ≥ 20 mm or a proximal polyp, whereas odds were lower in patients with a difficult baseline examination (composite variable for an incomplete examination with poor bowel preparation and additional difficulties). Patients with a baseline visit of more than 1 day were significantly less likely to have an adenoma detected at FUV1 (p < 0.001); however, there was considerable overlap between 95% CIs, some of which included 1, which made interpretation difficult. Multiple sightings of an adenoma was a strong risk factor for the detection of adenomas, and having the same adenoma seen five or more times increased odds more than threefold (OR 3.13, 95% CI 1.53 to 6.39).
Models 1 and 2, which used interval as a categorical variable and continuous variable, respectively, were very similar and selected the same variables. Crude and adjusted estimates of effect were similar for all variables except length of baseline visit and most complete colonoscopy.
Advanced adenomas
There was little evidence of a relationship between interval and AA at FUV1, both before and after adjusting for other factors; the test statistics were non-significant and all but one 95% CI included 1, although there was a tendency towards increasing odds with increasing interval.
After adjusting for the effects of covariates, older age, no complete colonoscopy and the presence of an adenoma of ≥ 20 mm at baseline were highly predictive of AA detection at FUV1 (all p < 0.001). Other risk factors included the presence of a proximal polyp, an adenoma with villous or tubulovillous histology, a large (≥ 10 mm) hyperplastic polyp or multiple adenomas at baseline. Multiple sightings of a unique adenoma at baseline was a strong risk factor for AA at FUV1; a dose–response effect was demonstrated and five or more sightings was associated with an almost fourfold greater odds of AA (OR 3.79, 95% CI 2.0 to 7.3). The two models, examining interval as a categorical and continuous variable, were very similar and selected the same variables.
When comparing crude and adjusted estimates, the effects of age, adenoma size and histology, and number of sightings of an adenoma were exaggerated before adjustment, suggesting positive confounding by covariates in the models. There was also evidence of negative confounding, with no effect of proximal polyps, and a smaller effect of completeness of colonoscopy, number of adenomas and large hyperplastic polyps before adjustment for other factors. Number of hyperplastic polyps and presence of a large hyperplastic polyp or a proximal polyp at baseline were significantly associated with AA only after adjustment.
Colorectal cancer
A longer interval was significantly associated with increased odds of CRC detection at FUV1, both before and after adjustment, regardless of whether interval was modelled as a continuous or categorical variable. After adjustment for covariates there was 21% greater odds of finding CRC per year increase in interval (OR 1.21, 95% CI 1.08 to 1.37; p = 0.0040). There was evidence of weak negative confounding as the effect of interval became stronger after adjusting for other factors.
Independent baseline risk factors for CRC at FUV1 included older age, the detection of an adenoma with villous or tubulovillous histology or with HGD, poor bowel preparation and a difficult examination (all p < 0.05), the last of which was significant only in model 2, with interval as a continuous variable. There was evidence of positive and negative confounding; the effects of histology, dysplasia and a difficult examination on CRC were attenuated after adjustment, whereas the effect of bowel preparation was strengthened slightly.
Baseline risk factors for a short interval
Unexpectedly, little evidence of an association was found between interval and detection of adenomas or AA at FUV1, even after adjusting for a number of covariates. As a large proportion of patients returned sooner than expected for their first follow-up, crude and adjusted estimates of the effect of baseline characteristics on interval length were calculated to allow a more detailed examination of baseline predictors of a short interval. An arbitrary cut-off of 2 years from baseline was used to classify patients as having a short interval, as this was the median interval length to FUV1 in the hospital cohort. A logistic regression model was used and factors that were not significant in the model at the 95% level were not included in the final model, and were therefore not adjusted for. Table 22 shows baseline risk factors for a short interval.
Baseline predictors | Interval from baseline to first follow-up of ≤ 2 years | ||||
---|---|---|---|---|---|
Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | ||
Age (years) | < 55 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
≥ 55 and < 60 | 1.04 (0.85 to 1.27) | 1.02 (0.82 to 1.27) | |||
≥ 60 and < 65 | 1.31 (1.09 to 1.58) | 1.19 (0.97 to 1.46) | |||
≥ 65 and < 70 | 1.18 (0.98 to 1.42) | 1.11 (0.91 to 1.36) | |||
≥ 70 and < 75 | 1.59 (1.31 to 1.93) | 1.47 (1.20 to 1.82) | |||
≥ 75 and < 80 | 1.66 (1.32 to 2.09) | 1.45 (1.13 to 1.87) | |||
≥ 80 | 2.32 (1.72 to 3.11) | 1.94 (1.40 to 2.67) | |||
Calendar year of baseline | 1-year increase | 1.06 (1.05 to 1.08) | < 0.0001 | 1.06 (1.04 to 1.08) | < 0.0001 |
Length of baseline visit | 1 day | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
2–30 days | 1.57 (1.21 to 2.05) | 1.13 (0.82 to 1.57) | |||
1–3 months | 1.30 (1.10 to 1.55) | 0.99 (0.79 to 1.23) | |||
3–6 months | 1.53 (1.28 to 1.84) | 1.05 (0.84 to 1.31) | |||
6–12 months | 1.30 (1.07 to 1.57) | 0.85 (0.67 to 1.08) | |||
≥ 12 months | 0.65 (0.43 to 0.98) | 0.26 (0.15 to 0.45) | |||
Most complete colonoscopy | Complete | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
Incomplete/unknown | 0.63 (0.55 to 0.71) | 0.71 (0.61 to 0.82) | |||
Largest adenoma (mm) | < 10 | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
10–14 | 1.09 (0.86 to 1.38) | 1.29 (0.99 to 1.67) | |||
15–19 | 1.22 (0.95 to 1.56) | 1.44 (1.09 to 1.90) | |||
≥ 20 | 1.78 (1.42 to 2.25) | 1.80 (1.37 to 2.35) | |||
Worst adenoma dysplasia | Low grade | 1.00 | < 0.0001 | 1.00 | < 0.0001 |
High grade | 1.62 (1.39 to 1.89) | 1.42 (1.20 to 1.68) | |||
Number of sightings of a unique adenoma | 1 | 1.00 | < 0.0001 | 1.00 | 0.0003 |
2 | 1.45 (1.26 to 1.67) | 1.34 (1.09 to 1.64) | |||
3 | 2.10 (1.53 to 2.87) | 1.93 (1.33 to 2.80) | |||
4 | 1.93 (1.15 to 3.24) | 2.55 (1.38 to 4.72) | |||
5+ | 0.82 (0.46 to 1.47) | 1.61 (0.77 to 3.34) | |||
Proximal polyp | No | 1.00 | < 0.0001 | 1.00 | 0.0005 |
Yes | 1.31 (1.16 to 1.48) | 1.29 (1.12 to 1.49) |
Age was significantly associated with a short interval (p < 0.0001), before and after adjustment for confounding, with a tendency towards an increasing odds of a short interval with increasing age. After adjustment, there was a 6% greater odds of a short interval per year increase in the calendar year of the baseline visit (OR 1.06, 95% CI 1.04 to 1.08) and odds also increased for patients with multiple sightings of a single adenoma (p < 0.0003). Conversely, patients without a complete colonoscopy at baseline were significantly less likely to return early, possibly as a result of the experience of a difficult examination (OR 0.71, 95% CI 0.61 to 0.82). Patients with a longer baseline visit were less likely to have a short surveillance interval; however, most 95% CIs included 1 so it was not possible to discern a real effect (p < 0.0001); these results may be affected by adjustment for multiple sightings of an adenoma, as before adjustment there was a positive association between length of baseline and a short interval. Having a large adenoma (≥ 10 mm), an adenoma with HGD or a proximal polyp were also risk factors for a short interval.
The independent predictors of a short interval were also identified as risk factors for finding an adenoma, AA or CRC at FUV1. However, adjustment for these factors made little difference to the effect estimates for interval, and did not reveal an association between interval and adenoma (only associated when interval was modelled as a continuous variable) or AA at FUV1. One possibility is that an unmeasured confounder closely linked to a factor(s) associated with the outcome and exposure may have increased the risk of a short interval and of having adenomas, AA or CRC at FUV1. This would cause an exaggerated effect of a short interval on risk, resulting in a diminished effect of interval length overall. Multiple sightings of a single adenoma at baseline was identified as a strong risk factor for a short interval and for finding an adenoma or AA at FUV1, so it is possible that this factor was acting as a proxy measure for an important, unmeasured confounder. This possibility is explored in detail in the next section.
New and previously seen lesions at first follow-up
As described in the previous section, we hypothesised that an unmeasured confounder was masking the association between interval and the detection of adenoma, AA or CRC at FUV1. It was possible that a proportion of individuals had a short interval because they were undergoing polypectomy site surveillance. Such patients would have a large adenoma, probably seen multiple times during baseline for repeated treatment, and possibly with advanced features such as HGD. Polypectomy site surveillance would be carried out to check the site of a large lesion that might not have been completely removed at baseline, rather than to check for the occurrence of newly developed lesions or lesions missed at baseline (possibly because of a poor-quality examination). The UK Adenoma Surveillance guideline16 assumes that all detected lesions are removed at baseline before surveillance begins, and includes recommendations for the treatment and surveillance of incompletely removed lesions. Such patients may require repeated treatment over a number of examinations in order to achieve complete removal and are then expected to return for a further examination(s) to check the polyp site.
We hypothesised that patients undergoing polypectomy site surveillance would be more likely not only to return for follow-up sooner, but also to have a finding detected at FUV1 – the lesion under polypectomy site surveillance. This could potentially confound the relationship between interval and detection of an adenoma or AA at FUV1. Although difficult to recognise such cases from a retrospective series, it was thought that lesions detected at FUV1 which were previously seen at baseline (i.e. the same lesion) were more likely to have been found as a result of polypectomy site surveillance. The distribution of new and previously seen outcomes by interval was examined to determine whether or not this was likely to be the case.
Tables 23–25 show a breakdown of IR patients by interval length and outcome status. Patients were stratified into four groups: (1) those with no findings at FUV1; (2) those who have only a previously seen finding; (3) those with both previously seen and new findings; and (4) those who have only a new finding. The number of patients within each stratum was then assessed to determine whether or not it was appropriate to exclude previously seen findings from the analyses.
Adenoma status | Interval from baseline to first follow-up, n (%) | Total, n (%) | ||||||
---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||
None | 1165 (66.19) | 627 (64.24) | 697 (65.94) | 232 (65.35) | 132 (60.83) | 74 (60.16) | 76 (63.33) | 3003 (65.17) |
Previously seen only | 184 (10.45) | 85 (8.71) | 45 (4.26) | 13 (3.66) | 4 (1.84) | 0 (0) | 3 (2.50) | 334 (7.25) |
New and previously seen | 48 (2.73) | 25 (2.56) | 15 (1.42) | 2 (0.56) | 2 (0.92) | 2 (1.63) | 0 (0) | 94 (2.04) |
New only | 363 (20.63) | 239 (24.49) | 300 (28.38) | 108 (30.42) | 79 (36.41) | 47 (38.21) | 41 (34.17) | 1177 (25.54) |
Total | 1760 (100) | 976 (100) | 1057 (100) | 355 (100) | 217 (100) | 123 (100) | 120 (100) | 4608 (100) |
AA status | Interval from baseline to first follow-up, n (%) | Total, n (%) | ||||||
---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||
None | 1492 (84.77) | 811 (83.09) | 906 (85.71) | 299 (84.23) | 183 (84.33) | 105 (85.37) | 89 (74.17) | 3885 (84.31) |
Previously seen only | 132 (7.50) | 75 (7.68) | 46 (4.35) | 11 (3.10) | 5 (2.30) | 0 (0) | 3 (2.50) | 272 (5.90) |
New and previously seen | 21 (1.19) | 9 (0.92) | 2 (0.19) | 1 (0.28) | 1 (0.46) | 2 (1.63) | 0 (0) | 36 (0.78) |
New only | 115 (6.53) | 81 (8.30) | 103 (9.74) | 44 (12.39) | 28 (12.9) | 16 (13.01) | 28 (23.33) | 415 (9.01) |
Total | 1760 (100) | 976 (100) | 1057 (100) | 355 (100) | 217 (100) | 123 (100) | 120 (100) | 4608 (100) |
CRC status | Interval from baseline to first follow-up, n (%) | Total, n (%) | ||||||
---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||
None | 1731 (98.35) | 951 (97.44) | 1051 (99.43) | 346 (97.46) | 213 (98.16) | 118 (95.93) | 114 (95.00) | 4524 (98.18) |
Previously seen only | 14 (0.80) | 16 (1.64) | 1 (0.09) | 0 (0) | 0 (0) | 1 (0.81) | 0 (0) | 32 (0.69) |
New and previously seen | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 1 (0.81) | 0 (0) | 1 (0.02) |
New only | 15 (0.85) | 9 (0.92) | 5 (0.47) | 9 (2.54) | 4 (1.84) | 3 (2.44) | 6 (5) | 51 (1.11) |
Total | 1760 (100) | 976 (100) | 1057 (100) | 355 (100) | 217 (100) | 123 (100) | 120 (100) | 4608 (100) |
After stratifying patients by interval and outcome status, increasing interval length was associated with increased detection of new findings. Patients with a shorter interval had a greater proportion of previously seen lesions detected than those with a longer interval. No such trend was seen among the ‘new and previously seen’ findings group, although there were only a small number of patients in this group: 2% with adenomas, 1% with AA and < 1% with CRC.
Previously seen lesions detected at the first follow-up were most likely to represent lesions undergoing polypectomy site surveillance when found in patients with a short interval between baseline and follow-up. As interval length increased, it became less certain whether or not this was the case. Logistic regression was performed using any findings (see Tables 19–21) and then using only new findings at FUV1 (Tables 26–30), having removed all previously seen findings.
All previously seen lesions were removed regardless of the interval length, rather than just those detected in patients with a short interval to FUV1, in order to avoid the introduction of bias into the data set. If only previously seen lesions in patients with short interval were removed from the analysis then this could artificially increase the odds of an outcome among patients with a longer surveillance interval, which would overestimate the effect of interval.
Effect of interval on new findings at first follow-up
After removal of previously seen lesions to adjust for the confounding effect of polypectomy site surveillance, the association of interval with new findings at the first follow-up was examined using univariable and multivariable analyses.
Univariable analysis
Table 26 shows the crude association between interval and new findings (adenomas, AA and CRC) at the first follow-up – the effect of interval length was stronger than in the univariable analysis of all findings (new and previously seen lesions) (compare Table 26 and Table 27). There was strong evidence of an association between interval length and adenomas, AA and CRC at FUV1 (p < 0.0005), with an apparent dose–response effect on all outcomes (see Table 26). When all findings (both new and previously seen) at FUV1 were analysed, interval was associated with CRC only at FUV1. This suggests that, as predicted, lesions undergoing polypectomy site surveillance were masking the association between interval and findings at FUV1. Although CIs for new outcomes overlap somewhat, they rarely include 1, suggesting that there is a true association in the population.
Interval from baseline to first follow-up | New findings at first follow-up | |||||
---|---|---|---|---|---|---|
Adenoma | AA | CRC | ||||
Unadjusted OR (95% CI) | p-value (LRT) | Unadjusted OR (95% CI) | p-value (LRT) | Unadjusted OR (95% CI) | p-value (LRT) | |
< 18 months | 1 | < 0.0001 | 1 | < 0.0001 | 1 | 0.0004 |
2 yearsa | 1.22 (1.02 to 1.46) | 1.21 (0.92 to 1.60) | 1.08 (0.47 to 2.48) | |||
3 yearsa | 1.39 (1.17 to 1.65) | 1.32 (1.01 to 1.72) | 0.55 (0.20 to 1.53) | |||
4 yearsa | 1.47 (1.15 to 1.89) | 1.73 (1.21 to 2.48) | 3.03 (1.31 to 6.97) | |||
5 yearsa | 1.95 (1.45 to 2.63) | 1.84 (1.20 to 2.83) | 2.18 (0.72 to 6.64) | |||
6 yearsa | 2.17 (1.49 to 3.17) | 2.05 (1.21 to 3.48) | 3.91 (1.28 to 11.97) | |||
≥ 6.5 years | 1.70 (1.15 to 2.52) | 3.63 (2.30 to 5.74) | 6.12 (2.33 to 16.08) | |||
Interval (per year increase) | 1.12 (1.08 to 1.16) | < 0.0001 | 1.16 (1.11 to 1.22) | < 0.0001 | 1.27 (1.16 to 1.40) | < 0.0001 |
Interval from baseline to first follow-up | Findings at FUV1 | |||||
---|---|---|---|---|---|---|
Adenoma | AA | CRC | ||||
Unadjusted OR (95% CI) | p-value (LRT) | Unadjusted OR (95% CI) | p-value (LRT) | Unadjusted OR (95% CI) | p-value (LRT) | |
< 18 months | 1 | 0.5768 | 1 | 0.0830 | 1 | 0.0006 |
2 yearsa | 1.09 (0.93 to 1.28) | 1.13 (0.92 to 1.40) | 1.57 (0.91 to 2.69) | |||
3 yearsa | 1.01 (0.86 to 1.19) | 0.93 (0.75 to 1.15) | 0.34 (0.14 to 0.82) | |||
4 yearsa | 1.04 (0.82 to 1.32) | 1.04 (0.76 to 1.43) | 1.55 (0.73 to 3.31) | |||
5 yearsa | 1.26 (0.94 to 1.68) | 1.03 (0.70 to 1.53) | 1.12 (0.39 to 3.22) | |||
6 yearsa | 1.30 (0.89 to 1.88) | 0.95 (0.57 to 1.60) | 2.53 (0.96 to 6.65) | |||
≥ 6.5 years | 1.13 (0.77 to 1.66) | 1.94 (1.26 to 2.98) | 3.14 (1.28 to 7.72) | |||
Interval (per year increase) | 1.03 (0.99 to 1.07) | 0.1117 | 1.04 (0.99 to 1.09) | 0.1103 | 1.13 (1.03 to 1.25) | 0.0232 |
Multivariable analysis
Logistic regression was used to identify independent risk factors for having new adenomas, AA or CRC at FUV1, and to adjust the effect of interval for potential confounding factors. Interval was first modelled as a categorical variable (model 1) and then as a continuous variable (model 2). Results of the models for new adenomas (advanced and non-advanced), AA and CRC are shown in Tables 28–30, respectively.
Baseline risk factor | Category | Number of IR patients (N = 4608) | Univariable analysis: new adenoma (all types) | Multivariable analyses: new adenoma (all types) | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 4608) | Model 2 – interval as continuous (n = 4608) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 1760 | 411 (23.35) | 1 | < 0.0001 | 1 | < 0.0001 | n/a | |
2 yearsa | 976 | 264 (27.05) | 1.22 (1.02 to 1.46) | 1.26 (1.04 to 1.52) | |||||
3 yearsa | 1057 | 315 (29.80) | 1.39 (1.17 to 1.65) | 1.43 (1.19 to 1.71) | |||||
4 yearsa | 355 | 110 (30.99) | 1.47 (1.15 to 1.89) | 1.58 (1.21 to 2.06) | |||||
5 yearsa | 217 | 81 (37.33) | 1.95 (1.45 to 2.63) | 2.12 (1.54 to 2.91) | |||||
6 yearsa | 123 | 49 (39.84) | 2.17 (1.49 to 3.17) | 2.46 (1.65 to 3.68) | |||||
≥ 6.5 years | 120 | 41 (34.17) | 1.70 (1.15 to 2.52) | 2.27 (1.49 to 3.46) | |||||
Per year increase | n/a | n/a | 1.12 (1.08 to 1.16) | < 0.0001 | n/a | 1.16 (1.11 to 1.21) | < 0.0001 | ||
Age (years) | < 55 | 1025 | 243 (23.71) | 1 | 0.0002 | 1 | 0.0009 | 1 | 0.0009 |
≥ 55 and < 60 | 622 | 205 (32.96) | 1.58 (1.27 to 1.97) | 1.51 (1.20 to 1.90) | 1.51 (1.20 to 1.90) | ||||
≥ 60 and < 65 | 788 | 230 (29.19) | 1.33 (1.07 to 1.64) | 1.30 (1.05 to 1.63) | 1.31 (1.05 to 1.63) | ||||
≥ 65 and < 70 | 813 | 212 (26.08) | 1.14 (0.92 to 1.40) | 1.06 (0.85 to 1.33) | 1.06 (0.85 to 1.32) | ||||
≥ 70 and < 75 | 714 | 219 (30.67) | 1.42 (1.15 to 1.76) | 1.47 (1.17 to 1.84) | 1.46 (1.16 to 1.83) | ||||
≥ 75 and < 80 | 413 | 111 (26.88) | 1.18 (0.91 to 1.54) | 1.27 (0.97 to 1.68) | 1.27 (0.97 to 1.67) | ||||
≥ 80 | 233 | 51 (21.89) | 0.90 (0.64 to 1.27) | 0.97 (0.68 to 1.39) | 0.97 (0.68 to 1.39) | ||||
Gender | Male | 2551 | 794 (31.13) | 1 | < 0.0001 | 1 | < 0.0001 | 1 | < 0.0001 |
Female | 2057 | 477 (23.19) | 0.67 (0.59 to 0.76) | 0.71 (0.61 to 0.81) | 0.70 (0.61 to 0.81) | ||||
Year of baseline | Per year increase | n/a | n/a | 1.03 (1.02 to 1.05) | 0.0001 | 1.06 (1.04 to 1.08) | < 0.0001 | 1.06 (1.04 to 1.08) | < 0.0001 |
Length of baseline visit | 1 day | 2496 | 745 (29.85) | 1 | 0.0002 | 1 | 0.0029 | 1 | 0.0055 |
2–30 days | 246 | 55 (22.36) | 0.68 (0.50 to 0.92) | 0.56 (0.39 to 0.81) | 0.55 (0.38 to 0.80) | ||||
1–3 months | 664 | 190 (28.61) | 0.94 (0.78 to 1.14) | 0.89 (0.70 to 1.14) | 0.90 (0.70 to 1.14) | ||||
3–6 months | 595 | 132 (22.18) | 0.67 (0.54 to 0.83) | 0.68 (0.53 to 0.88) | 0.69 (0.54 to 0.88) | ||||
6–12 months | 508 | 130 (25.59) | 0.81 (0.65 to 1.00) | 0.82 (0.63 to 1.07) | 0.83 (0.64 to 1.08) | ||||
≥ 12 months | 99 | 19 (19.19) | 0.56 (0.34 to 0.93) | 0.62 (0.33 to 1.19) | 0.64 (0.34 to 1.21) | ||||
Most complete colonoscopy | Complete | 2973 | 771 (25.93) | 1 | 0.0008 | 1 | < 0.0001 | 1 | < 0.0001 |
Incomplete/unknown | 1635 | 500 (30.58) | 1.26 (1.10 to 1.44) | 1.53 (1.30 to 1.80) | 1.52 (1.29 to 1.80) | ||||
Difficult examination | No | 4387 | 1232 (28.08) | 1 | 0.0004 | 1 | 0.0013 | 1 | 0.0016 |
Yes | 221 | 39 (17.65) | 0.55 (0.39 to 0.78) | 0.56 (0.39 to 0.81) | 0.57 (0.39 to 0.82) | ||||
Number of adenomas | 1 | 3107 | 756 (24.33) | 1 | < 0.0001 | 1 | 0.0002 | 1 | 0.0002 |
2 | 1151 | 375 (32.58) | 1.50 (1.30 to 1.74) | 1.38 (1.18 to 1.63) | 1.38 (1.17 to 1.62) | ||||
3 | 240 | 95 (39.58) | 2.04 (1.55 to 2.67) | 1.50 (1.11 to 2.01) | 1.51 (1.12 to 2.02) | ||||
4 | 110 | 45 (40.91) | 2.15 (1.46 to 3.18) | 1.50 (0.99 to 2.27) | 1.51 (1.00 to 2.28) | ||||
Number of sightings of a unique adenoma | 1 | 3311 | 939 (28.36) | 1 | 0.0008 | 1 | 0.0301 | 1 | 0.03027 |
2 | 1005 | 282 (28.06) | 0.99 (0.84 to 1.15) | 1.22 (0.98 to 1.53) | 1.22 (0.98 to 1.52) | ||||
3 | 182 | 30 (16.48) | 0.50 (0.33 to 0.74) | 0.66 (0.42 to 1.03) | 0.65 (0.42 to 1.02) | ||||
4 | 63 | 11 (17.46) | 0.53 (0.28 to 1.03) | 0.70 (0.34 to 1.44) | 0.70 (0.34 to 1.44) | ||||
5+ | 47 | 9 (19.15) | 0.60 (0.29 to 1.24) | 0.81 (0.34 to 1.94) | 0.82 (0.34 to 1.95) | ||||
Number of polyps with unknown histology | 0 | 3593 | 915 (25.47) | 1 | < 0.0001 | 1 | < 0.0001 | 1 | < 0.0001 |
1 | 556 | 179 (32.19) | 1.39 (1.15 to 1.69) | 1.28 (1.04 to 1.58) | 1.28 (1.04 to 1.58) | ||||
2 | 187 | 78 (41.71) | 2.09 (1.55 to 2.83) | 1.87 (1.36 to 2.59) | 1.87 (1.36 to 2.59) | ||||
3 | 108 | 39 (36.11) | 1.65 (1.11 to 2.47) | 1.44 (0.94 to 2.21) | 1.41 (0.92 to 2.15) | ||||
4 | 63 | 14 (22.22) | 0.84 (0.46 to 1.52) | 0.68 (0.36 to 1.26) | 0.69 (0.37 to 1.28) | ||||
5+ | 101 | 46 (45.54) | 2.45 (1.64 to 3.65) | 2.27 (1.48 to 3.47) | 2.26 (1.48 to 3.46) | ||||
Proximal polyps | No | 2940 | 718 (24.42) | 1 | < 0.0001 | 1 | 0.0032 | 1 | 0.0035 |
Yes | 1668 | 553 (33.15) | 1.53 (1.34 to 1.75) | 1.27 (1.08 to 1.49) | 1.27 (1.08 to 1.49) |
Baseline risk factor | Category | Number of patients (N = 4608) | Univariable analysis: new AA | Multivariable analyses: new AA | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 4608) | Model 2 – interval as continuous (n = 4608) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 1760 | 136 (7.73) | 1 | < 0.0001 | 1 | < 0.0001 | n/a | |
2 yearsa | 976 | 90 (9.22) | 1.21 (0.92 to 1.60) | 1.18 (0.89 to 1.57) | |||||
3 yearsa | 1057 | 105 (9.93) | 1.32 (1.01 to 1.72) | 1.42 (1.08 to 1.88) | |||||
4 yearsa | 355 | 45 (12.68) | 1.73 (1.21 to 2.48) | 1.80 (1.24 to 2.61) | |||||
5 yearsa | 217 | 29 (13.36) | 1.84 (1.20 to 2.83) | 1.88 (1.20 to 2.94) | |||||
6 yearsa | 123 | 18 (14.63) | 2.05 (1.21 to 3.48) | 2.09 (1.21 to 3.62) | |||||
≥ 6.5 years | 120 | 28 (23.33) | 3.63 (2.30 to 5.74) | 3.79 (2.33 to 6.16) | |||||
Per year increase | n/a | n/a | 1.16 (1.11 to 1.22) | < 0.0001 | n/a | 1.18 (1.12 to 1.24) | < 0.0001 | ||
Age (years) | < 55 | 1025 | 66 (6.44) | 1 | < 0.0001 | 1 | < 0.0001 | 1 | < 0.0001 |
≥ 55 and < 60 | 622 | 79 (12.70) | 2.11 (1.50 to 2.98) | 2.11 (1.49 to 3.00) | 2.13 (1.50 to 3.02) | ||||
≥ 60 and < 65 | 788 | 76 (9.64) | 1.55 (1.10 to 2.19) | 1.66 (1.17 to 2.36) | 1.65 (1.16 to 2.34) | ||||
≥ 65 and < 70 | 813 | 72 (8.86) | 1.41 (1.00 to 2.00) | 1.40 (0.98 to 2.00) | 1.40 (0.98 to 2.00) | ||||
≥ 70 and < 75 | 714 | 81 (11.34) | 1.86 (1.32 to 2.61) | 1.95 (1.38 to 2.77) | 1.95 (1.38 to 2.76) | ||||
≥ 75 and < 80 | 413 | 57 (13.80) | 2.33 (1.60 to 3.38) | 2.44 (1.66 to 3.59) | 2.44 (1.66 to 3.59) | ||||
≥ 80 | 233 | 20 (8.58) | 1.36 (0.81 to 2.30) | 1.41 (0.82 to 2.41) | 1.42 (0.83 to 2.42) | ||||
Most complete colonoscopy | Complete | 2973 | 244 (8.21) | 1 | < 0.0001 | 1 | < 0.0001 | 1 | < 0.0001 |
Incomplete/unknown | 1635 | 207 (12.66) | 1.62 (1.33 to 1.97) | 1.69 (1.34 to 2.11) | 1.69 (1.35 to 2.12) | ||||
Largest adenoma (mm) | < 20 | 2880 | 265 (9.20) | 1 | 0.0857 | 1 | 0.0101 | 1 | 0.012 |
≥ 20 | 1728 | 186 (10.76) | 1.19 (0.98 to 1.45) | 1.31 (1.07 to 1.62) | 1.30 (1.06 to 1.60) | ||||
Large hyperplastic polyp | No | 4525 | 436 (9.64) | 1 | 0.0199 | 1 | 0.0211 | 1 | 0.0198 |
Yes | 83 | 15 (18.07) | 2.07 (1.17 to 3.65) | 2.10 (1.17 to 3.78) | 2.11 (1.17 to 3.80) | ||||
Proximal polyp | No | 2940 | 265 (9.01) | 1 | 0.0199 | 1 | < 0.0001 | 1 | < 0.0001 |
Yes | 1668 | 186 (11.15) | 1.27 (1.04 to 1.54) | 1.61 (1.30 to 2.00) | 1.61 (1.30 to 2.00) |
Baseline risk factor | Category | Number of patients (N = 4608) | Univariate analysis: new CRC | Multivariable analyses: new CRC | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 4608) | Model 2 – interval as continuous (n = 4608) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 1760 | 15 (0.85) | 1 | 0.0004 | 1 | < 0.0001 | n/a | |
2 yearsa | 976 | 9 (0.92) | 1.08 (0.47 to 2.48) | 1.08 (0.47 to 2.5) | |||||
3 yearsa | 1057 | 5 (0.47) | 0.55 (0.20 to 1.53) | 0.69 (0.25 to 1.91) | |||||
4 yearsa | 355 | 9 (2.54) | 3.03 (1.31 to 6.97) | 4.02 (1.72 to 9.42) | |||||
5 yearsa | 217 | 4 (1.84) | 2.18 (0.72 to 6.64) | 3.11 (1 to 9.64) | |||||
6 yearsa | 123 | 4 (3.25) | 3.91 (1.28 to 11.97) | 5.56 (1.77 to 17.44) | |||||
≥ 6.5 years | 120 | 6 (5.00) | 6.12 (2.33 to 16.08) | 9.08 (3.36 to 24.59) | |||||
Per year increase | n/a | n/a | 1.27 (1.16 to 1.40) | < 0.0001 | n/a | 1.32 (1.20 to 1.46) | < 0.0001 | ||
Age (years) | < 60 | 1647 | 7 (0.43) | 1 | 0.0002 | 1 | < 0.0001 | 1 | < 0.0001 |
≥ 60 and < 65 | 788 | 5 (0.63) | 1.50 (0.47 to 4.73) | 1.54 (0.48 to 4.91) | 1.39 (0.43 to 4.47) | ||||
≥ 65 and < 70 | 813 | 10 (1.23) | 2.92 (1.11 to 7.69) | 2.86 (1.07 to 7.60) | 2.79 (1.05 to 7.44) | ||||
≥ 70 and < 75 | 714 | 12 (1.68) | 4.00 (1.57 to 10.21) | 4.54 (1.76 to 11.69) | 4.44 (1.72 to 11.43) | ||||
≥ 75 and < 80 | 413 | 12 (2.91) | 7.01 (2.74 to 17.92) | 8.37 (3.23 to 21.67) | 8.30 (3.21 to 21.49) | ||||
≥ 80 | 233 | 6 (2.58) | 6.19 (2.06 to 18.59) | 7.03 (2.29 to 21.56) | 7.00 (2.29 to 21.41) | ||||
Most complete colonoscopy | Complete | 2973 | 25 (0.84) | 1 | 0.0149 | n/a | 1 | 0.0384 | |
Incomplete/unknown | 1635 | 27 (1.65) | 1.98 (1.15 to 3.42) | 1.89 (1.04 to 3.44) | |||||
Best bowel preparation | Excellent/good/satisfactory/unknown | 4414 | 44 (1) | 1 | 0.0016 | 1 | 0.0005 | 1 | 0.0008 |
Poor | 194 | 8 (4.12) | 4.27 (1.98 to 9.20) | 5.31 (2.39 to 11.81) | 4.93 (2.22 to 10.93) | ||||
Proximal polyp | No | 2940 | 29 (0.99) | 1 | 0.2316 | n/a | 1 | 0.0462 | |
Yes | 1668 | 23 (1.38) | 1.40 (0.81 to 2.43) | 1.84 (1.02 to 3.33) |
Adenomas (all types)
After adjusting for the effects of covariates, the independent association between interval and new adenomas at FUV1 remained highly significant (p < 0.0001), with 16% increased odds of having a new adenoma per year increase in interval length (OR 1.16, 95% CI 1.11 to 1.21). The effect estimates were precise, and there was an apparent dose–response effect, providing strong evidence of an association. There was also evidence of weak negative confounding as the effect was strengthened slightly after adjustment.
Independent risk factors for new adenomas at FUV1 included older age, male gender, later date of baseline, no complete colonoscopy, and the presence of hyperplastic polyps, proximal polyps or multiple adenomas at baseline (all p < 0.004). A difficult examination or a baseline visit of longer than 1 day both appeared to confer a lower chance of having a new adenoma; however, the association with length of visit was irregular. The effect of number of sightings of an adenoma at baseline was somewhat difficult to interpret, as two sightings conferred 22% greater odds, whereas three or more sightings were associated with lower odds of having a new adenoma at FUV1.
Models 1 and 2, with interval as categorical and continuous, were very similar and used the same covariates. All covariates were significant before and after adjustment, with little evidence of confounding. After removal of previously seen findings, larger size of baseline adenoma was no longer predictive of finding new adenomas at FUV1.
Advanced adenomas
There was strong evidence of a significant association between interval and new AA at FUV1: effect estimates were precise and demonstrated a dose–response effect, with an 18% increased odds of new AA at FUV1 per year increase in interval (OR 1.18, 95% CI 1.12 to 1.24; p < 0.0001). Adjustment for covariates had little impact on the effect of interval.
Baseline risk factors for new AA at FUV1 included no complete colonoscopy (OR 1.69, 95% CI 1.34 to 2.11), and the presence of an adenoma of ≥ 20 mm (OR 1.30, 95% CI 1.06 to 1.60), a proximal polyp (OR 1.61, 95% CI 1.30 to 2.00) or a large hyperplastic polyp (OR 2.11, 95% CI 1.71 to 3.80). Older age also conferred greater odds of new AA, although the association with increasing age was irregular.
Models 1 and 2 were quite similar. All risk factors were significantly associated with new AA before and after adjustment except for largest adenoma, the effect of which was strengthened after adjustment. After excluding previously seen lesions, histology, number of sightings of a single adenoma, and number of adenomas or of hyperplastic polyps were no longer predictive of new AA at FUV1.
Colorectal cancer
After adjusting for the effects of covariates, the effect of interval length on new CRC at FUV1 was strengthened, with a more than fivefold greater odds of CRC among those with an interval of ≥ 6 years, and a 32% increase in odds per year increase in interval (OR 1.32, 95% CI 1.20 to 1.46). Owing to the small number of new CRC outcomes, measures of effect for some strata of interval were imprecise; however, the large effect sizes, tendency towards a dose–response effect and highly significant p-value provide strong evidence of an association.
Older age and poor bowel preparation (p = 0.0005) were highly significant risk factors for new CRC at FUV1. Although effect estimates were imprecise for individual categories, risk tended to increase with age and the estimated increase in odds was sevenfold or greater for those aged ≥ 75 years and was fivefold greater for poor bowel preparation. In model 2 (with interval as continuous), the absence of a complete colonoscopy and the presence of proximal polyps at baseline were weakly associated with new CRC at FUV1.
There was some evidence of negative confounding as the effects of age, best bowel preparation and proximal polyp were strengthened after adjusting for covariates. Histology and dysplasia of baseline adenomas were no longer significantly associated with CRC at FUV1 after removal of previously seen lesions from the analysis.
Effect modification of the association between interval and new findings at follow-up visit 1
We proposed that there might be an interaction between interval and age or gender. We investigated interactions with interval to follow-up only as a continuous variable, as these results were more intuitive and enabled the examination of potential trends.
There was no evidence of effect modification by age group or gender on new adenomas (Figure 3). There was some evidence of effect modification for the finding of new AA at FUV1 (Figure 4). By age group, the test for interaction was highly significant (p = 0.0100), although there was no clear trend in the ORs. Increasing the interval had the greatest effect in the < 55 years age group but the decrease in effect was not monotonic (Figure 4). To test for a trend in the ORs, an interaction was fitted between interval and continuous age group; the p-value was 0.8987. Thus, the effect of interval differed between the categorical age groups, but there was no trend in the effect with increasing age group. By gender, the ORs suggested that increasing interval had a stronger effect in men than in women, but this difference was not statistically significant (p = 0.0663). There was no evidence of effect modification on new CRC (Figure 5).
Although we detected a significant interaction between interval and age group, we did not model the interaction parameter in previously presented results, as it is likely to be impractical to offer different surveillance strategies based on age or gender in a clinical setting.
Second follow-up visit
Characteristics and findings
Of the 4608 patients who attended FUV1, 1635 (36%) patients returned for FUV2 during our data collection period and were not censored for cancer diagnosed at first follow-up.
Table 31 details the findings (new and previously seen) at FUV2 according to the interval between FUV1 and FUV2. Adenomas were detected in 527 (32%) patients, AA in 232 (14%) patients and CRC in 17 (1%) patients. The adenoma detection rate was high regardless of interval, varying from 30% to 43%, whereas the proportion of patients with AA varied from 8% to 20%, and the proportion with CRC varied from none to 3%. There was little evidence of a trend in findings with increasing interval for any of the outcomes.
Interval FUV1 to FUV2 | Number of IR patients (N = 1635) | Patients with findingsa at FUV2 | |||||
---|---|---|---|---|---|---|---|
Adenoma(s) | AA(s) | CRC(s) | |||||
n | % (n/N) | n | % (n/N) | n | % (n/N) | ||
< 18 months | 397 | 144 | 36.27 | 79 | 19.90 | 4 | 1.01 |
2 yearsb | 376 | 116 | 30.85 | 55 | 14.63 | 5 | 1.33 |
3 yearsb | 518 | 153 | 29.54 | 63 | 12.16 | 3 | 0.58 |
4 yearsb | 152 | 46 | 30.26 | 14 | 9.21 | 2 | 1.32 |
5 yearsb | 131 | 44 | 33.59 | 11 | 8.40 | 2 | 1.53 |
6 yearsb | 31 | 11 | 35.48 | 4 | 12.90 | 0 | 0 |
≥ 6.5 years | 30 | 13 | 43.33 | 6 | 20.00 | 1 | 3.33 |
Total | 1635 | 527 | 32.23 | 232 | 14.19 | 17 | 1.04 |
Table 32 shows the examinations undertaken at FUV2. In 89% of patients, FUV2 comprised a single procedure and was completed in 1 day. The most complete examination that we were able to glean from information provided on procedure type and polyp location was a complete colonoscopy in 74% of cases, and an incomplete colonoscopy in 15%. A further 5% are likely to have had a colonoscopy, as this is the most common procedure to offer patients undergoing surveillance in the UK, but 6% had only a FS.
Characteristic | Category | IR patients (N = 1635) | |
---|---|---|---|
n | % | ||
Number of examinations | 1 | 1460 | 89.30 |
2 | 125 | 7.65 | |
3 | 27 | 1.65 | |
4+ | 23 | 1.41 | |
Length of FUV2 | 1 day | 1461 | 89.36 |
2–30 days | 14 | 0.86 | |
1–3 months | 36 | 2.20 | |
3–6 months | 44 | 2.69 | |
6–12 months | 61 | 3.73 | |
1–2 years | 16 | 0.98 | |
3–4 years | 3 | 0.18 | |
Most complete examination at FUV2 | Complete colonoscopy | 1206 | 73.76 |
Colonoscopy not known to be complete | 241 | 14.74 | |
Colonoscopy or FS | 47 | 2.87 | |
FS | 106 | 6.48 | |
Colonoscopy, FS or rigid sigmoidoscopy | 28 | 1.71 | |
Surgery | 6 | 0.37 | |
Unknown procedure type | 1 | 0.06 |
New and previously seen lesions at second follow-up
Tables 33–35 show the status of findings at FUV2 – whether or not a lesion had been seen at a previous visit – stratified by the interval from FUV1 to FUV2. Similar to findings at FUV1, there was a trend towards an increasing proportion of new findings in patients with a longer interval, and a greater proportion of previously seen lesions in those with a shorter interval.
Adenoma status | Interval to from first to second follow-up, n (%) | Total, n (%) | ||||||
---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||
None | 253 (63.73) | 260 (69.15) | 365 (70.46) | 106 (69.74) | 87 (66.41) | 20 (64.52) | 17 (56.67) | 1108 (67.77) |
Previously seen only | 56 (14.11) | 23 (6.12) | 16 (3.09) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 95 (5.81) |
New and previously seen | 13 (3.27) | 9 (2.39) | 6 (1.16) | 2 (1.32) | 0 (0) | 1 (3.23) | 0 (0) | 31(1.90) |
New only | 75 (18.89) | 84 (22.34) | 131 (25.29) | 44 (28.95) | 44 (33.59) | 10 (32.26) | 13 (43.33) | 401 (24.53) |
Total | 397 (100) | 376 (100) | 518 (100) | 152 (100) | 131 (100) | 31 (100) | 30 (100) | 1635 (100) |
AA status | Interval to from first to second follow-up, n (%) | Total, n (%) | ||||||
---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||
None | 318 (80.10) | 321 (85.37) | 455 (87.84) | 138 (90.79) | 120 (91.6) | 27 (87.10) | 24 (80) | 1403 (85.81) |
Previously seen only | 51 (12.85) | 23 (6.12) | 12 (2.32) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 86 (5.26) |
New and previously seen | 4 (1.01) | 5 (1.33) | 5 (0.97) | 0 (0) | 0 (0) | 1 (3.23) | 0 (0) | 15 (0.92) |
New only | 24 (6.05) | 27 (7.18) | 46 (8.88) | 14 (9.21) | 11 (8.40) | 3 (9.68) | 6 (20) | 131 (8.01) |
Total | 397 (100) | 376 (100) | 518 (100) | 152 (100) | 131 (100) | 31 (100) | 30 (100) | 1635 (100) |
CRC status | Interval to from first to second follow-up, n (%) | Total, n (%) | ||||||
---|---|---|---|---|---|---|---|---|
< 18 months | 2 yearsa | 3 yearsa | 4 yearsa | 5 yearsa | 6 yearsa | ≥ 6.5 years | ||
None | 393 (98.99) | 371 (98.67) | 515 (99.42) | 150 (98.68) | 129 (98.47) | 31 (100) | 29 (96.67) | 1618 (98.96) |
Previously seen only | 2 (0.50) | 1 (0.27) | 2 (0.39) | 1 (0.66) | 2 (1.53) | 0 (0) | 0 (0) | 8 (0.49) |
New and previously seen | 2 (0.50) | 4 (1.06) | 1 (0.19) | 1 (0.66) | 0 (0) | 0 (0) | 1 (3.33) | 9 (0.55) |
Total | 397 (100) | 376 (100) | 518 (100) | 152 (100) | 131 (100) | 31 (100) | 30 (100) | 1635 (100) |
Based on these observations, all subsequent analyses of findings at FUV2 included only new findings, so as to allow the association between interval to FUV2 and finding at FUV2 to be examined without any confounding effects of polypectomy site surveillance, as was done in the analysis of new findings at FUV1.
The proportion of patients with new adenomas was high at both the first and second follow-ups, regardless of interval length. The high detection rate of adenomas meant that this outcome was not informative in terms of identifying an optimum surveillance strategy. For this reason, adenomas were not considered as an end point in subsequent analyses for FUV2, and only AA or CRC were used as outcomes.
Follow-up visit 1 risk factors for new advanced adenomas and colorectal cancer at follow-up visit 2
Univariable analyses were performed to assess the relationship between FUV1 characteristics and detection of new AA and CRC at FUV2. Table 36 describes new AA and CRC incidence at FUV2 according to patient characteristics and examinations at FUV1. Most patient or procedural characteristics were not significantly predictive. There was weak evidence that suboptimal bowel preparation increased the odds of new AA (p = 0.0178), but again 95% CIs included 1. There was some evidence of an association between new CRC at FUV2 and a difficult examination at FUV1 (OR 5.99, 95% CI 1.22 to 29.35; p = 0.0636).
First follow-up characteristics | Number of patients (N = 1635) | New AA at second follow-up | New CRC at second follow-up | |||||
---|---|---|---|---|---|---|---|---|
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | n (%) | Unadjusted OR (95% CI) | p-value (LRT) | |||
Age (years) at first follow-up | < 55 | 329 | 25 (7.6) | 1 | 0.7718 | 2 (0.61) | 1 | 0.7998 |
≥ 55 and < 60 | 256 | 21 (8.2) | 1.09 (0.59 to 1.99) | 0 (0) | n/a | |||
≥ 60 and < 65 | 279 | 30 (10.75) | 1.47 (0.84 to 2.56) | 2 (0.72) | 1.18 (0.17 to 8.44) | |||
≥ 65 and < 70 | 305 | 31 (10.16) | 1.38 (0.79 to 2.39) | 1 (0.33) | 0.54 (0.05 to 5.96) | |||
≥ 70 and < 75 | 253 | 19 (7.51) | 0.99 (0.53 to 1.84) | 1 (0.4) | 0.65 (0.06 to 7.20) | |||
≥ 75 and < 80 | 142 | 13 (9.15) | 1.23 (0.61 to 2.47) | 2 (1.41) | 2.34 (0.33 to 16.75) | |||
≥ 80 | 71 | 7 (9.86) | 1.33 (0.55 to 3.21) | 1 (1.41) | 2.34 (0.21 to 26.12) | |||
Gender | Male | 956 | 90 (9.41) | 1 | 0.4132 | 5 (0.52) | 1 | 0.8592 |
Female | 679 | 56 (8.25) | 0.86 (0.61 to 1.23) | 4 (0.59) | 1.13 (0.30 to 4.21) | |||
Family history of cancer or CRC | No | 1523 | 137 (9) | 1 | 0.7273 | 8 (0.53) | 1 | 0.6393 |
Yes | 112 | 9 (8.04) | 0.88 (0.44 to 1.79) | 1 (0.89) | 1.71 (0.21 to 13.76) | |||
Year of first follow-up | 1985–94 | 125 | 12 (9.6) | 1 | 0.6721 | 1 (0.8) | 1 | 0.6372 |
1995–9 | 355 | 29 (8.17) | 0.84 (0.41 to 1.70) | 1 (0.28) | 0.35 (0.02 to 5.64) | |||
2000–4 | 855 | 73 (8.54) | 0.88 (0.46 to 1.67) | 4 (0.47) | 0.58 (0.06 to 5.26) | |||
2005–9 | 300 | 32 (10.67) | 1.12 (0.56 to 2.26) | 3 (1) | 1.25 (0.13 to 12.16) | |||
Length of visit | 1 day | 1422 | 119 (8.37) | 1 | 0.3713 | 8 (0.56) | 1 | 0.3574 |
2–30 days | 18 | 2 (11.11) | 1.37 (0.31 to 6.02) | 0 (0) | n/a | |||
1–3 months | 37 | 5 (13.51) | 1.71 (0.65 to 4.47) | 0 (0) | n/a | |||
3–6 months | 58 | 5 (8.62) | 1.03 (0.41 to 2.63) | 1 (1.72) | 3.10 (0.38 to 25.21) | |||
6–12 months | 78 | 12 (15.38) | 1.99 (1.05 to 3.79) | 0 (0) | n/a | |||
≥ 12 months | 22 | 3 (13.64) | 1.73 (0.50 to 5.93) | 0 (0) | n/a | |||
Number of examinations in visit | 1 | 1420 | 119 (8.38) | 1 | 0.147 | 8 (0.56) | 1 | 0.8762 |
2 | 150 | 16 (10.67) | 1.31 (0.75 to 2.27) | 1 (0.67) | 1.18 (0.15 to 9.54) | |||
3 | 44 | 8 (18.18) | 2.43 (1.10 to 5.35) | 0 (0) | n/a | |||
4+ | 21 | 3 (14.29) | 1.82 (0.53 to 6.28) | 0 (0) | n/a | |||
Most complete examination | Complete colonoscopy | 1087 | 101 (9.29) | 1 | 0.7689 | 4 (0.37) | 1 | 0.2073 |
Colonoscopy of unknown completeness | 130 | 8 (6.15) | 0.64 (0.30 to 1.35) | 1 (0.77) | 2.10 (0.23 to 18.92) | |||
Incomplete colonoscopy | 145 | 12 (8.28) | 0.88 (0.47 to 1.65) | 3 (2.07) | 5.72 (1.27 to 25.82) | |||
Colonoscopy or FS | 106 | 11 (10.38) | 1.13 (0.59 to 2.18) | 0 (0) | n/a | |||
FS | 108 | 8 (7.41) | 0.78 (0.37 to 1.65) | 1 (0.93) | 2.53 (0.28 to 22.84) | |||
Colonoscopy or flexible or rigid sigmoidoscopy | 53 | 6 (11.32) | 1.25 (0.52 to 2.99) | 0 (0) | n/a | |||
Surgery | 2 | 0 (0) | n/a | 0 (0) | n/a | |||
Unknown | 4 | 0 (0) | n/a | 0 (0) | n/a | |||
Best bowel preparation at colonoscopy | Excellent/good | 464 | 31 (6.68) | 1 | 0.0178 | 2 (0.43) | 1 | 0.8971 |
Satisfactory | 169 | 27 (15.98) | 2.66 (1.53 to 4.60) | 1 (0.59) | 1.38 (0.12 to 15.26) | |||
Poor | 68 | 5 (7.35) | 1.11 (0.42 to 2.96) | 1 (1.47) | 3.45 (0.31 to 38.54) | |||
Unknown | 661 | 58 (8.77) | 1.34 (0.85 to 2.11) | 4 (0.61) | 1.41 (0.26 to 7.71) | |||
No known colonoscopy | 273 | 25 (9.16) | 1.41 (0.81 to 2.44) | 1 (0.37) | 0.85 (0.08 to 9.41) | |||
Difficult examination | No | 1559 | 136 (8.72) | 1 | 0.2117 | 7 (0.45) | 1 | 0.0636 |
Yes | 76 | 10 (13.16) | 1.59 (0.80 to 3.15) | 2 (2.63) | 5.99 (1.22 to 29.35) |
Table 37 describes new AA and new CRC incidence at FUV2 according to characteristics of adenomas and polyps detected at FUV1. There was a tendency towards increasing odds of new AA at FUV2 with increasing number and size of adenomas, severity of histology and proximal location of adenomas at FUV1, as well as in the presence of proximal polyps or polyps of unknown histology. Odds of new AA tended to increase with repeated sightings of an adenoma during FUV1 but the association was not significant (p = 0.0609). No significant relationship was found between new CRC at FUV2 and characteristics of adenomas or polyps seen at FUV1, a finding that was most likely due to the very small number of CRC outcomes at FUV2.
First follow-up characteristics | Number of patients (N = 1635) | New AA at second follow-up | New CRC at second follow-up | |||||
---|---|---|---|---|---|---|---|---|
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | n (%) | Unadjusted OR (95% CI) | p-value (LRT) | |||
Adenoma characteristics | ||||||||
Number | 0 | 1010 | 75 (7.43) | 1 | 0.0104 | 6 (0.59) | 1 | 0.8868 |
1 | 429 | 39 (9.09) | 1.25 (0.83 to 1.87) | 2 (0.47) | 0.78 (0.16 to 3.90) | |||
2 | 117 | 20 (17.09) | 2.57 (1.50 to 4.39) | 1 (0.85) | 1.44 (0.17 to 12.09) | |||
3 | 37 | 4 (10.81) | 1.51 (0.52 to 4.38) | 0 | n/a | |||
4 | 21 | 4 (19.05) | 2.93 (0.96 to 8.94) | 0 | n/a | |||
5+ | 21 | 4 (19.05) | 2.93 (0.96 to 8.94) | 0 | n/a | |||
Largest size (mm) | No adenomas | 1010 | 75 (7.43) | 1 | 0.0066 | 6 (0.59) | 1 | 0.6889 |
< 10 | 379 | 36 (9.50) | 1.31 (0.86 to 1.98) | 3 (0.79) | 1.34 (0.33 to 5.37) | |||
10–14 | 95 | 8 (8.42) | 1.15 (0.54 to 2.45) | 0 (0) | n/a | |||
15–19 | 52 | 7 (13.46) | 1.94 (0.85 to 4.45) | 0 (0) | n/a | |||
≥ 20 | 75 | 15 (20.00) | 3.12 (1.69 to 5.75) | 0 (0) | n/a | |||
Unknown | 24 | 5 (20.83) | 3.28 (1.19 to 9.03) | 0 (0) | n/a | |||
Worst histology | No adenomas | 1010 | 75 (7.43) | 1 | 0.0476 | 6 (0.59) | 1 | 0.7327 |
Tubular | 340 | 33 (9.71) | 1.34 (0.87 to 2.06) | 2 (0.59) | 0.99 (0.20 to 4.93) | |||
Tubulovillous | 156 | 19 (12.18) | 1.73 (1.01 to 2.95) | 0 (0) | n/a | |||
Villous | 66 | 10 (15.15) | 2.23 (1.09 to 4.54) | 1 (1.52) | 2.57 (0.31 to 21.70) | |||
Unknown | 63 | 9 (14.29) | 2.08 (0.99 to 4.37) | 0 (0) | n/a | |||
Worst dysplasia | No adenomas | 1010 | 75 (7.43) | 1 | 0.0387 | 6 (0.59) | 1 | 0.9933 |
Low grade | 508 | 61 (12.01) | 1.70 (1.19 to 2.43) | 3 (0.59) | 0.99 (0.25 to 3.99) | |||
High grade | 46 | 4 (8.7) | 1.19 (0.41 to 3.40) | 0 (0) | n/a | |||
Unknown | 71 | 6 (8.45) | 1.15 (0.48 to 2.74) | 0 (0) | n/a | |||
Distal adenomas | No adenomas | 1010 | 75 (7.43) | 1 | 0.0274 | 6 (0.59) | 1 | 0.5904 |
No | 242 | 27 (11.16) | 1.57 (0.98 to 2.49) | 2 (0.83) | 1.39 (0.28 to 6.95) | |||
Yes | 383 | 44 (11.49) | 1.62 (1.09 to 2.40) | 1 (0.26) | 0.44 (0.05 to 3.65) | |||
Proximal adenomas | No adenomas | 1010 | 75 (7.43) | 1 | 0.0115 | 6 (0.59) | 1 | 0.4909 |
No | 319 | 31 (9.72) | 1.34 (0.87 to 2.08) | 0 (0) | n/a | |||
Yes | 306 | 40 (13.07) | 1.87 (1.25 to 2.82) | 3 (0.98) | 1.66 (0.41 to 6.66) | |||
Number of sightings of a single adenoma | No adenomas | 1010 | 75 (7.43) | 1 | 0.0609 | 6 (0.59) | 1 | 0.9431 |
1 | 531 | 60 (11.3) | 1.59 (1.11 to 2.27) | 3 (0.56) | 0.95 (0.24 to 3.82) | |||
2 | 60 | 6 (10) | 1.39 (0.58 to 3.32) | 0 (0) | n/a | |||
3 | 20 | 3 (15) | 2.20 (0.63 to 7.68) | 0 (0) | n/a | |||
4 | 8 | 2 (25) | 4.16 (0.82 to 20.95) | 0 (0) | n/a | |||
5+ | 6 | 0 (0) | n/a | 0 (0) | n/a | |||
Polyp characteristics | ||||||||
Number of hyperplastic polyps | 0 | 1357 | 124 (9.14) | 1 | 0.6929 | 9 (0.66) | 1 | n/a |
1 | 173 | 13 (7.51) | 0.81 (0.45 to 1.46) | 0 (0) | n/a | |||
2 | 63 | 4 (6.35) | 0.67 (0.24 to 1.89) | 0 (0) | n/a | |||
3 | 17 | 1 (5.88) | 0.62 (0.08 to 4.73) | 0 (0) | n/a | |||
4 | 15 | 3 (20) | 2.49 (0.69 to 8.93) | 0 (0) | n/a | |||
5+ | 10 | 1 (10) | 1.10 (0.14 to 8.79) | 0 (0) | n/a | |||
Large hyperplastic polyp | No | 1618 | 142 (8.78) | 1 | 0.0712 | 9 (0.56) | 1 | n/a |
Yes | 17 | 4 (23.53) | 3.20 (1.03 to 9.94) | 0 (0) | n/a | |||
Number of polyps with unknown histology | 0 | 1330 | 105 (7.89) | 1 | 0.0004 | 7 (0.53) | 1 | 0.6705 |
1 | 180 | 23 (12.78) | 1.71 (1.06 to 2.76) | 1 (0.56) | 1.06 (0.13 to 8.63) | |||
2 | 64 | 6 (9.38) | 1.21 (0.51 to 2.86) | 1 (1.56) | 3.00 (0.36 to 24.76) | |||
3 | 25 | 3 (12) | 1.59 (0.47 to 5.40) | 0 (0) | n/a | |||
4 | 13 | 0 (0) | n/a | 0 (0) | n/a | |||
5+ | 23 | 9 (39.13) | 7.50 (3.17 to 17.74) | 0 (0) | n/a | |||
Distal polyps | No polyps | 667 | 44 (6.6) | 1 | 0.0046 | 5 (0.75) | 1 | 0.643 |
No | 306 | 40 (13.07) | 2.13 (1.36 to 3.34) | 1 (0.33) | 0.43 (0.05 to 3.73) | |||
Yes | 662 | 62 (9.37) | 1.46 (0.98 to 2.19) | 3 (0.45) | 0.60 (0.14 to 2.53) | |||
Proximal polyps | No polyps | 667 | 44 (6.6) | 1 | 0.0006 | 5 (0.75) | 1 | 0.372 |
No | 499 | 40 (8.02) | 1.23 (0.79 to 1.93) | 1 (0.2) | 0.27 (0.03 to 2.28) | |||
Yes | 469 | 62 (13.22) | 2.16 (1.44 to 3.24) | 3 (0.64) | 0.85 (0.20 to 3.58) |
Baseline risk factors for new advanced adenomas and colorectal cancer at the second follow-up visit
The crude association of baseline characteristics with new findings at FUV2 was investigated.
Table 38 describes crude associations of patient and procedural characteristics at baseline with new AA or CRC at FUV2. Patients with an incomplete baseline colonoscopy had a twofold increased odds of AA at FUV2 (OR 2.03, 95% CI 1.24 to 3.33). There was a tendency for increased odds of AA at FUV2 with increasing interval length between baseline and FUV1, although effect estimates were imprecise and most 95% CIs included 1 (p = 0.0212). No other factors appeared to be associated with new AA at FUV2. There was little evidence of an association between any patient or procedural characteristics at baseline and detection of new CRC at FUV2, as estimates were extremely imprecise with wide CIs and non-significant test statistics.
Baseline factors | Number of patients (N = 1635) | New AA at the second follow-up | New CRC at the second follow-up | |||||
---|---|---|---|---|---|---|---|---|
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | n (%) | Unadjusted OR (95% CI) | p-value (LRT) | |||
Family history of cancer | No | 1540 | 140 (9.09) | 1.00 | 0.3349 | 8 (0.52) | 1.00 | 0.5417 |
Yes | 95 | 6 (6.32) | 0.67 (0.29 to 1.57) | 1 (1.05) | 2.04 (0.25 to 16.46) | |||
Calendar year of baseline visit | 1985–94 | 218 | 23 (10.55) | 1.00 | 0.0644 | 1 (0.46) | 1.00 | 0.8971 |
1995–9 | 607 | 40 (6.59) | 0.60 (0.35 to 1.02) | 3 (0.49) | 1.08 (0.11 to 10.42) | |||
2000–4 | 730 | 73 (10) | 0.94 (0.57 to 1.55) | 4 (0.55) | 1.20 (0.13 to 10.75) | |||
2005–10 | 80 | 10 (12.5) | 1.21 (0.55 to 2.67) | 1 (1.25) | 2.75 (0.17 to 44.44) | |||
Length of visit | 1 day | 904 | 77 (8.52) | 1.00 | 0.9364 | 4 (0.44) | 1.00 | 0.7511 |
2–30 days | 75 | 7 (9.33) | 1.11 (0.49 to 2.49) | 0 (0) | n/a | |||
1–3 months | 230 | 23 (10) | 1.19 (0.73 to 1.95) | 2 (0.87) | 1.97 (0.36 to 10.84) | |||
3–6 months | 195 | 16 (8.21) | 0.96 (0.55 to 1.68) | 1 (0.51) | 1.16 (0.13 to 10.43) | |||
6–12 months | 191 | 20 (10.47) | 1.26 (0.75 to 2.11) | 2 (1.05) | 2.38 (0.43 to 13.09) | |||
≥ 12 months | 40 | 3 (7.5) | 0.87 (0.26 to 2.89) | 0 (0) | n/a | |||
Number of examinations in visit | 1 | 900 | 76 (8.44) | 1.00 | 0.2478 | 4 (0.44) | 1.00 | 0.7108 |
2 | 512 | 45 (8.79) | 1.04 (0.71 to 1.54) | 4 (0.78) | 1.76 (0.44 to 7.08) | |||
3 | 138 | 19 (13.77) | 1.73 (1.01 to 2.97) | 1 (0.72) | 1.64 (0.18 to 14.74) | |||
4+ | 85 | 6 (7.06) | 0.82 (0.35 to 1.95) | 0 (0) | 1 (0 to 0) | |||
Most complete colonoscopy | Complete | 944 | 75 (7.94) | 1.00 | 0.0278 | 5 (0.53) | 1.00 | 0.0896 |
Unknown completeness | 530 | 47 (8.87) | 1.13 (0.77 to 1.65) | 1 (0.19) | 0.36 (0.04 to 3.05) | |||
Incomplete | 161 | 24 (14.91) | 2.03 (1.24 to 3.33) | 3 (1.86) | 3.57 (0.84 to 15.07) | |||
Best bowel preparation at colonoscopy | Excellent/good | 465 | 36 (7.74) | 1.00 | 0.6065 | 2 (0.43) | 1.00 | 0.8677 |
Satisfactory | 127 | 10 (7.87) | 1.02 (0.49 to 2.11) | 1 (0.79) | 1.84 (0.17 to 20.43) | |||
Poor | 41 | 3 (7.32) | 0.94 (0.28 to 3.20) | 0 (0) | n/a | |||
Unknown | 1002 | 97 (9.68) | 1.28 (0.86 to 1.90) | 6 (0.6) | 1.39 (0.28 to 6.94) | |||
Difficult examination | No | 1575 | 138 (8.76) | 1.00 | 0.2516 | 9 (0.57) | n/a | n/a |
Yes | 60 | 8 (13.33) | 1.6 (0.75 to 3.44) | 0 (0) | ||||
Interval from baseline to first follow-up | < 18 months | 783 | 73 (9.32) | 1.00 | 0.0212 | 4 (0.51) | 1.00 | 0.4096 |
2 yearsa | 379 | 34 (8.97) | 0.96 (0.63 to 1.47) | 1 (0.26) | 0.52 (0.06 to 4.63) | |||
3 yearsa | 272 | 22 (8.09) | 0.86 (0.52 to 1.41) | 2 (0.74) | 1.44 (0.26 to 7.92) | |||
4 yearsa | 109 | 3 (2.75) | 0.28 (0.09 to 0.89) | 2 (1.83) | 3.64 (0.66 to 20.11) | |||
5 yearsa | 45 | 7 (15.56) | 1.79 (0.77 to 4.16) | 0 (0) | n/a | |||
6 yearsa | 22 | 1 (4.55) | 0.46 (0.06 to 3.49) | 0 (0) | n/a | |||
≥ 6.5 years | 25 | 6 (24) | 3.07 (1.19 to 7.93) | 0 (0) | n/a |
Table 39 describes the characteristics of polyps and adenomas detected at baseline, by whether patients had new AA or CRC found at FUV2. There was no association between any baseline adenoma or polyp characteristic and detection of new AA at FUV2. Similarly, no baseline polyp characteristic was a significant predictor of new CRC at FUV2, although this was affected by the small number of CRCs found at FUV2.
Baseline factors | Number of patients (N = 1635) | New AA at the second follow-up | New CRC at the second follow-up | |||||
---|---|---|---|---|---|---|---|---|
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | n (%) | Unadjusted OR (95% CI) | p-value (LRT) | |||
Adenoma characteristics | ||||||||
Number | 1 | 1138 | 92 (8.08) | 1.00 | 0.3000 | 7 (0.62) | 1.00 | 0.8249 |
2 | 387 | 44 (11.37) | 1.46 (1.00 to 2.13) | 2 (0.52) | 0.84 (0.17 to 4.06) | |||
3 | 67 | 6 (8.96) | 1.12 (0.47 to 2.66) | 0 (0) | n/a | |||
4 | 43 | 4 (9.3) | 1.17 (0.41 to 3.34) | 0 (0) | n/a | |||
Largest size (mm) | < 10 | 110 | 10 (9.09) | 1.00 | 0.9819 | 0 (0) | n/a | 0.3931 |
10–14 | 523 | 45 (8.6) | 0.94 (0.46 to 1.93) | 5 (0.96) | 1.00 | |||
15–19 | 343 | 30 (8.75) | 0.96 (0.45 to 2.03) | 1 (0.29) | 0.30 (0.04 to 2.60) | |||
≥ 20 | 659 | 61 (9.26) | 1.02 (0.51 to 2.06) | 3 (0.46) | 0.47 (0.11 to 1.99) | |||
Worst histology | Tubular | 562 | 42 (7.47) | 1.00 | 0.1129 | 3 (0.53) | 1.00 | 0.7065 |
Tubulovillous | 740 | 63 (8.51) | 1.15 (0.77 to 1.73) | 3 (0.41) | 0.76 (0.15 to 3.77) | |||
Villous | 175 | 21 (12) | 1.69 (0.97 to 2.94) | 1 (0.57) | 1.07 (0.11 to 10.36) | |||
Unknown | 158 | 20 (12.66) | 1.79 (1.02 to 3.16) | 2 (1.27) | 2.39 (0.40 to 14.42) | |||
Worst dysplasia | Low grade | 1152 | 102 (8.85) | 1.00 | 0.0907 | 7 (0.61) | 1.00 | 0.4685 |
High grade | 305 | 21 (6.89) | 0.76 (0.47 to 1.24) | 0 (0) | n/a | |||
Unknown | 178 | 23 (12.92) | 1.53 (0.94 to 2.48) | 2 (1.12) | 1.86 (0.38 to 9.02) | |||
Distal | No | 339 | 27 (7.96) | 1.00 | 0.4785 | 2 (0.59) | 1.00 | 0.9128 |
Yes | 1296 | 119 (9.18) | 1.17 (0.76 to 1.81) | 7 (0.54) | 0.92 (0.19 to 4.42) | |||
Proximal | No | 1227 | 108 (8.8) | 1.00 | 0.7545 | 7 (0.57) | 1.00 | 0.8475 |
Yes | 408 | 38 (9.31) | 1.06 (0.72 to 1.57) | 2 (0.49) | 0.86 (0.18 to 4.15) | |||
Number of sightings of a unique adenoma | 1 | 1171 | 99 (8.45) | 1.00 | 0.7397 | 5 (0.43) | 1.00 | 0.1454 |
2 | 341 | 35 (10.26) | 1.24 (0.83 to 1.86) | 4 (1.17) | 2.77 (0.74 to 10.37) | |||
3 | 75 | 7 (9.33) | 1.11 (0.50 to 2.49) | 0 (0) | n/a | |||
4 | 30 | 4 (13.33) | 1.67 (0.57 to 4.87) | 0 (0) | n/a | |||
5+ | 18 | 1 (5.56) | 0.64 (0.08 to 4.84) | 0 (0) | n/a | |||
Polyp characteristics | ||||||||
Number of hyperplastic polyps | 0 | 1325 | 110 (8.3) | 1.00 | 0.3793 | 8 (0.6) | 1.00 | 0.8990 |
1 | 189 | 22 (11.64) | 1.46 (0.90 to 2.36) | 1 (0.53) | 0.88 (0.11 to 7.04) | |||
2 | 55 | 4 (7.27) | 0.87 (0.31 to 2.44) | 0 (0) | n/a | |||
3 | 24 | 3 (12.5) | 1.58 (0.46 to 5.37) | 0 (0) | n/a | |||
4 | 11 | 2 (18.18) | 2.45 (0.52 to 11.5) | 0 (0) | n/a | |||
5+ | 31 | 5 (16.13) | 2.12 (0.80 to 5.64) | 0 (0) | n/a | |||
Any large hyperplastic polyps | No | 1603 | 141 (8.8) | 1.00 | 0.2196 | 9 (0.56) | 1.00 | n/a |
Yes | 32 | 5 (15.63) | 1.92 (0.73 to 5.06) | 0 (0) | n/a | |||
Number of polyps with unknown histology | 0 | 1297 | 104 (8.02) | 1.00 | 0.1923 | 7 (0.54) | 1.00 | 0.4981 |
1 | 174 | 20 (11.49) | 1.49 (0.90 to 2.47) | 1 (0.57) | 1.07 (0.13 to 8.71) | |||
2 | 56 | 8 (14.29) | 1.91 (0.88 to 4.15) | 0 (0) | n/a | |||
3 | 41 | 5 (12.2) | 1.59 (0.61 to 4.15) | 1 (2.44) | 4.61 (0.55 to 38.34) | |||
4 | 25 | 2 (8) | 1.00 (0.23 to 4.29) | 0 (0) | n/a | |||
5+ | 42 | 7 (16.67) | 2.29 (0.99 to 5.29) | 0 (0) | n/a | |||
Distal polyps | No | 268 | 20 (7.46) | 1.00 | 0.3468 | 2 (0.75) | 1.00 | 0.6495 |
Yes | 1367 | 126 (9.22) | 1.26 (0.77 to 2.06) | 7 (0.51) | 0.68 (0.14 to 3.31) | |||
Proximal polyps | No | 1087 | 92 (8.46) | 1.00 | 0.3555 | 5 (0.46) | 1.00 | 0.4955 |
Yes | 548 | 54 (9.85) | 1.18 (0.83 to 1.68) | 4 (0.73) | 1.59 (0.43 to 5.95) |
Follow-up visit 1 risk factors and interval
The association between FUV1 risk factors and interval between FUV1 and FUV2 was examined to identify potential confounders of the association between interval and new AA or CRC at FUV2 (see Appendix 9 for tables of results). Most FUV1 characteristics were significantly associated with interval at the 1% level. A greater proportion of patients of an older age – or with a FS, poor bowel preparation, a difficult examination at FUV1 or a long visit comprising multiple examinations – had a shorter interval. Additionally, a greater proportion of patients with multiple adenomas, multiple sightings of a single adenoma, detection of an adenoma of a larger size or with villous histology or severe dysplasia had a shorter interval.
Effect of interval on new findings at second follow-up
The effect of interval to second follow-up on new findings at FUV2 was examined using univariable and multivariable analyses. As so few CRCs were found at FUV2, new AA and CRC were combined and new AN was treated as the outcome measure instead.
Table 40 shows the association between interval from FUV1 to FUV2 and new AN at the second follow-up. In the crude analysis, there was a tendency towards increasing odds of new AN with increasing interval to FUV2; however, the relationship was not statistically significant (p = 0.2313) and most 95% CIs included 1. When interval was modelled as a continuous variable, there was a borderline significant 11% increased odds for every year increase in interval (OR 1.11, 95% CI 1.00 to 1.24; p = 0.0501).
Baseline and FUV1 risk factors | Category | Number of patients (N = 1635) | Univariable analysis: new AN | Multivariable analyses: new AN | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 1635) | Model 2 – interval as continuous (n = 1635) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval FUV1 to FUV2 | < 18 months | 397 | 29 (7.30) | 1 | 0.2313 | 1 | 0.0164 | n/a | |
2 yearsa | 376 | 35 (9.31) | 1.30 (0.78 to 2.18) | 1.62 (0.93 to 2.81) | |||||
3 yearsa | 518 | 52 (10.04) | 1.42 (0.88 to 2.28) | 2.02 (1.19 to 3.42) | |||||
4 yearsa | 152 | 15 (9.87) | 1.39 (0.72 to 2.67) | 2.45 (1.20 to 5.00) | |||||
5 yearsa | 131 | 11 (8.40) | 1.16 (0.56 to 2.40) | 2.01 (0.91 to 4.44) | |||||
6 yearsa | 31 | 4 (12.90) | 1.88 (0.62 to 5.74) | 2.76 (0.84 to 9.12) | |||||
≥ 6.5 years | 30 | 7 (23.33) | 3.86 (1.53 to 9.76) | 5.95 (2.15 to 16.46) | |||||
Per year increase | 1.11 (1 to 1.24) | 0.0501 | n/a | 1.22 (1.09 to 1.36) | 0.0010 | ||||
Most complete baseline colonoscopy | Complete | 944 | 78 (8.26) | 1 | 0.0065 | 1 | 0.0099 | 1 | 0.0082 |
Unknown completeness | 530 | 48 (9.06) | 1.11 (0.76 to 1.61) | 1.06 (0.64 to 1.73) | 1.05 (0.64 to 1.71) | ||||
Incomplete | 161 | 27 (16.77) | 2.24 (1.39 to 3.59) | 2.33 (1.37 to 3.97) | 2.36 (1.39 to 4.00) | ||||
Largest adenoma at FUV1 (mm) | No adenomas | 1010 | 79 (7.82) | 1 | 0.0028 | 1 | 0.0112 | 1 | 0.0159 |
< 20 | 526 | 54 (10.27) | 1.35 (0.94 to 1.94) | 0.98 (0.54 to 1.76) | 0.93 (0.52 to 1.67) | ||||
≥ 20 | 75 | 15 (20.00) | 2.95 (1.60 to 5.43) | 3.20 (1.42 to 7.21) | 2.93 (1.32 to 6.51) | ||||
Unknown | 24 | 5 (20.83) | 3.10 (1.13 to 8.53) | 2.31 (0.71 to 7.50) | 2.14 (0.67 to 6.91) | ||||
Proximal polyp at FUV1 | No polyps | 667 | 47 (7.05) | 1 | 0.0005 | 1 | 0.0286 | 1 | 0.0293 |
No | 499 | 41 (8.22) | 1.18 (0.76 to 1.83) | 0.75 (0.41 to 1.36) | 0.76 (0.42 to 1.39) | ||||
Yes | 469 | 65 (13.86) | 2.12 (1.43 to 3.15) | 1.39 (0.72 to 2.67) | 1.42 (0.74 to 2.71) | ||||
Number of polyps with unknown histology at FUV1 | 0 | 1330 | 110 (8.27) | 1 | 0.0001 | 1 | 0.001 | 1 | 0.0012 |
1–4 | 282 | 34 (12.06) | 1.52 (1.01 to 2.29) | 1.39 (0.83 to 2.33) | 1.34 (0.80 to 2.23) | ||||
5+ | 23 | 9 (39.13) | 7.13 (3.02 to 16.85) | 7.16 (2.69 to 19.06) | 6.94 (2.63 to 18.26) | ||||
Number of adenomas at baseline and FUV1 | 1 | 844 | 58 (6.87) | 1 | 0.0003 | 1 | 0.0183 | 1 | 0.0164 |
2+ | 791 | 95 (12.01) | 1.85 (1.31 to 2.60) | 1.71 (1.10 to 2.66) | 1.72 (1.11 to 2.67) | ||||
Number of polyps with unknown histology at baseline | 0 | 1297 | 109 (8.40) | 1 | 0.0124 | 1 | 0.0256 | 1 | 0.0286 |
1+ | 338 | 44 (13.02) | 1.63 (1.12 to 2.37) | 1.65 (1.07 to 2.55) | 1.63 (1.06 to 2.51) |
Logistic regression was used to identify independent risk factors for having new AN at FUV2, and to adjust the effect of interval for covariates. Interval was modelled as a categorical variable (model 1) and as a continuous variable (model 2). Appendix 9 contains details of the models fitted. Baseline and FUV1 risk factors were adjusted for in turn (models A and B) and in combination (model C), for interval as a categorical and continuous variable. The cumulative effect of factors across baseline and FUV1 were also adjusted for (model D), as well as a combination of individual and cumulative baseline and FUV1 factors (model E), with interval as a categorical and continuous variable. When the fits of models A–E were compared, with interval as categorical or continuous, model E was found to be the best in terms of its fit to the data. Measures of fit used to assess the models were the AIC and the Bayesian information criterion (BIC) (see Appendix 9 for additional results from other models and measures of fit).
After adjusting for covariates (model E, with interval as categorical and continuous), the effect of interval was strengthened and the association with new AN at FUV2 became statistically significant, with evidence of negative confounding. When interval was modelled as a categorical variable (model 1), there was an increased odds of new AN with increasing interval length (p = 0.0164) and a 22% increased odds per year increase in interval was seen when interval was modelled as a continuous variable (OR 1.22, 95% CI 1.09 to 1.36; p = 0.001); although some effect estimates were imprecise, the small p-values, large effect sizes and tendency towards a dose–response relationship provided strong evidence of an association.
Other risk factors for AN at FUV2 included the detection of a ≥ 20 mm adenoma, proximal polyp or multiple polyps with unknown histology at FUV1, or an incomplete colonoscopy or one or more polyps with unknown histology at baseline. The detection of two or more adenomas across baseline and FUV1 (cumulative) was also associated with an increased odds of AN at FUV2 (p < 0.02). Models 1 and 2 were very similar, with the same risk factors identified in each.
Effect modification of the association between interval and new findings at follow-up visit 2
A priori, we had proposed that there might be a difference in the effect of interval length on findings by age or gender at FUV2 and, to investigate this, we fitted an interaction between continuous interval and age group or gender for the outcome of new AN at FUV2. Results are presented in Figure 6; there were no significant differences between age groups or between males and females in the effect of increasing interval length.
Long-term cancer risk
A survival analysis was used to assess the incidence of CRC after both baseline (see Colorectal cancer risk after baseline, below) and FUV1 (see Colorectal cancer risk after the first follow-up visit, below) to determine the combined effects on future CRC risk of surveillance visits and baseline findings for the former and surveillance visits and both baseline and first follow-up findings for the latter.
The entire IA cohort comprised 11,944 patients for the analysis of CRC incidence after baseline and 4517 patients with at least one follow-up – who remained free of CRC at FUV1 – for the analysis of CRC incidence after FUV1.
The cohort was analysed using all observation time after baseline to assess whether or not surveillance had a protective effect against CRC. If CRC was diagnosed at a follow-up visit, that follow-up visit was not counted, as it could not have offered any protection against CRC.
Colorectal cancer risk after baseline
Overall, 168 CRCs developed during 81,442 pys of observation time after baseline (median 6.0 years, IQR 3.8–9.2 years), giving an incidence rate of 206 (95% CI 177 to 240) per 100,000 pys at risk.
Univariable analysis
The relationship between patient, procedural and polyp characteristics and long-term CRC incidence was first investigated by determining incidence rates of CRC after baseline and crude HRs.
Table 41 shows CRC incidence stratified by baseline demographic and procedural characteristics. Older age was a strong predictor of CRC (p < 0.0001), with a more than fourfold increased rate among those aged 75–80 years (HR 4.79, 95% CI 2.71 to 8.84). Patients whose best baseline colonoscopy was incomplete were at an almost threefold increased risk (HR 2.78, 95% CI 1.94 to 3.98), and those with only poor preparation at baseline had a more than twofold increased risk of CRC (HR 2.40, 95% CI 1.32 to 4.39), although the overall effect of bowel preparation was not significant (p = 0.0597). Similarly, patients with a difficult examination had twice the rate of CRC (HR 2.06, 95% CI 1.25 to 3.41). No association was found between CRC and gender, family history of cancer, year of baseline, length of baseline, number of examinations in the baseline visit or hospital attended (results for hospital not presented).
Baseline factors | Number of patients (N = 11,944) | IR patients with long-term follow-up | |||||
---|---|---|---|---|---|---|---|
CRC(s) | |||||||
pys | n = 168 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | |||
Age (years) at baseline | < 55 | 2122 | 17,900.05 | 19 | 106.14 | 1 | < 0.0001 |
≥ 55 and < 60 | 1321 | 10,475.16 | 10 | 95.46 | 0.95 (0.44 to 2.04) | ||
≥ 60 and < 65 | 1858 | 13,308.84 | 20 | 150.28 | 1.53 (0.81 to 2.87) | ||
≥ 65 and < 70 | 2171 | 14,190.37 | 39 | 274.83 | 2.95 (1.70 to 5.14) | ||
≥ 70 and < 75 | 1786 | 11,579.17 | 27 | 233.18 | 2.54 (1.40 to 4.59) | ||
≥ 75 and < 80 | 1416 | 8108.39 | 34 | 419.32 | 4.79 (2.71 to 8.48) | ||
≥ 80 | 1270 | 5879.72 | 19 | 323.14 | 4.00 (2.09 to 7.66) | ||
Gender | Male | 6625 | 44,061.76 | 95 | 215.61 | 1 | 0.4955 |
Female | 5319 | 37,379.96 | 73 | 195.29 | 0.90 (0.66 to 1.22) | ||
Family history of cancer | No | 11,445 | 77,544.37 | 160 | 206.33 | 1 | 0.9365 |
Yes | 499 | 3897.34 | 8 | 205.27 | 0.97 (0.48 to 1.98) | ||
Calendar year of baseline | 1985–94 | 439 | 6400.35 | 22 | 343.73 | 1 | 0.2389 |
1995–9 | 1430 | 15,648.51 | 43 | 274.79 | 0.85 (0.49 to 1.48) | ||
2000–4 | 4251 | 33,510.37 | 64 | 190.99 | 0.66 (0.38 to 1.15) | ||
2005–10 | 5824 | 25,882.49 | 39 | 150.68 | 0.57 (0.31 to 1.04) | ||
Length of baseline visit | 1 day | 6836 | 46,087.39 | 83 | 180.09 | 1 | 0.5751 |
2–30 days | 734 | 4481.26 | 11 | 245.47 | 1.41 (0.75 to 2.64) | ||
1–3 months | 1643 | 11,217.87 | 26 | 231.77 | 1.32 (0.85 to 2.05) | ||
3–6 months | 1382 | 9815.99 | 24 | 244.50 | 1.37 (0.87 to 2.16) | ||
6–12 months | 1177 | 8560.31 | 21 | 245.32 | 1.35 (0.84 to 2.19) | ||
≥ 12 months | 172 | 1278.89 | 3 | 234.58 | 1.28 (0.41 to 4.06) | ||
Number of examinations in baseline visit | 1 | 6826 | 45,984.04 | 83 | 180.50 | 1 | 0.1909 |
2 | 3788 | 26,357.29 | 64 | 242.82 | 1.36 (0.98 to 1.88) | ||
3 | 908 | 6200.41 | 12 | 193.54 | 1.09 (0.60 to 2.00) | ||
4+ | 422 | 2899.97 | 9 | 310.35 | 1.74 (0.87 to 3.46) | ||
Completeness of colonoscopy | Complete | 9016 | 56,749.44 | 95 | 167.40 | 1 | < 0.0001 |
Unknown | 1601 | 15,605.39 | 29 | 185.83 | 0.97 (0.63 to 1.48) | ||
Incomplete | 1327 | 9086.89 | 44 | 484.21 | 2.78 (1.94 to 3.98) | ||
Best bowel preparation at colonoscopy | Excellent/good | 3956 | 26,442.16 | 44 | 166.40 | 1 | 0.0597 |
Satisfactory | 1922 | 10,317.55 | 22 | 213.23 | 1.39 (0.83 to 2.33) | ||
Poor | 671 | 3660.88 | 14 | 382.42 | 2.40 (1.32 to 4.39) | ||
Unknown | 5395 | 41,021.13 | 88 | 214.52 | 1.22 (0.85 to 1.75) | ||
Difficult examination | No | 11,229 | 77,084.73 | 151 | 195.89 | 1 | 0.0101 |
Yes | 715 | 4356.99 | 17 | 390.18 | 2.06 (1.25 to 3.41) |
Table 42 shows CRC incidence stratified by adenoma or polyp characteristics at baseline. Detection of an adenoma with HGD (HR 1.76, 95% CI 1.23–2.53) or a proximally located polyp (HR 1.53, 95% CI 1.13 to 2.08; p = 0.0066) or adenoma (HR 1.55, 95% CI 1.13 to 2.12; p = 0.0082) were significant predictors. Tubulovillous or villous histology and unknown histology were significantly associated with increased CRC risk. There was weak evidence that a large adenoma increased risk of CRC with a tendency towards increasing risk of CRC with increasing size.
Baseline factors | Number of patients (N = 11,944) | IR patients with long-term follow-up | |||||
---|---|---|---|---|---|---|---|
CRC(s) | |||||||
pys | n = 168 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | |||
Adenoma characteristics | |||||||
Number | 1 | 7842 | 54,992.05 | 115 | 209.12 | 1 | 0.0816 |
2 | 3073 | 19,841.53 | 47 | 236.88 | 1.18 (0.84 to 1.65) | ||
3 | 748 | 4701.77 | 5 | 106.34 | 0.53 (0.22 to 1.30) | ||
4 | 281 | 1906.36 | 1 | 52.46 | 0.26 (0.04 to 1.85) | ||
Largest size (mm) | < 10 | 1029 | 6608.13 | 6 | 90.80 | 1 | 0.0760 |
10–14 | 4417 | 29,913.84 | 61 | 203.92 | 2.19 (0.95 to 5.07) | ||
15–19 | 2440 | 16,965.60 | 33 | 194.51 | 2.07 (0.87 to 4.94) | ||
≥ 20 | 4058 | 27,954.14 | 68 | 243.26 | 2.60 (1.13 to 6.00) | ||
Worst histology | Tubular | 4742 | 32,214.79 | 48 | 149.00 | 1 | 0.0098 |
Tubulovillous | 5576 | 37,064.54 | 83 | 223.93 | 1.51 (1.06 to 2.16) | ||
Villous | 1142 | 7611.79 | 18 | 236.48 | 1.59 (0.92 to 2.73) | ||
Unknown | 484 | 4550.59 | 19 | 417.53 | 2.44 (1.41 to 4.21) | ||
Worst dysplasia | Low grade | 9476 | 63,137.92 | 111 | 175.81 | 1 | 0.0077 |
High grade | 1994 | 12,964.32 | 40 | 308.54 | 1.76 (1.23 to 2.53) | ||
Unknown | 474 | 5339.47 | 17 | 318.38 | 1.51 (0.89 to 2.57) | ||
Distal | No | 2448 | 16,295.31 | 37 | 227.06 | 1 | 0.4977 |
Yes | 9496 | 65,146.41 | 131 | 201.09 | 0.88 (0.61 to 1.27) | ||
Proximal | No | 8294 | 58,758.82 | 107 | 182.10 | 1 | 0.0082 |
Yes | 3650 | 22,682.89 | 61 | 268.93 | 1.55 (1.13 to 2.12) | ||
Number of sightings of a unique adenoma | 1 | 8807 | 60,393.43 | 118 | 195.39 | 1 | 0.4385 |
2 | 2548 | 16,965.28 | 38 | 223.99 | 1.17 (0.81 to 1.69) | ||
3 | 390 | 2696.76 | 6 | 222.49 | 1.15 (0.51 to 2.61) | ||
4 | 108 | 764.42 | 4 | 523.27 | 2.70 (1.00 to 7.31) | ||
5+ | 91 | 621.82 | 2 | 321.64 | 1.64 (0.41 to 6.65) | ||
Polyp characteristics | |||||||
Number of hyperplastic polyps | 0 | 9874 | 67,518.89 | 143 | 211.79 | 1 | 0.7086 |
1 | 1307 | 8862.23 | 18 | 203.11 | 0.98 (0.60 to 1.60) | ||
2 | 405 | 2656.32 | 3 | 112.94 | 0.55 (0.18 to 1.72) | ||
3 | 152 | 1002.09 | 1 | 99.79 | 0.49 (0.07 to 3.53) | ||
4 | 76 | 520.54 | 2 | 384.21 | 1.83 (0.45 to 7.40) | ||
5+ | 130 | 881.65 | 1 | 113.42 | 0.55 (0.08 to 3.92) | ||
Any large hyperplastic polyps? | No | 11,761 | 80,155.72 | 166 | 207.10 | 1 | 0.6897 |
Yes | 183 | 1286.00 | 2 | 155.52 | 0.76 (0.19 to 3.07) | ||
Number of polyps with unknown histology | 0 | 9322 | 64,395.55 | 135 | 209.64 | 1 | 0.6239 |
1 | 1510 | 9781.43 | 15 | 153.35 | 0.75 (0.44 to 1.27) | ||
2 | 517 | 3266.29 | 8 | 244.93 | 1.21 (0.59 to 2.46) | ||
3 | 249 | 1650.11 | 5 | 303.01 | 1.45 (0.59 to 3.55) | ||
4 | 129 | 849.83 | 3 | 353.01 | 1.68 (0.53 to 5.26) | ||
5+ | 217 | 1498.49 | 2 | 133.47 | 0.62 (0.15 to 2.50) | ||
Distal polyp | No | 1980 | 13,188.16 | 32 | 242.64 | 1 | 0.3053 |
Yes | 9964 | 68,253.55 | 136 | 199.26 | 0.81 (0.55 to 1.20) | ||
Proximal polyp | No | 7369 | 52,583.80 | 93 | 176.86 | 1 | 0.0066 |
Yes | 4575 | 28,857.92 | 75 | 259.89 | 1.53 (1.13 to 2.08) |
The unadjusted effect of surveillance on CRC incidence after baseline is presented in Table 43. Surveillance was found to have a significant protective effect on future CRC risk, with a 46% reduction in risk with one follow-up visit (HR 0.54, 95% CI 0.37 to 0.80) and a 61% reduction with two or more visits (HR 0.39, 95% CI 0.22 to 0.66), both in comparison with no follow-up visits.
Number of follow-up visits after baselinea | Number of patients (N = 11,944) | IR patients with long-term follow-up | ||||
---|---|---|---|---|---|---|
CRC(s) | ||||||
pys | n = 168 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | ||
0 | 7427 | 48,891.70 | 108 | 220.90 | 1 | 0.0002 |
1 | 2901 | 21,030.19 | 38 | 180.69 | 0.54 (0.37 to 0.80) | |
2+ | 1616 | 11,519.83 | 22 | 190.97 | 0.39 (0.22 to 0.66) |
Multivariable analysis
Cox proportional hazards regression modelling was used to examine the effect of surveillance on CRC risk, controlling for the confounding effects of baseline factors.
Table 44 presents the results of the Cox regression using the full cohort and all available follow-up time from the baseline visit. The model provided strong evidence of the beneficial effect of surveillance (p = 0.0001), with a significant 49% lower CRC incidence with one follow-up visit compared with no surveillance (HR 0.51, 95% CI 0.34 to 0.77). Having more than one surveillance examination offered additional protection against CRC, with a 68% lower incidence after attendance at two or more follow-ups (HR 0.32, 95% CI 0.17 to 0.61). As there was only a further 19% reduction in incidence associated with two or more follow-ups, much of the protective effect appeared to be contributed by the initial follow-up examination.
Baseline risk factor | Category | Adjusted HR (95% CI) | p-value (LRT) |
---|---|---|---|
Number of follow-up visits after baselineb | 0 | 1 | 0.0001 |
1 | 0.51 (0.34 to 0.77) | ||
2+ | 0.32 (0.17 to 0.61) | ||
Largest adenoma (mm) | < 10 | 1 | 0.0177 |
10–19 | 2.93 (1.18 to 7.31) | ||
≥ 20 | 3.16 (1.24 to 8.02) | ||
Worst adenoma dysplasia | Low grade | 1 | 0.0107 |
High grade | 1.66 (1.14 to 2.41) | ||
Completeness of colonoscopy | Complete | 1 | 0.0002 |
Incomplete/unknown | 1.92 (1.37 to 2.69) | ||
Proximal polyps | No | 1 | 0.0002 |
Yes | 1.91 (1.37 to 2.68) | ||
Age (years) at baseline | < 55 | 1 | < 0.0001 |
≥ 55 and < 60 | 0.96 (0.42 to 2.17) | ||
≥ 60 and < 65 | 1.42 (0.72 to 2.82) | ||
≥ 65 and < 70 | 2.50 (1.37 to 4.58) | ||
≥ 70 and < 75 | 2.47 (1.32 to 4.63) | ||
≥ 75 and < 80 | 3.92 (2.13 to 7.22) | ||
≥ 80 | 3.23 (1.64 to 6.38) |
An increased rate of CRC was independently associated with older age, as well as with having an incomplete colonoscopy or proximal polyps at baseline; both of the latter were estimated to confer an almost twofold increase in risk (see Table 44; p < 0.0001). HGD and large adenoma size were also independently predictive.
Colorectal cancer risk after the first follow-up visit
To assess the effect of additional surveillance on CRC risk after FUV1 accounting for findings at both baseline and FUV1, an analysis was performed using 4517 patients who had at least one follow-up visit and were free of CRC at their first follow-up. In these patients, 60 CRCs were diagnosed during 32,550 pys of follow-up time (184 per 100,000 pys); 38 CRCs were diagnosed after the occurrence of just one follow-up and 22 after two or more.
Univariable analysis
We first examined the effect of factors found at FUV1 on CRC risk after FUV1 and then examined whether or not any baseline factors could have affected risk.
Effect of follow-up visit 1 factors on future risk of colorectal cancer
Colorectal cancer incidence after FUV1 was stratified by demographic and procedural characteristics at FUV1 (Table 45). Older age was strongly associated with increased CRC risk (p = 0.0067), as was having a difficult examination (HR 3.98, 95% CI 2.02 to 7.88). There was some evidence of an association with number of examinations at FUV1 (p = 0.0197), although the effect estimates were imprecise. No other FUV1 risk factors were significant.
First follow-up factors | Number of patients (N = 4517) | IR patients with long-term follow-up after first follow-up | |||||
---|---|---|---|---|---|---|---|
CRC(s) after first follow-up | |||||||
pys | n = 60 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | |||
Age (years) at first follow-up | < 55 | 722 | 6221.02 | 6 | 96.45 | 1 | 0.0067 |
≥ 55 and < 60 | 577 | 4714.19 | 6 | 127.28 | 1.39 (0.45 to 4.30) | ||
≥ 60 and < 65 | 720 | 5486.11 | 5 | 91.14 | 1.04 (0.32 to 3.43) | ||
≥ 65 and < 70 | 773 | 5746.65 | 14 | 243.62 | 2.93 (1.12 to 7.70) | ||
≥ 70 and < 75 | 805 | 5312.84 | 15 | 282.33 | 3.66 (1.40 to 9.59) | ||
≥ 75 and < 80 | 530 | 3193.95 | 8 | 250.47 | 3.45 (1.17 to 10.14) | ||
≥ 80 | 390 | 1875.26 | 6 | 319.96 | 4.97 (1.56 to 15.84) | ||
Gender | Male | 2493 | 17,845.15 | 32 | 179.32 | 1 | 0.7762 |
Female | 2024 | 14,704.87 | 28 | 190.41 | 1.08 (0.65 to 1.79) | ||
Family history of cancer | No | 4222 | 30,273.11 | 56 | 184.98 | 1 | 0.9207 |
Yes | 295 | 2276.91 | 4 | 175.68 | 0.95 (0.34 to 2.62) | ||
Year of first follow-up | 1985–94 | 159 | 2461 | 9 | 365.7 | 1 | 0.7501 |
1995–9 | 544 | 6206.05 | 13 | 209.47 | 0.69 (0.26 to 1.79) | ||
2000–4 | 1601 | 13,311.8 | 21 | 157.75 | 0.63 (0.25 to 1.64) | ||
2005–9 | 2213 | 10,571.17 | 17 | 160.81 | 0.83 (0.29 to 2.32) | ||
Length of visit | 1 day | 4041 | 29,240.39 | 51 | 174.42 | 1 | 0.0512 |
2–30 days | 54 | 314.53 | 0 | 0 | n/a | ||
1–3 months | 99 | 669.45 | 0 | 0 | n/a | ||
3–6 months | 137 | 943.26 | 5 | 530.08 | 3.21 (1.28 to 8.05) | ||
6–12 months | 152 | 1133.83 | 4 | 352.79 | 2.05 (0.74 to 5.68) | ||
≥ 12 months | 34 | 248.55 | 0 | 0 | n/a | ||
Number of examinations in visit | 1 | 4033 | 29,184.02 | 51 | 174.75 | 1 | 0.0197 |
2 | 355 | 2477.71 | 5 | 201.8 | 1.16 (0.46 to 2.92) | ||
3 | 87 | 576.87 | 4 | 693.39 | 4.27 (1.54 to 11.83) | ||
4+ | 42 | 311.41 | 0 | 0 | n/a | ||
Most complete examination | Complete colonoscopy | 3258 | 22,886.03 | 36 | 157.3 | 1 | 0.1061 |
Colonoscopy of unknown completeness | 250 | 2266.09 | 3 | 132.39 | 0.75 (0.23 to 2.45) | ||
Incomplete colonoscopy | 390 | 3077.3 | 12 | 389.95 | 2.27 (1.18 to 4.38) | ||
Colonoscopy or FS | 181 | 1519.34 | 2 | 131.64 | 0.71 (0.17 to 2.98) | ||
FS | 317 | 1896.99 | 3 | 158.15 | 1.04 (0.32 to 3.39) | ||
Colonoscopy or flexible or rigid sigmoidoscopy | 100 | 774 | 2 | 258.4 | 1.49 (0.36 to 6.21) | ||
Surgery | 12 | 44.91 | 1 | 2226.86 | 17.27 (2.35 to 126.74) | ||
Unknown | 9 | 85.36 | 1 | 1171.46 | 6.26 (0.85 to 46.25) | ||
Best bowel preparation at colonoscopy | Excellent/good | 1274 | 9241.46 | 13 | 140.67 | 1 | 0.5331 |
Satisfactory | 617 | 3758.6 | 4 | 106.42 | 0.82 (0.27 to 2.53) | ||
Poor | 240 | 1519.47 | 2 | 131.62 | 0.98 (0.22 to 4.33) | ||
Unknown | 1767 | 13,709.89 | 32 | 233.41 | 1.57 (0.82 to 3.00) | ||
No known colonoscopy | 619 | 4320.61 | 9 | 208.3 | 1.40 (0.60 to 3.30) | ||
Difficult examination | No | 4285 | 30,947.19 | 50 | 161.57 | 1 | 0.0007 |
Yes | 232 | 1602.83 | 10 | 623.9 | 3.98 (2.02 to 7.88) |
Colorectal cancer incidence after FUV1 was also stratified by characteristics of adenomas and polyps detected at FUV1 (Table 46). The only feature that was significantly associated with increased CRC incidence was the detection of a proximal polyp at FUV1 (HR 1.90, 95% CI 1.08 to 3.35). The detection of an adenoma with tubulovillous (but not villous) histology was associated with a borderline significant increased risk of CRC after FUV1 (HR 2.83, 95% CI 1.40 to 5.76; overall p = 0.0693). Although non-significant, there was a tendency towards an increased risk of CRC after FUV1 with the detection of multiple adenomas, a large adenoma or an adenoma with HGD at FUV1. Imprecision of effect estimates precluded meaningful interpretation for most factors.
First follow-up factors | Number of patients (N = 4517) | IR patients with long-term follow-up after first follow-up | |||||
---|---|---|---|---|---|---|---|
CRC(s) after first follow-up | |||||||
pys | n = 60 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | |||
Adenoma characteristics | |||||||
Number | 0 | 2940 | 21,922.01 | 35 | 159.66 | 1 | 0.5336 |
1 | 1082 | 7450.29 | 18 | 241.6 | 1.61 (0.91 to 2.86) | ||
2 | 315 | 2039.62 | 4 | 196.11 | 1.38 (0.49 to 3.89) | ||
3 | 106 | 681.63 | 1 | 146.71 | 0.97 (0.13 to 7.08) | ||
4 | 34 | 218.7 | 1 | 457.24 | 3.20 (0.44 to 23.44) | ||
5+ | 40 | 237.76 | 1 | 420.6 | 3.11 (0.42 to 22.80) | ||
Largest size (mm) | No adenomas | 2940 | 21,922.01 | 35 | 159.66 | 1 | 0.3690 |
< 10 | 1013 | 6804.36 | 13 | 191.05 | 1.29 (0.68 to 2.45) | ||
10–14 | 213 | 1461.97 | 5 | 342 | 2.32 (0.91 to 5.94) | ||
15–19 | 115 | 749.75 | 1 | 133.38 | 0.94 (0.13 to 6.87) | ||
≥ 20 | 182 | 1188.45 | 4 | 336.57 | 2.26 (0.80 to 6.37) | ||
Unknown | 54 | 423.48 | 2 | 472.28 | 2.65 (0.63 to 11.11) | ||
Worst histology | No adenomas | 2940 | 21,922.01 | 35 | 159.66 | 1 | 0.0693 |
Tubular | 946 | 6291.36 | 10 | 158.95 | 1.08 (0.53 to 2.19) | ||
Tubulovillous | 372 | 2452.04 | 10 | 407.82 | 2.83 (1.40 to 5.76) | ||
Villous | 122 | 839.62 | 1 | 119.1 | 0.78 (0.11 to 5.70) | ||
Unknown | 137 | 1044.98 | 4 | 382.78 | 2.39 (0.85 to 6.73) | ||
Worst dysplasia | No adenomas | 2940 | 21,922.01 | 35 | 159.66 | 1 | 0.3742 |
Low grade | 1353 | 8802.29 | 20 | 227.21 | 1.57 (0.90 to 2.74) | ||
High grade | 101 | 669.8 | 2 | 298.6 | 2.05 (0.49 to 8.56) | ||
Unknown | 123 | 1155.92 | 3 | 259.53 | 1.46 (0.45 to 4.77) | ||
Distal | No adenomas | 2940 | 21,922.01 | 35 | 159.66 | 1 | 0.0607 |
No | 718 | 4714.59 | 15 | 318.16 | 2.18 (1.18 to 4.00) | ||
Yes | 859 | 5913.42 | 10 | 169.11 | 1.13 (0.56 to 2.29) | ||
Proximal | No adenomas | 2940 | 21,922.01 | 35 | 159.66 | 1 | 0.1472 |
No | 718 | 5111.38 | 10 | 195.64 | 1.29 (0.64 to 2.61) | ||
Yes | 859 | 5516.63 | 15 | 271.91 | 1.88 (1.02 to 3.45) | ||
Number of sightings of a single adenoma | No adenomas | 2940 | 21,922.01 | 35 | 159.66 | 1 | 0.4693 |
1 | 1381 | 9312.05 | 21 | 225.51 | 1.52 (0.88 to 2.62) | ||
2 | 138 | 895.1 | 3 | 335.16 | 2.32 (0.71 to 7.58) | ||
3 | 35 | 253.76 | 1 | 394.08 | 2.55 (0.35 to 18.65) | ||
4 | 16 | 114.29 | 0 | 0 | n/a | ||
5+ | 7 | 52.82 | 0 | 0 | n/a | ||
Polyp characteristics | |||||||
Number of hyperplastic polyps | 0 | 3743 | 27,365.8 | 51 | 186.36 | 1 | 0.6502 |
1 | 496 | 3364.3 | 5 | 148.62 | 0.85 (0.34 to 2.15) | ||
2 | 160 | 1067.64 | 3 | 280.99 | 1.66 (0.52 to 5.35) | ||
3 | 58 | 363.43 | 0 | 0 | n/a | ||
4 | 24 | 185.01 | 0 | 0 | n/a | ||
5+ | 36 | 203.85 | 1 | 490.56 | 3.14 (0.43 to 22.86) | ||
Any large hyperplastic polyps? | No | 4477 | 32,294.35 | 60 | 185.79 | n/a | n/a |
Yes | 40 | 255.67 | 0 | 0 | |||
Number of polyps with unknown histology | 0 | 3742 | 27,017.94 | 47 | 173.96 | 1 | 0.8852 |
1 | 478 | 3395.46 | 9 | 265.06 | 1.48 (0.73 to 3.03) | ||
2 | 142 | 1081.72 | 2 | 184.89 | 1.00 (0.24 to 4.13) | ||
3 | 70 | 467.01 | 1 | 214.13 | 1.23 (0.17 to 8.93) | ||
4 | 31 | 200.61 | 0 | 0 | n/a | ||
5+ | 54 | 387.28 | 1 | 258.21 | 1.5 (0.21 to 10.85) | ||
Distal polyp | No polyps | 2000 | 15,179.5 | 24 | 158.11 | 1 | 0.1295 |
No | 848 | 5883.19 | 17 | 288.96 | 1.90 (1.02 to 3.55) | ||
Yes | 1669 | 11,487.33 | 19 | 165.4 | 1.11 (0.61 to 2.03) | ||
Proximal polyp | No polyps | 2000 | 15,179.5 | 24 | 158.11 | 1 | 0.0042 |
No | 1230 | 8816.78 | 12 | 136.1 | 0.90 (0.45 to 1.80) | ||
Yes | 1287 | 8553.74 | 24 | 280.58 | 1.90 (1.08 to 3.35) |
Effect of baseline factors on future risk of colorectal cancer
Few baseline factors were associated with CRC risk after FUV1 in univariable analyses (Table 47). There was a tendency towards an increasing risk of CRC with increasing interval between baseline and FUV1; however, the effect estimates were imprecise, with most 95% CIs crossing 1. Results also indicated an increased risk of CRC in patients with unknown bowel preparation quality, no complete colonoscopy or a difficult examination at baseline, but the associations were non-significant and most 95% CIs included 1.
Baseline factors | Number of patients (N = 4517) | IR patients with long-term follow-up after first follow-up | |||||
---|---|---|---|---|---|---|---|
CRC(s) after first follow-up | |||||||
pys | n = 60 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | |||
Family history of cancer | No | 4281 | 30,696.05 | 56 | 182.43 | 1 | 0.7605 |
Yes | 236 | 1853.97 | 4 | 215.75 | 1.18 (0.43 to 3.24) | ||
Calendar year of baseline visit | 1985–94 | 330 | 4473.85 | 16 | 357.63 | 1 | 0.1929 |
1995–9 | 1004 | 9703.32 | 20 | 206.12 | 0.64 (0.31 to 1.32) | ||
2000–4 | 2279 | 14,814.79 | 22 | 148.5 | 0.54 (0.26 to 1.15) | ||
2005–10 | 904 | 3558.05 | 2 | 56.21 | 0.23 (0.05 to 1.10) | ||
Length of visit | 1 day | 2455 | 18,050.92 | 30 | 166.2 | 1.00 | 0.9122 |
2–30 days | 239 | 1667.52 | 4 | 239.88 | 1.47 (0.52 to 4.18) | ||
1–3 months | 651 | 4483.72 | 8 | 178.42 | 1.11 (0.51 to 2.43) | ||
3–6 months | 579 | 4163.45 | 9 | 216.17 | 1.33 (0.63 to 2.81) | ||
6–12 months | 497 | 3552.47 | 7 | 197.05 | 1.2 (0.53 to 2.73) | ||
≥ 12 months | 96 | 631.95 | 2 | 316.48 | 1.98 (0.47 to 8.3) | ||
Number of examinations in visit | 1 | 2448 | 17,977.1 | 30 | 166.88 | 1 | 0.3596 |
2 | 1487 | 10,618.22 | 22 | 207.19 | 1.25 (0.72 to 2.17) | ||
3 | 381 | 2589.04 | 3 | 115.87 | 0.73 (0.22 to 2.41) | ||
4+ | 201 | 1365.66 | 5 | 366.12 | 2.24 (0.87 to 5.79) | ||
Most complete colonoscopy | Complete | 2926 | 19,853.68 | 32 | 161.18 | 1 | 0.5691 |
Unknown completeness | 1140 | 9260.95 | 19 | 205.16 | 1.16 (0.66 to 2.06) | ||
Incomplete | 451 | 3435.39 | 9 | 261.98 | 1.50 (0.71 to 3.15) | ||
Best bowel preparation at colonoscopy | Excellent/good | 1371 | 9689.83 | 11 | 113.52 | 1 | 0.0560 |
Satisfactory | 480 | 2695.92 | 2 | 74.19 | 0.73 (0.16 to 3.30) | ||
Poor | 184 | 1165.3 | 1 | 85.81 | 0.78 (0.10 to 6.07) | ||
Unknown | 2482 | 18,998.96 | 46 | 242.12 | 2.04 (1.06 to 3.95) | ||
Difficult examination | No | 4308 | 31,127 | 55 | 176.70 | 1 | 0.1674 |
Yes | 209 | 1423.02 | 5 | 351.37 | 2.03 (0.81 to 5.09) | ||
Interval from baseline to first follow-up | < 18 months | 1727 | 13,330.97 | 23 | 172.53 | 1 | 0.0089 |
2 yearsa | 949 | 7105.08 | 11 | 154.82 | 0.93 (0.45 to 1.90) | ||
3 yearsa | 1051 | 6997.94 | 15 | 214.35 | 1.38 (0.72 to 2.66) | ||
4 yearsa | 345 | 2449.78 | 5 | 204.1 | 1.27 (0.48 to 3.36) | ||
5 yearsa | 213 | 1358.88 | 1 | 73.59 | 0.48 (0.07 to 3.59) | ||
6 yearsa | 118 | 660.13 | 5 | 757.43 | 5.15 (1.94 to 13.65) | ||
≥ 6.5 years | 114 | 647.25 | 0 | 0 | n/a |
Table 48 shows the CRC incidence after FUV1 stratified by characteristics of adenomas and polyps detected at baseline. The only factor which reached statistical significance was the number of sightings of a unique adenoma (p = 0.0494); risk tended to increase with increased viewings but interpretation was difficult because of a lack of precision. Although no other baseline adenoma or polyp risk factors reached statistical significance and ORs were imprecise, there was a tendency towards an increased risk of CRC after FUV1 with the detection of a large (≥ 10 mm) adenoma, an adenoma with tubulovillous or villous histology, a proximal adenoma or polyp, or multiple polyps with unknown histology at baseline.
Baseline factors | Number of patients (N = 4517) | IR patients with long-term follow-up after first follow-up | |||||
---|---|---|---|---|---|---|---|
CRC(s) after first follow-up | |||||||
pys | n = 60 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | |||
Adenoma characteristics | |||||||
Number | 1 | 3040 | 22,527.05 | 45 | 199.76 | 1 | 0.4002 |
2 | 1129 | 7717.5 | 14 | 181.41 | 0.95 (0.52 to 1.73) | ||
3 | 238 | 1494.08 | 0 | 0 | n/a | ||
4 | 110 | 811.39 | 1 | 123.24 | 0.63 (0.09 to 4.55) | ||
Largest size (mm) | < 10 | 2305.47 | 1 | 43.38 | 90.80 | 1 | 0.1103 |
10–14 | 11,142.76 | 17 | 152.57 | 203.92 | 3.41 (0.45 to 25.64) | ||
15–19 | 6862.11 | 12 | 174.87 | 194.51 | 3.81 (0.49 to 29.32) | ||
≥ 20 | 12,239.68 | 30 | 245.1 | 243.26 | 5.36 (0.73 to 39.36) | ||
Worst histology | Tubular | 1700 | 12,040.66 | 17 | 141.19 | 1 | 0.4971 |
Tubulovillous | 2096 | 14,676.49 | 27 | 183.97 | 1.31 (0.72 to 2.41) | ||
Villous | 440 | 3104.81 | 7 | 225.46 | 1.58 (0.65 to 3.80) | ||
Unknown | 281 | 2728.06 | 9 | 329.91 | 1.85 (0.79 to 4.30) | ||
Worst dysplasia | Low grade | 3372 | 23,754.64 | 39 | 164.18 | 1 | 0.1088 |
High grade | 822 | 5592.51 | 8 | 143.05 | 0.88 (0.41 to 1.89) | ||
Unknown | 323 | 3202.87 | 13 | 405.89 | 2.04 (1.05 to 3.94) | ||
Distal | No | 921 | 6512.4 | 11 | 168.91 | 1 | 0.7652 |
Yes | 3596 | 26,037.62 | 49 | 188.19 | 1.10 (0.57 to 2.12) | ||
Proximal | No | 3261 | 24,154.31 | 42 | 173.88 | 1.00 | 0.3847 |
Yes | 1256 | 8395.71 | 18 | 214.40 | 1.28 (0.74 to 2.23) | ||
Number of sightings of a unique adenoma | 1 | 3256 | 23,786.29 | 40 | 168.16 | 1 | 0.0494 |
2 | 981 | 6797.85 | 16 | 235.37 | 1.43 (0.80 to 2.55) | ||
3 | 174 | 1255.55 | 0 | 0 | n/a | ||
4 | 61 | 399.74 | 2 | 500.33 | 3.19 (0.77 to 13.22) | ||
5+ | 45 | 310.58 | 2 | 643.95 | 3.70 (0.89 to 15.33) | ||
Polyp characteristics | |||||||
Number of hyperplastic polyps | 0 | 3669 | 26,896.79 | 55 | 204.49 | 1 | 0.4610 |
1 | 529 | 3547.39 | 3 | 84.57 | 0.44 (0.14 to 1.40) | ||
2 | 155 | 1039.77 | 0 | 0 | n/a | ||
3 | 64 | 429.76 | 1 | 232.69 | 1.22 (0.17 to 8.82) | ||
4 | 37 | 227.95 | 0 | 0 | n/a | ||
5+ | 63 | 408.36 | 1 | 244.88 | 1.30 (0.18 to 9.40) | ||
Any large hyperplastic polyps? | No | 4435 | 31,994.36 | 59 | 184.41 | 1 | 0.9814 |
Yes | 82 | 555.66 | 1 | 179.97 | 1.02 (0.14 to 7.40) | ||
Number of polyps with unknown histology | 0 | 3523 | 25,765.33 | 49 | 190.18 | 1 | 0.1700 |
1 | 544 | 3662.15 | 3 | 81.92 | 0.44 (0.14 to 1.41) | ||
2 | 184 | 1230.67 | 4 | 325.03 | 1.78 (0.64 to 4.94) | ||
3 | 106 | 755.28 | 2 | 264.8 | 1.40 (0.34 to 5.78) | ||
4 | 61 | 400.04 | 2 | 499.95 | 2.81 (0.68 to 11.59) | ||
5+ | 99 | 736.56 | 0 | 0 | n/a | ||
Distal polyp | No | 727 | 5149.74 | 10 | 194.18 | 1 | 0.8231 |
Yes | 3790 | 27,400.28 | 50 | 182.48 | 0.92 (0.47 to 1.82) | ||
Proximal polyp | No | 2886 | 21,527.35 | 37 | 171.87 | 1 | 0.3835 |
Yes | 1631 | 11,022.66 | 23 | 208.66 | 1.26 (0.75 to 2.13) |
Effect of surveillance on future risk of colorectal cancer
In a univariable analysis (Table 49), additional surveillance was estimated to reduce the risk of CRC after FUV1 by 46% compared with no additional surveillance (HR 0.54, 95% CI 0.30 to 1.00); the p-value from the LRT indicated significance (p = 0.0462), although the 95% CI included 1.
Number of follow-up visits after baseline (including first follow-up)a | Number of patients (N = 4517) | IR patients with long-term follow-up | ||||
---|---|---|---|---|---|---|
CRC(s) | ||||||
pys | n = 60 | Rate (per 100,000 pys) | Unadjusted HR (95% CI) | p-value (LRT) | ||
1 | 2901 | 21,030.19 | 38 | 180.69 | 1 | 0.0462 |
2+ | 1616 | 11,519.83 | 22 | 190.97 | 0.54 (0.30 to 1.00) |
Multivariable analysis
In order to build a final multivariable model for the analysis of risk following FUV1, separate models were first built considering baseline factors only, FUV1 factors only and cumulative factors only. The significant risk factors identified from these models were then considered together and a final model was chosen.
Table 50 presents the results of the selected final Cox regression model of CRC incidence after the first follow-up. There was a 41% reduction in CRC incidence with two or more follow-up visits compared with only one follow-up, although this was not shown to be statistically significant (HR 0.59, 95% CI 0.32 to 1.10; p = 0.0923).
Risk factor | Category | Adjusted HR (95% CI) | p-value (LRT) |
---|---|---|---|
Number of follow-up visits after baseline (including first follow-up)a | 1 | 1 | 0.0923 |
2+ | 0.59 (0.32 to 1.10) | ||
Age (years) at first follow-up | < 55 | 1 | 0.0151 |
55–59 | 1.35 (0.43 to 4.18) | ||
60–64 | 1.01 (0.31 to 3.32) | ||
65–69 | 2.77 (1.05 to 7.30) | ||
70–74 | 3.42 (1.29 to 9.03) | ||
75–79 | 2.98 (1.01 to 8.84) | ||
≥ 80 | 4.66 (1.45 to 15.01) | ||
Difficult examination at baseline or first follow-upb | No | 1 | 0.0066 |
Yes | 2.67 (1.4 to 5.08) | ||
Proximal polyps at first follow-up | No polyps | 1 | 0.0111 |
No | 0.97 (0.48 to 1.96) | ||
Yes | 2.28 (1.28 to 4.07) | ||
Largest adenoma at baseline, mm | < 10 | 1 | 0.0418 |
10–19 | 3.96 (0.54 to 29.28) | ||
≥ 20 | 5.89 (0.80 to 43.55) |
Independent risk factors for CRC after FUV1 included older age at FUV1, the presence of proximal polyps at FUV1 or a difficult examination at either baseline or FUV1. There was weak evidence that a large adenoma (≥ 10 mm) at baseline also increased CRC incidence after FUV1 [although the HRs suggested a large effect, they had extremely wide CIs that included 1 and the LRT statistic was only borderline significant (p = 0.0418)].
Effect modification of the association between surveillance and colorectal cancer incidence
As for findings at follow-up, we hypothesised that there might be differences in the effect of surveillance by age and gender. However, models with interactions terms included demonstrate no evidence of any significant difference in the effect of surveillance by age or gender (Table 51).
Characteristic | Number of follow-up visits after baselinea | New CRC after baseline | New CRC after FUV1 | |||
---|---|---|---|---|---|---|
HR (95% CI)b | p-valuec | HR (95% CI)b | p-valuec | |||
Age (years)d | < 60 | 0 | 1 | 0.2116 | n/a | 0.8669 |
1 | 0.22 (0.06 to 0.77) | 1 | ||||
2+ | 0.45 (0.17 to 1.20) | 0.75 (0.22 to 2.55) | ||||
≥ 60 and < 75 | 0 | 1 | n/a | |||
1 | 0.69 (0.41 to 1.15) | 1 | ||||
2+ | 0.25 (0.10 to 0.61) | 0.52 (0.24 to 1.12) | ||||
≥ 75 | 0 | 1 | n/a | |||
1 | 0.38 (0.16 to 0.89) | 1 | ||||
2+ | 0.36 (0.09 to 1.52) | 0.56 (0.15 to 2.04) | ||||
Gender | Male | 0 | 1 | 0.1353 | n/a | 0.1285 |
1 | 0.45 (0.26 to 0.79) | 1 | ||||
2+ | 0.18 (0.07 to 0.46) | 0.40 (0.18 to 0.91) | ||||
Female | 0 | 1 | n/a | |||
1 | 0.58 (0.32 to 1.05) | 1 | ||||
2+ | 0.56 (0.26 to 1.21) | 0.91 (0.41 to 2.05) |
Absolute risk of colorectal cancer in intermediate-risk patients
The absolute risk of CRC was assessed using cumulative incidence rates calculated using different subsets of the cohort and varying periods of observation time, as presented in Table 52 and illustrated in Figure 7. It should be noted that the results presented in this section for observation time partitioned by the occurrence of surveillance may be contaminated, as some patients could have had surveillance that we do not know about; this would artificially reduce the estimate of pre-surveillance risk and possibly underestimate the effect of surveillance. Sensitivity analyses that assess the impact of this potential misclassification can be found below (see Sensitivity analyses and internal validation).
Observation time | Number of patients | Cumulative incidence at | Number of observed CRCs | Number of expected CRCs | SIRa (95% CI) | ||
---|---|---|---|---|---|---|---|
3 years, % (95% CI) | 5 years, % (95% CI) | 10 years, % (95% CI) | |||||
Observation time free of surveillance in all IR patients (censored at first follow-up) | |||||||
Total | 11,944 | 0.5 (0.4 to 0.7) | 1.1 (0.9 to 1.4) | 2.9 (2.2 to 3.9) | 108 | 102 | 1.06 (0.87 to 1.28) |
All observation time in all IR patients | |||||||
Total | 11,944 | 0.5 (0.3 to 0.6) | 0.9 (0.7 to 1.1) | 2.1 (1.7 to 2.5) | 168 | 172 | 0.98 (0.84 to 1.14) |
Observation time free of further surveillance after FUV1 in IR patients with one or more follow-up visits (censored at second follow-up) | |||||||
Total | 4517 | 0.4 (0.2 to 0.6) | 0.8 (0.5 to 1.2) | 2.3 (1.5 to 3.7) | 38 | 44 | 0.87 (0.61 to 1.19) |
All observation time after FUV1 in IR patients with one follow-up visit | |||||||
Total | 4517 | 0.3 (0.2 to 0.5) | 0.7 (0.5 to 1.0) | 1.9 (1.4 to 2.6) | 60 | 70 | 0.86 (0.65 to 1.10) |
Cumulative incidence in the absence of surveillance was assessed by censoring the cohort at FUV1 (see Figure 7a). The cumulative incidence of CRC at 3, 5 and 10 years, respectively, was 0.5%, 1.1% and 2.9%, and CRC incidence was slightly, but non-significantly, higher than that of the general population.
Cumulative incidence in the whole cohort allowing for the effect of any surveillance was 0.5%, 0.9% and 2.1% at 3, 5 and 10 years, respectively (see Figure 7b). The cumulative incidence of CRC at 10 years was reduced to 2.1%, and CRC incidence was the same as the general population level; these results must be interpreted in the context of the fact that only 39% of the cohort were known to have had at least one surveillance visit.
Cumulative incidence after a single surveillance was assessed by focusing on the cohort of IR patients who attended follow-up and censoring at FUV2 to remove effects of additional surveillance (see Figure 7c). Compared with pre-surveillance risk, the cumulative incidence of CRC at 10 years was lowered to 2.3% after a single visit, and the CRC incidence was slightly lower than the general population, although not significantly so.
Finally, we assessed absolute risk of CRC after one or more surveillance visits using all observation time after the first follow-up in patients who attended one follow-up or more (see Figure 7d). Compared with the analysis that censored the cohort at the second follow-up, by including the effect of additional surveillance after the first follow-up, the cumulative incidence of CRC at 10 years was lowered to 1.9%, and CRC incidence remained slightly lower than in the general population.
Lower- and higher-intermediate-risk subgroups
Lower- and higher-intermediate-risk subgroups after baseline
To assess whether or not heterogeneity exists within the IR group, and to see if all IR patients benefit from surveillance after baseline, the cohort was divided into lower IR (LIR) and higher IR (HIR) subgroups. These subgroups were defined using risk factors for CRC identified in the Cox regression model of CRC risk after baseline (see Table 44); a HIR subgroup was defined to include patients with any of the following baseline characteristics: an adenoma of ≥ 20 mm or with HGD, proximal polyps, no complete colonoscopy or poor bowel preparation. All other patients were assigned to the LIR subgroup.
Older age was not used to define risk subgroups despite it being identified as a risk factor for CRC, as age has practical implications for surveillance, with risks of complications increasing with age, and surveillance often ceasing in patients aged ≥ 75 years. Bowel preparation quality was used despite it not being predictive in the Cox model as it was a risk factor for finding CRC at FUV1 (see Table 30) and has a crucial effect on examination quality. Although patients with an adenoma of ≥ 10 mm displayed similar risk to those with an adenoma of ≥ 20 mm, only the latter size was used to define a higher-risk subgroup, as almost all patients (91%) had a lesion of ≥ 10 mm; this led to their classification as IR and, thus, if ‘≥ 10 mm’ was used it would not be discriminant.
Based on this definition, the baseline subgroups comprised 2679 (22.4%) LIR and 9265 (77.6%) HIR patients (Table 53). In the HIR subgroup, attendance at one or more follow-up visits was associated with a significantly lower CRC risk than with no follow-up, with a 50% reduction for one follow-up (HR 0.50, 95% CI 0.34 to 0.76) and a 64% reduction for two or more (HR 0.36, 95% CI 0.20 to 0.62). Among patients in the LIR subgroup, the benefit of surveillance was less clear and, because of the small number of CRC end points, statistical significance was not reached and effect estimates were imprecise, although the attendance at one or more follow-up visits was associated with a non-significant reduction in CRC risk [38% reduction for one (HR 0.62, 95% CI 0.16 to 2.43); 71% reduction for two or more follow-ups (HR 0.29, 95% CI 0.03 to 2.82)].
IR subgroup | Number of patients | % | pys | Patients with CRC (n) | Rate/100,000 pys (95% CI) | Effect of surveillance | ||
---|---|---|---|---|---|---|---|---|
Number of follow-up visits after baselinea | Unadjusted HR (95% CI) | p-value (LRT) | ||||||
LIR | 2679 | 22.4 | 17,615 | 13 | 74 (43 to 127) | 0 | 1 | 0.47 |
1 | 0.62 (0.16 to 2.43) | |||||||
2+ | 0.29 (0.03 to 2.82) | |||||||
HIRb | 9265 | 77.6 | 63,827 | 155 | 243 (208 to 284) | 0 | 1 | 0.0001 |
1 | 0.50 (0.34 to 0.76) | |||||||
2+ | 0.36 (0.20 to 0.62) |
Table 54 shows the differences between patients in the HIR and LIR subgroups. Patients in the HIR subgroup had significantly more follow-up visits than those in the LIR subgroup; however, the median follow-up time was similar (6.1 years in the HIR subgroup vs. 5.7 years in the LIR subgroup). The HIR subgroup was also older and had their baseline visit earlier on average; despite reaching statistical significance, these differences were small.
Factor | Number of patients | LIR subgroup | HIR subgroupa | p-value (chi-squared test) | ||
---|---|---|---|---|---|---|
n | % | n | % | |||
Total | 11,944 | 2679 | 9265 | |||
Number of follow-up visits | < 0.001 | |||||
0 | 7427 | 1909 | 71.3 | 5518 | 59.6 | |
1 | 2880 | 515 | 19.2 | 2365 | 25.5 | |
2 | 1074 | 184 | 6.9 | 890 | 9.6 | |
3+ | 563 | 71 | 2.7 | 492 | 5.3 | |
Age (years) at first adenoma detection | < 0.001 | |||||
< 55 | 2122 | 572 | 21.4 | 1550 | 16.7 | |
≥ 55 and < 60 | 1321 | 311 | 11.6 | 1010 | 10.9 | |
≥ 60 and < 65 | 1858 | 439 | 16.4 | 1419 | 15.3 | |
≥ 65 and < 70 | 2171 | 473 | 17.7 | 1698 | 18.3 | |
≥ 70 and < 75 | 1786 | 403 | 15.0 | 1383 | 14.9 | |
≥ 75 and < 80 | 1416 | 260 | 9.7 | 1156 | 12.5 | |
≥ 80 | 1270 | 221 | 8.2 | 1049 | 11.3 | |
Year of baseline | < 0.001 | |||||
1985–94 | 439 | 71 | 2.7 | 368 | 4.0 | |
1995–9 | 1430 | 283 | 10.6 | 1147 | 12.4 | |
2000–4 | 4251 | 825 | 30.8 | 3426 | 37.0 | |
2005–10 | 5824 | 1500 | 56.0 | 4324 | 46.7 |
Lower- and higher-intermediate-risk subgroups after the first follow-up visit
The HIR and LIR subgroups were redefined after FUV1 incorporating findings at both baseline and FUV1. Specifically, the HIR subgroup was classified as patients with any of the following: an adenoma of ≥ 10 mm or with HGD, proximal polyps, no complete colonoscopy or poor bowel preparation at FUV1, or an adenoma of ≥ 20 mm at baseline.
The risk subgroups after FUV1 comprised 1246 (27.6%) LIR patients and 3271 (72.4%) HIR patients (Table 55). In the LIR subgroup, attendance for additional surveillance after FUV1 was associated with a non-significant increased risk compared with only one follow-up visit (HR 1.20, 95% CI 0.14 to 10.31; p = 0.87), although the effect estimate was very imprecise. By comparison, in the HIR subgroup attendance for additional surveillance conferred a significant 53% reduction in CRC risk (HR 0.47, 95% CI 0.25 to 0.87; p = 0.0155).
IR subgroup | N | % | pys | Patients with CRC (n) | Rate/100,000 pys (95% CI) | Effect of surveillance | ||
---|---|---|---|---|---|---|---|---|
Number of follow-up visits after baseline (including first follow-up)a | Unadjusted hazard ratio (95% CI) | p-value (LRT) | ||||||
LIR | 1246 | 27.6 | 9268 | 6 | 65 (29 to 144) | 1 | 1 | 0.87 |
2+ | 1.20 (0.14 to 10.31) | |||||||
HIRb | 3271 | 72.4 | 23,282 | 54 | 232 (118 to 303) | 1 | 1 | 0.0155 |
2+ | 0.47 (0.25 to 0.87) |
Absolute risk of colorectal cancer in lower- and higher-intermediate-risk subgroups
The absolute risk of CRC was assessed using cumulative incidence rates and SIRs, calculated using different subsets of the cohort and varying periods of observation time, as presented in Table 56. The pre-surveillance standardised CRC incidence in the LIR group was 60% below that of the general population, whereas the HIR group had a 26% higher incidence. This large difference in cancer risk between subgroups was also reflected in the 10-year cumulative incidence of CRC, which was 3.6% in the HIR subgroup compared with 1.0% in the LIR subgroup (Figure 8a).
Observation time | Number of patients | Cumulative incidence at | Number of observed CRCs | Number of expected CRCs | SIRa (95% CI) | ||
---|---|---|---|---|---|---|---|
3 years, % (95% CI) | 5 years, % (95% CI) | 10 years, % (95% CI) | |||||
Observation time free of surveillance in all IR patients (censored at first follow-up) | |||||||
Lower-risk subgroup | 2679 | 0.1 (0 to 0.4) | 0.4 (0.2 to 1.0) | 1.0 (0.4 to 2.4) | 9 | 23 | 0.39 (0.18 to 0.75) |
Higher-risk subgroupb | 9265 | 0.7 (0.5 to 0.9) | 1.3 (1.0 to 1.7) | 3.6 (2.6 to 4.8) | 99 | 79 | 1.26 (1.02 to 1.53) |
All observation time in all IR patients | |||||||
Lower-risk subgroup | 2679 | 0.1 (0 to 0.4) | 0.4 (0.2 to 0.8) | 0.6 (0.3 to 1.2) | 13 | 34 | 0.38 (0.20 to 0.65) |
Higher-risk subgroupb | 9265 | 0.6 (0.4 to 0.7) | 1.0 (0.8 to 1.3) | 2.4 (2.0 to 2.9) | 155 | 138 | 1.13 (0.96 to 1.32) |
Observation time free of further surveillance after FUV1 in IR patients with one or more follow-up visits (censored at second follow-up) | |||||||
Lower-risk subgroup | 1246 | 0.2 (0 to 0.7) | 0.3 (0.1 to 1.0) | 0.5 (0.2 to 1.4) | 4 | 13 | 0.32 (0.09 to 0.81) |
Higher-risk subgroupc | 3271 | 0.4 (0.2 to 0.8) | 1 (0.6 to 1.6) | 3.3 (2.0 to 5.2) | 34 | 31 | 1.09 (0.75 to 1.52) |
All observation time after FUV1 in IR patients with one or more follow-up visits | |||||||
Lower-risk subgroup | 1246 | 0.2 (0 to 0.7) | 0.4 (0.1 to 0.9) | 0.5 (0.2 to 1.1) | 6 | 19 | 0.32 (0.12 to 0.70) |
Higher-risk subgroupc | 3271 | 0.4 (0.2 to 0.7) | 0.8 (0.5 to 1.2) | 2.5 (1.8 to 3.5) | 54 | 51 | 1.05 (0.79 to 1.37) |
Including the effect of any surveillance, the CRC incidence in the LIR group remained around 60% lower than that of the general population, whereas the CRC incidence in the HIR subgroup was reduced to a level 13% higher than the general population. The 10-year cumulative incidence of CRC in the HIR subgroup was 2.4%, compared with 0.6% in the LIR subgroup (see Figure 8b).
We assessed the effect of just one follow-up visit on CRC risk by focusing on the cohort of IR patients who attended follow-up and censoring at FUV2 to remove effects of additional surveillance (see Figure 8c); risk groups were revised to incorporate findings at both baseline and FUV1. Compared with pre-surveillance risk, the 10-year cumulative incidence of CRC after a single surveillance visit was lower, at 0.5% and 3.3% in the LIR and HIR subgroups, respectively. In the low-risk subgroup, the standardised CRC incidence after one follow-up visit was slightly lower than the pre-surveillance level at 68% below the general population level. Similarly, for the HIR group, a single surveillance visit reduced the standardised, pre-surveillance CRC incidence closer to that of the general population.
When the effect of additional surveillance after FUV1 was included, the 10-year cumulative incidence of CRC in the HIR subgroup was 2.5% (see Figure 8d), compared with 3.3% when censoring at FUV2. The standardised CRC incidence in the LIR subgroup remained unchanged – it was significantly lower than the general population – whereas CRC incidence in the HIR group was further reduced to a level comparable with that of the general population.
Findings at follow-up examinations in lower- and higher-intermediate-risk subgroups
Lower- and higher-risk subgroups, derived from the Cox proportional hazards models for long-term CRC risk (see Tables 44 and 50), were applied to findings at FUV1 and FUV2 to determine if the criteria used to define the subgroups were discriminant in terms of risks of detecting AN at follow-up visits.
At FUV1, AN was detected in 6.2% of the LIR subgroup compared with 11.6% of the HIR subgroup (see Table 57; OR 1.99, 95% CI 1.46 to 2.71; p < 0.0001); this suggests that risk factors for CRC after baseline are also discriminant in terms of risk of AN at FUV1. At FUV2, new ANs was detected in 7.4% and 10.0% of the LIR and HIR subgroups, respectively (OR 1.38, 95% CI 0.91 to 2.10; p = 0.1245); thus, the risk groups were not discriminant for findings at FUV2, possibly owing to a lack of power because of the small number of end points detected.
The effect of interval in the LIR and HIR subgroups was examined (Table 57). At FUV1, there was a highly significant association between longer interval and new AN in the HIR subgroup (p < 0.0001); in the LIR subgroup, the trend was only borderline significant, possibly because of a paucity of end points (p = 0.0433). At FUV2, there was an association between interval and new AN in the HIR subgroup (p = 0.0191), but not the LIR subgroup (p = 0.4573).
Interval to first follow-up | FUV1 | FUV2 | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Lower-risk subgroup | Higher-risk subgroupa | Lower-risk subgroup | Higher-risk subgroupb | |||||||||
Number with FUV1 | Number of AN (CRC) | % | Number with FUV1 | Number of AN (CRC) | % | Number with FUV2 | Number of AN (CRC) | % | Number with FUV2 | Number of AN (CRC) | % | |
< 18 months | 264 | 12 | 4.55 | 1496 | 134 (15) | 8.96 | 43 | 1 (1) | 2.33 | 354 | 28 (1) | 7.91 |
2 yearsc | 147 | 12 | 8.16 | 829 | 87 (9) | 10.49 | 81 | 6 | 7.41 | 295 | 29 (4) | 9.83 |
3 yearsc | 227 | 9 | 3.96 | 830 | 101 (5) | 12.17 | 164 | 15 | 9.15 | 354 | 37 (1) | 10.45 |
4 yearsc | 50 | 7 (1) | 14.00 | 305 | 45 (8) | 14.75 | 40 | 3 | 7.50 | 112 | 12 (1) | 10.71 |
5 yearsc | 53 | 3 | 5.66 | 164 | 30 (4) | 18.29 | 59 | 3 | 5.08 | 72 | 8 | 11.11 |
6 yearsc | 19 | 2 (1) | 10.53 | 104 | 19 (3) | 18.27 | 9 | 0 | 0 | 22 | 4 | 18.18 |
≥ 6.5 years | 15 | 3 | 20.00 | 105 | 29 (6) | 27.62 | 8 | 2 | 25.00 | 22 | 5 (1) | 22.73 |
Total | 775 | 48 (2) | 6.19 | 3833 | 445 (50) | 11.61 | 404 | 30 (1) | 7.43 | 1231 | 123 (8) | 9.99 |
pd = 0.0433 | pd < 0.0001 | pd = 0.4573 | pd = 0.0191 | |||||||||
OR for higher-intermediate vs. LIR subgroup (95% CI), p-value | 1.99 (1.46 to 2.71), < 0.0001 | 1.38 (0.91 to 2.10), 0.1245 |
As interval had a strong effect on findings at both the first and second follow-ups in the HIR subgroup, the effect of interval on findings at FUV1 was assessed in patients with HIR polyp factors only, HIR procedure quality factors only, or both (Table 58). Interval had a significant effect in all subsets of the HIR group. Although the test for trend was not as significant in patients who were classified as high risk based on examination factors only, this is probably the result of the smaller size of this group.
Interval | FUV1 | ||||||||
---|---|---|---|---|---|---|---|---|---|
HIR subgroup | |||||||||
Polyp characteristics only | Poor examination only | Poor examination and polyp characteristics | |||||||
Number with FUV1 | Number of AN (CRC) | % | Number with FUV1 | Number of AN (CRC) | % | Number with FUV1 | Number of AN (CRC) | % | |
< 18 months | 926 | 71 (7) | 7.7 | 204 | 21 (3) | 10.3 | 366 | 42 (5) | 11.5 |
2 yearsa | 436 | 40 (3) | 9.2 | 149 | 14 (2) | 9.4 | 244 | 33 (4) | 13.5 |
3 yearsa | 465 | 59 (3) | 12.7 | 177 | 17 | 9.6 | 188 | 25 (2) | 13.3 |
4 yearsa | 125 | 18 (2) | 14.4 | 87 | 12 (3) | 13.8 | 93 | 15 (3) | 16.1 |
5 yearsa | 61 | 9 (2) | 14.8 | 52 | 10 (1) | 19.2 | 51 | 11 (1) | 21.6 |
6 yearsa | 36 | 6 | 16.7 | 34 | 4 | 11.8 | 34 | 9 (3) | 26.5 |
≥ 6.5 years | 18 | 5 (1) | 27.8 | 45 | 10 | 22.2 | 42 | 14 (5) | 33.3 |
Total | 2067 | 208 (18) | 10.1 | 748 | 88 (9) | 11.8 | 1018 | 149 (23) | 14.6 |
pb < 0.0001 | pb = 0.0137 | pb < 0.0001 |
Sensitivity analyses and internal validation
Owing to the complex nature of the hospital data set and the rules used to define baseline and follow-up visits and interval, a number of sensitivity analyses were carried out to examine whether or not our methods were robust and did not introduce bias into the study sample. Specifically, we restricted analyses of the effect of interval on AA and CRC detection rates at FUV1, and the effect of surveillance on long-term CRC risk after baseline to:
-
patients whose baseline visit comprised only a single colonoscopy – this verifies whether or not rules used to define and extend the baseline visit were adequate or if they introduced bias into the calculation of surveillance interval length
-
patients with a complete colonoscopy at FUV1 – this assesses whether or not results were biased by the inclusion of patients without a complete colonoscopy at FUV1 (arguably the lack of a complete colonoscopy at FUV1 may mean that the surveillance visit was not as effective, as the whole colon could not be examined).
To check whether or not the definition of AA used had affected our results, a sensitivity analysis of the effect of interval on AA detection rates at FUV1 was also performed, with the definition of AA changed to a large (≥ 10 mm) adenoma or an adenoma with HGD (i.e. excluding villous or tubulovillous histology).
We also undertook sensitivity analyses to investigate the potential for misclassification of surveillance attendance, and how this may have impacted on the apparent effect of surveillance on CRC risk after baseline. Owing to a gap between the end of hospital data collection and patient follow-up with cancer registries, around 50% of the hospital cohort had follow-up time after the end of data collection during which they may have attended surveillance visits for which we were not able to collect reports. We hypothesised that any such misclassification would be non-differential and would therefore result in underestimation of the effect of surveillance. In addition, we may have potentially underestimated pre-surveillance CRC risk after baseline owing to contamination of the no-surveillance group with patients who had in fact attended one or more follow-ups. To investigate this, we restricted analyses of CRC risk after baseline to patients with at least 5 years and at least 7 years of hospital data collection, among whom any misclassification of surveillance attendance is extremely unlikely.
Patients with a single baseline colonoscopy
When the cohort was restricted to patients whose baseline visit comprised a single colonoscopy only (n = 2489), the overall AA detection rate at FUV1 was 10.1%, compared with 9.8% in all patients who attended follow-up (n = 4608). In addition, the effects of interval and baseline risk factors on the odds of detecting AA at FUV1 were similar to the effects observed in the main analysis (compare Table 59 and Table 29).
Baseline risk factor | Category | Number of patients (N = 2489) | Univariable analysis: new AA | Multivariable analyses: new AA | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 2489) | Model 2 – interval as continuous (n = 2489) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 932 | 72 (7.73) | 1 | 0.0003 | 1 | 0.0005 | n/a | |
2 yearsa | 467 | 46 (9.85) | 1.31 (0.89 to 1.92) | 1.24 (0.83 to 1.86) | |||||
3 yearsa | 609 | 62 (10.18) | 1.35 (0.95 to 1.93) | 1.42 (0.98 to 2.07) | |||||
4 yearsa | 203 | 20 (9.85) | 1.31 (0.78 to 2.20) | 1.18 (0.68 to 2.05) | |||||
5 yearsa | 124 | 21 (16.94) | 2.44 (1.44 to 4.13) | 2.42 (1.38 to 4.24) | |||||
6 yearsa | 73 | 11 (15.07) | 2.12 (1.07 to 4.20) | 1.97 (0.96 to 4.07) | |||||
≥ 6.5 years | 81 | 19 (23.46) | 3.66 (2.08 to 6.46) | 3.94 (2.14 to 7.24) | |||||
Per year increase | n/a | n/a | 1.16 (1.09 to 1.23) | < 0.0001 | n/a | 1.17 (1.09 to 1.25) | < 0.0001 | ||
Age (years) | < 55 | 590 | 38 (6.44) | 1 | 0.0014 | 1 | 0.0028 | 1 | 0.0030 |
≥ 55 and < 60 | 355 | 49 (13.8) | 2.33 (1.49 to 3.63) | 2.34 (1.48 to 3.71) | 2.31 (1.46 to 3.66) | ||||
≥ 60 and < 65 | 431 | 38 (8.82) | 1.40 (0.88 to 2.24) | 1.39 (0.86 to 2.26) | 1.38 (0.85 to 2.23) | ||||
≥ 65 and < 70 | 447 | 44 (9.84) | 1.59 (1.01 to 2.49) | 1.42 (0.89 to 2.26) | 1.41 (0.88 to 2.25) | ||||
≥ 70 and < 75 | 353 | 42 (11.9) | 1.96 (1.24 to 3.11) | 1.82 (1.13 to 2.94) | 1.82 (1.13 to 2.92) | ||||
≥ 75 and < 80 | 208 | 31 (14.9) | 2.54 (1.54 to 4.21) | 2.51 (1.49 to 4.26) | 2.5 (1.48 to 4.22) | ||||
≥ 80 | 105 | 9 (8.57) | 1.36 (0.64 to 2.91) | 1.45 (0.66 to 3.18) | 1.42 (0.65 to 3.12) | ||||
Most complete colonoscopy | Complete | 1507 | 130 (8.63) | 1 | 0.0030 | 1 | 0.0018 | 1 | 0.0015 |
Incomplete/unknown | 982 | 121 (12.32) | 1.49 (1.15 to 1.93) | 1.65 (1.20 to 2.25) | 1.65 (1.21 to 2.26) | ||||
Largest adenoma (mm) | < 20 | 1827 | 178 (9.74) | 1 | 0.3511 | 1 | 0.1966 | 1 | 0.2275 |
≥ 20 | 662 | 73 (11.03) | 1.15 (0.86 to 1.53) | 1.22 (0.90 to 1.66) | 1.21 (0.89 to 1.63) | ||||
Large hyperplastic polyp | No | 2454 | 242 (9.86) | 1 | 0.0079 | 1 | 0.0139 | 1 | 0.0151 |
Yes | 35 | 9 (25.71) | 3.16 (1.47 to 6.83) | 3.06 (1.34 to 6.98) | 3.01 (1.32 to 6.85) | ||||
Proximal polyp | No | 1592 | 140 (8.79) | 1 | 0.0049 | 1 | < 0.0001 | 1 | < 0.0001 |
Yes | 897 | 111 (12.37) | 1.46 (1.13 to 1.91) | 1.92 (1.43 to 2.58) | 1.91 (1.42 to 2.56) |
When the cohort was restricted to patients with a single colonoscopy at baseline, the CRC detection rate at FUV1 was the same as in the main analysis, at 1.1%. The effect of interval on CRC detection at FUV1 differed somewhat in the sensitivity analysis; although some ORs differed to those estimated in the main analysis, the trends observed were comparable (compare Table 60 and Table 30). The smaller number of end points available in the sensitivity analysis meant that there was a greater degree of imprecision and the results were less statistically significant than those of the main analysis.
Baseline risk factor | Category | Number of patients (N = 2489) | Univariable analysis: new CRC | Multivariable analyses: new CRC | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 2489) | Model 2 – interval as continuous (n = 2489) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 932 | 10 (1.07) | 1 | 0.0031 | 1 | 0.0022 | n/a | |
2 yearsa | 467 | 4 (0.86) | 0.80 (0.25 to 2.55) | 0.82 (0.25 to 2.65) | |||||
3 yearsa | 609 | 1 (0.16) | 0.15 (0.02 to 1.19) | 0.20 (0.03 to 1.59) | |||||
4 yearsa | 203 | 4 (1.97) | 1.85 (0.58 to 5.97) | 2.56 (0.78 to 8.45) | |||||
5 yearsa | 124 | 1 (0.81) | 0.75 (0.01 to 5.91) | 1.00 (0.12 to 8.00) | |||||
6 yearsa | 73 | 2 (2.74) | 2.6 (0.56 to 12.08) | 3.41 (0.71 to 16.38) | |||||
≥ 6.5 years | 81 | 5 (6.17) | 6.07 (2.02 to 18.2) | 8.27 (2.63 to 26.01) | |||||
Per year increase | n/a | n/a | 1.27 (1.13 to 1.43) | 0.0011 | n/a | 1.31 (1.15 to 1.49) | 0.0004 | ||
Age (years) | < 60 | 945 | 2 (0.21) | 1 | 0.0001 | 1 | 0.0001 | 1 | < 0.0001 |
≥ 60 and < 65 | 431 | 2 (0.46) | 2.20 (0.31 to 15.66) | 2.44 (0.34 to 17.63) | 1.89 (0.26 to 13.89) | ||||
≥ 65 and < 70 | 447 | 4 (0.89) | 4.26 (0.78 to 23.33) | 3.76 (0.68 to 20.90) | 3.63 (0.65 to 20.23) | ||||
≥ 70 and < 75 | 353 | 8 (2.27) | 10.93 (2.31 to 51.74) | 11.62 (2.43 to 55.55) | 11.01 (2.3 to 52.63) | ||||
≥ 75 and < 80 | 208 | 7 (3.37) | 16.42 (3.39 to 79.63) | 17.91 (3.64 to 88.22) | 17.95 (3.67 to 87.7) | ||||
≥ 80 | 105 | 4 (3.81) | 18.67 (3.38 to 103.22) | 19.00 (3.36 to 107.49) | 20.04 (3.57 to 112.35) | ||||
Most complete colonoscopy | Complete | 1507 | 13 (0.86) | 1 | 0.1906 | n/a | 1 | 0.2402 | |
Incomplete/unknown | 982 | 14 (1.43) | 1.66 (0.78 to 3.55) | 1.65 (0.72 to 3.81) | |||||
Best bowel preparation | Excellent/good/satisfactory/unknown | 2396 | 24 (1) | 1 | 0.0968 | 1 | 0.0643 | 1 | 0.0627 |
Poor | 93 | 3 (3.23) | 3.29 (0.97 to 11.14) | 4.03 (1.12 to 14.45) | 4.05 (1.14 to 14.41) | ||||
Proximal polyp | No | 1592 | 14 (0.88) | 1 | 0.1957 | n/a | 1 | 0.0993 | |
Yes | 897 | 13 (1.45) | 1.66 (0.78 to 3.54) | 2.00 (0.88 to 4.54) |
In the sensitivity analysis of CRC risk after baseline restricted to patients with a single baseline colonoscopy (n = 6500), 72 CRCs were diagnosed, compared with 168 CRCs in 11,944 patients in the main analysis. The effect of a single surveillance visit remained the same, and the effect of two or more was similar; however, because of the reduction in the number of end points, the ORs were less precise and the effect of surveillance was only borderline significant (compare Table 61 and Table 44). Similar trends were seen among baseline risk factors and CRC risk after baseline.
Baseline risk factor | Category | Adjusted HR (95% CI) | p-value (LRT) |
---|---|---|---|
Number of follow-up visits after baselinea | 0 | 1 | 0.0484 |
1 | 0.51 (0.27 to 0.96) | ||
2+ | 0.44 (0.18 to 1.08) | ||
Largest adenoma (mm) | < 10 | 1 | 0.0050 |
10–19 | 9.56 (1.3 to 70.3) | ||
≥ 20 | 8.02 (1.05 to 61.41) | ||
Worst adenoma dysplasia | Low grade | 1 | 0.0023 |
High grade | 2.48 (1.44 to 4.26) | ||
Completeness of colonoscopy | Complete | 1 | 0.0008 |
Incomplete/unknown | 2.33 (1.44 to 3.79) | ||
Proximal polyps | No | 1 | 0.0165 |
Yes | 1.82 (1.13 to 2.95) | ||
Age (years) at baseline | < 55 | 1 | 0.0210 |
≥ 55 and < 60 | 1.11 (0.36 to 3.4) | ||
≥ 60 and < 65 | 1.57 (0.6 to 4.09) | ||
≥ 65 and < 70 | 2.35 (0.98 to 5.62) | ||
≥ 70 and < 75 | 2.86 (1.17 to 7.01) | ||
≥ 75 and < 80 | 3.84 (1.56 to 9.46) | ||
≥ 80 | 3.37 (1.26 to 9.01) |
The similarity between these sensitivity analyses and the main analyses suggests that the methods used to define and, in some cases, extend the baseline visit did not introduce bias into the data.
Patients with a complete colonoscopy at follow-up visit 1
Analyses of AA and CRC at FUV1 and CRC risk after baseline were repeated, this time restricting the cohort to patients with a complete colonoscopy at FUV1. The detection rate of AA at FUV1 in the restricted cohort (n = 3299) was 10.5%, which was marginally higher than the 9.8% detection rate in the full cohort. The effects of interval and baseline risk factors on AA detection were similar to the effects observed in the main analysis, apart from adenoma size, which was no longer statistically significant (compare Table 62 and Table 29).
Baseline risk factor | Category | Number of patients (N = 3299) | Univariable analysis: new AA | Multivariable analyses: new AA | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 3299) | Model 2 – interval as continuous (n = 3299) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 1225 | 99 (8.08) | 1 | 0.0001 | 1 | 0.0004 | n/a | |
2 yearsa | 628 | 65 (10.35) | 1.31 (0.95 to 1.82) | 1.28 (0.91 to 1.80) | |||||
3 yearsa | 831 | 86 (10.35) | 1.31 (0.97 to 1.78) | 1.46 (1.06 to 2.00) | |||||
4 yearsa | 254 | 34 (13.39) | 1.76 (1.16 to 2.66) | 1.74 (1.12 to 2.72) | |||||
5 yearsa | 174 | 25 (14.37) | 1.91 (1.19 to 3.06) | 1.84 (1.11 to 3.03) | |||||
6 yearsa | 100 | 15 (15) | 2.01 (1.12 to 3.61) | 1.99 (1.07 to 3.69) | |||||
≥ 6.5 years | 87 | 21 (24.14) | 3.62 (2.13 to 6.16) | 3.78 (2.13 to 6.74) | |||||
Per year increase | n/a | n/a | 1.15 (1.08 to 1.21) | < 0.0001 | n/a | 1.15 (1.08 to 1.23) | < 0.0001 | ||
Age (years) | < 55 | 772 | 53 (6.87) | 1 | 0.0001 | 1 | 0.0001 | 1 | 0.0001 |
≥ 55 and < 60 | 468 | 59 (12.61) | 1.96 (1.32 to 2.89) | 1.86 (1.24 to 2.78) | 1.87 (1.25 to 2.80) | ||||
≥ 60 and < 65 | 580 | 55 (9.48) | 1.42 (0.96 to 2.11) | 1.46 (0.98 to 2.19) | 1.44 (0.96 to 2.16) | ||||
≥ 65 and < 70 | 592 | 59 (9.97) | 1.50 (1.02 to 2.21) | 1.42 (0.95 to 2.12) | 1.42 (0.95 to 2.11) | ||||
≥ 70 and < 75 | 509 | 62 (12.18) | 1.88 (1.28 to 2.77) | 1.99 (1.33 to 2.96) | 1.98 (1.33 to 2.94) | ||||
≥ 75 and < 80 | 263 | 46 (17.49) | 2.88 (1.88 to 4.39) | 3.03 (1.94 to 4.72) | 2.99 (1.92 to 4.65) | ||||
≥ 80 | 115 | 11 (9.57) | 1.43 (0.73 to 2.84) | 1.67 (0.83 to 3.38) | 1.65 (0.82 to 3.34) | ||||
Most complete colonoscopy | Complete | 2333 | 206 (8.83) | 1 | < 0.0001 | 1 | < 0.0001 | 1 | < 0.0001 |
Incomplete/unknown | 966 | 139 (14.39) | 1.74 (1.38 to 2.18) | 1.78 (1.36 to 2.33) | 1.81 (1.38 to 2.37) | ||||
Largest adenoma (mm) | < 20 | 2122 | 212 (9.99) | 1 | 0.2414 | 1 | 0.1237 | 1 | 0.1408 |
≥ 20 | 1177 | 133 (11.3) | 1.15 (0.91 to 1.44) | 1.21 (0.95 to 1.54) | 1.20 (0.94 to 1.52) | ||||
Large hyperplastic polyp | No | 3231 | 330 (10.21) | 1 | 0.0050 | 1 | 0.0040 | 1 | 0.0039 |
Yes | 68 | 15 (22.06) | 2.49 (1.39 to 4.46) | 2.65 (1.43 to 4.89) | 2.65 (1.44 to 4.88) | ||||
Proximal polyp | No | 2039 | 192 (9.42) | 1 | 0.0136 | 1 | < 0.0001 | 1 | < 0.0001 |
Yes | 1260 | 153 (12.14) | 1.33 (1.06 to 1.67) | 1.70 (1.33 to 2.19) | 1.70 (1.33 to 2.18) |
When the analysis of CRC at FUV1 was carried out using only patients with a complete colonoscopy at FUV1, the CRC detection rate was slightly lower, at 0.7%, compared with 1.1% in the main analysis. The effect of surveillance interval was weaker and only borderline significant compared with the highly significant effect obtained when all 4608 patients were analysed (compare Table 63 and Table 30).
Baseline risk factor | Category | Number of patients (N = 3299) | Univariable analysis: new CRC | Multivariable analyses: new CRC | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 3299) | Model 2 – interval as continuous (n = 3299) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 1225 | 11 (0.9) | 1 | 0.0403 | 1 | 0.0491 | n/a | |
2 yearsa | 628 | 2 (0.32) | 0.35 (0.08 to 1.6) | 0.36 (0.08 to 1.65) | |||||
3 yearsa | 831 | 2 (0.24) | 0.27 (0.06 to 1.2) | 0.33 (0.07 to 1.49) | |||||
4 yearsa | 254 | 5 (1.97) | 2.22 (0.76 to 6.43) | 2.99 (1.01 to 8.92) | |||||
5 yearsa | 174 | 1 (0.57) | 0.64 (0.08 to 4.97) | 0.91 (0.11 to 7.20) | |||||
6 yearsa | 100 | 2 (2) | 2.25 (0.49 to 10.3) | 3.22 (0.68 to 15.2) | |||||
≥ 6.5 years | 87 | 1 (1.15) | 1.28 (0.16 to 10.06) | 1.84 (0.23 to 14.73) | |||||
Per year increase | n/a | n/a | 1.10 (0.91 to 1.34) | 0.3467 | n/a | 1.15 (0.95 to 1.4) | 0.1838 | ||
Age (years) | < 60 | 1240 | 4 (0.32) | 1 | 0.0551 | 1 | 0.0387 | 1 | 0.0435 |
≥ 60 and < 65 | 580 | 3 (0.52) | 1.61 (0.36 to 7.20) | 1.46 (0.32 to 6.64) | 1.52 (0.34 to 6.84) | ||||
≥ 65 and < 70 | 592 | 4 (0.68) | 2.10 (0.52 to 8.43) | 1.77 (0.44 to 7.21) | 1.86 (0.46 to 7.52) | ||||
≥ 70 and < 75 | 509 | 6 (1.18) | 3.69 (1.04 to 13.12) | 3.79 (1.05 to 13.67) | 3.68 (1.03 to 13.20) | ||||
≥ 75 and < 80 | 263 | 6 (2.28) | 7.21 (2.02 to 25.75) | 7.80 (2.15 to 28.27) | 7.72 (2.14 to 27.81) | ||||
≥ 80 | 115 | 1 (0.87) | 2.71 (0.30 to 24.45) | 2.47 (0.27 to 22.85) | 2.40 (0.26 to 22.11) | ||||
Most complete colonoscopy | Complete | 2333 | 15 (0.64) | 1 | 0.3857 | n/a | 1 | 0.3723 | |
Incomplete/unknown | 966 | 9 (0.93) | 1.45 (0.63 to 3.33) | 1.52 (0.62 to 3.73) | |||||
Best bowel preparation | Excellent/good/satisfactory/unknown | 3174 | 20 (0.63) | 1 | 0.0129 | 1 | 0.0075 | 1 | 0.0098 |
Poor | 125 | 4 (3.2) | 5.21 (1.75 to 15.49) | 6.34 (2.03 to 19.79) | 5.79 (1.89 to 17.69) | ||||
Proximal polyp | No | 2039 | 12 (0.59) | 1 | 0.2389 | n/a | 1 | 0.1963 | |
Yes | 1260 | 12 (0.95) | 1.62 (0.73 to 3.63) | 1.75 (0.75 to 4.1) |
In the analysis of CRC risk after baseline in patients with a complete colonoscopy at FUV1 (n = 10,685), there were 144 CRCs diagnosed, which was only slightly less than the 168 CRCs in the full cohort. Consequently, the effect of surveillance was almost identical in both analyses (compare Table 64 and Table 44). The effects of baseline risk factors on CRC risk were similar to the main analysis, except for the effect of a large adenoma, which was slightly weaker and only borderline significant in the sensitivity analysis.
Baseline risk factor | Category | Adjusted HR (95% CI) | p-value (LRT) |
---|---|---|---|
Number of follow-up visits after baselinea | 0 | 1 | 0.0003 |
1 | 0.51 (0.32 to 0.81) | ||
2+ | 0.28 (0.12 to 0.64) | ||
Largest adenoma (mm) | < 10 | 1 | 0.0455 |
10–19 | 2.50 (1.00 to 6.27) | ||
≥ 20 | 2.84 (1.11 to 7.26) | ||
Worst adenoma dysplasia | Low grade | 1 | 0.0027 |
High grade | 1.86 (1.26 to 2.76) | ||
Completeness of colonoscopy | Complete | 1 | < 0.0001 |
Incomplete/unknown | 2.16 (1.51 to 3.08) | ||
Proximal polyps | No | 1 | 0.0003 |
Yes | 1.95 (1.37 to 2.77) | ||
Age (years) at baseline | < 55 | 1 | < 0.0001 |
≥ 55 and < 60 | 0.92 (0.39 to 2.18) | ||
≥ 60 and < 65 | 1.07 (0.50 to 2.28) | ||
≥ 65 and < 70 | 2.41 (1.29 to 4.50) | ||
≥ 70 and < 75 | 2.35 (1.23 to 4.51) | ||
≥ 75 and < 80 | 3.37 (1.78 to 6.38) | ||
≥ 80 | 2.93 (1.45 to 5.89) |
The fact that these sensitivity analyses and the main analyses demonstrated comparable effects of interval and surveillance suggests that the inclusion of patients without a complete colonoscopy at FUV1 did not bias the results in any way.
Redefining advanced adenoma
When the definition of AA was altered to include only adenomas of ≥ 10 mm or with HGD, 324 patients had AA detected at FUV1 and the AA detection rate was 7.0%, compared with 415 and 9.8% when villous or tubulovillous histology was included in the definition. The effect of interval was slightly stronger when AA was redefined, with a trend of increasing odds of AA at FUV1 with increasing interval length as observed in the main analysis (compare Tables 65 and 29). The associations between baseline risk factors and AA at FUV1 were similar in both sets of analyses, except for adenoma size, which was only weakly associated with AA at FUV1 using the restricted definition. These findings suggest that the inclusion of villous or tubulovillous histology in the definition of AA is appropriate and did not bias our results for the effect of interval on AA at FUV1.
Baseline risk factor | Category | Number of patients (N = 4608) | Univariable analysis: new AA | Multivariable analyses: new AA | |||||
---|---|---|---|---|---|---|---|---|---|
Model 1 – interval as categorical (n = 4608) | Model 2 – interval as continuous (n = 4608) | ||||||||
n (%) | Unadjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | Adjusted OR (95% CI) | p-value (LRT) | |||
Interval | < 18 months | 1760 | 89 (5.06) | 1 | < 0.0001 | 1 | < 0.0001 | n/a | |
2 yearsa | 976 | 66 (6.76) | 1.36 (0.98 to 1.89) | 1.36 (0.98 to 1.91) | |||||
3 yearsa | 1057 | 74 (7.00) | 1.41 (1.03 to 1.94) | 1.53 (1.10 to 2.13) | |||||
4 yearsa | 355 | 34 (9.58) | 1.99 (1.32 to 3.00) | 2.13 (1.39 to 3.27) | |||||
5 yearsa | 217 | 22 (10.14) | 2.12 (1.30 to 3.46) | 2.31 (1.39 to 3.85) | |||||
6 yearsa | 123 | 15 (12.20) | 2.61 (1.46 to 4.66) | 2.66 (1.46 to 4.86) | |||||
≥ 6.5 years | 120 | 24 (20.00) | 4.69 (2.86 to 7.70) | 5.15 (3.04 to 8.73) | |||||
Per year increase | n/a | n/a | 1.20 (1.14 to 1.27) | < 0.0001 | n/a | 1.23 (1.16 to 1.3) | < 0.0001 | ||
Age (years) | < 55 | 1025 | 40 (3.90) | 1 | < 0.0001 | 1 | < 0.0001 | 1 | < 0.0001 |
≥ 55 and < 60 | 622 | 60 (9.65) | 2.63 (1.74 to 3.97) | 2.63 (1.73 to 4.01) | 2.66 (1.75 to 4.06) | ||||
≥ 60 and < 65 | 788 | 61 (7.74) | 2.07 (1.37 to 3.11) | 2.27 (1.50 to 3.45) | 2.26 (1.49 to 3.44) | ||||
≥ 65 and < 70 | 813 | 52 (6.40) | 1.68 (1.10 to 2.57) | 1.67 (1.09 to 2.57) | 1.69 (1.00.1 to 2.6) | ||||
≥ 70 and < 75 | 714 | 56 (7.84) | 2.10 (1.38 to 3.18) | 2.28 (1.49 to 3.49) | 2.29 (1.49 to 3.52) | ||||
≥ 75 and < 80 | 413 | 42 (10.17) | 2.79 (1.78 to 4.37) | 3.05 (1.92 to 4.84) | 3.08 (1.94 to 4.89) | ||||
≥ 80 | 233 | 13 (5.58) | 1.46 (0.77 to 2.77) | 1.63 (0.84 to 3.14) | 1.66 (0.86 to 3.21) | ||||
Most complete colonoscopy | Complete | 2973 | 177 (5.95) | 1 | 0.0001 | 1 | 0.0002 | 1 | 0.0002 |
Incomplete/unknown | 1635 | 147 (8.99) | 1.56 (1.24 to 1.96) | 1.64 (1.26 to 2.14) | 1.65 (1.27 to 2.15) | ||||
Largest adenoma (mm) | < 20 | 2880 | 194 (6.74) | 1 | 0.3137 | 1 | 0.0631 | 1 | 0.0678 |
≥ 20 | 1728 | 130 (7.52) | 1.13 (0.89 to 1.42) | 1.26 (0.99 to 1.6) | 1.25 (0.98 to 1.59) | ||||
Large hyperplastic polyp | No | 4525 | 311 (6.87) | 1 | 0.0067 | 1 | 0.0081 | 1 | 0.0074 |
Yes | 83 | 13 (15.66) | 2.52 (1.38 to 4.60) | 2.52 (1.35 to 4.72) | 2.55 (1.36 to 4.76) | ||||
Proximal polyp | No | 2940 | 187 (6.36) | 1 | 0.0192 | 1 | 0.0002 | 1 | 0.0001 |
Yes | 1668 | 137 (8.21) | 1.32 (1.05 to 1.66) | 1.63 (1.27 to 2.09) | 1.63 (1.27 to 2.09) |
Patients with specified minimum years of hospital data collected
To assess whether or not potential misclassification of surveillance attendance may have impacted on the apparent effect of surveillance on CRC risk after baseline, sensitivity analyses were restricted to patients with at least 5 years (n = 4854) and at least 7 years (n = 3055) of hospital data. We were able to look only at the effect of one or more surveillance visits compared with none because of the small numbers in the LIR subgroup when restricting the analysis to patients with at least 7 years of hospital data.
Compared with the analysis using all patients (n = 11,944), the effect of surveillance was slightly stronger when the cohort was restricted (Table 66). This underestimation of the effect of surveillance suggests non-differential misclassification of surveillance attendance between patients with and without CRC, with some contamination among patients classified as having no surveillance.
Number of surveillance visits after baselinea | Cohort | |||||
---|---|---|---|---|---|---|
Full | With ≥ 5 years of hospital data | With ≥ 7 years of hospital data | ||||
Univariable HR (95% CI) | p-value (LRT) | Univariable HR (95% CI) | p-value (LRT) | Univariable HR (95% CI) | p-value (LRT) | |
Total | ||||||
N = 11,944, cases = 168 | N = 4854, cases = 108 | N = 3055, cases = 83 | ||||
0.0001 | 0.0003 | |||||
0 | 1 | 1 | 1 | 0.0005 | ||
1+ | 0.49 (0.34 to 0.70) | 0.46 (0.30 to 0.70) | 0.43 (0.27 to 0.69) | |||
LIR subgroup | ||||||
N = 2679, cases = 13 | N = 936, cases = 7 | N = 578, cases = 4 | ||||
0.30 | 0.32 | |||||
0 | 1 | 1 | 1 | 0.1228 | ||
1+ | 0.51 (0.14 to 1.89) | 0.45 (0.09 to 2.20) | 0.18 (0.02 to 1.89) | |||
HIR subgroup | ||||||
N = 9265, cases = 155 | N = 3918, cases = 101 | N = 2477, cases = 79 | ||||
< 0.0001 | 0.0002 | |||||
0 | 1 | 1 | 1 | 0.0002 | ||
1+ | 0.45 (0.31 to 0.66) | 0.43 (0.28 to 0.67) | 0.41 (0.25 to 0.67) |
In the LIR and HIR subgroups, an underestimation of the effect of surveillance was also observed (see Table 66). When the analysis was restricted to patients with at least 5 years of hospital data, the effect of one or more surveillance visits was slightly underestimated in both risk subgroups. When the cohort was restricted to patients with at least 7 years of hospital data, the effect of one or more surveillance visits was only slightly stronger in the HIR subgroup but considerably so in the LIR subgroup, although the small number of end points and imprecision in the latter group precluded interpretation. However, if we have truly underestimated the effect substantially in our main analyses as a result of misclassification it may be that surveillance is of considerable benefit in the LIR subgroup.
When pre-surveillance CRC risk was examined using sensitivity analyses restricted to patients with at least 5 years or at least 7 years of hospital data, there was evidence to suggest that pre-surveillance risk had been underestimated in the main analyses (Table 67). Overall, patients with at least 7 years of follow-up were at 54% higher risk of CRC than the general population, as opposed to the 6% higher risk estimated previously.
Cohort | Full cohort, N = 11,944: observation time free of surveillance | Cohort with ≥ 5 years of hospital data, N = 4854: observation time free of surveillance | Cohort with ≥ 7 years of hospital data, N = 3055: observation time free of surveillance | ||||||
---|---|---|---|---|---|---|---|---|---|
Number of observed CRCs | Number of expected CRCs | SIR (95% CI) | Number of observed CRCs | Number of expected CRCs | SIR (95% CI) | Number of observed CRCs | Number of expected CRCs | SIR (95% CI) | |
Total | 108 | 102 | 1.06 (0.87 to 1.28) | 57 | 43 | 1.33 (1.01 to 1.72) | 40 | 26 | 1.54 (1.10 to 2.09) |
LIR subgroup | 9 | 23 | 0.39 (0.18 to 0.75) | 4 | 9 | 0.46 (0.13 to 1.18) | 3 | 5 | 0.57 (0.12 to 1.68) |
HIR subgroup | 99 | 79 | 1.26 (1.02 to 1.53) | 53 | 34 | 1.55 (1.16 to 2.02) | 37 | 21 | 1.78 (1.25 to 2.45) |
The misclassification of surveillance attendance had a great effect on the SIR in the HIR group, the pre-surveillance CRC risk of which became considerably greater than in the general population (see Table 67). In the LIR subgroup the pre-surveillance CRC risk also appeared to have been underestimated, with the SIR for LIR patients with at least 7 years of hospital data suggesting a 43% lower CRC incidence than in the general population. Owing to the small number of end points the SIRs for the LIR subgroup were imprecise, limiting interpretation.
Internal validation of models
To assess the performance of the multivariable logistic models for the outcomes of new findings at follow-up and the multivariable Cox regression models for the analysis of long-term cancer incidence, we performed internal validation using k-fold cross-validation with k = 10.
The 10-fold cross-validation results for the models for findings at FUV1 and FUV2 are presented in Figure 9 and Table 68. The presented results for new adenoma, AA and CRC at FUV1 and new AN at FUV2 correspond to models presented in Tables 28–30 and Table 40, respectively. For these models, the weighted mean area under the ROC curve ranged from 64.5% to 87.5%, with greater variation seen for the models for new CRC at FUV1 and new AN at FUV2. The predictions from the models for new CRC at FUV1 performed the best at discriminating between patients with and without new findings, but all models showed some ability to discriminate.
Validation set | New adenoma at FUV1 | New AA at FUV1 | New CRC at FUV1 | New AN at FUV2 | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Categorical interval | Continuous interval | Categorical interval | Continuous interval | Categorical interval | Continuous interval | Categorical interval | Continuous interval | |||||||||
Area under ROC curve | Area under ROC curve | Area under ROC curve | Area under ROC curve | Area under ROC curve | Area under ROC curve | Area under ROC curve | Area under ROC curve | |||||||||
SE | % | SE | % | SE | % | SE | % | SE | % | SE | % | SE | % | SE | % | |
1 | 65.4 | 2.7 | 66.2 | 2.7 | 57.4 | 4.5 | 62.4 | 4.1 | 59.5 | 11.1 | 83.2 | 7.5 | 71.5 | 8.5 | 67.2 | 6.9 |
2 | 63.7 | 2.8 | 61.0 | 3.0 | 67.4 | 4.3 | 63.9 | 3.9 | 73.6 | 12.6 | 94.0 | 4.1 | 68.8 | 8.6 | 74.2 | 7.9 |
3 | 63.0 | 3.0 | 64.9 | 2.8 | 61.4 | 4.6 | 66.4 | 4.1 | 78.1 | 9.4 | 60.8 | 12.8 | 67.9 | 9.2 | 60.9 | 8.7 |
4 | 67.3 | 2.7 | 64.5 | 2.8 | 68.3 | 3.9 | 64.7 | 4.2 | 75.9 | 10.3 | 82.1 | 6.7 | 73.3 | 5.5 | 73.4 | 5.7 |
5 | 67.7 | 2.7 | 64.5 | 2.9 | 65.4 | 4.2 | 70.8 | 4.4 | 80.0 | 7.0 | 69.8 | 13.6 | 73.3 | 6.4 | 68.5 | 7.9 |
6 | 68.1 | 2.8 | 67.2 | 2.8 | 67.3 | 4.0 | 67.6 | 4.5 | 70.2 | 9.2 | 59.7 | 9.6 | 48.8 | 8.4 | 65.6 | 8.1 |
7 | 61.9 | 3.0 | 62.4 | 3.0 | 71.3 | 4.9 | 58.2 | 4.6 | 94.5 | 2.2 | 71.7 | 10.9 | 51.4 | 8.4 | 66.6 | 8.4 |
8 | 64.1 | 2.9 | 64.7 | 2.9 | 63.2 | 4.3 | 60.0 | 4.8 | 77.7 | 9.9 | 83.0 | 6.8 | 66.0 | 7.2 | 61.8 | 9.8 |
9 | 60.3 | 3.1 | 63.0 | 2.8 | 66.2 | 4.1 | 61.7 | 4.3 | 59.1 | 9.6 | 61.6 | 12.8 | 70.6 | 7.4 | 66.4 | 7.9 |
10 | 62.1 | 2.8 | 68.6 | 2.7 | 63.0 | 4.1 | 69.0 | 3.7 | 76.5 | 8.5 | 88.5 | 4.2 | 71.2 | 7.7 | 69.4 | 7.2 |
Weighted mean | 64.5 | 0.9 | 64.8 | 0.9 | 65.1 | 1.3 | 64.7 | 1.3 | 87.5 | 1.8 | 84.2 | 2.2 | 67.5 | 2.4 | 68.3 | 2.4 |
The cross-validation results for the models for long-term CRC incidence are presented in Figure 10 and Table 69. The results for incidence after baseline correspond to the model in Table 44 and results for incidence after FUV1 correspond with the model in Table 50. The predictions from both models demonstrated some ability to discriminate between patients who were and were not diagnosed with CRC, with weighted mean areas under the ROC curve of 65–66%.
Validation set | CRC incidence | |||
---|---|---|---|---|
After baseline | After FUV1 | |||
Area under ROC curve | Area under ROC curve | |||
SE | % | SE | % | |
1 | 61.8 | 8.3 | 53.6 | 15.2 |
2 | 62.2 | 6.5 | 78.3 | 6.9 |
3 | 62.5 | 9.0 | 47.5 | 16.4 |
4 | 60.5 | 6.4 | 60.9 | 10.4 |
5 | 69.2 | 7.0 | 52.2 | 13.0 |
6 | 66.2 | 7.1 | 65.8 | 18.5 |
7 | 62.4 | 7.4 | 60.4 | 12.1 |
8 | 61.1 | 8.8 | 78.0 | 9.2 |
9 | 66.6 | 7.3 | 66.6 | 8.3 |
10 | 76.3 | 7.0 | 43.3 | 12.1 |
Weighted mean | 65.1 | 2.3 | 65.6 | 3.4 |
Discussion: hospital data set
Key findings
Does surveillance provide any benefit in terms of long-term cancer risk?
We found strong evidence that surveillance confers substantial benefit on IR patients by lowering their future risk of CRC. Overall, the first surveillance visit appeared to offer most protection, and the benefit of additional surveillance was not entirely clear. Having two or more surveillance visits was associated with an increased reduction in CRC risk, but the effect in all IR patients was not significant.
Is there a group that does not require a follow-up examination or for which a second follow-up examination might be omitted?
There appeared to be heterogeneity among IR patients in terms of long-term CRC risk and surveillance needs. When patients were subdivided into HIR and LIR subgroups based on baseline polyp and procedural predictors of CRC risk, the subgroups were discriminant in terms of future CRC risk. In the HIR subgroup, one follow-up was associated with a strong protective effect against risk of CRC after baseline, and additional surveillance provided significant further benefit compared with one follow-up alone. Before the first surveillance visit, HIR patients were at 26% increased risk of CRC compared with general population; after allowing for the effect of follow-up by including observation time after surveillance, the incidence rate of CRC was reduced but was still 13% greater than in the general population, suggesting that continued surveillance is beneficial for HIR patients.
By comparison, the effect of surveillance in the LIR subgroup was less clear, although there was a trend towards a reduction in CRC risk with a single follow-up, the results were not statistically significant. The pre-surveillance (post-baseline) incidence rate of CRC was significantly lower than that of the general population. When accounting for the effect of surveillance, there was a small reduction in the 10-year cumulative incidence of CRC, and the LIR subgroup remained at lower risk than the general population. Additional surveillance beyond one follow-up did not appear to provide any further reduction in risk in the LIR subgroup.
Is a 3-year surveillance interval appropriate for intermediate-risk patients?
Findings from the hospital data set suggest that the current surveillance interval of 3 years is appropriate for the majority of IR patients. At FUV1 the odds of detecting an AA or CRC increased significantly with increasing interval length. CRC detection rates were < 1% before the interval extended beyond 3 years, while AA detection rates were around 9% during this time. The proportion of patients with adenomas remained relatively constant across intervals (30–40%), making adenomas an uninformative outcome for specifying recommended follow-up intervals.
Based on these data we suggest that, in order to prevent delayed diagnosis of missed CRC or development of new CRC, the first surveillance examination should be performed no later than 3 years after baseline. In our data set, surveillance at 3 years would have been worthwhile, as there was an adequate yield of AA but the detection rate of CRC was low. Our results do not suggest that surveillance needs to be done earlier in most of this group of patients, as there was little increase in rates of AA in the first 3 years and a low rate of CRC.
Similarly, at FUV2 we found no evidence to suggest that the current 3-year interval between the first and second follow-ups is inappropriate. There was a significant increase in AN detection at FUV2 with increasing interval length, and more than twice the odds of AN with an interval of > 3 years.
Is there a group that needs a shorter interval to the first or second follow-up examination, or for which follow-up could be postponed?
The LIR and HIR subgroups derived from models for long-term CRC risk (see Lower- and higher-intermediate-risk subgroups, above) were discriminant when applied to findings at FUV1, but not FUV2. The detection of AN at FUV1 increased with increasing interval length in both risk subgroups, but more significantly so in the HIR subgroup. In the LIR subgroup, the AN detection rate was < 10% until the interval exceeded 3 years, providing further evidence in favour of a 3-year interval. In the HIR subgroup, the AN detection rate was 12% at 3 years, and a considerable proportion of CRCs occurred early; however, CRC incidence was only 1% before 3 years, so any gains from a shorter interval are likely to be small. An exception might be made for patients with incompletely removed lesions or a poor examination at baselines in whom a repeat examination soon after baseline might be appropriate. An association between increasing interval and AN at FUV2 was seen in the HIR subgroup but not the LIR subgroup, possibly because of a lack of power in the latter. A 3-year interval to FUV2 appeared to be the most appropriate for the HIR subgroup for the same reasons as for FUV1, but our data do not permit us to deduce the interval to FUV2 in the LIR subgroup if surveillance is offered at all.
Risk factors
Older age, poor bowel preparation quality, incomplete colonoscopy, a large adenoma (≥ 20 mm) and proximal polyps were identified as risk factors in a number of analyses of the hospital data set. Baseline risk factors for AA at FUV1 were similar to those for CRC risk after baseline, providing evidence of the validity of AA detection at follow-up as a short-term surrogate outcome for future risk of CRC. Risk factors common to both analyses included older age, large adenoma size (≥ 20 mm), proximal polyps and completeness of colonoscopy. HGD was associated with long-term CRC risk after baseline but not findings at FUV1, whereas having a large hyperplastic polyp was a risk factor for AA at FUV1 but not for CRC after baseline. Finding AA or CRC at FUV2, and future CRC risk after follow-up, were affected by both baseline and FUV1 risk factors.
Some of these risk factors have been previously reported in the literature on multiple occasions;3,23,32,51 however, a lesser known risk factor is the presence of polyps in the proximal colon, which was associated with a twofold increased risk of CRC in our cohort. An increased risk of AA at first surveillance in patients with proximal adenomas at baseline has been noted by some authors,32,52 but, to our knowledge, no study has identified this as a risk factor for CRC. The presence of proximal polyps may indicate a different biological pathway, such as the serrated pathway, whereby patients may produce lesions with rapid malignant transformation or be more likely to develop hard-to-find cancers in the proximal colon. 53
Examination quality
Colonoscopy completeness and quality of bowel preparation were important predictors of CRC in our cohort. Since the introduction of the national quality assessment tool in 2004 (UK National Endoscopy Training Programme and the Global Endoscopy Rating Scale), there has been a substantial improvement in colonoscopy quality, as assessed by ability to reach the caecum, and polyp detection rates, a measure of meticulousness in examining the colonic mucosa. 54,55 Furthermore, a study using data from the English National Cancer Repository56 found a significant 27% decline in cancers diagnosed within 3 years of colonoscopy between 2001 and 2008; such cancers are assumed to have arisen from missed or incompletely removed lesions.
An incomplete colonoscopy might be due to a number of factors, from endoscopist performance to patient characteristics such as older age or having prior abdominal or pelvic surgery. 57,58 Risk factors for poor bowel preparation quality include older age, overweight, diabetes and other comorbidities. 59,60 In those individuals for whom colonoscopy is difficult and therefore unsuccessful, it is probably inappropriate to recommend repeated colonoscopic surveillance; alternative surveillance strategies need to be explored.
The NICE colonoscopy guidelines advise that a repeat examination is performed in cases of suboptimal bowel preparation;24 however, paradoxically, this may result in reluctance on the part of an endoscopist to categorise bowel preparation quality as poor, particularly if the patient has been difficult to examine. There is only weak evidence on how to salvage a procedure when bowel preparation is found to be inadequate. 61 However, the importance of achieving good bowel preparation cannot be overstated; a systematic review62 found that adenoma detection rates were significantly higher in patients with adequate or good-quality preparation compared with poor-quality preparation.
Strengths and limitations
A major strength of the hospital data set was the wide variation in surveillance interval length. Following the adoption of national surveillance guidelines that prescribe set intervals,1,16,24 this feature is unlikely to be seen in future data sets examining adenoma follow-up.
Another achievement of the investigation was the creation of a high-quality data set despite the numerous difficulties encountered. The data used for the study were usually in a format that was not intended for research purposes, and required extensive cleaning. Thorough data collection and meticulous data coding enabled the ascertainment of detailed patient, procedural and polyp characteristics, the accuracy of which was often corroborated through the use of more than one source of information, for example endoscopic and pathological information, or multiple procedure reports. A major strength was that the raw source data have been retained for verification of our cleaning processes.
Extensive data cleaning was used to resolve transcriptional errors in, or discrepancies between, reports. Most data cleaning tasks were performed manually to ensure accuracy and avoid assumptions inherent to automation. Any inconsistencies were carefully scrutinised before being corrected. To account for changes in pathological classifications over the follow-up period, all lesions were classified using standardised, up-to-date terminology from the EU guideline for quality assurance in CRC screening and diagnosis. 63 Manual coding errors were minimised through comprehensive data consistency and validity checks, and standardised coding procedures, ensuring data accuracy.
In terms of missing data, as a result of extensive work on recorded data, the number and size of adenomas at baseline were complete, and only 4% of patients were missing data on histology or dysplasia. Completeness of baseline colonoscopy was unknown in only 14% of patients, but quality of bowel preparation was missing in 39% of patients and it is known that the reporting of the bowel preparation quality is subjective. 64 The ‘unknown’ category was retained in analyses to avoid the introduction of bias, with the assumption that it probably comprised a mixture of examinations with both suboptimal and good-quality bowel preparation.
Despite our rigorous coding and data cleaning methods, some measurement error and misclassification of exposures and confounding factors is to be expected given the nature of the routine data used; however, any misclassification should be non-differential and, thus, may have resulted in the underestimation of the effect of surveillance and interval. The potential effect of misdetermination of intervals was dealt with in the sensitivity analysis reported above (see Sensitivity analyses and internal validation). A number of studies have highlighted the potential for polyp type, size, histology, dysplasia and number to be measured inaccurately. Existing literature was used to estimate the potential effect of measurement error on our study findings. In terms of measurement error of pathological attributes of adenomas, a number of sources are available. 65–69 These indicate excellent histopathological measurement of adenoma size, with interobserver correlations among measures of the order of 0.98 and kappa values of the order of 0.85–0.90. 66,67 This would confer around a 2% bias in the estimates of logistic and Cox regression coefficients,70 and around the same proportionate increase in size of 95% CIs – see Spiegelman et al. 70 for mathematical details. Endoscopic determination of size is likely to be more subject to error. 66,68 Determination of grade of dysplasia is generally observed to be good, with kappa values of the order of 0.6 and interobserve agreement as high as 94%. 65,67 Most studies find that determination of villous status (and further classification of villous adenomas) is subject to a greater degree of measurement error, with kappa values of 0.40–0.60. 65,67,69 However, in our sensitivity analyses, we found that this was not crucial to our results. Thus, it is likely that measurement error of the pathological attributes of the adenomas, although not negligible, does not significantly alter the interpretation of our results.
Another limitation of our study is that, although some patients were censored before the end of hospital data collection, the majority had follow-up time after the end of data collection, during which they may have attended surveillance visits. Patients who were aged ≥ 73 years at the end of data collection and who had two or more follow-up visits, would not be affected in our analysis. Patients aged < 73 years with only one or no surveillance visit, recorded at the end of data collection (≈50% of our cohort) could have an underestimated number of surveillance visits resulting in the underestimation of the effect of surveillance on CRC risk. This issue was addressed in sensitivity analyses restricted to patients with at least 5 years and at least 7 years of hospital data; the sensitivity analyses suggested that, as expected, the effect of surveillance may have been underestimated.
In addition, we may have underestimated pre-surveillance risk after baseline if some patients classified as having no surveillance did, in fact, attend one or more follow-ups, which thus lowered their risk and contaminated the ‘pre-surveillance’ group. This was of particular concern in the LIR subgroup, as their pre-surveillance risk was very low compared with the general population (SIR = 0.39, 95% CI 0.18 to 0.75). When sensitivity analyses restricted to patients with at least 5 years or at least 7 years of hospital data were undertaken, the results suggested that, although the pre-surveillance CRC risk may have been underestimated as a result of misclassification of surveillance attendance, the LIR subgroup still appeared to be at substantially lower risk than the general population; however, SIRs were imprecise because of the small number of end points.
With regard to bias, we believe our methods to be relatively robust. National registries were consulted to accurately trace almost all patients’ mortality and cancer status. Together with extensive interrogation of follow-up data, this prevented loss to follow-up and limited the risk of selection bias and outcome misclassification, ensuring the quality of the data set. Similarly, selection bias due to non-response was not applicable because of the retrospective nature of the study – we extracted all available endoscopy and pathology data on every adenoma patient who underwent an endoscopy between specific dates at all study sites.
Exposed and unexposed groups (i.e. patients with and without follow-up, or with differing surveillance intervals) were both drawn from the same hospital databases of patients presenting during similar time periods, ensuring comparability. Potential selection bias could have resulted from the fact that those who attended surveillance differed from those who did not in terms of patient, procedural and polyp characteristics; however, the actual differences between patients with and without follow-up examinations were generally small. The most notable differences were in age, completeness of colonoscopy, bowel preparation and year of entry. Similarly, those with a short interval differed from those with a long interval. As factors associated with attendance at one or more follow-up visits, or with a short interval, were also risk factors for CRC after baseline and for AA and CRC at follow-up, then any selection bias should have resulted in an underestimate of the effect of interval length or surveillance on CRC risk.
A complex set of rules was generated to group examinations into visits and intervals using the NHS BCSP guideline. 71 In general, most patients had a visit that consisted of a single colonoscopy, and the baseline visit was extended beyond 11 months in only 2% of cases. With no clear surveillance recommendations or reasons for the examination provided in the reports, this was the best available method. Some error in the classification of interval length is to be expected; however, any misclassification is likely to be non-differential and so should result in only-ran underestimate of the effect of interval. We undertook sensitivity analyses using only those patients whose baseline visit comprised a single examination, or patients with a complete colonoscopy at follow-up, and we found no noteworthy differences between these and our main analyses, suggesting that the methods used to extend baseline and define visits and intervals were satisfactory and did not introduce any form of bias into the data.
All a priori confounders – for example adenoma size, dysplasia and age – were adjusted for in multivariable regression analyses in order to remove any potential confounding effects that may have obscured the associations of interest. In general, there was no confounding or evidence of only weak confounding in most multivariable analyses. However, a particularly strong confounding factor was identified when assessing the effect of interval on findings at follow-up. We found that patients at higher risk of AN at follow-up were also more likely to have had a short interval, which appeared to be due to recall for polypectomy site surveillance or continued treatment of a large lesion. This biased the effect of interval, making a shorter interval appear risky and diminishing the effect of interval overall. This confounding effect was adjusted for by removing all previously seen lesions from our analyses, as a prior sighting was the best proxy measure that could be identified in the data available. Despite this, there is some risk of residual confounding, which may have caused us to underestimate the effect of interval.
In cases for which only paper records were available (prior to the 1990s) a few early endoscopy examination(s) may have been missed; however, as most baseline data fell between 2000 and 2010 (84%) any missing data on prior examinations are likely to be negligible. We may have missed baseline or follow-up examinations for patients who were treated at a hospital that was not included in the study. This was an unavoidable problem inherent to the retrospective methods used, which could have resulted in the incorrect classification of baseline, surveillance visits or risk groups. Owing to the wide geographic coverage of the study, we believe that this is unlikely to affect many patients, if any. It should be noted that, although follow-up examinations were assumed to be for surveillance, some may have been for symptomatic purposes.
Research using pseudo-anonymised data
A number of problems were encountered as a result of the use of pseudo-anonymised data in the hospital data set. Our patient identifier lists, consisting of surname and forename(s), hospital number(s), NHS number, gender, postcode and date of birth, were created from data held on endoscopy and pathology systems. Inevitably, some of these patient identifiers were subject to data entry errors, such as spelling, incorrect recording, spaces in the NHS numbers or transpositions errors, and not all of the patient identifiers were available from the systems from which the data were extracted.
When carrying out patient follow-up, we found that a high percentage of our records could not be matched by the HSCIC, as their algorithms were designed for cleaned data sets. Not having the patient identifiers significantly limited our ability to correct the errors and complete the missing information. Moreover, the HSCIC algorithms took very few fields into account in order to find a match which, in around 7% of cases, resulted in either no match being found or multiple matches, which the HSCIC also classifies as no match.
We worked closely with the HSCIC over several months to develop new algorithms to look at multiple combinations of patient identifiers to find the match. Multiple checks were then done on the validity of the matches in order to achieve a high match rate while avoiding compromising data integrity.
In around 2.5% of cases it was necessary to ask staff at individual hospitals to use the patient-identifiable information they held (particularly the hospital number, which the HSCIC cannot use) to complete the missing information on any cases that could not be matched with certainty after all of the algorithms had been applied. The HSCIC then used the new information from the hospitals to re-match these cases. It was very difficult to get the already overstretched hospital staff to do this work for us. Finally, where it was not possible to find a match, the HSCIC performed manual ‘operator’ matches.
We also had to ensure that the individuals responsible for safeguarding the patient-linking-files at the hospital were still contactable and that in the event that they moved, the information was passed to another hospital staff member, who safeguarded the data.
The problem of following up patients to obtain information on cancers and deaths is an aspect of research that would greatly benefit from an improved ability to use non-anonymised data, or from better access to patient-identifiable information. For future follow-up, we would ideally like to hold all of the data collected, including patient identifiers, in one secure location, but current site-specific restrictions prevent the data from being held centrally; each hospital trust has its own regulations and requires the data to be held within the hospital environment. Currently, our latest cleaned data are held only by the HSCIC; if they were required to un-flag our patients, it would take many months to collate, clean and match again. The re-matching of some patients might not even be possible, as the original data collected were supplemented with information from hospital databases, some of which were quite old and may no longer be available if the databases are decommissioned. It is also possible that with tightening regulations, this process would not be permitted. Carrying out detailed research using pseudo-anonymised data is extremely challenging and has not proved to be the simple, time- and cost-saving exercise that may have been envisaged when the call for proposal using retrospective data was originally made.
Chapter 4 Screening data set: results and discussion
Background
The study team had knowledge of several large data sets collected within screening studies and programmes, which contained data on individuals who were under surveillance and were believed to have been followed up. Seven screening data sets were originally identified for inclusion; however, four were excluded for reasons given in Table 70, leaving only three which were deemed of sufficient size and quality for analysis: the UK Flexible Sigmoidoscopy Screening Trial (UKFSST), the English Bowel Cancer Screening Pilot (EP) and the Kaiser Permanente Colon Cancer Prevention Program (KP).
Research data set | Author | Exclusion reason |
---|---|---|
Veteran Affairs Study | Lieberman et al. 200072 | Permission to access the data was not granted owing to concerns over data security |
Nottingham Trial of Faecal Occult Blood Testing | Scholefield and Moss 200273 | Permission was denied for collection of data on follow-up examinations; many of the data were available only in paper records and would have been expensive and lengthy to retrieve |
Scottish Bowel Cancer Screening Pilot | Alexander 200374 | Endoscopy and pathology data were not linked and would have had to be linked manually. There were many repeat examinations with the same information but varying dates, and over 1000 pathology reports with no corresponding endoscopy report |
Italian FS Screening Trial (SCORE) | Segnan et al. 200275 | In this one-off FS screening trial, 17,148 men and women were invited and 9911 had FS screening in six centres. However, baseline information and cancer registry information were obtained for only one centre, which included only 194 subjects referred for colonoscopy following screening. This data set was deemed too small for inclusion |
UK Flexible Sigmoidoscopy Screening Trial
The UKFSST aimed to examine the efficacy of a single FS screening in reducing CRC incidence and mortality rates. The trial randomised 170,432 men and women aged between 55 and 64 years to either FS screening or usual care, which at the time meant no CRC screening. 10 A total of 40,674 participants was screened by FS in 14 UK centres. Individuals undergoing FS screening who were found to have a large (≥ 10 mm) lesion, three or more adenomas, villous or tubulovillous histology, severe dysplasia, malignant disease or ≥ 20 hyperplastic polyps above the rectum were offered colonoscopy surveillance. The cohort was followed up using records held by the ONS and cancer registries for incidence of CRC and deaths. Follow-up data were available until 31 December 2012.
English Bowel Cancer Screening Pilot
This study was commissioned by the DH in 1999 to determine the feasibility of CRC screening using a guaiac faecal occult blood test (gFOBT) in the UK. The pilot included two sites, one in Scotland and one in England, and ran from 2000 to 2002. 74 Men and women aged 50–69 years, registered with a NHS general practitioner (GP), were invited to complete a gFOBT, and, by 2003, 189,319 subjects in England and 297,036 in Scotland had been invited for screening, with an uptake rate of around 60%. In the EP, individuals who tested gFOBT positive were offered a meeting with a specialist screening practitioner at one of three pilot centres, who assessed their fitness for colonoscopy. In total, 82% of referred participants attended their colonoscopy. Pseudo-anonymised data on baseline and follow-up colonoscopies for those offered surveillance were available to 2012. Patient identifiers were sent directly to the ONS to obtain cancer and mortality data; data were available until 30 June 2012.
Both the UKFSST and the EP had data available for the significant risk factors for AA and CRC that were identified in the analysis of the hospital data set.
Kaiser Permanente Colon Cancer Prevention Program
The Northern California Kaiser Permanente Medical Care Program began its Colon Cancer Prevention Program in 1994, with the aim of offering sigmoidoscopy screening to all members aged ≥ 50 years once every 10 years. 76,77 The KP data set with which we were provided comprised all participants with a baseline sigmoidoscopy between January 1994 and December 1995 who then had a baseline colonoscopy within 6 months of sigmoidoscopy and at least 1 year of subsequent follow-up. Follow-up data on CRCs and deaths were available up to 31 December 2006 or until the date the participant left the program, if earlier.
Methods
Rules used to derive variables for the hospital data set (see Chapter 2, Creating summary values for polyp characteristics, Procedure information and Defining baseline and surveillance visits), including baseline and follow-up visits, and polyp and procedural characteristics, were applied to the screening data set. The KP data set did not contain information regarding quality of baseline colonoscopy; therefore, all subjects in the KP cohort were assumed to have had a complete colonoscopy with good bowel preparation at baseline. Analyses were performed with Stata/IC 13.1.
The baseline characteristics of IR subjects in the hospital and screening data sets, in the individual screening cohorts, and in screening participants with and without follow-up, were compared. The distribution of baseline characteristics among patients with and without follow-up visits in the screening data set was compared using chi-squared tests.
In the analysis of long-term CRC risk after baseline, the cut-off for follow-up was the end date of follow-up data availability in each of the individual screening cohorts. All time-to-event data were censored at first CRC diagnosis, death, emigration, end of program participation (KP data set only) or end of follow-up. Time at risk started from the latest most complete colonoscopy in baseline and, for the analysis of incidence following FUV1, time at risk started on the date of the first procedure in FUV1. If CRC was diagnosed at a follow-up visit, the follow-up visit was not included as a visit, as it did not offer any protection against CRC. ‘One minus the Kaplan–Meier estimator of the survival function’ was used to illustrate the time to cancer diagnosis and to estimate the cumulative risk of cancer with 95% CIs at 3, 5 and 10 years.
The effects of surveillance and baseline risk factors for CRC identified in the hospital data set on long-term CRC incidence in the screening data set were examined. Univariable Cox proportional hazards models were used to estimate unadjusted HRs. Independent predictors of cancer incidence identified in the hospital data set were fitted to the screening data set using a multivariable Cox proportional hazards model, with the number of follow-up visits included as a time-varying covariate.
Observed pys at risk were calculated by gender and 5-year age group. Expected numbers of CRC cases were calculated by multiplying the observed gender- and age-specific number of pys by the gender- and age-specific incidence in the general population of England in 2007. The ratio of observed to expected cases was reported as a SIR, and 95% CIs were computed assuming an exact Poisson distribution.
Findings at FUV1 were investigated. The relationship between interval from baseline to FUV1 and new AA and CRC at FUV1 was explored, both with and without adjustment for baseline risk factors identified in the hospital data set. Logistic regression models were fitted to the pooled screening data set in order to assess whether or not baseline risk factors identified in the hospital data set were predictive of advanced findings at first follow-up in screening populations. The predictive ability of these models for new AA and CRC in the screening data set was assessed using receiver operating characteristic (ROC) curves.
Screening data set: comparison of results with the hospital data set
Baseline characteristics of screening participants with intermediate-risk adenomas
In the pooled screening data set, there were 2352 subjects with IR adenomas: 796 in the UKFSST cohort, 407 in the EP cohort and 625 in the KP cohort. This compares with 11,944 in the hospital data set.
Table 71 compares the distribution of baseline demographic characteristics and risk factors for finding new AA or new cancers at FUV1 identified from the hospital data set models in the different data sets and cohorts. Participants in the pooled screening data set were younger: > 20% of patients in the hospital cohort were aged > 75 years, compared with negligible numbers in all of the screening cohorts. Individuals in the screening data set were also more likely to have had a better-quality baseline colonoscopy; this was despite almost 80% of baseline examinations being done prior to 2000, compared with only 16% in the hospital patients. Furthermore, the adenomas detected at baseline in the screening participants were less likely to be large (≥ 20 mm) or have HGD.
Baseline risk factor | Cohort | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Hospital (N = 11,944) | Pooled screening (N = 2352) | EP (N = 490) | UKFSST (N = 952) | KPa (N = 910) | |||||||
n | % | n | % | n | % | n | % | n | % | ||
Age (years) at start of baseline | < 55 | 2122 | 17.8 | 232 | 9.9 | 69 | 14.1 | 0 | 0 | 163 | 17.9 |
≥ 55 and < 60 | 1321 | 11.1 | 669 | 28.4 | 95 | 19.4 | 381 | 40.0 | 193 | 21.2 | |
≥ 60 and < 65 | 1858 | 15.6 | 855 | 36.4 | 159 | 32.4 | 500 | 52.5 | 196 | 21.5 | |
≥ 65 and < 70 | 2171 | 18.2 | 410 | 17.4 | 163 | 33.3 | 71 | 7.5 | 176 | 19.3 | |
≥ 70 and < 75 | 1786 | 15.0 | 125 | 5.3 | 3 | 0.6 | 0 | 0 | 122 | 13.4 | |
≥ 75 and < 80 | 1416 | 11.9 | 51 | 2.2 | 1 | 0.2 | 0 | 0 | 50 | 5.5 | |
≥ 80 | 1270 | 10.6 | 10 | 0.4 | 0 | 0 | 0 | 0 | 10 | 1.1 | |
Gender | Male | 6625 | 55.5 | 1595 | 67.8 | 327 | 66.7 | 655 | 68.8 | 613 | 67.4 |
Female | 5319 | 44.5 | 757 | 32.2 | 163 | 33.3 | 297 | 31.2 | 297 | 32.6 | |
Year of baseline | 1980–94 | 439 | 3.7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
1995–9 | 1430 | 12.0 | 1861 | 79.1 | 0 | 0 | 951 | 99.9 | 910 | 100 | |
2000–4 | 4251 | 35.6 | 395 | 16.8 | 394 | 80.4 | 1 | 0.1 | 0 | 0 | |
2005–10 | 5824 | 48.8 | 96 | 4.1 | 96 | 19.6 | 0 | 0 | 0 | 0 | |
Most complete colonoscopy | Complete | 9016 | 75.5 | 2261 | 96.1 | 475 | 96.9 | 876 | 92.0 | 910 | 100 |
Incomplete/unknown | 2928 | 24.5 | 91 | 3.9 | 15 | 3.1 | 76 | 8.0 | 0 | 0 | |
Best bowel preparation at colonoscopy | Excellent/good/satisfactory/unknown | 11,273 | 94.4 | 2303 | 97.9 | 482 | 98.4 | 911 | 95.7 | 910 | 100 |
Poor | 671 | 5.6 | 49 | 2.1 | 8 | 1.6 | 41 | 4.3 | 0 | 0 | |
Largest adenoma (mm) | < 10 | 1029 | 8.6 | 272 | 11.6 | 30 | 6.1 | 95 | 10.0 | 147 | 16.1 |
10–14 | 4417 | 37.0 | 1108 | 47.1 | 192 | 39.2 | 429 | 45.1 | 487 | 53.5 | |
15–19 | 2440 | 20.4 | 512 | 21.8 | 144 | 29.4 | 210 | 22.1 | 158 | 17.4 | |
≥ 20 | 4058 | 34.0 | 460 | 19.6 | 124 | 25.3 | 218 | 22.9 | 118 | 13.0 | |
Worst adenoma histology | Tubular | 4742 | 39.7 | 1146 | 48.7 | 112 | 22.9 | 468 | 49.2 | 566 | 62.2 |
Tubulovillous | 5576 | 46.7 | 1020 | 43.4 | 340 | 69.4 | 396 | 41.6 | 284 | 31.2 | |
Villous | 1142 | 9.6 | 153 | 6.5 | 30 | 6.1 | 63 | 6.6 | 60 | 6.6 | |
Unknown | 484 | 4.0 | 33 | 1.4 | 8 | 1.6 | 25 | 2.6 | 0 | 0 | |
Worst adenoma dysplasia | Low grade | 9476 | 79.3 | 2071 | 88.1 | 389 | 79.4 | 811 | 85.2 | 871 | 95.7 |
High grade | 1994 | 16.7 | 260 | 11.0 | 100 | 20.4 | 121 | 12.7 | 39 | 4.3 | |
Unknown | 474 | 4.0 | 21 | 0.9 | 1 | 0.2 | 20 | 2.1 | 0 | 0 | |
Distal polyps | No | 1980 | 16.6 | 98 | 4.2 | 31 | 6.3 | 33 | 3.5 | 34 | 3.7 |
Yes | 9964 | 83.4 | 2254 | 95.8 | 459 | 93.7 | 919 | 96.5 | 876 | 96.3 | |
Proximal polyps | No | 7369 | 61.7 | 1682 | 71.5 | 348 | 71.0 | 709 | 74.5 | 625 | 68.7 |
Yes | 4575 | 38.3 | 670 | 28.5 | 142 | 29.0 | 243 | 25.5 | 285 | 31.3 | |
Largest hyperplastic polyp (mm) | < 10 or none | 11,761 | 98.5 | 2284 | 97.1 | 484 | 98.8 | 913 | 95.9 | 887 | 97.5 |
≥ 10 | 183 | 1.5 | 68 | 2.9 | 6 | 1.2 | 39 | 4.1 | 23 | 2.5 |
The three screening cohorts also differed in several respects. There were differences in the age at which screening was offered in the different cohorts: age 55–65 years in the UKFSST, age 50–69 years in the EP, but a wider age range in the KP cohort. Thus, the UKSST participants tended to be younger than in the KP and EP cohorts. The EP participants had their baseline between 2000 and 2010, whereas in the KP and UKFSST cohorts almost all were between 1995 and 1999. Examination quality was slightly worse in the UKFSST than in the EP or KP; however, data on examination quality were missing in the KP cohort, so all participants were assumed to have had a complete colonoscopy with at least satisfactory bowel preparation at baseline. Adenomas detected in the EP tended to be larger, and a much higher proportion had tubulovillous histology (69.4% EP vs. 41.6% UKFSST and 31.2% KP) or HGD (20.4% EP vs. 12.7% UKFSST and 4.3% KP).
Patients with and without surveillance
We examined the distribution of the baseline characteristics among screening participants with and without surveillance after baseline to determine the risk of selection bias in analysis of findings at and subsequent to follow-up visits (Table 72). Three-quarters of screening (1828) participants attended at least one follow-up and the remaining 524 were followed using external cancer and deaths data only.
Baseline risk factor | Participants with one or more surveillance visits (N = 1828) | Participants with no surveillance visits (N = 524) | p-value (chi-squared test) | |||
---|---|---|---|---|---|---|
n | % | n | % | |||
Age (years) at start of baseline | < 55 | 56 | 10.7 | 176 | 9.6 | < 0.0001 |
≥ 55 and < 60 | 119 | 22.7 | 550 | 30.1 | ||
≥ 60 and < 65 | 163 | 31.1 | 692 | 37.9 | ||
≥ 65 and < 70 | 97 | 18.5 | 313 | 17.1 | ||
≥ 70 and < 75 | 52 | 9.9 | 73 | 4.0 | ||
≥ 75 and < 80 | 27 | 5.2 | 24 | 1.3 | ||
≥ 80 | 10 | 1.9 | 0 | 0 | ||
Gender | Male | 1232 | 67.4 | 363 | 69.3 | 0.4171 |
Female | 596 | 32.6 | 161 | 30.7 | ||
Most complete colonoscopy | Complete | 1760 | 96.3 | 501 | 95.6 | 0.4836 |
Incomplete/unknown | 68 | 3.7 | 23 | 4.4 | ||
Best bowel preparation at colonoscopy | Excellent/good/satisfactory/unknown | 1788 | 97.8 | 515 | 98.3 | 0.5061 |
Poor | 40 | 2.2 | 9 | 1.7 | ||
Largest adenoma (mm) | < 10 | 201 | 11.0 | 71 | 13.5 | 0.0187 |
10–14 | 841 | 46.0 | 267 | 51.0 | ||
15–19 | 415 | 22.7 | 97 | 18.5 | ||
≥ 20 | 371 | 20.3 | 89 | 17.0 | ||
Worst adenoma histology | Tubular | 843 | 46.1 | 303 | 57.8 | < 0.0001 |
Tubulovillous | 843 | 46.1 | 177 | 33.8 | ||
Villous | 116 | 6.4 | 37 | 7.1 | ||
Unknown | 26 | 1.4 | 7 | 1.3 | ||
Worst adenoma dysplasia | Low grade | 1590 | 87.0 | 481 | 91.8 | 0.0070 |
High grade | 222 | 12.1 | 38 | 7.2 | ||
Unknown | 16 | 0.9 | 5 | 1.0 | ||
Distal polyps | No | 70 | 3.8 | 28 | 5.3 | 0.1262 |
Yes | 1758 | 96.2 | 496 | 94.7 | ||
Proximal polyps | No | 1311 | 71.7 | 371 | 70.8 | 0.6821 |
Yes | 517 | 28.3 | 153 | 29.2 | ||
Largest hyperplastic polyp (mm) | < 10 or none | 1778 | 97.3 | 506 | 96.6 | 0.3993 |
≥ 10 | 50 | 2.7 | 18 | 3.4 |
Those attending surveillance were younger, on average, than those who did not attend [mean 61.5 years (SD 5.2) vs. mean 63.4 years (SD 7.1); p < 0.001), but there was no difference by gender or the quality of baseline colonoscopy. Attenders were more likely to have a large adenoma (p = 0.0187), an adenoma with tubulovillous histology (p < 0.0001) or an adenoma with HGD at baseline (p = 0.0070); however, the proportions with proximal polyps or large (≥ 10 mm) hyperplastic polyps were similar.
Table 73 describes the number of follow-up visits in the screening and hospital data sets. Almost 80% of screening participants had at least one follow-up examination, compared with < 40% of the hospital patients, and 43% had at least two follow-ups, compared with only 14% of the hospital patients. The amount of follow-up was relatively similar in the EP and UKFSST cohorts, whereas the KP cohort had a greater proportion of participants without any follow-up (31% vs. 16–17%).
Number of follow-up visits | Data set | Cohort | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Hospita |