| Literature DB >> 35738723 |
Andres Tamm1,2, Helen Js Jones1,3, William Perry1,3, Des Campbell4,5, Rachel Carten4,6, Jim Davies1,7, Algirdas Galdikas8,9, Louise English10, Alex Garbett11,12, Ben Glampson8,9, Steve Harris1,7, Khurum Khan10,13, Stephanie Little1,3, Lee Malcomson11,12, Sheila Matharu4,5, Erik Mayer9,14, Luca Mercuri8,9, Eva Ja Morris1,2, Rebecca Muirhead1,3, Ruth Norris11, Catherine O'Hara11,12, Dimitri Papadimitriou8,9, Niels Peek11,15, Andrew Renehan11,12, Gail Roadknight1,3, Naureen Starling4,5, Marion Teare4,5, Rachel Turner4,5, Kinga A Várnai1,3, Harpreet Wasan8,16, Kerrie Woods1,3, Chris Cunningham17,3.
Abstract
OBJECTIVE: Colorectal cancer is a common cause of death and morbidity. A significant amount of data are routinely collected during patient treatment, but they are not generally available for research. The National Institute for Health Research Health Informatics Collaborative in the UK is developing infrastructure to enable routinely collected data to be used for collaborative, cross-centre research. This paper presents an overview of the process for collating colorectal cancer data and explores the potential of using this data source.Entities:
Keywords: Database Management Systems; Electronic Health Records; Health Information Systems; Hospital Records; Informatics
Mesh:
Year: 2022 PMID: 35738723 PMCID: PMC9226931 DOI: 10.1136/bmjhci-2021-100535
Source DB: PubMed Journal: BMJ Health Care Inform ISSN: 2632-1009
Figure 1Hypothetical patient timelines that show specific treatment and surveillance patterns. Group A: Timelines of patients with colon cancer that follow the pattern ‘diagnosis, scan, surgery, scan’. Group B: Patients with rectal cancer with ‘diagnosis, scan, chemoradiotherapy, radical resection, chemo(radio)therapy, scan’. Group C: Patiens with colorectal cancer with ‘diagnosis, treatment, scan, recurrence, treatment, death’. Group D: Patients with rectal cancer with local excision. Timelines for 10 patients were created to illustrate each group. TNM, tumour, node, metastases.
Demographic baseline of patients captured with colorectal cancer in the NIHR HIC colorectal cancer research database form three pilot sites
| Characteristic | Value |
| Number of participants | 12 903 (100%) |
| Cancer site | |
| C18—colon | 4920 (38.1%) |
| C19—rectosigmoid | 1171 (9.1%) |
| C20—rectum | 2699 (20.9%) |
| Not known yet | 4997 (38.7%) |
| Age at diagnosis | |
| Age, years, median (IQR) | 68.4 (58.1–77.3) |
| Not known yet | 4997 (38.7%) |
| Sex | |
| Male | 7223 (56.0%) |
| Female | 5504 (42.7%) |
| Not known | 176 (1.4%) |
| Ethnicity | |
| White | 9222 (71.5%) |
| Not stated | 1575 (12.2%) |
| Other ethnic groups | 786 (6.1%) |
| Asian or Asian British | 554 (4.3%) |
| Black or Black British | 396 (3.1%) |
| Mixed | 77 (0.6%) |
| Not known | 293 (2.3%) |
| Smoking status | |
| Current or ex-smoker | 2912 (22.6%) |
| Non-smoker | 3793 (29.4%) |
| Not known | 6964 (54.0%) |
| Body mass index | |
| Median (IQR) | 25.8 (22.9–29.2) |
| Not known | 3220 (25.0%) |
| Treatment | |
| Neoadjuvant | 3708 (28.7%) |
| Adjuvant | 962 (7.5%) |
| Local excision | 291 (2.3%) |
| Radical resection | 3715 (28.8%) |
| Not known yet | 5749 (44.6%) |
| Mortality and follow-up | |
| Number of deaths | 5090 (39.4%) |
| Years from diagnosis to mortality, median (IQR) | 0.9 (0.3–2.0) |
| Years from diagnosis to last follow-up, median (IQR) | 2.4 (0.8–4.6) |
HIC, Health Informatics Collaborative; NIHR, National Institute for Health Research.
Number of records per selected data items in the NIHR HIC colorectal cancer research database
| Field | Number of records |
| Laboratory tests | 5071 605 |
| Inpatient episodes | 13 737 |
| Diagnosis codes | 443 762 |
| Procedure codes | 154 363 |
| Radiotherapy | 7726 |
| Chemotherapy | 17 452 |
| Histology reports | 15 311 |
| Relevant histology reports | 9226 |
| Imaging reports | 96 330 |
| Endoscopy reports | 13 737 |
| Relevant endoscopy reports | 11 352 |
Relevant histology reports contain references to colon or rectum; relevant imaging reports correspond to certain investigations, such as MRI of pelvis and rectum; and relevant endoscopy reports represent colonoscopies and flexible sigmoidoscopies.
HIC, Health Informatics Collaborative; NIHR, National Institute for Health Research.
T stage, recurrence and metastatic disease identified in the NIHR HIC colorectal cancer research database through NLP of imaging, endoscopy and/or histopathology reports for all patients who had surgical excision at one of the pilot sites
| Characteristic | Radical resection or local excision |
| Number of participants | 2124 (100%) |
| Maximum T stage | |
| 0 | 31 (1.5%) |
| is (in situ) | 0 (0%) |
| 1 | 195 (9.2%) |
| 2 | 369 (17.4%) |
| 3 | 954 (44.9%) |
| 4 | 460 (21.7%) |
| X | 2 (0.1%) |
| Not known | 113 (5.3%) |
| Recurrence or metastasis | |
| Recurrence or metastasis detected | 769 (36.2%) |
| Metastasis present around time of diagnosis | 286 (13.5%) |
| Not detected | 1355 (63.8%) |
HIC, Health Informatics Collaborative; NIHR, National Institute for Health Research; NLP, natural language processing.
Figure 2Presurgery and postsurgery T staging for patients with colon cancer (C18) who had a major resection, determined by natural language processing (NLP) of imaging reports (presurgery) and histopathology reports (postsurgery). Number of patients is given in brackets.
Figure 3Longitudinal pathway plot of a hypothetical patient with rectal cancer treated with neoadjuvant therapy then radical resection. After a colonoscopy and around the time of diagnosis the patient had neoadjuvant radiotherapy and chemotherapy as identified by the green and blue circles. They then proceeded to surgery, after which TNM staging was available (small pink circles). The next time point for this patient (light grey line) shows a scan done as part of the follow-up regime, with several further thereafter. Nearly 300 days since diagnosis a scan and colonoscopy led to the diagnosis of recurrence and further radiotherapy and chemotherapy. The final ‘X’ signifies death, although it does not show whether death was related to the cancer or not. TNM, tumour, node, metastases.
Figure 4Longitudinal pathway plot of a hypothetical patient with rectal cancer who underwent local excision. Rectal cancer was picked up on colonoscopy as indicated by the dark grey line, and treated by local excision as indicated by the orange circle. After a disease-free surveillance period of approximately 18 months, the patient had recurrence as shown by the first red arrow. This was followed by radiotherapy and chemotherapy prior to death. TNM, tumour, node, metastases.