| Literature DB >> 33214194 |
David Anthony Smith1,2, Tingyan Wang2,3, Oliver Freeman2, Charles Crichton2, Hizni Salih2, Philippa Clare Matthews3,4, Jim Davies2,5, Kinga Anna Várnai1,2, Kerrie Woods1,2, Christopher R Jones6, Ben Glampson7, Abdulrahim Mulla7, Luca Mercuri7, A Torm Shaw8, Lydia N Drumright9, Luis Romão10,11, David Ramlakan10,11, Finola Higgins12, Alistair Weir12, Eleni Nastouli10, Kosh Agarwal13, William Gelson9, Graham S Cooke6, Eleanor Barnes14,3.
Abstract
OBJECTIVE: The National Institute for Health Research (NIHR) Health Informatics Collaborative (HIC) is a programme of infrastructure development across NIHR Biomedical Research Centres. The aim of the NIHR HIC is to improve the quality and availability of routinely collected data for collaborative, cross-centre research. This is demonstrated through research collaborations in selected therapeutic areas, one of which is viral hepatitis.Entities:
Keywords: BMJ Health Informatics; computer methodologies; information systems; record systems
Year: 2020 PMID: 33214194 PMCID: PMC7678229 DOI: 10.1136/bmjhci-2020-100145
Source DB: PubMed Journal: BMJ Health Care Inform ISSN: 2632-1009
Overview of specific issues to be addressed in viral hepatitis theme using the National Institute for Health Research (NIHR) Health Informatics Collaborative (HIC) Viral Hepatitis Theme Dataset
| Virus | Example research questions to be addressed using the NIHR HIC Viral Hepatitis Theme Dataset |
| Hepatitis B virus (HBV) and hepatitis delta (HDV) coinfection | Characterising demographics and laboratory parameters for cases of chronic HBV infection. Improving use of biomarkers for monitoring and treatment stratification. Identifying individuals who control/clear HBV infection. Identifying subgroups that develop complications. Estimating incidence of HDV coinfection in the UK. Identifying determinants of HDV treatment success. |
| Hepatitis C virus (HCV) | Establishing sustained viral response rates for HCV therapy in the UK. Identifying factors determining treatment response in difficult to treat groups of patients. |
| Hepatitis E virus (HEV) | Estimating incidence of HEV infection in the UK. Identifying risk factors for HEV infection the UK. |
Figure 1Data flow for the National Institute for Health Research (NIHR) Health Informatics Collaborative (HIC) viral hepatitis clinical exemplar theme. For each participating site, data originating from clinical systems were used to populate a corresponding data warehouse that contains data fields in the agreed dataset for viral hepatitis research. Data in the data warehouse were transferred to the NIHR HIC Viral Hepatitis Central Data Repository. The data stored in the central data repository can be extracted and provided to internal and external research groups according to the governance process. NHS, National Health Service, EPR, Electronic Patient Record.
Figure 2Conceptual data flow for an individual site participating in the National Institute for Health Research (NIHR) Health Informatics Collaborative (HIC) Viral Hepatitis clinical exemplar theme. In an individual site, structured data in the hospital operational systems were directly transferred into the hospital data warehouse, and data stored in unstructured format was automatically or manually transformed to produce structured data either before or after transfer to the data warehouse. In addition, data from paper records or unconnected data sources were manually entered into a structured electronic data capture system and transferred into the data warehouse. Data was then anonymised prior to transfer to the central data repository for viral hepatitis research.
Figure 3Data model overview of the National Institute for Health Research Health Informatics Collaborative Viral Hepatitis Research Database. The defined dataset for viral hepatitis research includes 32 entities/tables, which can be grouped into different categories, marked in different colours. Each patient has a unique clinical data identifier that is stored in the table ‘clinical data IDs’. The relationship between two entities is shown beside their connection line. For example, 1:1 between table ‘clinical data IDs’ and table ‘study subject’ indicates that a patient can only have one study subject ID; while 1:N between table ‘clinical data IDs’ and table ‘hospital visit’ indicates that a patient could possibly have multiple hospital visits.
Characteristics of chronically infected hepatitis B virus (HBV) cohort
| Number of patients (total) | 960 |
| Age at the beginning of follow-up: median (range) | 36 (2–80) |
| Gender (%) | |
| Male | 502 (52.3) |
| Female | 389 (40.5) |
| Unknown | 69 (7.2) |
| Ethnicity (%) | |
| White | 106 (11.0) |
| Mixed | 11 (1.2) |
| Asian | 220 (22.9) |
| Black | 161 (16.8) |
| Other | 72 (7.5) |
| Unknown | 390 (40.6) |
| HDV coinfection | |
| Yes (%) | 17 (1.8) |
| No (%) | 468 (48.7) |
| Unknown (%) | 475 (49.5) |
| HCV coinfection | |
| Yes (%) | 75 (7.8) |
| No (%) | 347 (36.1) |
| Unknown (%) | 538 (56.1) |
| HIV coinfection | |
| Yes (%) | 7 (0.7) |
| No (%) | 427 (44.5) |
| Unknown (%) | 526 (54.8) |
| Documented comorbidities | |
| Diabetes (%) | 30 (3.1) |
| Depression (%) | 28 (2.9) |
| Chronic kidney disease (%) | 16 (1.7) |
| Coagulation disorder (%) | 22 (2.3) |
| Cryoglobulinaemia (%) | 3 (0.3) |
| Cancer (%) | 21 (2.2) |
| Liver cancer (%) | 1 (0.1) |
| Other cancer (%) | 20 (2.1) |
| Severity of liver disease | |
| Cirrhosis (%) | 27 (2.8) |
| Decompensated (%) | 4 (0.4) |
| Child–Pugh Score: median (IQR) | NA |
| Fibroscan stiffness: median (IQR) | 5.3 (2.4) |
| MELD Score: median (IQR) | NA |
| Patients treated (%) | 254 (26.5) |
| Lab tests at baseline | |
| Alanine aminotransferase (IU/L) | |
| Untreated group: median (IQR) | 29 (25) |
| Treated group: median (IQR) | 39 (37) |
| HBV DNA (log10 IU/mL) | |
| Untreated group: median (IQR) | 3.15 (1.58) |
| Treated group: median (IQR) | 4.26 (3.67) |
| Hepatitis B e antigen positive | |
| Untreated group (%) | 82/706 (11.6) |
| Treated group (%) | 84/254 (33.1) |
For a continuous variable, median and the IQR are calculated, for a categorical variable, the number and the percentage of patients is provided.
*Patients who had at least one episode of treatment recorded.
MELD, Model For End-Stage Liver Disease; NA, not available.
Characteristics of hepatitis C virus (HCV) cohort
| Number of patients (total) | 1404 |
| Age at the beginning of follow-up: median (range) | 47 (16–87) |
| Gender (%) | |
| Male | 604 (43.0) |
| Female | 281 (20.0) |
| Unknown | 519 (37.0) |
| Ethnicity (%) | |
| White | 359 (25.6) |
| Mixed | 11 (0.8) |
| Asian | 53 (3.8) |
| Black | 41 (2.9) |
| Other | 49 (3.5) |
| Unknown | 891 (63.4) |
| HCV genotype | |
| 1a (%) | 351 (25.0) |
| 1b (%) | 144 (10.3) |
| 1 other (%) | 48 (3.4) |
| 2 (%) | 58 (4.1) |
| 3a (%) | 225 (16.0) |
| 3 other (%) | 50 (3.6) |
| 4 (%) | 69 (4.9) |
| 5 (%) | 2 (0.1) |
| 6 (%) | 3 (0.2) |
| Unknown (%) | 454 (32.3) |
| HBV coinfection | |
| Yes (%) | 75 (5.4) |
| No (%) | 1097 (78.1) |
| Unknown (%) | 232 (16.5) |
| HIV coinfection | |
| Yes (%) | 60 (4.3) |
| No (%) | 1077 (76.7) |
| Unknown (%) | 267 (19.0) |
| Documented comorbidities | |
| Diabetes (%) | 111 (7.9) |
| Depression (%) | 253 (18.0) |
| Chronic kidney disease (%) | 18 (1.3) |
| Coagulation disorder (%) | 47 (3.3) |
| Cryoglobulinaemia (%) | 7 (0.5) |
| Cancer (%) | 77 (5.4) |
| Liver cancer (%) | 16 (1.1) |
| Other cancer (%) | 61 (4.3) |
| Severity of liver disease | |
| Cirrhosis (%) | 408 (29.1) |
| Decompensated (%) | 121 (8.6) |
| Child–Pugh Score: median (IQR) | 6.0 (2.0) |
| Fibroscan stiffness: median (IQR) | 8.4 (11.6) |
| MELD Score: median (IQR) | 9.0 (4.7) |
| Patients treated (%) | 914 (65.1) |
For a continuous variable, median and the IQR are calculated, for a categorical variable, the number and the percentage of patients is provided.
*Patients who had at least one episode of treatment recorded.
MELD, Model For End-Stage Liver Disease.