Literature DB >> 33907946

Identifying Novel Biomarkers Ready for Evaluation in Low-Prevalence Populations for the Early Detection of Lower Gastrointestinal Cancers: A Systematic Review and Meta-Analysis.

Paige Druce1, Natalia Calanzani2, Claudia Snudden2, Kristi Milley3, Rachel Boscott2, Dawnya Behiyat2, Javiera Martinez-Gutierrez4, Smiji Saji2, Jasmeen Oberoi3, Garth Funston2, Mike Messenger5, Fiona M Walter3,2, Jon Emery3,2.   

Abstract

INTRODUCTION: Lower gastrointestinal (GI) cancers are a major cause of cancer deaths worldwide. Prognosis improves with earlier diagnosis, and non-invasive biomarkers have the potential to aid with early detection. Substantial investment has been made into the development of biomarkers; however, studies are often carried out in specialist settings and few have been evaluated for low-prevalence populations.
METHODS: We aimed to identify novel biomarkers for the detection of lower GI cancers that have the potential to be evaluated for use in primary care. MEDLINE, Embase, Emcare and Web of Science were systematically searched for studies published in English from January 2000 to October 2019. Reference lists of included studies were also assessed. Studies had to report on measures of diagnostic performance for biomarkers (single or in panels) used to detect colorectal or anal cancers. We included all designs and excluded studies with fewer than 50 cases/controls. Data were extracted from published studies on types of biomarkers, populations and outcomes. Narrative synthesis was used, and measures of specificity and sensitivity were meta-analysed where possible.
RESULTS: We identified 142 studies reporting on biomarkers for lower GI cancers, for 24,844 cases and 45,374 controls. A total of 378 unique biomarkers were identified. Heterogeneity of study design, population type and sample source precluded meta-analysis for all markers except methylated septin 9 (mSEPT9) and pyruvate kinase type tumour M2 (TuM2-PK). The estimated sensitivity and specificity of mSEPT9 was 80.6% (95% CI 76.6-84.0%) and 88.0% (95% CI 79.1-93.4%) respectively; TuM2-PK had an estimated sensitivity of 81.6% (95% CI 75.2-86.6%) and specificity of 80.1% (95% CI 76.7-83.0%).
CONCLUSION: Two novel biomarkers (mSEPT9 and TuM2-PK) were identified from the literature with potential for use in lower-prevalence populations. Further research is needed to validate these biomarkers in primary care for screening and assessment of symptomatic patients.

Entities:  

Keywords:  Biomarkers; Clinical practice; Colorectal cancer; Early detection; Lower gastrointestinal cancers; Primary care

Year:  2021        PMID: 33907946      PMCID: PMC8078393          DOI: 10.1007/s12325-021-01645-6

Source DB:  PubMed          Journal:  Adv Ther        ISSN: 0741-238X            Impact factor:   3.845


Key Summary Points

Digital Features

This article is published with digital features, including a summary slide, to facilitate understanding of the article. To view digital features for this article go to 10.6084/m9.figshare.13664042.

Introduction

Gastrointestinal (GI) cancers account for over 25% of global cancer incidence and 35% of all cancer-related deaths [1]. Lower GI cancers, particularly colorectal cancer (CRC), contribute the most significant proportion with over 1.8 million new cases in 2018 [1]. CRC is the most commonly diagnosed GI cancer and constitutes 1 in 10 cancer cases and deaths [1]. Around 90% of patients with cancer first present with symptoms in primary care, highlighting a key role for primary care providers in the early detection of GI cancers [2, 3]. Diagnosis of GI cancers can prove challenging in the community setting: while gastrointestinal symptoms are commonly encountered, they are usually due to benign or self-limiting conditions and rarely to GI cancer [3]. Initial symptoms are often non-specific, and more specific symptoms usually represent more advanced disease [3]. Increased demand for diagnostic services for lower GI cancer and pressure on waiting times have been seen internationally in countries like Australia, the UK and Canada, where primary care plays a ‘gatekeeping’ role to specialist care. In many countries implementation of faecal occult blood tests (FOBTs) or faecal immunochemical tests (FITs) for CRC screening and diagnostic triage adds further pressure on colonoscopy services. In some healthcare systems, over-screening via colonoscopy is also an issue [4]. New diagnostic approaches are needed to help reduce the burden on specialist care, particularly in the current context of COVID-19 and associated delays in access to cancer diagnostic and treatment services [5]. There is considerable interest in the potential of biomarkers to detect GI cancers [3]. To date, carcinoembryonic antigen (CEA) and carbohydrate antigen 19-9 (CA19-9) have played an important role in clinical practice to detect recurrent disease, but their diagnostic performance is inadequate for the early detection of new disease [3, 6, 7]. Substantial investment has been made developing new biomarkers for early detection, but most studies of these tests have occurred in specialist clinical settings [8] where cancer prevalence is higher than in the community settings where they would eventually be applied [9, 10] The performance characteristics of a diagnostic test are strongly determined by the prevalence and severity of the target disease and of other diseases within the study population [9]. In populations where the prevalence of the target disease is low (e.g. GI cancer in primary care), the corresponding positive predictive values (PPV) are lower than in high-prevalence populations. Tests that are evaluated only in these high-prevalence populations tend to have lower sensitivity and higher specificity when translated to low-prevalence populations [10, 11]. This is known as the ‘spectrum effect’, and has crucial implications for comparing the performance of tests in different populations [9, 10]. In recognition of these issues, the CanTest Framework was developed (Fig. 1) [10]. This novel framework encompasses a translational pathway for diagnostic tests, from new test discovery to health system implementation in low-prevalence populations [11]. The framework highlights the importance of evaluating clinical performance, implementation, patient safety, quality of care, and cost-effectiveness in the intended setting. It is vital that these elements are investigated alongside test performance in order to ascertain clinical utility and improved outcomes for patients [8].
Fig. 1

The CanTest framework.

Source: Walter et al. 2019 [10]

The CanTest framework. Source: Walter et al. 2019 [10] This review aimed to systematically identify novel biomarkers for the early detection of lower GI cancers that have measures of diagnostic performance and show sufficient promise to warrant further evaluation in low-prevalence populations.

Methods

Search Strategy and Inclusion/Exclusion Criteria

These have been reported elsewhere [12]. The protocol for this review was registered on PROSPERO (registration ID CRD42020165005) and the Preferred Reporting Items for Systematic Reviews and Meta-Analysis of Diagnostic Test Accuracy Studies (PRISMA-DTA) statement was followed [13]. MEDLINE, Embase, Emcare and Web of Science were electronically searched for primary studies published in English between the 1 January 2000 and 31 October 2019. The search strategy was developed with the assistance of a medical librarian (Appendix 1 in the supplementary material). Studies eligible for inclusion were situated within phase 2 (i.e. providing measures of diagnostic accuracy beyond discovery, even if carried out in high-prevalence settings) and phase 3 (i.e. examining diagnostic accuracy in intended low-prevalence settings, and providing measures of clinical utility, including feasibility and acceptability) of the CanTest framework [10] (Fig. 1). We included studies which reported measures of diagnostic performance in an independent population (i.e. beyond measures from the initial discovery phase). Studies were excluded if no references to previous research evaluating biomarker performance were available, and if the study provided only one set of performance measures (reflecting discovery phase only). As studies beyond the discovery phase require larger sample sizes [14, 15], we included those which reported data on at least 50 cancer cases and at least one group of homogeneous non-cancer controls (n ≥ 50) with similar clinical characteristics (e.g. healthy, or with non-malignant conditions), as in previous reviews [15, 16]. We included studies on non-invasive biomarkers feasible for use in the community setting: blood (serum and plasma), urine, faecal, salivary or breath samples. Both observational (cohort or case–control, cross-sectional or longitudinal, prospective or retrospective) and experimental designs were eligible for inclusion. Studies undertaken in all recruitment settings were included. We included studies if they reported on at least one measure of diagnostic performance, namely sensitivity, specificity, PPV, negative predictive value (NPV), false positive, false negative or area under the curve (AUC) for biomarkers used to detect lower GI cancers, including colorectal (colon, rectum, caecum) or anal cancers, in adult populations (mean/median age 18 or older; studies including individuals aged less than 18 were accepted if these were outliers in large samples). Non-specified GI cancers, neuroendocrine cancers, and studies only reporting on familial populations at risk of hereditary cancers were excluded. Novel biomarkers were considered individually, in combination or as part of a panel. Studies reporting only on a single, established biomarker (CEA, CA19-9, or FIT or FOBT) were excluded [16, 17]. Studies providing measures of diagnostic performance for combinations of established and novel biomarkers were included. Covidence systematic review software [18] was used to facilitate article screening. Titles and abstracts were screened independently by two reviewers (any two of PD, NC, CS, KM, DB or RB). Full-text articles were also independently evaluated for inclusion by two reviewers (any two of the aforementioned). Reference lists of included studies were manually reviewed by one author to identify additional studies (NC). Full-text articles selected at this stage were also independently assessed by two reviewers (any two of PD, NC, RB or DB). Disagreements were resolved by consensus; when this could not be reached, a senior reviewer was consulted (JE or FMW).

Data Extraction and Analysis

Data extraction was piloted to ensure consistency and was performed independently (by SS, DB, RB, JMG, JO). Information on study characteristics, populations, biomarkers and measures of diagnostic performance were extracted. When studies reported on different phases of biomarker development, only data from the eligible phases were extracted. When studies had more than one eligible phase, data were extracted for all eligible phases. Extracted data were collated and checked for consistency and inaccuracies (PD). Biomarkers were categorised according to a modified version of the classifications reported by Uttley et al. [15]: microRNAs and other RNAs, autoantibodies and other immunological markers, other proteins (i.e. proteins that did not fit into other categories), metabolic markers, DNA-related markers (protein-coding genes, gene mutations), circulating tumour DNA, DNA methylation markers and other biomarkers. Controls were classified as normal/healthy, having non-malignant conditions or those with adenomas/polyps. Controls described as healthy were coded as such unless studies described underlying conditions. Full details of the control population classification are available in Appendix 2 in the supplementary material.

Quality Assessment and Risk of Bias

Considering the key issue of spectrum bias, studies were classified as either single-gate or two-gate designs. Single-gate studies recruit participants before disease status is known, with a single route of entry, and with the same inclusion criteria. Two-gate studies recruit participants through different routes and use different criteria for cases and controls. This can lead to over-inflated measures of diagnostic performance for example if there is an over-representation of individuals with advanced disease within the study population and comparison with the ‘fittest of the fit’ healthy controls [11]. One author (PD) classified all studies and another (NC) checked the classification. A full description of this classification and how it approaches issues covered by the QUADAS-2 critical appraisal tool [19] is available in Appendix 3 in the supplementary material. Studies included in the meta-analysis were assessed using the QUADAS-2 [19] tool by two reviewers (PD and NC).

Data Synthesis

As significant heterogeneity was anticipated, we used narrative synthesis to summarise the data [20]. An overview of the evidence was developed to describe the key characteristics of the included studies, their populations, biomarkers and outcome measures. Data were examined for similarities that would allow for subgroup analyses, namely the same biomarker, with similar study design and appropriate accuracy performance measures. For meta-analysis to occur, biomarkers had to be investigated in more than two studies, with individual outcome measures provided, similar populations included and a single-gate study design. We focused the analysis on single-gate studies, as this design reduces spectrum bias, and is more likely to provide results that translate for use in low-prevalence populations. Meta-analysis of diagnostic test accuracy was performed using MetaDTA (version 1.43) [21] and RevMan (5.3) [22] software. For meta-analysis, we used the random effects bivariate binomial model of Chu and Cole fitted as a generalised linear mixed effect model [21]. Sensitivity and specificity were jointly modelled and the estimates from each study were assumed to vary [21]. Hierarchical summary receiver operating characteristic (HSROC) parameters were estimated using the bivariate model parameters. Summary points of sensitivity and specificity were presented alongside forest plots and SROC curves. Heterogeneity and threshold effects were evaluated using the SROC plots and random effects correlation.

Compliance with Ethics Guidelines

This article is based on previously conducted studies and does not contain any studies with human participants or animals performed by any of the authors.

Results

A total of 16,597 records were identified in database searches; 9172 were retained after removing duplicates (Fig. 2). During title and abstract screening 8179 records were excluded. After assessing the full text of the remaining 993 records, 731 of them were excluded. Of the remaining studies, 142 are included in this review. The characteristics of included studies are described further in Table 1, and measures of diagnostic performance are described in supplementary Table 1.
Fig. 2

Study selection

Table 1

Characteristics of included studies: country, setting and population

Author (year)Country (population)SettingaCases and controlsBiomarkers
Cases (N)Controls (N)Type (N)
HospOtherCRCA/PAllHCNMA/PmiRNAAutoabDNAProteinMetabctDNAOtherb
Ahlquist et al. (2012) [23]USA, Canada, DenmarkX820960000114
Amiot et al. (2014) [24]FranceX6601231230343
Bagaria et al. (2013) [25]IndiaX5005050002
Broll et al. (2001) [26]GermanyUU12206565002
Bunger et al. (2012) [27]GermanyX836050500012
Calistri et al. (2009) [28]ItalyX1000100100002
Cao et al. (2019a) [29]ChinaXU6205959001
Cao et al. (2019b) [30]ChinaX1180858500121
Chan et al. (2010) [31]Taiwan, ChinaX94054540051
Chang et al. (2016) [32]Taiwan, ChinaUU2290368368002
Chang et al. (2011) [33]ChinaX606057570051
Chang et al. (2014) [34]Taiwan, ChinaUU560120120005
Chao et al. (2013) [35]Canada, USAXX3160328328007
Chen et al. (2019) [36]ChinaX1110114114003
Choi et al. (2018) [37]KoreaUU69074740011
Church et al. (2014) [38]USA, GermanyX530145793405231
Ciarloni et al. (2016) [39]Switzerland, South KoreaX520747400274
DeVos et al. (2009) [40]Germany, USAUU900155155001
Duran-Sanchon et al. (2019) [41]SpainX67035321704839
Duvillard et al. (2014) [42]FranceX22402520252011
Fan et al. (2017) [43]Taiwan, ChinaUU9201001000041
Fernandes et al. (2005) [44]BrazilX1691691001000011
Flamini et al. (2006) [45]ItalyX755775750011
Fouad et al. (2017) [46]EgyptUU5005005001
Fu et al. (2018) [47]ChinaX9803292537601
Fung et al. (2015) [48]AustraliaX9809909907
Gao et al. (2018) [49]ChinaX279104740003
Groblewska et al. (2010) [50]PolandXU9109191004
Grutzmann et al. (2008) [51]GermanyUU127036918414901
Guo et al. (2018) [52]ChinaX10702181200985
Han et al. (2019) [53]South KoreaX24519924524524501
Hao et al. (2018) [54]ChinaUU18609797002
Hata et al. (2017) [55]JapanXX2250916916001
Haug et al. (2007) [56]GermanyX650917917001
He et al. (2010) [57]ChinaX1820170170003
Herreros-Villanueva et al. (2019) [58]SpainX9678101100106
Imaoka et al. (2016) [59]JapanX21105757001
Imperiale et al. (2014) [60]USA, CanadaX650916762740289312
Jaberie et al. (2019) [61]IranXU1132425150002
Jensen et al. (2018) [62]DenmarkX14309191003
Jin et al. (2015) [63]ChinaX13503419102501
Johnson et al. (2014) [64]USAUU10101719407711
Jones et al. (2016) [65]USAUU680686800112
Karam et al. (2018) [66]EgyptXU6507070001
Karl et al. (2008) [67]GermanyX1860252252005
Kim et al. (2017) [68]KoreaX16660336120811352
Kim et al. (2015) [69]KoreaX13906060002
Koga et al. (2013) [70]JapanX11701071070011
Lee et al. (2013a) [71]KoreaX10109696001
Lee et al. (2013b) [72]KoreaXU132022812467373
Li et al. (2019a) [73]ChinaX1470147147001
Li et al. (2019b) [74]ChinaX6260155155001
Li et al. (2012) [75]ChinaX700141141001
Liu et al. (2013) [76]ChinaXU20008080002
Liu et al. (2016) [77]ChinaX1480320808016011
Liu et al. (2018a) [78]ChinaX1300140900502
Liu et al. (2018b) [79]ChinaX500100500502
Lumachi et al. (2012) [80]ItalyUU102099099014
Luo et al. (2019) [81]ChinaX57019219200121
Marcuello et al. (2019) [82]SpainX5908080006
Marshall et al. (2010) [83]Canada, USAX20202080007
Matsubara et al. (2011) [84]JapanX10101091090011
Matsushita et al. (2005) [85]JapanX1161698383001
Melotte et al. (2015) [86]GermanyX6657240240002
Meng et al. (2012) [87]ChinaX930158158002
Min et al. (2019) [88]ChinaX5807606974
Mizuno et al. (2003) [89]JapanX1000100100001
Mroczko et al. (2010) [90]PolandX751047070004
Mroczko et al. (2006) [91]PolandX7606565003
Mulder et al. (2007) [92]NetherlandsX5206363001
Murakoshi et al. (2011) [93]JapanX11502302300011
Murata et al. (2012) [94]JapanX2521998181009
Ng et al. (2017) [95]Hong KongX117090900021
Nielsen et al. (2011) [96]DenmarkX29404202217311768432
Ning et al. (2018) [97]ChinaXU51307575004
Nishiumi et al. (2012) [98]JapanX590636300227
Niu et al. (2012) [99]ChinaX119783973009701
Ørntoft et al. (2015) [100]DenmarkX1280150150001
Palmqvist et al. (2003) [101]SwedenX12402430002
Pedersen et al. (2013) [102]England, Wales, Northern IrelandX972429400018
Pedersen et al. (2015) [103]Australia, NetherlandsX1290128845054229611
Peng et al. (2017) [104]ChinaXU55905595590012
Pengjun et al. (2013) [105]ChinaX14906969004
Qian et al. (2018) [106]GermanyX2120106106005
Qu et al. (2019) [107]ChinaX1000100100005
Ren et al. (2016) [108]ChinaX42201019747272011
Rho et al. (2018) [109]JapanX51460386168591595
Ritchie et al. (2010) [110]USA, JapanX700700003
Ritchie et al. (2013) [111]CanadaXX980964096401
Ruffin et al. (2010) [112]USAUU69137937023011
Schneider et al. (2005) [113]GermanyUU2470530003
Shastri et al. (2008) [114]GermanyUU55695164981801
Shastri et al. (2006) [115]GermanyUU74012811711601
Shi et al. (2019) [116]ChinaX21101030010321
Sithambaram et al. (2015) [117]MalaysiaUU1000200200001
Song et al. (2017a) [118]ChinaX850324324001
Song et al. (2018a) [119]GermanyX465088261002721
Song et al. (2017b) [120]ChinaX388083759002471
Song et al. (2018b) [121]ChinaX7830794331046321
Stojkovic Lalosevic et al. (2019) [122]SerbiaX3000300300003
Sun et al. (2019a) [123]ChinaX1330587494118231
Sun et al. (2019b) [124]ChinaUU105010210200113
Swellam et al. (2016) [125]EgyptUU16205757005
Symonds et al. (2016) [126]AustraliaX661692462460011
Tagore et al. (2003) [127]USAX525721211309923
Taguchi et al. (2015) [128]USAUU60060600010
Tang et al. (2011) [129]ChinaX16901393001091
Tibble et al. (2001) [130]UKUU620233023301
Toiyama et al. (2013) [131]JapanX2001045353002
Tomasevic et al. (2016) [132]SerbiaX1810191019102
Toth et al. (2012) [133]HungaryUU92092920011
Uchiyama et al. (2018) [134]JapanX5606060005
Vychytilova-Faltejskova et al. (2016) [135]Czech RepublicX203199100100004
Vychytilova-Faltejskova et al. (2018) [136]Czech RepublicX1790100100004
Wang et al. (2018) [137]ChinaUU9606060001
Wang et al. (2017a) [138]ChinaX91091910010
Wang et al. (2014) [139]ChinaXU113059590062
Wang et al. (2007) [140]TaiwanXU157788080004
Wang et al. (2017b) [141]ChinaX60050500012
Wang et al. (2016) [142]ChinaX12001201200041
Warren et al. (2011) [143]USA and RussiaXX50242940001
Wilhelmsen et al. (2017) [144]DenmarkX51203320197813420161
Wu et al. (2014) [145]Hong Kong, ChinaX1040109109001
Wu et al. (2012) [146]Hong KongX880101101002
Wu et al. (2017) [147]ChinaX1350140140001
Wu et al. (2015) [148]ChinaX1000100100002
Wu et al. (2016) [149]ChinaX29102952950021
Xie et al. (2017) [150]ChinaX132601070002
Xie et al. (2018) [151]ChinaX123012519106031
Xu et al. (2013) [152]ChinaXX87073730021141
Yang et al. (2018) [153]ChinaUU5005000021
Yau et al. (2016) [154]Hong KongUU1980198198001
Yuan et al. (2016) [155]ChinaX18701091090051
Zhang et al. (2015a) [156]ChinaX13860111046652
Zhang et al. (2016) [157]ChinaX8001711165506
Zhang et al. (2015b) [158]JapanUU13005454004
Zhao et al. (2019a) [159]ChinaX1170166166002
Zhao et al. (2019b) [160]ChinaX3580286286001
Zheng et al. (2014) [161]ChinaX117017510207341
Zhou et al. (2017) [162]ChinaX2420262262001
Zhu et al. (2013) [163]ChinaXU2690110110005
Zhu et al. (2015) [164]ChinaX70070700011

autoab autoantibodies and other immunological markers, ctDNA circulating tumour DNA, HC healthy control, Hosp hospital, Ind individual, NM non-malignant, A/P adenomas/polyps, U unclear, UK United Kingdom, USA United States of America

aDue to wide variations in health systems across different countries, hospital setting is a broad definition than can encompass secondary and tertiary care. Other setting refers to biobanks, reference sets, databases or archived samples; general population cohorts or cohorts from population screening programmes; or cohorts from previous trials or observational studies

bOther biomarker type refers to methylation markers, platelets, white blood cells, red blood cells and colonocytes

Study selection Characteristics of included studies: country, setting and population autoab autoantibodies and other immunological markers, ctDNA circulating tumour DNA, HC healthy control, Hosp hospital, Ind individual, NM non-malignant, A/P adenomas/polyps, U unclear, UK United Kingdom, USA United States of America aDue to wide variations in health systems across different countries, hospital setting is a broad definition than can encompass secondary and tertiary care. Other setting refers to biobanks, reference sets, databases or archived samples; general population cohorts or cohorts from population screening programmes; or cohorts from previous trials or observational studies bOther biomarker type refers to methylation markers, platelets, white blood cells, red blood cells and colonocytes

Characteristics of Included Studies

Most papers (n = 124) recruited patients from a single country. China was the most common country (n = 62), followed by Japan and Germany (both n = 14), and the USA (n = 13). Most studies recruited from single settings with few studies recruiting from at least two different settings (n = 11). The most common recruitment settings were hospitals and other secondary care settings (n = 106). Only one study recruited controls from a primary care setting [57]. All included studies reported on CRC, with six studies specifically referring to colon cancer, and one study specifying rectal and caecum cancer cases separately to colon cancer. Some studies (n = 22) also referred to adenomas or polyps as cases, and five studies also included data on upper GI cancers (e.g. gastric, oesophageal and pancreatic cancers).

Characteristics of Cases and Controls

Overall, the included studies reported on 24,844 cases; the majority were diagnosed with CRC (80.2%) and a minority with adenomas/polyps (19.8%). Most cases had their age reported (79%), either as a range, mean or median. The overall mean age for CRC cases was 61.3 years, and 60.7 years for adenoma/polyp cases. The minimum age for cases was 18 years, while the oldest was 97 years old. The majority (59%) of CRC and adenoma/polyp cases were male. Most studies provided data on tumour staging, mainly using the TNM system (n = 101), though some studies used Dukes’ classification (n = 22), with one study providing data for both. When combining TNM and Dukes’ staging data, over half of the cancers (54%) were diagnosed at early stages (I–II/A + B). Adenomas included as cases were most frequently defined by size, dysplasia, villous component and/or number of adenomas. The included studies reported on a total of 45,374 controls (31,352 normal/healthy, 6414 with non-malignant conditions and 7608 with adenomas or polyps). A number of studies (n = 37) investigated more than one type of control population. The control populations of most studies (n = 108) were tested to rule out CRC, mainly using colonoscopy (n = 65). The majority of studies (n = 17) with adenomas or polyps as controls included those that were low risk (hyperplastic, non-neoplastic polyps or non-advanced adenomas), though some were high risk (advanced adenomas, those with villous histology or high-grade dysplasia). Age data were extractable for 47.1% of controls. The minimum age for a control was 16 years (healthy control), while the oldest was 99 years old. The majority of both healthy (50.6%) and non-malignant (58%) controls were male.

Types of Biomarkers

Most studies investigated more than one biomarker (79.6%), and these often reported on measures of performance for individual and combinations or panels of biomarkers (45.8%). The commonest sample source was blood (82.4%); these analysed serum (n = 62), plasma (n = 41) or whole blood (n = 14). Faeces was also a common sample source (24.6%); two studies analysed urine, and 13 studies analysed more than one type of sample. A total of 378 unique biomarkers were identified across the 142 included studies (Appendix 4 in the supplementary material). The commonest biomarkers were microRNAs and other RNAs, followed by proteins, DNA markers, autoantibodies and other immunological markers, and metabolic markers. Proteins were further classified into subcategories, with the most common being novel proteins (Table 2).
Table 2

Classification of identified biomarkers

Identified biomarkers (142 studies)N (%)
MicroRNAs and other RNAs126 (33.3%)
DNA markers (protein coding genes, mutations)45 (12.2%)
Proteins86 (22.8%)
 Adhesion and matrix proteins11 (2.9%)
 Classic tumour markers6 (1.6%)
 Coagulation and angiogenesis molecules6 (1.6%)
 Cytokines, chemokines and insulin-like growth factors15 (4.0%)
 Hormones1 (0.3%)
 Novel proteins39 (10.3%)
 Not otherwise specified8 (2.1%)
Autoantibodies and other immunological markers44 (11.6%)
Metabolic markers42 (11.1%)
Circulating tumour DNA4 (1.1%)
DNA methylation15 (4.0%)
Other biomarkersa16 (4.2%)

aOther biomarkers included platelets, white blood cells, red blood cells and colonocytes

Classification of identified biomarkers aOther biomarkers included platelets, white blood cells, red blood cells and colonocytes A total of 54 biomarkers were reported in more than one study (Appendix 5 in the supplementary material). Three biomarkers were investigated by more than 10 studies: CA19-9, CEA and mSEPT9 (methylated septin 9). Additionally, six other biomarkers were investigated in five or more studies: tumour pyruvate kinase isoenzyme type M2 (TuM2-PK), microRNA-21 (miR-21), FIT, microRNA-92a (miR-92a), cancer antigen 72-4 (CA72-4) and TIMP metallopeptidase inhibitor 1 (TIMP-1) (see Appendix 5 for references).

Measures of Diagnostic Performance

Individual measures of diagnostic performance (i.e. measures outside of combinations or panels) were available for 35 biomarkers evaluated more than once (Appendix 5 in the supplementary material). Heterogeneity of study design and included populations precluded meta-analysis for the majority of these biomarkers; however, three had individual measures from multiple studies adopting a classic single-gate design: CEA (n = 7 studies), mSEPT9 (n = 4 studies) and TuM2-PK (n = 3 studies). Differences in the sample sources and diagnostic performance measures provided across the studies precluded meta-analysis for any accuracy measures available for CEA, which was included as a comparator to the novel markers. Meta-analysis was performed for the markers mSEPT9 and TuM2-PK. The estimated sensitivity and specificity of mSEPT9 was 80.6% (95% CI 76.6–84.0%) and 88.0% (95% CI 79.1–93.4%), respectively, and the diagnostic odds ratio was 30.3 (95% CI 17.8–51.4). TuM2-PK had an estimated sensitivity of 81.6% (95% CI 75.2–86.6%) and a specificity of 80.1% (95% CI 76.7–83.0%), and a diagnostic odds ratio of 17.8 (95% CI 11.6–27.2). Paired forest plots of the sensitivity and specificity for both mSEPT9 and TuM2-PK are shown in Figs. 3 and 4.
Fig. 3

Forest plots of sensitivity and specificity for mSEPT9 in plasma

Fig. 4

Forest plots of sensitivity and specificity for TuM2-PK in stool

Forest plots of sensitivity and specificity for mSEPT9 in plasma Forest plots of sensitivity and specificity for TuM2-PK in stool The random effects correlation for mSEPT9 was − 1, indicating a significant threshold effect. Heterogeneity and threshold effect were harder to evaluate statistically for the meta-analysis of TuM2-PK as the low number of included studies impeded accurate fitting of the HSROC curve and generation of a random effects correlation. A cut-off value of 4 U/ml was used for the TuM2-PK assays across all studies. The studies included in the meta-analyses were at low risk of bias across most domains, except for the domains related to patient selection and the index test. Full appraisal data can be found in Appendix 6 in the supplementary material. Summary plots including risk of bias and applicability ratings from QUADAS-2 are shown in Figs. 5 and 6.
Fig. 5

HSROC curve for mSEPT9 (with risk of bias and applicability ratings)

Fig. 6

HSROC plot for TuM2-PK (with risk of bias and applicability ratings)

HSROC curve for mSEPT9 (with risk of bias and applicability ratings) HSROC plot for TuM2-PK (with risk of bias and applicability ratings)

Discussion

This systematic review identified 142 studies reporting on 378 different biomarkers for CRC. The included papers were very heterogeneous, with differences in study design, control populations, sample sources, types of biomarkers, test thresholds and reported performance measures. Meta-analysis of diagnostic accuracy data was only possible for two novel markers: mSEPT9 and TuM2-PK. Both demonstrated high sensitivity, specificity and diagnostic odds ratios in hospital populations. The most common biomarkers (both individually and in panels) were CEA, CA19-9, mSEPT9 and TuM2-PK. CEA and CA19-9 have a more established role in clinical practice for detecting recurrent disease [3, 7] so it is not surprising that these markers are prevalent throughout the literature. Most of the studies included CEA (42/53) and CA19-9 (20/21) in panels or used them as comparators for novel markers. Meta-analysis was not possible for these studies because of heterogeneity in sample sources and performance measures. Twenty studies reported on the performance of mSEPT9 for CRC detection, mostly as a blood-based biomarker sampled from plasma. While most measures of diagnostic performance were for mSEPT9 as an individual marker, it was also included in panels or combinations across seven studies. Fewer studies reported on the performance of TuM2-PK (nine overall, three included TuM2-PK in panels or combinations). Unlike mSEPT9, TuM2-PK was predominantly sampled from stool, though some studies also reported it as a blood-based biomarker. The studies that evaluated mSEPT9 and TuM2-PK included a number of two-gate studies or hybrid designs, and multiple instances where the study design was unclear. The meta-analyses for mSEPT9 and TuM2-PK included only those with a clear, classic single-gate design [11] to reduce heterogeneity and spectrum bias; consequently, both meta-analyses included a low number of studies, resulting in wide confidence intervals for the diagnostic odds ratios. The meta-analysis for mSEPT9 synthesised diagnostic performance data on 899 CRC cases. Cancer cases were mostly diagnosed in stages II or III and diagnostic performance data were also provided for adenomas and polyps in most cases. This is important to note, as the diagnostic performance results for early stage cancers are more likely to translate for use in early detection, and the ability for biomarkers also to detect high-risk adenomas, polyps or dysplasia could provide additional clinical utility. Across all studies, the test sensitivity was higher when detecting advanced CRC cases. Conversely, test sensitivity decreased when used to detect either adenomas or degrees of dysplasia. As previously mentioned, several studies evaluated mSEPT9 within diagnostic panels or in combination with other markers. Three studies in particular [66, 151, 153] showed the sensitivity of mSEPT9 to detect CRC increased when combined with more established markers such as FIT and CEA. The results from our review show a slightly higher sensitivity for mSEPT9 in comparison to a recent meta-analysis of 19 studies [165] though it should be noted the analysis from that review included a mixture of study designs and focused on high-risk populations. Our results are comparable to previous analyses which estimated the sensitivity of mSEPT9 as up to 88% [165, 166]. The meta-analysis of diagnostic accuracy data for TuM2-PK as a stool marker included 183 CRC cases. Similarly to mSEPT9, the sensitivity of TuM2-PK was higher for more advanced cancers (Dukes’ stage C and D; stages III and IV) and lower when it was used to detect adenomas, polyps or dysplasia. All three studies included in the meta-analysis compared the diagnostic performance of TuM2-PK to the established stool marker FIT, and demonstrated that FIT was preferable to TuM2-PK as a faecal biomarker for screening populations [94, 116, 117]. Three studies [50, 67, 115] evaluated TuM2-PK as a blood-based biomarker in combination with other markers or in panels, and all found sensitivity to be higher for TuM2-PK in combination with other markers. TuM2-PK may therefore be more promising in blood-based diagnostic panels than as a stand-alone stool marker. Two-gate and hybrid designs were used widely in the included studies. These types of study designs can lead to over-inflated measures of diagnostic performance due to an over-representation of individuals with advanced disease within the study population [11]. While many studies attempted single-gate designs and recruited participants through one route (usually screening populations where all participants attended for a colonoscopy), the low prevalence of CRC cases meant that extra cases were sourced from alternative routes. This study design issue highlights the importance of large-scale studies and trials that are adequately powered to evaluate diagnostic performance in truly low-prevalence populations. Several other methodological limitations were identified across the studies. These included the parallel analysis of large numbers of biomarkers during discovery studies; limited external, independent validation of test performance; and selective reporting for validation including alternative analyses and combinations or use of several cut-off points. Insufficient reporting regarding population characteristics and recruitment was also an issue in many studies, with information often provided as supplementary data and with little detail. As a result of the large amount of evidence on biomarker development and evaluation, we believe the field could benefit from a “living systematic review”; this refers to high-quality, up-to-date online summaries of evidence which can be constantly updated as new research becomes available [167]. Although our search was restricted to studies published in English, recent reviews indicate that this has minimal impact on review conclusions [168, 169]. Further limitations of this review include the exclusion of studies that evaluated biomarkers within risk assessment tools or risk prediction models. These studies have strong potential to be used in the community; however, we believe they should be investigated in a separate systematic review. The heterogeneity of the published literature meant we could only conduct meta-analyses on a limited subset of included studies. Nonetheless, we believe the narrative synthesis of additional studies provides a useful summary of the current state of the science in this area. There was insufficient homogeneous data on biomarker panels to report summary estimates of their diagnostic performance. A study from Fung et al. [48] describes ColoSTAT, a novel blood-based diagnostic panel for CRC that includes TuM2-PK with two other biomarkers (IL-8 and DKK-3) and is currently being trialled in Australia. The ColoSTAT panel has reported sensitivity and specificity of 73% and 95%, respectively, for CRC, which is comparable to reported values for FIT (64–73% and 92–95%, respectively [170-173]) for the detection of CRC in screening populations. Previous trials using this panel have been conducted in high-prevalence settings, with two-gate designs. Once further data are available on ColoSTAT and its performance to detect early stage CRC, it may have applicability in low-prevalence settings as an alternative to FIT, either for screening or in symptomatic populations.

Conclusion

There is a large body of evidence on novel biomarkers being developed to aid with the early detection of lower GI cancers. Few of these markers have yet demonstrated their validity or clinical utility, but two show promise for further evaluation, mSEPT9 and TuM2-PK, and could contribute towards the early detection of CRC as part of blood-based diagnostic panels. Further, large-scale studies in low-prevalence populations are required to evaluate their potential role to support diagnostic assessment in primary care and community settings. This review offers a comprehensive overview of the current state of evidence, situates it within a translational framework for diagnostic tests and makes recommendations in order to build the evidence base for the early detection of lower GI cancers in low-prevalence settings. Below is the link to the electronic supplementary material. Supplementary file1 (DOCX 67 KB) Supplementary file1 (DOCX 87 KB)
To our knowledge, this is the first systematic review to characterise the range of novel biomarkers being investigated for the early detection of lower GI cancers, with a focus on their readiness to progress to further evaluation in low-prevalence populations such as primary care.
We identified 378 unique biomarkers from the literature; a meta-analysis of diagnostic accuracy data indicated mSEPT9 and TuM2-PK have potential for further evaluation in low-prevalence populations.
We highlight the need for (1) further studies on mSEPT9 and TuM2-PK in low-prevalence populations; (2) better reporting to facilitate translation; (3) more consistency in the use of biomarkers. By doing so, we will be able to progress to a different step in the evaluation process of promising biomarkers, and ultimately ascertain clinical benefits for our intended population. This will require going beyond test performance, investigating implementation (including feasibility and acceptability), safety and cost-effectiveness.
  165 in total

1.  Free DNA and carcinoembryonic antigen serum levels: an important combination for diagnosis of colorectal cancer.

Authors:  Emanuela Flamini; Laura Mercatali; Oriana Nanni; Daniele Calistri; Roberta Nunziatini; Wainer Zoli; Paola Rosetti; Nice Gardini; Arturo Lattuneddu; Giorgio Maria Verdecchia; Dino Amadori
Journal:  Clin Cancer Res       Date:  2006-12-01       Impact factor: 12.531

2.  Matrix metalloproteinase 2 and tissue inhibitor of matrix metalloproteinases 2 in the diagnosis of colorectal adenoma and cancer patients.

Authors:  Magdalena Groblewska; Barbara Mroczko; Mariusz Gryko; Bogusław Kędra; Maciej Szmitkowski
Journal:  Folia Histochem Cytobiol       Date:  2010-12       Impact factor: 1.698

3.  Screening for colorectal neoplasms with new fecal occult blood tests: update on performance characteristics.

Authors:  James E Allison; Lori C Sakoda; Theodore R Levin; Jo P Tucker; Irene S Tekawa; Thomas Cuff; Mary Pat Pauly; Lyle Shlager; Albert M Palitz; Wei K Zhao; J Sanford Schwartz; David F Ransohoff; Joseph V Selby
Journal:  J Natl Cancer Inst       Date:  2007-09-25       Impact factor: 13.506

4.  Combination of preoperative NLR, PLR and CEA could increase the diagnostic efficacy for I-III stage CRC.

Authors:  Hong-Xin Peng; Lin Yang; Bang-Shun He; Yu-Qin Pan; Hou-Qun Ying; Hui-Ling Sun; Kang Lin; Xiu-Xiu Hu; Tao Xu; Shu-Kui Wang
Journal:  J Clin Lab Anal       Date:  2016-09-30       Impact factor: 2.352

5.  Identification of microRNA-135b in stool as a potential noninvasive biomarker for colorectal cancer and adenoma.

Authors:  Chung Wah Wu; Siew Chien Ng; Yujuan Dong; Linwei Tian; Simon Siu Man Ng; Wing Wa Leung; Wai Tak Law; Tung On Yau; Francis Ka Leung Chan; Joseph Jao Yiu Sung; Jun Yu
Journal:  Clin Cancer Res       Date:  2014-04-01       Impact factor: 12.531

6.  Identification and Validation of MicroRNA Profiles in Fecal Samples for Detection of Colorectal Cancer.

Authors:  Saray Duran-Sanchon; Lorena Moreno; Josep M Augé; Miquel Serra-Burriel; Míriam Cuatrecasas; Leticia Moreira; Agatha Martín; Anna Serradesanferm; Àngels Pozo; Rosa Costa; Antonio Lacy; Maria Pellisé; Juan José Lozano; Meritxell Gironella; Antoni Castells
Journal:  Gastroenterology       Date:  2019-10-14       Impact factor: 22.682

7.  The Diagnostic Accuracy of the M2 Pyruvate Kinase Quick Stool Test--A Rapid Office Based Assay Test for the Detection of Colorectal Cancer.

Authors:  Suresh Sithambaram; Ida Hilmi; Khean-Lee Goh
Journal:  PLoS One       Date:  2015-07-09       Impact factor: 3.240

8.  The role of mSEPT9 in screening, diagnosis, and recurrence monitoring of colorectal cancer.

Authors:  Jie Sun; Fei Fei; Mingqing Zhang; Yuwei Li; Xipeng Zhang; Siwei Zhu; Shiwu Zhang
Journal:  BMC Cancer       Date:  2019-05-14       Impact factor: 4.430

9.  Multiplex methylated DNA testing in plasma with high sensitivity and specificity for colorectal cancer screening.

Authors:  Guodong Zhao; Hui Li; Zixuan Yang; Zhenzhen Wang; Manqiu Xu; Shangmin Xiong; Shiming Li; XiaoTing Wu; Xiaoyu Liu; Ziwen Wang; Yun Zhu; Yong Ma; Sujuan Fei; Minxue Zheng
Journal:  Cancer Med       Date:  2019-08-12       Impact factor: 4.452

Review 10.  A Review of the Role of Carcinoembryonic Antigen in Clinical Practice.

Authors:  Claire Hall; Louise Clarke; Atanu Pal; Pamela Buchwald; Tim Eglinton; Chris Wakeman; Frank Frizelle
Journal:  Ann Coloproctol       Date:  2019-12-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.