Literature DB >> 27426280

Building the Evidence Base of Blood-Based Biomarkers for Early Detection of Cancer: A Rapid Systematic Mapping Review.

Lesley Uttley1, Becky L Whiteman2, Helen Buckley Woods1, Susan Harnan1, Sian Taylor Philips3, Ian A Cree4.   

Abstract

BACKGROUND: The Early Cancer Detection Consortium is developing a blood-test to screen the general population for early identification of cancer, and has therefore conducted a systematic mapping review to identify blood-based biomarkers that could be used for early identification of cancer.
METHODS: A mapping review with a systematic approach was performed to identify biomarkers and establish their state of development. Comprehensive searches of electronic databases Medline, Embase, CINAHL, the Cochrane library and Biosis were conducted in May 2014 to obtain relevant literature on blood-based biomarkers for cancer detection in humans. Screening of retrieved titles and abstracts was performed using an iterative sifting process known as "data mining". All blood based biomarkers, their relevant properties and characteristics, and their corresponding references were entered into an inclusive database for further scrutiny by the Consortium, and subsequent selection of biomarkers for rapid review. This systematic review is registered with PROSPERO (no. CRD42014010827).
FINDINGS: The searches retrieved 19,724 records after duplicate removal. The data mining approach retrieved 3990 records (i.e. 20% of the original 19,724), which were considered for inclusion. A list of 814 potential blood-based biomarkers was generated from included studies. Clinical experts scrutinised the list to identify miss-classified and duplicate markers, also volunteering the names of biomarkers that may have been missed: no new markers were identified as a result. This resulted in a final list of 788 biomarkers.
INTERPRETATION: This study is the first to systematically and comprehensively map blood biomarkers for early detection of cancer. Use of this rapid systematic mapping approach found a broad range of relevant biomarkers allowing an evidence-based approach to identification of promising biomarkers for development of a blood-based cancer screening test in the general population.
Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Assay; Biomarker; Blood; Cancer; Diagnosis; Early detection; Systematic review

Mesh:

Substances:

Year:  2016        PMID: 27426280      PMCID: PMC5006664          DOI: 10.1016/j.ebiom.2016.07.004

Source DB:  PubMed          Journal:  EBioMedicine        ISSN: 2352-3964            Impact factor:   8.143


Introduction

Early detection of cancer results in improved survival (Etzioni et al., 2003, Wolf et al., 2010, McPhail et al., 2015). Cancers detected early require less extensive treatment and are less likely to have spread to other organs. Cancer diagnosis requires histological examination of tissue abnormalities detected by radiological, clinical or endoscopic examination of patients. Detection, as opposed to diagnosis, relies on screening a largely asymptomatic population to identify people who may be at higher risk of having cancer than others. Screening tests for cancer, or any other condition need to fulfil strict criteria to prevent the implementation of inappropriate screening, ensuring screening is cost effective and benefits patients. The criteria applied within the UK are listed at http://www.screening.nhs.uk/criteria, based on those developed by Wilson and Jungner (Cochrane and Holland, 1971, Wilson and Jungner, 1968). For early cancer detection, a blood-based screening test would have to be cost effective and demonstrate a meaningful clinical benefit which outweighs the harms associated with false positive, indeterminate results and overtreatment. This is clearly a major undertaking, and needs a multidisciplinary approach. The Early Cancer Detection Consortium (ECDC) was established in 2012 in the United Kingdom and comprises 23 universities, their associated NHS hospitals, as well as other organisations and industry partners. The consortium was established to investigate whether a cost-effective screening test can be used in the general population to identify people with early cancers. Given the extensive literature on blood biomarkers for cancer, it is logical to explore the development of such a test using existing biomarkers that have the best evidence-base for cancer detection. A sensitive blood test for multiple tumour types could enable people with biomarker levels which are outside the typical range to receive further investigation and lead to earlier diagnosis of cancer at an asymptomatic stage when curative treatment is feasible. The next stage of the programme will involve analytical and clinical validation of these biomarkers in a case control study, from which a detection algorithm will be produced and validated for possible use as a generic cancer screen. Finally, a randomised controlled trial will be required to determine the clinical and cost-effectiveness of the resulting screening strategy. Previous reviews in this area have understandably been limited in scope, usually restricted to one biomarker or well-defined group of potential markers, due to the enormous number of publications in the field. The aim of this study was therefore to establish the full range of candidate blood-based biomarkers with potential for the early detection of cancer, and map key characteristics of the tests.

Methods

To identify all relevant biomarkers, comprehensive searches and innovative methods to perform the mapping review were employed to cope with the sizeable body of relevant literature to be assessed within a short time-frame. The mapping review comprised the following stages: comprehensive literature searches; data mining techniques for rapid screening of the search records and; development of a customizable database of evidence to optimise the output from the mapping review. It was not considered sufficient simply to list evidence by reference or to name the biomarker once in a spreadsheet and continue searching until another new biomarker was found. Instead it was more useful and time-efficient to maintain the corresponding citations for each biomarker and record the basic characteristics of the study at the time of screening. This enabled a basic informative profile to be built for each biomarker identified in the mapping review. This systematic review is registered with PROSPERO (no. CRD42014010827) and the methods have been structured around the PRISMA checklist (http://www.prisma-statement.org/).

Eligibility Criteria

Eligible studies included all English language studies from the past five years that investigated blood based biomarkers in more than 50 patients, see Table 1.
Table 1

Eligibility criteria for the systematic mapping review.

Inclusion criteriaExclusion criteria
English language studiesStudies published in non-English language
Studies within the last five years (2010–2014)Studies from 2009 or older
Controlled studiesNo healthy control group
Validation studiesDerivative studies from included papers
Cancer detection/diagnosisPrognosis or prediction (treatment response) associated markers
50 or more patientsLess than 50 patients
Biomarkers measured in bloodTissue or other bodily fluid samples
Abstracts of panels which do not state which biomarkers are studied
Citation titles without abstracts

Search Strategy

To identify a comprehensive body of literature from which a list of candidate biomarkers could be generated, a broad search using keywords and subject headings was undertaken. The terms reflected the concepts of ‘diagnosis’, ‘markers’, ‘blood’ and ‘screening’ (see supplementary material). The keywords and subject headings were developed using a variety of collaborative methods between Information Specialists and Systematic Reviewers at the University of Sheffield and researchers at the University of Warwick. A scoping search was performed and assessed for appropriateness. Additionally, key journal articles and abstracts in Medline were retrieved and assessed to obtain relevant subject headings and keywords. Clinical input was sought from members of the ECDC to verify and validate the chosen keywords. For the full search, relevant free-text, keyword and thesaurus terms were combined using Boolean operators and translated into database specific syntax. Full searches were limited to English language, humans and publication dated from 2010 to May 2014. The databases searched were Medline and Medline in Process, Embase, CINAHL, Cochrane Library (including Cochrane Database of Systematic Reviews, DARE, CENTRAL, HTA, NHS EED), Science Citation Index Expanded, Conference Proceedings Citation Index - Science, Book Citation Index – Science, and Biosis Previews. The initial search strategy was broad and inclusive. As a result, a large number of relevant records were obtained. Preliminary validation by consulting experts in the field indicated that the search was sensitive and no missing relevant literature was identified.

Sifting and Data Mining

The results of the initial searches were imported into a Reference Manager database. To identify an exhaustive list of biomarkers, retrieved records were searched iteratively within the Reference Manager database, using keywords to select potentially relevant titles. Titles and abstracts of this selection of citations were scrutinised for names and descriptions of biomarkers that met (or potentially met) the selection criteria (see Table 1). The citations were tagged to indicate that they had been viewed, to enable their exclusion from further searches. Relevant citations were exported to a Microsoft Access database which was customised to allow data extraction of relevant key information for each biomarker that was available from the corresponding study abstracts. The data mining process within the main database included the following restrictions (see Box 1): To ensure a comprehensive capture of all relevant biomarkers, a further validation stage was performed. Relevant reviews identified during the search were used to check for additional biomarkers not generated by the data mining process. ECDC members were invited to recommend papers that they believed to be relevant to the mapping review.

Data Collection

Each biomarker occupied a record with a unique identifier number in a customised Microsoft Access database which stored the number of associated papers, the abstract and reference details; associated synonyms and acronyms; types of cancers and study design; keywords used to retrieve the abstract during data mining; assays used to measure the biomarker, where reported; category to which the biomarker was assigned (e.g. auto-antibodies); and the sample types used, where reported (e.g. serum, plasma or whole blood).

Results

After duplicates were removed, 19,724 records were yielded from the comprehensive searches. Using data mining, 3990 titles and abstracts were retrieved from the 19,724 records for full scrutiny. Data mining is the process of pulling a subset of records from a large, unwieldy dataset. The subset of 3990 abstracts was reviewed in order to generate a list of biomarkers which are potentially relevant to early identification of cancer using blood. A full breakdown of the keywords used and the number of corresponding records retrieved can be seen in Fig. 1. During the validation process, three relevant reviews were consulted for the identification of any additional biomarkers. No further biomarkers were identified either from these reviews or from the consultation of ECDC members.
Fig. 1

Modified PRISMA 2009 flow diagram.

A total of 814 biomarkers were identified as potentially relevant to the review question and were subjected to further scrutiny, identifying duplicates and miss-classified biomarkers during a process of data cleaning and categorising the biomarkers into groups or families. These groups are currently arranged by molecular function in order to map the biomarkers by biological origin. Further research using this methodology and database into the empirical application and validation of each biomarker will allow the biomarkers to be grouped by clinical utility such as cancer type or platform. However, we have performed this analysis for colorectal cancer (Table 2) and lung cancer (Table 3) to illustrate how these data could be used to define cancer-specific biomarkers. This resulted in a final total of 788 biomarkers, grouped into 13 initial categories (see Supplementary Table 1, Supplementary Table 2, Supplementary Table 3, Supplementary Table 4, Supplementary Table 5, Supplementary Table 6, Supplementary Table 7, Supplementary Table 8, Supplementary Table 9, Supplementary Table 10, Supplementary Table 11, Supplementary Table 12, Supplementary Table 13) as follows:
Table 2

Colorectal cancer specific biomarkers from all 13 categories.

Biomarker categoriesID noBiomarkerAcronymCancer
Adhesion and matrix proteins7ClusterinCLIColorectal
12Ep cell adhesion module (GA733-2)EpCAM (GA733-2)Colorectal
22Metallopeptidase inhibitor 1TIMP1; TIMP-1Colorectal
Auto-antibodies & immunological markers2Anti-p53 antibodiesp53; serum p53 antibodies; p53-Abs; p-53-AAB; Anti-p53AbColorectal
19Anti-heat shock protein 60HSP60Colorectal
40IL2RBIL2RBColorectal
Classical tumour markers3Carcinoembryonic antigenCEAColorectal
8Carbohydrate antigen 19-9CA19-9; CA199Colorectal
Coagulation and angiogenesis molecules2Vascular endothelial growth factorVEGFColorectal
8Kininogen-1Kininogen-1Colorectal
23Endothelial cell-specific molecule-1ESM-1Colorectal
27ThrombomodulinTHBD-MColorectal
28Annexin A3ANXA3Colorectal
Cytokines, chemokines and insulin-like growth factors3Interleukin 8IL-8Colorectal
17Insulin-like growth factor-binding protein-2IGFBP-2Colorectal
26Brain-derived neurotrophic factorBDNFColorectal
28Interleukin-1raIL-1raColorectal
50TNFAIP6TNFAIP6Colorectal
Circulating-free DNA3Adenomatous polyposis coliAPCColorectal
9Septin 9Septin 9Colorectal
17Methylation of CYCD2CYCD2Colorectal
18Methylation of HIC1HIC1Colorectal
19Methylation of PAX 1PAX 1Colorectal
20Methylation of RB1RB1Colorectal
21Methylation of SRBCSRBCColorectal
34Line1 79 bpLine1 79 bpColorectal
35Line1 300 bpLine1 300 bpColorectal
36Alu 115 bpAlu 115 bpColorectal
37Alu 247 bpAlu 247 bpColorectal
HormonesNilNil
Metabolic markers1Plasma glucose levelsPlasma glucose levelsColorectal
53-Hydroxypropionic acid and pyruvic acid3-Hydroxypropionic acid and pyruvic acidColorectal
6Alaninel-Alanine, glucuronoic lactoneColorectal
7l-GlutamineGlutamineColorectal
8SarcosineSarcosineColorectal
11CholinePhosphatidylcholine; (PC) (34 : 1)Colorectal
12PhosphatidylinositolPhosphatidylinositolColorectal
17l-ValineValineColorectal
18l-ThreonineThreonineColorectal
191-Deoxyglucose1-DeoxyglucoseColorectal
20GlycineGlycineColorectal
21MACF1MACF1Colorectal
22Apolipoprotein HAPOH; beta-2-glycoproteinColorectal
23Alpha-2-macroglobulinA2MColorectal
24Immunoglobulin lambda locusIGL@Colorectal
25Vitamin D-binding proteinVDBColorectal
302-Hydroxyglutarate2-HydroxyglutarateColorectal
342-Hydroxybutyrate2-HydroxybutyrateColorectal
35Aspartic acidAspartic acidColorectal
36KynurenineKynurenineColorectal
37CystamineCystamineColorectal
50Tricarboxylic acidTCAColorectal
532-Aminoethanesulfonic acidTaurineColorectal
54LactateLactateColorectal
55PhosphocholinePhosphocholineColorectal
56ProlineProlineColorectal
57PhenylalaninePhenylalanineColorectal
102OleamideOleamideColorectal
111Leukocyte methylated cytosine 55-mCColorectal
116Plasma choline-containing phospholipidsPlasma phospholipidsColorectal
120Palmitic amidePalmitic amideColorectal
121Hexadecanedioic acidHexadecanedioic acidColorectal
122Octadecanoic acidOctadecanoic acidColorectal
123Eicosatrienoic acidEicosatrienoic acidColorectal
124Lysophosphatidylcholine 18:2LPC(18:2)Colorectal
125Lysophosphatidylcholine 16:0LPC(16:0)Colorectal
MicroRNA and other RNAs5let-7gColorectal
15miR-126miR-126Colorectal
32miR-135bmiR-135bColorectal
36miR-141miR-141Colorectal
38miR-143miR-143Colorectal
39miR-145miR-145Colorectal
57miR-17-3pmiR-17-3pColorectal
68miR-18amiR-18aColorectal
71miR-191-5pmiR-191-5pColorectal
94miR-20amiR-20aColorectal
95miR-21miR-21Colorectal
125miR-29amiR-29aColorectal
187miR-548as-3pmiR-548as-3pColorectal
195miR-601miR-601Colorectal
210mir-760mir-760Colorectal
214miR-885-5pmiR-885-5pColorectal
219miR-92amiR-92aColorectal
231U6 snRNA (U6)U6 snRNA (U6)Colorectal
Novel proteins15Microtubule-associated protein RP/EB family member 1MAPRE1Colorectal
16Leucine-rich alpha-2-glycoproteinLRG1Colorectal
56Alpha-enolaseAlpha-enolaseColorectal
62BetaineBetaineColorectal
72CACNAG1CACNAG1Colorectal
82Colon cancer specific antigen-2CCSA-2Colorectal
88C9orf50-MC9orf50-MColorectal
89CLEC4DCLEC4DColorectal
90LMNB1LMNB1Colorectal
91PRRG4PRRG4Colorectal
92VNN1VNN1Colorectal
103Dermokine-betaDK-betaColorectal
105SepraseSepraseColorectal
126Serum amyloid ASAAColorectal
132Lipocalin 2Lipocalin 2Colorectal
Nuclear proteins2k-rask-rasColorectal
Microbial proteinsNilNil
Volatile organic compounds1Phenyl methylcarbamatePhenyl methylcarbamateColorectal
2EthylhexanolEthylhexanolColorectal
36-t-Butyl-2,2,9,9-tetramethyl-3,5- decadien-7-yne6-t-Butyl-2,2,9,9-tetramethyl-3,5-decadien-7-yneColorectal
Table 3

Example for lung cancer and mesothelioma specific biomarkers from all 13 categories.

Biomarker categoriesID noBiomarkerAcronymCancer
Adhesion and matrix proteins2CalreticulinCRTLung
7ClusterinCLILung
8Cross-linked telopeptide of type I collageICTPLung
9E-cadherinE-cadherin; soluble E-cadherin (sE-cad)Lung
10E-cadherin gene CDH1CDH1Lung
11E-selectinE-selectin; sE-selectinLung
19Matrix metalloproteinase-2MMP2Lung
29Soluble L-selectinsL-selectinLung
31Surfactant protein-DSP-DLung
Auto-antibodies & immunological markers2Anti-p53 antibodiesp53; serum p53 antibodies; p53-Abs; p-53-AAB; Anti-p53AbLung
3Anti-survivin antibodiesSurvivin/anti-survivin antibodiesLung
6Inosine monophosphate dehydrogenaseIMPDHLung
8Immunoglobulin GIgGLung
12Anti-livinLivin/anti-livin antibodiesLung
22C-reactive proteinCRPLung
28Anti-Krebs von Lungren-6KL-6Lung
30Anti-ubiquillinUbiquillinLung
32Alpha-crystallin IgG antibodiesAlpha-crystallin antibodiesLung
37CD30CD30Lung
38CD63CD63Lung
43NY-ESO-1NY-ESO-1Lung
44CAGECAGELung
45GBU4-5GBU4-5Lung
46SOX2SOX2Lung
47HuDHuDLung
48IgM autoantibodiesIgM autoantibodiesLung
55Anti-hydroxysteroid-(17-alpha)-dehydrogenaseLung
56Anti-triosephosphate isomeraseLung
Classical tumour markers2Cancer antigen 15-3CA15-3; CA 15-3Lung
3Carcinoembryonic antigenCEALung
6Human epididymis protein 4HE4Lung
9Squamous cell carcinoma antigenSCCA; SCC-agLung
11Cytokeratin fragment 19CYFRA 21-1Lung
12Neuron Specific EnolaseNSELung
14Progastrin-releasing peptideproGRPLung
22HER2HER2; AB_HER2; 36 HER2 negative; erbb-2; soluble human epidermal growth factor receptor 2 (sHER2)Lung
Coagulation and angiogenesis molecules1Urokinase plasminogen activatoruPA/uPAR/suPARLung
2Vascular endothelial growth factorVEGFLung
10Endothelin-1ET-1Lung
13Angiopoietin-2Angiopoietin-2; Apo-2Lung
14Thrombospondin-1THBS1Lung
15Plasminogen activator inhibitorPlasminogen activator inhibitorLung
19EndostatinEndostatinLung
21Annexin A1ANXA1 mNRALung
24C4dC4dLung
25Annexin A2ANXA2Lung
Cytokines, chemokines and insulin-like growth factors7Tumour necrosis factor [alpha]TNF[alpha]; DcR3Lung
10Macrophage migration inhibitory factorMIFLung
18Hepatocyte growth factorHGFLung
19Insulin-like growth factor binding proteinIGFBP-3Lung
20Granulocyte-colony stimulating factorG-CSFLung
21Interleukin 3IL-3Lung
22Stem cell factorSCFLung
25C-C motif chemokine 5C-C motif chemokine 5Lung
28Interleukin-1raIL-1raLung
29Monocyte chemotactic protein-1MCP-1Lung
31MidkineMK; MDKLung
38IRF1IRF1Lung
51Macrophage inflammatory protein 4MIP-4Lung
52Megakaryocyte potentiating factorMPFMesothelioma
Circulating-free DNA1Microsatellite alterations at FHITFHITLung
2Microsatellite alterations at loci on chromosome 33p lociLung
3Adenomatous polyposis coliAPCLung
4CHD1CHD1Lung
5O(6)-Methyl-guanine-DNA methyltransferaseMGMTLung
6DCCDCCLung
7RASSF1ARASSF1ALung
8absent in melanoma 1AIM1; beta/gamma crystallin domain-containing protein 1Lung
Hormones9Progesterone receptor BPRBLung
13ProlactinProlactinLung
Metabolic markers6Alaninel-Alanine, glucuronoic lactoneLung
26LeucineLeucine; isoleucineLung
27HistidineHistidineLung
28TryptophanTryptophanLung
29OrnithineOrnithineLung
38Lactic acidLactic acidLung
39Glycelic acidGlycelic acidLung
40Glycolic acidGlycolic acidLung
87NG1A2FNG1A2FLung
89N-glycopeptidesGlycopeptidesMesothelioma
102OleamideOleamideLung
103Long chain acyl carnitinesLong chain acyl carnitinesLung
104Lysophosphatidylcholine 18:1LPC(18:1)Lung
105Lysophosphatidylcholine 20:4LPC(20:4)Lung
106Lysophosphatidylcholine 20:3LPC(20:3)Lung
107Lysophosphatidylcholine 22:6LPC(22:6)Lung
108Serum metabolite 16:0/1SM(16:0/1)Lung
115FerritinFTLLung
MicroRNA and other RNAs7miR-103miR-103Mesothelioma
14miR-1254miR-1254Lung
15miR-126miR-126Mesothelioma
20miR-128bmiR-128bLung
29miR-133amiR-133aLung
35miR-140miR-140Lung
38miR-143miR-143Lung
41miR-1468miR-1468Lung
43miR-146b-3pmiR-146b-3pLung
50miR-155miR-155Lung
53miR-15bmiR-15bLung
60miR-181cmiR-181cLung
61miR-182miR-182Lung
68miR-18amiR-18aLung
80miR-197miR-197Lung
95miR-21miR-21Lung
98miR-212miR212Lung
106miR-220miR-220Lung
108miR-221miR-221Lung
111miR-23amiR-23aLung
122miR-27bmiR-27bLung
135miR-30c-1*miR-30c-1*Lung
145miR-330miR-330Lung
147miR-331miR-331Lung
152miR-339-5pmiR-339-5pLung
157miR-345miR-345Lung
158miR-346miR-346Lung
172miR-377miR-377Lung
180miR-484miR-484Lung
188miR-548bmiR-548bLung
189miR-550miR-550Lung
190miR-566miR-566Lung
192miR-574–5pmiR-574–5pLung
197miR-616*miR-616*Lung
198miR-625*miR-625*Mesothelioma
203miR-656miR-656Lung
204miR-660miR-660Lung
213miR-876-3pmiR-876-3pLung
218miR-92miR-92Lung
221miR-939miR-939Lung
224miR-let-7let-7Lung
Novel proteins3HaptoglobinHPLung
21CD9CD9Lung
22CD81CD81Lung
39HMGA1HMGA1Lung
40TFDP1TFDP1Lung
41SUV39H1SUV39H1Lung
42RBL1RBL1Lung
43HNRPDHNRPDLung
58Anterior gradient 2AGR2Lung
63Pentraxin-3PTX3Lung
67Lysyl oxidaseLOXLung
75Death receptor 3DR3Lung
76Membrane-spanning 4 domain subfamily A from the multigene family of proteins involved in signal transduction of which CD20 is one memberMS4ALung
93Heat shock protein 90 alphaHSP90alphaLung
94Leucine-rich repeats and immunoglobulin-like domains 3LRIG3Lung
95PleiotrophinPleiotrophinLung
96Protein kinase C iota typePRKCILung
97Repulsive Guidance Molecule CRGM-CLung
98Stem Cell Factor soluble ReceptorSCF-sRLung
99YESYESLung
116HMGB1HMGB1Mesothelioma
119Carbohydrate antigen 50CA50Lung
125Cytokeratin fragment 21.1Cytokeratin fragment 21.1Lung
126Serum amyloid ASAALung
128Carbohydrate antigen 211CA211Lung
146Endoplasmic reticulum protein-29ERP29Lung
Nuclear proteins3Isocitrate dehydrogenase 1IDH1Lung
4p53 messenger RNAp53 mRNALung
10E2F6E2F6Lung
13Variant Ciz1Ciz1Lung
Microbial proteins6Epstein-Barr virus-induced gene 3EBI3Lung
Volatile organic compoundsNilNil
Adhesion and matrix proteins (n = 36). The expression of molecules involved in adhesion or in formation of the connective tissue matrix around cancer cells differ from non-neoplastic cells and appear in blood. Early work included collagen breakdown products, which are produced as a result of increased collagen turnover, but are not specific to particular tumour types (Paterson et al., 1991, Berruti et al., 1995). Collagens are metabolised by matrix metalloproteinase proteins (MMPs), these in turn are antagonised by tissue inhibitors of matrix metalloproteinases (TIMPs) (Roy et al., 2009). Both MMPs and TIMPs are represented in this group. Turnover of other matrix proteins is altered in cancer: vimentin (Ludwig et al., 2009), laminin (Schechter & Lopes, 1990) and tenascin are included in the list. Cancer cells have increased motility compared with non-neoplastic cells, and show altered expression of adhesion molecules. EpCAM, e-cadherin, and e-selectin are represented as blood biomarkers in the list (Beije et al., 2015, Hauselmann and Borsig, 2014, Gires and Stoecklein, 2014). Following review, a total of 18 were removed, including one duplicate entry. Auto-antibodies and immunological markers (n = 59). The majority of entries in this category relate to auto-antibodies. These have been described for a wide variety of proteins within cancer, notably nuclear proteins such as P53 and other nuclear proteins, and occur in many cancers (Middleton et al., 2014). Immunological markers of interest include CRP, usually regarded as a marker of inflammation. Classical Tumour Markers. A total of 23 markers were included in the ‘classical’ tumour marker group. This includes those used widely in practice, including CEA, CA125, CA15-3, CA19-9, AFP, and PSA. Markers of lesser utility, such as LDH and HE4 were also included. It should be noted that several of these (CA15-3 and CA19-9) refer to different epitopes of the same antigen, MUC1, which also came up in our searches. Coagulation & angiogenic proteins. Of the 29 proteins in this category, the majority had relatively little evidence for their utility in early cancer detection. The markers can be sub-categorised into those connected to angiogenesis (e.g. VEGF, PlGF, Angiopoietins) and coagulation (e.g. plasminogen activating proteins and kallikreins). Annexins were included in this group, though they are more often thought of as apoptosis associated proteins. Cytokines, chemokines and insulin-like growth factors. 52 biomarkers were included in this group. They include a wide range of cytokines and soluble receptors. Evidence for these is limited, but they represent an interesting group of proteins abnormal in cancer, measurement of which is likely to reflect the profound local immune suppression and systemic alteration of immunity present in cancers. Circulating-free DNA. This is usually abbreviated as cfDNA, though increasingly the term circulating tumour DNA (ctDNA) is used. While DNA is clearly a single biomarker, 39 individual biomarkers representing genes or alterations of most interest were identified in this group, though in essence any mutation of gene methylation marker identified would be part of this group. While the first descriptions of cfDNA used PCR (Lo, 2001a, Lo, 2001b), many recent papers apply multi-analyte methods, including next generation sequencing (Coco et al., 2015, Rothe et al., 2014, Couraud et al., 2014), to the study of cfDNA to detect mutations of potential diagnostic significance. Though as yet few have used this for early detection. Hormones. While 13 biomarkers were assigned to this category, only Corticosteroid-binding globulin survives more stringent searches (Wu et al., 2012). Hormone levels are not thought to be reliable markers of cancer. Metabolomics. A large number of metabolites are known to be altered in cancer, as the result of changes in energy, lipid, amino acid, and protein metabolism. We identified 126 individual markers, many of which were measured in concert by mass spectroscopy within several studies (Cross et al., 2014, Hasim et al., 2013). MicroRNA and other RNAs. There are now over 1000 human miRNA species known, a large number of these have been studied in cancer. While the majority have been looked at in tissue, there is considerable interest in their possible use as a liquid biopsy, our list of 232 biomarkers in this group reflects this. They are rarely measured alone: most use some form of array strategies for measurement, most studies concentrate on single cancer types (Fortunato et al., 2014, Clancy et al., 2014). Novel Proteins. A large number of protein biomarkers, often identified by mass spectroscopy or 2D gel electrophoresis, were hard to categorise. These were grouped as novel proteins and represent a diverse group of 148 biomarkers. Examples include alpha-2-heremans-schmid-glycoprotein (AHSG) (Dowling et al., 2012) and galectin (Gromov et al., 2010) in breast cancer. Nuclear proteins. A group of 13 nuclear protein biomarkers were assigned to this category, though some markers within the novel protein group are of nuclear origin. Circulating nucleosomes are included in this group as they are usually detected by ELISA (Holdenrieder et al., 2014). Microbial proteins (n = 15). A small number of Epstein-Barr Virus (EBV) and Human Papilloma Virus (HPV) proteins and their antibodies have been studied as early cancer biomarkers in blood, based on the detection of EBV DNA in cancer patients (Lo, 2001b). Helicobacter antibodies also fall into this group. Volatile Organic Compounds (VOC). Only three biomarkers, all small metabolites, were assigned to this category, which it could be argued forms part of the metabolite group. It is however measured differently.

Discussion

We systematically searched the literature from the last five years to identify potential blood biomarkers for cancer (Hanahan and Weinberg, 2011, Cree, 2011). The data mining process retrieved 3990 citations from the initial 19, 724 records, screening the abstracts of these citations identified 814 biomarkers that may be relevant. After data-cleaning, 788 biomarkers were fitted into 13 categories as described above as having potential for use as early cancer detection biomarkers present within blood samples. Biomarkers were grouped by molecular function. Further analysis such as grouping by cancer type may be possible only once the utility of each biomarker has been reviewed independently. As this is a mapping review, it is not possible to speculate the definitive clinical utility for each biomarker. Most studies reviewed tended to concentrate on single common cancers, and few papers show evidence of a systematic approach to biomarker discovery but were limited by the clinical samples and techniques of their laboratories. The conduct of large systematic reviews is challenging, yet not all biomedical questions can be reduced to the size where standard methodologies for systematic review are thought reasonable. We have therefore taken a data mining approach to map blood biomarkers that may be suitable for the early detection of cancer using the search tools available within the reference management software. As with any approach to reviewing literature that falls short of a full systematic review, there is a balance between rigour and expenditure of time and resources. In this case, the aim was not to identify all relevant literature (as would be the case in a systematic review of efficacy), but rather all relevant biomarkers. It should be noted that the database does not hold the full text of the articles referenced and is restricted to titles, abstracts and keywords. Full text searching using machine learning algorithms could eventually provide a better solution. In this instance, to allow a thorough search of the large dataset of biomarker literature and ensure an efficient approach to managing the data, we used data mining tools available within the reference management software. This allowed us to retrieve potentially relevant records, extract data relating to relevant biomarkers, and validate the process through adjunctive searches of reviews and through contact with an extensive network of experts. While the use of experts to validate the data may be regarded as subjective, it was a necessary step in validation of the searches and the multidisciplinary consortium involved in this work covers a large range of expertise. The limitation to studies published after 2009 could have skewed the data towards new technologies, and therefore reviews were included to mitigate the risk of ignoring older methodologies. Despite this limitation, it is notable that proteomic biomarkers, a more mature technology, formed a large proportion of the biomarkers found. Furthermore, it is possible that many of those biomarkers that have received less attention more recently did so because they were found to have limited utility in subsequent studies. We used conservative selection criteria that may have resulted in the inclusion of irrelevant biomarkers, but will have minimised the chance of relevant biomarkers being excluded. As such, we are confident that our methodology is fit for purpose and will have had high sensitivity for the identification of relevant biomarkers. Limiting the mapping review to abstracts may have excluded studies identifying multiple potential biomarkers if such biomarkers were only mentioned in the main text. This is unlikely to occur in the field of emerging and promising biomarkers where the aim is to highlight the biomarker and technology to the audience. However, the vagueness of the abstracts of many papers is a challenge, as is the generally poor quality of study design. Even some larger scale studies from major groups do not include controls and few studies were powered to examine multiple biomarkers in comparison with existing tumour markers. The majority of cases (when described) are from patients with advanced disease, and this is a major concern for those interested in early detection: there is no guarantee that biomarkers identified in patients with advanced disease are relevant to those with early disease. There is certainly a need to improve the quality of papers on early detection using tools such as those available from the EQUATOR network (http://www.equator-network.org). Our intention is to use the list of biomarkers identified by this review to generate a set of biomarkers that can be subjected to analytical validation within pathology blood science laboratories, then clinically validated within a large, prospective, multicentre clinical study to develop a generic cancer testing strategy for subsequent clinical trial. The primary aim is to produce a screening test strategy for cancer that does more good than harm at reasonable cost. Good includes decreased morbidity and mortality from early detection, diagnosis and treatment of cancers, while harm is usually regarded as significant risk of overdiagnosis, and consequent overtreatment. The entire strategy needs to be cost effective to achieve eventual approval from the UK National Screening Committee (NSC), which defines 22 criteria according to the condition, the test, the treatment and the screening programme (http://www.screening.nhs.uk/criteria) based on those developed by Wilson & Jungner (1968). Within the list, there are some interesting results. Firstly, it is clear that current tumour markers, which considered in isolation, few would regard as sensible diagnostic tests in patients with a possible diagnosis of cancer, are collectively quite good at detection if used concurrently. The bulk of the work on this comes from one group in Barcelona (Molina et al., 2012), with other important contributions from others (Barak et al., 2010). The validation of biomarkers needs a point of reference, for direct comparison and it is clear that tumour marker lists used by Molina et al. (2012) represent such a standard. We would encourage those active in the field to use this list as their comparator for future work to allow comparison between studies. The biomarkers can be grouped by the technology used for their detection. Taken to its logical conclusion, this results in a reduction of the thirteen groups above to seven groups as outlined in Box 3. The ability of protein measurement to be multiplexed by immunoassay arrays or mass spectroscopy means that all proteins, including auto-antibodies, can be measured simultaneously. Simple panels with few analyses tend to be less expensive and have greater potential for high throughput. DNA and RNA can be detected rapidly and inexpensively by polymerase chain reaction (PCR) technologies, and there is evidence from multiple studies that the level of cfDNA has potential as a generic cancer marker. However, PCR is limited in the number of targets that can be detected at one time, and by the small amount of material present in patients with small tumours, which does not permit large numbers of tests to be performed without recourse to sequencing or large panels. Sequencing has the potential to detect large numbers of mutations, adding specificity, and could have utility in reflex testing. It is currently an expensive option, but costs of sequencing are decreasing rapidly, while technologies available are improving their capability at almost the same pace. Metabolomics is of considerable interest, with a large literature to support it. While larger molecules require mass spectroscopy to measure their presence, smaller molecules can be detected in gas phase in the head space of blood samples using inexpensive sensor technologies. We believe that this relatively new option may have considerable potential to act as a generic test. There are a number of other tests that do not fit immediately into one of these seven categories: nucleosome assays are one such example, and are being used as potential screening tests. The concept of combining high sensitivity/low specificity tests with reflex low sensitivity/high specificity tests to detect cancers early (Cree, 2011), seems feasible from the results we have obtained. We need to combine biomarkers with high sensitivity for screening the general population with biomarkers of high specificity to determine the relevance of the screening results. The next task is clearly to try this in practice to determine its real potential for early cancer detection, and to determine the best analytical methods to process the data for individual patients. Our preferred strategy is to examine the biomarkers in each category in greater detail, and undertake direct comparison of these biomarkers in a large cohort of samples following independent analytical validation. In our view, the same caveats around retrospective studies apply to biomarker validation as they do to drug trials: the potential for bias from sample collections is high and large prospective studies are necessary. This review is therefore the first step in an ambitious programme of work which will inevitably require careful evaluation of clinical, cost and ethical implications at each stage. However, there is no doubt that if such an approach to early cancer detection proved successful, it could be invaluable.

Conclusion

This ground-breaking study is the first to systematically and comprehensively map blood biomarkers for early detection of cancer and will inform an innovative research project to identify, validate and implement new generic blood screening tests for early cancer detection in the general population. The following are the supplementary data related to this article.

Supplementary Table 1

Adhesion and matrix proteins.

Supplementary Table 2

Auto-antibodies & immunological markers.

Supplementary Table 3

Classical tumour markers.

Supplementary Table 4

Coagulation and angiogenesis molecules.

Supplementary Table 5

Cytokines, chemokines and insulin-like growth factors.

Supplementary Table 6

Circulating-free DNA.

Supplementary Table 7

Hormones.

Supplementary Table 8

Metabolic markers.

Supplementary Table 9

MicroRNA and other RNAs.

Supplementary Table 10

Novel proteins.

Supplementary Table 11

Nuclear proteins.

Supplementary Table 12

Microbial proteins.

Supplementary Table 13

Volatile organic compounds.

Authors Contributions

IC, SH, BW, and STP designed the study. Searches were performed HBW. LU performed the mapping review with input from the ECDC. The draft manuscript was prepared by LU, IC and BW. All authors agreed the final version.

Declaration of Competing Interests

The authors LU, IC, SH, STP, BW, HBW have no conflicts of interest to declare. The ECDC has grant funding for early cancer biomarker research from Cancer Research UK and involves the following companies GE Healthcare, Life Technologies, Abcodia, Nalia, and Perkin-Elmer. Individual ECDC members have declared their interests to the ECDC secretariat.

Funding

This work was conducted on behalf of the Early Cancer Detection Consortium, within the programme of work for work packages 1 & 2. The Early Cancer Detection Consortium is funded by Cancer Research UK under grant number: C50028/A18554.
RestrictionJustification

Searches limited to last five years.

To ensure that the biomarkers identified and their associated evidence is current and relevant, searches were restricted to records published in the last five years (from 2010 to May 2014).

Data mining technique employed, as opposed to screening all references

Data mining involved interrogation of search results using relevant keywords (Box 2) to search within the database of total records for batches of references. Keywords were identified through consultation with ECDC members for known technologies, and for other potentially relevant terms. Keywords for similar concepts (e.g. synonyms for a specific biomarker) were grouped and searched together. Keywords expected to retrieve citations of high relevance were prioritised over those with less obvious relevance. Further keywords were identified by the review team by consideration of indexing keywords and content of studies identified as relevant.

One reviewer performed the data mining.

One reviewer screened the references to generate the list of biomarkers using the data mining technique. A single reviewer screening approach was mitigated for by the examination review papers and consultation with ECDC membership during a later validation phase. An inclusive approach to inclusion was adopted to minimise inappropriate exclusions.

Pragmatic inclusion criteria

Titles without abstracts were not included. Equally abstracts of primary studies or reviews which did not name a biomarker were not included. Titles and abstracts retrieved from each batch of references associated with each keyword were assessed against the eligibility criteria in Table 1.
  31 in total

Review 1.  The case for early detection.

Authors:  Ruth Etzioni; Nicole Urban; Scott Ramsey; Martin McIntosh; Stephen Schwartz; Brian Reid; Jerald Radich; Garnet Anderson; Leland Hartwell
Journal:  Nat Rev Cancer       Date:  2003-04       Impact factor: 60.716

Review 2.  Validation of screening procedures.

Authors:  A L Cochrane; W W Holland
Journal:  Br Med Bull       Date:  1971-01       Impact factor: 4.291

3.  Two-site immunoassays for the measurement of serum laminin: correlation with breast cancer staging and presence of auto-antibodies.

Authors:  G B Schechter; J D Lopes
Journal:  Braz J Med Biol Res       Date:  1990       Impact factor: 2.590

Review 4.  Matrix metalloproteinases as novel biomarkers and potential therapeutic targets in human cancer.

Authors:  Roopali Roy; Jiang Yang; Marsha A Moses
Journal:  J Clin Oncol       Date:  2009-09-08       Impact factor: 44.544

Review 5.  Quantitative analysis of Epstein-Barr virus DNA in plasma and serum: applications to tumor detection and monitoring.

Authors:  Y M Lo
Journal:  Ann N Y Acad Sci       Date:  2001-09       Impact factor: 5.691

Review 6.  Circulating nucleic acids in plasma and serum: an overview.

Authors:  Y M Lo
Journal:  Ann N Y Acad Sci       Date:  2001-09       Impact factor: 5.691

7.  Next generation sequencing in non-small cell lung cancer: new avenues toward the personalized medicine.

Authors:  Simona Coco; Anna Truini; Irene Vanni; Maria Giovanna Dal Bello; Angela Alama; Erika Rijavec; Carlo Genova; Giulia Barletta; Claudio Sini; Giovanni Burrafato; Federica Biello; Francesco Boccardo; Francesco Grossi
Journal:  Curr Drug Targets       Date:  2015       Impact factor: 3.465

8.  Improved blood tests for cancer screening: general or specific?

Authors:  Ian A Cree
Journal:  BMC Cancer       Date:  2011-11-30       Impact factor: 4.430

9.  Stage at diagnosis and early mortality from cancer in England.

Authors:  S McPhail; S Johnson; D Greenberg; M Peake; B Rous
Journal:  Br J Cancer       Date:  2015-03-31       Impact factor: 7.640

Review 10.  Altered tumor-cell glycosylation promotes metastasis.

Authors:  Irina Häuselmann; Lubor Borsig
Journal:  Front Oncol       Date:  2014-02-13       Impact factor: 6.244

View more
  16 in total

1.  Multi-analyte assays and early detection of common cancers.

Authors:  Robert P Young; Timothy Christmas; Raewyn J Hopkins
Journal:  J Thorac Dis       Date:  2018-07       Impact factor: 2.895

2.  Networks Models of Actin Dynamics during Spermatozoa Postejaculatory Life: A Comparison among Human-Made and Text Mining-Based Models.

Authors:  Nicola Bernabò; Alessandra Ordinelli; Marina Ramal Sanchez; Mauro Mattioli; Barbara Barboni
Journal:  Biomed Res Int       Date:  2016-08-24       Impact factor: 3.411

3.  Inflammatory biomarkers and cancer: CRP and suPAR as markers of incident cancer in patients with serious nonspecific symptoms and signs of cancer.

Authors:  Line Jee Hartmann Rasmussen; Martin Schultz; Anne Gaardsting; Steen Ladelund; Peter Garred; Kasper Iversen; Jesper Eugen-Olsen; Morten Helms; Kim Peter David; Andreas Kjaer; Anne-Mette Lebech; Gitte Kronborg
Journal:  Int J Cancer       Date:  2017-04-24       Impact factor: 7.396

Review 4.  The evidence base for circulating tumour DNA blood-based biomarkers for the early detection of cancer: a systematic mapping review.

Authors:  Ian A Cree; Lesley Uttley; Helen Buckley Woods; Hugh Kikuchi; Anne Reiman; Susan Harnan; Becky L Whiteman; Sian Taylor Philips; Michael Messenger; Angela Cox; Dawn Teare; Orla Sheils; Jacqui Shaw
Journal:  BMC Cancer       Date:  2017-10-23       Impact factor: 4.430

5.  Combinational Biomarkers for Atrial Fibrillation Derived from Atrial Appendage and Plasma Metabolomics Analysis.

Authors:  Songqing Lai; Xiumeng Hua; Ran Gao; Liang Zeng; Jiangping Song; Jichun Liu; Jing Zhang
Journal:  Sci Rep       Date:  2018-11-16       Impact factor: 4.379

6.  Serum TNFRII: A promising biomarker for predicting the risk of subcentimetre lung adenocarcinoma.

Authors:  Minjuan Hu; Yanwei Zhang; Beibei Sun; Yuqing Lou; Xueyan Zhang; Huimin Wang; Chengya Huang; Wei Zhang; Tianqing Chu; Baohui Han
Journal:  J Cell Mol Med       Date:  2020-02-19       Impact factor: 5.310

7.  Human beings as islands of stability: Monitoring body states using breath profiles.

Authors:  Kiran Sankar Maiti; Michael Lewton; Ernst Fill; Alexander Apolonski
Journal:  Sci Rep       Date:  2019-11-07       Impact factor: 4.379

Review 8.  Identifying Novel Biomarkers Ready for Evaluation in Low-Prevalence Populations for the Early Detection of Upper Gastrointestinal Cancers: A Systematic Review.

Authors:  Natalia Calanzani; Paige E Druce; Claudia Snudden; Kristi M Milley; Rachel Boscott; Dawnya Behiyat; Smiji Saji; Javiera Martinez-Gutierrez; Jasmeen Oberoi; Garth Funston; Mike Messenger; Jon Emery; Fiona M Walter
Journal:  Adv Ther       Date:  2020-12-11       Impact factor: 3.845

9.  BBCancer: an expression atlas of blood-based biomarkers in the early diagnosis of cancers.

Authors:  Zhixiang Zuo; Huanjing Hu; Qingxian Xu; Xiaotong Luo; Di Peng; Kaiyu Zhu; Qi Zhao; Yubin Xie; Jian Ren
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

Review 10.  Ribosome Biogenesis Alterations in Colorectal Cancer.

Authors:  Sophie Nait Slimane; Virginie Marcel; Tanguy Fenouil; Frédéric Catez; Jean-Christophe Saurin; Philippe Bouvet; Jean-Jacques Diaz; Hichem C Mertani
Journal:  Cells       Date:  2020-10-27       Impact factor: 6.600

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.