Literature DB >> 19091087

Examining the significance of fingerprint-based classifiers.

Brian T Luke1, Jack R Collins.   

Abstract

BACKGROUND: Experimental examinations of biofluids to measure concentrations of proteins or their fragments or metabolites are being explored as a means of early disease detection, distinguishing diseases with similar symptoms, and drug treatment efficacy. Many studies have produced classifiers with a high sensitivity and specificity, and it has been argued that accurate results necessarily imply some underlying biology-based features in the classifier. The simplest test of this conjecture is to examine datasets designed to contain no information with classifiers used in many published studies.
RESULTS: The classification accuracy of two fingerprint-based classifiers, a decision tree (DT) algorithm and a medoid classification algorithm (MCA), are examined. These methods are used to examine 30 artificial datasets that contain random concentration levels for 300 biomolecules. Each dataset contains between 30 and 300 Cases and Controls, and since the 300 observed concentrations are randomly generated, these datasets are constructed to contain no biological information. A modest search of decision trees containing at most seven decision nodes finds a large number of unique decision trees with an average sensitivity and specificity above 85% for datasets containing 60 Cases and 60 Controls or less, and for datasets with 90 Cases and 90 Controls many DTs have an average sensitivity and specificity above 80%. For even the largest dataset (300 Cases and 300 Controls) the MCA procedure finds several unique classifiers that have an average sensitivity and specificity above 88% using only six or seven features.
CONCLUSION: While it has been argued that accurate classification results must imply some biological basis for the separation of Cases from Controls, our results show that this is not necessarily true. The DT and MCA classifiers are sufficiently flexible and can produce good results from datasets that are specifically constructed to contain no information. This means that a chance fitting to the data is possible. All datasets used in this investigation are available on the web.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19091087      PMCID: PMC2628908          DOI: 10.1186/1471-2105-9-545

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  31 in total

1.  High-resolution serum proteomic features for ovarian cancer detection.

Authors:  T P Conrads; V A Fusaro; S Ross; D Johann; V Rajapakse; B A Hitt; S M Steinberg; E C Kohn; D A Fishman; G Whitely; J C Barrett; L A Liotta; E F Petricoin; T D Veenstra
Journal:  Endocr Relat Cancer       Date:  2004-06       Impact factor: 5.678

2.  Lessons from controversy: ovarian cancer screening and serum proteomics.

Authors:  David F Ransohoff
Journal:  J Natl Cancer Inst       Date:  2005-02-16       Impact factor: 13.506

Review 3.  Place of pattern in proteomic biomarker discovery.

Authors:  Michael A Gillette; D R Mani; Steven A Carr
Journal:  J Proteome Res       Date:  2005 Jul-Aug       Impact factor: 4.466

4.  Application of surface-enhanced laser desorption/ionization time-of-flight-based serum proteomic array technique for the early diagnosis of prostate cancer.

Authors:  Yu-Zhuo Pan; Xue-Yuan Xiao; Dan Zhao; Ling Zhang; Guo-Yi Ji; Yang Li; Bao-Xue Yang; Da-Cheng He; Xue-Jian Zhao
Journal:  Asian J Androl       Date:  2006-01       Impact factor: 3.285

5.  Accurate diagnosis of acute graft-versus-host disease using serum proteomic pattern analysis.

Authors:  Ramaprasad Srinivasan; Jasmine Daniels; Vincent Fusaro; Andreas Lundqvist; Jonathan K Killian; David Geho; Martha Quezado; David Kleiner; Sally Rucker; Virginia Espina; Gordon Whiteley; Lance Liotta; Emmanuel Petricoin; Stefania Pittaluga; Ben Hitt; A J Barrett; Kevin Rosenblatt; Richard W Childs
Journal:  Exp Hematol       Date:  2006-06       Impact factor: 3.084

6.  Using tree analysis pattern and SELDI-TOF-MS to discriminate transitional cell carcinoma of the bladder cancer from noncancer patients.

Authors:  Weiwei Liu; Ming Guan; Denglong Wu; Yuanfang Zhang; Zhong Wu; Ming Xu; Yuan Lu
Journal:  Eur Urol       Date:  2004-11-11       Impact factor: 20.096

7.  Increased serum levels of complement C3a anaphylatoxin indicate the presence of colorectal tumors.

Authors:  Jens K Habermann; Uwe J Roblick; Brian T Luke; Darue A Prieto; William J J Finlay; Vladimir N Podust; John M Roman; Elisabeth Oevermann; Thomas Schiedeck; Nils Homann; Michael Duchrow; Thomas P Conrads; Timothy D Veenstra; Stanley K Burt; Hans-Peter Bruch; Gert Auer; Thomas Ried
Journal:  Gastroenterology       Date:  2006-10       Impact factor: 22.682

8.  Cancer gene search with data-mining and genetic algorithms.

Authors:  Shital Shah; Andrew Kusiak
Journal:  Comput Biol Med       Date:  2006-04-17       Impact factor: 4.589

9.  Application of serum SELDI proteomic patterns in diagnosis of lung cancer.

Authors:  Shuan-ying Yang; Xue-yuan Xiao; Wang-gang Zhang; Li-juan Zhang; Wei Zhang; Bin Zhou; Guoan Chen; Da-cheng He
Journal:  BMC Cancer       Date:  2005-07-20       Impact factor: 4.430

10.  Identification of serum biomarkers for colon cancer by proteomic analysis.

Authors:  D G Ward; N Suggett; Y Cheng; W Wei; H Johnson; L J Billingham; T Ismail; M J O Wakelam; P J Johnson; A Martin
Journal:  Br J Cancer       Date:  2006-06-06       Impact factor: 7.640

View more
  2 in total

1.  Inside the mind of a medicinal chemist: the role of human bias in compound prioritization during drug discovery.

Authors:  Peter S Kutchukian; Nadya Y Vasilyeva; Jordan Xu; Mika K Lindvall; Michael P Dillon; Meir Glick; John D Coley; Natasja Brooijmans
Journal:  PLoS One       Date:  2012-11-21       Impact factor: 3.240

2.  Identification of gene signatures used to recognize biological characteristics of gastric cancer upon gene expression data.

Authors:  Zhi Yan; Brian T Luke; Shirley X Tsang; Rui Xing; Yuanming Pan; Yixuan Liu; Jinlian Wang; Tao Geng; Jiangeng Li; Youyong Lu
Journal:  Biomark Insights       Date:  2014-08-14
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.