Literature DB >> 22648677

Chapter 9: options for summarizing medical test performance in the absence of a "gold standard".

Thomas A Trikalinos1, Cynthia M Balion.   

Abstract

The classical paradigm for evaluating test performance compares the results of an index test with a reference test. When the reference test does not mirror the "truth" adequately well (e.g. is an "imperfect" reference standard), the typical ("naïve") estimates of sensitivity and specificity are biased. One has at least four options when performing a systematic review of test performance when the reference standard is "imperfect": (a) to forgo the classical paradigm and assess the index test's ability to predict patient relevant outcomes instead of test accuracy (i.e., treat the index test as a predictive instrument); (b) to assess whether the results of the two tests (index and reference) agree or disagree (i.e., treat them as two alternative measurement methods); (c) to calculate "naïve" estimates of the index test's sensitivity and specificity from each study included in the review and discuss in which direction they are biased; (d) mathematically adjust the "naïve" estimates of sensitivity and specificity of the index test to account for the imperfect reference standard. We discuss these options and illustrate some of them through examples.

Entities:  

Mesh:

Year:  2012        PMID: 22648677      PMCID: PMC3364362          DOI: 10.1007/s11606-012-2031-7

Source DB:  PubMed          Journal:  J Gen Intern Med        ISSN: 0884-8734            Impact factor:   5.128


  32 in total

1.  Bayesian approaches to modeling the conditional dependence between multiple diagnostic tests.

Authors:  N Dendukuri; L Joseph
Journal:  Biometrics       Date:  2001-03       Impact factor: 2.571

2.  Methods for evaluating the performance of diagnostic tests in the absence of a gold standard: a latent class model approach.

Authors:  Elizabeth S Garrett; William W Eaton; Scott Zeger
Journal:  Stat Med       Date:  2002-05-15       Impact factor: 2.373

Review 3.  Sources of variation and bias in studies of diagnostic accuracy: a systematic review.

Authors:  Penny Whiting; Anne W S Rutjes; Johannes B Reitsma; Afina S Glas; Patrick M M Bossuyt; Jos Kleijnen
Journal:  Ann Intern Med       Date:  2004-02-03       Impact factor: 25.391

Review 4.  Applying the right statistics: analyses of measurement studies.

Authors:  J M Bland; D G Altman
Journal:  Ultrasound Obstet Gynecol       Date:  2003-07       Impact factor: 7.299

5.  Estimating disease prevalence in the absence of a gold standard.

Authors:  Michael A Black; Bruce A Craig
Journal:  Stat Med       Date:  2002-09-30       Impact factor: 2.373

Review 6.  A review of solutions for diagnostic accuracy studies with an imperfect or missing reference standard.

Authors:  Johannes B Reitsma; Anne W S Rutjes; Khalid S Khan; Arri Coomarasamy; Patrick M Bossuyt
Journal:  J Clin Epidemiol       Date:  2009-05-17       Impact factor: 6.437

7.  Modeling conditional dependence between diagnostic tests: a multiple latent variable model.

Authors:  Nandini Dendukuri; Alula Hadgu; Liangliang Wang
Journal:  Stat Med       Date:  2009-02-01       Impact factor: 2.373

Review 8.  Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD Initiative.

Authors:  Patrick M Bossuyt; Johannes B Reitsma; David E Bruns; Constantine A Gatsonis; Paul P Glasziou; Les M Irwig; Jeroen G Lijmer; David Moher; Drummond Rennie; Henrica C W de Vet
Journal:  Radiology       Date:  2003-01       Impact factor: 11.105

9.  Prevalence of prostate cancer among men with a prostate-specific antigen level < or =4.0 ng per milliliter.

Authors:  Ian M Thompson; Donna K Pauler; Phyllis J Goodman; Catherine M Tangen; M Scott Lucia; Howard L Parnes; Lori M Minasian; Leslie G Ford; Scott M Lippman; E David Crawford; John J Crowley; Charles A Coltman
Journal:  N Engl J Med       Date:  2004-05-27       Impact factor: 91.245

10.  Chapter 11: challenges in and principles for conducting systematic reviews of genetic tests used as predictive indicators.

Authors:  Daniel E Jonas; Timothy J Wilt; Brent C Taylor; Tania M Wilkins; David B Matchar
Journal:  J Gen Intern Med       Date:  2012-06       Impact factor: 5.128

View more
  16 in total

1.  The effect of a deep-learning tool on dentists' performances in detecting apical radiolucencies on periapical radiographs.

Authors:  Manal H Hamdan; Lyudmila Tuzova; André Mol; Peter Z Tawil; Dmitry Tuzoff; Donald A Tyndall
Journal:  Dentomaxillofac Radiol       Date:  2022-09-12       Impact factor: 3.525

Review 2.  Summary diagnostic validity of commonly used maternal major depression disorder case finding instruments in the United States: A meta-analysis.

Authors:  Arthur H Owora; Hélène Carabin; Jessica Reese; Tabitha Garwe
Journal:  J Affect Disord       Date:  2016-08-16       Impact factor: 4.839

3.  Using a web-based application to define the accuracy of diagnostic tests when the gold standard is imperfect.

Authors:  Cherry Lim; Prapass Wannapinij; Lisa White; Nicholas P J Day; Ben S Cooper; Sharon J Peacock; Direk Limmathurotsakul
Journal:  PLoS One       Date:  2013-11-12       Impact factor: 3.240

4.  Sputum Gram Stain for Bacterial Pathogen Diagnosis in Community-acquired Pneumonia: A Systematic Review and Bayesian Meta-analysis of Diagnostic Accuracy and Yield.

Authors:  Hiroaki Ogawa; Georgios D Kitsios; Mitsunaga Iwata; Teruhiko Terasawa
Journal:  Clin Infect Dis       Date:  2020-07-27       Impact factor: 9.079

5.  Chapter 8: meta-analysis of test performance when there is a "gold standard".

Authors:  Thomas A Trikalinos; Cynthia M Balion; Craig I Coleman; Lauren Griffith; Pasqualina L Santaguida; Ben Vandermeer; Rongwei Fu
Journal:  J Gen Intern Med       Date:  2012-06       Impact factor: 5.128

6.  Utility of investigation for suspected microbial keratitis: a diagnostic accuracy study.

Authors:  Stephen Tuft; Catey Bunce; Surjo De; John Thomas
Journal:  Eye (Lond)       Date:  2022-01-31       Impact factor: 4.456

7.  How to Determine the Accuracy of an Alternative Diagnostic Test when It Is Actually Better than the Reference Tests: A Re-Evaluation of Diagnostic Tests for Scrub Typhus Using Bayesian LCMs.

Authors:  Cherry Lim; Daniel H Paris; Stuart D Blacksell; Achara Laongnualpanich; Pacharee Kantipong; Wirongrong Chierakul; Vanaporn Wuthiekanun; Nicholas P J Day; Ben S Cooper; Direk Limmathurotsakul
Journal:  PLoS One       Date:  2015-05-29       Impact factor: 3.240

8.  Tuberculin skin test and Quantiferon test agreement and influencing factors in tuberculosis screening of healthcare workers: a systematic review and meta-analysis.

Authors:  Rossella Uccello; Maria Grazia Lourdes Monaco; Mariarosaria Muoio; Monica Lamberti; Daniela Feola; Nicola Sannolo; Albert Nienhaus; Paolo Chiodini
Journal:  J Occup Med Toxicol       Date:  2015-01-27       Impact factor: 2.646

9.  Melioidosis diagnostic workshop, 2013.

Authors:  Alex R Hoffmaster; David AuCoin; Prasith Baccam; Henry C Baggett; Rob Baird; Saithip Bhengsri; David D Blaney; Paul J Brett; Timothy J G Brooks; Katherine A Brown; Narisara Chantratita; Allen C Cheng; David A B Dance; Saskia Decuypere; Dawn Defenbaugh; Jay E Gee; Raymond Houghton; Possawat Jorakate; Ganjana Lertmemongkolchai; Direk Limmathurotsakul; Toby L Merlin; Chiranjay Mukhopadhyay; Robert Norton; Sharon J Peacock; Dionne B Rolim; Andrew J Simpson; Ivo Steinmetz; Robyn A Stoddard; Martha M Stokes; David Sue; Apichai Tuanyok; Toni Whistler; Vanaporn Wuthiekanun; Henry T Walke
Journal:  Emerg Infect Dis       Date:  2015-02       Impact factor: 6.883

10.  Multinomial tree models for assessing the status of the reference in studies of the accuracy of tools for binary classification.

Authors:  Juan Botella; Huiling Huang; Manuel Suero
Journal:  Front Psychol       Date:  2013-10-03
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.