Literature DB >> 30194472

Inter-observer variability of manual contour delineation of structures in CT.

Leo Joskowicz1, D Cohen2, N Caplan3, J Sosna3.   

Abstract

PURPOSE: To quantify the inter-observer variability of manual delineation of lesions and organ contours in CT to establish a reference standard for volumetric measurements for clinical decision making and for the evaluation of automatic segmentation algorithms.
MATERIALS AND METHODS: Eleven radiologists manually delineated 3193 contours of liver tumours (896), lung tumours (1085), kidney contours (434) and brain hematomas (497) on 490 slices of clinical CT scans. A comparative analysis of the delineations was then performed to quantify the inter-observer delineation variability with standard volume metrics and with new group-wise metrics for delineations produced by groups of observers.
RESULTS: The mean volume overlap variability values and ranges (in %) between the delineations of two observers were: liver tumours 17.8 [-5.8,+7.2]%, lung tumours 20.8 [-8.8,+10.2]%, kidney contours 8.8 [-0.8,+1.2]% and brain hematomas 18 [-6.0,+6.0] %. For any two randomly selected observers, the mean delineation volume overlap variability was 5-57%. The mean variability captured by groups of two, three and five observers was 37%, 53% and 72%; eight observers accounted for 75-94% of the total variability. For all cases, 38.5% of the delineation non-agreement was due to parts of the delineation of a single observer disagreeing with the others. No statistical difference was found for the delineation variability between the observers based on their expertise.
CONCLUSION: The variability in manual delineations for different structures and observers is large and spans a wide range across a variety of structures and pathologies. Two and even three observers may not be sufficient to establish the full range of inter-observer variability. KEY POINTS: • This study quantifies the inter-observer variability of manual delineation of lesions and organ contours in CT. • The variability of manual delineations between two observers can be significant. Two and even three observers capture only a fraction of the full range of inter-observer variability observed in common practice. • Inter-observer manual delineation variability is necessary to establish a reference standard for radiologist training and evaluation and for the evaluation of automatic segmentation algorithms.

Entities:  

Keywords:  Humans; Observer variation; Reproducibility of results

Mesh:

Year:  2018        PMID: 30194472     DOI: 10.1007/s00330-018-5695-5

Source DB:  PubMed          Journal:  Eur Radiol        ISSN: 0938-7994            Impact factor:   5.315


  14 in total

1.  Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation.

Authors:  Simon K Warfield; Kelly H Zou; William M Wells
Journal:  IEEE Trans Med Imaging       Date:  2004-07       Impact factor: 10.048

2.  Evaluation of lung MDCT nodule annotation across radiologists and methods.

Authors:  Charles R Meyer; Timothy D Johnson; Geoffrey McLennan; Denise R Aberle; Ella A Kazerooni; Heber Macmahon; Brian F Mullan; David F Yankelevitz; Edwin J R van Beek; Samuel G Armato; Michael F McNitt-Gray; Anthony P Reeves; David Gur; Claudia I Henschke; Eric A Hoffman; Peyton H Bland; Gary Laderach; Richie Pais; David Qing; Chris Piker; Junfeng Guo; Adam Starkey; Daniel Max; Barbara Y Croft; Laurence P Clarke
Journal:  Acad Radiol       Date:  2006-10       Impact factor: 3.173

Review 3.  SCCT guidelines for the performance and acquisition of coronary computed tomographic angiography: A report of the society of Cardiovascular Computed Tomography Guidelines Committee: Endorsed by the North American Society for Cardiovascular Imaging (NASCI).

Authors:  Suhny Abbara; Philipp Blanke; Christopher D Maroules; Michael Cheezum; Andrew D Choi; B Kelly Han; Mohamed Marwan; Chris Naoum; Bjarne L Norgaard; Ronen Rubinshtein; Paul Schoenhagen; Todd Villines; Jonathon Leipsic
Journal:  J Cardiovasc Comput Tomogr       Date:  2016-10-12

4.  Pretreatment tumor volume as a prognostic factor in metastatic colorectal cancer treated with selective internal radiation to the liver using yttrium-90 resin microspheres.

Authors:  Neha Bhooshan; Navesh K Sharma; Shahed Badiyan; Adeel Kaiser; Fred M Moeslein; Young Kwok; Pradip P Amin; Svetlana Kudryasheva; Michael D Chuong
Journal:  J Gastrointest Oncol       Date:  2016-12

5.  Intra-rater variability in low-grade glioma segmentation.

Authors:  Hans Kristian Bø; Ole Solheim; Asgeir Store Jakola; Kjell-Arne Kvistad; Ingerid Reinertsen; Erik Magnus Berntsen
Journal:  J Neurooncol       Date:  2016-11-11       Impact factor: 4.130

6.  Crowdsourcing image annotation for nucleus detection and segmentation in computational pathology: evaluating experts, automated methods, and the crowd.

Authors:  H Irshad; L Montaser-Kouhsari; G Waltz; O Bucur; J A Nowak; F Dong; N W Knoblauch; A H Beck
Journal:  Pac Symp Biocomput       Date:  2015

7.  Stratification of predictive factors to assess resectability and surgical outcome in clinoidal meningioma.

Authors:  Anil Nanda; Subhas K Konar; Tanmoy K Maiti; Shyamal C Bir; Bharat Guthikonda
Journal:  Clin Neurol Neurosurg       Date:  2016-01-11       Impact factor: 1.876

8.  Automated lung volumetry from routine thoracic CT scans: how reliable is the result?

Authors:  Matthias Haas; Bernd Hamm; Stefan M Niehues
Journal:  Acad Radiol       Date:  2014-05       Impact factor: 3.173

9.  Reverse Classification Accuracy: Predicting Segmentation Performance in the Absence of Ground Truth.

Authors:  Vanya V Valindria; Ioannis Lavdas; Wenjia Bai; Konstantinos Kamnitsas; Eric O Aboagye; Andrea G Rockall; Daniel Rueckert; Ben Glocker
Journal:  IEEE Trans Med Imaging       Date:  2017-04-17       Impact factor: 10.048

10.  Comparison of liver volumetry on contrast-enhanced CT images: one semiautomatic and two automatic approaches.

Authors:  Wei Cai; Baochun He; Yingfang Fan; Chihua Fang; Fucang Jia
Journal:  J Appl Clin Med Phys       Date:  2016-11-08       Impact factor: 2.102

View more
  20 in total

1.  GPU-based 3D iceball modeling for fast cryoablation simulation and planning.

Authors:  Ehsan Golkar; Pramod P Rao; Leo Joskowicz; Afshin Gangi; Caroline Essert
Journal:  Int J Comput Assist Radiol Surg       Date:  2019-08-12       Impact factor: 2.924

Review 2.  Artificial intelligence in assessment of hepatocellular carcinoma treatment response.

Authors:  Bradley Spieler; Carl Sabottke; Ahmed W Moawad; Ahmed M Gabr; Mustafa R Bashir; Richard Kinh Gian Do; Vahid Yaghmai; Radu Rozenberg; Marielia Gerena; Joseph Yacoub; Khaled M Elsayes
Journal:  Abdom Radiol (NY)       Date:  2021-03-31

3.  Atlas-based segmentation of cochlear microstructures in cone beam CT.

Authors:  Kimerly A Powell; Gregory J Wiet; Brad Hittle; Grace I Oswald; Jason P Keith; Don Stredney; Steven Arild Wuyts Andersen
Journal:  Int J Comput Assist Radiol Surg       Date:  2021-02-13       Impact factor: 2.924

4.  Pharyngeal flow simulations during sibilant sound in a patient-specific model with velopharyngeal insufficiency.

Authors:  Elias Sundström; Liran Oren
Journal:  J Acoust Soc Am       Date:  2019-05       Impact factor: 1.840

5.  Atlas-based liver segmentation for nonhuman primate research.

Authors:  Jeffrey Solomon; Nina Aiosa; Dara Bradley; Marcelo A Castro; Syed Reza; Christopher Bartos; Philip Sayre; Ji Hyun Lee; Jennifer Sword; Michael R Holbrook; Richard S Bennett; Dima A Hammoud; Reed F Johnson; Irwin Feuerstein
Journal:  Int J Comput Assist Radiol Surg       Date:  2020-07-09       Impact factor: 2.924

6.  Exploiting Shared Knowledge From Non-COVID Lesions for Annotation-Efficient COVID-19 CT Lung Infection Segmentation.

Authors:  Yichi Zhang; Qingcheng Liao; Lin Yuan; He Zhu; Jiezhen Xing; Jicong Zhang
Journal:  IEEE J Biomed Health Inform       Date:  2021-11-05       Impact factor: 5.772

7.  The Medical Segmentation Decathlon.

Authors:  Michela Antonelli; Annika Reinke; Spyridon Bakas; Keyvan Farahani; Annette Kopp-Schneider; Bennett A Landman; Geert Litjens; Bjoern Menze; Olaf Ronneberger; Ronald M Summers; Bram van Ginneken; Michel Bilello; Patrick Bilic; Patrick F Christ; Richard K G Do; Marc J Gollub; Stephan H Heckers; Henkjan Huisman; William R Jarnagin; Maureen K McHugo; Sandy Napel; Jennifer S Golia Pernicka; Kawal Rhode; Catalina Tobon-Gomez; Eugene Vorontsov; James A Meakin; Sebastien Ourselin; Manuel Wiesenfarth; Pablo Arbeláez; Byeonguk Bae; Sihong Chen; Laura Daza; Jianjiang Feng; Baochun He; Fabian Isensee; Yuanfeng Ji; Fucang Jia; Ildoo Kim; Klaus Maier-Hein; Dorit Merhof; Akshay Pai; Beomhee Park; Mathias Perslev; Ramin Rezaiifar; Oliver Rippel; Ignacio Sarasua; Wei Shen; Jaemin Son; Christian Wachinger; Liansheng Wang; Yan Wang; Yingda Xia; Daguang Xu; Zhanwei Xu; Yefeng Zheng; Amber L Simpson; Lena Maier-Hein; M Jorge Cardoso
Journal:  Nat Commun       Date:  2022-07-15       Impact factor: 17.694

8.  Segmentation evaluation with sparse ground truth data: Simulating true segmentations as perfect/imperfect as those generated by humans.

Authors:  Jieyu Li; Jayaram K Udupa; Yubing Tong; Lisheng Wang; Drew A Torigian
Journal:  Med Image Anal       Date:  2021-01-26       Impact factor: 8.545

Review 9.  Surgical data science - from concepts toward clinical translation.

Authors:  Lena Maier-Hein; Matthias Eisenmann; Duygu Sarikaya; Keno März; Toby Collins; Anand Malpani; Johannes Fallert; Hubertus Feussner; Stamatia Giannarou; Pietro Mascagni; Hirenkumar Nakawala; Adrian Park; Carla Pugh; Danail Stoyanov; Swaroop S Vedula; Kevin Cleary; Gabor Fichtinger; Germain Forestier; Bernard Gibaud; Teodor Grantcharov; Makoto Hashizume; Doreen Heckmann-Nötzel; Hannes G Kenngott; Ron Kikinis; Lars Mündermann; Nassir Navab; Sinan Onogur; Tobias Roß; Raphael Sznitman; Russell H Taylor; Minu D Tizabi; Martin Wagner; Gregory D Hager; Thomas Neumuth; Nicolas Padoy; Justin Collins; Ines Gockel; Jan Goedeke; Daniel A Hashimoto; Luc Joyeux; Kyle Lam; Daniel R Leff; Amin Madani; Hani J Marcus; Ozanan Meireles; Alexander Seitel; Dogu Teber; Frank Ückert; Beat P Müller-Stich; Pierre Jannin; Stefanie Speidel
Journal:  Med Image Anal       Date:  2021-11-18       Impact factor: 13.828

10.  An automatic multi-tissue human fetal brain segmentation benchmark using the Fetal Tissue Annotation Dataset.

Authors:  Kelly Payette; Priscille de Dumast; Hamza Kebiri; Ivan Ezhov; Johannes C Paetzold; Suprosanna Shit; Asim Iqbal; Romesa Khan; Raimund Kottke; Patrice Grehten; Hui Ji; Levente Lanczi; Marianna Nagy; Monika Beresova; Thi Dao Nguyen; Giancarlo Natalucci; Theofanis Karayannis; Bjoern Menze; Meritxell Bach Cuadra; Andras Jakab
Journal:  Sci Data       Date:  2021-07-06       Impact factor: 6.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.