Literature DB >> 12540815

Assessment of consistency in contouring of normal-tissue anatomic structures.

Dawn C Collier1, Stuart S C Burnett, Mayankkumar Amin, Stephen Bilton, Christopher Brooks, Amanda Ryan, Dominique Roniger, Danny Tran, George Starkschall.   

Abstract

The purpose of this work is to estimate the uncertainty in the manual contouring of normal anatomical structures. The heart, esophagus, and spinal cord were contoured manually on six sets of computed tomography images by six dosimetrists whose experience ranged from 1 year to over 15 years. To determine the differences between inter- and intraobserver variations, each data set was contoured by one of the dosimetrists five times and once each by the five other dosimetrists. The magnitude of the discrepancies in delineating the contours was assessed. Intradosimetrist contouring discrepancies were as follows: esophagus, average 0.3 cm and maximum 2.9 cm; heart, average 0.5 cm and maximum 7.6 cm; and spinal cord, average 0.1 cm and maximum 0.7 cm. Interdosimetrist contouring discrepancies were as follows: esophagus, average 0.4 cm and maximum 3.1 cm; heart, average 0.7 cm and maximum 8.1 cm; and spinal cord, average 0.2 cm and maximum 0.9 cm. Significant discrepancies can occur when normal anatomic structures are contoured manually. Interdosimetrist discrepancies are typically slightly greater than intradosimetrist discrepancies. The magnitude of the discrepancies does not appear to be correlated to the experience of the dosimetrist.

Entities:  

Mesh:

Year:  2003        PMID: 12540815      PMCID: PMC5724429          DOI: 10.1120/jacmp.v4i1.2538

Source DB:  PubMed          Journal:  J Appl Clin Med Phys        ISSN: 1526-9914            Impact factor:   2.102


INTRODUCTION

The need for manual delineation of normal‐tissue anatomic structures is a major stumbling block that impedes full automation of the radiation treatment planning process. Consequently, much effort has been put into the search for reliable ways to automate this task. – An important step in the development of any automatic contouring algorithm is assessing the accuracy of the algorithm. A reasonable expectation for the accuracy of the algorithm is that it be no worse than what is manually achievable. Knowledge of the inter‐ and intraobserver variations in manual contouring can help to quantify this expectation. Prior studies ‐ have examined the consistency in delineation of target volumes [gross tumor volume (GTV), clinical target volume (CTV), and planning target volume (PTV)] and normal tissue anatomic structures. Not surprisingly, intraobserver variation was found to be less than interobserver variation, – for example, a 5.5% volume variation as compared with a 17.5% variation. In these cases, however, delineation of the target volume was based on interpretation of the extent of tumor as displayed in an imaging study and not on the assessment of a particular anatomic structure. Prostate target volumes are notable exceptions, because the target volume is often the entire prostate. , , For these cases, delineation of the target volume is tantamount to delineation of the prostate, and variations in the delineated target volume are the result of different interpretations of the extent of the anatomic structure. The purpose of this study was to assess the variations in the manual delineation of normal anatomy. Unlike previous studies, , , , which focused on uncertainties in the total volume of an anatomic structure, the present study assesses the spatial uncertainties that arise in locating the boundaries of anatomic structures. Because beam geometries and treatment portals determined during treatment planning are based partly on the location of one or more critical anatomic structures, errors in locating the boundaries of these structures may have profound effects on the design of a treatment plan.

METHODS AND MATERIALS

A. Contour delineation

Six dosimetrists, with experience ranging from 1 year to over 15 years, participated in the study. The heart, esophagus, and spinal cord were outlined on computed tomography (CT) image data sets of six patients with thoracic cancer who were treated at The University of Texas M. D. Anderson Cancer Center. The CT images were acquired using a virtual simulator (AcQSim; Philips Radiation Oncology Systems, Highland Heights, OH) with a slice thickness of 0.3 cm, and between 64 and 160 transverse slices per patient data set, and then transferred to a treatment planning system (Pinnacle ; Philips Radiation Oncology Systems, Milpitas, CA). All patient‐identifying information was removed from the system. To determine the intraobserver variation, each dosimetrist was assigned one of the CT data sets and asked to contour the three structures five times. Intervals between contouring sessions varied from sequentially up to several days. To determine the interobserver variation, each CT data set was contoured once each by the remaining five dosimetrists. The dosimetrists were asked not to use automatic contouring tools provided by the treatment‐planning system, but rather to contour each structure manually. To facilitate the analysis, the contours were transferred from Pinnacle to a treatment planning system that was developed in‐house using the Radiation Therapy Oncology Group (RTOG) transfer utility provided by the Pinnacle system. The contours for each of the six patients were then separated and individual files created for each contour of the heart, esophagus, and spinal cord. A program was written to read in the contour data and carry out the analysis.

B. Data analysis

If the variation in a set of contours is not too great, the contours will typically divide the plane (i.e., a particular CT slice) into three regions. These regions are an inner region of points enclosed by all contours, an outer region of points that are outside all contours, and an annular region of points that are inside some contours and not others. Figure 1 illustrates this situation for five hypothetical contours (thin lines) where, for emphasis, the inner and outer boundaries of the annular region are drawn with a thick line.
Figure 1

Sketch of five hypothetical contours (thin lines) that divide the plane into an inner region of points contained within all contours, an outer region of points outside all contours, and an annular region of points inside some contours and not others. The numbers 0–5 indicate the number of enclosing contours. The inner and outer boundaries are drawn with a thick line for emphasis.

Sketch of five hypothetical contours (thin lines) that divide the plane into an inner region of points contained within all contours, an outer region of points outside all contours, and an annular region of points inside some contours and not others. The numbers 0–5 indicate the number of enclosing contours. The inner and outer boundaries are drawn with a thick line for emphasis. In our analysis, we examined the width of the annular region at points along its perimeter. These points corresponded to the points defining the anatomic regions on the treatment planning system and varied from around 10 to over 100 points per slice. The width can be determined from a map where each pixel is assigned a value equal to the number of contours that enclose it. A pixel is assigned a value of 5 (for a set of five contours) if it lies inside the inner region, a value of 0 if it lies in the outer region, or an intermediate value equal to the number of enclosing contours if it lies within the annular region. The map is then used to generate a histogram of the number of pixels, or area A(n), contained within n or more contours. Thus, A(5) is the number of pixels enclosed by the inner boundary, A(4) is A(5) plus the number of pixels with a value of 4, and so forth, with A(1) being the number pixels enclosed by the outer boundary. The quantity A(5)/A(1) equals the ratio of areas enclosed by the inner and outer boundaries. To determine the width of the annular region from the map, we use an edge‐detection algorithm to trace out the boundary of a region with a specified pixel value. We select a threshold pixel value of 5 to trace the inner boundary and a threshold pixel value of 1 to trace the outer boundary. For each point on the outer boundary, we define the width of the annular region at that point to be the distance to the nearest inner boundary point. From this set of width “measurements” for a given anatomical structure, CT slice, and inter/intradosimetrist data set, we extract the maximum values, values corresponding to the 95th percentile width, and mean values.

A. Comparison of intradosimetrist contours

Table I summarizes for each dosimetrist, the maximum, 95th percentile, and mean discrepancies for the heart, esophagus, and spinal cord. The experience of each dosimetrist is also noted. The esophagus had a mean discrepancy of 0.3 cm and a maximum of 2.9 cm. For the heart, the mean discrepancy was 0.5 cm, and the maximum was 7.6 cm. The mean discrepancy for the spinal cord was 0.1 cm, whereas the maximum was 0.7 cm.
Table I

Maximum and mean discrepancies in delineation of three anatomic structures for each of the six dosimetrists.

Dosimetrist (experience)Anatomic structureMaximum (cm)95th percentile (cm)Mean (cm)
1Heart2.62.40.4
(1yr)Esophagus2.01.90.3
Spinal cord0.70.40.1
2Heart2.01.70.4
(3yr)Esophagus2.41.70.6
Spinal cord0.50.40.1
3Heart2.62.40.4
(10 yr)Esophagus1.10.80.2
Spinal cord0.50.40.1
4Heart1.81.70.3
Esophagus1.71.00.2
Spinal cord0.60.40.2
5Heart6.76.50.7
(3yr)Esophagus2.92.40.3
Spinal cord0.60.40.1
6Heart7.65.10.5
(15 yr)Esophagus2.01.30.3
Spinal cord0.30.30.1
All dosimetristsMaximumMaximum of 95th percentileMean
Heart7.6 cm6.5 cm0.5 cm
Esophagus2.9 cm2.4 cm0.3 cm
Spinal cord0.7 cm0.4 cm0.1 cm
Figure 2 summarizes the information in Table I on graphs of maximum discrepancy versus years of experience [(Fig. 2a)] and mean discrepancy versus years of experience [(Fig. 2b)]. It does not appear that contouring discrepancy can be correlated with the experience of the dosimetrist.
Figure 2

(a) Maximum discrepancy in contouring the heart, esophagus, and spinal cord for six dosimetrists with varying numbers of years of experience. (b) Mean discrepancy in contouring the heart, esophagus, and spinal cord for six dosimetrists with varying numbers of years of experience.

(a) Maximum discrepancy in contouring the heart, esophagus, and spinal cord for six dosimetrists with varying numbers of years of experience. (b) Mean discrepancy in contouring the heart, esophagus, and spinal cord for six dosimetrists with varying numbers of years of experience. In addition to the discrepancies in locating the boundaries of structures on individual transverse slices, discrepancies also arose in locating the boundaries of structures in the superior‐inferior direction. In particular, the heart is a structure that is displayed in only a limited number of slices in the thorax. Significant inconsistency existed as to the extent of the heart, even when the same dosimetrist repeated contouring the heart on the same CT data set. Table II summarizes the intra‐dosimetrist discrepancy in contouring the heart, for which variations of over 2 cm were found in the superior‐inferior direction.
Table II

Discrepancy in superior‐inferior delineation of the heart for each of the six dosimetrists.

DosimetristYr experienceDiscrepancy
111.2 cm
230.6 cm
3101.8 cm
432.1 cm
530.6 cm
6152.4 cm
Maximum and mean discrepancies in delineation of three anatomic structures for each of the six dosimetrists.

B. Comparison of interdosimetrist contours

Generally, interdosimetrist discrepancies were slightly greater than intradosimetrist discrepancies. The distances between the innermost contour and outermost contour varied depending on the structure. Each average distance was calculated for every slice contained in the image set. Table III summarizes the maximum and mean discrepancies for the three anatomic structures for each patient. Perhaps more remarkable than the magnitude of the discrepancies is the observation that in some transverse planes for some patients there was significant disagreement as to the location of the esophagus. Figure 3 shows an instance in which two different dosimetrists placed the esophagus in two disjoint locations. In addition, variations of over 5 cm were observed in the extent of the delineated heart in the superior‐inferior direction.
Table III

Comparison of inter‐ and intradosimetrist discrepancies in delineation of three anatomic structures in transverse planes.

Heart:Max width (cm)Mean width (cm)
InterIntraInterIntra
Patient 12.72.60.60.4
Patient 23.62.00.50.4
Patient 35.32.60.90.4
Patient 48.11.81.20.3
Patient 53.06.70.30.7
Patient 64.87.60.50.5
Esophagus:Max width (cm)Mean width (cm)
InterIntraInterIntra
Patient 11.82.00.40.3
Patient 22.42.40.50.6
Patient 32.31.10.40.2
Patient 41.81.70.40.2
Patient 53.12.90.40.3
Patient 62.22.00.50.3
Spinal cord:Max width (cm)Mean width (cm)
InterIntraInterIntra
Patient 10.80.70.20.1
Patient 20.90.50.20.1
Patient 30.80.50.30.1
Patient 40.80.60.30.2
Patient 50.50.60.20.1
Patient 60.80.30.20.1
Figure 3

(Color) A transverse plane showing the purported esophagus outlined by two different dosimetrists. Observe that there is no overlap between the esophagus as delineated by the two dosimetrists.

(Color) A transverse plane showing the purported esophagus outlined by two different dosimetrists. Observe that there is no overlap between the esophagus as delineated by the two dosimetrists.

DISCUSSION

Previous studies have demonstrated that the delineation of target volumes by radiation oncologists is highly subjective and may be significantly affected by the availability of additional information, e.g., from imaging studies. By contrast, delineation of normal anatomic structures is usually based on CT data alone and, therefore, should be somewhat less subjective. This study has found dependence on the judgment of the dosimetrist in contouring normal anatomic structures. To improve on those inconsistencies would be very desirable. Thus, an automated method should have a consistency at least as good as that which dosimetrists can achieve manually. Agreement does not yet exist on a suitable metric for assessing consistency in contouring normal‐tissue anatomic structures. Most of the earlier studies, which, addressed contouring tumor volumes, used the ratio of volumes as a metric. , , This ratio may be a useful metric when applied to the evaluation of consistency in delineation of target volumes, but the relative volumes of contoured normal‐tissue anatomic structures may not accurately reflect the clinical consequences of inconsistently delineating these structures. Recently, Cardenas et al. have used a standard statistical value, ***k, that assesses the number of voxels contained within the intersections and complements between two or more structure volumes. The connection between a value of ***k and clinical consequences has not yet been established. We propose that a distance metric may be clinically relevant for assessing contouring uniformity. During the treatment‐planning process, beam geometries and treatment portals are often determined to avoid normal‐tissue anatomic structures for which the margins can be relatively small. If the boundary contour of a critical anatomic structure is displaced from its true location, it is possible that the margin between the portal edge and the structure would be insufficient to spare irradiation of the anatomic structure. This may have significant consequences for the potential morbidity of the treatment plan. Discrepancy in superior‐inferior delineation of the heart for each of the six dosimetrists. The level of disagreement among dosimetrists in contouring normal‐tissue anatomy was somewhat surprising and may have significant consequences, especially for the esophagus, where, in several instances, contours were placed in disjoint locations. It is possible that, given the differences in location of the esophagus, significantly different beam configurations may be designed, leading to significant differences in the delivered dose and correspondingly different risks of morbidity. Comparison of inter‐ and intradosimetrist discrepancies in delineation of three anatomic structures in transverse planes.

CONCLUSIONS

Discrepancies in the manual delineation of anatomic structures can vary substantially, up to several centimeters. Generally, the discrepancies do not appear to be correlated with the experience of the dosimetrist. An automated method could standardize the delineation of anatomical structures and thereby decrease the amount of variation from patient to patient. Automated delineation would need to be at least as accurate as that achievable manually. This study will help to set a guideline for the accuracy expected of an automated method.

ACKNOWLEDGMENT

This research was supported in part by a sponsored research agreement with Philips Radiation Oncology Systems (ADAC), Milpitas, CA.
  13 in total

1.  Inter-observer variation in delineation of bladder and rectum contours for brachytherapy of cervical cancer.

Authors:  A E Saarnak; M Boersma; B N van Bunningen; R Wolterink; M J Steggerda
Journal:  Radiother Oncol       Date:  2000-07       Impact factor: 6.280

2.  Segmentation of brain 3D MR images using level sets and dense registration.

Authors:  C Baillard; P Hellier; C Barillot
Journal:  Med Image Anal       Date:  2001-09       Impact factor: 8.545

3.  Automatic outlining of regions on CT scans.

Authors:  J M Keller; F M Edwards; R Rundle
Journal:  J Comput Assist Tomogr       Date:  1981-04       Impact factor: 1.826

4.  Differences in target outline delineation from CT scans of brain tumours using different methods and different observers.

Authors:  M Yamamoto; Y Nagata; K Okajima; T Ishigaki; R Murata; T Mizowaki; M Kokubo; M Hiraoka
Journal:  Radiother Oncol       Date:  1999-02       Impact factor: 6.280

5.  Variability in target volume delineation on CT scans of the breast.

Authors:  C W Hurkmans; J H Borger; B R Pieters; N S Russell; E P Jansen; B J Mijnheer
Journal:  Int J Radiat Oncol Biol Phys       Date:  2001-08-01       Impact factor: 7.038

6.  Rectum contouring variability in patients treated for prostate cancer: impact on rectum dose-volume histograms and normal tissue complication probability.

Authors:  Claudio Fiorino; Vittorio Vavassori; Giuseppe Sanguineti; Carla Bianchi; Giovanni Mauro Cattaneo; Anna Piazzolla; Cesare Cozzarini
Journal:  Radiother Oncol       Date:  2002-06       Impact factor: 6.280

7.  Variability of target volume delineation in cervical esophageal cancer.

Authors:  P Tai; J Van Dyk; E Yu; J Battista; L Stitt; T Coad
Journal:  Int J Radiat Oncol Biol Phys       Date:  1998-09-01       Impact factor: 7.038

8.  Clinical variability of target volume description in conformal radiotherapy planning.

Authors:  J P Logue; C L Sharrock; R A Cowan; G Read; J Marrs; D Mott
Journal:  Int J Radiat Oncol Biol Phys       Date:  1998-07-01       Impact factor: 7.038

9.  Measurement of lung tumor volumes using three-dimensional computer planning software.

Authors:  Patrick Bowden; Richard Fisher; Michael Mac Manus; Andrew Wirth; Gillian Duchesne; Michael Millward; Allan McKenzie; Judy Andrews; David Ball
Journal:  Int J Radiat Oncol Biol Phys       Date:  2002-07-01       Impact factor: 7.038

10.  Computed tomography determination of prostate volume and maximum dimensions: a study of interobserver variability.

Authors:  Eric Berthelet; Mitchell C C Liu; Alex Agranovich; Kelly Patterson; Tammy Currie
Journal:  Radiother Oncol       Date:  2002-04       Impact factor: 6.280

View more
  26 in total

1.  A geodesic deformable model for automatic segmentation of image sequences applied to radiation therapy.

Authors:  G Bueno; O Déniz; J Salido; C Carrascosa; J M Delgado
Journal:  Int J Comput Assist Radiol Surg       Date:  2010-07-20       Impact factor: 2.924

2.  Automatic multiorgan segmentation in CT images of the male pelvis using region-specific hierarchical appearance cluster models.

Authors:  Dengwang Li; Pengxiao Zang; Xiangfei Chai; Yi Cui; Ruijiang Li; Lei Xing
Journal:  Med Phys       Date:  2016-10       Impact factor: 4.071

3.  Segmentation of pelvic structures for planning CT using a geometrical shape model tuned by a multi-scale edge detector.

Authors:  Fabio Martínez; Eduardo Romero; Gaël Dréan; Antoine Simon; Pascal Haigron; Renaud de Crevoisier; Oscar Acosta
Journal:  Phys Med Biol       Date:  2014-03-05       Impact factor: 3.609

4.  Quantifying the dosimetric impact of organ-at-risk delineation variability in head and neck radiation therapy in the context of patient setup uncertainty.

Authors:  Eric Aliotta; Hamidreza Nourzadeh; Jeffrey Siebers
Journal:  Phys Med Biol       Date:  2019-07-05       Impact factor: 3.609

5.  Atlas ranking and selection for automatic segmentation of the esophagus from CT scans.

Authors:  Jinzhong Yang; Benjamin Haas; Raymond Fang; Beth M Beadle; Adam S Garden; Zhongxing Liao; Lifei Zhang; Peter Balter; Laurence Court
Journal:  Phys Med Biol       Date:  2017-11-14       Impact factor: 3.609

6.  Cardiac atlas development and validation for automatic segmentation of cardiac substructures.

Authors:  Rongrong Zhou; Zhongxing Liao; Tinsu Pan; Sarah A Milgrom; Chelsea C Pinnix; Anhui Shi; Linglong Tang; Ju Yang; Ying Liu; Daniel Gomez; Quynh-Nhu Nguyen; Bouthaina S Dabaja; Laurence Court; Jinzhong Yang
Journal:  Radiother Oncol       Date:  2016-12-08       Impact factor: 6.280

7.  Consideration of dose limits for organs at risk of thoracic radiotherapy: atlas for lung, proximal bronchial tree, esophagus, spinal cord, ribs, and brachial plexus.

Authors:  Feng-Ming Spring Kong; Timothy Ritter; Douglas J Quint; Suresh Senan; Laurie E Gaspar; Ritsuko U Komaki; Coen W Hurkmans; Robert Timmerman; Andrea Bezjak; Jeffrey D Bradley; Benjamin Movsas; Lon Marsh; Paul Okunieff; Hak Choy; Walter J Curran
Journal:  Int J Radiat Oncol Biol Phys       Date:  2010-10-08       Impact factor: 7.038

Review 8.  Magnetic resonance imaging in precision radiation therapy for lung cancer.

Authors:  Hannah Bainbridge; Ahmed Salem; Rob H N Tijssen; Michael Dubec; Andreas Wetscherek; Corinne Van Es; Jose Belderbos; Corinne Faivre-Finn; Fiona McDonald
Journal:  Transl Lung Cancer Res       Date:  2017-12

9.  Automatic contouring of brachial plexus using a multi-atlas approach for lung cancer radiotherapy.

Authors:  Jinzhong Yang; Arya Amini; Ryan Williamson; Lifei Zhang; Yongbin Zhang; Ritsuko Komaki; Zhongxing Liao; James Cox; James Welsh; Laurence Court; Lei Dong
Journal:  Pract Radiat Oncol       Date:  2013-10-01

10.  On the objectivity, reliability, and validity of deep learning enabled bioimage analyses.

Authors:  Dennis Segebarth; Matthias Griebel; Christoph M Flath; Robert Blum; Nikolai Stein; Cora R von Collenberg; Corinna Martin; Dominik Fiedler; Lucas B Comeras; Anupam Sah; Victoria Schoeffler; Teresa Lüffe; Alexander Dürr; Rohini Gupta; Manju Sasi; Christina Lillesaar; Maren D Lange; Ramon O Tasan; Nicolas Singewald; Hans-Christian Pape
Journal:  Elife       Date:  2020-10-19       Impact factor: 8.140

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.