Literature DB >> 26229648

Reflections on the current status of commercial automated segmentation systems in clinical practice.

Abstract

Entities: Disease Species

Year: 2014 PMID： 26229648 PMCID： PMC4175848 DOI： 10.1002/jmrs.65

Source DB: PubMed Journal: J Med Radiat Sci ISSN： 2051-3895

× No keyword cloud information.

Delineation of multiple anatomical structures, both target and organs at risk, is a requirement for the planning of modern radiotherapy techniques such as intensity modulated radiation therapy (IMRT) and volumetric modulated arc radiation therapy (VMAT) as well as for the reporting of dose to structures for correlation with treatment outcome. Manual delineation of the many anatomical structures, particularly target structures which require a radiologist or radiation oncologists input, is time-consuming and therefore expensive. As such, it is one of the major obstacles to meeting the increasing demand for IMRT and VMAT. Automatic segmentation has for a long time held the promise of decreasing the requirement for manual delineation but has been the reserve of research institutions using ‘in-house’ software solutions.1–3 With the recent availability of commercial systems for automatic image segmentation (at least 11 now available4), there has been a rapid increase in published articles evaluating the performance of these algorithms for the automatic segmentation of structures in various anatomical sites.5–9 In this issue of the Journal of Medical Radiation Sciences, Greenham et al.10 evaluate the performance of the ABAS system (Elekta AB, Stockholm, Sweden) for delineation of the prostate and pelvic organs. As with other automatic segmentation studies, they report varying degrees of success with some structures being clinically acceptable while others require considerable manual editing. Variation in performance across patients is also observed. Indeed, this is the finding of the author's own institution (not published) where the ABAS system has been implemented for head and neck and prostate patients since 2011. Typically, there is a net reduction in the amount of time taken to delineate the contours if they are first automatically created and then manually edited. In this editorial, the different methods of automatic segmentation are introduced briefly before discussing some of their fundamental limitations.

Methods of Auto-Segmentation

For a good review of automatic segmentation methods the reader should refer to the article by Sharp et al.4 To summarise, there are two methods employed in commercial systems. These are either atlas-based segmentation (ABS) alone or in combination with model-based segmentation (MBS). How these are implemented varies from one system to another. The ABAS system used in the study by Greenham et al.10 uses a combination of both ABS and MBS. The atlas data used by ABAS consist of a number of CT scans each with their own validated set of structures which can be developed for the local institution – a set of atlases. After initial alignment with rigid body registration, deformable image registration is used to deform the CT images of each of the atlas patients to the target patient. The deformation is further refined using a final deformation stage which incorporates some knowledge (or model) of the organs being segmented. The contours from each atlas are then deformed to the target patient using their respective deformation map. In the final stage, the STAPLE algorithm11 is used to combine the multiple structures (for each organ) into a single structure by maximisation of likelihood. This combined contour is, in effect, a consensus of opinion of the contours originating from each of the atlases in the atlas set.

Limitations of Automated Segmentation Methods

There are a number of limitations to this approach which are in common with other commercial systems. The deformable image registration is never perfect for at least the following reasons. (1) Deformable image registration algorithms tend to be constrained (regularised) to avoid locally severe deformations and ensure the global integrity of the image registration. However, this can mean that any locally severe deformation that does exist in the patient does not get matched well. (2) The contrast in the images between boundaries of organs may also be indistinct leading to errors in the deformation and this can vary from patient to patient, organ to organ and even across an organ. (3) Image registration algorithms have a ‘capture range’. If a deformation is over a large distance, for example, deformation between the superior wall of a full bladder and an empty bladder, the image registration algorithm is unlikely to correctly account for this deformation because it is beyond the capture range. In an effort to reduce the impact of a poor image registration between a single atlas and a target patient, multiple atlases can be employed. Selecting the best atlas would require human intervention and so algorithms such as the STAPLE algorithm11 or majority vote have been introduced. However, these algorithms make their decision based on the set of contours alone, without reference to the underlying image data. This has two effects. If there is a random error between the deformed atlases at a point on the surface of an organ to be segmented, the combined contour may not fall exactly on the edge of the underlying image data. The larger the random error is, then the greater the likelihood of a systematic error. In this case, a larger number of atlases should help minimise the error. However, having more atlases does not always mean better segmentation. Using fill state of the bladder as an example, consider the scenario where the majority of the atlases have a small or medium-sized bladder with just one atlas having a large bladder and with the target patient having a large bladder. For the small/medium bladder atlases the deformation is beyond the capture range of the deformable image registration algorithm, so only the atlas with the large bladder is likely to give a good estimate of the true deformation. A STAPLE or majority vote algorithm will assume the grossly incorrect deformations, the majority, are correct and therefore ignore or give low weighting to the outlier that has good deformation. This illustrates the need for alternative solutions such as intelligent pre-selection of atlases, for example, sub-dividing the atlases into small, medium and large bladder atlas sets. This was alluded to by Greenham et al.10 Another potential pitfall of the STAPLE and majority vote algorithms, depending on how they are implemented, is that the consensus structures for two abutting structures may be based on a different selection of atlases. This can lead to overlapping of the two structures when the atlases are combined. A final limitation, as described by Sharp et al.,4 is that the combined structures can lead to small islands disconnected from the main structure.

Evaluating Performance of Automated Segmentation

To evaluate the performance of automatic segmentation it is common to compare the automated segmentation with manually delineated contours. A description of metrics for comparing contours is given by Sharp et al. and also in the review by Jameson et al.4,12 The dice similarity coefficient (DSC)13 is used widely but is very difficult to interpret because (1) its value is dependent on the volume being compared and (2) it gives no indication of the distance between the two contours. One relatively easy to implement solution to this is to use the normalised dice similarity coefficient nDSC,14,15 which simultaneously removes the dependence on volume and also attaches a clinical significance to the discrepancy. Another common metric used to compare contours is the mean distance to agreement. This gives a better indication of the magnitude of the error in terms of distance but can also be misleading in terms of clinical impact. For instance, a contour may be within 1 mm of the reference contour for 90% of its surface area but for 10% of the surface there could be a clinically unacceptable 10 mm discrepancy. The overall mean distance would be 1.9 mm, which could be considered as clinically acceptable even when the 10 mm discrepancy has clinical significance. Defining whether a segmentation is acceptable is not trivial. Not all anatomical structures are equal and furthermore, not all regions on the surface of an anatomical structure are equal. For instance, the impact of a 2 mm error in the definition of the femoral heads will not have as much impact on the optimisation of a treatment plan for the prostate as it will for the rectum. Furthermore, an error in delineation of the anterior wall of the rectum, where the dose gradients are highest, will have more impact on the treatment plan than an error in delineation of the posterior wall. Defining acceptability on whether the contour is within inter-delineator variation is also problematic. Inter-delineator studies will also demonstrate that the random error can be spatially variant, and this may or may not coincide with where the greatest accuracy is required for dose planning.16,17 The method used in the article by Greenham et al.10 could be criticised for not using any of the more complex metrics for comparing structures. However, their method of subjectively scoring the discrepancy of the contours inherently compensates for the variation of clinical acceptability of the discrepancy both between organs and across the surface of an organ.

Clinical Implementation of Automated Segmentation Systems

It is worth spending some time validating the atlas contours. An ABS/MBS system will never work if local contouring practices differ from those used to define the atlas or model in the first place. Implementation can be further complicated if there are a variety of contouring practices within an institution. At the author's hospital, ABAS was primed with atlas head and neck contours closely following international consensus guidelines18; but none of the clinical oncologists were following these guidelines at the time. However, all agreed that they wanted to conform to the guidelines. Consequently, ABAS was used as a tool which facilitated a change in practice. ABS or MBS systems could also be used to facilitate conformance to trial-contouring protocols where there were no previous guidelines.

Automatic Image Segmentation Systems: Are They Worth It?

If an ABS or MBS system is to be effective, then it will need to reduce the time to delineate the required anatomy considerably. Any significant time saving is likely to have a high perceived value by the individual oncologists and radiation therapists who would normally undertake this task. However, in monetary terms, it would take many patients to offset the software and hardware costs of an ABS/MBS system and the cost of the considerable time and effort required to implement the system. For example, at the author's institute, there are ∼300 head and neck cases per year which is atypically large compared to most cancer care centres. If ABS/MBS is able to save 50% of the total contouring time (estimated to be 90 min) and the hourly salary cost is $80 (AUS), then the total saving will be ca. $18,000. So without accounting for the cost of the resources required to implement the system, it could take up to 2 years before the investment starts to pay dividends. If ABS/MBS can be implemented for multiple treatment sites, then its efficiency savings may be realised sooner.

The Future of Automatic Image Segmentation

There is no doubt that automatic image segmentation is going to play a critical role in future radiotherapy. Accurate and efficient auto-segmentation is going to be a requirement if adaptive techniques, where the plan is adapted to changing anatomy on one or more occasions during treatment or on a daily basis19 or if ‘plan of the day’ type techniques, are going to become mainstream.20 In the early days of IMRT, optimisation algorithms in treatment planning software were in their infancy. Consequently, early implementation of IMRT tended to be the domain of large research-oriented centres. Over the years, the software has improved considerably with a corresponding increase in utilisation. Expectations are that automatic segmentation will also improve as deformable image registration algorithms become more refined and tailored to particular anatomical sites. The increase in use of magnetic resonance imaging as a medium for target and normal tissue delineation and in future for image guidance may also increase the accuracy and reliability of auto-segmentation due to its increased soft tissue definition. With these improvements, a corresponding uptake of the use of auto-segmentation techniques is expected.

16 in total

1. CT-based delineation of lymph node levels and related CTVs in the node-negative neck: DAHANCA, EORTC, GORTEC, NCIC,RTOG consensus guidelines.

Authors: Vincent Grégoire; Peter Levendag; Kian K Ang; Jacques Bernier; Marijel Braaksma; Volker Budach; Cliff Chao; Emmanuel Coche; Jay S Cooper; Guy Cosnard; Avraham Eisbruch; Samy El-Sayed; Bahman Emami; Cai Grau; Marc Hamoir; Nancy Lee; Philippe Maingon; Karin Muller; Hervé Reychler
Journal: Radiother Oncol Date: 2003-12 Impact factor: 6.280

2. Automated model-based organ delineation for radiotherapy planning in prostatic region.

Authors: Vladimir Pekar; Todd R McNutt; Michael R Kaus
Journal: Int J Radiat Oncol Biol Phys Date: 2004-11-01 Impact factor: 7.038

Review 3. A review of methods of analysis in contouring studies for radiation oncology.

Authors: Michael G Jameson; Lois C Holloway; Philip J Vial; Shalini K Vinod; Peter E Metcalfe
Journal: J Med Imaging Radiat Oncol Date: 2010-10 Impact factor: 1.735

4. Comparison of rigid and adaptive methods of propagating gross tumor volume through respiratory phases of four-dimensional computed tomography image data set.

Authors: Muthuveni Ezhil; Bum Choi; George Starkschall; M Kara Bucci; Sastry Vedam; Peter Balter
Journal: Int J Radiat Oncol Biol Phys Date: 2008-05-01 Impact factor: 7.038

5. Online adaptive radiotherapy of the bladder: small bowel irradiated-volume reduction.

Authors: Nichola Burridge; Ali Amer; Tom Marchant; Jonathan Sykes; Julie Stratford; Ann Henry; Catherine McBain; Pat Price; Chris Moore
Journal: Int J Radiat Oncol Biol Phys Date: 2006-11-01 Impact factor: 7.038

6. Prospective randomized double-blind pilot study of site-specific consensus atlas implementation for rectal cancer target volume delineation in the cooperative group setting.

Authors: Clifton D Fuller; Jasper Nijkamp; Joop C Duppen; Coen R N Rasch; Charles R Thomas; Samuel J Wang; Paul Okunieff; William E Jones; Daniel Baseman; Shilpen Patel; Carlo G N Demandante; Anna M Harris; Benjamin D Smith; Alan W Katz; Camille McGann; Jennifer L Harper; Daniel T Chang; Stephen Smalley; David T Marshall; Karyn A Goodman; Niko Papanikolaou; Lisa A Kachnic
Journal: Int J Radiat Oncol Biol Phys Date: 2010-04-18 Impact factor: 7.038

7. Atlas-based delineation of lymph node levels in head and neck computed tomography images.

Authors: Olivier Commowick; Vincent Grégoire; Grégoire Malandain
Journal: Radiother Oncol Date: 2008-02-14 Impact factor: 6.280

8. Three-dimensional analysis of delineation errors, setup errors, and organ motion during radiotherapy of bladder cancer.

Authors: Gert J Meijer; Coen Rasch; Peter Remeijer; Joos V Lebesque
Journal: Int J Radiat Oncol Biol Phys Date: 2003-04-01 Impact factor: 7.038

9. Clinical evaluation of multi-atlas based segmentation of lymph node regions in head and neck and prostate cancer patients.

Authors: Carl Sjöberg; Martin Lundmark; Christoffer Granberg; Silvia Johansson; Anders Ahnesjö; Anders Montelius
Journal: Radiat Oncol Date: 2013-10-03 Impact factor: 3.481

10. Evaluation of atlas-based auto-segmentation software in prostate cancer patients.

Authors: Stuart Greenham; Jenna Dean; Cheuk Kuen Kenneth Fu; Joanne Goman; Jeremy Mulligan; Deanna Tune; David Sampson; Justin Westhuyzen; Michael McKay
Journal: J Med Radiat Sci Date: 2014-08-06

6 in total

1. Atlas-based auto-segmentation for postoperative radiotherapy planning in endometrial and cervical cancers.

Authors: Nalee Kim; Jee Suk Chang; Yong Bae Kim; Jin Sung Kim
Journal: Radiat Oncol Date: 2020-05-13 Impact factor: 3.481

2. The clinical evaluation of atlas-based auto-segmentation for automatic contouring during cervical cancer radiotherapy.

Authors: Yi Li; Wenjing Wu; Yuchen Sun; Dequan Yu; Yuemei Zhang; Long Wang; Yao Wang; Xiaozhi Zhang; Yongkai Lu
Journal: Front Oncol Date: 2022-08-02 Impact factor: 5.738

6. Evaluation of a Deep Learning Algorithm for Automated Spleen Segmentation in Patients with Conditions Directly or Indirectly Affecting the Spleen.

Authors: Aymen Meddeb; Tabea Kossen; Keno K Bressem; Bernd Hamm; Sebastian N Nagel
Journal: Tomography Date: 2021-12-13