Literature DB >> 24257288

A critical evaluation of the PTW 2D-ARRAY seven29 and OCTAVIUS II phantom for IMRT and VMAT verification.

Mohammad Hussein¹, Elizabeth J Adams, Thomas J Jordan, Catharine H Clark, Andrew Nisbet.

Abstract

Quality assurance (QA) for intensity- and volumetric-modulated radiotherapy (IMRT and VMAT) has evolved substantially. In recent years, various commercial 2D and 3D ionization chamber or diode detector arrays have become available, allowing for absolute verification with near real time results, allowing for streamlined QA. However, detector arrays are limited by their resolution, giving rise to concerns about their sensitivity to errors. Understanding the limitations of these devices is therefore critical. In this study, the sensitivity and resolution of the PTW 2D-ARRAY seven29 and OCTAVIUS II phantom combination was comprehensively characterized for use in dynamic sliding window IMRT and RapidArc verification. Measurement comparisons were made between single acquisition and a multiple merged acquisition techniques to improve the effective resolution of the 2D-ARRAY, as well as comparisons against GAFCHROMIC EBT2 film and electronic portal imaging dosimetry (EPID). The sensitivity and resolution of the 2D-ARRAY was tested using two gantry angle 0° modulated test fields. Deliberate multileaf collimator (MLC) errors of 1, 2, and 5 mm and collimator rotation errors were inserted into IMRT and RapidArc plans for pelvis and head & neck sites, to test sensitivity to errors. The radiobiological impact of these errors was assessed to determine the gamma index passing criteria to be used with the 2D-ARRAY to detect clinically relevant errors. For gamma index distributions, it was found that the 2D-ARRAY in single acquisition mode was comparable to multiple acquisition modes, as well as film and EPID. It was found that the commonly used gamma index criteria of 3% dose difference or 3 mm distance to agreement may potentially mask clinically relevant errors. Gamma index criteria of 3%/2 mm with a passing threshold of 98%, or 2%/2 mm with a passing threshold of 95%, were found to be more sensitive. We suggest that the gamma index passing thresholds may be used for guidance, but also should be combined with a visual inspection of the gamma index distribution and calculation of the dose difference to assess whether there may be a clinical impact in failed regions.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2013 PMID： 24257288 PMCID： PMC5714639 DOI： 10.1120/jacmp.v14i6.4460

Source DB: PubMed Journal: J Appl Clin Med Phys ISSN： 1526-9914 Impact factor: 2.102

I. INTRODUCTION

Intensity‐modulated radiotherapy, delivered using step‐and‐shoot or dynamic sliding window multileaf collimators (MLC), has facilitated complex treatments whereby the dose can be conformed to a target volume whilst minimizing the dose to surrounding normal tissue. Volumetric‐modulated arc therapy (VMAT), which combines dynamic MLC delivery along with gantry speed and dose rate modulation, has been a relatively new development.( , As with the treatment capabilities, quality assurance (QA) for IMRT and VMAT has evolved substantially to meet the increasing treatment complexity. In recent years, various commercial 2D and 3D ionization chamber or diode detector arrays have become available, allowing for verification of absolute dose with immediate results. Conventional methods, such as ionization chamber point dose measurements and film dosimetry, are gradually being replaced by detector arrays. These devices have allowed centers to streamline their QA and increase the number of patients treated with IMRT and VMAT. However, detector arrays are limited by their resolution, giving rise to concerns about their sensitivity to errors. Commercial detector arrays available include the (ScandiDos AB, Uppsala, Sweden), ArcCHECK/MapCHECK (Sun Nuclear Corp., Melbourne, FL), MatriXX (IBA Dosimetry GmbH, Schwarzenbruck, Germany), and 2D‐ARRAY seven29 (PTW, Freiburg, Germany). Various studies have previously been performed to assess the sensitivity to IMRT and/or VMAT simulated errors for the , , , ArcCHECK,( , and MapCHECK.( , For the OCTAVIUS II octagonal phantom and 2D‐ARRAY seven29 combination there is limited data available for composite plan verification for dynamic IMRT or Varian RapidArc (Varian Medical Systems, Palo Alto, CA). Masi et al. compared the 2D‐ARRAY seven29 with other dosimetry systems, including and MapCHECK, in the detectability of MLC errors in Elekta VMAT (Elekta, Stockholm, Sweden). Spezi et al. and Poppe et al. found that for step‐and‐shoot IMRT, 1 mm MLC errors could be detected for per‐beam planar verification using the 2D‐ARRAY. The purpose of this study was to systematically characterize the sensitivity and resolution of the PTW 2D‐ARRAY seven29 and OCTAVIUS II phantom combination, using simulated errors, for optimal use in composite clinical dynamic IMRT and Varian RapidArc verification, including a comparison between a multiple 2D‐ARRAY acquisition technique to improve the effective resolution of the 2D‐ARRAY, and with the single acquisition technique. Comparisons were also performed against GAFCHROMIC EBT2 film (International Specialty Products, Wayne NJ).

II. MATERIALS AND METHODS

A. PTW 2D‐ARRAY seven29 and OCTAVIUS II phantom

The PTW 2D‐ARRAY seven29 consists of a matrix of 729 cubic vented ionization chambers with cross section, spaced 1 cm center‐to‐center, giving a total area of . The upper electrode layer sits below a 0.5 cm polymethyl methacrylate (PMMA) build‐up layer, whereas the lower electrode layer lies on top of a 0.2 cm thick electrode plate which itself is mounted on a 1 cm PMMA base plate. The OCTAVIUS phantom has an octagonal shape in its cross section, and is designed to allow composite rotational IMRT plan verification. The phantom is made of polystyrene which has a physical density of 1.04g/cm3. Its dimensions are 32 cm wide, 32 cm long, and 32 cm in height, and has a central cavity for the 2D‐ARRAY. Throughout this study, the OCTAVIUS II phantom was used in the coronal (horizontal) orientation. The OCTAVIUS phantom was CT scanned twice, once with the 2D‐ARRAY in situ and once with a homogeneous insert, for comparison (Fig. 1). For composite field measurements, the base of the OCTAVIUS contained a semicircular air gap to correct for the inherent under‐response of the 2D‐ARRAY when the radiation field is incident posteriorly, as described by Van Esch et al. For planning, the phantom was scanned with a solid base.

Figure 1

CT scan of the OCTAVIUS phantom with 2D‐ARRAY (left) in situ and homogeneous insert (right).

A.1 Basic commissioning tests

Basic commissioning tests of the 2D‐ARRAY were performed on a Varian Clinac iX (Varian Medical Systems). The Clinac incorporates the Millennium 120‐leaf MLCs, with the central 80 MLCs covering , each having a 0.5 cm width at the isocenter; the remaining MLCs have 1 cm width. The methodology used was in keeping with previously published reports and is, therefore, not discussed in detail.( , , , ) The effective point of measurement (EPOM) was determined following the methodology of Poppe et al. and Van Esch et al. Dose linearity was determined between 5–2500 cGy. Dose rate linearity was checked for dose rates ranging from 100–600 MU/min. Output versus field size were checked for through to cm field sizes. Profiles for both plain and wedged field sizes of , and were measured by the 2D‐ARRAY and compared with diode data measured in a Scanditronix Wellhöfer RFA water tank (IBA Dosimetry GmbH, Schuarzenbruck, Germany) at the same depth. For all the tests, except EPOM, the setup was such that the effective depth of the 2D‐ARRAY was at 5 cm. The beam energy used throughout was 6 MV , focus‐to‐surface distance (FSD) was 100 cm, and 100 MU was used in all cases, except the dose linearity check. For the linearity measurements, a field size was used. Furthermore, the detectors within the 2D‐ARRAY have a relative calibration against the central detector and this was confirmed by setting an isocentric field with a field size to cover all the detectors within the 2D‐ARRAY and assessing uniformity by looking at a profile through each line of detectors, compared to a water tank profile.

A.2 Cross‐calibration procedure

The 2D‐ARRAY was calibrated using a cross‐calibration procedure. In this procedure a known dose was delivered and the response of the central detector was used to calculate a crosscalibration factor. This factor was applied to the entire matrix. For planar measurements, the 2D‐ARRAY was set up at an effective depth of 5 cm in solid water (Gammex Inc., Middleton, WI), and with 10 cm solid water backscatter. For composite measurements, the 2D‐ARRAY was setup within the OCTAVIUS phantom and the cross‐calibration procedure was performed in those conditions.

A.3 Comparisons of using the OCTAVIUS scan with 2D‐ARRAY in situ vs. homogeneous scan

Calculating on the scan of the OCTAVIUS with the 2D‐ARRAY in situ with an advanced calculation algorithm may result in perturbation of the predicted dose by the air‐filled ion chambers, which may add to uncertainties, particularly when using the gamma index analysis. Therefore, a dosimetric comparison was performed between using the OCTAVIUS scan with the 2D‐ARRAY in situ and a homogeneous insert. Firstly, the directional response of the OCTAVIUS phantom was assessed by delivering a field in 15° gantry angle increments at 6 MV with the phantom set up isocentrically. The dose to the central detector was recorded. To avoid irradiating through the couch, the sectors comprising the first 180° were measured with the OCTAVIUS phantom in the normal setup, and the remaining sectors were measured by inverting the phantom. The expected dose at the central detector was calculated in the Varian Eclipse v8.9 treatment planning system (Varian Medical Systems) using the analytical anisotropic algorithm (AAA) v8.9 algorithm for both scans. All clinical plan composite measurements, described below in the Materials & Methods section C.2, were evaluated using predicted doses calculated on both scans to compare the sensitivity of the gamma index analysis.

B. Multiple acquisition modes in the 2D‐ARRAY

In the PTW VeriSoft software, it is possible to merge multiple measurement acquisitions, as proposed by Spezi et al. The sequence of measurements is as follows: a measurement is performed at the central axis; the 2D‐ARRAY is moved 0.5 cm inferior; the 2D‐ARRAY is shifted 0.5 cm to the right; and the 2D‐ARRAY is shifted 0.5 cm superior. By performing the above sequence and merging the measurements, it was possible to effectively increase the total number of measurement points four‐fold from 729 to 2916, and improve the detector spacing from 1 cm to 0.5 cm center‐to‐center. For planar measurements, this can be easily achieved by automated couch movements. However, for composite measurements using the OCTAVIUS phantom, the 2D‐ARRAY must be shifted within the phantom and an insert is available to facilitate this. It may not be practical to perform this for every clinical plan verification. Therefore, the effect of different acquisition techniques was compared. All of the test fields and clinical plans described below were measured using the multiple acquisition technique. Comparisons were then performed between: single acquisition merging in the lateral direction only (by performing two acquisitions) merging in the longitudinal direction only full merge after four acquisitions The gap between each ion chamber in the 2D‐ARRAY is 5 mm wide. as can be seen in the schematic in Fig. 2. Suppose that only a single 5 mm MLC leaf was being sampled. In this case, three possibilities may occur for a collimator rotation of 0°: i) direct overlap between the leaf and a line of detectors, ii) partial overlap with a line of detectors, or iii) complete miss if the leaf aligns with the gap between the lines of detectors.

Figure 2

Schematic of the overlap between the 5 mm Varian Millenium MLCs and ion chambers within the 2D‐ARRAY for a collimator rotation of 0° (as typically used for static gantry IMRT) and 30° (as typically used for RapidArc). The dashed line indicates the central axis.

In normal situations, whereby the setup is such that the field's crosshairs align with the marks on the 2D‐ARRAY and OCTAVIUS phantom, the central axis will intersect the central detector. In this case, scenario ii will occur (illustrated in Fig. 2) for static gantry IMRT, where the collimator is typically set to 0°, and for RapidArc, where the collimator angle may be typically set to 30°. However, scenario i and iii above would occur if a superior‐inferior movement of 0.5 cm is performed. In this case, every other MLC leaf will directly overlap with a row of detectors for a collimator angle of 0°, and the remainder will be missed. This effect is minimized where there is a collimator rotation. Therefore in order to test the limits of the 2D‐ARRAY, comparisons were also performed using acquisition #2 in the measurement sequence described above. Schematic of the overlap between the 5 mm Varian Millenium MLCs and ion chambers within the 2D‐ARRAY for a collimator rotation of 0° (as typically used for static gantry IMRT) and 30° (as typically used for RapidArc). The dashed line indicates the central axis.

C. Deliberate errors

The resolution and sensitivity of the 2D‐ARRAY was tested by a number of methods. All plans described in the following subsections were created using Eclipse (Varian Medical Systems), and calculations were performed using the AAA with a 0.25 cm grid spacing. Measurements were performed on the same Varian Clinac iX over two sessions. Plans were generated to make optimal use of the 0.5 cm MLCs. The 2D‐ARRAY was cross‐calibrated in the morning and afternoon of each session to account for any output fluctuation. In all cases, the normal plan (i.e., with no errors) was measured for baseline. Measurements were also performed using GAFCHROMIC EBT2 film in the OCTAVIUS phantom in the same plane as the 2D‐ARRAY, using the film insert provided with the phantom. In the case of the film measurements, the solid OCTAVIUS base was used and plans were calculated on a homogeneous scan. GAFCHROMIC films were processed and analyzed 24 hours after exposure. Films were scanned using the Epson Espression 10000 XL flatbed colour scanner (US Epson, Long Beach, CA) at a resolution of 75 dpi, using the red channel. In all cases, the gamma index (y) method of evaluation was used with a 20% threshold. Various criteria for γ were analyzed, including the commonly used 3% dose difference and 3 mm distance‐to‐agreement (DTA) criteria. For the 2D‐ARRAY, analysis was performed using the PTW VeriSoft software version 4.1. For film, analysis was performed using the IBA OmniPro‐I'mRT software version 1.7. Both the film and plan data were normalized to 100% at a point in a high‐dose low‐gradient region, to perform a relative comparison. This procedure is commonly used for film analysis due to the known difficulty in performing an absolute dose calibration for film. In order to maintain a consistent comparison, the 2D‐ARRAY data was also rescaled in the same way as the film. In both cases, the normalization point for the gamma analysis was kept consistent for any particular set of measurement; for example, in the prostate IMRT plan with different errors introduced, the normalization value was always kept the same to avoid bias.

C.1 Gantry angle 0° test fields

As a starting point, it was necessary to understand the limitations of the 2D‐ARRAY in its basic IMRT measuring mode — that is setting the gantry to 0° and delivering a modulated field such that the 2D‐ARRAY is orthogonal to the beam. The aim was to investigate the two following questions: How does the detector spacing of the 2D‐ARRAY affect the measurement and visualization of a highly modulated field? How does the detector spacing affect the sensitivity of the gamma index in a modulated field? Therefore, two individual planar test fields were designed specifically with the aim of addressing these questions. The test fields described below were also measured using the Varian aS1000 Electronic Portal Imaging Device (EPID) and GAFCHROMIC film. The first test field was designed to test the sensitivity of the gamma index analysis calculated in the 2D‐ARRAY using a modulated field with regions ranging from subtle to significant. This test will be referred to as the “sensitivity test”. The test had 54 regions of varying width and dose difference introduced into an open field using the fluence dose painting tool in Eclipse, as shown in Fig. 3 (left). The minimum spot size that the fluence painting tool allowed was 3 mm width and 5 mm height. As such, the columns in the test varied between one to six adjacent fluence spots (i.e., the width varied from 3 mm in the first columns up to 15 mm in the last column). Each row had a height of 5 mm and corresponded to a MLC leaf, and the gap between each region was 5 mm. The difference in dose between each row and the high‐dose background ranged between 1% and 10%. The measurement of the field was compared to the open field predicted dose to determine the minimum detectable error by means of the gamma index analysis. Parameters for the analysis were varied from 1%‐10% dose difference, and 1–3 mm DTA.

Figure 3

Single gantry test fields for sensitivity (left) and resolution (right) assessment. In the sensitivity test, the regions vary in width from left to right between 3 mm and 15 mm, and the dose difference relative to the background varies from top to bottom by 1 % to 10%. In the resolution test, the values in the lower half represent difference in percent dose between the regions and the background (lime green) area. In both tests, each row represents a single MLC leaf.

In the second test, a highly modulated field was created by dose painting varying dose and spatial information into an initially uniform field, as shown in Fig. 3 (right). This field is more complex than a field encountered clinically and tests the limits of the 2D‐ARRAY The first two lines in the field were offset from each other by 5 mm. This meant that due to the design of the 2D‐ARRAY, the resolution in the lateral direction could be tested. The third line increased in size in the longitudinal direction and, therefore, the resolution could be tested in this direction. The remaining six lines were used as a combined spatial and dose resolution test. This test will be referred to as the “resolution test”. The field was delivered to the 2D‐ARRAY to test how well it performs in distinguishing the regions. Single gantry test fields for sensitivity (left) and resolution (right) assessment. In the sensitivity test, the regions vary in width from left to right between 3 mm and 15 mm, and the dose difference relative to the background varies from top to bottom by 1 % to 10%. In the resolution test, the values in the lower half represent difference in percent dose between the regions and the background (lime green) area. In both tests, each row represents a single MLC leaf.

C.2 Clinical IMRT and RapidArc plan errors

To assess the sensitivity and resolution of the 2D‐ARRAY in composite clinical plan verification using the OCTAVIUS phantom, a range of errors were simulated. Firstly, deliberate single MLC leaf positional errors of 1 mm, 2 mm, and 5 mm were introduced, consistently throughout the dynamic leaf motion, into pelvic and head & neck IMRT and RapidArc clinical plans to simulate subtle MLC mechanical errors. This was achieved by editing each control point within the RT Plan DICOM files. Measurements were evaluated against the TPS calculated dose distribution. Plans included five‐field prostate IMRT, five‐field head & neck IMRT, single arc prostate RapidArc, and two‐Arc head & neck RapidArc. All plan files were reloaded into the Eclipse TPS and calculated to ensure the fields were deliverable. The test fields were delivered using the same monitor units (MUs) as the original plan and compared against the original unedited plan. Deliberate collimator rotation errors of 1°, 2° and 5° were also introduced into a two‐arc prostate and pelvic nodes RapidArc plan. A further test was created using the fluence editing tool in Eclipse to manually paint hot and cold dose spots of varying dimensions into a field in a prostate and pelvic node IMRT plan. In this case a recalculation of the leaf sequence was required; however, the intention with this test was to introduce significant errors to test the sensitivity and the resolution of the 2D‐ARRAY. All clinical error plans were also calculated on the OCTAVIUS scan with homogeneous insert. This meant it was possible to simulate the expected gamma index pass rate in ‘ideal’ conditions (i.e., no output fluctuation and no inherent mechanical effects on the other unperturbed MLC leaves). This was done by exporting the normal plan predicted dose plane and the dose plane for each error plan. The expected gamma index was calculated in both VeriSoft and OmniPro to check how well they agreed. The expected passing rate for each plan was taken as the average of the VeriSoft and OmniPro calculation.

C.3 Effect of normalization point

The effect of choosing a point for the gamma index evaluation was investigated to assess whether there would be an influence on results. The analysis described above was repeated by deliberately choosing a dose point in a region where there was an MLC error, and by choosing the dose based on a mean value over the high‐dose region. In order to facilitate the latter, a custom spreadsheet was generated in Microsoft Excel 2007 (Redmond, WA). The spreadsheet was created such that it was possible to import the 2D‐ARRAY measurement and predicted dose. The local percent dose difference was calculated on a per detector basis, by comparing the measurement against the corresponding predicted dose. Customizable thresholds were also written into the spreadsheet such that the user may choose a lower and upper threshold for any value between 0% and 100%. The maximum dose point was taken as 100%. Assuming that the ICRU Report 83 conditions were met for the high‐dose region, the coverage would range between 95% and 107% of the prescribed dose (i.e., a range of 12%). Allowing for changes in the homogeneity when the plan was transferred to the OCTAVIUS phantom, a lower threshold of 85% was used to ensure complete sampling of the primary PTV region. This spreadsheet was generated because it was found that the commercial systems (VeriSoft and OmniPro) limited the lower threshold to a maximum of 30%, whether for dose difference or gamma analysis. It was then possible to acquire various statistics such as the mean of all the dose differences and standard deviation. The spreadsheet was also written in a way that a comparison may be performed between any predicted dose plane versus another.

D. Dosimetric and radiobiological evaluation of clinical plan errors

The dosimetric impact of the subtle MLC errors was assessed using the spreadsheet described above. The predicted dose due to a MLC error was exported to compare against the unperturbed predicted dose. The expected local dose difference caused by the deliberate MLC errors was calculated and compared to that found by the 2D‐ARRAY. Additionally, the mean dose difference over a high‐dose region was also calculated. In addition to the dosimetric impact, it was also possible to determine whether there is a theoretical radiobiological effect due to the errors introduced into the clinical plans, as described by Carver et al. Tumor control probability (TCP) calculations and normal tissue complication probabilities were performed in BIOPLAN. TCP was calculated using the mechanistic Poisson‐based TCP model. The following input parameters for the TCP models for prostate tumors were used: radiosensitivity parameter , interpatient variation in radiosensitivity parameter , clonogenic cell density , as well as an ratio of 10 Gy. For squamous cell carcinoma, parameters were chosen as . In the prostate cases, NTCP calculations were performed for the rectum using the Lyman‐Kutcher‐Bauman model, , , generalized uniform dose concept, and Quantative Analysis of Normal Tissue Effects in the Clinic recommended best parameter estimates of , volume effects parameter , slope parameter , and the dose for 50% complication probability . For the bladder, there is limited NTCP parameter data due to difficulties in fitting parameters to genitourinary toxicity. The general consensus is to use the parameters of , in conjunction with . In the head & neck plans, NTCP values were calculated for spinal cord and parotids. For spinal cord, parameters for myelopathy were taken as . For xerostomia, .

E. Data and statistical analysis

To perform a quantitative analysis between the different permutations described above, a range of gamma index passing criteria was recorded, including the commonly used 3%/3 mm. For each passing criteria, the percentage of detectors with was recorded. We propose an analysis technique, whereby cumulative histograms of the percentage of planes measured against the percentage of detectors/pixels passing with are plotted. This makes it possible to compare the sensitivity of different passing criteria and different measurement technique qualitatively. In addition to the above analysis method, agreement between passing rates in the different measurement permutations and the 2D‐ARRAY measurement in single acquisition mode was statistically assessed using the concordance correlation coefficient, . In the case of poor agreement, the statistical significance of any difference was assessed using the Wilcoxon signed‐rank test with as the threshold for significance.

III. RESULTS

A. Basic commissioning

Dose linearity between 5–2500 cGy had a Pearson correlation coefficient, r, of 1.0. Dose rate linearity was found to be within between 100–600 MU/minute. Profiles of field sizes ranging from to and wedged profiles measured with the 2D‐ARRAY agreed within measurement uncertainty with profiles measured using a diode in a water tank (the concordance correlation coefficient, , was for all). Basic commissioning data agreed well with previously reported data.( , , , ) The EPOM was measured to be 0.75 cm from the surface, in agreement with the operational manual and published data.

A.1 Directional response in the OCTAVIUS scan with 2D‐ARRAY in situ vs. homogeneous scan

The result of the directional response evaluation can be seen in Fig. 4. The graph shows the difference between the measured and expected dose in the central detector within the 2D‐ARRAY as a function of gantry angle. The difference between the total dose given to the central detector and expected was 0.3% when the 2D‐ARRAY was scanned in the OCTAVIUS phantom and was when a homogeneous insert was used. It can be seen that using a homogeneous scan results in a significant underresponse when the beam incidence is lateral or entering the ARRAY through an oblique direction. This is due to the lack of modeling of the inhomogeneities caused by the vented ion chambers within the 2D‐ARRAY. When the scan of the 2D‐ARRAY was used, this improved the response. It is worth noting that this comparison is reported for the AAA algorithm. Calculating on the 2D‐ARRAY scan using the pencil beam convolution algorithm with heterogeneity correction yielded, as expected, a similar result to that seen with the homogeneous insert calculated using AAA.

Figure 4

Angular response of the 2D‐ARRAY; comparison using predicted doses in the 2D‐ARRAY scan and in the homogeneous scan.

B. Gantry angle 0° test fields

Measurements of the resolution test field for the 2D‐ARRAY in single acquisition and full merge mode, GAFCHROMIC film, and EPID are shown in Fig. 5. The 2D‐ARRAY was able to distinguish dose differences, but there was a smoothing effect in the single acquisition which was less apparent when the effective resolution was improved to 5 mm.

Figure 5

(left to right) GAFCHROMIC film of resolution test, 2D‐ARRAY measurement (single acquisition), 2D‐ARRAY measurement (fully merged), EPID measurement.

Passing rates for the sensitivity test using various gamma index criteria are shown in Fig. 6. Data have been plotted for the 2D‐ARRAY in single and full merge acquisition modes, film, EPID, and expected passing rate. It can be seen that the DTA criteria had a minimal impact in this test field for all the permutations, except for the GAFCHROMIC film. The single acquisition 2D‐ARRAY data can be seen to be the least sensitive when compared to the expected passing rate; however, this improved when a full merge was performed. As the dose difference criterion was increased, the different systems began to converge. The EPID measurement was found to be the most sensitive for planar field measurements.

Figure 6

Gamma index passing rates for the sensitivity test field for different measurement permutations. Points have been linked to provide a visual guide.

(left to right) GAFCHROMIC film of resolution test, 2D‐ARRAY measurement (single acquisition), 2D‐ARRAY measurement (fully merged), EPID measurement. Gamma index passing rates for the sensitivity test field for different measurement permutations. Points have been linked to provide a visual guide.

C. Dosimetric and radiobiological impact of the deliberate composite clinical plan errors

The ability of the 2D‐ARRAY to detect local dose differences caused by the MLC errors is shown in Fig. 7. There was a statistically good agreement between the dose difference detected by the 2D‐ARRAY and the expected difference . A 1 mm MLC error caused up to a 1% local dose difference whereas, for a 2 mm error, this was between 1% and 3% and for a 5 mm MLC error, local dose difference of between 3% and 6% was observed.

Figure 7

Expected vs. measured local dose difference due to the MLC errors.

In the prostate IMRT plan, a 5 mm MLC error resulted in a 0.4% rectal NTCP increase, whereas in the RapidArc plan, this was 1.2%. For a 2 mm error, the increase in the rectal NTCP for IMRT and RapidArc plans was 0.3% and 0.9%, respectively. In the prostate and nodes plan with collimator rotation errors, a 1° and 2° error resulted in an increased rectal NTCP of 3.0% and 3.2%, respectively. In all prostate plans, bladder NTCPs were found to be 0%, although this may not be clinically relevant and is due to the difficulty of fitting parameters to genitourinary toxicity. In the head & neck plans, NTCP values for spinal cord did not increase and were 0.2% for all IMRT plans and 0.1% for all RapidArc plans; these values are in keeping with published data on the incidence of myelopathy at the 45 Gy level. Similarly for the parotids, the maximum increase was limited to 0.2%. As expected, TCP values increased in all plans due to the increase in local dose from the MLC errors. This was as high as an increase of 3% for a 5 mm MLC error and a collimator rotation error of 2°. Expected vs. measured local dose difference due to the MLC errors.

D. Composite verification of clinical plans

All the unperturbed plans had a passing rate of 100% using 3%/3 mm. When using 2%/2 mm, the passing rate for all the plans was greater than 97%. The 5 mm systematic MLC errors were detected using 3%/3 mm in the 2D‐ARRAY in the IMRT plans and in the prostate single‐arc RapidArc plans. However, the 2 mm systematic errors were difficult to detect using 3%/3 mm; the γ in the region where the errors occurred was increased in comparison to the surrounding area, but was still , and hence would not be detected as a fault. The error was detectable at 2%/2 mm. For the head & neck two‐arc RapidArc plan, none of the errors were visible in the measurement and were also found to have a low impact in the expected gamma index maps. This is due to the plan having opposing collimator rotations on each arc to minimize the tongue‐and‐groove effect and the errors may have been largely cancelled out. For the prostate and pelvic nodes RapidArc plan with collimator rotation errors, 3%/3 mm gave a passing rate of for deliberate 1° and 2° errors, and reduced to 92% in the presence of a 5° error. The 1° error would have still passed at 2%/2 mm with a passing rate of 99.3%. The 2° error, however, resulted in a passing rate of 94.1% and would have failed if a 95% threshold was used. At 2%/2 mm, the passing rate for the 5° error plan was 74.6%. In the cases where the deliberate errors were detectable using 2%/2 mm, a passing criteria of 3%/2 mm would have passed if a passing threshold of 95% was used; however, had a passing threshold of 98% been used, then these measurements would have failed. Table 1 gives a summary of the average and minimum percentage of detectors/pixels passing with in all the plans, for 3%/3 mm, 3%/2 mm, and 2%/2 mm passing criteria for the different acquisition permutations.

Table 1

Summary of mean and minimum gamma index passing criteria for all various measurement permutations. A lower number indicates greater sensitivity to error detection. The concordance correlation coefficient, , is also given, assessing agreement with single 2D‐ARRAY acquisition

		Percent Detectors/Pixels Passing with γ<1 and ρc
		3%/3 mm			3%/2 mm			2%/2 mm
Device	Acquisition	Mean	Min	ρc	Mean	Min	ρc	Mean	Min	Pcρc
2D‐ARRAY	Single	98.9	92.0	−	97.9	82.6	−	94.7	74.6	−
	Merged lateral	98.8	91.3	0.984	97.8	82.0	0.979	95.0	74.0	0.957
	Merged longitudinal	98.6	88.8	0.948	97.6	80.9	0.967	94.6	72.3	0.966
	Merged full	98.6	89.5	0.922	97.5	80.9	0.941	94.8	72.1	0.912
	Shift 5 mm longitudinal ^a	98.5	85.7	0.875	97.8	79.2	0.959	95.1	69.9	0.910
	Homogeneous scan	99.0	91.8	0.851	98.3	86.6	0.795	96.1	74.6	0.703
GAFCHROMIC film		98.5	95.5	0.204	95.4	87.8	0.060	92.1	81.3	0.062

Acquisition number 2 as described in the Material & Methods section B.

The analysis of the effect of choosing a normalization point found that there was no significant difference between choosing a point in an unperturbed region, a point in an error region, or mean dose within the 85% isodose at 3%/3 mm or 3%/2 mm. At 2%/2 mm, there was a reduction in the passing rate in the analysis based on mean dose by 0.5% compared to the other two normalization techniques. This reduction was small but statistically significant . Summary of mean and minimum gamma index passing criteria for all various measurement permutations. A lower number indicates greater sensitivity to error detection. The concordance correlation coefficient, , is also given, assessing agreement with single 2D‐ARRAY acquisition Acquisition number 2 as described in the Material & Methods section B.

E. Comparison between expected gamma index passing rates in VeriSoft and OmniPro

There was good statistical agreement between the expected gamma index passing rates calculated in VeriSoft and OmniPro as indicated by the concordance correlation coefficient ( for all passing criteria). The average difference between the passing rates calculated by VeriSoft and OmniPro was 0.5% and 1.1% for 3%/3 mm and 2%/2 mm criteria, respectively. The difference was found to be statistically not significant ( for all). It was therefore reasonable to use the average passing rate for the expected gamma index calculated by both software for each clinical plan to compare against that measured by 2D‐ARRAY and GAFCHROMIC film.

F. Comparison Between Evaluations using CT Scan with 2D‐Array in situ vs. Homogeneous Insert

As shown in Fig. 8(a) and Table 1, there was a small difference between passing rates using a criteria of 3%/3 mm. However, at 2%/2 mm, using the scan with the 2D‐ARRAY in situ appeared to be more sensitive to errors than comparing against the predicted dose calculated on the homogeneous scan; for criteria of 2%/2 mm, the average passing rate was 95.7% compared to 88.5% in the 2D‐ARRAY scan.

Figure 8

Cumulative histogram of gamma index analysis using a passing criteria of 3%/3 mm for: (a) comparison of using 2D‐ARRAY in situ scan vs. homogeneous scan, (b) comparison of different acquisition techniques, and (c) comparison of 2D‐ARRAY vs. GAFCHROMIC film measurements.

G. Single vs. multiple 2D‐ARRAY acquisition modes

There was no significant difference in the gamma index passing rate, at either 3%/3 mm or 2%/2 mm, between performing single or multiple acquisitions, as shown in the cumulative histogram in Table 1 and in Fig. 8(b) for (3%/3 mm). Figure 9 shows the gamma index maps and passing rates for the prostate IMRT plan with a 5 mm MLC error for single versus multiple acquisitions. It can be seen that performing a merged lateral acquisition is visually comparable to a single acquisition, whereas slightly improved resolution is achieved by either merging longitudinally or performing a full merge of four acquisitions. It can also be seen in Fig. 9 and Table 1 that the single acquisition was comparable to the acquisition shifted 5 mm on the longitudinal axis, demonstrating no significant reduction in sensitivity to errors. Overall the acquisition shifted 5 mm longitudinal was found to be the most sensitive acquisition position, based on the gamma index passing rates.

Figure 9

Gamma maps using 3%/3 mm criteria showing the effect of multiple acquisition modes for prostate IMRT plan with 5 mm MLC error: (a) single acquisition, passing rate 96.8%; (b) two merged acquisitions with array shifted lateral for second acquisition, 97.0%; (c) two merged acquisitions with array shifted longitudinal for second acquisition, 96.2%; (d) four merged acquisitions to give effective 5 mm resolution, 96.8%; (e) acquisition with 5 mm shift in the longitudinal direction, 96.1%; and (f) predicted gamma index distribution, 99.5%.

H. 2D‐ARRAY vs. GAFCHROMIC film

Figure 10 shows a comparison between the gamma index distribution (using 3%/3 mm) in the 2D‐ARRAY, GAFCHROMIC film, and expected gamma index distribution for the head & neck IMRT plan with a 5 mm MLC error and prostate and nodes with randomly distributed errors. Regions of failure were comparable between the 2D‐ARRAY and GAFCHROMIC film, with the 2D‐ARRAY exhibiting the blurred effect due to its resolution. Neither system picked up all the errors in the prostate and nodes plan with random errors.

Figure 10

Comparison between gamma index distribution at a passing criteria of 3%/3 mm for the head & neck IMRT plan with 5 mm MLC error: (a) predicted, (b) 2D‐ARRAY, (c) GAFCHROMIC film; and for the prostate & nodes plan with randomly distributed errors for: (d) predicted, (e) 2D‐ARRAY, (f) film.

Average and minimum gamma index passing rates using criteria of 3%/3 mm were comparable for the 2D‐ARRAY and film, as shown in Table 1 and the cumulative histogram in Fig. 8(c). At 2%/2 mm the 2D‐ARRAY resulted in a higher overall passing rate. For a passing rate of 85% or below, the 2D‐ARRAY and GAFCHROMIC film were comparable at 2%/2 mm. For passing criteria of 3%/3 mm, all film planes achieved 95% passing rate or above; for the 2D‐ARRAY, this was found to be 90.5% of measured planes. At 2%/2 mm 33.3% of film planes achieved a passing rate of 95% or above, whereas for the 2D‐ARRAY it was 66.7%. Statistically, there was a poor agreement between 2D‐ARRAY and film as given by for each passing criteria. The difference between 2D‐ARRAY and GAFCHROMIC film was statistically significant for passing criteria , however it was not significant for . When compared against the expected gamma passing rate, the 2D‐ARRAY result had a statistically more significant agreement than GAFCHROMIC film . Comparison between gamma index distribution at a passing criteria of 3%/3 mm for the head & neck IMRT plan with 5 mm MLC error: (a) predicted, (b) 2D‐ARRAY, (c) GAFCHROMIC film; and for the prostate & nodes plan with randomly distributed errors for: (d) predicted, (e) 2D‐ARRAY, (f) film.

IV. DISCUSSION

For planar measurements of IMRT fields, the 2D‐ARRAY in single acquisition mode performed the worst in measuring the sensitivity and resolution test fields. Spatial resolution was significantly affected; however, dose resolution was less affected. This was due to the sparse resolution of 1 cm. Improvements were found when a full merge acquisition was performed. In measuring individual IMRT fields with the 2D‐ARRAY orthogonal to the beam, the resolution may be more influenced by the modulated nature of the fields. This may have less significance in a prostate plan than in a head & neck cancer case. It appeared that the GAFCHROMIC film measurement, although very good spatially, was giving false negative results in the sensitivity test. This is due to intrinsic film heterogeneity causing minor artifacts combined with processing uncertainty, which were enough to disrupt the gamma index analysis passing rate, and are some of the known limitations of film dosimetry.( , The EPID was found to be the most effective of the different devices. In this regard, if using a 2D‐ARRAY for planar field measurement, it would be advisable to consider performing a full merge acquisition when measuring very complex planar fields. In composite plan verification, the 2D‐ARRAY demonstrated good sensitivity to subtle MLC errors. There was a reasonable comparison between the gamma index distributions generated by the 2D‐ARRAY and GAFCHROMIC film. At 3%/3 mm, passing rates were similar between the two systems. The 2D‐ARRAY did exhibit a higher passing rate at 2%/2 mm compared to film. However, it was also interesting to see that the passing rates from the 2D‐ARRAY agreed better with the expected passing rates than GAFCHROMIC film, consistent with the static gantry planar test fields. The effect of picking a normalization point was found to be minimal, but there was a statistically significant small difference between normalizing based on a mean dose and a point in the measured distribution. Performing a normalization based on a mean dose would provide more consistency. Performing a merged lateral acquisition was visually comparable to a single acquisition, whereas slightly improved resolution was achieved by either merging longitudinally or performing a full merge of four acquisitions. This is because, in the case of a lateral shift, resolution is only gained along the MLC leaf path, whereas merging in the longitudinal direction perpendicular to the MLCs allowed more sampling of the leaf bank. The single acquisition was also comparable to the acquisition shifted 5 mm on the longitudinal axis, demonstrating no significant reduction in sensitivity to errors. The lack of difference between the different acquisition modes can be explained by the fact that on Varian linear accelerators, MLCs are arranged either side of the central axis. However, the 2D‐ARRAY is set up such that the central detector is aligned directly with the central axis. Therefore each chamber is always sampling two 5 mm MLC leaves simultaneously. A 5 mm offset in the longitudinal direction would result in every other MLC potentially being missed. It would, therefore, be recommended that if a longitudinal shift is required (e.g., for a long IMRT field where it is necessary to avoid irradiating the electronics), that the shift be made in whole centimeters. It also appears that calculating the expected dose on a homogeneous scan may be less sensitive to errors than calculating on a scan with the 2D‐ARRAY in situ. This is due to underestimation of the dose from the lateral and oblique directions when using the homogeneous scan. Various studies have been performed on the impact of errors in different detector systems.( , , , , ) At the time of writing, this was the first study to perform such a comprehensive evaluation of the PTW 2D‐ARRAY seven29 and OCTAVIUS II phantom combination, particularly for dynamic IMRT and RapidArc. These errors were designed to test the sensitivity and resolution of the 2D‐ARRAY. However, an interesting question to pose is: How likely are these errors in clinical practice? It is possible that an MLC motor becomes less efficient due to wear and tear, leading to a leaf travelling slower than expected and therefore lag behind the other leaves, in a way that would be similar to the errors simulated in this study. However, the tolerance on the MLC control software (usually about 2 mm) means that there are generally two possible scenarios: if possible, all the other leaves are slowed down and the dose rate decreased to compensate for the slower leaf, or an interlock would be activated. Software errors may also possibly lead to a mistranslation of the MLC positions. This study has demonstrated that in the event of MLC errors occurring, that the 2D‐ARRAY seven29 combined with the OCTAVIUS II phantom was a sensitive and reliable method of absolute IMRT or RapidArc verification. The suitability of the gamma index in detecting errors that may be clinically significant has been previously criticized. Alternatives have been suggested. , , , It should be noted that the gamma index has been widely accepted and is implemented into most analysis software. It provides the means for an efficient analysis, particularly important within a busy clinical environment. It is a useful parameter when one performs an audit of IMRT QA over the past months or years in order to explore trends and attempt to improve or streamline QA. It is equally important to understand the limitations of the index combined with the equipment in use. There is a general trend to use 3%/3 mm with a 95% passing threshold. In this study, it was found that in terms of passing rate, the criteria of 3%/3 mm masked errors caused by deliberate collimator rotation errors of 1° and 2°, as well as 2 mm MLC errors. The collimator errors introduced in the prostate and nodes RapidArc plan caused the rectal NTCP to increase by about 3%, which may be clinically significant. The 2 mm errors increased the rectal NTCP up to 0.9% in the prostate plans. These errors were detectable using passing criteria of 2%/2 mm with a 95% threshold or using passing criteria of 3%/2 mm with a 98% passing threshold. For this system, these may be the recommend criteria to be used in order to detect errors that may cause a clinically significant increase in NTCP. The errors introduced all increased local dose difference and, therefore, the TCP was increased. One limitation of this study would be that none of the errors resulted in a reduction of TCP. All the MLC errors were designed to increase the leaf gap. We did not create errors with closed leaf gaps as there was a risk of causing MLC collisions. Errors with narrower leaf gaps would have been expected to cause dose reductions. It was shown in Fig. 7 that the 2D‐ARRAY was able to detect the dose differences caused by the MLC errors, and the strong linear relationship between the expected dose difference and the measured difference suggests that dose reductions may have been detected equally. We suggest that the gamma index passing thresholds be used for guidance, but also be combined with a visual inspection of the gamma index distribution and calculation of the dose difference to assess whether there may be a clinical impact in failed regions.

IV. CONCLUSIONS

Tests have been employed to characterize the sensitivity and resolution of the PTW 2D‐ARRAY seven29 and OCTAVIUS II phantom combination. The 2D‐ARRAY in single acquisition mode was comparable to multiple acquisition modes and GAFCHROMIC film for composite IMRT and RapidArc plan verification. A gamma index criterion of 3%/3 mm may potentially mask clinically relevant errors. A criterion of 3%/2 mm with a passing threshold of 98% or 2%/2 mm with a passing threshold of 95% was found to be more sensitive in conjunction with an evaluation of the gamma index distribution. These tests have resulted in an understanding of the 2D‐ARRAY's limitations and increased confidence in its use for clinical IMRT and RapidArc verification.

ACKNOWLEDGMENTS

The authors would like to thank Yatman Tsang (Mount Vernon Hospital, UK) for assistance with the processing of the GAFCHROMIC EBT2 film.

37 in total

1. An analytical approach to acceptance criteria for quality assurance of Intensity Modulated Radiotherapy.

Authors: Antony Carver; Martyn Gilmore; Stephen Riley; Julien Uzan; Philip Mayles
Journal: Radiother Oncol Date: 2011-09-28 Impact factor: 6.280

2. Histogram reduction method for calculating complication probabilities for three-dimensional treatment planning evaluations.

Authors: G J Kutcher; C Burman; L Brewster; M Goitein; R Mohan
Journal: Int J Radiat Oncol Biol Phys Date: 1991-05-15 Impact factor: 7.038

Review 3. Radiation dose-volume effects of the urinary bladder.

Authors: Akila N Viswanathan; Ellen D Yorke; Lawrence B Marks; Patricia J Eifel; William U Shipley
Journal: Int J Radiat Oncol Biol Phys Date: 2010-03-01 Impact factor: 7.038

4. Detection of IMRT delivery errors using a quantitative 2D dosimetric verification system.

Authors: Nathan L Childress; Charles Bloch; R Allen White; Mohammad Salehpour; Isaac I Rosen
Journal: Med Phys Date: 2005-01 Impact factor: 4.071

5. Volumetric modulated arc therapy: IMRT in a single gantry arc.

Authors: Karl Otto
Journal: Med Phys Date: 2008-01 Impact factor: 4.071

6. Quality assurance of volumetric modulated arc therapy: evaluation and comparison of different dosimetric systems.

Authors: L Masi; F Casamassima; R Doro; P Francescon
Journal: Med Phys Date: 2011-02 Impact factor: 4.071

7. A concordance correlation coefficient to evaluate reproducibility.

Authors: L I Lin
Journal: Biometrics Date: 1989-03 Impact factor: 2.571

8. A model for calculating tumour control probability in radiotherapy including the effects of inhomogeneous distributions of dose and clonogenic cell density.

Authors: S Webb; A E Nahum
Journal: Phys Med Biol Date: 1993-06 Impact factor: 3.609

9. Sensitivity of a helical diode array device to delivery errors in IMRT treatment and establishment of tolerance level for pretreatment QA.

Authors: Feliciano García-Vicente; Virginia Fernández; Rocío Bermúdez; Alberto Gómez; Leopoldo Pérez; Almudena Zapatero; Juan J Torres
Journal: J Appl Clin Med Phys Date: 2012-01-05 Impact factor: 2.102

10. Evaluation of a new VMAT QA device, or the "X" and "O" array geometries.

Authors: Vladimir Feygelman; Geoffrey Zhang; Craig Stevens; Benjamin E Nelms
Journal: J Appl Clin Med Phys Date: 2011-01-31 Impact factor: 2.102

14 in total

1. Evaluation of the ability of three commercially available dosimeters to detect systematic delivery errors in step-and-shoot IMRT plans.

Authors: Alison Gray; Omemh Bawazeer; Sankar Arumugam; Philip Vial; Joseph Descallar; David Thwaites; Lois Holloway
Journal: Rep Pract Oncol Radiother Date: 2021-09-30

2. Feasibility of using glass-bead thermoluminescent dosimeters for radiotherapy treatment plan verification.

Authors: Shakardokht M Jafari; Tom J Jordan; Gail Distefano; David A Bradley; Nicholas M Spyrou; Andrew Nisbet; Catharine H Clark
Journal: Br J Radiol Date: 2015-08-10 Impact factor: 3.039

3. Radiotherapy dosimetry audit: three decades of improving standards and accuracy in UK clinical practice and trials.

Authors: Catharine H Clark; Edwin G A Aird; Steve Bolton; Elizabeth A Miles; Andrew Nisbet; Julia A D Snaith; Russell A S Thomas; Karen Venables; David I Thwaites
Journal: Br J Radiol Date: 2015-09-02 Impact factor: 3.039

4. Dosimetric impact of statistical uncertainty on Monte Carlo dose calculation algorithm in volumetric modulated arc therapy using Monaco TPS for three different clinical cases.

Authors: Mohandass Palanisamy; Khanna David; Manigandan Durai; Narendra Bhalla; Abhishek Puri
Journal: Rep Pract Oncol Radiother Date: 2019-02-18

5. Comparison of Dosimetric Performance among Commercial Quality Assurance Systems for Verifying Pretreatment Plans of Stereotactic Body Radiotherapy Using Flattening-Filter-Free Beams.

Authors: Jin Beom Chung; Sang Won Kang; Keun Yong Eom; Changhoon Song; Kyoung Sik Choi; Tae Suk Suh
Journal: J Korean Med Sci Date: 2016-11 Impact factor: 2.153

6. Performance Evaluation of Algorithms in Lung IMRT: A comparison of Monte Carlo, Pencil Beam, Superposition, Fast Superposition and Convolution Algorithms.

Authors: T Verma; N K Painuly; S P Mishra; M Shajahan; N Singh; M L B Bhatt; N Jamal; M C Pant
Journal: J Biomed Phys Eng Date: 2016-09-01

7. Verification of high-dose-rate brachytherapy treatment planning dose distribution using liquid-filled ionization chamber array.

Authors: A B Mohamed Yoosuf; Prakash Jeevanandam; Glenn Whitten; Geraldine Workman; Conor K McGarry
Journal: J Contemp Brachytherapy Date: 2018-04-30

8. Volumetric-modulated Arc Therapy Lung Stereotactic Body Radiation Therapy Dosimetric Quality Assurance: A Comparison between Radiochromic Film and Chamber Array.

Authors: Juan Fernando Mata Colodro; Alfredo Serna Berná; Vicente Puchades Puchades; David Ramos Amores; Miguel Alcaraz Baños
Journal: J Med Phys Date: 2017 Jul-Sep

9. Robotic radiosurgery system patient-specific QA for extracranial treatments using the planar ion chamber array and the cylindrical diode array.

Authors: Mu-Han Lin; Iavor Veltchev; Sion Koren; Charlie Ma; Jinsgeng Li
Journal: J Appl Clin Med Phys Date: 2015-07-08 Impact factor: 2.102

10. Practical implications for the quality assurance of modulated radiation therapy techniques using point detector arrays.

Authors: Steffi Kantz; Almut Troeller McDermott; Matthias Söhn; Sabine Reinhardt; Claus Belka; Katia Parodi; Michael Reiner
Journal: J Appl Clin Med Phys Date: 2017-08-30 Impact factor: 2.102