Monitor Unit Verification for Small Fields

Monitor Unit Verification for Small Fields Patrick Higgins, Ph.D University of Minnesota Department of Radiation Oncology October 10, 2013

The Issues: How do we verify the monitor units calculated by the planning system are accurate when we have multiple issues of small fields and heterogeneous media? Do we have to run IMRT-like phantom tests (which avoid the problem of heterogeneity) to verify MU s? Can we do some kind of hand calculation to verify MU s? If so, how do we approximate heterogeneity effects? What level of check accuracy is reasonable to avoid catastrophe?

What do people do? As best as we have been able to tell, they do either: 1. Simple hand calcs for confirmation. But MuCheck and Rad Calc use deff for heterogeneous calcs. This is maybe the simplest of 1-d calculations and loses accuracy as the depth (and effective depth) increases. or, 2. Measure dose using Mapcheck or its equivalent (with or without point measurement) as is done for IMRT qa. But now we replace a hard problem with a much easier test.

What do we do? Historically, at Minnesota we have relied on hand calculations as being the most accurate, least biased method for monitor unit definition. The planning system MU s were considered a secondary check. Prior to Pinnacle (i.e. superposition/convolution) heterogeneity corrections were rarely done, except for lung. Even then we tried to be sure the central axis minimally overlapped lung so that simple hand calculations could still be done. Only in the last 5 or 6 years have we become comfortable with the predictive quality of Pinnacle for heterogeneous media and only after we convinced ourselves that we could do confirmative hand calculations for 6 MV to within about ±5%. Another great impacting factor on us was the introduction of SBRT which forced us to find a way to check doses in the presence of significant heterogeneity.

Planning System Accuracy The best we can hope for beam models is probably 2-5% accuracy when we include heterogeneities (AAPM report 85). For systems like Pinnacle that accuracy is further confounded by the subjectivity of the modeling process. Beam models (and modelers!) are further stretched in the limits of small fields where both the input data is more prone to error and heterogeneity modeling is not included. For IMRT plans we are forced to transfer heterogeneous plans to homogeneous phantoms in which we can do direct measurements to confirm monitor units and bypass the heterogeneity issue. For static field SBRT cases some places use the IMRT QA approach. We felt that, at least for static fields, we should be able to find a way to verify Pinnacle MU s without resorting to special (albeit flawed) measurements.

Planning system verification vs. Hand calculation As part of an M.S. project (Rishik Saxena) we measured the validity and range of application of several 1-d heterogeneity corrections that could easily be incorporated into a hand calculation for planning system MU verification. We built composite lung-soft tissue and bone-soft tissue phantoms to measure doses and compared against Pinnacle, Eclipse pencil beam and 3 1-d methods over a range of field sizes and energies: 1. Effective depth (used by MuCheck and other vendor supplied programs). 2. Effective attenuation (originally used in diagnostic applications). 3. Ratio of TMR s. All 3 methods use effective depth in some form or other. For this presentation we are going to concentrate on effective attenuation.

Effective Depth This is the simplest possible estimate of the effect of electron density on depth for the purpose of calculating dose in homogeneous media of density different from water. It is so primitive that it is not discussed in textbooks as a correction model. deff is the effective pathlength or depth in a medium based on the weighted electron densities of the elements in the medium. This has been used to adjust measurement depths over a small range to make up for the difference in density in water-replacement plastics for calibration purposes. It has been used in place of the physical depth in some vendor-supplied monitor unit calculation programs.

Effective attenuation An effective attenuation coefficient is used to estimate the correction to TERMA due to differences in physical and effective depths: ICF = exp( µ ( d eff d)) (Inhomogeneity Correction Factor) µ is the effective attenuation coefficient of water for a particular beam quality. (For this we have just used the linear attenuation coefficient for the peak photon energy.) d is the physical depth d eff is the effective depth In this method the homogeneously calculated pdd or TMR is corrected by the ICF.

Ratio of tissue-air ratios (RTAR) This method was one of the most commonly used methods in older commercial treatment planning systems and as a method of spot checking dose. Other names have been given to this method. The 1-d version is given by ICF = TAR( d eff, W ) TAR( d, W ) One might expect the inclusion of field size effects would be an advantage over simple attenuation differences, but as you will see we found no particular improvement in accuracy.

Model Evaluation In order to evaluate dose calculation accuracy for our planning systems we built a lung/soft tissue phantom. Measurements were made 1 at a number of depths (EDR2 film) and in interfaces for a range of field sizes and energies. These measurements have been used for some time for baseline testing of our planning systems. Using this data we could also evaluate the viability of different correction methods. 1 Saxena R. and Higgins PD, Measurement and Evaluation of Inhomogeneity Corrections and Monitor Unit Verification for Treatment Planning. Medical Dosimetry, 35:19-27, 2010.

Lung Phantom for Testing Solid Water 3 cm Cork Solid Water Cork Solid Water 3.0 cm 5.0 cm 7.0 cm 7.5 cm 8.5 cm 10.5 cm 11.5 cm 13.5 cm 15.5 cm 17.5 cm 4 cm 4.5 cm 4 cm 5 cm

4x4 25.0 20.0 15.0 D-effective %-difference 10.0 5.0 0.0-5.0-10.0 0 5 10 15 20 Depth (cm) Attenuation corr Eclipse Eq. TAR Pinnacle 6 MV Pdd calculation vs. 10x10 measurement 20.0 15.0 %-difference 10.0 5.0 0.0-5.0 0 5 10 15 20 D-effective Attenuation corr Eclipse Eq. TAR Pinnacle -10.0 Depth (cm)

4x4 20.0 15.0 %-difference 10.0 5.0 0.0 0 5 10 15 20 Attenuation corr Eclipse Eq. TAR Pinnacle 10 MV -5.0-10.0 Depth (cm) 10x10 Pdd calculation vs. measurement 20.0 15.0 %-difference 10.0 5.0 0.0 0 5 10 15 20 Attenuation corr Eclipse Eq. TAR Pinnacle -5.0-10.0 Depth (cm)

25.0 4x4 %-difference 20.0 15.0 10.0 5.0 0.0 0 5 10 15 20-5.0 Depth (cm) 15.0 10x10 Attenuation corr Eclipse Eq. TAR Pinnacle 18 MV Pdd calculation vs. measurement 10.0 %-difference 5.0 0.0-5.0 0 5 10 15 20 Attenuation corr Eclipse Eq. TAR Pinnacle -10.0 Depth (cm)

How good can Pinnacle be? Pdd vs. Depth, Varian 25 MV, field size 2x2 120 100 Pdd (%) 80 60 40 Measured Pinnacle 20 0 0 2 4 6 8 10 12 14 16 18 20 Depth (cm)

Measurement Summary 1. All of the 1-d correction errors are in the direction of overestimating doses. 2. Pinnacle appears accurate to within 5% for field sizes as small as 2x2 for energies of 6-25 MV. 3. For large field sizes (>10x10) all of the simple 1-d correction methods work fairly well (within 5%). 4. For small fields (4x4) and low energy (6 MV) the attenuation correction appeared promising, at least in a simple phantom setting.

Clinical Small Field MU Evaluation Phantom tests confirmed the accuracy of Pinnacle. The next step was to apply the simple 1-d corrections for real test cases. For our first 26 SBRT lung cases, we compared Pinnacle MU settings with hand monitor unit calculations, based on the physical depth, but corrected using the effective attenuation ICF factor. 2 2 Higgins PD, Adolfson T, Cho LC and Saxena R. Monitor Unit Checking in Heterogeneous SBRT Treatment Planning; Medical Dosimetry, Vol. 36 (3): 255-263, 2011

SBRT #19 Comparison of Pinnacle with ICFcorrected Point calc. Field d deff FS Eff FS ICF Pinnacle MU Point MU Point/ICF Delta a 4.7 4.2 4.5 3.9 1.014 121 120 118.3 2.2% b 19.1 13.7 4.4 3.9 1.161 161 188 161.9-0.5% c 23.2 12.8 5.1 4.4 1.334 189 273 204.7-8.3% d 16.5 5.8 4.6 4.1 1.345 183 259 192.6-5.2% e 22.4 20.9 4.9 4.4 1.042 238 241 231.2 2.9% f 14.8 10 4.3 3.8 1.142 204 234 204.9-0.4% g 8.8 7.9 4.8 4.1 1.025 148 149 145.3 1.8% h 6.7 5.4 4.7 4.2 1.037 135 136 131.2 2.8% I 3.9 3.5 5.3 4.5 1.011 123 121 119.7 2.7% j 3.7 3.4 4.6 4.1 1.008 111 109 108.1 2.6% ICF = exp{ µ w ( d eff d )} µ water ( 6 x ) = 0.0277 / cm

MU comparison with Pinnacle vs. ICF ICF Pinnacle Calculation Comparison: Attenuation Correction 1.7 1.6 1.5 1.4 1.3 1.2 1.1 1.0 Most outliers are correlated with very low lung density. 0.9-20.0% -15.0% -10.0% -5.0% 0.0% 5.0% 10.0% 15.0% 20.0% % Diff calc vs Pinnacle MU n Ave ICF St Dev Max Min 260 1.115 0.116 1.679 0.984 Ave St Dev Max Min 1.80% 4.10% 14.70% -12.90% Pass number % 15% 230 100.0% 10% 219 95.2% 5% 182 79.1%

MU comparison with Pinnacle vs. Field Size Pinnacle Calculation Comparison: Field Size 10.0 9.0 8.0 Effective Field Size 7.0 6.0 5.0 4.0 3.0 2.0 1.0 0.0-20.0% -15.0% -10.0% -5.0% 0.0% 5.0% 10.0% 15.0% 20.0% % Diff calc vs Pinnacle MU

Ratio of TMR Comparison of MU s with Pinnacle vs. ICF 1.8 1.7 1.6 1.5 Pinnacle Calculation Comparison: Ratio of TMR's ICF 1.4 1.3 1.2 1.1 1.0 0.9-10.0% -5.0% 0.0% 5.0% 10.0% 15.0% 20.0% 25.0% 30.0% 35.0% 40.0% % Diff calc vs Pinnacle MU

SBRT Test Summary 1. Check calculations using effective attenuation to approximate the effect of heterogeneities on lung plans appear to be accurate to ±5% for 6 MV and field sizes 3x3 and larger. 2. For those fields where the check is more than 10% different from Pinnacle, we review the planned fields for potential problems, increasing the field size, if needed, to obtain better agreement. Even though we have the highest confidence in Pinnacle, we may not know if the discrepancy is due to the lousy 1-d test, or to modeling error in small field extrapolation. So it is better to err on the cautious side.

SBRT Test Summary 3. We have found that conditions where the point calculation disagrees the most with Pinnacle are for the situation of very low density lung tissue (COPD) and in areas of bone-lung interfaces where Pinnacle is less accurate. This led us to do the following study of density effects, relative to this simple point calculation check.

Effects of Lung and Target Density The amount of heterogeneity in the scattering environment certainly contributes to the treatment planning problem. Low density lungs require larger PTV volumes or block margins to achieve the same coverage as higher density lungs. We set up a water phantom model in Pinnacle with a spherical lung insert and varied the size and density of a target as well as the lung density. We then exercised our simple correction by calculating MU s vs. Pinnacle MU s for a single field ranging in size to just matching the target to a few cm larger.

Effects of Lung and Target Density 6 MV Field Unit density 3.5 cm Lung 15 cm PTV

Effects of Lung and Target Density 1.15 2 cm Target Density 1.0 MuCalc MU/Pinnacle MU 1.10 1.05 1.00 0.95 Lung density 0.25 Lung density 0.2 Lung density 0.15 Lung density 0.1 Lung density 0.05 Lung density 0.0 0.90 2.0 2.5 3.0 3.5 4.0 4.5 5.0 Field Size ICF-corrected MU vs. Pinnacle MU Single Small Field of Different Widths: 5-10% overestimates.

Effects of Lung and Target Density 1.15 1.10 2 cm Target Density 0.6 MuCalc MU/Pinnacle MU 1.05 1.00 0.95 0.90 0.85 0.80 0.75 2.0 2.5 3.0 3.5 4.0 4.5 5.0 Field Size Lung density 0.25 Lung density 0.2 Lung density 0.15 Lung density 0.1 Lung density 0.05 Lung density 0.0 ICF-Corrected MU vs. Pinnacle for Typical Target Density: ±5% Range.

Effects of Lung and Target Density 1.15 1.10 2 cm Target Density 0.3 MuCalc MU/Pinnacle MU 1.05 1.00 0.95 0.90 0.85 0.80 0.75 0.70 2.0 2.5 3.0 3.5 4.0 4.5 5.0 Field Size Lung density 0.25 Lung density 0.2 Lung density 0.15 Lung density 0.1 Lung density 0.05 Lung density 0.0 ICF-Corrected MU vs. Pinnacle for Typical Target Density: 10+% underestimate depending on field size.

Effects of Lung and Target Density 1.15 1.10 4 cm Target Density 0.3 MuCalc MU/Pinnacle MU 1.05 1.00 0.95 0.90 0.85 0.80 0.75 0.70 4.0 4.5 5.0 5.5 6.0 Field Size Lung density 0.25 Lung density 0.2 Lung density 0.15 Lung density 0.1 Lung density 0.05 Lung density 0.0 ICF-Corrected MU vs. Pinnacle for Typical Target Density: Match improves with target and field size.

Conclusions 1. For low energy (6 MV) a 1-d attenuation-based correction can be used to verify the planned MU s for a well-tested planning system, avoiding the need for phantom test measurements and giving us some level of confidence that there is no catastrophic problem in the plan. 2. Small targets in SBRT lung cases pose additional problems for planning systems depending on how diffuse the targets are and on the surrounding lung density. It is better to increase the field margin to satisfy a simple 1-d test than use very tight margins and prescribe to low isodose lines. 3. Even though it may be argued that hand calculations are much inferior to a full, well-tested beam model, experience has shown that the risk of not performing some independent test is not worth taking.

Thank you!

1.6 cm Target Density 0.6 1.00 Ratio 0.8 cm Lat:to:Center Dose 0.95 0.90 0.85 0.80 0.75 0.70 0.65 0.60 0.55 0.50 1.5 2.0 2.5 3.0 3.5 4.0 4.5 Lung density 0.25 Lung density 0.2 Lung density 0.15 Lung density 0.1 Lung density 0.05 Lung density 0.0 Dose homogeneity: Ratio of dose at lateral edges of targets to the dose at the center. Field Size Ratio 1.5 cm Lat:to:Center Dose 3 cm Target Density 0.1 1.00 0.95 0.90 0.85 0.80 0.75 0.70 0.65 0.60 0.55 0.50 3.0 3.5 4.0 4.5 5.0 5.5 Lung density 0.25 Lung density 0.15 Lung density 0.1 Lung density 0.05 Lung density 0.0 Effect of field size and densities of target and lungs on dose coverage. Field Size

Estimates of Uncertainty (in terms of one standard deviation) in absolute dose in the patient for megavoltage photons (AAPM Report 85). Source of Uncertainties Uncertainty at Uncertainty in Present (%) Future (%) Dose at the calibration point in water 2.5 1.0 Additional uncertainty for other points 0.6 0.3 Beam Monitor stability 1.0 0.5 Beam flatness 1.5 0.5 Patient data 1.5 1.0 Patient set up and organ motion 2.5 2.0 Overall (excluding dose calculation) 4.3 2.5 Dose calculation algorithm 1.0 / 2.0 / 3.0 / 5.0 1.0 / 2.0 / 3.0 TOTAL 4.4 / 4.7 / 5.2 / 6.6 2.7 / 3.2 / 3.9

Table 7. Categorization of different inhomogeneity correction algorithms according to the level of anatomy sampled (1D or 3D) and the inclusion or exclusion of electron transport (AAPM Report #85) TERMA Local energy deposition (No electron transport) DOSE Non-local energy deposition(electron transport) 1d Category 1 1.1 Linear attenuation 1.2 Ratio of TAR (RTAR) (Equivalent path length, effective SSD, isodose shift) 1.3 Power law (Batho) 3d Category 1 2.1 Equivalent TAR (ETAR) 2.2 Differential SAR (DSAR) 2.3 Delta volume (DVOL) 2.5 3D Beam Subtraction Method Category 3 3.1 Convolution (pencil beam) 3.2 FFT techniques Category 4 4.1 Superposition/Convolution 4.2 Monte Carlo 2.4 Differential TAR (dtar)

1-d Methods for Evaluating Monitor Units 6 MV 2300cd pdd 100 90 80 70 60 50 40 30 pdd (uncorrected) pdd (deff) Measured ICF-corrected pdd Pinnacle 20 0 5 10 15 20 25 30 depth (cm)