PURPOSE: With emergence of clinical outcomes databases as tools utilized routinely within institutions, comes need for software tools to support automated statistical analysis of these large data sets and intrainstitutional exchange from independent federated databases to support data pooling. In this paper, the authors present a design approach and analysis methodology that addresses both issues. METHODS: A software application was constructed to automate analysis of patient outcomes data using a wide range of statistical metrics, by combining use of C#.Net and R code. The accuracy and speed of the code was evaluated using benchmark data sets. RESULTS: The approach provides data needed to evaluate combinations of statistical measurements for ability to identify patterns of interest in the data. Through application of the tools to a benchmark data set for dose-response threshold and to SBRT lung data sets, an algorithm was developed that uses receiver operator characteristic curves to identify a threshold value and combines use of contingency tables, Fisher exact tests, Welch t-tests, and Kolmogorov-Smirnov tests to filter the large data set to identify values demonstrating dose-response. Kullback-Leibler divergences were used to provide additional confirmation. CONCLUSIONS: The work demonstrates the viability of the design approach and the software tool for analysis of large data sets.
PURPOSE: With emergence of clinical outcomes databases as tools utilized routinely within institutions, comes need for software tools to support automated statistical analysis of these large data sets and intrainstitutional exchange from independent federated databases to support data pooling. In this paper, the authors present a design approach and analysis methodology that addresses both issues. METHODS: A software application was constructed to automate analysis of patient outcomes data using a wide range of statistical metrics, by combining use of C#.Net and R code. The accuracy and speed of the code was evaluated using benchmark data sets. RESULTS: The approach provides data needed to evaluate combinations of statistical measurements for ability to identify patterns of interest in the data. Through application of the tools to a benchmark data set for dose-response threshold and to SBRT lung data sets, an algorithm was developed that uses receiver operator characteristic curves to identify a threshold value and combines use of contingency tables, Fisher exact tests, Welch t-tests, and Kolmogorov-Smirnov tests to filter the large data set to identify values demonstrating dose-response. Kullback-Leibler divergences were used to provide additional confirmation. CONCLUSIONS: The work demonstrates the viability of the design approach and the software tool for analysis of large data sets.
Authors: Reshma Jagsi; Justin E Bekelman; Otis W Brawley; Joseph O Deasy; Quynh-Thu Le; Jeff M Michalski; Benjamin Movsas; Charles R Thomas; Colleen A Lawton; Theodore S Lawrence; Stephen M Hahn Journal: Int J Radiat Oncol Biol Phys Date: 2012-03-19 Impact factor: 7.038
Authors: Jatinder R Palta; Jason A Efstathiou; Justin E Bekelman; Sasa Mutic; Carl R Bogardus; Todd R McNutt; Peter E Gabriel; Colleen A Lawton; Anthony L Zietman; Christopher M Rose Journal: Pract Radiat Oncol Date: 2011-07-14
Authors: R Alfredo Siochi; Peter Balter; Charles D Bloch; Harry S Bushe; Charles S Mayo; Bruce H Curran; Wenzheng Feng; George C Kagadis; Thomas H Kirby; Robin L Stern Journal: J Appl Clin Med Phys Date: 2009-09-02 Impact factor: 2.102
Authors: Charles S Mayo; Marc L Kessler; Avraham Eisbruch; Grant Weyburne; Mary Feng; James A Hayman; Shruti Jolly; Issam El Naqa; Jean M Moran; Martha M Matuszak; Carlos J Anderson; Lynn P Holevinski; Daniel L McShan; Sue M Merkel; Sherry L Machnak; Theodore S Lawrence; Randall K Ten Haken Journal: Adv Radiat Oncol Date: 2016-10-13