William H Crown1. 1. OptumLabs,(®) Cambridge, MA, USA. Electronic address: william.crown@optum.com.
Abstract
The current focus on real world evidence (RWE) is occurring at a time when at least two major trends are converging. First, is the progress made in observational research design and methods over the past decade. Second, the development of numerous large observational healthcare databases around the world is creating repositories of improved data assets to support observational research. OBJECTIVE: This paper examines the implications of the improvements in observational methods and research design, as well as the growing availability of real world data for the quality of RWE. These developments have been very positive. On the other hand, unstructured data, such as medical notes, and the sparcity of data created by merging multiple data assets are not easily handled by traditional health services research statistical methods. In response, machine learning methods are gaining increased traction as potential tools for analyzing massive, complex datasets. CONCLUSIONS: Machine learning methods have traditionally been used for classification and prediction, rather than causal inference. The prediction capabilities of machine learning are valuable by themselves. However, using machine learning for causal inference is still evolving. Machine learning can be used for hypothesis generation, followed by the application of traditional causal methods. But relatively recent developments, such as targeted maximum likelihood methods, are directly integrating machine learning with causal inference.
The current focus on real world evidence (RWE) is occurring at a time when at least two major trends are converging. First, is the progress made in observational research design and methods over the past decade. Second, the development of numerous large observational healthcare databases around the world is creating repositories of improved data assets to support observational research. OBJECTIVE: This paper examines the implications of the improvements in observational methods and research design, as well as the growing availability of real world data for the quality of RWE. These developments have been very positive. On the other hand, unstructured data, such as medical notes, and the sparcity of data created by merging multiple data assets are not easily handled by traditional health services research statistical methods. In response, machine learning methods are gaining increased traction as potential tools for analyzing massive, complex datasets. CONCLUSIONS: Machine learning methods have traditionally been used for classification and prediction, rather than causal inference. The prediction capabilities of machine learning are valuable by themselves. However, using machine learning for causal inference is still evolving. Machine learning can be used for hypothesis generation, followed by the application of traditional causal methods. But relatively recent developments, such as targeted maximum likelihood methods, are directly integrating machine learning with causal inference.
Authors: Antoine Perpoil; Gael Grimandi; Stéphane Birklé; Jean-François Simonet; Anne Chiffoleau; François Bocquet Journal: Int J Environ Res Public Health Date: 2020-12-29 Impact factor: 3.390
Authors: Thomas Laurent; Jason Simeone; Ryohei Kuwatsuru; Takahiro Hirano; Sophie Graham; Ryozo Wakabayashi; Robert Phillips; Tatsuya Isomura Journal: Drugs Real World Outcomes Date: 2022-03-18
Authors: Alexander C Fanaroff; Robert M Califf; Robert A Harrington; Christopher B Granger; John J V McMurray; Manesh R Patel; Deepak L Bhatt; Stephan Windecker; Adrian F Hernandez; C Michael Gibson; John H Alexander; Renato D Lopes Journal: J Am Coll Cardiol Date: 2020-08-04 Impact factor: 24.094