BACKGROUND: There are large numbers of schemes that collect and aggregate data from primary care computer systems into large databases. These data are then used for market and academic research. How the data is aggregated, cleaned and processed is usually opaque. Making the method transparent allows researchers to compare methods, and users of the output to better understand the strengths and weaknesses of the data.Objectives To define the stages of the process of aggregating, processing and cleaning clinical data from multiple data sources. METHODS: Identify errors in design, collection, staging, integration and analysis. RESULTS: An eight step process defined: (1) Design (2) DATA: entry, (3) Extraction, (4) Migration, (5) Integration, (6) Cleaning, (7) Processing, and (8) Analysis. CONCLUSIONS: This eight step method provides a taxonomy to enable researchers to compare their methods of data process and aggregation.
BACKGROUND: There are large numbers of schemes that collect and aggregate data from primary care computer systems into large databases. These data are then used for market and academic research. How the data is aggregated, cleaned and processed is usually opaque. Making the method transparent allows researchers to compare methods, and users of the output to better understand the strengths and weaknesses of the data.Objectives To define the stages of the process of aggregating, processing and cleaning clinical data from multiple data sources. METHODS: Identify errors in design, collection, staging, integration and analysis. RESULTS: An eight step process defined: (1) Design (2) DATA: entry, (3) Extraction, (4) Migration, (5) Integration, (6) Cleaning, (7) Processing, and (8) Analysis. CONCLUSIONS: This eight step method provides a taxonomy to enable researchers to compare their methods of data process and aggregation.
Authors: Mohammad A Tahir; Olga Dmitrieva; Simon de Lusignan; Jeremy van Vlymen; Tom Chan; Ramez Golmohamad; Kevin Harris; Charles Tomson; Nicola Thomas; Hugh Gallagher Journal: BMC Fam Pract Date: 2011-08-05 Impact factor: 2.497
Authors: Simon de Lusignan; Rob Navarro; Tom Chan; Glenys Parry; Kim Dent-Brown; Tony Kendrick Journal: BMC Med Inform Decis Mak Date: 2011-10-13 Impact factor: 2.796
Authors: Simon de Lusignan; Hugh Gallagher; Tom Chan; Nicki Thomas; Jeremy van Vlymen; Michael Nation; Neerja Jain; Aumran Tahir; Elizabeth du Bois; Iain Crinson; Nigel Hague; Fiona Reid; Kevin Harris Journal: Implement Sci Date: 2009-07-14 Impact factor: 7.327
Authors: Olga Dmitrieva; Simon de Lusignan; Iain C Macdougall; Hugh Gallagher; Charles Tomson; Kevin Harris; Terry Desombre; David Goldsmith Journal: BMC Nephrol Date: 2013-01-25 Impact factor: 2.388
Authors: Imran Rafi; Susmita Chowdhury; Tom Chan; Ibrahim Jubber; Mohammad Tahir; Simon de Lusignan Journal: BMC Fam Pract Date: 2013-07-24 Impact factor: 2.497
Authors: Simon de Lusignan; Simon de Lusignana; Hugh Gallagher; Simon Jones; Tom Chan; Jeremy van Vlymen; Aumran Tahir; Nicola Thomas; Neerja Jain; Olga Dmitrieva; Imran Rafi; Andrew McGovern; Kevin Harris Journal: Kidney Int Date: 2013-03-27 Impact factor: 10.612