Literature DB >> 26637827

The European thoracic data quality project: An Aggregate Data Quality score to measure the quality of international multi-institutional databases.

Michele Salati1, Pierre-Emmanuel Falcoz2, Herbert Decaluwe3, Gaetano Rocco4, Dirk Van Raemdonck3, Gonzalo Varela5, Alessandro Brunelli6.   

Abstract

OBJECTIVES: To describe the methodology for the development of data quality metrics in multi-institutional databases, deriving a cumulative data quality score [Aggregate Data Quality score (ADQ)]. The ESTS database was used to create and apply the metrics. The Units contributing to the ESTS database were ranked for the quality of data uploaded using the ADQ.
METHODS: We analysed data obtained from 96 Units contributing with at least 100 major lung resections (January 2007 to December 2014). The Units were anonymized assigning a casual numeric code. The following metrics were developed for measuring the data quality of each Unit: (i) record Completeness (COM); rate of present variables on 16 expected variables for all the records uploaded [1 - ('null values'/total expected values for the Unit) × 100, the concept of 'null value' was defined for each variable]; (ii) record Reliability (REL); rate of consistent checks on 9 checks tested for all the records uploaded [1 - (valid controls/total possible controls for the Unit) × 100, specific reliability control queries were defined]. These two metrics were rescaled using the mean and standard deviation of the entire dataset and summed, obtaining: (iii) ADQ score: [COM rescaled + REL rescaled]; it measures the cumulative data quality of a given dataset. The ADQ was used to rank the contributors.
RESULTS: The COM of ESTS database contributors varied from 98.6 to 43% and the REL from 100 to 69%. Combining the rescaled metrics, the obtained ADQ ranged between 2.67 (highest data quality) and -7.85 (lowest data quality). Comparing the rating using just the COM value to the one obtained using the ADQ, 93% of Units changed their position. The major change was the drop of 66 positions considering the ADQ list.
CONCLUSIONS: We described a reproducible method for data quality assessment in clinical multi-institutional databases. The ADQ is a unique indicator able to describe data quality and to compare it among centres. It has the potential of objectively guiding projects of data quality management and improvement.
© The Author 2015. Published by Oxford University Press on behalf of the European Association for Cardio-Thoracic Surgery. All rights reserved.

Entities:  

Keywords:  Data quality; Database management systems; Quality indicators; Registry

Mesh:

Year:  2015        PMID: 26637827     DOI: 10.1093/ejcts/ezv385

Source DB:  PubMed          Journal:  Eur J Cardiothorac Surg        ISSN: 1010-7940            Impact factor:   4.191


  3 in total

Review 1.  Blazing new trails: initial efforts to create a joint Society of Thoracic Surgeons-European Society of Thoracic Surgeons (STS-ESTS) dataset.

Authors:  Christopher W Seder
Journal:  J Thorac Dis       Date:  2018-10       Impact factor: 2.895

2.  European Society of Thoracic Surgeons big data utilization-part 1: research interest for the thoracic community.

Authors:  Michele Salati
Journal:  J Thorac Dis       Date:  2018-10       Impact factor: 2.895

3.  Focus on specific disease-part 2: the European Society of Thoracic Surgery chest wall database.

Authors:  Benedetta Bedetti; Davide Patrini; Luca Bertolaccini; Roberto Crisci; Piergiorgio Solli; Joachim Schmidt; Marco Scarci
Journal:  J Thorac Dis       Date:  2018-10       Impact factor: 2.895

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.