| Literature DB >> 28320446 |
M Kourime1,2, J Bryce3, J Jiang3, R Nixon3, M Rodie3, S F Ahmed3.
Abstract
BACKGROUND: With the proliferation of rare disease registries, there is a need for registries to undergo an assessment of their quality against agreed standards to ensure their long-term sustainability and acceptability.This study was performed to evaluate the I-DSD and I-CAH Registries and identify their strengths and weaknesses.Entities:
Keywords: Congenital adrenal hyperplasia; Data quality; Disorders of sex development; Research quality; Validity
Mesh:
Year: 2017 PMID: 28320446 PMCID: PMC5360059 DOI: 10.1186/s13023-017-0603-7
Source DB: PubMed Journal: Orphanet J Rare Dis ISSN: 1750-1172 Impact factor: 4.123
Fig. 1The assessment model of the I-DSD and I-CAH Registries
Data quality indicators of Registry design that affect the quality of Research derived from the Registry according to the Agency for Healthcare Research and Quality
| Parameter | Criteria of good Quality practice for Research |
|---|---|
| Purpose of the Registry | -Research questions clearly defined |
| Study Design | Choice of study design which is more efficient for addressing the research questions: Cohort, Case-control or case-cohort. |
| Data elements | Relevance to the objectives of the registry |
| Data sources (good quality means the use of the appropriate data sources to collect relevant data) | Clinician:accurate and specific clinical data |
| Population definition | - Patient selection: inclusion and exclusion criteriarelevant to the purpose |
| Registry size and duration | Calculation of target sample size and definition of the duration of enrollment and follow-up should consider the aims of the registry, the desired precision of information sought, and the hypotheses to be tested. |
CDISC Clinical Data Interchange Standards Consortium
CDASH Clinical Data Acquisition Standards Harmonization
BRIDG Biomedical Research Integrated Domain Group
areasonably feasible to collect, minimal response burden
bdoes each data element truthfully measure what it is supposed to?
ccan the instrument yield replicate metrics or estimates?
Criteria of evaluation of quality of operational aspects [4]
| Quality indicators | Profile of High Quality Rare Disease Registries |
|---|---|
| Ethical and Legal issues | -Protocols are approved by ethics committee. |
| Access to data and security | -Data are made available anonymously to public institutions, public authorities, patient associations and private institutions/citizens, centres of expertise within the country and worldwide |
| Communication of activities | Communication to data providers, public health policy makers and patient associations through websites, newsletters, institutional bulletins, scientific meetings and journals resulting in peer review by scientific journals and scientific meetings |
| Governance | A main governing board composed by internal and external experts has a good oversight and governance mechanisms dealing with financial, administrative, ethical and legal issues, research objectives, database content, data access and use, communication and coordination of all stakeholders. |
| Sustainability | Established plans to ensure durable funding and long-term sustainability |
| Data quality assurance procedures | -Case definition for the Rare Disease of interest |
The 6 Data Quality Dimensions defined by DAMA UK Working Group for data quality assessment
| Data quality dimensions | Definition | Measure in the Registry |
|---|---|---|
| Completeness | The proportion of stored data against the potential of “100% complete”. | - Optional variables in Core data |
| Uniqueness | No thing will be recorded more than once based upon how that thing is identified. | Percentage of duplicated cases by measuring data item against itself. A case is presumed duplicated when there is 100% similarity in core data and more than 90% similarity in non-core data between duplicates. |
| Timeliness | The degree to which data represent reality from the required point in time or how current or up to date the data are at the time of release. | Timeframe between the age at first presentation and the upload date in the Registry |
| Validity | Data are valid if it conforms to the syntax (format, type, range) of data definition. | The percentage of data that are not conform to the syntax in the longitudinal module in the ICAH Registry (Blood pressure) |
| Accuracy | The degree to which data correctly describes the “real world” object or event being described. | The accuracy of data in PAISa cases in the Registry was verified against original data available in templates completed by centres |
| Consistency | The absence of difference, when comparing two or more representations of a thing against a definition. | Consistency between the number of adverse events episodesb and sick days in the longitudinal module of the I-CAH Registry |
(When the data provider enter a number of adverse events, a table with a number of rows corresponding to the number of adverse event episodes is displayed and in each row we need to complete the number of sick days in each adverse event episode. The total number of sick days is automatically calculated. Obviously, the number of adverse events should not exceed the number of sick days, otherwise, there is an inconsistency between the two variables)
a PAIS Partial androgen insensitivity syndrome
b Adverse events: are the number of separate episodes of illness requiring extra dose of steroid
Fig. 2Description of the populations in the I-DSD Registry
Completeness of data in the I-DSD and I-CAH Registries in core data, according to disorders and in optional data by centres
| Data | Scope | Variables | Completeness | Completeness | Completeness |
|---|---|---|---|---|---|
| Optional Core data | All cases in | n = 2 | - | - | 99% |
| All data | Data in each disorder in all cases in the | n = 61 | -Other (n1 = 144) | -Disorder of gonadal development (n1 = 418) | -Disorder of androgen action (n1 = 474) |
| Optional data | Data by centre in the I-DSD Registry | n = 55 | n3 = 17 | n3 = 9 | n3 = 20 |
| Optional data | Data by centre in the I-CAH Registry | n = 55 | n3 = 14 | n3 = 10 | n3 = 7 |
| Optional data | Longitudinal module in the I-CAH Registry | n = 47 | n3 = 2 | n3 = 2 | n3 = 11 |
-Total Number of variables in the non longitudinal module; which is a common module between the two Registries; is 65 with 4 identifiers so, we assessed 61 variables with 6 variables which are mandatory to complete and 55 which are optional
-The core data mean the data that need completion in order to consider a case for a study. It contains the 4 identifiers, 3 mandatory variables (year of birth, Original Sex Assigned, Karyotype, and disorder type) and 2 optional variables (age at presentation and actual diagnosis)
-Total number of variables in the longitudinal module in the I-CAH Registry is 47
n number of variables
n1 number of cases
n2 median number of cases per centre (range)
n3 number of centres
Fig. 3The rate of addition of new cases per year in the I-DSD and I-CAH Registries. n: Number of new cases added per year. N: Total number of cases in the Registry at the time of analysis
Fig. 4The rate of enrolment of new centres per year in the I-DSD and I-CAH Registries. n: Number of new centres enrolled per year
Fig. 5The rate of enrolment of European and non-European users in the I-DSD and I-CAH Registries. n: Number of new users. %: Percentage of new non European users
Fig. 6The evolution of the level of data sharing in the I-DSD and I-CAH Registries. n: Number of new cases per year shared at a certain level (international, national….). N: Total Number of cases in the Registry shared at that level at the time of analysis