| Literature DB >> 33851061 |
Brenda W Gillespie1, Louis-Philippe Laurin2, Dawn Zinsser3, Richard Lafayette4, Maddalena Marasa5, Scott E Wenderfer6, Suzanne Vento7, Caroline Poulton8, Laura Barisoni9, Jarcy Zee3, Margaret Helmuth3, Francesca Lugani10, Margret Kamel11, Peg Hill-Callahan3, Stephen M Hewitt12, Laura H Mariani13, William E Smoyer14, Larry A Greenbaum15, Debbie S Gipson16, Bruce M Robinson17, Ali G Gharavi5, Lisa M Guay-Woodford18, Howard Trachtman7.
Abstract
BACKGROUND: High data quality is of crucial importance to the integrity of research projects. In the conduct of multi-center observational cohort studies with increasing types and quantities of data, maintaining data quality is challenging, with few published guidelines.Entities:
Keywords: Data quality; Kidney disease; Observational studies; Quality metrics; Site performance
Year: 2021 PMID: 33851061 PMCID: PMC8039553 DOI: 10.1016/j.conctc.2021.100749
Source DB: PubMed Journal: Contemp Clin Trials Commun ISSN: 2451-8654
CureGN Committees with data quality as part of their mission.
| Committee | Mission |
|---|---|
| Data Quality | Develop and implement strategies to improve data quality, oversee development and modification of the study Site Report Card, provide data to the Central Review Committee on site metrics, and work with Data Coordinating Center staff on new data queries. |
| Recruitment and Retention | Monitor recruitment and loss to follow-up, identify challenges and mitigation strategies, generate newsletters and other patient support materials, and guide metric development for recruitment and retention reports, as well as relevant metrics for the site Report Cards. |
| Central Review | Work with Participating Clinical Center (PCC) coordinators who have sites with quality metrics that are below a defined level to: develop a remediation plan to improve site performance; use ideas from all sites to achieve the best possible outcomes; allow flexibility in approaches to sites that have differing problems or issues; and develop metrics that may indicate a need for formal review by the study leadership for retention versus removal of a site from the consortium. |
| Ancillary Studies | Review ancillary study proposals for feasibility and scientific merit, with input from the Data Coordinating Center regarding sufficient biospecimen sample size, data availability and quality to answer the research question. |
| Biospecimens | Provide expert input on collection, storage, and optimal use of the non-renewable biosample resource: meet study aims while shepherding this resource for the community; review the current biosample inventory; provide input for proposed ancillary studies requesting CureGN biosamples; review study biospecimen performance; and recommend protocol and procedure changes related to biospecimens. |
| Pathology | Provide morphology-based inclusion and exclusion criteria, set guidelines for monitoring completeness and de-identification of pathology reports, and develop a protocol defining core scoring elements and establishing a process for core scoring. |
| Digital Pathology Repository (DPR) | Provide monitoring and metrics on proper shipment of pathology material to and from the Image Coordinating Center at the National Institutes of Health (NIH); evaluate completeness and de-identification of the pathology material, image quality, and adequacy of metadata. |
CureGN Quarterly Report Cards. Designed to monitor site compliance for major study elements, with consortium averages for comparison. Cells are color-coded based on compliance, defined as excellent (≥90%, green), acceptable (70%–89%, yellow), and poor (<70%, red). Arrows indicate change in performance compared with the previous quarter (better = green up-arrow; worse = red down-arrow). For details, see Supplemental Table S1.
Fig. 1Data quality improvement over time with quarterly queries. Boxplots show the distribution over six query cycles (quarters) of data query volumes (average number of queries per subject), shown for six categories of variables queried over 66 study sites. The number of variables queried has increased during the study and was 382 at cycle 6. Most sites had joined the study by cycle 1. Values were capped at the maxima shown in each graph. Median values in each cycle are shown as a horizontal line across each box. Each box spans the 25th to 75th percentiles. Medication queries were sent every other quarter. *Other Clinical Information includes family and birth history, physical exam (e.g., blood pressure, height, weight), and disease course (proteinuria relapse or remission).
Fig. 2Variability across study sites (boxplots) for 10 Report Card metrics at cycle 5. Sites below the reference line at 70% have 'poor compliance’ for the given metric. Comparison across the metrics allows the Data Quality Committee (DQC) to assess needs and recommend priorities within the network. CRF=Case Report Form; PRO=Patient-Reported Outcome. Medians = horizontal line across each box; 25th to 75th percentiles = bottom to top of each box. Metrics are defined as follows: Visit Completion (#visits occurred/#visits expected where visit window has closed); Visit CRF (#visit CRFs completed/#visit CRFs expected); Patient or Proxy PRO (#PROs completed/#PROs expected); Any blood at visit (#visits with any blood collected/#in-person visits); All annual blood (#tubes annual expected blood collected/#patient-years in study); Annual RNA or DNA (#annual samples collected/#expected); Any DNA (ever) (#consenting patients with at least one DNA sample collected ever/# expected); Any urine at visit (#visits with any urine sample collected [spot, First Morning Void, timed]/#visits).
Fig. 3Percent of study sites with acceptable or better (>70%) achievement for 10 Report Card metrics over 6 query cycles, for sites with all six Report Card cycles (n = 62). Most metrics show increases in achievement over time. Network-wide efforts addressed metrics where larger numbers of sites underperformed. See Fig. 2 caption for metric definitions.
Fig. 4To query or not to query: Logic checking for internal consistency of proteinuria data, reported relapse/remission events, and medication changes for patients with FSGS, MCD, or MN. Patient-specific plots show lab values (in mg urine protein per mg urine creatinine, UPCR; or Dipstick levels), self-reports of relapse with medication change (red lines) or remission (blue dotted lines). Relapse is defined, when possible, as UPCR≥3.0 or Dipstick≥3+, and remission is defined as UPCR<0.3 or Dipstick = Negative. The plot above shows data for a patient with MCD who has several inconsistent or potentially missing data elements: Of the three self-reported relapses, none were confirmed by UPCR≥3.0 or Dipstick≥3+ in the data entered. The first two occurred while on immunosuppression treatment (although possibly during steroid tapering); the last relapse was associated with starting a beta-blocker but not any immunosuppression medication. Nonetheless, the patient achieved self-reported remission thereafter. There were two UPCR-confirmed relapses between days 365 and 730, after immunosuppression had been stopped, with no subsequent data on remission or further immunosuppression medication. Finally, there were three self-reported remissions, only one of which was confirmed as UPCR<0.3. Queries to the site coordinator may resolve some of these issues. FSGS = focal segmental glomerulosclerosis; MCD = minimal change disease; MN = membranous nephropathy; UPCR = urinary protein/creatinine ratio. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)