| Literature DB >> 29623192 |
Christian Ohmann1, Steve Canham2, Rita Banzi3, Wolfgang Kuchinke4, Serena Battaglia5.
Abstract
Background: In recent years, a cultural change in the handling of data from research has resulted in the strong promotion of a culture of openness and increased sharing of data. In the area of clinical trials, sharing of individual participant data involves a complex set of processes and the interaction of many actors and actions. Individual services/tools to support data sharing are available, but what is missing is a detailed, structured and comprehensive list of processes/subprocesses involved and tools/services needed.Entities:
Keywords: business process model; clinical trial; data sharing; generic framework; individual participant data (IPD); process
Year: 2018 PMID: 29623192 PMCID: PMC5861517 DOI: 10.12688/f1000research.13789.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Overview on the main processes in sharing of IPD.
Listing of processes, actors and possible services/tools in sharing of IPD from clinical trials.
| Process | Subprocess | Main Actors | Possible Services/Tools |
|---|---|---|---|
| 1. Preparation for data sharing,
| |||
| 1.1 Learn about individual
| 1.1.1 Learn about policies, requirements,
| Investigators, trials unit
[ | Education service (web pages, videos,
|
| 1.1.2 Become aware of repositories available for
| Investigators, trials unit heads, operational
| Web based information sources on
| |
| 1.2 Develop local SOPs and
| 1.2.1 Develop procedures governing data
| Trials unit heads, QA staff, operational managers,
| Example SOPs and proformas |
| 1.2.2 Develop procedures and libraries to
| Trials unit operational managers, statisticians,
| Links to standards and associated
| |
| 1.3 Clarify and integrate own
| 1.3.1 Clarify / agree with relevant university,
| Trials unit heads, QA staff, operational managers,
| |
| 1.3.2 Integrate any institutional requirements into
| Trials unit heads, QA staff, operational managers | ||
| 2. Plan for data sharing,
| |||
| 2.1 Decide the strategy for data
| 2.1.1 Explore options for data sharing
| Sponsors, with trial management team and
| Checklist of issues that need to be
|
| 2.1.2 Check funder/sponsor requirements for
| Trial management team, funder and / or sponsor
| Classification of legal responsibilities of
| |
| 2.1.3 Decide the strategy and specific actions
| Sponsors, with trial management team | Checklist of issues that need to be
| |
| 2.2 Document the strategy for
| 2.2.1 Incorporate data sharing summary in
| Trial management team | Example protocol sections |
| 2.2.2 Incorporate data sharing details within the
| Trial management team, data management /
| Example DMP sections with supporting
| |
| 2.2.3 Incorporate data sharing summary within
| Trial management team | Example registry data sections | |
| 2.3 Incorporate information
| 2.3.1 Summarise and explain data sharing plan
| Trial management team, patient groups and
| Guidance on legislation framework
|
| 2.3.2 Include request for broad consent for data
| Trial management team, patient groups and
| Demonstration material, templates,
| |
| 2.4 Check and align data
| 2.4.1 Ensure any plans to publish collaborators‘
| Trial management team | Examples of possible issues (e.g. with
|
| 2.4.2 Ensure all collaborators have contributed to
| Trial management team | Examples of possible processes, policies,
| |
| 2.5 Ensure that data and
| Trial management team | Links to standards and associated
| |
| 3. Preparation of data for sharing, after data collected | |||
| 3.1 Confirm strategy for
| 3.1.1 Decide if (further) pseudonymisation or
| Trial data management and IT staff | Guidance on interpretation of legal
|
| 3.1.2 Assess risk of re-identification with existing
| Trial data management and IT staff, specialist de-
| De-identification/anonymisation service for
| |
| 3.2 Carry out strategy for data
| 3.2.1 De-identify, and pseudo-anonymise or
| Trial data management and IT staff, specialist de-
| De-identification/anonymisation service for
|
| 3.2.2 Check that analyses still function for de-
| Statisticians and data managers | ||
| 3.2.3 Generate / transform, and check,
| Trial data management and IT staff | ||
| 3.2.4 Select file formats for long term storage of
| Trial data management and It staff | File formats recognized as standard | |
| 3.3 Document data preparation
| 3.3.1 Assess and document risk of re-
| Trial data management and IT staff, specialist de-
| De-identification/anonymisation service for
|
| 3.3.2 Incorporate record of data preparation and
| Trial data management and IT staff | Metadata scheme for describing de-
| |
| 4. Transferring data objects to external repository | |||
| 4.1 Select repository (or confirm
| 4.1.1 Explore repository features, management,
| Sponsors with trial management team | Data repository identification service
|
| 4.2 Transfer the datasets
| 4.2.1 Agree on access regime, data
| Sponsors with trial management team | Checklists to support data transfer
|
| 4.2.2 Agree on responsibilities for generating
| Sponsors with trial management team | Checklists to support data transfer
| |
| 4.2.3 Draw up and agree data transfer
| Sponsors with trial management team | Tools for generating data transfer
| |
| 4.2.4 Apply discoverability and provenance
| Trial data management and IT staff and/or
| Metadata schemas for data object
| |
| 5. Repository data and access management | |||
| 5.1 Maintain highly granular
| 5.1.1 Maintain access control that allows
| Repository managers | Authentication and authorisation tools,
|
| 5.1.2 Maintain 2-factor authentication, as
| Repository managers | Authentication and authorisation tools with
| |
| 5.2 Maintain mechanisms to set
| 5.2.1 Provide web based forms that allow users
| Repository managers | Authentication and authorisation tools,
|
| 5.2.2 Provide appropriate log-in pages, with
| Repository managers | Authentication and authorisation tools | |
| 5.3 Where there is a demand,
| 5.3.1 Allow datasets (including, possibly some
| Repository managers, analysis environment
| The analysis environment itself, including
|
| 5.3.2 Provide viewing, analysis and recording
| Repository managers, analysis environment
| Analysis tools and services, logging tools | |
| 5.3.3 Provide workflow recording tools and
| Repository managers, analysis environment
| Workflow recording tools, logging tools | |
| 5.4 Supply discovery data
| 5.4.1 Liaise with metadata repositories to agree
| Repository managers | Schema for discovery metadata |
| 5.4.2 Allow regular (e.g. nightly) harvesting of
| Repository managers | API for making metadata available from
| |
| 5.5 Facilitate data access
| 5.5.1 Establish a Data Access Committee, for
| Repository managers, Data Access Committee
| Guidelines for terms of reference /
|
| 5.5.2 For data that requires them, create and
| Repository managers | Template and example data request
| |
| 5.5.3 For data that requires them, create
| Repository managers | Template and example data use
| |
| 5.6 Provide usage and status
| 5.6.1 Provide regular (e.g. quarterly) reports
| Repository managers | Report services maintained by repository
|
| 5.6.2 Provide regular (e.g. annual) reports on
| Repository managers | Report services maintained by repository
| |
| 6 Managing access to individual participant data and associated data objects | |||
| 6.1 Manage direct responses
| 6.1.1 Decide upon the possibility, in legal terms,
| Sponsors and trial management team | Guidance on interpretation of legal
|
| 6.1.2 Assess the reasonableness of the request
| Sponsors and trial management team | ||
| 6.1.3 Assess the costs of de-identifying the data,
| Sponsors and trial management team | Data on costs in data preparation
| |
| 6.1.4 Make a final decision as to whether to
| Sponsors and trial management team | ||
| 6.1.5 Draw up a data use agreement and transfer
| Sponsors and trial management team | Example data use agreements | |
| 6.2 Manage access to data in
| 6.2.1 Repository makes appropriate request
| Repository managers | Available forms on line (see 5.5) |
| 6.2.2 Request forms completed and submitted
| Secondary users | ||
| 6.2.3 (If stipulated in data transfer agreement)
| Sponsors or advisory panel | ||
| 6.2.4 Decision to allow request made, by Data
| Sponsors or Data Access Committee, or
| Guidelines for terms of reference /
| |
| 6.2.5 If positive decision, data use agreement
| Sponsors or advisory panel, repository
| Example data use agreements | |
| 6.2.6 Data access arranged after liaison with
| Sponsors or advisory panel, repository managers | Pipeline for quick processing of access
| |
| 6.2.7 Access request and decision documented | Sponsors or advisory panel, repository managers | Recording systems for request and
| |
| 7. Discovering the data | |||
| 7.1 Agree a common discovery
| Repository managers, metadata repository
| The metadata scheme itself | |
| 7.2 Agree and implement an
| 7.2.1 Develop, and cost a mechanism for
| Repository managers, metadata repository
| Existing ID supply mechanisms, especially
|
| 7.2.2 Implement the process for generating
| Trial teams, repository and metadata repository
| The ID generation mechanism itself | |
| 7.3 Agree and implement an ID
| 7.3.1 Use existing (multiple) study identifiers as
| Repository managers, metadata repository
| Existing IDs, from registries, sponsors,
|
| 7.3.2 Attempt to develop an ID generation and /
| Repository, metadata repository and registry
| Existing ID supply mechanisms, especially
| |
| 7.4 Collect metadata together
| 7.4.1 Collect existing metadata samples and
| Metadata repository managers | The metadata scheme from 7.1, the ID
|
| 7.4.2 Maintain the metadata by arranging regular
| Metadata repository managers | The metadata scheme from 7.1, the
| |
| 7.4.3 Develop a single portal for searching
| Metadata repository managers | ||
| 7.4.4 Federate additional metadata sources
| Metadata repository managers | ||
| 7.5 Search for the data objects
| Secondary users | Search tools using study identifiers, name,
| |
| 8. Publishing results of re-use | |||
| 8.1 Carry out secondary use
| 8.1.1 Publish re-analysis, preferably open (e.g.
| Secondary users | |
| 8.1.2 If successful, ensure proper citation of data
| Secondary users | Agreed schemes for citation and credit
| |
| 8.1.3 Whether or not published in a journal,
| Secondary users, repository managers | ||
| 8.1.4 Apply metadata to new data objects,
| Secondary users | Metadata scheme for discoverability | |
| 9. Monitoring data sharing | |||
| 9.1 Monitor data sharing activity | 9.1.1 Gather and disseminate data on data
| Repository managers | Web site on which to display collected
|
| 9.1.2 Gather and disseminate data on reasons
| Repository managers | Web site on which to display collected
| |
| 9.1.3 Gather and disseminate data on data
| Repository managers | Web site on which to display collected
| |
| 9.2 Monitor output and
| 9.2.1 Attempt to monitor output of data sharing
| Publishers, funders | Web site on which to display collected
|
| 9.2.2 Attempt to monitor level and outcome of
| Publishers, funders, individual researchers | Published papers | |
| 9.2.3 Attempt to monitor changing attitudes
| Individual researchers | Published papers | |
1 Data objects: any discrete packages of data in an electronic form – whether that data is textual, numerical, a structured dataset, an image, film clip, (etc.) in form. They are each a file, as that term is used within computer systems, and are named, at least within their source file system. In the context of clinical research and data sharing, data objects can include electronic forms of protocols, journal papers, patient consent forms, analysis plans, and any other documents associated with the study, as well as datasets representing different portions and types of the data generated, and the metadata describing that data.
2 SOP: Standard Operating Procedure – A controlled document, explicitly versioned, reviewed and approved, that outlines the roles and responsibilities involved in a particular task and / or workflow, and the subtasks, deliverables and associated documentation required. SOPs may be supplemented by more detailed ‘work instructions’, that may relate to using one or more specific systems.
3 Authentication: The process of ensuring that a person or system that is trying to access a system is who they say (it says) they are. With a person, authentication is by provision of one or more of something only they should know (e.g. a password), or should have (e.g. a card or fob), or can show (e.g. fingerprint, iris pattern). With a system it is more often by provision of a secret token (in effect a machine password), often derived from public key cryptography.
4 Two factor authentication: The simultaneous use, by a person, of two of the three authentication methods described above.
5 Authorisation: The process of giving an authenticated entity the rights to access particular subsets of data and/or to carry out particular functions within a system. It is usually carried out by assigning user entities to roles and to groups that together define the access allowed.
Classification and description of possible tools/services to support processes in sharing IPD from clinical trials.
| Type of service/tool | Description/comments | Reference to
|
|---|---|---|
|
| ||
| Providing general
| Collection of relevant resources about data sharing
| 1.1.1 |
|
| ||
| List of general resources to
| Annotated links to web sites that provide (for example) …
| 1.1.2 |
| List of resources to support
| Annotated links to
| 1.2.2, 2.5 |
|
| ||
| Example documents
| • Example SOPs, (see
[ | 1.2.1, 2.1,
|
| Example data sharing
| Examples of possible
| 2.2.1, 2.2.2,
|
| Example data sharing
| Examples / templates of possible
| 4.2.1, 4.2.3 |
| Example data sharing
| Examples of
| 5.5.3, 6.1.5 |
|
| ||
| De-identification /
| There are four possible services here –
| 3.1.2, 3.2.1,
|
| Descriptive metadata
| To be useful (easily searchable, comparable etc.) the descriptive metadata of the data needs to be in a standard
| 3.2.3, 3.3.2 |
| Assessment / certification
| Provision of a set of standards, that can be used to assess the suitability of any repository as a location for data object
| 4.1.1, 4.2.1 |
| An ID assignment
| An ID (e.g. doi) generation service is required for all stored data objects. | 7.2.1, 7.2.2 |
| A common pipeline for
| With the possibility of many different data repositories emerging storing clinical datasets, there is potential advantage
| 6.2.6 |
| Recording and reporting
| Reports that could be provided by repositories include
| 5.6.1, 6.2.7, 9.1.1,
|
| Provision of a prototype
| A metadata repository, (or a
| 7.4.1, 7.4.2, 7.4.3,
|
| Service for provision of a
| Based on tools to provide an analysis environment for in-situ work (see below). | 5.3.1, 5.3.2, 5.3.3 |
|
| ||
| The development of a
| Agreement is needed on a common discovery metadata standard that can link data objects to studies and that can
| 4.2.4, 5.4.1, 5.4.2,
|
| The development of an
| There needs to be a universally recognised scheme that will allow fair credit for the re-use of data, in terms of
| 8.1.2 |
| Legal and regulatory
| As the legal and regulatory environment continues to evolve, there will be an ongoing need to clarify the legal
| 2.1.2, 2.3.1, 3.1.1,
|
| Checklist to decide the
| Checklist of issues to be considered of data sharing, with supporting material, option descriptions | 2.1.1, 2.1.2, 2.1.3 |
| Checklist to support
| Checklist to support development of data transfer agreement/data use agreement | 4.2.1, 4.2.2 |
| Manual to establish boards
| Manual for advisory panel/board | 5.5.1, 6.2.4 |
|
| ||
| Tools to support the
| A tool is required to allow the easy application of the metadata schema used to characterize data objects, ideally by
| 4.2.4, 8.1.4 |
| Tools for de-identification /
| See de-identification / anonymisation service for datasets above, | 3.1.2, 3.2.1,
|
| Authentication and
| Highly granular access is needed (at the level of individual users / individual data objects) to support the variety of
| 5.1.1, 5.1.2,
|
| Provide an analysis
| Interest has been expressed in a mechanism that allows data to be examined, re-analysed, aggregated etc. without
| 5.3.1, 5.3.2, 5.3.3 |
| APIs to access repository
| When discovery data is not (or has not been) directly transferred to a central repository using the tools described
| 5.4.1, 5.4.2 |
| Tools for generation of data
| Software tools supporting the development of data transfer/data use agreements. | 4.2.3, 6.1.5, 6.2.5 |