| Literature DB >> 26107811 |
Lisa M Federer1, Ya-Ling Lu1, Douglas J Joubert1, Judith Welsh1, Barbara Brandys1.
Abstract
BACKGROUND: Significant efforts are underway within the biomedical research community to encourage sharing and reuse of research data in order to enhance research reproducibility and enable scientific discovery. While some technological challenges do exist, many of the barriers to sharing and reuse are social in nature, arising from researchers' concerns about and attitudes toward sharing their data. In addition, clinical and basic science researchers face their own unique sets of challenges to sharing data within their communities. This study investigates these differences in experiences with and perceptions about sharing data, as well as barriers to sharing among clinical and basic science researchers.Entities:
Mesh:
Year: 2015 PMID: 26107811 PMCID: PMC4481309 DOI: 10.1371/journal.pone.0129506
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sample OR contingency table.
| Outcome 1 | Outcome 2 | |
|---|---|---|
|
| a | c |
|
| b | d |
Respondent demographics.
| Position Category | |||||
|---|---|---|---|---|---|
| Administrative | Clinical | Scientific | Total | ||
|
| Contractor | 5 (2.9%) | 0 (0.0%) | 13 (7.6%) | 18 (10.6%) |
| Fellowship Appointment | 1 (0.6%) | 5 (2.9%) | 25 (14.7%) | 31 (18.2%) | |
| Guest Researcher | 0 (0.0%) | 1 (0.6%) | 1 (0.6%) | 2 (1.2%) | |
| NIH Employee | 27 (15.9%) | 15 (8.8%) | 73 (42.9%) | 115 (67.6%) | |
| Summer Student | 1 (0.6%) | 0 (0.0%) | 0 (0.0%) | 1 (0.6%) | |
| Volunteer | 1 (0.6%) | 1 (0.6%) | 1 (0.6%) | 3 (1.8%) | |
|
| 35 (20.6%) | 22 (12.9%) | 113 (66.5%) | 170 (100.0%) | |
Responses to “Locate and obtain other researchers’ shared data to use in your research, and clean or process it to meet your research needs.”
| Relevance to work | Level of expertise | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Scientific (n = 113) | Clinical (n = 22) | Total (n = 135) | Scientific (n = 113) | Clinical (n = 22) | Total (n = 135) | |||||||
| f | % | f | % | f | % | f | % | f | % | f | % | |
| not sure (0) | 2 | 1.77% | 1 | 4.55% | 3 | 2.22% | 2 | 1.77% | 1 | 4.55% | 3 | 2.22% |
| very low (1) | 0 | 0.00% | 1 | 4.55% | 1 | 0.74% | 14 | 12.39% | 1 | 4.55% | 15 | 11.11% |
| low (2) | 14 | 12.39% | 7 | 31.82% | 21 | 15.56% | 31 | 27.43% | 14 | 63.66% | 45 | 33.33% |
| medium (3) | 24 | 21.23% | 5 | 22.73% | 29 | 21.48% | 35 | 30.97% | 4 | 18.18% | 39 | 28.89% |
| high (4) | 37 | 32.74% | 5 | 22.73% | 42 | 31.11% | 23 | 20.35% | 2 | 9.09% | 25 | 18.52% |
| very high (5) | 36 | 31.85% | 3 | 13.64% | 39 | 28.89% | 8 | 7.08% | 0 | 0.00% | 8 | 5.93% |
Fig 1Comparison of self-rated relevance and expertise regarding reusing data among clinical and scientific research staff.
Comparison of initial analyses with worst-case scenario analyses.
|
|
|
|
|
| "not sure" excluded (OR = 4.2637, 95% CI 1.5012 to 12.1101, p = 0.0065) | Scientific | 97 | 14 |
| Clinical | 13 | 8 | |
| "not sure" included—worst-case scenario (OR = 3.4643, 95% CI 1.2529 to 9.5784, p = 0.0166) | Scientific | 97 | 16 |
| Clinical | 14 | 8 | |
|
|
|
|
|
| "not sure" excluded (OR = 3.6667, 95% CI 1.3225 to 10.1661, p = 0.0125) | Scientific | 66 | 45 |
| Clinical | 6 | 15 | |
| "not sure" included—worst-case scenario (OR = 3.0091, 95% CI 1.1384 to 7.9540, p = 0.0263) | Scientific | 66 | 47 |
| Clinical | 7 | 15 |
Responses to “Publish and deposit data in a repository suited to your research field.”
| Relevance to work | Level of expertise | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Scientific (n = 113) | Clinical (n = 22) | Total (n = 135) | Scientific (n = 113) | Clinical (n = 22) | Total (n = 135) | |||||||
| f | % | f | % | f | % | f | % | f | % | f | % | |
| not sure (0) | 7 | 6.19% | 2 | 9.09% | 9 | 6.67% | 6 | 5.31% | 1 | 4.55% | 7 | 5.19% |
| very low (1) | 3 | 2.65% | 2 | 9.09% | 5 | 3.70% | 13 | 11.50% | 2 | 9.09% | 15 | 11.11% |
| low (2) | 8 | 7.08% | 6 | 27.27% | 14 | 10.37% | 35 | 30.97% | 11 | 50.00% | 46 | 34.07% |
| medium (3) | 31 | 27.43% | 6 | 27.27% | 37 | 27.41% | 30 | 26.55% | 3 | 13.64% | 33 | 24.44% |
| high (4) | 30 | 26.55% | 3 | 13.64% | 33 | 24.43% | 20 | 17.70% | 4 | 18.18% | 24 | 17.78% |
| very high (5) | 34 | 30.09% | 3 | 13.64% | 37 | 27.41% | 9 | 7.96% | 1 | 4.55% | 10 | 7.41% |
Fig 2Comparison of self-rated relevance and expertise regarding sharing data in a repository among clinical and scientific research staff.
Comparison of initial analyses with worst-case scenario analyses.
|
|
|
|
|
| "not sure" excluded (OR = 5.7576, 95% CI 1.9341 to 17.1396, p = 0.0017) | Scientific | 95 | 11 |
| Clinical | 12 | 8 | |
| "not sure" included—worst-case scenario (OR = 3.0159, 95% CI 1.1048 to 8.2327, p = 0.0312) | Scientific | 95 | 18 |
| Clinical | 14 | 8 | |
|
|
|
|
|
| "not sure" excluded (OR = 1.9974, 95% CI 0.7651 to 5.2146, p = 0.1570) | Scientific | 59 | 48 |
| Clinical | 8 | 13 | |
| "not sure" included—worst-case scenario (OR = 1.5782, 95% CI 0.6248 to 3.9864, p = 0.3340) | Scientific | 59 | 54 |
| Clinical | 9 | 13 |
Responses to “Have you ever uploaded your data to a public repository?”
| Scientific (n = 113) | Clinical (n = 22) | Total (n = 135) | ||||
|---|---|---|---|---|---|---|
| f | % | f | % | f | % | |
| Yes | 47 | 41.59% | 6 | 27.27% | 53 | 39.26% |
| No | 66 | 58.41% | 16 | 72.73% | 82 | 60.74% |
Responses to “Have you ever shared data with another researcher, either informally or through a formal agreement, such as a Material Transfer Agreement or Data Sharing Agreement?”
| Scientific (n = 113) | Clinical (n = 22) | Total (n = 135) | ||||
|---|---|---|---|---|---|---|
| f | % | f | % | f | % | |
| Yes | 82 | 72.57% | 14 | 63.64% | 96 | 71.11% |
| No | 31 | 27.43% | 8 | 36.36% | 39 | 28.89% |
Responses to “What was your motivation for sharing your data? Please check all that apply.”
| Scientific (n = 93) | Clinical (n = 13) | Total (n = 106) | ||||
|---|---|---|---|---|---|---|
| f | % | f | % | f | % | |
| To collaborate with a researcher who requested the data | 66 | 71.73% | 7 | 50% | 73 | 68.87% |
| To advance science in a particular area | 59 | 63.13% | 9 | 64.28% | 68 | 64.15% |
| To assist a known colleague | 49 | 53.26% | 3 | 21.42% | 52 | 49.06% |
| To comply with a requirement to share as a condition of my grant funding or employment | 23 | 25% | 3 | 21.42% | 26 | 24.53% |
| To assist a junior researcher | 23 | 25% | 1 | 7.14% | 24 | 22.64% |
| To enhance my professional standing | 16 | 17.39% | 2 | 14.28% | 18 | 16.98% |
Odds ratio results for differences between scientific and clinical researchers regarding reasons for sharing.
| Reason for data sharing | Position | Yes | No |
|---|---|---|---|
| To collaborate with a researcher who requested the data (OR = 2.0952, 95% CI 0.6446 to 6.8105, p = 0.2188) | Scientific | 66 | 27 |
| Clinical | 7 | 6 | |
| To advance science in a particular area (OR = 0.7712, 95% CI 0.2207 to 2.6950, p = 0.6841) | Scientific | 59 | 34 |
| Clinical | 9 | 4 | |
| To assist a known colleague (OR = 3.7121, 95% CI 0.9595 to 14.3612, p = 0.0574 (Fisher's exact test p = 0.0732)) | Scientific | 49 | 44 |
| Clinical | 3 | 10 | |
| To comply with a requirement to share as a condition of my grant funding or employment (OR = 1.0952, 95% CI 0.2773 to 4.3254, p = 0.8967 (Fisher's exact test p = 1)) | Scientific | 23 | 70 |
| Clinical | 3 | 10 | |
| To assist a junior researcher (OR = 3.9429, 95% CI 0.4859 to 31.9963, p = 0.1990, (Fisher's exact test p = 0.289)) | Scientific | 23 | 70 |
| Clinical | 1 | 12 | |
| To enhance my professional standing (OR = 1.1429, 95% CI 0.2307 to 5.6607, p = 0.8701 (Fisher's exact test p = 1)) | Scientific | 16 | 77 |
| Clinical | 2 | 11 |
Responses to “How much time did you or your staff spend preparing your data so it would be ready to share or upload?”
| Scientific (n = 93) | Clinical (n = 13) | Total (n = 106) | ||||
|---|---|---|---|---|---|---|
| f | % | f | % | f | % | |
| 1–2 hours | 15 | 16.13% | 5 | 38.46% | 20 | 18.87% |
| 3–5 hours | 14 | 15.05% | 1 | 7.69% | 15 | 14.15% |
| 6–10 hours | 8 | 8.60% | 2 | 15.38% | 10 | 9.43% |
| More than 10 hours | 25 | 26.88% | 5 | 38.46% | 30 | 28.30% |
| None—my data was already in a form that could be shared | 31 | 33.33% | 0 | 0.00% | 31 | 29.25% |
| None—my data was not in a form that another researcher would understand, but I made no changes | 0 | 0.00% | 0 | 0.00% | 0 | 0.00% |
Responses to “Did you provide any additional materials or information besides the dataset?”
| Scientific (n = 93) | Clinical (n = 13) | Total (n = 106) | ||||
|---|---|---|---|---|---|---|
| Contextualizing information about the data | 45 | 48.39% | 5 | 38.46% | 50 | 47.17% |
| Codebook explaining variables | 29 | 31.18% | 5 | 38.46% | 34 | 32.08% |
| Code used with the data, such as R code | 24 | 25.81% | 3 | 23.08% | 27 | 25.47% |
| Software or program required to access or analyze the data | 25 | 26.88% | 1 | 7.69% | 26 | 24.53% |
| Nothing—the data required no additional materials to be useful to the requester | 24 | 25.81% | 6 | 46.15% | 30 | 28.30% |
| Nothing—the data required additional materials to be useful to the requester, but I did not send them | 0 | 0.00% | 0 | 0.00% | 0 | 0.00% |
Scientific group vs. Clinical group: supplementary materials they provided in data sharing.
| Shared supplementary materials | Position | Yes | No |
|---|---|---|---|
| Contextualizing information about the data (Fisher’s exact p = 0.5646 | Scientific | 45 | 48 |
| Clinical | 5 | 8 | |
| Codebook explaining variables (Fisher’s exact p = 0.7520) | Scientific | 29 | 64 |
| Clinical | 5 | 8 | |
| Code used with the data, such as R code (Fisher’s exact p = 1.0000) | Scientific | 24 | 69 |
| Clinical | 3 | 10 | |
| Software or program required to access or analyze the data (Fisher’s exact p = 0.1793) | Scientific | 25 | 68 |
| Clinical | 1 | 12 | |
| Nothing—the data required no additional materials to be useful to the requester (Fisher’s exact p = 0.1857) | Scientific | 24 | 69 |
| Clinical | 6 | 7 | |
| Nothing—the data required additional materials to be useful to the requester, but I did not send them (Fisher’s exact p = 1.0000) | Scientific | 0 | 93 |
| Clinical | 0 | 13 |
Responses to “If another researcher published or presented on results from your shared data, how were you acknowledged?”
| Scientific (n = 91) | Clinical (n = 13) | Total (n = 104) | ||||
|---|---|---|---|---|---|---|
| Co-authorship | 46 | 50.55% | 7 | 53.85% | 53 | 50.96% |
| Recognition in the acknowledgement section of the publication | 34 | 37.36% | 2 | 15.38% | 36 | 34.62% |
| Citation in bibliography | 22 | 24.18% | 1 | 7.69% | 23 | 22.12% |
| I received no acknowledgement | 14 | 15.38% | 2 | 15.38% | 16 | 15.38% |
| No publication arose from sharing data | 27 | 29.67% | 5 | 38.46% | 32 | 30.77% |
Scientific group vs. Clinical group: type of acknowledgement of data sharing.
| Type of acknowledgement | Position | Yes | No |
|---|---|---|---|
| Co-authorship (Fisher’s exact p = 1.0000) | Scientific | 46 | 45 |
| Clinical | 7 | 6 | |
| Recognition in the acknowledgement section of the publication (Fisher’s exact p = 0.2108) | Scientific | 34 | 57 |
| Clinical | 2 | 11 | |
| Citation in bibliography (Fisher’s exact p = 0.2885) | Scientific | 22 | 69 |
| Clinical | 1 | 12 | |
| I received no acknowledgement (Fisher’s exact p = 1.0000) | Scientific | 14 | 77 |
| Clinical | 2 | 11 | |
| No publication arose from sharing data (Fisher’s exact p = 0.5324) | Scientific | 27 | 64 |
| Clinical | 5 | 8 |
Responses to “You have indicated that you have never shared your data nor uploaded to a repository. Please indicate the reason(s) for not sharing your data. Please check all that apply.”
| Scientific (n = 15) | Clinical (n = 5) | Total (n = 20) | ||||
|---|---|---|---|---|---|---|
| I would be willing to share my data, but I haven't had an opportunity to do so | 8 | 53% | 1 | 20% | 9 | 45% |
| My data contains personally identifiable information and sharing would compromise my subjects' privacy | 2 | 13% | 5 | 100% | 7 | 35% |
| I am prohibited from sharing my data for some reason other than subject privacy | 2 | 13% | 4 | 80% | 6 | 30% |
| I don't know any repositories that accept the kind of data I produce | 7 | 47% | 2 | 40% | 9 | 45% |
| It's too difficult to prepare my data and documentation for sharing with others | 0 | 0% | 0 | 0% | 0 | 0% |
| I don't know how to prepare my data and documentation for sharing with others | 6 | 40% | 0 | 0% | 6 | 30% |
| Repositories' requirements for format or description of data are too difficult to meet | 0 | 0% | 0 | 0% | 0 | 0% |
| I don't feel I would get credit for sharing my data | 1 | 7% | 0 | 0% | 1 | 5% |
| I put in a great deal of time and effort to gather my data, and I don't want to give it away | 0 | 0% | 1 | 20% | 1 | 5% |
| I'm concerned that another researcher could beat me to publication if I share my data | 1 | 7% | 0 | 0% | 1 | 5% |
| My data has commercial value, so I don't want to give it away for free | 0 | 0% | 0 | 0% | 0 | 0% |
| I don't think anyone else would have any reason to use my data | 4 | 27% | 0 | 0% | 4 | 20% |
| It isn't customary to share data in my research field | 4 | 27% | 3 | 60% | 7 | 35% |
| I'm concerned another researcher might find errors in my data | 0 | 0% | 0 | 0% | 0 | 0% |
| I'm concerned another researcher might misinterpret my data | 1 | 7% | 2 | 40% | 3 | 15% |