| Literature DB >> 21738610 |
Carol Tenopir1, Suzie Allard, Kimberly Douglass, Arsev Umur Aydinoglu, Lei Wu, Eleanor Read, Maribeth Manoff, Mike Frame.
Abstract
BACKGROUND: Scientific research in the 21st century is more data intensive and collaborative than in the past. It is important to study the data practices of researchers--data accessibility, discovery, re-use, preservation and, particularly, data sharing. Data sharing is a valuable part of the scientific method allowing for verification of results and extending research from prior results. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2011 PMID: 21738610 PMCID: PMC3126798 DOI: 10.1371/journal.pone.0021101
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Joint Information Systems Committee (JISC), Stages of the research and data lifecycle.
Primary work sector.
| Frequency | Percent | |
| Academic | 1058 | 80.5 |
| Government | 167 | 12.7 |
| Commercial | 34 | 2.6 |
| Non-profit | 35 | 2.7 |
| Other | 21 | 1.6 |
|
|
|
|
Subject discipline.
| Frequency | Percent | |
| environmental sciences & ecology | 475 | 36.1 |
| social sciences | 204 | 15.5 |
| biology | 181 | 13.7 |
| physical sciences | 158 | 12.0 |
| computer science/engineering | 118 | 9.0 |
| other | 98 | 7.4 |
| atmospheric science | 52 | 3.9 |
| medicine | 31 | 2.4 |
|
|
|
|
Data access.
| Frequency | Percent | |
| An organization-specific system | 351 | 38.5% |
| Long-tem Ecological Research Network | 292 | 32.1% |
| Other data access | 246 | 27.0% |
| A Distributed Active-Archive Center | 173 | 19.0% |
| A Global Biodiversity Information Facility | 73 | 8.0% |
| National Biological Information Infrastructure | 70 | 7.7% |
| National Ecological Observatory Network | 64 | 7.0% |
| International Long-term Ecological Research Network | 58 | 6.4% |
| Taiwan Ecological Research Network | 7 | .8% |
| South African Environmental Observation Network | 6 | .7% |
Data types.
| Responses | Percent | |
| Experimental | 711 | 54.6% |
| Observational | 632 | 48.5% |
| Data Models | 499 | 38.3% |
| Biotic Surveys | 446 | 34.3% |
| Abiotic Surveys | 442 | 33.9% |
| Remote-Sensed Abiotic | 358 | 27.5% |
| Remote-Sensed Biotic | 264 | 20.3% |
| Social Science Surveys | 251 | 19.3% |
| Interviews | 195 | 15.0% |
| Other | 80 | 6.1% |
Data issues.
| I am satisfied with the process for,,, | Agree Strongly | Agree Somewhat | Neither Agree Nor Disagree | Disagree Somewhat | Disagree Strongly |
| … collecting my research data. | 410 (31.6%) | 626(48.2%) | 139 (10.7%) | 112 (8.6%) | 11 (0.8%) |
| … searching for my own data. | 298 (23.2%) | 600 (46.7%) | 230 (17.9%) | 141 (11%) | 16 (1.2%) |
| … cataloging/describing my data. | 226 (18%) | 526 (41.8%) | 273 (21.7%) | 194 (15.4%) | 40 (3.2%) |
| … storing my data during the life of the project (short-term). | 376 (29.2%) | 559 (43.5%) | 189 (14.7%) | 143 (11.1%) | 19 (1.5%) |
| … storing my data beyond the life of the project (long-term). | 206 (16%) | 369 (28.6%) | 271 (21%) | 334 (25.9%) | 111 (8.6%) |
| … analyzing my data. | 383 (29.7%) | 598 (46.4%) | 177 (13.7%) | 118 (9.1%) | 14 (1.1%) |
Data tools.
| Agree Strongly | Agree Somewhat | Neither Agree Nor Disagree | Disagree Somewhat | Disagree Strongly | |
| I am satisfied with the tools for preparing metadata. | 75 (6%) | 252 (20%) | 526 (41.7%) | 289 (22.9%) | 118 (9.4%) |
| I am satisfied with the tools for preparing my documentation. | 155 (12.1%) | 413 (32.3%) | 409 (32%) | 231 (18.1%) | 71 (5.6%) |
Organizational involvement in data issues.
| My organization or project… | Agree Strongly | Agree Somewhat | Neither Agree Nor Disagree | Disagree Somewhat | Disagree Strongly |
| … has a formal established process for managing data during the life of the project (short-term). | 221 (17.2%) | 330 (25.6%) | 183 (14.2%) | 257 (20%) | 297 (23.1%) |
| … has a formal established process for storing data beyond the life of the project (long-term). | 200 (15.6%) | 294 (22.9%) | 191 (14.9%) | 271 (21.1%) | 328 (25.5%) |
| … provides the necessary tools and technical support for data management during the life of the project (short-term). | 192 (15%) | 374 (29.2%) | 269 (21%) | 221 (17.3%) | 224 (17.5%) |
| … provides the necessary tools and technical support for data management beyond the life of the project (long-term). | 155 (12.1%) | 294 (22.9%) | 232 (18%) | 204 (23.6%) | 301 (23.4%) |
| … provides training on best practices for data management. | 75 (5.9%) | 199 (15.5%) | 253 (19.8%) | 339 (26.5%) | 414 (32.3%) |
| … provides the necessary funds to support data management during the life of a research project (short-term). | 115 (9%) | 275 (21.4%) | 273 (21.3%) | 296 (23.1%) | 325 (25.3%) |
| … provides the necessary funds to support data management beyond the life of the project (long-term). | 85 (6.6%) | 194 (15.1%) | 249 (19.4%) | 314 (24.4%) | 443 (34.5%) |
Data reuse.
| Agree Strongly | Agree Somewhat | Neither Agree Nor Disagree | Disagree Somewhat | Disagree Strongly | |
| Lack of access to data generated by other researchers or institutions is a major impediment to progress in science. | 353 (27.2%) | 520 (40%) | 230 (17.7%) | 149 (11.5%) | 48 (3.7%) |
| Lack of access to data generated by other researchers or institutions has restricted my ability to answer scientific questions. | 228 (17.6%) | 422 (32.5%) | 297 (22.9%) | 238 (18.4%) | 112 (8.6%) |
| Data may be misinterpreted due to complexity of the data. | 383 (29.6%) | 590 (45.6%) | 217 (16.8%) | 77 (6%) | 26 (2%) |
| Data may be misinterpreted due to poor quality of the data. | 379 (29.4%) | 540 (41.8%) | 232 (18%) | 107 (8.3%) | 33 (2.6) |
| Data may be used in other ways than intended. | 410 (31.8%) | 539 (41.8%) | 249 (19.3%) | 68 (5.3%) | 23 (1.8%) |
Metadata standards.
| Responses | Percent | |
| No metadata standard | 676 | 56.1% |
| Metadata Standardized Within My Lab | 266 | 22.1% |
| International Standards Organization | 97 | 8.0% |
| Open GIS | 96 | 8.0% |
| Ecological Metadata Language | 95 | 7.9% |
| Federal Geographic Data Committee | 95 | 7.9% |
| Other Metadata | 82 | 6.8% |
| Dublin Core | 26 | 2.2% |
| Darwin Core | 21 | 1.7% |
| Directory Interchange Format | 12 | 1.0% |
Data sharing practices.
| None | Some | Most | All | Total | |
| On My Organization's Website | 495 (45.9%) | 378 (35.1%) | 143 (13.3%) | 62 (5.8%) |
|
| On the Principal Investigator's Website | 553 (56.7%) | 303 (31.0%) | 87 (8.9%) | 33 (3.4%) |
|
| Through a National Network | 470 (46.4%) | 331 (32.6%) | 153 (15.1%) | 60 (5.9%) |
|
| Through a Regional Network | 579 (64.7%) | 238 (26.6%) | 58 (6.5%) | 20 (2.2%) |
|
| Through a Global Network | 550 (57.6%) | 242 (25.3%) | 111 (11.6%) | 52 (5.4%) |
|
| On My Personal Website | 668 (72.7%) | 173 (18.8%) | 49 (5.3%) | 29 (3.2%) |
|
| Other | 370 (65.3%) | 94 (16.6%) | 47 (8.3%) | 56 (9.9%) |
|
Data sharing.
| AgreeStrongly | AgreeSomewhat | Neither Agree Nor Disagree | Disagree Somewhat | Disagree Strongly | |
| I share my data with others. | 418 (32.3%) | 551 (42.6%) | 199 (15.4%) | 95 (7.3%) | 30 (2.3%) |
| Others can access my data easily. | 150 (11.6%) | 317 (24.6%) | 310 (24%) | 307 (23.8%) | 207 (16%) |
Reasons for not making data electronically available.
| Responses | Percent | |
| Insufficient Time | 603 | 53.6% |
| Lack of Funding | 445 | 39.6% |
| Do not Have Rights to Make Data Public | 271 | 24.1% |
| No Place to Put Data | 264 | 23.5% |
| Lack of Standards | 222 | 19.8% |
| Sponsor does not Require | 196 | 17.4% |
| Do not Need Data | 169 | 15.0% |
| Other Reasons For Data Not Available | 164 | 14.6% |
| Should not be Available | 162 | 14.4% |
Conditions for data sharing.
| AgreeStrongly | Agree Somewhat | Neither Agree Nor Disagree | Disagree Somewhat | Disagree Strongly | |
| I would use other researchers' datasets if their datasets were easily accessible. | 561 (43.2%) | 524 (40.3%) | 136 (10.5%) | 62 (4.8%) | 16 (1.2%) |
| I would be willing to place at least some of my data into a central data repository with no restrictions. | 539 (41.6%) | 472 (36.4%) | 141 (10.9%) | 104 (8%) | 39 (3%) |
| I would be willing to place all of my data into a central data repository with no restrictions. | 191 (14.9%) | 338 (26.3%) | 234 (18.2%) | 318 (24.7%) | 205 (15.9%) |
| I would be more likely to make my data available if I could place conditions on access. | 317 (24.8%) | 506 (39.6%) | 279 (21.8%) | 107 (8.4%) | 68 (5.3%) |
| I am satisfied with my ability to integrate data from disparate sources to address research questions. | 156 (12.2%) | 419 (32.7%) | 363 (28.3%) | 275 (21.5%) | 69 (5.4%) |
| I would be willing to share data across a broad group of researchers who use data in different ways. | 476 (37%) | 565 (43.9%) | 185 (14.4%) | 48 (3.7%) | 13 (1%) |
| It is important that my data are cited when used by other researchers. | 885 (68.6%) | 298 (23.1%) | 87 (6.7%) | 14 (1.1%) | 7 (0.5%) |
| It is appropriate to create new datasets from shared data. | 505 (38.9%) | 475 (36.6%) | 261 (20.1%) | 36 (2.8%) | 20 (1.5%) |
Others using data & using others' data.
| For others to use my data | To use other people's data | |||
| Yes | No | Yes | No | |
| Co-authorship on publications resulting from use of the data | 751 (59.7%) | 506 (40.3%) | 750 (61.2%) | 476 (38.8%) |
| Formal acknowledgement of the data providers and/or funding agencies in all disseminated work making use of the data | 1168 (93%) | 88 (7%) | 1147 (93.3%) | 83 (6.7%) |
| Formal citation of the data providers and/or funding agencies in all disseminated work making use of the data | 1166 (94.5%) | 68 (5.5%) | 1152 (95.1%) | 59 (4.9%) |
| The opportunity to collaborate on the project (including, for example, consultation on analytic methods, interpretation of results, dissemination of research results, etc.) | 991 (80.6%) | 239 (19.4%) | 980 (81.2%) | 227 (18.8%) |
| Results based (at least in part) on the data could not be disseminated in any format without the data provider's approval. | 585 (47.7%) | 642 (52.3%) | 594 (48.9%) | 620 (51.1%) |
| At least part of the costs of data acquisition, retrieval or provision must be recovered. | 364 (30%) | 851 (70%) | 374 (31.2%) | 826 (68.8%) |
| Results based (at least in part) on the data could not be disseminated without the data provider having the opportunity to review the results and make suggestions or comments, but approval not required. | 746 (61.7%) | 464 (38.3%) | 750 (62.7%) | 447 (37.3%) |
| Reprints of articles that make use of the data must be provided to the data provider. | 860 (70.1%) | 367 (29.9%) | 850 (70.4%) | 357 (29.6%) |
| The data provider is given a complete list of all products that make use of the data, including articles, presentations, educational materials, etc. | 846 (69.3%) | 375 (30.7%) | 831 (69.1%) | 372 (30.9%) |
| Legal permission for data use is obtained. | 545 (44.8%) | 672 (55.2%) | 552 (45.8%) | 652 (54.2%) |
| Mutual agreement on reciprocal sharing of data | 880 (72.2%) | 339 (27.8%) | 865 (71.9%) | 338 (28.1%) |
| The data provider is given and agrees to a statement of uses to which the data will be put. | 810 (66.8%) | 403 (33.2%) | 799 (67%) | 394 (33%) |
Conditions for data sharing by subject discipline.
| I am satisfied with my ability to integrate data from disparate sources to address research questions | ||
| Agree strongly | Agree somewhat | |
| social sciences | 24 (11.9%) | 56 (27.9%) |
| computer science/engineering | 14 (12.3%) | 35 (30.7%) |
| physical sciences | 25 (16.8%) | 45 (30.2%) |
| environmental sciences & ecology | 53 (11.4%) | 169 (36.4%) |
| atmospheric science | 8 (16.3%) | 20 (40.8%) |
| biology | 19 (10.5%) | 53 (29.3%) |
| medicine | 1 (3.3%) | 10 (33.3%) |
| other | 12 (12.9%) | 31 (33.3%) |
χ
Data sharing by subject discipline.
| Others can access my data easily | ||
| Agree strongly | Agree somewhat | |
| social sciences | 11(5.4%) | 36(17.8%) |
| computer science/engineering | 12(10.3%) | 29(24.8%) |
| physical sciences | 17(11.3%) | 41(27.3%) |
| environmental sciences & ecology | 56(12.0%) | 124(26.5%) |
| atmospheric science | 12(23.5%) | 13(25.5%) |
| biology | 28(15.6%) | 50(27.9%) |
| medicine | 2(6.5%) | 2(6.5%) |
| other | 12(13.0%) | 21(22.8%) |
χ
Satisfaction for data management by subject discipline.
| I am satisfied with the process for collecting my research data | I am satisfied with the tools for preparing metadata | I am satisfied with the tools for preparing my documentation | ||||
| Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | |
| social sciences | 52(25.7%) | 105(52.0%) | 6(3.2%) | 29(15.3%) | 20(10.0%) | 67(33.5%) |
| computer science/engineering | 26(22.0%) | 58(49.2%) | 9(7.8%) | 26(22.6%) | 16(13.7%) | 42(35.9%) |
| physical sciences | 51(33.1%) | 71(46.1%) | 9(6.3%) | 35(24.5%) | 21(14.1%) | 49(32.9%) |
| environmental sciences & ecology | 148(31.6%) | 233(49.7%) | 27(5.8%) | 91(19.4%) | 45(9.6%) | 138(29.6%) |
| atmospheric science | 15(30.0%) | 25(50.0%) | 3(6.4%) | 16(34.0%) | 6(12.5%) | 8(16.7%) |
| biology | 73(40.6%) | 83(46.1%) | 12(6.7%) | 36(20.2%) | 26(14.8%) | 60(34.1%) |
| medicine | 9(29.0%) | 13(41.9%) | 4(12.9%) | 2(6.5%) | 6(19.4%) | 12(38.7%) |
| other | 35(37.6%) | 38(40.9%) | 5(5.7%) | 17(19.5%) | 15(16.7%) | 37(41.1%) |
χ
χ
χ
Data reuse by subject discipline.
| Lack of access to data generated by other researchers or institutions is a major impediment to progress in science | Data may be used in other ways than intended | |||
| Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | |
| social sciences | 52(25.7%) | 82(40.6%) | 73(36.1%) | 89(44.1%) |
| computer science/engineering | 43(37.1%) | 36(31.0%) | 30(25.9%) | 44(37.9%) |
| physical sciences | 33(21.3%) | 68(43.9%) | 43(28.3%) | 63(41.4%) |
| environmental sciences & ecology | 128(27.1%) | 203(43.0%) | 158(33.7%) | 207(44.1%) |
| atmospheric science | 11(22.4%) | 19(38.8%) | 13(26.5%) | 18(36.7%) |
| biology | 50(27.8%) | 71(39.4%) | 56(31.6%) | 74(41.8%) |
| medicine | 5(16.1%) | 11(35.5%) | 8(25.8%) | 12(38.7%) |
| other | 30(31.9%) | 30(31.9%) | 29(31.5%) | 32(34.8%) |
χ
χ
Conditions for data sharing by subject discipline.
| I would use other researchers' datasets if their datasets were easily accessible | I would be willing to place at least some of my data into a central data repository with no restrictions | I would be willing to place all of my data into a central data repository with no restrictions | I would be more likely to make my data available if I could place conditions on access | |||||
| Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | |
| social sciences | 83(40.7%) | 79(38.7%) | 70(34.5%) | 75(36.9%) | 23(11.4%) | 41(20.4%) | 46(23.1%) | 82(41.2%) |
| computer science/engineering | 51(44.0%) | 44(37.9%) | 48(41.4%) | 44(37.9%) | 16(13.9%) | 31(27.0%) | 21(18.4%) | 47(41.2%) |
| physical sciences | 66(43.1%) | 66(43.1%) | 61(40.1%) | 61(40.1%) | 22(14.7%) | 44(29.3%) | 26(17.6%) | 60(40.5%) |
| environmental sciences & ecology | 221(47.0%) | 195(41.5%) | 223(47.6%) | 159(34.0%) | 70(15.0%) | 138(29.6%) | 133(28.9%) | 187(40.6%) |
| atmospheric science | 22(44.0%) | 22(44.0%) | 21(42.0%) | 25(50.0%) | 9(18.0%) | 17(34.0%) | 6(12.0%) | 29(58.0%) |
| biology | 67(37.0%) | 74(40.9%) | 75(41.4%) | 68(37.6%) | 39(21.7%) | 39(21.7%) | 56(31.1%) | 53(29.4%) |
| medicine | 8(26.7%) | 10(33.3%) | 4(13.3%) | 10(33.3%) | 1(3.3%) | 4(13.3%) | 8(26.7%) | 11(36.7%) |
| other | 42(44.7%) | 34(36.2%) | 36(38.3%) | 30(31.9%) | 11(11.8%) | 23(24.7%) | 21(22.3%) | 37(39.4%) |
χ2 = 46.693, p = .015;
χ2 = 69.438, p = .000;
χ2 = 56.836, p = .001;
χ2 = 43.404, p = .032.
Conditions for data sharing for reuse by subject discipline.
| I would be willing to share data across a broad group of researchers who use data in different ways | It is important that my data are cited when used by other researchers | It is appropriate to create new datasets from shared data | ||||
| Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | Agree strongly | Agree somewhat | |
| social sciences | 60(29.7%) | 100(49.5%) | 119(59.5%) | 55(27.5%) | 77(37.9%) | 73(36.0%) |
| computer science/engineering | 36(31.9%) | 41(36.3%) | 68(58.6%) | 32(27.6%) | 40(34.5%) | 36(31.0%) |
| physical sciences | 57(38.0%) | 64(42.7%) | 112(74.7%) | 30(20.0%) | 53(34.9%) | 61(40.1%) |
| environmental sciences & ecology | 199(42.4%) | 210(44.8%) | 331(70.4%) | 109(23.2%) | 196(41.7%) | 192(40.9%) |
| atmospheric science | 18(36.7%) | 28(57.1%) | 40(80.0%) | 10(20.0%) | 17(34.0%) | 19(38.0%) |
| biology | 73(40.6%) | 65(36.1%) | 134(74.4%) | 31(17.2%) | 82(45.3%) | 53(29.3%) |
| medicine | 4(13.3%) | 17(56.7%) | 17(56.7%) | 9(30.0%) | 9(30.0%) | 8(26.7%) |
| Other | 28(30.1%) | 40(43.0%) | 63(67.0%) | 22(23.4%) | 30(31.9%) | 33(35.1%) |
χ2 = 71.679, p = .000;
χ2 = 41.985, p = .044;
χ2 = 43.649, p = .030.
Using others' data by subject discipline.
| Co-authorship on publications resulting from use of the data | The opportunity to collaborate on the project | Results based (at least in part) on the data could not be disseminated in any format without the data provider's approval | At least part of the costs of data acquisition, retrieval or provision must be recovered | Reprints of articles that make use of the data must be provided to the data provider | Legal permission for data use is obtained | |
| social sciences | 104(52.8%) | 142(72.1%) | 83(42.8%) | 60(30.9%) | 125(64.4%) | 107(55.2%) |
| computer science/engineering | 59(52.2%) | 88(80.0%) | 59(54.6%) | 46(42.2%) | 66(60.0%) | 61(56.0%) |
| physical sciences | 84(55.3%) | 120(82.8%) | 64(43.8%) | 35(24.3%) | 107(73.8%) | 58(40.3%) |
| environmental sciences & ecology | 289(63.7%) | 360(80.7%) | 208(46.2%) | 122(27.5%) | 337(75.4%) | 173(38.7%) |
| atmospheric science | 31(63.3%) | 39(79.6%) | 20(42.6%) | 11(24.4%) | 32(66.7%) | 17(37.8%) |
| biology | 105(60.7%) | 143(84.6%) | 84(49.7%) | 42(25.3%) | 114(67.1%) | 65(38.9%) |
| medicine | 24(82.8%) | 27(93.1%) | 20(71.4%) | 15(53.6%) | 22(78.6%) | 19(70.4%) |
| other | 54(60.7%) | 71(84.5%) | 47(56.0%) | 33(38.8%) | 56(66.7%) | 45(54.2%) |
χ2 = 17.514, p = .014;
χ2 = 15.076, p = .035;
χ2 = 14.610, p = .041;
χ2 = 24.282, p = .001;
χ2 = 17.680, p = .014;
χ2 = 35.158, p = .000.
Using others' data by subject discipline.
| Co-authorship on publications resulting from use of the data | The opportunity to collaborate on the project | At least part of the costs of data acquisition, retrieval or provision must be recovered | Legal permission for data use is obtained | |
| social sciences | 103(54.2%) | 144(73.5%) | 60(30.9%) | 107(55.7%) |
| computer science/engineering | 56(51.4%) | 86(77.5%) | 48(44.0%) | 63(58.9%) |
| physical sciences | 83(55.7%) | 119(84.4%) | 40(27.8%) | 60(41.4%) |
| environmental sciences & ecology | 293(66.0%) | 355(81.4%) | 125(28.5%) | 171(38.9%) |
| atmospheric science | 30(63.8%) | 38(79.2%) | 11(25.6%) | 20(42.6%) |
| biology | 107(62.6%) | 144(87.3%) | 43(26.7%) | 67(41.1%) |
| medicine | 24(82.8%) | 26(89.7%) | 16(57.1%) | 19(70.4%) |
| other | 53(61.6%) | 67(83.8%) | 31(37.8%) | 45(54.9%) |
χ2 = 20.469, p = .005;
χ2 = 15.439, p = .031;
χ2 = 23.199, p = .002;
χ2 = 35.590, p = .000.
Organizational involvement in data issues by age group.
| Age 20–39 | Age 40–50 | Age over 50 | ||
| My organization or project has a formal established process for managing data during the life of the project | Agree Strongly | 68(15.5%) | 54(15.2%) | 71(18.5%) |
| Agree somewhat | 124(28.2%) | 85(23.9%) | 97(25.3%) | |
| My organization or project provides the necessary tools and technical support for data management beyond the life of the project | Agree Strongly | 65(14.8%) | 31(8.8%) | 45(11.7%) |
| Agree somewhat | 102(23.2%) | 77(21.9%) | 91(23.6%) | |
| My organization or project provides the necessary funds to support data management during the life of a research project | Agree Strongly | 46(10.5%) | 26(7.4%) | 29(7.6%) |
| Agree somewhat | 106(24.1%) | 78(22.1%) | 68(17.8%) | |
| My organization or project provides the necessary funds to support data management beyond the life of the project | Agree Strongly | 43(9.8%) | 13(3.7%) | 18(4.7%) |
| Agree somewhat | 82(18.6%) | 49(14.0%) | 46(11.9%) |
χ2 = 17.444, p = .026;
χ2 = 21.800, p = .005;
χ2 = 30.504, p = .000;
χ2 = 45.763, p = .000.
Data reuse by age group.
| Age 20–39 | Age 40–50 | Age over 50 | ||
| Lack of access to data generated by other researchers or institutions is a major impediment to progress in science | Agree Strongly | 138(30.9%) | 105(29.3%) | 83(21.4%) |
| Agree somewhat | 182(40.8%) | 138(38.5%) | 157(40.6%) | |
| Lack of access to data generated by other researchers or institutions has restricted my ability to answer scientific questions | Agree Strongly | 100(22.4%) | 63(17.6%) | 46(11.9%) |
| Agree somewhat | 154(34.5%) | 102(28.6%) | 131(34.0%) |
χ2 = 19.082, p = .014;
χ2 = 29.320, p = .000.
Data sharing by age group.
| Age 20–39 | Age 40–50 | Age over 50 | ||
| I would be willing to place all of my data into a central data repository with no restrictions | Agree Strongly | 50(11.3%) | 47(13.3%) | 69(18.0%) |
| Agree somewhat | 124(28.1%) | 88(24.9%) | 106(27.6%) | |
| I would be more likely to make my data available if I could place conditions on access | Agree Strongly | 119(27.0%) | 94(26.9%) | 82(21.6%) |
| Agree somewhat | 187(42.4%) | 135(38.6%) | 143(37.7%) | |
| It is appropriate to create new datasets from shared data | Agree Strongly | 175(39.1%) | 132(37.0%) | 161(41.7%) |
| Agree somewhat | 159(35.6%) | 129(36.1%) | 145(37.6%) |
χ2 = 16.072, p = .041;
χ2 = 19.507, p = .012;
χ2 = 15.620, p = .048.
Others using data by age group.
| The data provider is given a complete list of all products that make use of the data | Legal permission for data use is obtained | |
| Age 20–39 | 311(74.2%) | 217(51.1%) |
| Age 40–50 | 230(66.9%) | 160(47.5%) |
| Age over 50 | 239(66.4%) | 130(36.1%) |
χ2 = 7.180, p = .028;
χ2 = 18.603, p = .000.
Using others' data by age group.
| The data provider is given a complete list of all products that make use of the data | Legal permission for data use is obtained | The data provider is given and agrees to a statement of uses to which the data will be put | |
| Age 20–39 | 306(73.9%) | 218(52.2%) | 294(71.5%) |
| Age 40–50 | 219(65.2%) | 164(49.1%) | 221(66.4%) |
| Age over 50 | 241(67.5%) | 131(36.6%) | 224(63.3%) |
χ2 = 7.344, p = .025;
χ2 = 20.386, p = .000;
χ2 = 6.082, p = .048.
Conditions for data sharing by activity.
| Teaching-intensive | Research-intensive | ||
| I would be willing to place at least some of my data into a central data repository with no restrictions | Agree Strongly | 64(37.6%) | 282(43.0%) |
| Agree somewhat | 61(35.9%) | 233(35.5%) | |
| I would be willing to share data across a broad group of researchers who use data in different ways | Agree Strongly | 54(32.0%) | 260(39.8%) |
| Agree somewhat | 76(45.0%) | 282(43.1%) |
χ2 = 11.479, p = .022;
χ2 = 12.122, p = .016.
Data access by activity.
| Others can access my data easily | ||
| Agree strongly | Agree somewhat | |
| Teaching-intensive | 14(8.3%) | 30(17.9%) |
| Research-intensive | 73(11.2%) | 176(27.0%) |
χ2 = 12.270, p = .015.
Organizational involvement by activity.
| Teaching-intensive | Research-intensive | ||
| My organization or project has a formal established process for managing data during the life of the project | Agree strongly | 15(8.9%) | 124(19.0%) |
| Agree somewhat | 30(17.9%) | 179(27.5%) | |
| My organization or project has a formal established process for storing data beyond the life of the project | Agree strongly | 8(4.7%) | 121(18.6%) |
| Agree somewhat | 26(15.4%) | 159(24.5%) | |
| My organization or project provides the necessary tools and technical support for data management during the life of the project | Agree strongly | 12(7.1%) | 115(17.8%) |
| Agree somewhat | 46(27.1%) | 203(31.4%) | |
| My organization or project provides the necessary tools and technical support for data management beyond the life of the project | Agree strongly | 9(5.3%) | 97(14.9%) |
| Agree somewhat | 31(18.3%) | 162(24.9%) | |
| My organization or project provides training on best practices for data management | Agree strongly | 3(1.8%) | 47(7.3%) |
| Agree somewhat | 21(12.4%) | 108(16.7%) | |
| My organization or project provides the necessary funds to support data management during the life of a research project | Agree strongly | 3(1.8%) | 68(10.4%) |
| Agree somewhat | 27(16.0%) | 152(23.3%) | |
| My organization or project provides the necessary funds to support data management beyond the life of the project | Agree strongly | 3(1.8%) | 51(7.8%) |
| Agree somewhat | 16(9.4%) | 121(18.6%) |
χ2 = 22.598, p = .000;
χ2 = 33.678, p = .000;
χ2 = 16.981, p = .002;
χ2 = 18.068, p = .001;
χ2 = 10.793, p = .029;
χ2 = 21.447, p = .000;
χ2 = 21.092, p = .000.
Satisfaction by geographic location.
| North American | Europe | Others | ||
| I am satisfied with the process for collecting my research data | Agree strongly | 311(34.0%) | 44(23.8%) | 47(30.1%) |
| Agree somewhat | 435(47.5%) | 95(51.4%) | 72(46.2%) | |
| I am satisfied with the process for storing my data during the life of the project | Agree strongly | 279(30.7%) | 47(25.7%) | 41(26.6%) |
| Agree somewhat | 405(44.6%) | 65(35.5%) | 68(44.2%) |
χ2 = 20.009, p = .010;
χ2 = 18.201, p = .020.
Satisfaction with data management by geographic location.
| North American | Europe | Others | ||
| I am satisfied with the process for storing my data beyond the life of the project | Agree strongly | 141(15.4%) | 21(11.5%) | 39(25.2%) |
| Agree somewhat | 251(27.5%) | 59(32.2%) | 45(29.0%) | |
| I am satisfied with the tools for preparing metadata | Agree strongly | 39(4.4%) | 12(6.7%) | 21(13.9%) |
| Agree somewhat | 167(18.8%) | 35(19.4%) | 43(28.5%) | |
| I am satisfied with the tools for preparing my documentation | Agree strongly | 98(10.8%) | 15(8.2%) | 38(24.8%) |
| Agree somewhat | 290(32.1%) | 59(32.4%) | 55(35.9%) |
χ2 = 24.102, p = .002;
χ2 = 34.898, p = .000;
χ2 = 36.098, p = .000.
Organizational involvement in data issues by geographic location.
| North American | Europe | Others | ||
| My organization or project has a formal established process for managing data during the life of the project | Agree strongly | 164(17.9%) | 17(9.2%) | 34(22.5%) |
| Agree somewhat | 229(25.1%) | 46(24.9%) | 43(28.5%) | |
| My organization or project has a formal established process for storing data beyond the life of the project | Agree strongly | 151(16.6%) | 16(8.6%) | 28(18.7%) |
| Agree somewhat | 201(22.1%) | 41(22.0%) | 43(28.7%) | |
| My organization or project provides training on best practices for data management | Agree strongly | 48(5.3%) | 5(2.7%) | 19(12.8%) |
| Agree somewhat | 142(15.6%) | 25(13.6%) | 27(18.2%) | |
| My organization or project provides the necessary funds to support data management during the life of a research project | Agree strongly | 87(9.5%) | 9(4.8%) | 17(11.4%) |
| Agree somewhat | 188(20.6%) | 36(19.4%) | 41(27.5%) | |
| My organization or project provides the necessary funds to support data management beyond the life of the project | Agree strongly | 62(6.8%) | 6(3.2%) | 16(10.7%) |
| Agree somewhat | 137(15.0%) | 18(9.7%) | 32(21.3%) |
χ2 = 21.461, p = .006;
χ2 = 21.562, p = .006;
χ2 = 25.298, p = .001;
χ2 = 17.585, p = .025;
χ2 = 23.352, p = .003.
Data reuse by geographic location.
| North America | Europe | Others | ||
| Lack of access to data generated by other researchers or institutions is a major impediment to progress in science | Agree strongly | 207(22.6%) | 64(34.0%) | 75(47.8%) |
| Agree somewhat | 376(41.0%) | 72(38.3%) | 49(31.2%) | |
| Lack of access to data generated by other researchers or institutions has restricted my ability to answer scientific questions | Agree strongly | 127(13.9%) | 45(23.9%) | 46(29.3%) |
| Agree somewhat | 298(32.6%) | 58(30.9%) | 53(33.8%) | |
| Data may be misinterpreted due to complexity of the data | Agree strongly | 276(30.3%) | 57(30.5%) | 40(25.6%) |
| Agree somewhat | 434(47.6%) | 79(42.2%) | 61(39.1%) | |
| Data may be used in other ways than intended | Agree strongly | 312(34.3%) | 42(22.5%) | 46(29.5%) |
| Agree somewhat | 375(41.3%) | 84(44.9%) | 60(38.5%) |
χ2 = 52.125, p = .000;
χ2 = 41.971, p = .000;
χ2 = 41.022, p = .000;
χ2 = 17.484, p = .025.
for data sharing by geographic location.
| North America | Europe | Others | ||
| I would use other researchers' datasets if their datasets were easily accessible | Agree strongly | 371(40.4%) | 83(44.4%) | 91(58.3%) |
| Agree somewhat | 397(43.2%) | 74(39.6%) | 37(23.7%) | |
| I would be willing to place all of my data into a central data repository with no restrictions | Agree strongly | 127(14.0%) | 23(12.4%) | 36(23.5%) |
| Agree somewhat | 235(25.8%) | 43(23.2%) | 45(29.4%) | |
| I would be more likely to make my data available if I could place conditions on access | Agree strongly | 201(22.3%) | 54(29.0%) | 52(33.8%) |
| Agree somewhat | 357(39.7%) | 70(37.6%) | 60(39.0%) | |
| I am satisfied with my ability to integrate data from disparate sources to address research questions | Agree strongly | 87(9.6%) | 27(14.6%) | 36(23.5%) |
| Agree somewhat | 297(32.8%) | 54(29.2%) | 53(34.6%) |
χ2 = 28.331, p = .000;
χ2 = 24.507, p = .002;
χ2 = 17.579, p = .025;
χ2 = 42.956, p = .000.
Others using data by geographic location.
| North America | Europe | Others | |
| Co-authorship on publications resulting from use of the data | 504(56.9%) | 112(61.5%) | 113(73.4%) |
| Formal acknowledgement of the data providers and/or funding agencies in all disseminated work making use of the data | 837(93.8%) | 160(88.9%) | 142(94.7%) |
| The opportunity to collaborate on the project | 690(78.9%) | 150(84.7%) | 127(87%) |
| Results based (at least in part) on the data could not be disseminated in any format without the data provider's approval | 398(45.5%) | 85(48.3%) | 88(60.7%) |
| At least part of the costs of data acquisition, retrieval or provision must be recovered | 232(26.7%) | 48(27.9%) | 75(52.1%) |
| Results based (at least in part) on the data could not be disseminated without the data provider having the opportunity to review the results and make suggestions or comments, but approval not required | 535(61.7%) | 94(55%) | 98(69%) |
| Reprints of articles that make use of the data must be provided to the data provider | 593(67.8%) | 125(71.4%) | 118(81.4%) |
| The data provider is given a complete list of all products that make use of the data, including articles, presentations, educational materials, etc. | 593(68.2%) | 109(62.3%) | 121(82.9%) |
| Legal permission for data use is obtained | 347(39.9%) | 88(51.2%) | 93(64.1%) |
| Mutual agreement on reciprocal sharing of data | 605(69.5%) | 128(73.1%) | 128(89.1%) |
| The data provider is given and agrees to a statement of uses to which the data will be put | 561(64.4%) | 108(64.3%) | 121(84%) |
χ2 = 15.141, p = .001;
χ2 = 6.360, p = .042;
χ2 = 7.307, p = .026;
χ2 = 11.465, p = .003;
χ2 = 38.343, p = .000;
χ2 = 6.482, p = .039;
χ2 = 11.170, p = .004;
χ2 = 17.102, p = .000;
χ2 = 33.238, p = .000;
χ2 = 24.774, p = .000;
χ2 = 21.989, p = .000.
Using others' data by geographic region.
| North America | Europe | Others | |
| Co-authorship on publications resulting from use of the data | 509(58.4%) | 114(64%) | 104(72.7%) |
| Formal citation of the data providers and/or funding agencies in all disseminated work making use of the data | 818(94.3%) | 166(97.1%) | 140(98.6%) |
| The opportunity to collaborate on the project | 690(79.7%) | 146(84.9%) | 121(87.7%) |
| Results based (at least in part) on the data could not be disseminated in any format without the data provider's approval | 402(46.3%) | 87(50%) | 92(64.8%) |
| At least part of the costs of data acquisition, retrieval or provision must be recovered | 246(28.5%) | 47(27.8%) | 73(52.5%) |
| Reprints of articles that make use of the data must be provided to the data provider | 590(68.1%) | 123(71.9%) | 115(82.7%) |
| The data provider is given a complete list of all products that make use of the data, including articles, presentations, educational materials, etc. | 589(68.3%) | 106(62%) | 114(81.4%) |
| Legal permission for data use is obtained | 356(41.1%) | 86(51.2%) | 94(66.7%) |
| Mutual agreement on reciprocal sharing of data | 600(69.4%) | 122(71.8%) | 125(89.9%) |
| The data provider is given and agrees to a statement of uses to which the data will be put | 560(65%) | 103(63.2%) | 117(84.2%) |
χ2 = 11.484, p = .003;
χ2 = 6.328, p = .042;
χ2 = 6.667, p = .036;
χ2 = 16.738, p = .000;
χ2 = 33.218, p = .000;
χ2 = 12.621, p = .002;
χ2 = 14.213, p = .001;
χ2 = 34.383, p = .000;
χ2 = 25.215, p = .000;
χ2 = 21.227, p = .000.