| Literature DB >> 23935830 |
Jillian C Wallis1, Elizabeth Rolando, Christine L Borgman.
Abstract
Research on practices to share and reuse data will inform the design of infrastructure to support data collection, management, and discovery in the long tail of science and technology. These are research domains in which data tend to be local in character, minimally structured, and minimally documented. We report on a ten-year study of the Center for Embedded Network Sensing (CENS), a National Science Foundation Science and Technology Center. We found that CENS researchers are willing to share their data, but few are asked to do so, and in only a few domain areas do their funders or journals require them to deposit data. Few repositories exist to accept data in CENS research areas.. Data sharing tends to occur only through interpersonal exchanges. CENS researchers obtain data from repositories, and occasionally from registries and individuals, to provide context, calibration, or other forms of background for their studies. Neither CENS researchers nor those who request access to CENS data appear to use external data for primary research questions or for replication of studies. CENS researchers are willing to share data if they receive credit and retain first rights to publish their results. Practices of releasing, sharing, and reusing of data in CENS reaffirm the gift culture of scholarship, in which goods are bartered between trusted colleagues rather than treated as commodities.Entities:
Mesh:
Year: 2013 PMID: 23935830 PMCID: PMC3720779 DOI: 10.1371/journal.pone.0067332
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1CENS data types organized by collection method and use (adapted from [47]).
Interview participants and their distribution.
| Research Area | Status | Round 1 | Round 2 | Totals |
|
| Faculty | 7 | 6 | 13 |
| Staff | 5 | 2 | 7 | |
| Student | 3 | 2 | 5 | |
|
| Faculty | 4 | 2 | 6 |
| Staff | 1 | 2 | 3 | |
| Student | 2 | 7 | 9 | |
|
| 22 | 21 | 43 |
Conditions for data sharing.
| “I will share my data if….” | Round 1 | Round 2 | Total |
|
|
|
|
|
|
|
|
|
|
| I have first rights to publish the results from the data | 15 | 5 | 20 |
| I will receive proper attribution as the data source | 5 | 2 | 7 |
| The requestor is known to me or my group | 2 | 4 | 6 |
| My research funder expects me to share | 2 | 4 | 6 |
| Minimal effort is required to share | 1 | 4 | 5 |
| Sharing was negotiated in advance of exchange | 1 | 3 | 4 |
| The data are appropriately sized (not too big or too small) | 1 | 3 | 4 |
| Research and/or data are developed and stable | 3 | 3 | |
| My community expects me to do so | 3 | 3 | |
| Minimal effort was required to collect data | 2 | 2 | |
| The data will be easily understood by others | 1 | 1 | 2 |
| The journal requires that the data be shared | 1 | 1 | 2 |
| Permission was granted by the PI on the project | 2 | 2 | |
| Standard methods exist for interoperability | 1 | 1 | |
| Shared data are not focus of participant's research | 1 | 1 | |
| Data collection is part of my job description | 1 | 1 | |
| I do not plan to commercialize the data or technology | 1 | 1 | |
| Shared data will be re-shared with others | 1 | 1 | |
| Data recipient and I address same research question | 1 | 1 | |
|
|
|
|
|
Methods for sharing data.
|
| Round 1 | Round 2 | Total |
|
|
|
|
|
|
|
|
|
|
| Fulfill personal requests | 10 | 12 | 22 |
| Post data to a website | 15 | 6 | 21 |
| Submit data to a repository | 2 | 10 | 12 |
| Data Publication | 2 | 4 | 6 |
| Supplement to published journal article | 2 | 1 | 3 |
| Submit data description to a registry | 3 | 1 | 4 |
|
|
|
|
|
Repositories used by participants to share data.
| Name of repository | Round 1 | Round 2 | Total | Participant Discipline |
|
|
|
|
| |
| Anopheles Database | 1 | 0 | 1 | Marine Biology |
| Code.Google | 0 | 1 | 1 | Computer Science |
| Crawdad | 0 | 1 | 1 | Computer Science |
| EDDMaps | 0 | 2 | 2 | Ecology |
| Free(Code) | 0 | 1 | 1 | Computer Science |
| GenBank | 1 | 1 | 2 | Marine Biology |
| IRIS | 0 | 2 | 2 | Seismology |
| Personally managed SVN | 0 | 1 | 1 | Computer Science |
| SensorBase | 0 | 1 | 1 | Environmental Engineering |
|
|
|
|
|
Where researchers find data for reuse and what data they use.
| Name of Data Source | Observatory or Repository? | Data Used | Round 1 | Round 2 | Total |
|
|
|
|
| ||
|
|
|
|
| ||
| CA Irrigation Management Information System | observatory | weather, solar radiation, soil temp | 1 | 1 | |
| California Data Exchange Center (CDEC) | observatory | river conditions, river scales, gauging | 1 | 1 | 2 |
| Central and Northern CA Ocean Observing System (CeNCOOS) | Observatory/repository | unspecified | 1 | 1 | |
| Crawdad | repository | 802.11 measures | 1 | 1 | |
| DARPA | observatory | photos | 1 | 1 | |
| Free(Code) | repository | Software code | 1 | 1 | |
| Heal the Bay | observatory | Malibu Watershed data, tidal charts | 1 | 1 | |
| James Reserve (JR) | observatory/repository | Weather, environmental data, photos, web cam/Visitors' data | 4 | 1 | 5 |
| Macaulay Library at Cornell | Recordings of bird sounds | 2 | 2 | ||
| NASA | observatory | unspecified coastal ocean data | 1 | 1 | |
| NASA's MODIS Satellite | observatory | spectral bands | 1 | 1 | |
| NOAA | observatory | tidal height | 1 | 1 | |
| NOAA's National Weather Service | observatory | point data | 1 | 1 | |
| Satellite (unspecified) | observatory | images | 1 | 1 | 2 |
| Southern CA Coastal Ocean Observing System (SCOOS) | observatory/repository | unspecified coastal ocean data | 1 | 1 | |
| TerraServer | observatory | remote sensing | 1 | 1 | |
| UIUC Face Database | observatory | facial images | 1 | 1 | |
| US Geologic Survey (USGS) | observatory | remote sensing, demographic data, gravitational data | 1 | 1 | 2 |
|
|
|
|
| ||