| Literature DB >> 26452388 |
Kim W Carter, Richard W Francis, K W Carter1, R W Francis1, M Bresnahan2, M Gissler3, T K Grønborg4, R Gross5, N Gunnes6, G Hammond1, M Hornig7, C M Hultman8, J Huttunen9, A Langridge1, H Leonard1, S Newman10, E T Parner4, G Petersson8, A Reichenberg11, S Sandin8, D E Schendel12, L Schalkwyk10, A Sourander13, C Steadman1, C Stoltenberg14, A Suominen15, P Surén6, E Susser2, A Sylvester Vethanayagam16, Z Yusof8.
Abstract
BACKGROUND: Research studies exploring the determinants of disease require sufficient statistical power to detect meaningful effects. Sample size is often increased through centralized pooling of disparately located datasets, though ethical, privacy and data ownership issues can often hamper this process. Methods that facilitate the sharing of research data that are sympathetic with these issues and which allow flexible and detailed statistical analyses are therefore in critical need. We have created a software platform for the Virtual Pooling and Analysis of Research data (ViPAR), which employs free and open source methods to provide researchers with a web-based platform to analyse datasets housed in disparate locations.Entities:
Keywords: ViPAR; data federation; data pooling; data sharing
Year: 2015 PMID: 26452388 PMCID: PMC4864874 DOI: 10.1093/ije/dyv193
Source DB: PubMed Journal: Int J Epidemiol ISSN: 0300-5771 Impact factor: 7.196
Figure 1.ViPAR topology. A typical multi-site ViPAR configuration where a ViPAR master server (VMS) is linked to a number of remote sites. Each remote site stores and maintains their research data. Users of the ViPAR system access the web-based analytical portal where they can initiate analyses. During an analysis, the federation component retrieves data from the remote sites into RAM on the VMS where they are analysed and removed without ever permanently being stored.
Summary of hardware and software requirements for the key ViPAR server components
| ViPAR Master Server (VMS) | Local ViPAR Database (LVD) Server | |
|---|---|---|
| Hardware | Physical or virtual server with at least two CPU cores, 8GB of RAM and 50GB of disk space | Physical or virtual server with at least 1 CPU core, 1GB of RAM and 5 GB of disk space |
| Software (pre-installed) | Perl 5.10 or greater, OpenSSH 5.4 or greater, MySQL 5.5 or greater, Apache webserver, R statistical software | OpenSSH 5.4 or greater, MySQL 5.5 or greater |
| Software (optional) | OpenSSL server-side certificates, SAS, STATA, denyhosts | denyhosts |
Figure 2.VWAP analysis interface. Screenshot of browsing the VWAP analysis interface. Here the analyst has provided some simple syntax in the R language to provide summary information for the single selected variable across all selected resources.
Figure 3.VWAP file manager. Screenshot of browsing the VWAP file manager. Here the output files resulting from a single analysis are displayed. Users can download files individually or all at once in the provided ZIP file. Optionally users can upload files to associate with an analysis. In addition there are options for deleting the results of an analysis and for sharing the results with other users of the system.