| Literature DB >> 33816800 |
Kyle Chard1,2, Eli Dart3, Ian Foster1,2, David Shifflett1,2, Steven Tuecke1,2, Jason Williams1,2.
Abstract
We describe best practices for providing convenient, high-speed, secure access to large data via research data portals. We capture these best practices in a new design pattern, the Modern Research Data Portal, that disaggregates the traditional monolithic web-based data portal to achieve orders-of-magnitude increases in data transfer performance, support new deployment architectures that decouple control logic from data storage, and reduce development and operations costs. We introduce the design pattern; explain how it leverages high-performance data enclaves and cloud-based data management services; review representative examples at research laboratories and universities, including both experimental facilities and supercomputer sites; describe how to leverage Python APIs for authentication, authorization, data transfer, and data sharing; and use coding examples to demonstrate how these APIs can be used to implement a range of research data portal capabilities. Sample code at a companion web site, https://docs.globus.org/mrdp, provides application skeletons that readers can adapt to realize their own research data portals. ©2018 Chard et al.Entities:
Keywords: Data transfer node; Globus; High-speed network; Portal; Science DMZ
Year: 2018 PMID: 33816800 PMCID: PMC7924693 DOI: 10.7717/peerj-cs.144
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992