| Literature DB >> 34791407 |
Mallory Ann Freeberg1, Lauren A Fromont2, Teresa D'Altri2, Anna Foix Romero1, Jorge Izquierdo Ciges1, Aina Jene2, Giselle Kerry1, Mauricio Moldes2, Roberto Ariosa2, Silvia Bahena1, Daniel Barrowdale1, Marcos Casado Barbero1, Dietmar Fernandez-Orth2, Carles Garcia-Linares1, Emilio Garcia-Rios1, Frédéric Haziza2, Bela Juhasz1, Oscar Martinez Llobet2, Gemma Milla2, Anand Mohan1, Manuel Rueda2, Aravind Sankar1, Dona Shaju1, Ashutosh Shimpi1, Babita Singh2, Coline Thomas1, Sabela de la Torre2, Umuthan Uyan2, Claudia Vasallo2, Paul Flicek1, Roderic Guigo2, Arcadi Navarro2, Helen Parkinson1, Thomas Keane1, Jordi Rambla2.
Abstract
The European Genome-phenome Archive (EGA - https://ega-archive.org/) is a resource for long term secure archiving of all types of potentially identifiable genetic, phenotypic, and clinical data resulting from biomedical research projects. Its mission is to foster hosted data reuse, enable reproducibility, and accelerate biomedical and translational research in line with the FAIR principles. Launched in 2008, the EGA has grown quickly, currently archiving over 4,500 studies from nearly one thousand institutions. The EGA operates a distributed data access model in which requests are made to the data controller, not to the EGA, therefore, the submitter keeps control on who has access to the data and under which conditions. Given the size and value of data hosted, the EGA is constantly improving its value chain, that is, how the EGA can contribute to enhancing the value of human health data by facilitating its submission, discovery, access, and distribution, as well as leading the design and implementation of standards and methods necessary to deliver the value chain. The EGA has become a key GA4GH Driver Project, leading multiple development efforts and implementing new standards and tools, and has been appointed as an ELIXIR Core Data Resource.Entities:
Mesh:
Year: 2022 PMID: 34791407 PMCID: PMC8728218 DOI: 10.1093/nar/gkab1059
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Data archived at EGA between 2013–2021. Cumulative size of data (A), number of studies and datasets (B), and number of files (C) archived and available for download from EGA per year. (D) Number of institutes per country that have archived data at the EGA.
Figure 2.EGA facilitates the submission, discovery, access, and distribution of sensitive human data. A researcher submits controlled access human genetic, phenotypic and clinical data to EGA after signing a Data Processing Agreement (1). EGA processes, archives, and releases the dataset to be findable. Another researcher discovers data of interest at the EGA (2). They contact the Data Access Committee for the data of interest and agree to the terms of data reuse by signing a Data Access Agreement (3). The Data Access Committee informs EGA that access is approved (4). The EGA grants access to the requesting researcher (5) who can then download and visualise the data (6). GDPR: General Data Protection Regulation.
Figure 3.EGA data distribution to approved researchers between 2011 and 2021. (A) Number of EGA data requester accounts created over time. (B) Amount of data distributed to approved researchers over time.
Figure 4.The EGA offers a variety of secure data access and download services to meet user needs, many of which implement GA4GH standards. FUSE: Filesystem in Userspace. AAI: Authentication and Authorization Infrastructure. OpenIDC: OpenID Connect, an open standard and decentralized authentication protocol.