| Literature DB >> 34220930 |
Peter W Harrison1, Alexey Sokolov1, Akshatha Nayak1, Jun Fan1, Daniel Zerbino1, Guy Cochrane1, Paul Flicek1.
Abstract
The Functional Annotation of ANimal Genomes (FAANG) project is a worldwide coordinated action creating high-quality functional annotation of farmed and companion animal genomes. The generation of a rich genome-to-phenome resource and supporting informatic infrastructure advances the scope of comparative genomics and furthers the understanding of functional elements. The project also provides terrestrial and aquatic animal agriculture community powerful resources for supporting improvements to farmed animal production, disease resistance, and genetic diversity. The FAANG Data Portal (https://data.faang.org) ensures Findable, Accessible, Interoperable and Reusable (FAIR) open access to the wealth of sample, sequencing, and analysis data produced by an ever-growing number of FAANG consortia. It is developed and maintained by the FAANG Data Coordination Centre (DCC) at the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI). FAANG projects produce a standardised set of multi-omic assays with resulting data placed into a range of specialised open data archives. To ensure this data is easily findable and accessible by the community, the portal automatically identifies and collates all submitted FAANG data into a single easily searchable resource. The Data Portal supports direct download from the multiple underlying archives to enable seamless access to all FAANG data from within the portal itself. The portal provides a range of predefined filters, powerful predictive search, and a catalogue of sampling and analysis protocols and automatically identifies publications associated with any dataset. To ensure all FAANG data submissions are high-quality, the portal includes powerful contextual metadata validation and data submissions brokering to the underlying EMBL-EBI archives. The portal will incorporate extensive new technical infrastructure to effectively deliver and standardise FAANG's shift to single-cellomics, cell atlases, pangenomes, and novel phenotypic prediction models. The Data Portal plays a key role for FAANG by supporting high-quality functional annotation of animal genomes, through open FAIR sharing of data, complete with standardised rich metadata. Future Data Portal features developed by the DCC will support new technological developments for continued improvement for FAANG projects.Entities:
Keywords: Data Portal; FAANG; FAIR data; agricultural genomics; functional annotation; metadata validation; open access; phenotype to genotype
Year: 2021 PMID: 34220930 PMCID: PMC8248360 DOI: 10.3389/fgene.2021.639238
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1FAANG Data Portal architecture with local Elasticsearch metadata database, python, and JSON-schema contextual validation and brokering of validated data to underlying public archives.
Figure 2FAANG Data Portal presenting rich ‘omic datasets to the community complete with preconfigured data filters, automated literature scraping, and direct links to data files in underlying archives (https://data.faang.org/dataset).
Figure 3FAANG Data Portal specimen table utilising filters to obtain specimens from Equus caballus females from the liver left lateral lobe (https://data.faang.org/specimen?standard=FAANG&sex=female&organism=Equus%20caballus&organismpart_celltype=liver%20left%20lateral%20lobe).
Figure 4Project-specific subportal views offer the full functionality of the FAANG site pre-filtered to data from a particular consortia (https://data.faang.org/projects/BovReg).
Figure 5Data validation and submission brokering service flag metadata errors and improvements for correction before submission can be made (https://data.faang.org/validation/samples).