| Literature DB >> 25897122 |
Damian Smedley1, Syed Haider2, Steffen Durinck3, Luca Pandini4, Paolo Provero5, James Allen6, Olivier Arnaiz7, Mohammad Hamza Awedh8, Richard Baldock9, Giulia Barbiera4, Philippe Bardou10, Tim Beck11, Andrew Blake12, Merideth Bonierbale13, Anthony J Brookes11, Gabriele Bucci4, Iwan Buetti4, Sarah Burge6, Cédric Cabau10, Joseph W Carlson14, Claude Chelala15, Charalambos Chrysostomou11, Davide Cittaro4, Olivier Collin16, Raul Cordova13, Rosalind J Cutts15, Erik Dassi17, Alex Di Genova18, Anis Djari19, Anthony Esposito20, Heather Estrella20, Eduardo Eyras21, Julio Fernandez-Banet20, Simon Forbes1, Robert C Free11, Takatomo Fujisawa22, Emanuela Gadaleta15, Jose M Garcia-Manteiga4, David Goodstein14, Kristian Gray23, José Afonso Guerra-Assunção15, Bernard Haggarty9, Dong-Jin Han24, Byung Woo Han25, Todd Harris26, Jayson Harshbarger27, Robert K Hastings11, Richard D Hayes14, Claire Hoede19, Shen Hu28, Zhi-Liang Hu29, Lucie Hutchins30, Zhengyan Kan20, Hideya Kawaji31, Aminah Keliet32, Arnaud Kerhornou6, Sunghoon Kim24, Rhoda Kinsella6, Christophe Klopp19, Lei Kong33, Daniel Lawson34, Dejan Lazarevic4, Ji-Hyun Lee35, Thomas Letellier32, Chuan-Yun Li36, Pietro Lio37, Chu-Jun Liu36, Jie Luo6, Alejandro Maass38, Jerome Mariette19, Thomas Maurel6, Stefania Merella4, Azza Mostafa Mohamed39, Francois Moreews10, Ibounyamine Nabihoudine19, Nelson Ndegwa40, Céline Noirot19, Cristian Perez-Llamas41, Michael Primig42, Alessandro Quattrone17, Hadi Quesneville32, Davide Rambaldi4, James Reecy29, Michela Riba4, Steven Rosanoff6, Amna Ali Saddiq43, Elisa Salas13, Olivier Sallou16, Rebecca Shepherd1, Reinhard Simon13, Linda Sperling7, William Spooner44, Daniel M Staines6, Delphine Steinbach32, Kevin Stone30, Elia Stupka4, Jon W Teague1, Abu Z Dayem Ullah15, Jun Wang33, Doreen Ware45, Marie Wong-Erasmus46, Ken Youens-Clark45, Amonida Zadissa6, Shi-Jian Zhang36, Arek Kasprzyk47.
Abstract
The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations.Entities:
Mesh:
Year: 2015 PMID: 25897122 PMCID: PMC4489294 DOI: 10.1093/nar/gkv350
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.BioMart community databases and their host countries.
BioMart community databases and their host organizations
| Database | Description | Host | Reference |
|---|---|---|---|
| Animal Genome databasesa,b | Agriculturally important livestock genomes | Iowa State University, US | NA |
| Atlas of UTR Regulatory Activity (AURA)a | Meta-database centred on mapping post-transcriptional (PTR) interactions of trans-factors with human and mouse untranslated regions (UTRs) of mRNAs | University of Trento, Italy | ( |
| BCCTB Bioinformatics Portala | Portal for mining omics data on breast cancer from published literature and experimental datasets | Breast Cancer Campaign/Barts Cancer Institute UK | ( |
| Cildb | Database for eukaryotic cilia and centriolar structures, integrating orthology relationships for 44 species with high-throughput studies and OMIM | Centre National de la Recherche Scientifique (CNRS), France | ( |
| COSMIC | Somatic mutation information relating to human cancers | Wellcome Trust Sanger Institute (WTSI), UK | ( |
| DAPPERa | Mass spec identified protein interaction networks in | Department of Genetics, University of Cambridge, Cambridge, UK | NA |
| EMAGE | In situ gene expression data in the mouse embryo | Medical Research Council, Human Genetics Unit (MRC HGU), UK | ( |
| Ensembl | Genome databases for vertebrates and other eukaryotic species | Wellcome Trust Sanger Institute (WTSI), UK | ( |
| Ensembl Genomes | Ensembl Fungi, Metazoa, Plants and Protists | European Bioinformatics Institute (EBI), UK | ( |
| Euraexpress | Transcriptome atlas database for mouse embryo | Medical Research Council, Human Genetics Unit (MRC HGU), UK | ( |
| EuroPhenome | Mouse phenotyping data | Harwell Science and Innovation Campus (MRC Harwell), UK | ( |
| FANTOM5a | The FANTOM5 project mapped a promoter level expression atlas in human and mouse. The FANTOM5 BioMart instance provides the set of promoters along with annotation. | RIKEN Center for Life Science Technologies (CLST), Japan | ( |
| GermOnLine | Cross-species microarray expression database focusing on germline development, meiosis, and gametogenesis as well as the mitotic cell cycle | Institut national de la santé et de la recherche médicale (Inserm), France | ( |
| GnpISa | Genetic and Genomic Information System (GnpIS) | Institut Nationale de Recherche Agronomique (INRA), Unité de Recherche en Génomique-Info (URGI), France | ( |
| Gramene | Agriculturally important grass genomes | Cold Spring Harbor Laboratory (CSHL), US | ( |
| GWAS Centrala | GWAS Central provides a comprehensive curated collection of summary level findings from genetic association studies | University of Leicester, UK | ( |
| HapMap | Multi-country effort to identify and catalog genetic similarities and differences in human beings | National Center for Biotechnology Information (NCBI), US | ( |
| HGNC | Repository of human gene nomenclature and associated resources | European Bioinformatics Institute (EBI), UK | ( |
| i-Pharma | PharmDB-K is an integrated bio-pharmacological network databases for TKM (Traditional Korean Medicine) | Information Center for Bio-pharmacological Network (i-Pharm), South Korea | ( |
| InterPro | Integrated database of predictive protein ‘signatures’ used for the classification and automatic annotation of proteins and genomes | European Bioinformatics Institute (EBI), UK | ( |
| KazusaMart | Cyanobase, rhizobia, and plant genome databases | Kazusa DNA Research Institute (Kazusa), Japan | NA |
| MGI | Mouse genome features, locations, alleles, and orthologs | Jackson Laboratory, US | ( |
| Pancreatic Expression Database | Results from published literature | Barts Cancer Institute UK | ( |
| ParameciumDB | Paramecium genome database | Centre National de la Recherche Scientifique (CNRS), France | ( |
| Phytozome | Comparative genomics of green plants | Joint Genome Institute (JGI)/Center for Integrative Genomics (CIG), US | ( |
| Potato Database | Potato and sweetpotato phenotypic and genomic information | International Potato Center (CIP), Peru | NA |
| PRIDE | Repository for protein and peptide identifications | European Bioinformatics Institute (EBI), UK | ( |
| Regulatory Genomics Groupa | Predictive Models of Gene Regulation from High-Throughput Epigenomics Data | Universitat Pompeu Fabra (UPF), Spain | ( |
| Rfama | The Rfam database is a collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs). | Wellcome Trust Sanger Institute (WTSI), UK | ( |
| RhesusBasea | A knowledgebase for the monkey research community | Peking University, China | ( |
| Rice-Map | Rice (japonica and indica) genome annotation database | Peking University, China | ( |
| SalmonDB | Genomic information for Atlantic salmon, rainbow trout, and related species | Center for Mathematical Modeling and Center for Genome Regulation (CMM), Chile | ( |
| sigReannot | Aquaculture and farm animal species microarray probes re-annotation | INRA - French National Institute of Agricultural Research, France | ( |
| UniProt | Protein sequence and functional information | European Bioinformatics Institute (EBI), UK | ( |
| VectorBase | Genome information for invertebrate vectors of human pathogens | University of Notre Dame, US | ( |
| VEGA | Manual annotation of vertebrate genome sequences | Wellcome Trust Sanger Institute (WTSI), UK | ( |
| WormBase | Cold Spring Harbor Laboratory (CSHL), US | ( |
aDenotes new databases that have become available since last publication (9).
bDenotes new databases that are not yet integrated into the portal.
Figure 2.The network graphic output of the BioMart enrichment tool. The Gene Ontology (GO) enrichment analysis was performed using BED file containing human data. This tool is also accessible through web services (Java version only). The programmatic access complies with a standard BioMart interface: dataset, filter and attribute.
Figure 3.The XML web service query (A) and the corresponding two types of output: tab delimited following setting a processor to ‘TSV’ (B) and JSON following setting processor to ‘JSON’.
Query to display phenotypic consequence for known, pathogenic variants in FGFR2
| Database and dataset | Filters | Attributes |
|---|---|---|
| Ensembl 78 Short Variations | Ensembl Gene ID(s): | Chromosome name |
| (WTSI, UK) | ENSG00000066468 | Chromosome position start (bp) |
| Homo sapiens Short Variation (SNPs and indels) (GRCh38) | SIFT Prediction: deleterious | Chromosome position end (bp) |
| PolyPhen Prediction: probably damaging | Strand | |
| Variant Alleles | ||
| Ensembl Gene ID | ||
| Consequence to transcript | ||
| Associated variation names | ||
| Study External Reference | ||
| Source name | ||
| Associated gene with phenotype | ||
| Phenotype description |