Literature DB >> 25897122

The BioMart community portal: an innovative alternative to large, centralized data repositories.

Damian Smedley1, Syed Haider2, Steffen Durinck3, Luca Pandini4, Paolo Provero5, James Allen6, Olivier Arnaiz7, Mohammad Hamza Awedh8, Richard Baldock9, Giulia Barbiera4, Philippe Bardou10, Tim Beck11, Andrew Blake12, Merideth Bonierbale13, Anthony J Brookes11, Gabriele Bucci4, Iwan Buetti4, Sarah Burge6, Cédric Cabau10, Joseph W Carlson14, Claude Chelala15, Charalambos Chrysostomou11, Davide Cittaro4, Olivier Collin16, Raul Cordova13, Rosalind J Cutts15, Erik Dassi17, Alex Di Genova18, Anis Djari19, Anthony Esposito20, Heather Estrella20, Eduardo Eyras21, Julio Fernandez-Banet20, Simon Forbes1, Robert C Free11, Takatomo Fujisawa22, Emanuela Gadaleta15, Jose M Garcia-Manteiga4, David Goodstein14, Kristian Gray23, José Afonso Guerra-Assunção15, Bernard Haggarty9, Dong-Jin Han24, Byung Woo Han25, Todd Harris26, Jayson Harshbarger27, Robert K Hastings11, Richard D Hayes14, Claire Hoede19, Shen Hu28, Zhi-Liang Hu29, Lucie Hutchins30, Zhengyan Kan20, Hideya Kawaji31, Aminah Keliet32, Arnaud Kerhornou6, Sunghoon Kim24, Rhoda Kinsella6, Christophe Klopp19, Lei Kong33, Daniel Lawson34, Dejan Lazarevic4, Ji-Hyun Lee35, Thomas Letellier32, Chuan-Yun Li36, Pietro Lio37, Chu-Jun Liu36, Jie Luo6, Alejandro Maass38, Jerome Mariette19, Thomas Maurel6, Stefania Merella4, Azza Mostafa Mohamed39, Francois Moreews10, Ibounyamine Nabihoudine19, Nelson Ndegwa40, Céline Noirot19, Cristian Perez-Llamas41, Michael Primig42, Alessandro Quattrone17, Hadi Quesneville32, Davide Rambaldi4, James Reecy29, Michela Riba4, Steven Rosanoff6, Amna Ali Saddiq43, Elisa Salas13, Olivier Sallou16, Rebecca Shepherd1, Reinhard Simon13, Linda Sperling7, William Spooner44, Daniel M Staines6, Delphine Steinbach32, Kevin Stone30, Elia Stupka4, Jon W Teague1, Abu Z Dayem Ullah15, Jun Wang33, Doreen Ware45, Marie Wong-Erasmus46, Ken Youens-Clark45, Amonida Zadissa6, Shi-Jian Zhang36, Arek Kasprzyk47.   

Abstract

The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2015        PMID: 25897122      PMCID: PMC4489294          DOI: 10.1093/nar/gkv350

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The methods of data generation and processing that are utilized in biomedical sciences have radically changed in recent years. With the advancement of new high-throughput technologies, data have grown in terms of quantity as well as complexity. However, the significance of the information that is hidden in the newly generated experimental data can only be deciphered by linking it to other types of biological data that have been accumulated previously. As a result there are already numerous bioinformatics resources and new ones are constantly being created. Typically, each resource comes with its own query interface. This poses a problem for the scientists who want to utilize such resources in their research. Even the simplest task such as compiling results from a few existing resources is challenging due to the lack of a complete, up to date catalogue of already existing resources and the necessity of constantly learning how to navigate new query interfaces. A different challenge is faced by collaborating groups of scientists who independently generate or maintain their own data. Such collaborations are seriously hampered by the lack of a simple data management solution that would make it possible to connect their disparate, geographically distributed data sources and present them in a uniform way to other scientists. The BioMart project has been set up to address these challenges.

SOFTWARE

BioMart is an open source data management system, which is based on a data federation model (1). Under this model, each data source is managed, updated and released independently by their host organization while the BioMart software provides a unified view of these sources that are distributed worldwide. The data sources are presented to the user through a unified set of graphical and programmatic interfaces so that they appear to be a single integrated database. To navigate this database and compile a query the user does not have to learn the underlying structure of each data source but instead use a set of simple abstractions: datasets, filters and attributes. Once a user's input is provided, the software distributes parts of the query to individual data sources, collects the data and presents the user with the unified result set. The BioMart software is data agnostic and its applications are not limited to biological data. It is cross-platform and supports many popular relational database managements systems, including MySQL, Oracle, PostgreSQL. It also supports many third party packages such as Taverna (2), Galaxy (3), Cytoscape (4) and biomaRt (5), which part of the Bioconductor (6) library. The BioMart project currently maintains two independent code bases: one written in Java and one written in Perl. For more information about the architecture and capabilities of each of the packages please refer to previous publications (1,7). The latest version of the Java based BioMart software has been significantly enhanced with new additions to the existing collection of graphical user interfaces (GUIs). It has also been re-engineered to provide better support and extensibility for data analysis and visualization tools. The first of the BioMart tools based on this new framework has already been implemented and is accessible from the BioMart Community Portal. The BioMart project adheres to the open source philosophy that promotes collaboration and code reuse. Two good examples of how this philosophy benefits the scientific community are provided by two independent research groups. The INRA group based in Toulouse, France has recently released a software package called RNAbrowse (RNA-Seq De Novo Assembly Results Browser) (8). The Pfizer group based in La Jolla, USA has just announced the release of OASIS: A Web-based Platform for Exploratory Analysis of Cancer Genome and Transcriptome data (www.oasis-genomics.org). Both of these software packages are based on the BioMart software.

DATA

The BioMart community consists of a wide spectrum of different research groups that use the BioMart technology to provide access to their databases. It currently comprises 30 scientific organizations supporting 38 database projects that contain over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. The BioMart community is constantly growing and since the last publication (9), 11 new database projects have become available. As new BioMart databases become available locally they also become gradually integrated into the BioMart Community Portal. The main function of the portal is to provide a convenient single point of access to all available data that is distributed worldwide (Figure 1). All BioMart databases that are included in the portal are independently administered and funded. Table 1 provides a detailed list of all BioMart community resources as of March 2015.
Figure 1.

BioMart community databases and their host countries.

Table 1.

BioMart community databases and their host organizations

DatabaseDescriptionHostReference
Animal Genome databasesa,bAgriculturally important livestock genomesIowa State University, USNA
Atlas of UTR Regulatory Activity (AURA)aMeta-database centred on mapping post-transcriptional (PTR) interactions of trans-factors with human and mouse untranslated regions (UTRs) of mRNAsUniversity of Trento, Italy(36)
BCCTB Bioinformatics PortalaPortal for mining omics data on breast cancer from published literature and experimental datasetsBreast Cancer Campaign/Barts Cancer Institute UK(37)
CildbDatabase for eukaryotic cilia and centriolar structures, integrating orthology relationships for 44 species with high-throughput studies and OMIMCentre National de la Recherche Scientifique (CNRS), France(38)
COSMICSomatic mutation information relating to human cancersWellcome Trust Sanger Institute (WTSI), UK(39)
DAPPERaMass spec identified protein interaction networks in Drosophila cell cycle regulationDepartment of Genetics, University of Cambridge, Cambridge, UKNA
EMAGEIn situ gene expression data in the mouse embryoMedical Research Council, Human Genetics Unit (MRC HGU), UK(40)
EnsemblGenome databases for vertebrates and other eukaryotic speciesWellcome Trust Sanger Institute (WTSI), UK(41)
Ensembl GenomesEnsembl Fungi, Metazoa, Plants and ProtistsEuropean Bioinformatics Institute (EBI), UK(41)
EuraexpressTranscriptome atlas database for mouse embryoMedical Research Council, Human Genetics Unit (MRC HGU), UK(42)
EuroPhenomeMouse phenotyping dataHarwell Science and Innovation Campus (MRC Harwell), UK(15)
FANTOM5aThe FANTOM5 project mapped a promoter level expression atlas in human and mouse. The FANTOM5 BioMart instance provides the set of promoters along with annotation.RIKEN Center for Life Science Technologies (CLST), Japan(16)
GermOnLineCross-species microarray expression database focusing on germline development, meiosis, and gametogenesis as well as the mitotic cell cycleInstitut national de la santé et de la recherche médicale (Inserm), France(17)
GnpISaGenetic and Genomic Information System (GnpIS)Institut Nationale de Recherche Agronomique (INRA), Unité de Recherche en Génomique-Info (URGI), France(18)
GrameneAgriculturally important grass genomesCold Spring Harbor Laboratory (CSHL), US(43)
GWAS CentralaGWAS Central provides a comprehensive curated collection of summary level findings from genetic association studiesUniversity of Leicester, UK(19)
HapMapMulti-country effort to identify and catalog genetic similarities and differences in human beingsNational Center for Biotechnology Information (NCBI), US(20)
HGNCRepository of human gene nomenclature and associated resourcesEuropean Bioinformatics Institute (EBI), UK(21)
i-PharmaPharmDB-K is an integrated bio-pharmacological network databases for TKM (Traditional Korean Medicine)Information Center for Bio-pharmacological Network (i-Pharm), South Korea(22)
InterProIntegrated database of predictive protein ‘signatures’ used for the classification and automatic annotation of proteins and genomesEuropean Bioinformatics Institute (EBI), UK(44)
KazusaMartCyanobase, rhizobia, and plant genome databasesKazusa DNA Research Institute (Kazusa), JapanNA
MGIMouse genome features, locations, alleles, and orthologsJackson Laboratory, US(23)
Pancreatic Expression DatabaseResults from published literatureBarts Cancer Institute UK(24)
ParameciumDBParamecium genome databaseCentre National de la Recherche Scientifique (CNRS), France(25)
PhytozomeComparative genomics of green plantsJoint Genome Institute (JGI)/Center for Integrative Genomics (CIG), US(26)
Potato DatabasePotato and sweetpotato phenotypic and genomic informationInternational Potato Center (CIP), PeruNA
PRIDERepository for protein and peptide identificationsEuropean Bioinformatics Institute (EBI), UK(45)
Regulatory Genomics GroupaPredictive Models of Gene Regulation from High-Throughput Epigenomics DataUniversitat Pompeu Fabra (UPF), Spain(27)
RfamaThe Rfam database is a collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs).Wellcome Trust Sanger Institute (WTSI), UK(28)
RhesusBaseaA knowledgebase for the monkey research communityPeking University, China(29)
Rice-MapRice (japonica and indica) genome annotation databasePeking University, China(30)
SalmonDBGenomic information for Atlantic salmon, rainbow trout, and related speciesCenter for Mathematical Modeling and Center for Genome Regulation (CMM), Chile(31)
sigReannotAquaculture and farm animal species microarray probes re-annotationINRA - French National Institute of Agricultural Research, France(46)
UniProtProtein sequence and functional informationEuropean Bioinformatics Institute (EBI), UK(32)
VectorBaseGenome information for invertebrate vectors of human pathogensUniversity of Notre Dame, US(33)
VEGAManual annotation of vertebrate genome sequencesWellcome Trust Sanger Institute (WTSI), UK(34)
WormBaseC. elegans and related nematode genomic informationCold Spring Harbor Laboratory (CSHL), US(35)

aDenotes new databases that have become available since last publication (9).

bDenotes new databases that are not yet integrated into the portal.

BioMart community databases and their host countries. aDenotes new databases that have become available since last publication (9). bDenotes new databases that are not yet integrated into the portal.

PORTAL

The current version of the BioMart Community Portal operates two different instances of the web server: one implemented in Perl and the other in Java. Both servers support complex database searches and although they use different types of GUIs, they share the same navigation and query compilation logic based on selection of datasets, filters and attributes (9,10). The Java version of the portal also includes a section for specialized tools, which consists of the following: Sequence retrieval, ID Converter and Enrichment Analysis. Sequence retrieval allows easy querying of sequences while the ID Converter tool allows users to enter or upload a list of identifiers in any format (currently supported by Ensembl), and retrieve the same list converted to any other supported format. The enrichment tool supports enrichment analysis of genes in all species included in the current Ensembl release. For each of those species a broad range of gene identifiers is available. Furthermore, the tool supports cross species analysis using Ensembl homology data. For instance, it is possible to perform a one step enrichment analysis against a human disease dataset using experimental data from any of the species for which human homology data is available. Finally, the enrichment tool facilitates analysis of BED files containing genomic features such as Copy Number Variations or Differentially Methylated Regions. The output is provided in tabular and network graphic format (Figure 2).
Figure 2.

The network graphic output of the BioMart enrichment tool. The Gene Ontology (GO) enrichment analysis was performed using BED file containing human data. This tool is also accessible through web services (Java version only). The programmatic access complies with a standard BioMart interface: dataset, filter and attribute.

The network graphic output of the BioMart enrichment tool. The Gene Ontology (GO) enrichment analysis was performed using BED file containing human data. This tool is also accessible through web services (Java version only). The programmatic access complies with a standard BioMart interface: dataset, filter and attribute.

WEB SERVICE

The BioMart Community Portal handles queries from several interfaces such as: PERL API Java API Web interfaces URL based access RESTful web service SPARQL For more detailed description of all the interfaces please refer to earlier publications (1,7). In the section below we provide a description and compare the REST-based web service, which is implemented in Perl and its counterpart, which is implemented in Java. It is worth noting that the web service maintains the same query interface both in Perl and Java implementations. For example, the web service query (Figure 3A) can be run against java-based server as follows:
Figure 3.

The XML web service query (A) and the corresponding two types of output: tab delimited following setting a processor to ‘TSV’ (B) and JSON following setting processor to ‘JSON’.

curl –data-urlencode query@query.xml http://central.biomart.org/martservice/results or its Perl-based counter-part as below curl –data-urlencode query@query.xml http://www.biomart.org/biomart/martservice The XML web service query (A) and the corresponding two types of output: tab delimited following setting a processor to ‘TSV’ (B) and JSON following setting processor to ‘JSON’. By default, query sets the attribute processor to ‘TSV’ requesting tab-delimited results (Figure 3B). Alternatively, by setting processor to ‘JSON’, would return JSON formatted results (Figure 3C), which are readily consumable by third-party web-based clients saving overhead of parsing and format translations. Please note that JSON format is only available in the java version. A simple way to compile a web service query for later programmatic use is to use one of the web GUIs and generate the query XML using REST/SOAP button. After following the steps outlined by the GUI and clicking the ‘results’ button, the user needs to click the REST/SOAP button, save the query and run it as described above. Alternatively a user can take advantage of the programmatic access to all the metadata defining marts, datasets, filters and attributes. The access to the metadata served by the Java and Perl BioMart servers is provided using the following webservice requests: Java (central.biomart.org) registry information: http://central.biomart.org/martservice/portal available marts: http://central.biomart.org/martservice/marts datasets available for a config: http://central.biomart.org/martservice/datasets?config=snp_config attributes available for a dataset: http://central.biomart.org/martservice/attributes?datasets=btaurus_snp&config=snp_config filters available for a dataset: http://central.biomart.org/martservice/filters?datasets=btaurus_snp&config=snp_config Perl (www.biomart.org) registry information: http://www.biomart.org/biomart/martservice?type=registry datasets available for a mart: http://www.biomart.org/biomart/martservice?type=datasets&mart=ensembl attributes available for a dataset: http://www.biomart.org/biomart/martservice?type=attributes&dataset=oanatinus_gene_ensembl filters available for a dataset: http://www.biomart.org/biomart/martservice?type=filters&dataset=oanatinus_gene_ensembl configuration for a dataset: http://www.biomart.org/biomart/martservice?type=configuration&dataset=oanatinus_gene_ensembl Please note that the granularity between mart and dataset has been improved in the Java version through the introduction of multiple dataset configs. This facilitates the end-users to browse various views of the same dataset, which are presented through the portal either using a different GUI or subsets of data.

QUERY EXAMPLES

Given the coverage of the current BioMart datatsets, many relevant biological questions can be answered. For example, a researcher who has detected potentially pathogenic variants in FGFR2 (ENSG00000066468) from exome sequencing patients may be interested if the same variants have been previously described and if they were associated with the same or similar diseases. To answer this, integrated data from Ensembl can be queried as shown in Table 2 to display all known variants annotated within FGFR2 that are predicted as pathogenic by SIFT (11) and Polyphen (12). The genomic position outputs can be compared to the researcher's variants and the phenotype data used to assess candidacy for their cases. For example, the first batch of results shows a C->G variant at position 121520160 on chromosome 10 that is associated with Apert syndrome (OMIM:176943).
Table 2.

Query to display phenotypic consequence for known, pathogenic variants in FGFR2

Database and datasetFiltersAttributes
Ensembl 78 Short VariationsEnsembl Gene ID(s):Chromosome name
(WTSI, UK)ENSG00000066468Chromosome position start (bp)
Homo sapiens Short Variation (SNPs and indels) (GRCh38)SIFT Prediction: deleteriousChromosome position end (bp)
PolyPhen Prediction: probably damagingStrand
Variant Alleles
Ensembl Gene ID
Consequence to transcript
Associated variation names
Study External Reference
Source name
Associated gene with phenotype
Phenotype description
Another common use case that BioMart is used for is to analyse a list of genes to establish whether they are associated with particular protein functions, pathways or diseases more often than would be expected by chance (enrichment analysis). For example, a researcher may have discovered that AURKA, AURKB, AURKC, PLK1, CDK1 and CDK4 are differentially expressed in their experiment and used BioMart's enrichment tool with its default settings to analyse these genes. The results show that these genes are enriched for involvement in the cell cycle, kinase activity and mitotic nuclear division amongst others. Many other real usage examples are documented in our previous paper (10) and the BioMart special issue in Database: the journal of biological databases and biocuration (www.oxfordjournals.org/our_journals/databa/biomart_virtual_issue.html).

CONCLUSIONS

Since its conception as a data-mining interface for the Human Genome Project (13) BioMart has rapidly grown to become an international collaboration involving a large number of different groups and organizations both in academia and in industry (14). It has been successfully applied to many different types of data including genomics, proteomics, model organisms, cancer data, etc., proving that its generic data model is widely applicable (15–53). BioMart has also provided a first successful solution for the unprecedented data management needs of the International Cancer Genome Consortium proving that the federated model scales well with the amounts of data generated by Next Generation Sequencing (48). There are a number of important factors that contributed to the BioMart's success and its adoption by many different types of projects around the world as their data management platform. BioMart's ability to quickly deploy a website hosting any type of data, user-friendly GUI, several programmatic interfaces and support for third party tools has proved to be an attractive solution for data managers who were in need of a rapid and reliable solution for their user community. BioMart has also proven to be a platform of choice for many smaller organizations that lack the necessary resources to embark on the development of their own data management solution. As a result, more and more database projects have become accessible through the BioMart interface. The arrival of these new resources coupled with the data federation technology provided by the BioMart software has galvanized the creation of the BioMart Community Portal. The federated model has proven to be very cost-effective since all development and maintenance of individual databases is left to the individual data providers. It also has proven to be very scalable as the internet and database traffic is handled by the local BioMart servers. As a result the BioMart Community Portal service has grown impressively not only in terms of available data but also the level of service. The BioMart community portal now averages over million requests per our services per day. Building on this level of service and the wealth of information that has become accessible through the BioMart interface, the BioMart Community Portal has effectively introduced a new, more scalable and much more cost-effective alternative to the large data stores maintained by specialized organizations.
  53 in total

1.  The HUGO Gene Nomenclature Committee (HGNC).

Authors:  S Povey; R Lovering; E Bruford; M Wright; M Lush; H Wain
Journal:  Hum Genet       Date:  2001-10-24       Impact factor: 4.132

2.  The International HapMap Project.

Authors: 
Journal:  Nature       Date:  2003-12-18       Impact factor: 49.962

3.  EnsMart: a generic system for fast and flexible access to biological data.

Authors:  Arek Kasprzyk; Damian Keefe; Damian Smedley; Darin London; William Spooner; Craig Melsopp; Martin Hammond; Philippe Rocca-Serra; Tony Cox; Ewan Birney
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

4.  SIFT: Predicting amino acid changes that affect protein function.

Authors:  Pauline C Ng; Steven Henikoff
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  Galaxy: a platform for interactive large-scale genome analysis.

Authors:  Belinda Giardine; Cathy Riemer; Ross C Hardison; Richard Burhans; Laura Elnitski; Prachi Shah; Yi Zhang; Daniel Blankenberg; Istvan Albert; James Taylor; Webb Miller; W James Kent; Anton Nekrutenko
Journal:  Genome Res       Date:  2005-09-16       Impact factor: 9.043

6.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis.

Authors:  Steffen Durinck; Yves Moreau; Arek Kasprzyk; Sean Davis; Bart De Moor; Alvis Brazma; Wolfgang Huber
Journal:  Bioinformatics       Date:  2005-08-15       Impact factor: 6.937

Review 7.  Bioconductor: an open source framework for bioinformatics and computational biology.

Authors:  Mark Reimers; Vincent J Carey
Journal:  Methods Enzymol       Date:  2006       Impact factor: 1.600

8.  Integration of biological networks and gene expression data using Cytoscape.

Authors:  Melissa S Cline; Michael Smoot; Ethan Cerami; Allan Kuchinsky; Nerius Landys; Chris Workman; Rowan Christmas; Iliana Avila-Campilo; Michael Creech; Benjamin Gross; Kristina Hanspers; Ruth Isserlin; Ryan Kelley; Sarah Killcoyne; Samad Lotia; Steven Maere; John Morris; Keiichiro Ono; Vuk Pavlovic; Alexander R Pico; Aditya Vailaya; Peng-Liang Wang; Annette Adler; Bruce R Conklin; Leroy Hood; Martin Kuiper; Chris Sander; Ilya Schmulevich; Benno Schwikowski; Guy J Warner; Trey Ideker; Gary D Bader
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

9.  Taverna: a tool for building and running workflows of services.

Authors:  Duncan Hull; Katy Wolstencroft; Robert Stevens; Carole Goble; Mathew R Pocock; Peter Li; Tom Oinn
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

10.  BioMart--biological queries made easy.

Authors:  Damian Smedley; Syed Haider; Benoit Ballester; Richard Holland; Darin London; Gudmundur Thorisson; Arek Kasprzyk
Journal:  BMC Genomics       Date:  2009-01-14       Impact factor: 3.969

View more
  328 in total

Review 1.  Accelerating Adverse Outcome Pathway Development Using Publicly Available Data Sources.

Authors:  Noffisat O Oki; Mark D Nelms; Shannon M Bell; Holly M Mortensen; Stephen W Edwards
Journal:  Curr Environ Health Rep       Date:  2016-03

Review 2.  Protein Bioinformatics Databases and Resources.

Authors:  Chuming Chen; Hongzhan Huang; Cathy H Wu
Journal:  Methods Mol Biol       Date:  2017

3.  Trifunctional lipid probes for comprehensive studies of single lipid species in living cells.

Authors:  Doris Höglinger; André Nadler; Per Haberkant; Joanna Kirkpatrick; Martina Schifferer; Frank Stein; Sebastian Hauke; Forbes D Porter; Carsten Schultz
Journal:  Proc Natl Acad Sci U S A       Date:  2017-02-02       Impact factor: 11.205

4.  A characterization of cis- and trans-heritability of RNA-Seq-based gene expression.

Authors:  Klaasjan G Ouwens; Rick Jansen; Michel G Nivard; Jenny van Dongen; Maia J Frieser; Jouke-Jan Hottenga; Wibowo Arindrarto; Annique Claringbould; Maarten van Iterson; Hailiang Mei; Lude Franke; Bastiaan T Heijmans; Peter A C 't Hoen; Joyce van Meurs; Andrew I Brooks; Brenda W J H Penninx; Dorret I Boomsma
Journal:  Eur J Hum Genet       Date:  2019-09-26       Impact factor: 4.246

5.  BAYESIAN LARGE-SCALE MULTIPLE REGRESSION WITH SUMMARY STATISTICS FROM GENOME-WIDE ASSOCIATION STUDIES.

Authors:  Xiang Zhu; Matthew Stephens
Journal:  Ann Appl Stat       Date:  2017-10-05       Impact factor: 2.083

6.  Biological Interpretation of Complex Genomic Data.

Authors:  Kathleen M Fisch
Journal:  Methods Mol Biol       Date:  2019

7.  Rotator cuff tear state modulates self-renewal and differentiation capacity of human skeletal muscle progenitor cells.

Authors:  Kelsey A Thomas; Michael C Gibbons; John G Lane; Anshuman Singh; Samuel R Ward; Adam J Engler
Journal:  J Orthop Res       Date:  2016-10-16       Impact factor: 3.494

8.  A Genome-Wide Association Study to Identify Single-Nucleotide Polymorphisms for Acute Kidney Injury.

Authors:  Bixiao Zhao; Qiongshi Lu; Yuwei Cheng; Justin M Belcher; Edward D Siew; David E Leaf; Simon C Body; Amanda A Fox; Sushrut S Waikar; Charles D Collard; Heather Thiessen-Philbrook; T Alp Ikizler; Lorraine B Ware; Charles L Edelstein; Amit X Garg; Murim Choi; Jennifer A Schaub; Hongyu Zhao; Richard P Lifton; Chirag R Parikh
Journal:  Am J Respir Crit Care Med       Date:  2017-02-15       Impact factor: 21.405

9.  Phenotypic Screening of Chemical Libraries Enriched by Molecular Docking to Multiple Targets Selected from Glioblastoma Genomic Data.

Authors:  David Xu; Donghui Zhou; Khuchtumur Bum-Erdene; Barbara J Bailey; Kamakshi Sishtla; Sheng Liu; Jun Wan; Uma K Aryal; Jonathan A Lee; Clark D Wells; Melissa L Fishel; Timothy W Corson; Karen E Pollok; Samy O Meroueh
Journal:  ACS Chem Biol       Date:  2020-05-21       Impact factor: 5.100

10.  Cross-validation of existing signatures and derivation of a novel 29-gene transcriptomic signature predictive of progression to TB in a Brazilian cohort of household contacts of pulmonary TB.

Authors:  Samantha Leong; Yue Zhao; Rodrigo Ribeiro-Rodrigues; Edward C Jones-López; Carlos Acuña-Villaorduña; Patricia Marques Rodrigues; Moises Palaci; David Alland; Reynaldo Dietze; Jerrold J Ellner; W Evan Johnson; Padmini Salgame
Journal:  Tuberculosis (Edinb)       Date:  2020-01-07       Impact factor: 3.131

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.