Literature DB >> 22308096

xQTL workbench: a scalable web environment for multi-level QTL analysis.

Danny Arends1, K Joeri van der Velde, Pjotr Prins, Karl W Broman, Steffen Möller, Ritsert C Jansen, Morris A Swertz.   

Abstract

SUMMARY: xQTL workbench is a scalable web platform for the mapping of quantitative trait loci (QTLs) at multiple levels: for example gene expression (eQTL), protein abundance (pQTL), metabolite abundance (mQTL) and phenotype (phQTL) data. Popular QTL mapping methods for model organism and human populations are accessible via the web user interface. Large calculations scale easily on to multi-core computers, clusters and Cloud. All data involved can be uploaded and queried online: markers, genotypes, microarrays, NGS, LC-MS, GC-MS, NMR, etc. When new data types come available, xQTL workbench is quickly customized using the Molgenis software generator. AVAILABILITY: xQTL workbench runs on all common platforms, including Linux, Mac OS X and Windows. An online demo system, installation guide, tutorials, software and source code are available under the LGPL3 license from http://www.xqtl.org. CONTACT: m.a.swertz@rug.nl.

Entities:  

Mesh:

Year:  2012        PMID: 22308096      PMCID: PMC3315722          DOI: 10.1093/bioinformatics/bts049

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Modern high-throughput technologies generate large amounts of genomic, transcriptomic, proteomic and metabolomic data. However, existing open source web-based tools for QTL analysis, such as webQTL (Wang ) and QTLNetwork (Yang ), are not easily extendable to different settings and computationally scalable for whole genome analyses. xQTL workbench makes it easy to analyse large and complex datasets using state-of-the-art QTL mapping tools and to apply these methods to millions of phenotypes using parallelized ‘Big Data’ solutions (Trelles ). xQTL workbench also supports storing of raw, intermediate and final result data, and analysis protocols and history for reproducibility and data provenance. Use of Molgenis (Swertz ) helps to customize the software. All is conveniently accessible via standard Internet browsers on Windows, Linux or Mac (and using Java, R for the server).

2 FEATURES

xQTL workbench provides visualization of QTL profiles, single and multiple QTL mapping methods, easy addition of new QTL analyses, scalable data management and analysis tracking.

2.1 Explore QTL profiles

Through the web interface, users can explore mapped QTLs, and underlying information by viewing QTL plots in an interactive scrollable and zoomable window. xQTL workbench has support for other common image formats, such as PNG, JPG, SVG and embedded postscript; useful for publishing scientific results online, and on paper. From the output, main database identifiers, such as gene, protein and/or metabolite identifiers are automatically linked-out to matching external web pages of public database such as NCBI, KEGG and Wormbase.

2.2 Single and multiple QTL mapping

xQTL workbench wraps R/qtl (Arends ; Broman ) in a web-based analysis framework offering all important QTL mapping routines, such as the EM algorithm, imputation, Haley-Knott regression, the extended Haley-Knott method, marker regression and Multiple QTL mapping. In addition, xQTL workbench includes R/qtlbim, a library that provides a Bayesian model selection approach for mapping multiple interacting QTL (Yandell ) and Plink, a library for association QTL mapping on single nucleotide polymorphisms (SNP) in natural populations (Purcell ).

2.3 Add new analysis tools

xQTL workbench supports flexible adding of more QTL analysis software: any R-based, or command-line tool, can be plugged in. All analysis results are uploaded, stored and tracked in the xQTL workbench database through an R-API. When new tools are added, they can build on the high-level multi-core computer, cluster and Cloud management functions, based on TORQUE/OpenPBS and BioNode (Prins ). xQTL workbench can be made part of a larger analysis pipeline using interfaces to R, Excel, REST and SOAP web services and Galaxy (Goecks ). Screenshot of xQTL workbench with all features enabled; (1) import phenotype, genotype and genetic map data, examples are given per import type; (2) search through the whole database, explore and browse your data using molgenis generated web-interfaces; (3) run R/qtl QTL mapping, the general plugin allows users to perform not only QTL mapping but also other analyze; (4) use default (or custom) plugins to explore results (e.g. Heatmaps, QTL profiles); (5) add new tools to the workbench (for Bio informaticians); (6) user management and access control of the system (Only for admins); (7) expert settings can be altered in the admin tab (Only for admins); (8) connect/share data using generated API's to R statistics, REST/JSON, SOAP.

2.4 Track analysis and monitor performance

When a new analysis protocol or R script is defined, this protocol can easily be applied to new data. Also, xQTL workbench keeps track of history. Re-use of analysis protocols can be done in an automated fashion. Previous analyses can be rerun without resetting parameters. xQTL workbench provides an online overview of past analyses e.g. which analyses were performed, by who, when and display settings applied.

2.5 Scalable data management

xQTL workbench has a consistency checking database based on XGAP specification (Swertz ), user interfaces to manage and query genotype and phenotype datasets and support for various database back-ends including HSQL (standalone) and MySQL. Phenotype, genotype and genetic map data can be imported as text (TXT), comma separated (CSV) and Excel files. xQTL workbench handles and stores large data in a new and efficient binary edition of the XGAP format, named XGAPbin (extension .xbin), documented online. Such binary formats are essential when handling, storing and transporting multi-Gigabyte datasets.

2.6 Customizable to research needs

Additional modules for new data modalities can be added using Molgenis software generator (Swertz ). The ‘look and feel’ of xQTL workbench is adaptable to institute or consortium style by changing a simple template, which is described in the xQTL workbench documentation enabling seamless integration into an existing website or intranet site, such as recently for EU-PANACEA model organism project and LifeLines biobank.

3 IMPLEMENTATION

We built xQTL workbench on top of Molgenis (Swertz ), a Java-based software to generate tailored research infrastructure on-demand (Swertz and Jansen, 2007). From a single ‘blueprint’ describing the whole system, Molgenis auto-generates a full application including user interface, database infrastructure, application programming interfaces in R, REST and SOAP (APIs). Molgenis' flexibility and robustness is proven by the wide range of research projects, e.g. the Nordic GWAS Control database (Leu ), EB mutation database (van den Akker ) and the Animal observation database (Swertz ). For data storage, the eXtensible Genotype and Phenotype (XGAP) data model was adopted (Swertz ) and extended for big data. To support the increased demand for computational resources for included mapping routines, we added high-level cluster and cloud management functions for computation. The scalable QTL mapping routines of xQTL workbench are written in R and C. The choice of R ties in with the general practice of using R for QTL mapping. The user interface includes direct access to the R interpreter. Both xQTL workbench and Molgenis are open-source software, and source code is transparently stored and tracked in online source control repositories.

4 CONCLUSION

xQTL workbench provides a total solution for web-based analysis: major QTL mapping routines are integrated for use by experienced and inexperienced users. Researchers can upload raw data, run analyses, explore mapped QTL and underlying information, and link-out to important databases. New algorithms can be flexibly added, immediately available to all users. Large analyses can be easily executed on a cluster or in the Cloud. Future work include visualizations and search options to explore the results. We also had an EU-SYSGENET workshop that envisioned further integration of xQTL with analysis tools like HAPPY, databases like GeneNetwork, and the workflow manager TIQS (Durrant ).
  15 in total

1.  R/qtl: QTL mapping in experimental crosses.

Authors:  Karl W Broman; Hao Wu; Saunak Sen; Gary A Churchill
Journal:  Bioinformatics       Date:  2003-05-01       Impact factor: 6.937

2.  Molecular Genetics Information System (MOLGENIS): alternatives in developing local experimental genomics databases.

Authors:  Morris A Swertz; E O De Brock; Sacha A F T Van Hijum; Anne De Jong; Girbe Buist; Richard J S Baerends; Jan Kok; Oscar P Kuipers; Ritsert C Jansen
Journal:  Bioinformatics       Date:  2004-04-01       Impact factor: 6.937

3.  R/qtl: high-throughput multiple QTL mapping.

Authors:  Danny Arends; Pjotr Prins; Ritsert C Jansen; Karl W Broman
Journal:  Bioinformatics       Date:  2010-10-21       Impact factor: 6.937

4.  R/qtlbim: QTL with Bayesian Interval Mapping in experimental crosses.

Authors:  Brian S Yandell; Tapan Mehta; Samprit Banerjee; Daniel Shriner; Ramprasad Venkataraman; Jee Young Moon; W Whipple Neely; Hao Wu; Randy von Smith; Nengjun Yi
Journal:  Bioinformatics       Date:  2007-01-19       Impact factor: 6.937

Review 5.  Beyond standardization: dynamic software infrastructures for systems biology.

Authors:  Morris A Swertz; Ritsert C Jansen
Journal:  Nat Rev Genet       Date:  2007-02-13       Impact factor: 53.242

6.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

7.  QTLNetwork: mapping and visualizing genetic architecture of complex traits in experimental populations.

Authors:  Jian Yang; Chengcheng Hu; Han Hu; Rongdong Yu; Zhen Xia; Xiuzi Ye; Jun Zhu
Journal:  Bioinformatics       Date:  2008-01-17       Impact factor: 6.937

8.  XGAP: a uniform and extensible data model and software platform for genotype and phenotype experiments.

Authors:  Morris A Swertz; K Joeri van der Velde; Bruno M Tesson; Richard A Scheltema; Danny Arends; Gonzalo Vera; Rudi Alberts; Martijn Dijkstra; Paul Schofield; Klaus Schughart; John M Hancock; Damian Smedley; Katy Wolstencroft; Carole Goble; Engbert O de Brock; Andrew R Jones; Helen E Parkinson; Ritsert C Jansen
Journal:  Genome Biol       Date:  2010-03-09       Impact factor: 13.583

9.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences.

Authors:  Jeremy Goecks; Anton Nekrutenko; James Taylor
Journal:  Genome Biol       Date:  2010-08-25       Impact factor: 13.583

Review 10.  Bioinformatics tools and database resources for systems genetics analysis in mice--a short review and an evaluation of future needs.

Authors:  Caroline Durrant; Morris A Swertz; Rudi Alberts; Danny Arends; Steffen Möller; Richard Mott; Pjotr Prins; K Joeri van der Velde; Ritsert C Jansen; Klaus Schughart
Journal:  Brief Bioinform       Date:  2011-07-08       Impact factor: 11.622

View more
  9 in total

1.  HEFT: eQTL analysis of many thousands of expressed genes while simultaneously controlling for hidden factors.

Authors:  Chuan Gao; Nicole L Tignor; Jacqueline Salit; Yael Strulovici-Barel; Neil R Hackett; Ronald G Crystal; Jason G Mezey
Journal:  Bioinformatics       Date:  2013-12-04       Impact factor: 6.937

2.  A preliminary study on early postmortem submersion interval (PMSI) estimation and cause-of-death discrimination based on nontargeted metabolomics and machine learning algorithms.

Authors:  Fu-Yuan Zhang; Lin-Lin Wang; Wen-Wen Dong; Miao Zhang; Dilichati Tash; Xin-Jie Li; Shu-Kui Du; Hao-Miao Yuan; Rui Zhao; Da-Wei Guan
Journal:  Int J Legal Med       Date:  2022-01-31       Impact factor: 2.686

3.  Worm variation made accessible: Take your shopping cart to store, link, and investigate!

Authors:  L Basten Snoek; K Joeri van der Velde; Yang Li; Ritsert C Jansen; Morris A Swertz; Jan E Kammenga
Journal:  Worm       Date:  2014-03-06

4.  Pheno2Geno - High-throughput generation of genetic markers and maps from molecular phenotypes for crosses between inbred strains.

Authors:  Konrad Zych; Yang Li; Joeri K van der Velde; Ronny V L Joosen; Wilco Ligterink; Ritsert C Jansen; Danny Arends
Journal:  BMC Bioinformatics       Date:  2015-02-19       Impact factor: 3.169

5.  Proceedings of the 14th Annual UT-KBRIN Bioinformatics Summit 2015.

Authors: 
Journal:  BMC Bioinformatics       Date:  2015-10-23       Impact factor: 3.169

6.  Similarities between plant traits based on their connection to underlying gene functions.

Authors:  Jan-Peter Nap; Gabino F Sanchez-Perez; Aalt D J van Dijk
Journal:  PLoS One       Date:  2017-08-10       Impact factor: 3.240

7.  reGenotyper: Detecting mislabeled samples in genetic data.

Authors:  Konrad Zych; Basten L Snoek; Mark Elvin; Miriam Rodriguez; K Joeri Van der Velde; Danny Arends; Harm-Jan Westra; Morris A Swertz; Gino Poulin; Jan E Kammenga; Rainer Breitling; Ritsert C Jansen; Yang Li
Journal:  PLoS One       Date:  2017-02-13       Impact factor: 3.240

8.  WormQTL--public archive and analysis web portal for natural variation data in Caenorhabditis spp.

Authors:  L Basten Snoek; K Joeri Van der Velde; Danny Arends; Yang Li; Antje Beyer; Mark Elvin; Jasmin Fisher; Alex Hajnal; Michael O Hengartner; Gino B Poulin; Miriam Rodriguez; Tobias Schmid; Sabine Schrimpf; Feng Xue; Ritsert C Jansen; Jan E Kammenga; Morris A Swertz
Journal:  Nucleic Acids Res       Date:  2012-11-24       Impact factor: 16.971

9.  WormQTLHD--a web database for linking human disease to natural variation data in C. elegans.

Authors:  K Joeri van der Velde; Mark de Haan; Konrad Zych; Danny Arends; L Basten Snoek; Jan E Kammenga; Ritsert C Jansen; Morris A Swertz; Yang Li
Journal:  Nucleic Acids Res       Date:  2013-11-11       Impact factor: 16.971

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.