Literature DB >> 29880557

gQTL: A Web Application for QTL Analysis Using the Collaborative Cross Mouse Genetic Reference Population.

Kranti Konganti1, Andre Ehrlich1, Ivan Rusyn2, David W Threadgill3,4.   

Abstract

Multi-parental recombinant inbred populations, such as the Collaborative Cross (CC) mouse genetic reference population, are increasingly being used for analysis of quantitative trait loci (QTL). However specialized analytic software for these complex populations is typically built in R that works only on command-line, which limits the utility of these powerful resources for many users. To overcome analytic limitations, we developed gQTL, a web accessible, simple graphical user interface application based on the DOQTL platform in R to perform QTL mapping using data from CC mice.
Copyright © 2018 Konganti et al.

Entities:  

Keywords:  collaborative cross; qtl; software

Mesh:

Year:  2018        PMID: 29880557      PMCID: PMC6071593          DOI: 10.1534/g3.118.200230

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


The utility of model organisms for genetic analysis of biological systems has dramatically increased with the establishment of genetic reference populations. Modern, multi-parental populations specifically designed for quantitative trait locus (QTL) and systems genetics analyses originated with the Collaborative Cross (CC) mouse genetic reference population (Threadgill ; Threadgill and Churchill 2012). The CC population is derived from eight founder strains, A/J, C57BL/6J, 129S1Sv/ImJ, NOD/ShiLtJ, NZO/H1LtJ, CAST/EiJ, PWK/PhJ, and WSB/EiJ, representing the three major Mus musculus subspecies (M. m. musculus, M. m. domesticus, and M. m. castaneus) and which captures 90% of the genetic variation in laboratory mice (Roberts ). Although the CC has an organized genetic structure (Churchill ) and is increasingly being used to identify genetic factors controlling a variety of phenotypes from infectious disease and cancer to molecular circuitry (Rasmussen ; Dorman ; Venkatratnam ), genetic analysis of phenotypes using the CC can be challenging due to the multi-allelic structure of the population and complex analytic tools needed to perform analyses (Aylor ). Although not a replicable population like the CC, the Diversity Outbred (DO) population was derived from the CC population to increase the recombination load in order to improve mapping resolution for QTL analysis (Svenson ). To support genetic analysis using the DO population, DOQTL was developed (Gatti ), which also is increasingly being used for analysis of CC data. DOQTL is an R-based program developed to overcome several analytic challenges of multi-parental populations by implementing an integrated pipeline for haplotype reconstruction, regression modeling to account for kinship, significance thresholds through permutation analysis, and combined association mapping and parental allele-specific tests. Although DOQTL has become the predominant analytic platform for analysis of CC data, it presents a substantial barrier for most biologists with limited computer programming background. Exploiting recent advancements in web framework technologies in R programming, we developed gQTL, which is web application to simplify genetic analyses using data collected from CC mice that will greatly extend the utility of the CC model for a much broader user base.

Methods

gQTL was implemented using the R Shiny framework (Chang ), which provides necessary tools for rapid prototyping of interactive web applications. gQTL relies on functions from the DOQTL R package to perform QTL mapping (Gatti ). Since the CC population has a fixed genetic architecture, associated genotypes and haplotype probabilities for each CC line are stored and loaded into memory in the backend when gQTL is launched. The genotype probabilities for each CC and founder strain were obtained from UNC Systems Genetics data repository (http://csbio.unc.edu/CCstatus/index.py), while the MegaMUGA and GigaMUGA marker set from which the genotypes are determined in the CC was obtained from The Jackson Laboratory data repository (ftp://ftp.jax.org/MUGA/). The user has the ability to choose between either of these marker sets during the submission of the analysis.

Data availability

The authors affirm that all data necessary for confirming the conclusions of this article are represented fully within the article and its figures. Supplemental material available at Figshare: https://doi.org/10.25387/g3.6453092.

Results and Discussion

After creating a user account, data can be uploaded into a server-side deployment of gQTL, which accepts simple tab delimited or comma separated text files containing a sex identifier and multiple phenotype columns from individual or strain pooled CC data (Figure 1A). At least 3 columns containing Strain (CC), Sex and Phenotype values are mandatory. The CC column can be official or alias names (Supplementary Material, Table S1). Each row can be a line mean or individual mice, sex column should contain M or F, and multiple phenotype columns can be used. In a recent toxicology study, we used the CC population to evaluate the inter-strain variability in oxidative metabolism of trichloroethylene (TCE) and found several QTL controlling tissue TCE levels and expression of specific genes using DOQTL (Venkatratnam ); datasets from this project are used here to illustrate simplicity of gQTL (Supplemental Material, Table S2). After uploading the data file, users can remove outliers, normalize the data and perform QTL mapping. Uploaded data are presented as a table, wherein specified phenotype columns can be selected for analysis (Figure 1B). Data from specific CC strains for each phenotype can be manually removed using simple check boxes, or automatic outlier removal can be selected. Trait outliers are detected using the standard boxplot outlier rule, 1.5 × interquartile range (IQR) (Tukey 1977). Multiple data transformation choices (log, sqrt, rankZ) are available for user selection, or an automated transformation selection feature can be specified that uses the Shapiro-Wilk test of normality to determine the optimal transformation between log and sqrt (Shapiro and Wilk 1965). For a selected phenotype column, data quality plots, including raw and normalized histogram and QQ plots, are displayed (Figure 2). Finally, individual or multiple phenotype data columns can be submitted to the server for QTL mapping. Significance thresholds are determined through permutation analysis using a user-specified number of permutations (Churchill and Doerge 1994). QTL mapping with 1000 permutations typically takes about 5 hr to finish due to the fact that DOQTL runs on a single core; future implementations will transition to multiple cores. E-mail notifications keep the user informed on the current state of the job(s) running on the server. Each user account can store up to seven different analyses for later revisiting and re-submission of QTL mapping jobs with different parameters.
Figure 1

Screen shots of data entry and initial processing. (A) Data loading and file type selection. (B) Uploaded data visualization, outlier selection, and normalization options.

Figure 2

Screen shots of QTL analysis results. (A) Options for data visualization with normalized histogram. (B) QQ plot. (C) QTL plot with threshold levels and locations of significant markers. (D) Allele effect and genotype-phenotype plots. (E) A zoomed version of the significant QTL interval.

Screen shots of data entry and initial processing. (A) Data loading and file type selection. (B) Uploaded data visualization, outlier selection, and normalization options. Screen shots of QTL analysis results. (A) Options for data visualization with normalized histogram. (B) QQ plot. (C) QTL plot with threshold levels and locations of significant markers. (D) Allele effect and genotype-phenotype plots. (E) A zoomed version of the significant QTL interval. After the analyses are complete, QTL results can be explored using the web application (Figure 2; Gatti ). Linkage plots are displayed along with permutation determined LOD scores for the 85, 90 and 95% significance threshold levels. Chromosome-wide, CC founder strain-specific allele effect plots are automatically generated for any locus reaching significance that shows the marker ID with the maximal LOD and its location in cM and Mb coordinates on Build 37 (mm9) or Build 38 (mm10) depending on marker set selected, as well as Mb coordinates of the confidence interval based on a 95% Bayesian credible interval (Sen and Churchill 2001). Higher resolution images of the 95% intervals can be selected that show underlying gene annotations. Other chromosomes that may contain regions of interest but not reach at least 85% significance can be manually selected to generate additional chromosome-specific allele effect plots. For those loci reaching at least 85% significance thresholds, phenotypes for each CC sample is also plotted by genotype to visualize those genotypes driving the QTL signal. A comprehensive PDF report is automatically generated for archiving (Supplemental Material, Figure S1). Additionally, a ZIP archive containing the PDF report along with publication quality PNG figures at 600 dpi can be downloaded. gQTL v1.0 provides an easy to use graphical user interface for QTL mapping analyses of studies in CC mice with the upload of quantitative phenotype data collected in CC mice being the only input required from users. We plan to extend the application to include the ability to use phenotypes from CC Recombinant Inbred Intercrosses (CC-RIX) in subsequent version releases (Zou ).

Web Resources

The web application is freely available at: https://genomics.tamu.edu/gqtl. A built-in help menu exists on gQTL with instructions on setting up user accounts, uploading phenotype data files, inspecting phenotype data, running QTL analysis, viewing QTL analysis results and generating reports of QTL results. The source code, from the original developers (Gatti ), for the underlying DOQTL package is available at GitHub (https://github.com/dmgatti/DOQTL).
  14 in total

Review 1.  Genetic dissection of complex and quantitative traits: from fantasy to reality via a community effort.

Authors:  David W Threadgill; Kent W Hunter; Robert W Williams
Journal:  Mamm Genome       Date:  2002-04       Impact factor: 2.957

2.  Ten years of the Collaborative Cross.

Authors:  David W Threadgill; Gary A Churchill
Journal:  Genetics       Date:  2012-02       Impact factor: 4.562

3.  The Collaborative Cross, a community resource for the genetic analysis of complex traits.

Authors:  Gary A Churchill; David C Airey; Hooman Allayee; Joe M Angel; Alan D Attie; Jackson Beatty; William D Beavis; John K Belknap; Beth Bennett; Wade Berrettini; Andre Bleich; Molly Bogue; Karl W Broman; Kari J Buck; Ed Buckler; Margit Burmeister; Elissa J Chesler; James M Cheverud; Steven Clapcote; Melloni N Cook; Roger D Cox; John C Crabbe; Wim E Crusio; Ariel Darvasi; Christian F Deschepper; R W Doerge; Charles R Farber; Jiri Forejt; Daniel Gaile; Steven J Garlow; Hartmut Geiger; Howard Gershenfeld; Terry Gordon; Jing Gu; Weikuan Gu; Gerald de Haan; Nancy L Hayes; Craig Heller; Heinz Himmelbauer; Robert Hitzemann; Kent Hunter; Hui-Chen Hsu; Fuad A Iraqi; Boris Ivandic; Howard J Jacob; Ritsert C Jansen; Karl J Jepsen; Dabney K Johnson; Thomas E Johnson; Gerd Kempermann; Christina Kendziorski; Malak Kotb; R Frank Kooy; Bastien Llamas; Frank Lammert; Jean-Michel Lassalle; Pedro R Lowenstein; Lu Lu; Aldons Lusis; Kenneth F Manly; Ralph Marcucio; Doug Matthews; Juan F Medrano; Darla R Miller; Guy Mittleman; Beverly A Mock; Jeffrey S Mogil; Xavier Montagutelli; Grant Morahan; David G Morris; Richard Mott; Joseph H Nadeau; Hiroki Nagase; Richard S Nowakowski; Bruce F O'Hara; Alexander V Osadchuk; Grier P Page; Beverly Paigen; Kenneth Paigen; Abraham A Palmer; Huei-Ju Pan; Leena Peltonen-Palotie; Jeremy Peirce; Daniel Pomp; Michal Pravenec; Daniel R Prows; Zhonghua Qi; Roger H Reeves; John Roder; Glenn D Rosen; Eric E Schadt; Leonard C Schalkwyk; Ze'ev Seltzer; Kazuhiro Shimomura; Siming Shou; Mikko J Sillanpää; Linda D Siracusa; Hans-Willem Snoeck; Jimmy L Spearow; Karen Svenson; Lisa M Tarantino; David Threadgill; Linda A Toth; William Valdar; Fernando Pardo-Manuel de Villena; Craig Warden; Steve Whatley; Robert W Williams; Tim Wiltshire; Nengjun Yi; Dabao Zhang; Min Zhang; Fei Zou
Journal:  Nat Genet       Date:  2004-11       Impact factor: 38.330

4.  High-resolution genetic mapping using the Mouse Diversity outbred population.

Authors:  Karen L Svenson; Daniel M Gatti; William Valdar; Catherine E Welsh; Riyan Cheng; Elissa J Chesler; Abraham A Palmer; Leonard McMillan; Gary A Churchill
Journal:  Genetics       Date:  2012-02       Impact factor: 4.562

5.  Empirical threshold values for quantitative trait mapping.

Authors:  G A Churchill; R W Doerge
Journal:  Genetics       Date:  1994-11       Impact factor: 4.562

6.  Population-based dose-response analysis of liver transcriptional response to trichloroethylene in mouse.

Authors:  Abhishek Venkatratnam; John S House; Kranti Konganti; Connor McKenney; David W Threadgill; Weihsueh A Chiu; David L Aylor; Fred A Wright; Ivan Rusyn
Journal:  Mamm Genome       Date:  2018-01-20       Impact factor: 2.957

7.  Genetic analysis of complex traits in the emerging Collaborative Cross.

Authors:  David L Aylor; William Valdar; Wendy Foulds-Mathes; Ryan J Buus; Ricardo A Verdugo; Ralph S Baric; Martin T Ferris; Jeff A Frelinger; Mark Heise; Matt B Frieman; Lisa E Gralinski; Timothy A Bell; John D Didion; Kunjie Hua; Derrick L Nehrenberg; Christine L Powell; Jill Steigerwalt; Yuying Xie; Samir N P Kelada; Francis S Collins; Ivana V Yang; David A Schwartz; Lisa A Branstetter; Elissa J Chesler; Darla R Miller; Jason Spence; Eric Yi Liu; Leonard McMillan; Abhishek Sarkar; Jeremy Wang; Wei Wang; Qi Zhang; Karl W Broman; Ron Korstanje; Caroline Durrant; Richard Mott; Fuad A Iraqi; Daniel Pomp; David Threadgill; Fernando Pardo-Manuel de Villena; Gary A Churchill
Journal:  Genome Res       Date:  2011-03-15       Impact factor: 9.043

8.  The polymorphism architecture of mouse genetic resources elucidated using genome-wide resequencing data: implications for QTL discovery and systems genetics.

Authors:  Adam Roberts; Fernando Pardo-Manuel de Villena; Wei Wang; Leonard McMillan; David W Threadgill
Journal:  Mamm Genome       Date:  2007-08-03       Impact factor: 2.957

9.  Quantitative trait locus mapping methods for diversity outbred mice.

Authors:  Daniel M Gatti; Karen L Svenson; Andrey Shabalin; Long-Yang Wu; William Valdar; Petr Simecek; Neal Goodwin; Riyan Cheng; Daniel Pomp; Abraham Palmer; Elissa J Chesler; Karl W Broman; Gary A Churchill
Journal:  G3 (Bethesda)       Date:  2014-09-18       Impact factor: 3.154

10.  Genetic analysis of intestinal polyp development in Collaborative Cross mice carrying the Apc (Min/+) mutation.

Authors:  Alexandra Dorman; Daria Baer; Ian Tomlinson; Richard Mott; Fuad A Iraqi
Journal:  BMC Genet       Date:  2016-02-19       Impact factor: 2.797

View more
  6 in total

1.  Host genetic diversity drives variable central nervous system lesion distribution in chronic phase of Theiler's Murine Encephalomyelitis Virus (TMEV) infection.

Authors:  Koedi S Lawley; Raquel R Rech; Faith Elenwa; Gang Han; Aracely A Perez Gomez; Katia Amstalden; C Jane Welsh; Colin R Young; David W Threadgill; Candice L Brinkmeyer-Langford
Journal:  PLoS One       Date:  2021-08-20       Impact factor: 3.240

2.  Genetic background influences survival of infections with Salmonella enterica serovar Typhimurium in the Collaborative Cross.

Authors:  Kristin Scoggin; Rachel Lynch; Jyotsana Gupta; Aravindh Nagarajan; Maxwell Sheffield; Ahmed Elsaadi; Christopher Bowden; Manuchehr Aminian; Amy Peterson; L Garry Adams; Michael Kirby; David W Threadgill; Helene L Andrews-Polymenis
Journal:  PLoS Genet       Date:  2022-04-13       Impact factor: 6.020

3.  Diverse tumour susceptibility in Collaborative Cross mice: identification of a new mouse model for human gastric tumourigenesis.

Authors:  Pin Wang; Yunshan Wang; Sasha A Langley; Yan-Xia Zhou; Kuang-Yu Jen; Qi Sun; Colin Brislawn; Carolina M Rojas; Kimberly L Wahl; Ting Wang; Xiangshan Fan; Janet K Jansson; Susan E Celniker; Xiaoping Zou; David W Threadgill; Antoine M Snijders; Jian-Hua Mao
Journal:  Gut       Date:  2019-03-06       Impact factor: 23.059

4.  QTLViewer: an interactive webtool for genetic analysis in the Collaborative Cross and Diversity Outbred mouse populations.

Authors:  Matthew Vincent; Isabela Gerdes Gyuricza; Gregory R Keele; Daniel M Gatti; Mark P Keller; Karl W Broman; Gary A Churchill
Journal:  G3 (Bethesda)       Date:  2022-07-29       Impact factor: 3.542

5.  Elucidating Mechanisms of Tolerance to Salmonella Typhimurium across Long-Term Infections Using the Collaborative Cross.

Authors:  Kristin Scoggin; Jyotsana Gupta; Rachel Lynch; Aravindh Nagarajan; Manuchehr Aminian; Amy Peterson; L Garry Adams; Michael Kirby; David W Threadgill; Helene L Andrews-Polymenis
Journal:  mBio       Date:  2022-07-26       Impact factor: 7.786

6.  Viral Clearance and Neuroinflammation in Acute TMEV Infection Vary by Host Genetic Background.

Authors:  Koedi S Lawley; Raquel R Rech; Aracely A Perez Gomez; Laura Hopkins; Gang Han; Katia Amstalden; C Jane Welsh; Colin R Young; Yava Jones-Hall; David W Threadgill; Candice L Brinkmeyer-Langford
Journal:  Int J Mol Sci       Date:  2022-09-09       Impact factor: 6.208

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.