Literature DB >> 24122053

MAPfastR: quantitative trait loci mapping in outbred line crosses.

Ronald M Nelson1, Carl Nettelblad, Mats E Pettersson, Xia Shen, Lucy Crooks, Francois Besnier, José M Alvarez-Castro, Lars Rönnegård, Weronica Ek, Zheya Sheng, Marcin Kierczak, Sverker Holmgren, Orjan Carlborg.   

Abstract

MAPfastR is a software package developed to analyze quantitative trait loci data from inbred and outbred line-crosses. The package includes a number of modules for fast and accurate quantitative trait loci analyses. It has been developed in the R language for fast and comprehensive analyses of large datasets. MAPfastR is freely available at: http://www.computationalgenetics.se/?page_id=7.

Entities:  

Keywords:  QTL mapping; inbred line cross; outbred line cross; quantitative genetics

Mesh:

Year:  2013        PMID: 24122053      PMCID: PMC3852377          DOI: 10.1534/g3.113.008623

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Quantitative trait loci (QTL) mapping is a valuable tool for unraveling the complex genetic architecture of phenotypic traits. A number of software packages currently are available for detecting QTL from marker data (for reviews, see Manly and Olson 1999; Durrant ; Zhou ). Most of the software were developed for analyses of various types of crosses between inbred lines, including backcrosses and F2 crosses (R/QTL, Broman 2003; Joehanes and Nelson 2008), multicross designs (heterogeneous stocks and collaborative crosses; Mott ; Jourjon ; Huang and George 2011), and advanced intercross lines (Peirce ). Although we have written an extension to R/QTL that enables data from outbred lines to be analyzed (Nelson ), the functionality is limited. There is some software designed for outbred populations and line crosses (e.g., QxPak, Pérez-Enciso and Misztal 2004 and GridQTL, Seaton ), but as they are several years old, and the algorithms they use are not able to handle the large amount of data produced by current SNP chip technology (Crooks ). MAPfastR is a fast and comprehensive software package for analyzing QTL data from outbred line-crosses that has been developed for flexible analyses of large datasets. MAPfastR is distinct from other packages in several ways. Notably, MAPfastR is based on a computationally efficient algorithm that uses all available data from dense SNP-chips (i.e., tens to hundreds of thousands of markers, similar to association studies) and pedigree information (Crooks ). MAPfastR provides functionality for F2 crosses and backcrosses under the assumption that different QTL alleles are fixed in the founder lines (Crooks ), line-cross analyses allowing for within-line segregation (flexible interclass analysis [FIA]; Rönnegård ), and tests for epistatic interactions (Carlborg and Andersson 2002). In addition to the standard functionality, the software comes with add-on packages that allow more experienced users to take advantage of modules for analyses of deep (Advanced Intercross Line) pedigrees. MAPfastR includes an online developer and community-based support system. MAPfastR is implemented in the R language (with optimization of the more computationally intensive algorithms in C++), accepts several standard input formats and is available for Windows, Unix, and Mac OS.

Implementation

MAPfastR integrates a number of published analytical tools by providing them within a comprehensive R package with a user-friendly interface and accompanying documentation. An outline of the analysis pipeline is shown in Figure 1. The main functions in the package are briefly described herein.
Figure 1

Outline of MAPfastR.

Outline of MAPfastR.

Data import

The first release (v 1.0) of MAPfastR supports two major input formats (CRI-MAP, Green and triM, Crooks ). All imported data are stored in a standardized R object, which consists of a list with two main components (phenotypic and pedigree data, and genotypic data) and several optional attributes (storing, for example, information on which is the heterogametic sex and which is the sex chromosome). A full description of the data object is provided in Supporting Information, File S1. Once the data import is completed, the internal data format can be used for outbred pedigree analyses by use of the main and supplementary analysis modules as well as additional custom analyses coded by the user. As output from the available functions is produced, it is appended to the data object, facilitating further analyses using the results. Support for more formats is in progress and will be provided in coming releases.

Least-squares QTL mapping

A module is provided to perform QTL mapping by least-squares regression (Haley and Knott 1992), where a user-selected phenotype is regressed onto genetic effect variables derived from genotype probabilities.

Calculation of QTL genotype probabilities using trim:

The probabilities of alleles in the mapping population originating from each founder line are calculated using the triM algorithm (Nettelblad ; Crooks ). The algorithm uses a hidden Markov model to trace allele transmission in the pedigree. The line origins are calculated at user-defined, regular intervals along the chromosome and returned to the data object for further analysis.

Regression analysis:

Estimates of QTL effects are provided together with a plot of the test statistic from the fitted model across the genome, which illustrates the QTL locations. Analyses can be done on both autosomes and the homogametic sex chromosome, and for backcrosses and F2 crosses. Permutation testing can be performed by creating appropriately permuted datasets to derive an empirical significance threshold (Churchill and Doerge 1994).

FIA

FIA (Rönnegård ) is an algorithm developed for analyses of outbred line-cross data where it is not reasonable to assume fixation of different QTL alleles in the founder lines. The analysis is performed by the following two steps.

IBD estimation using MCIBD:

Identity by descent (IBD) matrices are estimated from the QTL genotype probabilities calculated from triM using the Monte Carlo Identity-By-Descent Matrix Estimation (MCIBD) algorithm (Shen ). These matrices are used in the second step of the FIA analysis.

FIA:

The variance-component-based FIA analysis scans the genome and provides estimates of genetic effects at regular spaced, user-defined locations in the genome as well as estimates of the likelihood that the QTL is fixed or segregating within the founder lines. The significance testing is based on a score-statistic and empirical significance thresholds are derived by permutation (Rönnegård ).

Estimation of genetic effects using the Natural and Orthogonal InterAction model

The Natural and Orthogonal InterAction (NOIA) model is a unified model that ensures genetic effect estimates are orthogonal and enables effects to be translated from one population to another, aiding biological interpretation (Álvarez-Castro and Carlborg 2001). This allows users to estimate, for example, interaction effects that are comparable between populations and construct high-order genotype-phenotype maps for further analyses of interactions (e.g., Álvarez-Castro and Rouzic 2008; Le Rouzic ; Le Rouzic and Álvarez-Castro 2008).

Variance-component-based analysis of deep intercross pedigrees

An external module for performing analyses of deep pedigrees is provided as an unsupported add-on function for advanced users (Besnier ). When this module is used, individuals from Advanced Intercross Lines generated from outbred founders can be haplotyped and an IBD matrix created that can be used to screen the genome for QTL using the FIA module for variance-component-based analysis.

Results

Each of the functions has been extensively tested during development (Crooks ; Ek ; Shen ). The complete pipeline has also been thoroughly tested to ensure that the package performs as a whole. Sample code for a complete analysis of an outbred line-cross and the resulting output is available in the supplementary documentation and example files. In conclusion, MAPfastR is a comprehensive, fast, and accurate software that is able to perform various methods for QTL mapping in outbred line-cross data. It can also be used for analyzing data from inbred line crosses, where the computational efficiency may be a benefit. Add-on functions for the analysis of deeper pedigrees are also provided for advanced users. MAPfastR is under ongoing development to extend and improve its functionality and is extensively documented, with support available through an online forum for community and developers alike (https://groups.google.com/d/forum/mapfastr).
  24 in total

1.  R/qtl: QTL mapping in experimental crosses.

Authors:  Karl W Broman; Hao Wu; Saunak Sen; Gary A Churchill
Journal:  Bioinformatics       Date:  2003-05-01       Impact factor: 6.937

2.  R/mpMap: a computational platform for the genetic analysis of multiparent recombinant inbred lines.

Authors:  B Emma Huang; Andrew W George
Journal:  Bioinformatics       Date:  2011-01-08       Impact factor: 6.937

3.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers.

Authors:  C S Haley; S A Knott
Journal:  Heredity (Edinb)       Date:  1992-10       Impact factor: 3.821

4.  QGene 4.0, an extensible Java QTL-analysis platform.

Authors:  Roby Joehanes; James C Nelson
Journal:  Bioinformatics       Date:  2008-10-21       Impact factor: 6.937

5.  How to deal with genotype uncertainty in variance component quantitative trait loci analyses.

Authors:  Xia Shen; Lars Rönnegård; Orjan Carlborg
Journal:  Genet Res (Camb)       Date:  2011-07-18       Impact factor: 1.588

6.  Dissection of the genetic architecture of body weight in chicken reveals the impact of epistasis on domestication traits.

Authors:  Arnaud Le Rouzic; José M Alvarez-Castro; Orjan Carlborg
Journal:  Genetics       Date:  2008-07-13       Impact factor: 4.562

7.  Inheritance beyond plain heritability: variance-controlling genes in Arabidopsis thaliana.

Authors:  Xia Shen; Mats Pettersson; Lars Rönnegård; Örjan Carlborg
Journal:  PLoS Genet       Date:  2012-08-02       Impact factor: 5.917

Review 8.  Bioinformatics tools and database resources for systems genetics analysis in mice--a short review and an evaluation of future needs.

Authors:  Caroline Durrant; Morris A Swertz; Rudi Alberts; Danny Arends; Steffen Möller; Richard Mott; Pjotr Prins; K Joeri van der Velde; Ritsert C Jansen; Klaus Schughart
Journal:  Brief Bioinform       Date:  2011-07-08       Impact factor: 11.622

9.  Estimation of genetic effects and genotype-phenotype maps.

Authors:  Arnaud Le Rouzic; José M Alvarez-Castro
Journal:  Evol Bioinform Online       Date:  2008-06-28       Impact factor: 1.625

10.  How to perform meaningful estimates of genetic effects.

Authors:  José M Alvarez-Castro; Arnaud Le Rouzic; Orjan Carlborg
Journal:  PLoS Genet       Date:  2008-05-02       Impact factor: 5.917

View more
  2 in total

1.  Composition of the ileum microbiota is a mediator between the host genome and phosphorus utilization and other efficiency traits in Japanese quail (Coturnix japonica).

Authors:  Valentin Haas; Solveig Vollmar; Siegfried Preuß; Markus Rodehutscord; Amélia Camarinha-Silva; Jörn Bennewitz
Journal:  Genet Sel Evol       Date:  2022-03-08       Impact factor: 4.297

2.  A Complex Structural Variation on Chromosome 27 Leads to the Ectopic Expression of HOXB8 and the Muffs and Beard Phenotype in Chickens.

Authors:  Ying Guo; Xiaorong Gu; Zheya Sheng; Yanqiang Wang; Chenglong Luo; Ranran Liu; Hao Qu; Dingming Shu; Jie Wen; Richard P M A Crooijmans; Örjan Carlborg; Yiqiang Zhao; Xiaoxiang Hu; Ning Li
Journal:  PLoS Genet       Date:  2016-06-02       Impact factor: 5.917

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.