Literature DB >> 28981643

MareyMap Online: A User-Friendly Web Application and Database Service for Estimating Recombination Rates Using Physical and Genetic Maps.

Aurélie Siberchicot1, Adrien Bessy1,2, Laurent Guéguen1, Gabriel A B Marais1.   

Abstract

Given the importance of meiotic recombination in biology, there is a need to develop robust methods to estimate meiotic recombination rates. A popular approach, called the Marey map approach, relies on comparing genetic and physical maps of a chromosome to estimate local recombination rates. In the past, we have implemented this approach in an R package called MareyMap, which includes many functionalities useful to get reliable recombination rate estimates in a semi-automated way. MareyMap has been used repeatedly in studies looking at the effect of recombination on genome evolution. Here, we propose a simpler user-friendly web service version of MareyMap, called MareyMap Online, which allows a user to get recombination rates from her/his own data or from a publicly available database that we offer in a few clicks. When the analysis is done, the user is asked whether her/his curated data can be placed in the database and shared with other users, which we hope will make meta-analysis on recombination rates including many species easy in the future.
© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Marey map approach; R package; Shiny; crossover rates; web applications

Mesh:

Year:  2017        PMID: 28981643      PMCID: PMC5737613          DOI: 10.1093/gbe/evx178

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Meiotic recombination rates can vary quite a lot along chromosomes (Nachman 2002) and a good description of this variation in model organisms (humans, Drosophila, Caenorhabditis elegans, Arabidopsis) and in nonmodel organisms is important to understand both the process of meiosis and its evolutionary implications. In particular, recombination is considered one of the main factors influencing genome architecture (e.g., Gaut etal. 2007; Lynch 2007; Charlesworth and Campos 2014; Ellegren 2014; Melamed-Bessudo etal. 2016) and studying the influence of recombination on genome evolution is a hot topics in molecular evolution and genomics. A key methodological point is of course the estimation of the local meiotic recombination rates. Several approaches have been developed to obtain such estimates. Sperm-typing gives very fine-scale and reliable estimates but this approach is costly, difficult to apply on a large-scale and only provides estimates in males (Kauppi etal. 2004, 2009). Studying SNPs and linkage disequilibrium from natural population data can give population-genetic-based estimates of recombination (Stumpf and McVean 2003; Coop and Przeworski 2007). If dense and genome-wide SNP data are available in a species, as for example in humans, this approach will return fine-scale estimates for the entire genome, and it has been very successful in identifying hot spots in humans and in other well-studied organisms (e.g., Ptak etal. 2004; Myers etal. 2005). Even more recently, genome-wide SNP data have been obtained for pedigrees in humans, yeast, Drosophila and Arabidopsis, which have allowed the direct and very precise study of recombination events that occurred in those pedigrees and have resulted in arguably the best estimates so far (Coop etal. 2008; Mancera etal. 2008; Comeron etal. 2012; Yang etal. 2012). With next-generation sequencing, obtaining such genome-wide SNP data is becoming easier, but at present this kind of data are not available for many organisms. Another approach, called the Marey map approach, relies on comparing physical and genetic maps of a given chromosome to estimate local recombination rates, which are given by the local slope of the curve adjusted to the datapoints (Chakravarti 1991). Historically, many studies using local recombination rate estimates have used this approach. Also, all methods have pros and cons (e.g., Buard and de Massy 2007) and the Marey map approach is still very popular as there are many more species for which both physical and genetic maps are available (the raw material for the Marey map approach) compared with species for which genome-wide SNP data are available. Several tools implementing the Marey map approach have been developed but they focus on particular organisms: Drosophila (Fiston-Lavier etal. 2010) or mice and rat (Voigt etal. 2004) for example. In these tools, the users cannot work on their own data. The only tool for inferring local recombination rates in any organisms in a semiautomated way is a tool called MareyMap that we developed a few years ago (Rezvoy etal. 2007) and have updated recently (Siberchicot etal. 2015). MareyMap requires the user to have files for physical and genetic maps for each chromosome in a given species. The files are then uploaded and a number of functionalities allow the user to build a Marey map plot, to clean aberrant points from the map, to infer local estimates using different methods and to export estimates in different formats (Rezvoy etal. 2007). MareyMap is an R package with a tcl/tk graphical interface, which works well but requires the user to have R (R Core Team 2016) installed and to be familiar with installing R packages. Despite R being free and open-source, we have noticed that a significant number of users have experienced difficulties in installing the MareyMap R package when they are not frequent R users. Moreover, the many options proposed by MareyMap are useful for experienced users but can be confusing for beginners with the package and the Marey map approach. Here, we describe a web service called MareyMap Online that we have developed as a user-friendly version of MareyMap. MareyMap Online proposes a simplified version of MareyMap with five main steps: data uploading, data cleaning, recombination rate estimation, export and data sharing (see menu bar in fig. 1), which are described below. The user is guided through these steps with only essential options proposed and clear guidelines when there are choices to make. An important feature offered by MareyMap Online is to be able to pick some Marey map data from a publicly accessible database that we provide. This database currently includes 8 organisms: Arabidopsis thaliana, Caenorhabditis elegans, Danio rerio (zebrafish), Drosophila melanogaster, Homo sapiens, Oryza sativa (rice), Oryzias latipes (medaka), and Saccharomyces cerevisiae. MareyMap Online will ask each user the permission to keep her/his curated Marey map data once the analysis is done, which will then be stored in our database. We thus expect the database to include many more species in the future, which can be useful for meta-analysis on recombination and genome evolution for instance. Such studies are currently difficult as this kind of database does not exist and one needs to collect the Marey map data through literature search.
. 1.

—Screenshot of a Marey map plot (step 2) in the MareyMap Online web service. The five steps are shown in the menu bar.

MareyMap Online has been implemented from the MareyMap R package with shiny, an R package which offers a framework to develop web applications in R (Chang etal. 2016). As explained above, the user follows a pipeline step by step and is asked first to pick a Marey map dataset from the database or to upload her/his own dataset. The format required is quite simple and an example is provided. In step 2, a plot comparing physical (X axis) and genetic (Y axis) map can be seen for each chromosome (fig. 1). Maps often contain errors. These can be easily spotted on a Marey map as they will clearly disrupt the expected overall monotonously increasing relationship between physical and genetic positions. Such erroneous datapoints need to be removed before further analysis and an interactive cleaning tool is proposed to the user in order to do it. Step 3 consists in selecting an estimation method to get local recombination rates. This is a very sensitive step. Users are proposed three different methods already available in the MareyMap R package: sliding windows, loess and cubic splines (Rezvoy etal. 2007; Siberchicot etal. 2015). Cubic splines are the best method to capture relatively fine-scale variations whereas sliding large windows will return large-scale estimates. Which is the right method for an analysis will depend on the features of the data. Key points are the density of the markers and the noise in the data. We offer some guidelines to help the users in picking up the best method for her/his data. In particular, a good criterion to check if a method worked well is to see whether no negative estimates have been obtained; negative estimates are indeed clear signs that another method with more “smoothing” effect is required. At step 4, the user will view the estimates and automatically generated warnings will tell her/him when negative values are present among the estimates. Back-and-forth adjustments between steps 3 and 4 may be needed to pick the estimation method most adapted to the user data. Then, export options are proposed. A useful one is the possibility to upload a list of physical positions and retrieve the estimates for all these positions, which is very useful for a downstream analysis of all the genes in a genome for example. The user can also query some positions to get estimates. At step 5, the user will be asked whether she/he is willing to authorise storage of her/his curated Marey map data in our public database. —Screenshot of a Marey map plot (step 2) in the MareyMap Online web service. The five steps are shown in the menu bar. In conclusion, MareyMap Online provides a simplified user-friendly pipeline for getting local recombination rate estimates. It is available online and requires no installation. We also offer a database with curated Marey map data, which we hope will grow in the future thanks to the MareyMap Online users. Importantly, for well-experienced users the R package will still be maintained and will offer the full set of options included in MareyMap.
  20 in total

1.  Drosophila melanogaster recombination rate calculator.

Authors:  Anna-Sophie Fiston-Lavier; Nadia D Singh; Mikhail Lipatov; Dmitri A Petrov
Journal:  Gene       Date:  2010-05-07       Impact factor: 3.688

Review 2.  Estimating recombination rates from population-genetic data.

Authors:  Michael P H Stumpf; Gilean A T McVean
Journal:  Nat Rev Genet       Date:  2003-12       Impact factor: 53.242

Review 3.  Where the crossovers are: recombination distributions in mammals.

Authors:  Liisa Kauppi; Alec J Jeffreys; Scott Keeney
Journal:  Nat Rev Genet       Date:  2004-06       Impact factor: 53.242

4.  A graphical representation of genetic and physical maps: the Marey map.

Authors:  A Chakravarti
Journal:  Genomics       Date:  1991-09       Impact factor: 5.736

5.  Great majority of recombination events in Arabidopsis are gene conversion events.

Authors:  Sihai Yang; Yang Yuan; Long Wang; Jing Li; Wen Wang; Haoxuan Liu; Jian-Qun Chen; Laurence D Hurst; Dacheng Tian
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-03       Impact factor: 11.205

Review 6.  The relations between recombination rate and patterns of molecular variation and evolution in Drosophila.

Authors:  Brian Charlesworth; José L Campos
Journal:  Annu Rev Genet       Date:  2014-09-10       Impact factor: 16.830

Review 7.  Meiotic recombination and genome evolution in plants.

Authors:  Cathy Melamed-Bessudo; Shay Shilo; Avraham A Levy
Journal:  Curr Opin Plant Biol       Date:  2016-03-01       Impact factor: 7.834

Review 8.  Playing hide and seek with mammalian meiotic crossover hotspots.

Authors:  Jérôme Buard; Bernard de Massy
Journal:  Trends Genet       Date:  2007-04-16       Impact factor: 11.639

9.  High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans.

Authors:  Graham Coop; Xiaoquan Wen; Carole Ober; Jonathan K Pritchard; Molly Przeworski
Journal:  Science       Date:  2008-01-31       Impact factor: 47.728

10.  Absence of the TAP2 human recombination hotspot in chimpanzees.

Authors:  Susan E Ptak; Amy D Roeder; Matthew Stephens; Yoav Gilad; Svante Pääbo; Molly Przeworski
Journal:  PLoS Biol       Date:  2004-06-15       Impact factor: 8.029

View more
  6 in total

1.  Genome properties of key oil palm (Elaeis guineensis Jacq.) breeding populations.

Authors:  Essubalew Getachew Seyum; Ngalle Hermine Bille; Wosene Gebreselassie Abtew; Pasi Rastas; Deni Arifianto; Hubert Domonhédo; Benoît Cochard; Florence Jacob; Virginie Riou; Virginie Pomiès; David Lopez; Joseph Martin Bell; David Cros
Journal:  J Appl Genet       Date:  2022-06-13       Impact factor: 3.240

2.  The Transposable Element Environment of Human Genes Differs According to Their Duplication Status and Essentiality.

Authors:  Margot Correa; Emmanuelle Lerat; Etienne Birmelé; Franck Samson; Bérengère Bouillon; Kévin Normand; Carène Rizzon
Journal:  Genome Biol Evol       Date:  2021-05-07       Impact factor: 3.416

3.  Chromosome-length genome assembly and linkage map of a critically endangered Australian bird: the helmeted honeyeater.

Authors:  Diana A Robledo-Ruiz; Han Ming Gan; Parwinder Kaur; Olga Dudchenko; David Weisz; Ruqayya Khan; Erez Lieberman Aiden; Ekaterina Osipova; Michael Hiller; Hernán E Morales; Michael J L Magrath; Rohan H Clarke; Paul Sunnucks; Alexandra Pavlova
Journal:  Gigascience       Date:  2022-03-29       Impact factor: 6.524

4.  Extreme variation in recombination rate and genetic diversity along the Sylvioidea neo-sex chromosome.

Authors:  Suvi Ponnikas; Hanna Sigeman; Max Lundberg; Bengt Hansson
Journal:  Mol Ecol       Date:  2022-06-06       Impact factor: 6.622

5.  Construction of Genetic Linkage Maps From a Hybrid Family of Large Yellow Croaker (Larimichthys crocea).

Authors:  Xinxiu Yu; Rajesh Joshi; Hans Magnus Gjøen; Zhenming Lv; Matthew Kent
Journal:  Front Genet       Date:  2022-01-03       Impact factor: 4.599

6.  A chromosome-anchored genome assembly for Lake Trout (Salvelinus namaycush).

Authors:  Seth R Smith; Eric Normandeau; Haig Djambazian; Pubudu M Nawarathna; Pierre Berube; Andrew M Muir; Jiannis Ragoussis; Chantelle M Penney; Kim T Scribner; Gordon Luikart; Chris C Wilson; Louis Bernatchez
Journal:  Mol Ecol Resour       Date:  2021-08-14       Impact factor: 8.678

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.