Literature DB >> 21372085

Comparative visualization of genetic and physical maps with Strudel.

Micha Bayer1, Iain Milne, Gordon Stephen, Paul Shaw, Linda Cardle, Frank Wright, David Marshall.   

Abstract

UNLABELLED: Data visualization can play a key role in comparative genomics, for example, underpinning the investigation of conserved synteny patterns. Strudel is a desktop application that allows users to easily compare both genetic and physical maps interactively and efficiently. It can handle large datasets from several genomes simultaneously, and allows all-by-all comparisons between these.
AVAILABILITY AND IMPLEMENTATION: Installers for Strudel are available for Windows, Linux, Solaris and Mac OS X at http://bioinf.scri.ac.uk/strudel/.

Entities:  

Mesh:

Year:  2011        PMID: 21372085      PMCID: PMC3077070          DOI: 10.1093/bioinformatics/btr111

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Crop genetics is still dominated by species for which fully sequenced and well-annotated genomes are unavailable. Comparative genomics is an important means of annotating unfinished genomes, and requires powerful visualization tools that elucidate the relationships with already annotated genomes. There are a number of tools in this area, which range from web-based applications with database back-ends to standalone desktop applications (Fang ; Lewis ; Meyer ; Mueller ; Pan ; Sawkins ). The challenges faced by any comparative visualization application are the increasing volume of data, fast delivery of these to users, efficient on-screen rendering of a large amount of information and layout constraints. Here, we present Strudel, a standalone Java desktop application that aims to combine ease of installation with ease of use, and allows the simultaneous multi-way comparison of several genomes. Usability has been a major design criterion for Strudel, and in early acceptance testing users were able to start generating insights into their data within minutes of downloading the application, without having to first consult the manual. Strudel's graphical interface has been designed to reduce visual clutter as much as possible, and a critical condition for this is that homologies between two chromosomes are never drawn across other genomes.

2 IMPLEMENTATION

Strudel ships in easy-to-use installers bundled with Java Runtime Environments (JREs), so there is no requirement to install additional software, and no Java version issues. It is available for Windows, Linux, Solaris and Apple Mac OS X at http://bioinf.scri.ac.uk/strudel/. The installers feature an auto-update facility which alerts users to new releases and provides the option of upgrading the software. In its current implementation, data input into Strudel is by means of flat text files only. This provides the advantage of users being able to generate their own datasets easily, for example, in spreadsheet software, without relying on complex database back-ends. Strudel uses its own simple data format as standard formats for comparative data that have not been developed so far. The Strudel file format is row based, with both features and homologs in a single plain-text file. The format is documented in the online manual, where an example file is also provided. Example datasets are provided both on the Strudel web site and with the application itself. The example dataset that is distributed with the application consists of three cereal genomes with a high degree of conserved synteny: barley (Hordeum vulgare), rice (Oryza sativa) and the model grass species Brachypodium distachyon. The latter two species have complete physical maps, while barley is supplied as a consensus single nucleotide polymorphism (SNP) map (Close ). A worked example is provided (http://bioinf.scri.ac.uk/strudel/#useCases) of how Strudel was used to investigate the barley Int-C mutant (Ramsay ). This involved using the high density barley SNP map, and exploring syntenous regions of the rice and Brachypodium genome in order to identify potential candidate genes through the links to the rice and Brachypodium genome browsers. The graphical interface of Strudel is shown in Figure 1. Genomes are arranged in columns, with chromosomes represented by vertical bars. Features on chromosomes—for example, SNPs or genes—are rendered as horizontal lines, and pairs of homologs are represented by lines between the features involved. Rendering features and homology lines is the computationally most costly part of the canvas drawing operation, but at lower zoom levels this is accelerated by avoiding duplicate drawing operations for features and links that occupy the same on-screen (pixel) coordinates when zoomed out only. This allows Strudel to display feature-dense genomes with tens of thousands of features without noticeable impact on rendering speed.
Fig. 1.

Strudel's graphical interface, showing the example dataset provided with the application. Chromosome 4H of the barley genome has been expanded to fill the screen, showing homologies (gray lines) with the B.distachyon genome (left) and chromosome 3 of rice (right).

Strudel's graphical interface, showing the example dataset provided with the application. Chromosome 4H of the barley genome has been expanded to fill the screen, showing homologies (gray lines) with the B.distachyon genome (left) and chromosome 3 of rice (right). It is assumed that homology data for Strudel datasets are generated by BLAST (Altschul ; not part of Strudel's functionality) or similar tools, and therefore a facility is provided that allows the user to filter the visible homologies (links) by, for example, their BLAST e-value to generate a more stringent view of the data as required. Other numerical variables can be used for this instead of e-values if required. The number of genomes that can be compared is theoretically unlimited and only constrained by the available screen space. Additional graphical instances of genomes can be added without duplicating data (hence, conserving memory), to allow multi-way comparisons. Users are able to choose the number and position of the additional genome instances. Quantitative trait loci (QTL) or any other regions of interest can be explored by defining an interval on a given chromosome. A table with the features contained within that region is then displayed. The table contains names of features and their homologs, along with their positions and any annotation information available. Feature names are clickable and the associated links point at user-defined URLs that provide annotation for the feature in question. Searching for a feature by name is also possible and results in a table being displayed such as that described above. Zooming individual genomes is possible by means of individual zoom sliders at the bottom of each genome column, or by a click-and-drag motion that allows for a region to be highlighted and then zoomed into when the mouse button is released. High-level (zoomed out) views allow users to establish patterns of conserved synteny between genomes. Chromosomes can be inverted to help to disentangle crossed-over links to regions that have undergone chromosomal inversion events. Users can customize numerous display features, such as color schemes, the shape of links (straight, curved or angled), enabling/disabling antialiased drawing and displaying distance labels. A separate overview window shows all maps as laid out on the main canvas and allows easy orientation when the user has zoomed into one or more genomes. It also allows quick navigation by means of a highlighted area that shows the region currently visible on the main canvas in the context of the whole genome. We have also developed close integration with the Germinate 2 (http://bioinf.scri.ac.uk/public/?page_id=159) database warehouse system to allow additional information on markers and genetic maps to be displayed to the user. This is performed by allowing the seamless movement between Strudel and the Germinate 2 web application and vice versa. In addition with the integration of Germinate 2 with our graphical genotype visualization tool, Flapjack (Milne ), we have created an interactive and extendable software environment. Genetic maps held in Germinate 2 can be easily exported in Strudel format. Similarly, any other data source could in theory be adapted to interact with Strudel in this way, both in terms of data export to Strudel, and in terms of providing annotation URLs that can be accessed through the application (see above). An online help manual is available, which includes a quickstart tutorial. There is also a hint panel built into the application that provides context-dependent advice on what actions are available in a given situation. This allows users to start using the application without constant referral to the manual.
  10 in total

1.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

2.  Comparative map and trait viewer (CMTV): an integrated bioinformatic tool to construct consensus maps and compare QTL and functional genomics data across genomes and experiments.

Authors:  M C Sawkins; A D Farmer; D Hoisington; J Sullivan; A Tolopko; Z Jiang; J-M Ribaut
Journal:  Plant Mol Biol       Date:  2004-10       Impact factor: 4.076

3.  SynBrowse: a synteny browser for comparative sequence analysis.

Authors:  Xiaokang Pan; Lincoln Stein; Volker Brendel
Journal:  Bioinformatics       Date:  2005-06-30       Impact factor: 6.937

4.  The SGN comparative map viewer.

Authors:  Lukas A Mueller; Adri A Mills; Beth Skwarecki; Robert M Buels; Naama Menda; Steven D Tanksley
Journal:  Bioinformatics       Date:  2008-01-17       Impact factor: 6.937

5.  MizBee: a multiscale synteny browser.

Authors:  Miriah Meyer; Tamara Munzner; Hanspeter Pfister
Journal:  IEEE Trans Vis Comput Graph       Date:  2009 Nov-Dec       Impact factor: 4.579

6.  cMap: the comparative genetic map viewer.

Authors:  Z Fang; M Polacco; S Chen; S Schroeder; D Hancock; H Sanchez; E Coe
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

7.  INTERMEDIUM-C, a modifier of lateral spikelet fertility in barley, is an ortholog of the maize domestication gene TEOSINTE BRANCHED 1.

Authors:  Luke Ramsay; Jordi Comadran; Arnis Druka; David F Marshall; William T B Thomas; Malcolm Macaulay; Katrin MacKenzie; Craig Simpson; John Fuller; Nicola Bonar; Patrick M Hayes; Udda Lundqvist; Jerome D Franckowiak; Timothy J Close; Gary J Muehlbauer; Robbie Waugh
Journal:  Nat Genet       Date:  2011-01-09       Impact factor: 38.330

8.  Flapjack--graphical genotype visualization.

Authors:  Iain Milne; Paul Shaw; Gordon Stephen; Micha Bayer; Linda Cardle; William T B Thomas; Andrew J Flavell; David Marshall
Journal:  Bioinformatics       Date:  2010-10-18       Impact factor: 6.937

9.  Development and implementation of high-throughput SNP genotyping in barley.

Authors:  Timothy J Close; Prasanna R Bhat; Stefano Lonardi; Yonghui Wu; Nils Rostoks; Luke Ramsay; Arnis Druka; Nils Stein; Jan T Svensson; Steve Wanamaker; Serdar Bozdag; Mikeal L Roose; Matthew J Moscou; Shiaoman Chao; Rajeev K Varshney; Péter Szucs; Kazuhiro Sato; Patrick M Hayes; David E Matthews; Andris Kleinhofs; Gary J Muehlbauer; Joseph DeYoung; David F Marshall; Kavitha Madishetty; Raymond D Fenton; Pascal Condamine; Andreas Graner; Robbie Waugh
Journal:  BMC Genomics       Date:  2009-12-04       Impact factor: 3.969

Review 10.  Apollo: a sequence annotation editor.

Authors:  S E Lewis; S M J Searle; N Harris; M Gibson; V Lyer; J Richter; C Wiel; L Bayraktaroglu; E Birney; M A Crosby; J S Kaminker; B B Matthews; S E Prochnik; C D Smithy; J L Tupy; G M Rubin; S Misra; C J Mungall; M E Clamp
Journal:  Genome Biol       Date:  2002-12-23       Impact factor: 13.583

  10 in total
  21 in total

1.  New evidence of ancestral polyploidy in the Genistoid legume Lupinus angustifolius L. (narrow-leafed lupin).

Authors:  Magdalena Kroc; Grzegorz Koczyk; Wojciech Święcicki; Andrzej Kilian; Matthew N Nelson
Journal:  Theor Appl Genet       Date:  2014-03-15       Impact factor: 5.699

2.  A genome-wide identification of chromosomal regions determining nitrogen use efficiency components in wheat (Triticum aestivum L.).

Authors:  Fabien Cormier; Jacques Le Gouis; Pierre Dubreuil; Stéphane Lafarge; Sébastien Praud
Journal:  Theor Appl Genet       Date:  2014-10-19       Impact factor: 5.699

3.  Evolutionary Dynamics of the Cellulose Synthase Gene Superfamily in Grasses.

Authors:  Julian G Schwerdt; Katrin MacKenzie; Frank Wright; Daniel Oehme; John M Wagner; Andrew J Harvey; Neil J Shirley; Rachel A Burton; Miriam Schreiber; Claire Halpin; Jochen Zimmer; David F Marshall; Robbie Waugh; Geoffrey B Fincher
Journal:  Plant Physiol       Date:  2015-05-21       Impact factor: 8.340

4.  Genetic mapping of legume orthologs reveals high conservation of synteny between lentil species and the sequenced genomes of Medicago and chickpea.

Authors:  Neha Gujaria-Verma; Sally L Vail; Noelia Carrasquilla-Garcia; R Varma Penmetsa; Douglas R Cook; Andrew D Farmer; Albert Vandenberg; Kirstin E Bett
Journal:  Front Plant Sci       Date:  2014-12-05       Impact factor: 5.753

5.  Development and bin mapping of gene-associated interspecific SNPs for cotton (Gossypium hirsutum L.) introgression breeding efforts.

Authors:  Amanda M Hulse-Kemp; Hamid Ashrafi; Xiuting Zheng; Fei Wang; Kevin A Hoegenauer; Andrea B V Maeda; S Samuel Yang; Kevin Stoffel; Marta Matvienko; Kimberly Clemons; Joshua A Udall; Allen Van Deynze; Don C Jones; David M Stelly
Journal:  BMC Genomics       Date:  2014-10-30       Impact factor: 3.969

6.  Powerful regulatory systems and post-transcriptional gene silencing resist increases in cellulose content in cell walls of barley.

Authors:  Hwei-Ting Tan; Neil J Shirley; Rohan R Singh; Marilyn Henderson; Kanwarpal S Dhugga; Gwenda M Mayo; Geoffrey B Fincher; Rachel A Burton
Journal:  BMC Plant Biol       Date:  2015-02-21       Impact factor: 4.215

7.  Prioritization of candidate genes in "QTL-hotspot" region for drought tolerance in chickpea (Cicer arietinum L.).

Authors:  Sandip M Kale; Deepa Jaganathan; Pradeep Ruperao; Charles Chen; Ramu Punna; Himabindu Kudapa; Mahendar Thudi; Manish Roorkiwal; Mohan Avsk Katta; Dadakhalandar Doddamani; Vanika Garg; P B Kavi Kishor; Pooran M Gaur; Henry T Nguyen; Jacqueline Batley; David Edwards; Tim Sutton; Rajeev K Varshney
Journal:  Sci Rep       Date:  2015-10-19       Impact factor: 4.379

8.  A SNP-based consensus genetic map for synteny-based trait targeting in faba bean (Vicia faba L.).

Authors:  Anne Webb; Amanda Cottage; Thomas Wood; Khalil Khamassi; Douglas Hobbs; Krystyna Gostkiewicz; Mark White; Hamid Khazaei; Mohamed Ali; Daniel Street; Gérard Duc; Fred L Stoddard; Fouad Maalouf; Francis C Ogbonnaya; Wolfgang Link; Jane Thomas; Donal M O'Sullivan
Journal:  Plant Biotechnol J       Date:  2015-04-10       Impact factor: 9.803

9.  Genetic characterization of a reciprocal translocation present in a widely grown barley variety.

Authors:  A Farré; A Cuadrado; I Lacasa-Benito; L Cistué; I Schubert; J Comadran; J Jansen; I Romagosa
Journal:  Mol Breed       Date:  2012-01-28       Impact factor: 2.589

10.  Next-generation sequencing of flow-sorted wheat chromosome 5D reveals lineage-specific translocations and widespread gene duplications.

Authors:  Stuart J Lucas; Bala Anı Akpınar; Hana Šimková; Marie Kubaláková; Jaroslav Doležel; Hikmet Budak
Journal:  BMC Genomics       Date:  2014-12-09       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.