Literature DB >> 27154270

Synteny Portal: a web-based application portal for synteny block analysis.

Jongin Lee1, Woon-Young Hong1, Minah Cho1, Mikang Sim1, Daehwan Lee1, Younhee Ko2, Jaebum Kim3.   

Abstract

Recent advances in next-generation sequencing technologies and genome assembly algorithms have enabled the accumulation of a huge volume of genome sequences from various species. This has provided new opportunities for large-scale comparative genomics studies. Identifying and utilizing synteny blocks, which are genomic regions conserved among multiple species, is key to understanding genomic architecture and the evolutionary history of genomes. However, the construction and visualization of such synteny blocks from multiple species are very challenging, especially for biologists with a lack of computational skills. Here, we present Synteny Portal, a versatile web-based application portal for constructing, visualizing and browsing synteny blocks. With Synteny Portal, users can easily (i) construct synteny blocks among multiple species by using prebuilt alignments in the UCSC genome browser database, (ii) visualize and download syntenic relationships as high-quality images, (iii) browse synteny blocks with genetic information and (iv) download the details of synteny blocks to be used as input for downstream synteny-based analyses, all in an intuitive and easy-to-use web-based interface. We believe that Synteny Portal will serve as a highly valuable tool that will enable biologists to easily perform comparative genomics studies by compensating limitations of existing tools. Synteny Portal is freely available at http://bioinfo.konkuk.ac.kr/synteny_portal.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2016        PMID: 27154270      PMCID: PMC4987893          DOI: 10.1093/nar/gkw310

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Recent advances in next-generation sequencing technologies and genome assembly algorithms, together with the results of various large-scale genome projects, such as the Genome 10K Project (1), the Bird 10K Project (2) and i5K (3), have enabled the accumulation of a huge volume of genomic sequences from various species. These genome sequences are usually stored in public databases, such as the NCBI Reference Sequence database (4), the Genome OnLine Database (5), the UCSC genome browser database (6) and Ensembl Genomes (7); they have been used to uncover genomic similarities and differences between different species and the functional consequences of such similarities and differences. Comparative genomics studies have contributed to a better understanding of the molecular-level mechanisms of species diversity and genome evolution (8–12). Synteny blocks, which are genomic regions that are conserved among multiple species, play a pivotal role in comparative genomics. Various algorithms and tools have been introduced for the construction and utilization of synteny blocks, such as CYNTENATOR (13), DRIMM-Synteny (14), i-ADHoRe 3.0 (15), inferCars (16), OSfinder (17) and Sibelia (18). They are stand-alone and command-line tools, and users need to download and install them on a users’ machine. Therefore, the construction of synteny blocks using such tools requires computer resources, and advanced bioinformatics skills to properly prepare input data and running command-line programs. To alleviate these difficulties, more user-friendly tools for utilizing and visualizing synteny blocks have been developed. For example, GenomeMatcher (19), Mauve (20), MizBee (21) and SyMap (22) are stand-alone synteny browsers that support graphical user interface. Gbrowse_syn (23) and Sybil (24) are web-based software packages for comparative genomics that need to be installed and configured in users’ own machines. GSV (25) and mGSV (26) are web-based synteny viewers that help to visualize user-provided synteny information. Cinteny (27) is a web server for identifying synteny blocks and analyzing genome rearrangements. CoGe (28), VISTA (29) and Ensembl SyntenyView (30) are web-based platforms that perform various analyses for comparative genomic studies. Genomicus (31) is a web browser that displays the homologous genomic contexts of different species. C-Sibelia (32) is a web server for comparing assemblies to a reference genome together with the annotation of variants in genes. GRIMM-Synteny (33) is a web server for analyzing genome rearrangements by using the list of genomic markers represented by numbers with the annotation of variants in genes. However, different tools still have different weaknesses, such as only supporting pairwise comparison, requiring the generation of synteny information by users, limited features for visualization and not supporting the saving or downloading of high-quality images (Supplementary Data for tool comparison). Here, we present Synteny Portal, a versatile web-based application portal that allows for construction, visualization and browsing of synteny blocks by compensating limitations of existing tools. With Synteny Portal, users can easily (i) construct synteny blocks among multiple species by using prebuilt alignments in the UCSC genome browser database (6), (ii) visualize and download syntenic relationships as high-quality images, (iii) browse synteny blocks together with genetic information and (iv) download the details of synteny blocks that can be used as input for downstream synteny-based analyses, all in an intuitive and easy-to-use web interface.

MATERIALS AND METHODS

Data preparation

Data collection

We collected genomic information on well-studied reference species, including human, mouse, cow, dog, horse, pig, rat, rhesus, chicken and zebra fish, from the UCSC genome browser database (6), which includes genome assembly sequences, gene annotation, whole-genome pairwise alignments of the reference species in the chain and net format (34), cytobands and assembly sizes of the reference species. In total, 458 whole-genome pairwise alignments for 67 different species, including 124 different assembly versions and 33 assembly sequences for the ten reference species with different assembly versions are available on our web server. Gene/protein identifiers annotated with seven different criteria, such as Wiki Genes (35), RefSeq genes and proteins (4), Entrez genes (36), Ensembl genes, transcripts and proteins (7), allow for searching and browsing user-provided gene/protein identifiers. The mapping information using different identifiers was obtained from the Uniprot website (http://www.uniprot.org/) and the biomaRt package from Bioconductor (37).

Synteny block construction and visualization

Synteny blocks were built using the inferCars program package (16) with four different resolutions (150, 300, 400 and 500 Kbp). Specifically, pairwise collinear genome segments (PCGSs) of two species (i.e. a reference species and another species) are first collected from pairwise whole-genome alignments. Then, to create multiple collinear genome segments (MCGSs), two PCGSs are merged by splitting the PCGSs of one species at positions corresponding to the boundaries of the PCGSs of the other species based on the coordinates of the reference genome. This merging process repeats until the PCGSs of all species are merged, and the final MCGSs, which are larger than the given resolution, are reported as synteny blocks. If outgroup species are provided, which is optional, genomic regions of the outgroup species matched to each of the calculated synteny blocks are added. Syntenic relationships are visualized through the Circos program (38) and converted to an interactive version on the website with JavaScript. The BLAST program (39) is used to search for a top-matched region in the reference genome based on the user-provided DNA or protein sequence with default parameter values (Supplementary Data).

Implementations

The Synteny Portal website is implemented with the HTML5 and JavaScript libraries, including jQuery (http://jquery.com), d3.js (http://d3js.org/) and GenomeD3Plot (40) for the client-side user interface, and by PHP scripts (5.3.3) for interactive data processing. The server-side processes are generated by Perl scripts (v5.22.0).

RESULTS

Synteny Portal overview

Synteny Portal provides four main web applications: SynCircos, SynBrowser, SynSearcher and SynBuilder. SynCircos visualizes synteny blocks against a user-selected reference and target species in an interactive Circos plot. SynBrowser displays more details of synteny blocks with annotated reference genes. SynSearcher finds syntenic regions in other species based on query sequences. Finally, SynBuilder constructs synteny blocks among multiple chosen species. These four applications generate the high-quality Circos plots or records of synteny blocks that can be downloaded in various formats, including SVG, PNG, JPEG and PDF.

SynCircos

SynCircos depicts synteny blocks in an interactive Circos plot. Users can select a reference species, which is the reference of pairwise whole-genome alignment, multiple target species, target chromosomes and a resolution for the minimum size of a homologous reference block. The syntenic relationships among chosen species are visualized as an interactive Circos plot (Figure 1A), which can be downloaded in various formats. Ribbons connect homologous blocks in different species that belong to the same synteny block and users can highlight a specific syntenic relationship by placing a mouse pointer on a specific ribbon.
Figure 1.

Examples of SynCircos and SynBrowser outputs. (A) A representative Circos plot generated by SynCircos, which compared three species and their specific chromosomes (Human: chr1, chr3, chr5, chr6, chr8, chr12, chr15, chr19; Mouse: chr1, chr3, chr7, chr17, chr18; and Cow: chr3, chr9, chr10, chr14, chr17, chr22). The highlighted ribbons represent syntenic relationships between human chromosome 1 and the chromosomes of the other species. (B) A representative image generated by SynBrowser which shows the relationships between human chromosome 3 and the related mouse chromosomes. The coordinates of homologous blocks in each synteny block can be seen when a specific genomic region is selected (the khaki-colored box in the center of the image). The red triangle indicates the position of the user-queried gene. (C) A representative gene browser produced by SynBrowser which shows the genes of the reference species (human in this image) in synteny blocks. Details of the annotated genes are provided via links to the UCSC genome browser.

Examples of SynCircos and SynBrowser outputs. (A) A representative Circos plot generated by SynCircos, which compared three species and their specific chromosomes (Human: chr1, chr3, chr5, chr6, chr8, chr12, chr15, chr19; Mouse: chr1, chr3, chr7, chr17, chr18; and Cow: chr3, chr9, chr10, chr14, chr17, chr22). The highlighted ribbons represent syntenic relationships between human chromosome 1 and the chromosomes of the other species. (B) A representative image generated by SynBrowser which shows the relationships between human chromosome 3 and the related mouse chromosomes. The coordinates of homologous blocks in each synteny block can be seen when a specific genomic region is selected (the khaki-colored box in the center of the image). The red triangle indicates the position of the user-queried gene. (C) A representative gene browser produced by SynBrowser which shows the genes of the reference species (human in this image) in synteny blocks. Details of the annotated genes are provided via links to the UCSC genome browser.

Inputs to SynCircos

SynCircos requires user-selected reference and target species, genome assembly version of the selected species, chromosome number, a resolution for the synteny blocks and an image format for visualization in the Circos plot.

Outputs of SynCircos

The outputs of SynCircos is an interactive Circos plot. Tracks and ribbons in the Circos plot represent the chromosomes and cytobands (if any) of the chosen species and the syntenic relationships among different species, respectively. Figure 1A shows a highlighted synteny block in human, mouse and cow.

SynBrowser

SynBrowser displays more details of syntenic relationships between two species (i.e. a reference and a target species), whereas SynCircos shows a global view of syntenic relationships among multiple species. SynBrowser presents homologous blocks between the two species in an interactive graphical form (Figure 1B and C). Users can easily highlight specific homologous blocks via exact genomic coordinates in a floating window. SynBrowser also provides two ways of navigating synteny blocks: (i) chromosome-level navigation, by clicking on a specific reference chromosome in a chromosome legend on the left of a webpage, and (ii) region-level navigation, by specifying a coordinate of a reference chromosome. Genes within a synteny block can also be navigated using a gene browser through a direct search for a specific gene with a variety of gene identifiers, such as RefSeq, Wiki Gene and Entrez Gene. The images in SynBrowser can also be downloaded in multiple formats.

Inputs to SynBrowser

To visualize the pairwise synteny blocks, SynBrowser needs only four user-selected parameters: a reference species, a reference chromosome, a target species and a synteny block resolution. Synteny blocks containing a specific gene or protein can be found by searching for a gene or protein identifier. Users can directly move to a chosen position in the reference genome by providing specific coordinates.

Outputs of SynBrowser

The outputs of SynBrowser are a plot showing connections between homologous blocks between two species and a gene browser displaying reference genes within synteny blocks. Figure 1B and C show highlighted homologous blocks between human and mouse in the same synteny block, along with their coordinates and reference genes within the synteny block. The red triangle in Figure 1B indicates the location of a gene searched for by users.

SynSearcher

SynSearcher searches for a reference genome sequence and finds the best-matched region based on a user-provided DNA or protein sequence by BLAST query; synteny blocks encompassing the top-matched reference region are retrieved (Figure 2A). The results are provided to users in two different formats: (i) the details of BLAST search results and the coordinates of synteny blocks as a text file, and (ii) an interactive Circos plot highlighting the retrieved syntenic relationships (Figure 2B). The Circos plot can be downloaded in various formats, and the text file can also be downloaded for downstream analyses with minor modifications.
Figure 2.

Flowchart and outputs of SynSearcher. (A) Given a DNA or protein sequence in the FASTA format, SynSearcher finds matching regions in a reference species using a BLAST search and returns pairwise synteny blocks containing the matched regions of the reference species. (B) The outputs of SynSearcher (white box, BLAST search results and the coordinates of pairwise synteny blocks; below, the Circos plot) for seven species (human, chimpanzee, gorilla, mouse, cat, dog and chicken) given a user-provided sequence.

Flowchart and outputs of SynSearcher. (A) Given a DNA or protein sequence in the FASTA format, SynSearcher finds matching regions in a reference species using a BLAST search and returns pairwise synteny blocks containing the matched regions of the reference species. (B) The outputs of SynSearcher (white box, BLAST search results and the coordinates of pairwise synteny blocks; below, the Circos plot) for seven species (human, chimpanzee, gorilla, mouse, cat, dog and chicken) given a user-provided sequence.

Inputs to SynSearcher

A user-provided DNA or protein sequence can be directly inserted in a text box or uploaded as a file in a FASTA format. Users can also select a reference and target species with a chosen resolution.

Outputs of SynSearcher

The results consist of (i) the details of the BLAST output for the top-matched alignment result, (ii) the coordinates of identified pairwise synteny blocks and (iii) a Circos plot visualizing the synteny blocks. Figure 2B shows synteny blocks obtained from seven species (human, chimpanzee, mouse, gorilla, chicken, cat and dog) given a user-provided sequence.

SynBuilder

SynBuilder creates synteny blocks for user-selected species (a reference, target and optional outgroup species) with a chosen resolution using the inferCars program (16). Specifically, pairwise whole-genome alignments between the reference and the target species are first collected and then homologous genomic regions are identified, and, finally, co-linear genomic regions among the chosen species are grouped into synteny blocks (Figure 3A). If outgroup species are selected, matched genomic regions between the outgroup species and a reference species are added to the synteny blocks. The results of SynBuilder consist of (i) the coordinates of the synteny blocks as a text file, and (ii) an interactive Circos plot depicting syntenic relationships (Figure 3B). The Circos plot can be downloaded in various formats, and the text file can also be downloaded and used for downstream analyses with minor modifications.
Figure 3.

Flowchart and outputs of SynBuilder. (A) Given options (a reference species, target species, and optional outgroup species, and a resolution) selected by users, SynBuilder creates synteny blocks using pairwise whole-genome alignments between the reference and the other species. (B) The output of SynBuilder (white box, coordinates of synteny blocks; below, the Circos plot) for four species (human, chimpanzee, gorilla and mouse) with three outgroup species (cat, dog and chicken).

Flowchart and outputs of SynBuilder. (A) Given options (a reference species, target species, and optional outgroup species, and a resolution) selected by users, SynBuilder creates synteny blocks using pairwise whole-genome alignments between the reference and the other species. (B) The output of SynBuilder (white box, coordinates of synteny blocks; below, the Circos plot) for four species (human, chimpanzee, gorilla and mouse) with three outgroup species (cat, dog and chicken).

Inputs to SynBuilder

SynBuilder requires the selection of a reference, target and optional outgroup species, and a resolution for the synteny blocks.

Outputs of SynBuilder

The results of SynBuilder are (i) the coordinates of synteny blocks and (ii) an interactive Circos plot depicting syntenic relationships among the chosen species.

CONCLUSION

Synteny Portal facilitates studies on comparative genomics through easy construction of synteny blocks, intuitive graphical representation with a high-quality image format and easy-to-use querying and browsing functionalities. Synteny Portal enables a user with a lack of computational skills to perform comparative genomic analyses. The current version only works with prebuilt whole-genome alignments because of long computational time and high computational resource requirements for online creation of whole-genome sequence alignments. However, whole-genome alignment data will be periodically updated via mirroring of data in the UCSC genome browser database. In addition, our website will be updated to support the whole-genome alignment of user-provided genome sequences with additional reference species in future. We believe that Synteny Portal will serve as a highly valuable tool that will enable biologists to easily perform comparative genomics studies.

AVAILABILITY

The Synteny Portal web server is available at http://bioinfo.konkuk.ac.kr/synteny_portal/.
  39 in total

1.  Sybil: methods and software for multiple genome comparison and visualization.

Authors:  Jonathan Crabtree; Samuel V Angiuoli; Jennifer R Wortman; Owen R White
Journal:  Methods Mol Biol       Date:  2007

2.  Accurate identification of orthologous segments among multiple genomes.

Authors:  Tsuyoshi Hachiya; Yasunori Osana; Kris Popendorf; Yasubumi Sakakibara
Journal:  Bioinformatics       Date:  2009-02-02       Impact factor: 6.937

3.  A wiki for the life sciences where authorship matters.

Authors:  Robert Hoffmann
Journal:  Nat Genet       Date:  2008-09       Impact factor: 38.330

4.  Circos: an information aesthetic for comparative genomics.

Authors:  Martin Krzywinski; Jacqueline Schein; Inanç Birol; Joseph Connors; Randy Gascoyne; Doug Horsman; Steven J Jones; Marco A Marra
Journal:  Genome Res       Date:  2009-06-18       Impact factor: 9.043

5.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

6.  GSV: a web-based genome synteny viewer for customized data.

Authors:  Kashi V Revanna; Chi-Chen Chiu; Ezekiel Bierschank; Qunfeng Dong
Journal:  BMC Bioinformatics       Date:  2011-08-02       Impact factor: 3.307

Review 7.  Comparative genomics as a tool to understand evolution and disease.

Authors:  Jessica Alföldi; Kerstin Lindblad-Toh
Journal:  Genome Res       Date:  2013-07       Impact factor: 9.043

8.  i-ADHoRe 3.0--fast and sensitive detection of genomic homology in extremely large data sets.

Authors:  Sebastian Proost; Jan Fostier; Dieter De Witte; Bart Dhoedt; Piet Demeester; Yves Van de Peer; Klaas Vandepoele
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

9.  A web-based multi-genome synteny viewer for customized data.

Authors:  Kashi V Revanna; Daniel Munro; Alvin Gao; Chi-Chen Chiu; Anil Pathak; Qunfeng Dong
Journal:  BMC Bioinformatics       Date:  2012-08-02       Impact factor: 3.169

10.  Ensembl Genomes 2016: more genomes, more complexity.

Authors:  Paul Julian Kersey; James E Allen; Irina Armean; Sanjay Boddu; Bruce J Bolt; Denise Carvalho-Silva; Mikkel Christensen; Paul Davis; Lee J Falin; Christoph Grabmueller; Jay Humphrey; Arnaud Kerhornou; Julia Khobova; Naveen K Aranganathan; Nicholas Langridge; Ernesto Lowy; Mark D McDowall; Uma Maheswari; Michael Nuhn; Chuang Kee Ong; Bert Overduin; Michael Paulini; Helder Pedro; Emily Perry; Giulietta Spudich; Electra Tapanari; Brandon Walts; Gareth Williams; Marcela Tello-Ruiz; Joshua Stein; Sharon Wei; Doreen Ware; Daniel M Bolser; Kevin L Howe; Eugene Kulesha; Daniel Lawson; Gareth Maslen; Daniel M Staines
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

View more
  15 in total

1.  Smash++: an alignment-free and memory-efficient tool to find genomic rearrangements.

Authors:  Morteza Hosseini; Diogo Pratas; Burkhard Morgenstern; Armando J Pinho
Journal:  Gigascience       Date:  2020-05-01       Impact factor: 6.524

2.  Something Fishy about Siamese Fighting Fish (Betta splendens) Sex: Polygenic Sex Determination or a Newly Emerged Sex-Determining Region?

Authors:  Thitipong Panthum; Kitipong Jaisamut; Worapong Singchat; Syed Farhan Ahmad; Lalida Kongkaew; Wongsathit Wongloet; Sahabhop Dokkaew; Ekaphan Kraichak; Narongrit Muangmai; Prateep Duengkae; Kornsorn Srikulnath
Journal:  Cells       Date:  2022-05-27       Impact factor: 7.666

3.  mySyntenyPortal: an application package to construct websites for synteny block analysis.

Authors:  Jongin Lee; Daehwan Lee; Mikang Sim; Daehong Kwon; Juyeon Kim; Younhee Ko; Jaebum Kim
Journal:  BMC Bioinformatics       Date:  2018-06-05       Impact factor: 3.169

4.  Divergent genome evolution caused by regional variation in DNA gain and loss between human and mouse.

Authors:  Reuben M Buckley; R Daniel Kortschak; David L Adelson
Journal:  PLoS Comput Biol       Date:  2018-04-20       Impact factor: 4.475

5.  TarSynFlow, a workflow for bacterial genome comparisons that revealed genes putatively involved in the probiotic character of Shewanella putrefaciens strain Pdp11.

Authors:  Pedro Seoane; Silvana T Tapia-Paniagua; Rocío Bautista; Elena Alcaide; Consuelo Esteve; Eduardo Martínez-Manzanares; M Carmen Balebona; M Gonzalo Claros; Miguel A Moriñigo
Journal:  PeerJ       Date:  2019-02-28       Impact factor: 2.984

6.  Reorganization of 3D genome structure may contribute to gene regulatory evolution in primates.

Authors:  Ittai E Eres; Kaixuan Luo; Chiaowen Joyce Hsiao; Lauren E Blake; Yoav Gilad
Journal:  PLoS Genet       Date:  2019-07-19       Impact factor: 6.020

7.  The JAX Synteny Browser for mouse-human comparative genomics.

Authors:  Georgi Kolishovski; Anna Lamoureux; Paul Hale; Joel E Richardson; Jill M Recla; Omoluyi Adesanya; Al Simons; Govindarajan Kunde-Ramamoorthy; Carol J Bult
Journal:  Mamm Genome       Date:  2019-11-27       Impact factor: 2.957

8.  Collaborative Cross Mice Yield Genetic Modifiers for Pseudomonas aeruginosa Infection in Human Lung Disease.

Authors:  Nicola Ivan Lorè; Barbara Sipione; Gengming He; Lisa J Strug; Hanifa J Atamni; Alexandra Dorman; Richard Mott; Fuad A Iraqi; Alessandra Bragonzi
Journal:  mBio       Date:  2020-03-03       Impact factor: 7.867

9.  Comprehensive analysis of long noncoding RNA expression in dorsal root ganglion reveals cell-type specificity and dysregulation after nerve injury.

Authors:  Georgios Baskozos; John M Dawes; Jean S Austin; Ana Antunes-Martins; Lucy McDermott; Alex J Clark; Teodora Trendafilova; Jon G Lees; Stephen B McMahon; Jeffrey S Mogil; Christine Orengo; David L Bennett
Journal:  Pain       Date:  2019-02       Impact factor: 7.926

10.  CHESS enables quantitative comparison of chromatin contact data and automatic feature extraction.

Authors:  Silvia Galan; Nick Machnik; Kai Kruse; Noelia Díaz; Marc A Marti-Renom; Juan M Vaquerizas
Journal:  Nat Genet       Date:  2020-10-19       Impact factor: 41.307

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.