| Literature DB >> 31539030 |
Masahiko Kumagai1, Daiki Nishikawa2, Yoshihiro Kawahara1,3,4, Hironobu Wakimoto5, Ryutaro Itoh2, Norio Tabei2, Tsuyoshi Tanaka1,3,4, Takeshi Itoh1,4.
Abstract
Recent revolutionary advancements in sequencing technologies have made it possible to obtain mass quantities of genome-scale sequence data in a cost-effective manner and have drastically altered molecular biological studies. To utilize these sequence data, genome-wide association studies (GWASs) have become increasingly important. Hence, there is an urgent need to develop a visualization tool that enables efficient data retrieval, integration of GWAS results with diverse information and rapid public release of such large-scale genotypic and phenotypic data. We developed a web-based genome browser TASUKE+ (https://tasuke.dna.affrc.go.jp/), which is equipped with the following functions: (i) interactive GWAS results visualization with genome resequencing data and annotation information, (ii) PCR primer design, (iii) phylogenetic tree reconstruction and (iv) data sharing via the web. GWAS results can be displayed in parallel with polymorphism data, read depths and annotation information in an interactive and scalable manner. Users can design PCR primers for polymorphic sites of interest. In addition, a molecular phylogenetic tree of any region can be reconstructed so that the overall relationship among the examined genomes can be understood intuitively at a glance. All functions are implemented through user-friendly web-based interfaces so that researchers can easily share data with collaborators in remote places without extensive bioinformatics knowledge.Entities:
Keywords: GWAS; data sharing; genome browser; phylogenetic analysis; resequencing data
Mesh:
Year: 2019 PMID: 31539030 PMCID: PMC7207078 DOI: 10.1093/dnares/dsz022
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Schematic view of the TASUKE+ system. Data from genome resequencing (fasta, VCF and BAM files) and a GWAS (qqman format and phenotype data files) are added to a MySQL database with annotation information (GFF) on a backend Linux server running Apache and PHP programs. The server environments and default values for parameters of visualization can be specified in a configuration file (Conf). Genomic views and analytical functions are provided through general web browsers. The web storage feature and the canvas element of HTML5 are used for rapidly displaying massive information from a large genomic region.
Figure 2The main window of TASUKE+ showing GWAS and variant information for resequencing data obtained from 176 rice varieties. (A) Positions on the reference genome and annotation information are shown in the uppermost pane. (B) Annotation information tracks show features provided by a GFF format file (e.g. gene structures and LD blocks). By clicking on a gene, a pop-up window containing detailed information appears. (C) GWAS Manhattan plots are displayed below the annotation pane. (D)The variant frequency (blue) and read depth (grey) for each accession are shown with a colour gradient. Unmapped sites are coloured light yellow. Upon clicking on a block, detailed information for the variants pops up. (E) SnpEff mode shows the variant effect levels, namely, MODIFIER (sky blue), LOW (yellow), MODERATE (orange) and HIGH (red), inferred by SnpEff. (F) Variants can also be shown by their precise genomic positions with nucleotides coloured orange for A, pink for T, green for G and blue for C.
Figure 3Exploration of GWAS data. (A) GWAS results are displayed in an interactive Manhattan plot, and a table of summary statistics and variant effects is displayed in a subwindow. (B) Upon clicking on a position in the plot or table, the main window of TASUKE+ moves to the corresponding position. The GWAS results, variants and annotation information can be viewed interactively. (C) Detailed information of variants and possible functional effects inferred by snpEff. Primers for selected sites or regions of interest can be designed quickly. (D) Gene structure of Os01g0846450/LOC_Os01g62780 and a sub-window for the gene information, external link, phylogenetic analysis and export functions. (E) Download list of genotypes and annotation information for a user-selected region.
Figure 4Phylogenetic tree reconstruction. (A) Trees of three styles are illustrated by treelib-js. (B) The order of accessions in the main window can be sorted by the order of names that appears in the inferred phylogeny.