| Literature DB >> 31762961 |
Wei Zhou1, Lei Wang2, Wenming Zheng3, Wen Yao3.
Abstract
With the rapid decreasing of sequencing cost, large volume of genotype data has been generated in many organisms based on high-throughput sequencing, which was utilized in various fields of biological studies in the post-genome era. The raw sequencing data were usually deposited in the NCBI SRA database. Construction of the database to store and analyze the processed genotype data is an essential step for the utilization of the genotype data by the community. Up to now, a comprehensive genotype database is still missing from maize, which is an important crop of the world. We report the construction of the MaizeSNPDB database using genotype data of 1210 maize line across 35,370,939 SNP sites refined from a large set of genomic variations reported by the maize HapMap 3 project. We further implemented several genetic analysis programs as graphical interfaces in the MaizeSNPDB database. SNPs in user-specified genomic regions could be easily extracted and analyzed in MaizeSNPDB. The whole dataset and code of MaizeSNPDB is available at https://github.com/venyao/MaizeSNPDB. MaizeSNPDB is deployed at http://150.109.59.144:3838/MaizeSNPDB/ for online use. The MaizeSNPDB database is of great value to future maize functional genomic studies, which can also facilitate marker-assisted breeding in maize.Entities:
Keywords: Database; Genomic variation; Genotype; Graphical interface; Maize; SNP
Year: 2019 PMID: 31762961 PMCID: PMC6861670 DOI: 10.1016/j.csbj.2019.10.003
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1The graphical interface of the MaizeSNPDB database. A total of 8 menus for different functionalities are implemented in the database. The “Browser” menu is shown to illustrate the various widgets, which are designed to receive with user input. A text-input widget is provided for users to specify the genomic region to be visualized. Inverted triangles with differing colors in the main plot region represent SNP sites in user-specified genomic region. Three download widgets on top of the main plot region is designed for downloading of data in the specified genomic region. Three widgets below the main plot region is provided for users to screen the SNP sites displayed in the genome browser. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2An example linkage disequilibrium heatmap created using the functionality of MaizeSNPDB. Linkage disequilibrium measured in r2 is represented as an inverted-triangle heatmap. The color of different cells of the heatmap denotes the value of r2 as indicated in the color key. The structure of annotated gene models is displayed on top of the heatmap. The positions of SNP sites are represented as black ticks along the gene models. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3Sequence diversity in wild and modern maize lines around the genomic region harboring the tb1 gene. The top panel shows the nucleotide diversity among wild maize lines (Z. mays ssp. parviglumis) and modern maize lines (improved) indicated by different colors. The bottom panel demonstrate the division of nucleotide diversity in the two populations. X-axis, coordinates of the genomic region around tb1 (Zm00001d033673). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4NJ tree constructed based on SNPs around the tb1 gene using the phylogenetic functionality of MaizeSNPDB. The NJ tree is shown in circular format using the ggtree package. Each tip of the tree represents a maize line. Colors in the inner circle denote the cultivated maize lines while colors in the outer circle indicate the wild maize lines. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)