| Literature DB >> 29688387 |
David A Schott1, Abhinav G Vinnakota1, John L Portwood2, Carson M Andorf1,2, Taner Z Sen3,4.
Abstract
Database URL: https://www.maizegdb.org/snpversity.Entities:
Mesh:
Year: 2018 PMID: 29688387 PMCID: PMC5912076 DOI: 10.1093/database/bay037
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Technologies used in the creation of the SNPversity tool and their descriptions
| Name | Link | Description | Used by |
|---|---|---|---|
| Chosen | A jQuery plugin that make select boxes more user friendly | Homepage | |
| JSON-simple | Java library used to convert Java Objects into their JSON representation | Tassel wrapper | |
| PostgreSQL | Object-relational database management system | Server Back end (PHP) | |
| BlockUI | jQuery plugin that blocks user interface (loading screen) | Homepage, Query Results page | |
| Toolbar.js | A jQuery plugin that creates tooltip style toolbars | Query Results page | |
| FontAwesome | CSS file including a wide selection of different icons | Query Results page | |
| JQuery | Javascript library designed to simplify client-side scripting | Homepage, Query Results page | |
| Tassel API | Java platform designed for the optimized analysis of crop genomic diversity | Tassel wrapper |
Figure 1.High level workflow and technical overview of SNPversity components. Users interact with the drop-down menus and enter their choices through the SNPversity home page. The necessary data, such as stocks information are pulled and feed into a TASSEL-based wrapper. The wrapper first queries diversity data stored in the HDF5 representation. The wrapper then creates appropriate JSON files for the PHP layer to visualize the data online.
Figure 2.The SNPversity homepage form at MaizeGDB. The homepage contains many data and visualization options that allow the user to reach the right data sets and display the desired information in the desired formats. Please see text for additional details.
Figure 3.An example SNPversity output page. The output page is designed with contrasting colors to facilitate an intuitive understanding of SNP characteristics and differences. The output shows when ‘Medium’ quick example is selected. The colors represent and differentiate major and minor allelic differences: green—major allele, orange—minor allele, red—insertion, aqua—deletion, and grey—unknown value. Please see text for further details.
SNPversity’s query time for representative cases using k-nearest neighbor algorithm (KNN) and multivariable regressions
| Data set | # Datapoints | Mean query times | Standard deviation of query times | KNN error | Multivariable error |
|---|---|---|---|---|---|
| Maize ZeaGBS v2.7 (raw) | 14 | 81 | 68 | 23 | 31 |
| Maize ZeaGBS v2.7 (imputed) | 244 | 46 | 59 | 19 | 28 |
Actual mean and standard deviation of query times for specific data sets are provided in the first four columns. The last two rows results show the mean absolute error between predicted and actual times and are calculated based on the leave-one-out cross-validated (LOOCV) results using KNN and multivariable regression algorithms respectively. Times are expressed in rounded values in seconds. Please see text for further details.
The statistics of the current diversity data sets available at SNPversity for maize from Panzea (10–12)
| Name | Assembly | # Taxa | # SNPs | File size(GB) |
|---|---|---|---|---|
| Maize ZeaGBS v2.7 (imputed) | B73 RefGen_v2 | 17 280 | 955 690 | 5.5 |
| Maize ZeaGBS v2.7 (raw) | B73 RefGen_v2 | 18 013 | 955 690 | 3.8 |
| Maize HapMap v3.2.1 (imputed) | B73 RefGen_v3 | 1210 | 83 153 144 | 34.2 |
| Maize HapMap v3.2.1 (raw) | B73 RefGen_v3 | 1210 | 83 153 144 | 24.7 |