| Literature DB >> 27899667 |
Locedie Mansueto1, Roven Rommel Fuentes1, Frances Nikki Borja1, Jeffery Detras1, Juan Miguel Abriol-Santos1, Dmytro Chebotarov1, Millicent Sanciangco1, Kevin Palis1,2, Dario Copetti3, Alexandre Poliakov4,5, Inna Dubchak4,5, Victor Solovyev6, Rod A Wing1,3, Ruaraidh Sackville Hamilton1, Ramil Mauleon1, Kenneth L McNally1, Nickolai Alexandrov7.
Abstract
We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org.Entities:
Mesh:
Year: 2016 PMID: 27899667 PMCID: PMC5210592 DOI: 10.1093/nar/gkw1135
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Genotype query options (A) and results table (B) with multiple reference genomes alleles. The selected reference genome (Nipponbare) is displayed at the top header row just below the SNP positions. The alleles for the other genomes (Kasalath, DJ 123, IR 64, 93-11) are shown below in the table header. The corresponding positions in the other genomes are displayed in the message box.
Figure 2.Genotype matrix with short indels. The table displays deletions (positions in blue) at anchor positions (region): 27698 (27699), 27791 (27792–27794), 27836 (27837–27841). Insertion regions (positions in green) are at 27722.01 27722.04 and 27797.01. For deletion regions, the reference is copied from the reference genome, while for insertions the reference genome is set to gaps.
Gene loci names we used for rice reference genomes we are using
| Reference genome | Reference | Gene loci names |
|---|---|---|
| 93-11 | ( | Os9311_{YY}g{NNNNNN}, Os9311_{XXXXX}g{NNNNNN} |
| IR 64 | ( | OsIR64_{XXXXX}g{NNNNNN} |
| DJ 123 | ( | OsDJ123{XXXXX}g{NNNNNN} |
| Kasalath | ( | OsKasal{YY}g{NNNNNN} |
| Nipponbare | ( | OsNippo{YY}g{NNNNNN} |
Data sources for genomics data
| Data | Source | URL | Reference |
|---|---|---|---|
| Gene model | MSU v7 | ( | |
| Gene model | RAP | ( | |
| Gene names/symbols | Oryzabase | ( | |
| Gene ontology | MSU v7 | ( | |
| Trait genes | OGRO | ( | |
| Trait ontology-genes | Oryzabase | ( | |
| Plant ontology-genes | Oryzabase | ( | |
| QTL | Q-TARO | ( | |
| Sequence | MSU v7 | ( |
Figure 3.Allele frequency chart with major/minor allele/genotype frequency/count at each SNP position in the queried region for all or each subpopulation.
JBrowse tracks for Nipponbare
| Category | Track names (count) | Reference |
|---|---|---|
| Gene model | MSU7 RAP representative RAP predicted FGenesh++ Merged MSU7, RAP, FGenesh++ | ( |
| Trait Genes | 28 OGRO trait track OGRO all traits genes Oryzabase all trait genes | ( |
| QTL | 28 QTARO QTL tracks QTARO all QTL | ( |
| BAM | 3024 varieties | |
| BAM Coverage | 3024 varieties | |
| VCF | 3024 varieties | |
| Alignment | Nipponbare versus 9311 Nipponbare versus IR64-21 Nipponbare versus DJ123 Nipponbare versus Kasalath | This project |
| Variants | SNPs v2 INDELs v2 SNPs v1 | This project |
Figure 4.Query capabilities of SNP-Seek. The blocks at the left (rounded) are possible query constraints to the query modules (rectangles). The query results may be stored as a list (parallelograms), and used as constraints in further queries. Lists may also be created by the user as initial constraints. The marker annotator accepts a list of SNP positions, which may be the result from experiments or GWAS studies, to generate constraints for further queries or loop back to the initial constraints, increasing the confidence of the association.