| Literature DB >> 18953038 |
Makoto K Shimada1, Ryuzou Matsumoto, Yosuke Hayakawa, Ryoko Sanbonmatsu, Craig Gough, Yumi Yamaguchi-Kabata, Chisato Yamasaki, Tadashi Imanishi, Takashi Gojobori.
Abstract
Creation of a vast variety of proteins is accomplished by genetic variation and a variety of alternative splicing transcripts. Currently, however, the abundant available data on genetic variation and the transcriptome are stored independently and in a dispersed fashion. In order to provide a research resource regarding the effects of human genetic polymorphism on various transcripts, we developed VarySysDB, a genetic polymorphism database based on 187,156 extensively annotated matured mRNA transcripts from 36,073 loci provided by H-InvDB. VarySysDB offers information encompassing published human genetic polymorphisms for each of these transcripts separately. This allows comparisons of effects derived from a polymorphism on different transcripts. The published information we analyzed includes single nucleotide polymorphisms and deletion-insertion polymorphisms from dbSNP, copy number variations from Database of Genomic Variants, short tandem repeats and single amino acid repeats from H-InvDB and linkage disequilibrium regions from D-HaploDB. The information can be searched and retrieved by features, functions and effects of polymorphisms, as well as by keywords. VarySysDB combines two kinds of viewers, GBrowse and Sequence View, to facilitate understanding of the positional relationship among polymorphisms, genome, transcripts, loci and functional domains. We expect that VarySysDB will yield useful information on polymorphisms affecting gene expression and phenotypes. VarySysDB is available at http://h-invitational.jp/varygene/.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18953038 PMCID: PMC2686441 DOI: 10.1093/nar/gkn798
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Data used in VarySysDB
| Number of data available in VarySysDB | Database: name and version (or date of download) | Provider | URL | References | |
|---|---|---|---|---|---|
| H-Inv Transcripts (HITs) | 187 156 | H-InvDB 5.0 | BIRC | (8,9) | |
| SNPs & DIPs | 11 817 893 | dbSNP build 128 | NCBI | (3,14) | |
| STRs | 18 637 | H-InvDB 5.0 | BIRC | (8,9) | |
| SARs | 33 007 | H-InvDB 5.0 | BIRC | (8,9) | |
| CNVs | 11 966 | DGV (hg18.v3) | TCAGf | (4) | |
| LD-bins | 99 921 | D-HaploDB | Kyushu University | (13) | |
| OMIM allelic variants | 950 | OMIM (1.28. 2008) | NCBI | (14,15) | |
| Functional domain | – | InterPro 15.1 | EBI | (16,17) | |
| Structural domain (SCOP) | – | GTOP (2.4. 2008) | NIG | (18) |
aH-Invitational Database Release 5.0 (used human genome sequence version: hg18, NCBI Build36.2).
bBiomedicinal Information Research Center.
cThe number of SNP & DIP data downloaded from dbSNP and analyzed for VarySysDB. The numbers of annotated SNP and HIT pairs are as follows: 568 982 for non-synonymous, 431 433 for synonymous, 747 for synonymous at stop-codon, 7227 for termination, 1510 for stop codon to amino acid, 8945 for NMD.
dNational Center for Biotechnology Information.
eDatabase of Genomic Variants.
fThe Centre for Applied Genomics.
gDatabase of Definitive Haplotypes.
hOnline Mendelian Inheritance in Man™.
iEuropean Bioinformatics Institute.
jPDB 2007-Apr-6, Swissprot 52.1, SCOP 1.69, Pfam 21.0, ProSite 20.0, Wormpep 174, HUGE 2003-11-6(kiaa2038).
kNational Institute of Genetics.
Figure 1.View of polymorphism search page, which is one of the search pages contained in VarySysDB. In the polymorphism search page, users can search the polymorphism data by features, classification and our analysis results such as effects on functional domains and protein 3D structures. Four boxes (‘Search by postion’, ‘Polymorphism features’, ‘Polymorphism classification’, ‘Search for analysis result’) organize the search criteria by subject. When multiple search criteria are specified ‘over’ these boxes, an ‘and’ search is conducted, offering polymorphisms matching all the specified criteria.
System and web-interface design of VarySysDB
| Subsystem | Web-interface | Function |
|---|---|---|
| Varygene 2 | Polymorphism search | Retrieving and displaying genetic polymorphisms. |
| Polymorphism table | Displaying detailed information on polymorphisms. | |
| Transcript search | Retrieving and displaying transcript information. | |
| Transcript table | Displaying detailed information on transcripts. | |
| Sequence view | Displaying cDNA sequence with information on polymorphisms and functional domains. | |
| STR/SAR search | Retrieving and displaying STRs and SARs. | |
| CNV search | Retrieving and displaying CNVs. | |
| CNV table | Displaying detailed information on CNVs. | |
| Keyword search | Retrieving and displaying by ID, gene name or definition. | |
| System information | Displaying summary table showing total numbers of transcripts and polymorphisms in VaryGene2. | |
| LD-Search | – | Retrieving and displaying LD-bins within the specified region. |
| GBrowse | – | Displaying genomic region specified with HITs, HIXs and polymorphisms. |
Figure 2.Transcript table containing information on HIT, polymorphism mapped on the HIT (SNP classification, STRs and SARs), and functional domain included in the HIT.
Figure 3.Two graphical viewers in VarySysDB. Left: Sequence View containing polymorphism, domain, sequence of HIT and amino acid sequence. Right: GBrowse showing position of SNP, CNV, HIT and HIX.