| Literature DB >> 19073702 |
Guoqing Li1, Lijia Ma, Chao Song, Zhentao Yang, Xiulan Wang, Hui Huang, Yingrui Li, Ruiqiang Li, Xiuqing Zhang, Huanming Yang, Jian Wang, Jun Wang.
Abstract
The YH database is a server that allows the user to easily browse and download data from the first Asian diploid genome. The aim of this platform is to facilitate the study of this Asian genome and to enable improved organization and presentation large-scale personal genome data. Powered by GBrowse, we illustrate here the genome sequences, SNPs, and sequencing reads in the MapView. The relationships between phenotype and genotype can be searched by location, dbSNP ID, HGMD ID, gene symbol and disease name. A BLAST web service is also provided for the purpose of aligning query sequence against YH genome consensus. The YH database is currently one of the three personal genome database, organizing the original data and analysis results in a user-friendly interface, which is an endeavor to achieve fundamental goals for establishing personal medicine. The database is available at http://yh.genomics.org.cn.Entities:
Mesh:
Year: 2009 PMID: 19073702 PMCID: PMC2686535 DOI: 10.1093/nar/gkn966
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Data summary of the YH genome sequence
| Nucleotide | |
| Total | 117.7 Gb |
| Map to genome | 102.9 Gb |
| Coverage of genome | 99.97% |
| Polymorphism | |
| SNP | 3.07 million |
| Indel | 1 35 262 |
| Structural variation | 2682 |
aThe fraction of reference genome which was covered by sequencing reads.
Figure 1.Data generation process.The sequencing reads were mapped to the NCBI reference genome to obtain the YH consensus genome and identify YH variants. Both types of data, combined with related information from dbSNP and HapMap, are presented in MapView. All variants were scanned by the disease alleles in HGMD to generate phenotypic information for the YH genome. Three colors of blocks were used to distinguish (i) data we generated (green); (ii) data from public database (blue); (iii) web pages we provided to show our genome (red).
Figure 2.YH database Screenshot. We provide MapView, phenotype, and BLAST web service as the three main functions of the YH database. Users can easily begin searching or browsing this database for a variety of things such as a phenotype of interest or a risk allele. Access to the MapView can be obtained by clicking its chromosomal location. A BLAST web service for aligning query sequences against the YH consensus sequence is also available. All detailed information of the YH consensus sequence and identified SNPs are presented in the MapView. All sequencing reads mapped to an area in the display window are shown and are overlapped individually to facilitate users in examining each detected SNPs.