| Literature DB >> 25178365 |
Joon-Ho Lee1, Taeheon Lee2, Hak-Kyo Lee1, Byung-Wook Cho3, Dong-Hyun Shin2, Kyoung-Tag Do4, Samsun Sung5, Woori Kwak6, Hyeon Jeong Kim5, Heebal Kim7, Seoae Cho5, Kyung-Do Park1.
Abstract
Genetics is important for breeding and selection of horses but there is a lack of well-established horse-related browsers or databases. In order to better understand horses, more variants and other integrated information are needed. Thus, we construct a horse genomic variants database including expression and other information. Horse Single Nucleotide Polymorphism and Expression Database (HSDB) (http://snugenome2.snu.ac.kr/HSDB) provides the number of unexplored genomic variants still remaining to be identified in the horse genome including rare variants by using population genome sequences of eighteen horses and RNA-seq of four horses. The identified single nucleotide polymorphisms (SNPs) were confirmed by comparing them with SNP chip data and variants of RNA-seq, which showed a concordance level of 99.02% and 96.6%, respectively. Moreover, the database provides the genomic variants with their corresponding transcriptional profiles from the same individuals to help understand the functional aspects of these variants. The database will contribute to genetic improvement and breeding strategies of Thoroughbreds.Entities:
Keywords: Database; Expression; Horse; Thoroughbred; Variants
Year: 2014 PMID: 25178365 PMCID: PMC4150188 DOI: 10.5713/ajas.2013.13694
Source DB: PubMed Journal: Asian-Australas J Anim Sci ISSN: 1011-2367 Impact factor: 2.509
Figure 1Horse SNP database web page. Horse single nucleotide polymorphism and expression database (HSDB) is available at http://snugenome2.snu.ac.kr/HSDB. (A) Users can search SNP or corresponding gene information using SNP position or RS ID or gene symbol. After searching the SNP, user can see the SNPs within the gene, there are several colors depending their effect. User will get more detail information including reference and alternative allele information, SNP position, SNP quality, depth, and effect impact (high, moderate, low, modifier) which definition follows snpEff. HSDB also enables the user to obtain gene annotation information including transcript ID and gene description, gene ontology term and orthology ID with human and mouse. Users can see the linkage disequilibrium map of interest gene. (B) Advanced Search provides similar results of SNP Search, but user can search the SNPs with many filtering options. (C) HSDB provides expression information of muscle and blood from 4 horses both before and after exercise, especially DEG information. User can search the expression information according to their interest using the Advanced Tools of Expression Search. SNP, single nucleotide polymorphism.
Identified variants from re-sequencing and RNA-seq
| Region | Seq | RNA-seq | ||
|---|---|---|---|---|
|
|
| |||
| Count | Percent | Count | Percent | |
| Downstream (5kb) | 532,802 | 12.09 | 110,047 | 17.66 |
| Exon | 94,502 | 2.15 | 40,856 | 6.56 |
| Intron | 3,213,091 | 72.92 | 385,484 | 61.88 |
| Splice site acceptor | 362 | 0.01 | 4,912 | 0.79 |
| Splice site donor | 485 | 0.01 | 10,360 | 1.66 |
| Upstream (5kb) | 555,335 | 12.60 | 65,621 | 10.53 |
| UTR 3′ | 6,198 | 0.14 | 3,641 | 0.58 |
| UTR 5′ | 3,552 | 0.08 | 2,069 | 0.33 |
UTR, untranslated region.
Figure 2Genotype concordance. Genotype concordance rate which is the proportion of concordant calls having a consistent same genotype between the SNP chip and re-sequencing datasets of 11 samples. Loci of SNP chip were filtered with Hardy-Weinberg equilibrium p-value <0.001, SNP call rate <99%. Type of genotypes are showed as 0/0, 0/1, 1/1 and “./.”. 0 and 1 represents reference allele, alternative allele and “.” means missing allele. SNP, single nucleotide polymorphism.
The number of variants
| Chrom | Chom length | Genic region length | DNA-Seq | RNA-seq | ||
|---|---|---|---|---|---|---|
|
|
| |||||
| Number of SNPs | Average interval of SNP | Number of SNPs | Average interval of SNP | |||
| 1 | 185,838,109 | 85,058,767 | 265,638 | 320 | 32,909 | 2,585 |
| 2 | 120,857,687 | 54,020,259 | 181,513 | 298 | 25,093 | 2,153 |
| 3 | 119,479,920 | 47,558,748 | 168,436 | 282 | 19,786 | 2,404 |
| 4 | 108,569,075 | 48,153,912 | 157,729 | 305 | 15,086 | 3,192 |
| 5 | 99,680,356 | 49,305,787 | 156,993 | 314 | 21,656 | 2,277 |
| 6 | 84,719,076 | 40,835,311 | 140,519 | 291 | 17,360 | 2,352 |
| 7 | 98,542,428 | 46,347,586 | 154,512 | 300 | 19,013 | 2,438 |
| 8 | 94,057,673 | 37,008,741 | 133,993 | 276 | 18,050 | 2,050 |
| 9 | 83,561,422 | 29,609,146 | 102,049 | 290 | 11,771 | 2,515 |
| 10 | 83,980,604 | 38,905,681 | 132,805 | 293 | 21,489 | 1,810 |
| 11 | 61,308,211 | 42,313,527 | 121,290 | 349 | 22,887 | 1,849 |
| 12 | 33,091,231 | 19,338,230 | 89,790 | 215 | 11,944 | 1,619 |
| 13 | 42,578,167 | 25,459,957 | 91,239 | 279 | 16,823 | 1,513 |
| 14 | 93,904,894 | 36,769,770 | 117,602 | 313 | 15,874 | 2,316 |
| 15 | 91,571,448 | 35,044,790 | 114,827 | 305 | 16,148 | 2,170 |
| 16 | 87,365,405 | 41,746,323 | 143,468 | 291 | 19,294 | 2,164 |
| 17 | 80,757,907 | 24,276,765 | 80,866 | 300 | 8,364 | 2,903 |
| 18 | 82,527,541 | 31,749,417 | 101,681 | 312 | 12,322 | 2,577 |
| 19 | 59,975,221 | 24,180,234 | 87,671 | 276 | 9,236 | 2,618 |
| 20 | 64,166,202 | 27,844,102 | 127,436 | 218 | 18,589 | 1,498 |
| 21 | 57,723,302 | 20,506,447 | 75,458 | 272 | 9,486 | 2,162 |
| 22 | 49,946,797 | 23,385,284 | 74,281 | 315 | 11,390 | 2,053 |
| 23 | 55,726,280 | 19,324,601 | 69,413 | 278 | 9,026 | 2,141 |
| 24 | 46,749,900 | 21,845,462 | 74,277 | 294 | 10,803 | 2,022 |
| 25 | 39,536,964 | 22,453,941 | 71,426 | 314 | 11,890 | 1,888 |
| 26 | 41,866,177 | 11,026,255 | 46,915 | 235 | 5,391 | 2,045 |
| 27 | 39,960,074 | 12,495,321 | 52,320 | 239 | 5,080 | 2,460 |
| 28 | 46,177,339 | 22,368,313 | 78,472 | 285 | 10,698 | 2,091 |
| 29 | 33,672,925 | 12,545,609 | 48,395 | 259 | 5,476 | 2,291 |
| 30 | 30,062,385 | 12,311,357 | 51,249 | 240 | 5,401 | 2,279 |
| 31 | 24,984,650 | 10,541,353 | 44,371 | 238 | 5,390 | 1,956 |
| Total | 2,242,939,370 | 974,330,996 | 3,356,634 | 290 | 443,725 | 2,196 |
SNP, single nucleotide polymorphism.
Genic region length (upstream, downstream 5k).
Minor allele frequency of data
| DNA-Seq | RNA-seq | |
|---|---|---|
| MAF | 0.055556 | 0.125 |
MAF, minor allele frequency.