| Literature DB >> 31584087 |
Wenqian Yang1, Yanbo Yang1, Cecheng Zhao1, Kun Yang1, Dongyang Wang1, Jiajun Yang1, Xiaohui Niu1, Jing Gong1,2.
Abstract
Animal-ImputeDB (http://gong_lab.hzau.edu.cn/Animal_ImputeDB/) is a public database with genomic reference panels of 13 animal species for online genotype imputation, genetic variant search, and free download. Genotype imputation is a process of estimating missing genotypes in terms of the haplotypes and genotypes in a reference panel. It can effectively increase the density of single nucleotide polymorphisms (SNPs) and thus can be widely used in large-scale genome-wide association studies (GWASs) using relatively inexpensive and low-density SNP arrays. However, most animals except humans lack high-quality reference panels, which greatly limits the application of genotype imputation in animals. To overcome this limitation, we developed Animal-ImputeDB, which is dedicated to collecting genotype data and whole-genome resequencing data of nonhuman animals from various studies and databases. A computational pipeline was developed to process different types of raw data to construct reference panels. Finally, 13 high-quality reference panels including ∼400 million SNPs from 2265 samples were constructed. In Animal-ImputeDB, an easy-to-use online tool consisting of two popular imputation tools was designed for the purpose of genotype imputation. Collectively, Animal-ImputeDB serves as an important resource for animal genotype imputation and will greatly facilitate research on animal genomic selection and genetic improvement.Entities:
Mesh:
Year: 2020 PMID: 31584087 PMCID: PMC6943029 DOI: 10.1093/nar/gkz854
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Construction of animal reference panels in Animal-ImputeDB. (A) Data collection. (B) Data processing. (C) Database content and web interface.
Data summary in Animal-ImputeDB
| Reference panel | |||
|---|---|---|---|
| Species | No. of chromosome | No. of sample | No. of SNPs |
| Ailuropoda melanoleuca (Giant panda) | 28 354 scaffolds | 34 | 4 671 936 |
| Anas platyrhynchos (Duck) | 30 | 106 | 12 682 400 |
| Bos taurus (Cattle) | 30 | 93 | 41 808 907 |
| Bubalus bubalis (Swamp buffalo) | 24 | 206 | 33 245 917 |
| Canis familiaris (Dog) | 39 | 658 | 61 065 811 |
| Capra hircus (Goat) | 30 | 233 | 29 889 815 |
| Equus caballus (Horse) | 32 | 53 | 19 257 635 |
| Equus ferus (Tarpan) | 32 | 19 | 7 809 754 |
| Gallus gallus (Chicken) | 35 | 103 | 26 864 273 |
| Ovis aries (Sheep) | 27 | 450 | 29 889 815 |
| Sus scrofa (Pig) | 19 | 233 | 40 323 709 |
| Macaca mulatta (Monkey) | 21 | 30 | 47 332 297 |
| Oryctolagus cuniculus (Rabbit) | 22 | 46 | 40 420 337 |
Figure 2.Overview of the Animal-ImputeDB database. (A) The main functions in Animal-ImputeDB, including ‘Imputation’, ‘Reference Panel’ and ‘Download’ modules. (B) The species included in Animal-ImputeDB. (C) The search box of SNP in Animal-ImputeDB. (D) An example of search results after inputting ‘Chr1:192–420’ in the ‘SNP search’ section of ‘cattle’.
The imputation accuracy using reference panels in Animal-ImputeDB
| Beagle imputation results | Minimac3 imputation results | |||||||
|---|---|---|---|---|---|---|---|---|
| No. of imputed SNPs (mean±SD) | Increased fold | CR (mean±SD) |
| No. of imputed SNPs (mean±SD) | Increased fold | CR (mean±SD) |
| |
| Buffalo | 1 618 065±51 924 | 32.4 | 0.835±0.010 | 0.756±0.010 | 333 402±11 424 | 6.7 | 0.900±0.006 | 0.843±0.006 |
| Chicken | 1 637 061±218 238 | 32.7 | 0.939±0.031 | 0.772±0.052 | 519 892±100 062 | 10.4 | 0.946±0.031 | 0.824±0.036 |
| Dog | 449 768±11 343 | 9 | 0.871±0.006 | 0.733±0.012 | 221 222±8 932 | 4.4 | 0.905±0.006 | 0.799±0.014 |
| Duck | 750 920±14 269 | 15 | 0.813±0.015 | 0.679±0.023 | 293 485±9 285 | 5.9 | 0.865±0.012 | 0.751±0.021 |
| Goat | 797 748±20 260 | 16 | 0.888±0.009 | 0.807±0.018 | 320 904±10 751 | 6.4 | 0.920±0.010 | 0.856±0.018 |
| Pig | 4 792 133±390 227 | 95.8 | 0.929±0.031 | 0.751±0.033 | 2 072 512±327 546 | 41.5 | 0.950±0.022 | 0.818±0.030 |
| Sheep | 1 239 606±11 604 | 24.8 | 0.859±0.003 | 0.812±0.002 | 399 671±14 665 | 8.0 | 0.905±0.002 | 0.856±0.003 |
CR: concordance rate between true and imputed genotypes.
R 2: squared correlation between true and imputed genotypes.
Figure 3.Online imputation tool in the Animal-ImputeDB database. (A) Input data through typing genotypes or uploading a VCF file by clicking the ‘Choose File’ button. (B) Select an imputation tool (Beagle or Minimac3) and enter the chromosome region of interest. (C) Submit the imputation task to Animal-ImputeDB. (D) An example of an imputation result.