Literature DB >> 31584087

Animal-ImputeDB: a comprehensive database with multiple animal reference panels for genotype imputation.

Wenqian Yang1, Yanbo Yang1, Cecheng Zhao1, Kun Yang1, Dongyang Wang1, Jiajun Yang1, Xiaohui Niu1, Jing Gong1,2.   

Abstract

Animal-ImputeDB (http://gong_lab.hzau.edu.cn/Animal_ImputeDB/) is a public database with genomic reference panels of 13 animal species for online genotype imputation, genetic variant search, and free download. Genotype imputation is a process of estimating missing genotypes in terms of the haplotypes and genotypes in a reference panel. It can effectively increase the density of single nucleotide polymorphisms (SNPs) and thus can be widely used in large-scale genome-wide association studies (GWASs) using relatively inexpensive and low-density SNP arrays. However, most animals except humans lack high-quality reference panels, which greatly limits the application of genotype imputation in animals. To overcome this limitation, we developed Animal-ImputeDB, which is dedicated to collecting genotype data and whole-genome resequencing data of nonhuman animals from various studies and databases. A computational pipeline was developed to process different types of raw data to construct reference panels. Finally, 13 high-quality reference panels including ∼400 million SNPs from 2265 samples were constructed. In Animal-ImputeDB, an easy-to-use online tool consisting of two popular imputation tools was designed for the purpose of genotype imputation. Collectively, Animal-ImputeDB serves as an important resource for animal genotype imputation and will greatly facilitate research on animal genomic selection and genetic improvement.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2020        PMID: 31584087      PMCID: PMC6943029          DOI: 10.1093/nar/gkz854

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Genotype imputation is a process to predict and impute missing genotypes in terms of the haplotypes and genotypes in a reference panel (1), which plays essential roles in genome-wide association studies (GWASs) or fine mapping studies of a specific region (2,3). Genotype imputation is based on the assumption that two individuals, even if obviously unrelated, share short panels from a distant common ancestor in their genomes. Thus, it is possible to infer unobserved genotypes in one sample via the reference panel, which includes a large set of markers. Most contemporary imputation tools employ a hidden Markov model (HMM) framework to infer the genotype from the estimated haplotypes in a reference panel (4,5). Imputing genotypes at ungenotyped loci could dramatically boost the density of SNPs, increase the power of association studies, improve the ability to fine-map causal variations, facilitate the combination of different studies, and promote meta-analysis (6). Therefore, genotype imputation has been widely used in all kinds of genetic research, especially in humans (7–10). In animals, numerous GWASs have been performed for genomic selection and genetic improvement. Although advances in high-throughput sequencing technologies have reduced the cost of whole-genome sequencing, GWASs usually require thousands of genotyped animals or more, resulting in high genotyping costs (11). Considering the high genotyping cost, most genetic studies still use low-density SNP panels. Some studies have adopted genotype imputation to increase SNP density after the fact (12) and have confirmed the accuracy and necessity of genotype imputation in animals (13–15). In Brangus beef cattle, genotype imputation has not only integrated different samples using different 40k SNP chips but also increased the density of SNPs (14). Additionally, genotype imputation substantially increased the genomic prediction accuracies of the estimated breeding values (EBVs) of ten traits (14). For example, the genomic prediction accuracy of the EBV of calving ease direct (CED) was increased from 0.52 to 0.68 using leave-one-out cross-validation (LOOCV) (14). In a GWAS of lumbar number in Sutai pigs using the original 60K SNP array panel, no significant association between genotypes and lumbar number was observed in 418 Sutai pigs. However, after imputation, a quantitative trait locus (QTL) in SSC1 was identified with a P-value of 9.01 × 10−18, which was close to the location of the potential causative gene NR6A1 (15). A high-quality reference panel is usually a prerequisite for an effective and accurate genome imputation (16). For example, a HapMap 2 CEU reference panel of 60 individuals with 2.1 million markers was applied for genome-wide imputation in humans (17–19). With the rapid development of high-throughput sequencing, the 1000 Genomes Project accumulated low-coverage whole-genome sequencing data of 2504 individuals from 26 populations world-wide (16). Based on this dataset, a reference panel was constructed, which included 5008 haplotypes with over 88 million variants. Recently, the Haplotype Reference Consortium has generated a human reference panel of 64 976 haplotypes with 39 million SNPs by combining 20 studies (20). These high-quality human reference panels make it possible to accurately impute millions of genetic variations for human studies using low-density SNP array panels (21). Recently, some animal reference panels, such as pig and sheep, have been constructed (22,23). However, most animals lack a corresponding high-quality reference panel, which greatly limits the wide application of genotype imputation in animal genetic studies. In addition, the formats of the reference panels vary with the imputation tools, and current genotype imputation tools require researchers to have a certain background knowledge of computer language and bioinformatics, which makes it challenging for general geneticists and biologists to perform genotype imputation. Therefore, it is essential to develop a convenient database to provide these reference panels and imputation tools for animal genetic research. To address this need, we developed the Animal Imputation Database (Animal-ImputeDB, http://gong_lab.hzau.edu.cn/Animal_ImputeDB/), which is dedicated to collecting publicly available genomic sequencing data of 13 animal species, constructing high-quality reference panels, and providing online genotype imputation tools.

DATA COLLECTION AND PROCESSING

Data collection

To collect as many samples as possible, several steps were taken in the data collection process. We first performed a systematic literature search in the PubMed, ISI Web of Science with the language restricted to English using the following main keywords: ‘pig, dog, monkey, duck, chicken, horse, sheep, cattle, buffalo, rabbit, tarpan, panda, or goat’ and ‘genome sequencing, DNA sequencing, resequencing, genome-wide association study, genomic prediction, or GWAS’. The abstracts of these published studies were downloaded and manually checked by three researchers to select eligible studies. The full texts of all eligible articles were downloaded. Then, all references listed in these articles were also examined to identify more relevant literature. Next, researchers manually checked whether the raw genotypes or sequencing data could be downloaded. In addition, several sequencing deposit databases, such as the Sequence Read Archive (SRA, https://www.ncbi.nlm.nih.gov/sra) (24) of the National Center for Biotechnology Information (NCBI, https://www.ncbi.nlm.nih.gov/sra/) (25) and the BIG Data Center (https://bigd.big.ac.cn/) (26), were also explored to find eligible samples. Finally, genotype or whole-genome sequencing (WGS) data of 13 species were collected from different sources to construct the animal genetic reference panels. Collection of genotype data. Genotype data of pig, horse, cattle, goat, buffalo, chicken, tarpan, and panda in variant call format (VCF) were collected from the Genome Variation Map (GVM, https://bigd.big.ac.cn/gvm/home) (27) of BIG Data Center (BIGD, http://bigd.big.ac.cn/). The genotype data of dog (28), sheep (29) and duck (30) were gathered from the National Human Genome Research Institute (NHGRI, https://www.genome.gov/) (28), European Institute of Bioinformatics (EBI, https://www.ebi.ac.uk/) (31) and DUCKbase (http://duckbase.org/home) (30), respectively. Collection of WGS data. The WGS data of rabbit (32,33) and monkey (34–36) were obtained from the NCBI SRA. SNP annotation file. The dbSNP IDs of dog, horse and tarpan were collected from the dbSNP of the NCBI database (ftp://ftp.ncbi.nih.gov/snp/organisms/archive/) (37), and the dbSNP ID of sheep was collected from the Ensembl database (ftp://ftp.ensembl.org/pub/release-75/variation/vcf/ovis_aries/) (38). We downloaded the known variant files and ensured that the reference genome versions were consistent with ours.

Data processing

The raw WGS reads were subjected to quality control using FastQC (version: 0.11.5-Java-1.8.0_92) and cleaned with Trimmomatic (version: 0.36) (39). Subsequently, the clean reads were aligned to the reference genome using Burrows-Wheeler Aligner mem (BWA, version: 0.7.17-r1188) with default parameters (40). The aligned data were merged into a single BAM file, and the processed data were marked for duplicates by using Picard in Genome Analysis Toolkit (GATK, version: 4.0.12.0) (41). The duplicate reads were removed. We further performed variant calling by running HaplotypeCaller and variant refining by variant quality score recalibration (VQSR). Then, running HaplotypeCaller, an intermediate genomic GVCF file for each sample was produced by using GVCF mode, and GenotypeGVCFs in GATK was applied to pool all GVCF files together to create a VCF file of the raw variants. These raw variants identified by GATK were further filtered by using VariantFiltration (35). Default parameters of tools were used in the variant calling methodology. All genotype data were filtered through the following two steps with GATK and Perl scripts: (i) SNP filtration. SNPs were first selected based on the following criteria: QualByDepth < 5.0, FisherStrand > 15.0, RMSMappingQuality < 50.0, ReadPosRankSumTest < −8.0, MappingQualityRankSumTest < −12.5, StrandOddsRatio > 3.0 (42). Then, the SNPs with a call rate < 0.9 or a minor allele frequency (MAF) < 0.01 were removed. (ii) Sample filtration. The animal samples with a genotype call rate < 0.9 were removed (Figure 1).
Figure 1.

Construction of animal reference panels in Animal-ImputeDB. (A) Data collection. (B) Data processing. (C) Database content and web interface.

Construction of animal reference panels in Animal-ImputeDB. (A) Data collection. (B) Data processing. (C) Database content and web interface. The detailed statistics of the genetic variants and sample data of each species in the final dataset are listed in Table 1.
Table 1.

Data summary in Animal-ImputeDB

Reference panel
SpeciesNo. of chromosomeNo. of sampleNo. of SNPs
Ailuropoda melanoleuca (Giant panda)28 354 scaffolds344 671 936
Anas platyrhynchos (Duck)3010612 682 400
Bos taurus (Cattle)309341 808 907
Bubalus bubalis (Swamp buffalo)2420633 245 917
Canis familiaris (Dog)3965861 065 811
Capra hircus (Goat)3023329 889 815
Equus caballus (Horse)325319 257 635
Equus ferus (Tarpan)32197 809 754
Gallus gallus (Chicken)3510326 864 273
Ovis aries (Sheep)2745029 889 815
Sus scrofa (Pig)1923340 323 709
Macaca mulatta (Monkey)213047 332 297
Oryctolagus cuniculus (Rabbit)224640 420 337
Data summary in Animal-ImputeDB

Reference panel construction

Haplotypes of each species were constructed by Beagle (v5.0) (43) using clean SNP data with the default parameters. The reference panels were converted from VCF to M3VCF format by Minimac3 (2). Beagle (v5.0), Impute2, and Minimac3 are the most frequently used tools for genotype imputation. All these tools are similar in accuracy, but differ in memory requirements and computation time (44). Beagle is computationally fast and highly efficient in memory. Minimac3 is also superior to Impute2 in these two aspects (44). Furthermore, Beagle and Minimac3 are widely applied in animal genotype imputation. Therefore, we provided the reference panels in VCF and M3VCF formats corresponding to Beagle and Minimac3 in our database.

SNP annotation

For each species with a SNP annotation file, we mapped the dbSNP ID to the SNPs in our reference panels according to chromosome position. Furthermore, the allele frequency of each SNP was calculated. The above steps were performed with in-house scripts.

IMPLEMENTATION

Animal-ImputeDB (http://gong_lab.hzau.edu.cn/Animal_ImputeDB/) was built based on the Flask (version 1.0.3) framework with AngularJS (version 1.6.1) as the JavaScript library, running on the Apache 2 web server (version 2.4.18) with MongoDB (version 3.4.2) as its database engine. Animal-ImputeDB is available online without registration and optimized for Chrome (recommended), Internet Explorer, Opera, Firefox, Windows Edge and macOS Safari.

DATABASE CONTENT AND USAGE

Samples of 13 species in Animal-ImputeDB

In total, ∼400 million SNPs of 2265 samples from 13 species were deposited in Animal-ImputeDB. The detailed information, including the number of samples per species, the number of chromosomes, genome version, and the number of SNPs, is shown in Table 1 and displayed on the ‘Home’ page (Figure 2A). The species information, including the basic animal introduction, genome size, and the number of chromosomes, is presented in the ‘Species information’ module, which can be accessed by clicking the animal photos on the ‘Home’ page (Figure 2B). The detailed sample information of each species is described in the ‘Sample information’ module. The information including PubMed ID, journal, publication year of article, sample number, material, technology, platform, data type, and sequencing coverage of the project is provided. Users could obtain more information at NCBI PubMed Central (PMC) by clicking the ‘PubMed ID’ hyperlink on the list.
Figure 2.

Overview of the Animal-ImputeDB database. (A) The main functions in Animal-ImputeDB, including ‘Imputation’, ‘Reference Panel’ and ‘Download’ modules. (B) The species included in Animal-ImputeDB. (C) The search box of SNP in Animal-ImputeDB. (D) An example of search results after inputting ‘Chr1:192–420’ in the ‘SNP search’ section of ‘cattle’.

Overview of the Animal-ImputeDB database. (A) The main functions in Animal-ImputeDB, including ‘Imputation’, ‘Reference Panel’ and ‘Download’ modules. (B) The species included in Animal-ImputeDB. (C) The search box of SNP in Animal-ImputeDB. (D) An example of search results after inputting ‘Chr1:192–420’ in the ‘SNP search’ section of ‘cattle’.

The imputation accuracy using reference panels in Animal-ImputeDB

To validate the performance of reference panels and imputation process, we calculated imputation accuracy for seven species with sample size larger than 100 using 5-fold cross-validation strategy. For each species, individuals were randomly divided into five folds. Each time, one-fold was selected as the study population, and the remaining individuals were used as the reference panel. Since most commercial SNP arrays of animals contain about 50k probes (14,15), we randomly selected 50 000 SNPs on autosomes of the study populations and masked other SNPs. Then we used Beagle and Minimac3 to impute the genotypes with default parameters. In this way, we have both the true and imputed genotypes. The imputed SNPs with MAF ≥0.01 and estimated squared correlation ≥0.3 were remained as properly imputed variants and used for the following evaluation. Two values were used to evaluate the accuracy of imputation. One is the concordance rate (CR), which is calculated as the number of genotypes imputed correctly divided by total imputed genotypes per species. The other value is the squared correlation (R2) between true and imputed genotypes. The accuracy of imputation was the mean CR or R2 across five folds for each species. The results of imputation are summarized in Table 2. After imputation, the number of SNPs increased by 8.0–95.8 folds when using 50k markers in the study population. The average CRs for all test species were greater than 0.8. The average R2 of Beagle ranged from 0.679 for duck to 0.812 for sheep, and the average R2 of Minimac3 ranged from 0.751 for duck to 0.856 for sheep. These results indicate that our reference panels and used imputation tools have good performances, which can greatly increase the number of SNPs with relatively high accuracy.
Table 2.

The imputation accuracy using reference panels in Animal-ImputeDB

Beagle imputation resultsMinimac3 imputation results
No. of imputed SNPs (mean±SD)Increased foldCR (mean±SD) R 2 (mean±SD)No. of imputed SNPs (mean±SD)Increased foldCR (mean±SD) R2(mean±SD)
Buffalo1 618 065±51 92432.40.835±0.0100.756±0.010333 402±11 4246.70.900±0.0060.843±0.006
Chicken1 637 061±218 23832.70.939±0.0310.772±0.052519 892±100 06210.40.946±0.0310.824±0.036
Dog449 768±11 34390.871±0.0060.733±0.012221 222±8 9324.40.905±0.0060.799±0.014
Duck750 920±14 269150.813±0.0150.679±0.023293 485±9 2855.90.865±0.0120.751±0.021
Goat797 748±20 260160.888±0.0090.807±0.018320 904±10 7516.40.920±0.0100.856±0.018
Pig4 792 133±390 22795.80.929±0.0310.751±0.0332 072 512±327 54641.50.950±0.0220.818±0.030
Sheep1 239 606±11 60424.80.859±0.0030.812±0.002399 671±14 6658.00.905±0.0020.856±0.003

CR: concordance rate between true and imputed genotypes.

R 2: squared correlation between true and imputed genotypes.

The imputation accuracy using reference panels in Animal-ImputeDB CR: concordance rate between true and imputed genotypes. R 2: squared correlation between true and imputed genotypes.

Web interface

The Animal-ImputeDB database provides a user-friendly interface. It contains three main modules, namely, ‘Imputation’ for online genotype imputation, ‘Reference Panel’ for SNP search, and ‘Download’ for reference panel download. Users can access the ‘Imputation/Reference Panel/Download’ modules by clicking the corresponding buttons on the ‘Home’ page (Figure 2A) or by clicking on the hyperlink embedded in the corresponding animal photo (Figure 2B) in the ‘Module’ section on the ‘Home’ page. These ‘Imputation/Reference Panel/Download’ modules provide the functions ‘Species Information/Online Imputation/SNP search/Sample information’ (Figure 2C). Animal-ImputeDB provides detailed supporting documentation on the ‘Help’ page, and it is open to any feedback with email address provided on the ‘Contact’ page.

SNPs of 13 curated species for searching and browsing in Animal-ImputeDB

To support SNP search and browse, the ‘Reference Panel’ page provides an advanced search box for different species. SNPs can be browsed by inputting the specific chromosomal region (e.g. Chr1: 192–340), SNP ID (e.g. bta10), dbSNP ID (e.g. rs42801761), or MAF (e.g. >0.05). Fuzzy queries are used in the search, and the query results are displayed in a table containing the basic SNP information, including SNP ID, chromosome, position, allele, minor allele frequency, and dbSNP ID. For example, when users select ‘Cattle’ and enter ‘Chr1: 192–340’ in the ‘Region’ box, the query results will be returned as shown in Figure 2D. The returned tables can be sorted by clicking on a specific column header. In addition, the query results can be exported to a tab-separated file and saved by clicking the ‘Download’ button (Figure 2D). To help users find more detailed SNP information, the dbSNP IDs in the query results are linked to the dbSNP database.

Online imputation for 13 curated species in Animal-ImputeDB

On the ‘Imputation’ page, Animal-ImputeDB provides an easy-to-use online tool consisting of two free and popular tools, namely, Beagle and Minimac3, for genotype imputation. There are two ways to navigate to the ‘imputation’ module: (i) by clicking on ‘Imputation’ in the ‘Home’ page browser bar and (ii) by clicking on the hyperlink in the corresponding species photo on the ‘Home’ page. Users can enter or copy pending processed genotype data into the text box (Figure 3A) or upload the genotype data directly via the ‘Choose File’ button. The genotype data should be input in VCF format with annotation information. An example of genotype data in the VCF format can be obtained by clicking the ‘Example’ button above the input box. After uploading the candidate genotype data, users should select one of the two tools (Figure 3B), enter the chromosome region, and click the ‘Submit’ button to submit the inquiry (Figure 3C). Then, the imputation results will be returned as a VCF format file and can be downloaded freely (Figure 3D).
Figure 3.

Online imputation tool in the Animal-ImputeDB database. (A) Input data through typing genotypes or uploading a VCF file by clicking the ‘Choose File’ button. (B) Select an imputation tool (Beagle or Minimac3) and enter the chromosome region of interest. (C) Submit the imputation task to Animal-ImputeDB. (D) An example of an imputation result.

Online imputation tool in the Animal-ImputeDB database. (A) Input data through typing genotypes or uploading a VCF file by clicking the ‘Choose File’ button. (B) Select an imputation tool (Beagle or Minimac3) and enter the chromosome region of interest. (C) Submit the imputation task to Animal-ImputeDB. (D) An example of an imputation result.

Reference panels of 13 curated species for download in Animal-ImputeDB

Reference panels for 13 species are publicly available on the ‘Download’ page of Animal-ImputeDB. These 13 reference panels support both VCF and M3VCF file formats (text and binary), so users can download a reference panel in either VCF format or M3VCF format according to their own tool requirements. The M3VCF file is usable by only the Minimac3 tool stores a large reference panel in a compact manner, whereas the VCF file can be widely applied by most popular imputation tools. Our database provides a total of ∼400G data for users to download.

SUMMARY AND FUTURE DIRECTIONS

Rapid progress has been seen in animal genome research in recent decades. Several animal-related databases have been widely used by animal researchers, such as AnimalQTLdb (45,46) and AnimalTFDB (47). However, no convenient database is available for animal genotype imputation. In this study, we developed the Animal-ImputeDB database by collecting publicly available data, constructing reference panels of 13 curated species, and designing an easy-to-use online genotype imputation tool. Reference panels of 13 animal species could be downloaded and used for the corresponding animal studies to increase the power of GWAS and to aid fine mapping of causative variants. All the SNPs of reference panels could be browsed and downloaded in our database. For user convenience, we linked the SNPs in Animal-ImputeDB to the NCBI SRA, BIGD, EBI, NCBI dbSNP, and NCBI PMC. With this easy-to-use online tool, researchers without coding experience can also perform genotype imputation easily. We believe that Animal-ImputeDB, with multiple animal reference panels and online imputation tools, will be a valuable resource for the field of animal breeding and genetic improvement. Recent next-generation sequencing technology and imputation algorithm advances provide us with unprecedented opportunities to construct animal reference panels for genotype imputation. In the future, we will annually perform a systematic literature search to integrate more samples and species into Animal-ImputeDB and continue to update the database. Future database development will focus on response to community needs and functional annotations to improve the efficiency and comprehensiveness of the database. Collectively, we will maintain Animal-ImputeDB as an informative and valuable resource for animal genetic research.
  47 in total

1.  High imputation accuracy from informative low-to-medium density single nucleotide polymorphism genotypes is achievable in sheep1.

Authors:  Aine C O'Brien; Michelle M Judge; Sean Fair; Donagh P Berry
Journal:  J Anim Sci       Date:  2019-04-03       Impact factor: 3.159

Review 2.  Genotype Imputation from Large Reference Panels.

Authors:  Sayantan Das; Gonçalo R Abecasis; Brian L Browning
Journal:  Annu Rev Genomics Hum Genet       Date:  2018-05-23       Impact factor: 8.929

Review 3.  Genotype imputation.

Authors:  Yun Li; Cristen Willer; Serena Sanna; Gonçalo Abecasis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

4.  Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle.

Authors:  Hans D Daetwyler; Aurélien Capitan; Hubert Pausch; Paul Stothard; Rianne van Binsbergen; Rasmus F Brøndum; Xiaoping Liao; Anis Djari; Sabrina C Rodriguez; Cécile Grohs; Diane Esquerré; Olivier Bouchez; Marie-Noëlle Rossignol; Christophe Klopp; Dominique Rocha; Sébastien Fritz; André Eggen; Phil J Bowman; David Coote; Amanda J Chamberlain; Charlotte Anderson; Curt P VanTassell; Ina Hulsegge; Mike E Goddard; Bernt Guldbrandtsen; Mogens S Lund; Roel F Veerkamp; Didier A Boichard; Ruedi Fries; Ben J Hayes
Journal:  Nat Genet       Date:  2014-07-13       Impact factor: 38.330

5.  AnimalQTLdb: a livestock QTL database tool set for positional QTL information mining and beyond.

Authors:  Zhi-Liang Hu; Eric Ryan Fritz; James M Reecy
Journal:  Nucleic Acids Res       Date:  2006-11-29       Impact factor: 16.971

6.  Hyperlipidemia-associated gene variations and expression patterns revealed by whole-genome and transcriptome sequencing of rabbit models.

Authors:  Zhen Wang; Jifeng Zhang; Hong Li; Junyi Li; Manabu Niimi; Guohui Ding; Haifeng Chen; Jie Xu; Hongjiu Zhang; Ze Xu; Yulin Dai; Tuantuan Gui; Shengdi Li; Zhi Liu; Sujuan Wu; Mushui Cao; Lu Zhou; Xingyu Lu; Junxia Wang; Jing Yang; Yunhe Fu; Dongshan Yang; Jun Song; Tianqing Zhu; Shen Li; Bo Ning; Ziyun Wang; Tomonari Koike; Masashi Shiomi; Enqi Liu; Luonan Chen; Jianglin Fan; Y Eugene Chen; Yixue Li
Journal:  Sci Rep       Date:  2016-06-01       Impact factor: 4.379

7.  Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.

Authors:  Benjamin N Bimber; Michael J Raboin; John Letaw; Kimberly A Nevonen; Jennifer E Spindel; Susan R McCouch; Rita Cervera-Juanes; Eliot Spindel; Lucia Carbone; Betsy Ferguson; Amanda Vinson
Journal:  BMC Genomics       Date:  2016-08-24       Impact factor: 3.969

8.  Imputation-Based Whole-Genome Sequence Association Study Rediscovered the Missing QTL for Lumbar Number in Sutai Pigs.

Authors:  Guorong Yan; Ruimin Qiao; Feng Zhang; Wenshui Xin; Shijun Xiao; Tao Huang; Zhiyan Zhang; Lusheng Huang
Journal:  Sci Rep       Date:  2017-04-04       Impact factor: 4.379

9.  The European Bioinformatics Institute in 2018: tools, infrastructure and training.

Authors:  Charles E Cook; Rodrigo Lopez; Oana Stroe; Guy Cochrane; Cath Brooksbank; Ewan Birney; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies.

Authors:  Sanne van den Berg; Jérémie Vandenplas; Fred A van Eeuwijk; Aniek C Bouwman; Marcos S Lopes; Roel F Veerkamp
Journal:  Genet Sel Evol       Date:  2019-01-24       Impact factor: 4.297

View more
  5 in total

1.  RefRGim: an intelligent reference panel reconstruction method for genotype imputation with convolutional neural networks.

Authors:  Shuo Shi; Qiheng Qian; Shuhuan Yu; Qi Wang; Jinyue Wang; Jingyao Zeng; Zhenglin Du; Jingfa Xiao
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

2.  Aquaculture Molecular Breeding Platform (AMBP): a comprehensive web server for genotype imputation and genetic analysis in aquaculture.

Authors:  Qifan Zeng; Baojun Zhao; Hao Wang; Mengqiu Wang; Mingxuan Teng; Jingjie Hu; Zhenmin Bao; Yangfan Wang
Journal:  Nucleic Acids Res       Date:  2022-05-25       Impact factor: 19.160

3.  Plant-ImputeDB: an integrated multiple plant reference panel database for genotype imputation.

Authors:  Yingjie Gao; Zhiquan Yang; Wenqian Yang; Yanbo Yang; Jing Gong; Qing-Yong Yang; Xiaohui Niu
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

4.  Animal-eRNAdb: a comprehensive animal enhancer RNA database.

Authors:  Weiwei Jin; Guanghui Jiang; Yanbo Yang; Jianye Yang; Wenqian Yang; Dongyang Wang; Xiaohui Niu; Rong Zhong; Zhao Zhang; Jing Gong
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

5.  PHARP: a pig haplotype reference panel for genotype imputation.

Authors:  Zhen Wang; Zhenyang Zhang; Zitao Chen; Jiabao Sun; Caiyun Cao; Fen Wu; Zhong Xu; Wei Zhao; Hao Sun; Longyu Guo; Zhe Zhang; Qishan Wang; Yuchun Pan
Journal:  Sci Rep       Date:  2022-07-25       Impact factor: 4.996

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.