Literature DB >> 31850063

LD-annot: A Bioinformatics Tool to Automatically Provide Candidate SNPs With Annotations for Genetically Linked Genes.

Julien Prunier1,2, Audrey Lemaçon1, Alexandre Bastien3, Mohsen Jafarikia4,5, Ilga Porth2, Claude Robert2, Arnaud Droit1.   

Abstract

A multitude of model and non-model species studies have now taken full advantage of powerful high-throughput genotyping advances such as SNP arrays and genotyping-by-sequencing (GBS) technology to investigate the genetic basis of trait variation. However, due to incomplete genome coverage by these technologies, the identified SNPs are likely in linkage disequilibrium (LD) with the causal polymorphisms, rather than be causal themselves. In addition, researchers could benefit from annotations for the identified candidate SNPs and, simultaneously, for all neighboring genes in genetic linkage. In such case, LD extent estimation surrounding the candidate SNPs is required to determine the regions encompassing genes of interest. We describe here an automated pipeline, "LD-annot," designed to delineate specific regions of interest for a given experiment and candidate polymorphisms on the basis of LD extent, and furthermore, provide annotations for all genes within such regions. LD-annot uses standard file formats, bioinformatics tools, and languages to provide identifiers, coordinates, and annotations for genes in genetic linkage with each candidate polymorphism. Although the focus lies upon SNP arrays and GBS data as they are being routinely deployed, this pipeline can be applied to a variety of datasets as long as genotypic data are available for a high number of polymorphisms and formatted into a vcf file. A checkpoint procedure in the pipeline allows to test several threshold values for linkage without having to rerun the entire pipeline, thus saving the user computational time and resources. We applied this new pipeline to four different sample sets: two breeding populations GBS datasets, one within-pedigree SNP set coming from whole genome sequencing (WGS), and a very large multi-varieties SNP dataset obtained from WGS, representing variable sample sizes, and numbers of polymorphisms. LD-annot performed within minutes, even when very high numbers of polymorphisms are investigated and thus will efficiently assist research efforts aimed at identifying biologically meaningful genetic polymorphisms underlying phenotypic variation. LD-annot tool is available under a GPL license from https://github.com/ArnaudDroitLab/LD-annot.
Copyright © 2019 Prunier, Lemaçon, Bastien, Jafarikia, Porth, Robert and Droit.

Entities:  

Keywords:  SNP annotation; SNP chip analyses; bioinformatics tool; candidate SNP; linkage disequilibrium; variant call format (VCF)

Year:  2019        PMID: 31850063      PMCID: PMC6889475          DOI: 10.3389/fgene.2019.01192

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  26 in total

1.  Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data.

Authors:  Benilton Carvalho; Henrik Bengtsson; Terence P Speed; Rafael A Irizarry
Journal:  Biostatistics       Date:  2006-12-22       Impact factor: 5.899

2.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap.

Authors:  Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

3.  Population genomic signatures of divergent adaptation, gene flow and hybrid speciation in the rapid radiation of Lake Victoria cichlid fishes.

Authors:  I Keller; C E Wagner; L Greuter; S Mwaiko; O M Selz; A Sivasundar; S Wittwer; O Seehausen
Journal:  Mol Ecol       Date:  2012-11-05       Impact factor: 6.185

Review 4.  After GWAS: mice to the rescue?

Authors:  Joerg Ermann; Laurie H Glimcher
Journal:  Curr Opin Immunol       Date:  2012-09-29       Impact factor: 7.486

5.  Adaptive evolution during an ongoing range expansion: the invasive bank vole (Myodes glareolus) in Ireland.

Authors:  Thomas A White; Sarah E Perkins; Gerald Heckel; Jeremy B Searle
Journal:  Mol Ecol       Date:  2013-05-24       Impact factor: 6.185

6.  Identification of loci governing eight agronomic traits using a GBS-GWAS approach and validation by QTL mapping in soya bean.

Authors:  Humira Sonah; Louise O'Donoughue; Elroy Cober; Istvan Rajcan; François Belzile
Journal:  Plant Biotechnol J       Date:  2014-09-12       Impact factor: 9.803

Review 7.  Efficient genome-wide genotyping strategies and data integration in crop plants.

Authors:  Davoud Torkamaneh; Brian Boyle; François Belzile
Journal:  Theor Appl Genet       Date:  2018-01-19       Impact factor: 5.699

8.  A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species.

Authors:  Robert J Elshire; Jeffrey C Glaubitz; Qi Sun; Jesse A Poland; Ken Kawamoto; Edward S Buckler; Sharon E Mitchell
Journal:  PLoS One       Date:  2011-05-04       Impact factor: 3.240

9.  Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications.

Authors:  Andy Rimmer; Hang Phan; Iain Mathieson; Zamin Iqbal; Stephen R F Twigg; Andrew O M Wilkie; Gil McVean; Gerton Lunter
Journal:  Nat Genet       Date:  2014-07-13       Impact factor: 38.330

10.  Developing genome-wide SNPs and constructing an ultrahigh-density linkage map in oil palm.

Authors:  Bin Bai; Le Wang; Ying Jun Zhang; May Lee; Rahmadsyah Rahmadsyah; Yuzer Alfiko; Bao Qing Ye; Sigit Purwantomo; Antonius Suwanto; Nam-Hai Chua; Gen Hua Yue
Journal:  Sci Rep       Date:  2018-01-12       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.