Chandler Zuo1, Sunyoung Shin1, Sündüz Keleş1. 1. Department of Statistics and Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA.
Abstract
MOTIVATION: Genome-wide association studies revealed that most disease-associated single nucleotide polymorphisms (SNPs) are located in regulatory regions within introns or in regions between genes. Regulatory SNPs (rSNPs) are such SNPs that affect gene regulation by changing transcription factor (TF) binding affinities to genomic sequences. Identifying potential rSNPs is crucial for understanding disease mechanisms. In silico methods that evaluate the impact of SNPs on TF binding affinities are not scalable for large-scale analysis. RESULTS: We describe A: ffinity T: esting for regulatory SNP: s (atSNP), a computationally efficient R package for identifying rSNPs in silico. atSNP implements an importance sampling algorithm coupled with a first-order Markov model for the background nucleotide sequences to test the significance of affinity scores and SNP-driven changes in these scores. Application of atSNP with >20 K SNPs indicates that atSNP is the only available tool for such a large-scale task. atSNP provides user-friendly output in the form of both tables and composite logo plots for visualizing SNP-motif interactions. Evaluations of atSNP with known rSNP-TF interactions indicate that SNP is able to prioritize motifs for a given set of SNPs with high accuracy. AVAILABILITY AND IMPLEMENTATION: https://github.com/keleslab/atSNP. CONTACT: keles@stat.wisc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Genome-wide association studies revealed that most disease-associated single nucleotide polymorphisms (SNPs) are located in regulatory regions within introns or in regions between genes. Regulatory SNPs (rSNPs) are such SNPs that affect gene regulation by changing transcription factor (TF) binding affinities to genomic sequences. Identifying potential rSNPs is crucial for understanding disease mechanisms. In silico methods that evaluate the impact of SNPs on TF binding affinities are not scalable for large-scale analysis. RESULTS: We describe A: ffinity T: esting for regulatory SNP: s (atSNP), a computationally efficient R package for identifying rSNPs in silico. atSNP implements an importance sampling algorithm coupled with a first-order Markov model for the background nucleotide sequences to test the significance of affinity scores and SNP-driven changes in these scores. Application of atSNP with >20 K SNPs indicates that atSNP is the only available tool for such a large-scale task. atSNP provides user-friendly output in the form of both tables and composite logo plots for visualizing SNP-motif interactions. Evaluations of atSNP with known rSNP-TF interactions indicate that SNP is able to prioritize motifs for a given set of SNPs with high accuracy. AVAILABILITY AND IMPLEMENTATION: https://github.com/keleslab/atSNP. CONTACT: keles@stat.wisc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Morgane Thomas-Chollier; Andrew Hufton; Matthias Heinig; Sean O'Keeffe; Nassim El Masri; Helge G Roider; Thomas Manke; Martin Vingron Journal: Nat Protoc Date: 2011-11-03 Impact factor: 13.491
Authors: Matthew T Maurano; Richard Humbert; Eric Rynes; Robert E Thurman; Eric Haugen; Hao Wang; Alex P Reynolds; Richard Sandstrom; Hongzhu Qu; Jennifer Brody; Anthony Shafer; Fidencio Neri; Kristen Lee; Tanya Kutyavin; Sandra Stehling-Sun; Audra K Johnson; Theresa K Canfield; Erika Giste; Morgan Diegel; Daniel Bates; R Scott Hansen; Shane Neph; Peter J Sabo; Shelly Heimfeld; Antony Raubitschek; Steven Ziegler; Chris Cotsapas; Nona Sotoodehnia; Ian Glass; Shamil R Sunyaev; Rajinder Kaul; John A Stamatoyannopoulos Journal: Science Date: 2012-09-05 Impact factor: 47.728
Authors: Malin C Andersen; Pär G Engström; Stuart Lithwick; David Arenillas; Per Eriksson; Boris Lenhard; Wyeth W Wasserman; Jacob Odeberg Journal: PLoS Comput Biol Date: 2007-11-27 Impact factor: 4.475
Authors: Riccardo Dainese; Vincent Gardeux; Gerard Llimos; Daniel Alpern; Jia Yuan Jiang; Antonio Carlos Alves Meireles-Filho; Bart Deplancke Journal: Proc Natl Acad Sci U S A Date: 2020-05-27 Impact factor: 11.205
Authors: Matthew Dapas; Ryan Sisk; Richard S Legro; Margrit Urbanek; Andrea Dunaif; M Geoffrey Hayes Journal: J Clin Endocrinol Metab Date: 2019-04-30 Impact factor: 5.958