| Literature DB >> 25807536 |
Fuli Yu1, Jian Lu2, Xiaoming Liu3, Elodie Gazave4, Diana Chang4, Srilakshmi Raj4, Haley Hunter-Zinck4, Ran Blekhman4, Leonardo Arbiza4, Cris Van Hout4, Alanna Morrison3, Andrew D Johnson5, Joshua Bis6, L Adrienne Cupples5, Bruce M Psaty6, Donna Muzny7, Jin Yu7, Richard A Gibbs7, Alon Keinan4, Andrew G Clark4, Eric Boerwinkle8.
Abstract
Whole genome analysis in large samples from a single population is needed to provide adequate power to assess relative strengths of natural selection across different functional components of the genome. In this study, we analyzed next-generation sequencing data from 962 European Americans, and found that as expected approximately 60% of the top 1% of positive selection signals lie in intergenic regions, 33% in intronic regions, and slightly over 1% in coding regions. Several detailed functional annotation categories in intergenic regions showed statistically significant enrichment in positively selected loci when compared to the null distribution of the genomic span of ENCODE categories. There was a significant enrichment of purifying selection signals detected in enhancers, transcription factor binding sites, microRNAs and target sites, but not on lincRNA or piRNAs, suggesting different evolutionary constraints for these domains. Loci in "repressed or low activity regions" and loci near or overlapping the transcription start site were the most significantly over-represented annotations among the top 1% of signals for positive selection.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25807536 PMCID: PMC4373932 DOI: 10.1371/journal.pone.0121644
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240