Literature DB >> 26776200

BIOFILTER AS A FUNCTIONAL ANNOTATION PIPELINE FOR COMMON AND RARE COPY NUMBER BURDEN.

Dokyoon Kim1, Anastasia Lucas, Joseph Glessner, Shefali S Verma, Yuki Bradford, Ruowang Li, Alex T Frase, Hakon Hakonarson, Peggy Peissig, Murray Brilliant, Marylyn D Ritchie.   

Abstract

Recent studies on copy number variation (CNV) have suggested that an increasing burden of CNVs is associated with susceptibility or resistance to disease. A large number of genes or genomic loci contribute to complex diseases such as autism. Thus, total genomic copy number burden, as an accumulation of copy number change, is a meaningful measure of genomic instability to identify the association between global genetic effects and phenotypes of interest. However, no systematic annotation pipeline has been developed to interpret biological meaning based on the accumulation of copy number change across the genome associated with a phenotype of interest. In this study, we develop a comprehensive and systematic pipeline for annotating copy number variants into genes/genomic regions and subsequently pathways and other gene groups using Biofilter - a bioinformatics tool that aggregates over a dozen publicly available databases of prior biological knowledge. Next we conduct enrichment tests of biologically defined groupings of CNVs including genes, pathways, Gene Ontology, or protein families. We applied the proposed pipeline to a CNV dataset from the Marshfield Clinic Personalized Medicine Research Project (PMRP) in a quantitative trait phenotype derived from the electronic health record - total cholesterol. We identified several significant pathways such as toll-like receptor signaling pathway and hepatitis C pathway, gene ontologies (GOs) of nucleoside triphosphatase activity (NTPase) and response to virus, and protein families such as cell morphogenesis that are associated with the total cholesterol phenotype based on CNV profiles (permutation p-value < 0.01). Based on the copy number burden analysis, it follows that the more and larger the copy number changes, the more likely that one or more target genes that influence disease risk and phenotypic severity will be affected. Thus, our study suggests the proposed enrichment pipeline could improve the interpretability of copy number burden analysis where hundreds of loci or genes contribute toward disease susceptibility via biological knowledge groups such as pathways. This CNV annotation pipeline with Biofilter can be used for CNV data from any genotyping or sequencing platform and to explore CNV enrichment for any traits or phenotypes. Biofilter continues to be a powerful bioinformatics tool for annotating, filtering, and constructing biologically informed models for association analysis - now including copy number variants.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26776200      PMCID: PMC4722964     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  26 in total

1.  PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.

Authors:  Kai Wang; Mingyao Li; Dexter Hadley; Rui Liu; Joseph Glessner; Struan F A Grant; Hakon Hakonarson; Maja Bucan
Journal:  Genome Res       Date:  2007-10-05       Impact factor: 9.043

2.  A new initiative on precision medicine.

Authors:  Francis S Collins; Harold Varmus
Journal:  N Engl J Med       Date:  2015-01-30       Impact factor: 91.245

3.  Quality control procedures for genome-wide association studies.

Authors:  Stephen Turner; Loren L Armstrong; Yuki Bradford; Christopher S Carlson; Dana C Crawford; Andrew T Crenshaw; Mariza de Andrade; Kimberly F Doheny; Jonathan L Haines; Geoffrey Hayes; Gail Jarvik; Lan Jiang; Iftikhar J Kullo; Rongling Li; Hua Ling; Teri A Manolio; Martha Matsumoto; Catherine A McCarty; Andrew N McDavid; Daniel B Mirel; Justin E Paschall; Elizabeth W Pugh; Luke V Rasmussen; Russell A Wilke; Rebecca L Zuvich; Marylyn D Ritchie
Journal:  Curr Protoc Hum Genet       Date:  2011-01

Review 4.  Emerging role of Toll-like receptors in atherosclerosis.

Authors:  Linda K Curtiss; Peter S Tobias
Journal:  J Lipid Res       Date:  2008-11-01       Impact factor: 5.922

5.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

Authors:  Joshua C Denny; Lisa Bastarache; Marylyn D Ritchie; Robert J Carroll; Raquel Zink; Jonathan D Mosley; Julie R Field; Jill M Pulley; Andrea H Ramirez; Erica Bowton; Melissa A Basford; David S Carrell; Peggy L Peissig; Abel N Kho; Jennifer A Pacheco; Luke V Rasmussen; David R Crosslin; Paul K Crane; Jyotishman Pathak; Suzette J Bielinski; Sarah A Pendergrass; Hua Xu; Lucia A Hindorff; Rongling Li; Teri A Manolio; Christopher G Chute; Rex L Chisholm; Eric B Larson; Gail P Jarvik; Murray H Brilliant; Catherine A McCarty; Iftikhar J Kullo; Jonathan L Haines; Dana C Crawford; Daniel R Masys; Dan M Roden
Journal:  Nat Biotechnol       Date:  2013-12       Impact factor: 54.908

6.  Genetic copy number variants in myocardial infarction patients with hyperlipidemia.

Authors:  Wei-Chung Shia; Tien-Hsiung Ku; Yu-Ming Tsao; Chien-Hsun Hsia; Yung-Ming Chang; Ching-Hui Huang; Yeh-Ching Chung; Shih-Lan Hsu; Kae-Woei Liang; Fang-Rong Hsu
Journal:  BMC Genomics       Date:  2011-11-30       Impact factor: 3.969

Review 7.  The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future.

Authors:  Omri Gottesman; Helena Kuivaniemi; Gerard Tromp; W Andrew Faucett; Rongling Li; Teri A Manolio; Saskia C Sanderson; Joseph Kannry; Randi Zinberg; Melissa A Basford; Murray Brilliant; David J Carey; Rex L Chisholm; Christopher G Chute; John J Connolly; David Crosslin; Joshua C Denny; Carlos J Gallego; Jonathan L Haines; Hakon Hakonarson; John Harley; Gail P Jarvik; Isaac Kohane; Iftikhar J Kullo; Eric B Larson; Catherine McCarty; Marylyn D Ritchie; Dan M Roden; Maureen E Smith; Erwin P Böttinger; Marc S Williams
Journal:  Genet Med       Date:  2013-06-06       Impact factor: 8.822

Review 8.  Copy number variation analysis in the context of electronic medical records and large-scale genomics consortium efforts.

Authors:  John J Connolly; Joseph T Glessner; Berta Almoguera; David R Crosslin; Gail P Jarvik; Patrick M Sleiman; Hakon Hakonarson
Journal:  Front Genet       Date:  2014-03-18       Impact factor: 4.599

9.  Global increases in both common and rare copy number load associated with autism.

Authors:  Santhosh Girirajan; Rebecca L Johnson; Flora Tassone; Jorune Balciuniene; Neerja Katiyar; Keolu Fox; Carl Baker; Abhinaya Srikanth; Kian Hui Yeoh; Su Jen Khoo; Therese B Nauth; Robin Hansen; Marylyn Ritchie; Irva Hertz-Picciotto; Evan E Eichler; Isaac N Pessah; Scott B Selleck
Journal:  Hum Mol Genet       Date:  2013-03-27       Impact factor: 6.150

10.  Genomic analyses with biofilter 2.0: knowledge driven filtering, annotation, and model development.

Authors:  Sarah A Pendergrass; Alex Frase; John Wallace; Daniel Wolfe; Neerja Katiyar; Carrie Moore; Marylyn D Ritchie
Journal:  BioData Min       Date:  2013-12-30       Impact factor: 2.522

View more
  1 in total

1.  PLATO software provides analytic framework for investigating complexity beyond genome-wide association studies.

Authors:  Molly A Hall; John Wallace; Anastasia Lucas; Dokyoon Kim; Anna O Basile; Shefali S Verma; Cathy A McCarty; Murray H Brilliant; Peggy L Peissig; Terrie E Kitchner; Anurag Verma; Sarah A Pendergrass; Scott M Dudek; Jason H Moore; Marylyn D Ritchie
Journal:  Nat Commun       Date:  2017-10-27       Impact factor: 14.919

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.