Literature DB >> 21767363

A strategy analysis for genetic association studies with known inbreeding.

Stefano Cabras1, Maria Eugenia Castellanos, Ginevra Biino, Ivana Persico, Alessandro Sassu, Laura Casula, Stefano Del Giacco, Francesco Bertolino, Mario Pirastu, Nicola Pirastu.   

Abstract

BACKGROUND: Association studies consist in identifying the genetic variants which are related to a specific disease through the use of statistical multiple hypothesis testing or segregation analysis in pedigrees. This type of studies has been very successful in the case of Mendelian monogenic disorders while it has been less successful in identifying genetic variants related to complex diseases where the insurgence depends on the interactions between different genes and the environment. The current technology allows to genotype more than a million of markers and this number has been rapidly increasing in the last years with the imputation based on templates sets and whole genome sequencing. This type of data introduces a great amount of noise in the statistical analysis and usually requires a great number of samples. Current methods seldom take into account gene-gene and gene-environment interactions which are fundamental especially in complex diseases. In this paper we propose to use a non-parametric additive model to detect the genetic variants related to diseases which accounts for interactions of unknown order. Although this is not new to the current literature, we show that in an isolated population, where the most related subjects share also most of their genetic code, the use of additive models may be improved if the available genealogical tree is taken into account. Specifically, we form a sample of cases and controls with the highest inbreeding by means of the Hungarian method, and estimate the set of genes/environmental variables, associated with the disease, by means of Random Forest.
RESULTS: We have evidence, from statistical theory, simulations and two applications, that we build a suitable procedure to eliminate stratification between cases and controls and that it also has enough precision in identifying genetic variants responsible for a disease. This procedure has been successfully used for the beta-thalassemia, which is a well known Mendelian disease, and also to the common asthma where we have identified candidate genes that underlie to the susceptibility of the asthma. Some of such candidate genes have been also found related to common asthma in the current literature.
CONCLUSIONS: The data analysis approach, based on selecting the most related cases and controls along with the Random Forest model, is a powerful tool for detecting genetic variants associated to a disease in isolated populations. Moreover, this method provides also a prediction model that has accuracy in estimating the unknown disease status and that can be generally used to build kit tests for a wide class of Mendelian diseases.

Entities:  

Mesh:

Year:  2011        PMID: 21767363      PMCID: PMC3155486          DOI: 10.1186/1471-2156-12-63

Source DB:  PubMed          Journal:  BMC Genet        ISSN: 1471-2156            Impact factor:   2.797


  21 in total

1.  Genomewide rapid association using mixed model and regression: a fast and simple method for genomewide pedigree-based quantitative trait loci association analysis.

Authors:  Yurii S Aulchenko; Dirk-Jan de Koning; Chris Haley
Journal:  Genetics       Date:  2007-07-29       Impact factor: 4.562

2.  Molecular characterization of beta-thalassemia in the Sardinian population.

Authors:  M C Rosatelli; A Dozy; V Faà; A Meloni; R Sardu; L Saba; Y W Kan; A Cao
Journal:  Am J Hum Genet       Date:  1992-02       Impact factor: 11.025

Review 3.  A review of modern multiple hypothesis testing, with particular attention to the false discovery proportion.

Authors:  Alessio Farcomeni
Journal:  Stat Methods Med Res       Date:  2007-08-14       Impact factor: 3.021

4.  Circadian variations in rat liver gene expression: relationships to drug actions.

Authors:  Richard R Almon; Eric Yang; William Lai; Ioannis P Androulakis; Debra C DuBois; William J Jusko
Journal:  J Pharmacol Exp Ther       Date:  2008-06-18       Impact factor: 4.030

5.  Bayesian mixture models for the incorporation of prior knowledge to inform genetic association studies.

Authors:  Brooke L Fridley; Daniel Serie; Gregory Jenkins; Kristin White; William Bamlet; John D Potter; Ellen L Goode
Journal:  Genet Epidemiol       Date:  2010-07       Impact factor: 2.135

6.  HIGH DIMENSIONAL VARIABLE SELECTION.

Authors:  Larry Wasserman; Kathryn Roeder
Journal:  Ann Stat       Date:  2009-01-01       Impact factor: 4.028

Review 7.  Asthma genetics and genomics 2009.

Authors:  Scott T Weiss; Benjamin A Raby; Angela Rogers
Journal:  Curr Opin Genet Dev       Date:  2009-05-28       Impact factor: 5.578

8.  Thalassaemia and glucose-6-phosphate dehydrogenase screening in 13- to 14-year-old students of the Sardinian population: preliminary findings.

Authors:  A Cao; R Congiu; M C Sollaino; M F Desogus; F R Demartis; D Loi; M Cau; R Galanello
Journal:  Community Genet       Date:  2008-03-26

9.  beta zero thalassemia in Sardinia is caused by a nonsense mutation.

Authors:  R F Trecartin; S A Liebhaber; J C Chang; K Y Lee; Y W Kan; M Furbetta; A Angius; A Cao
Journal:  J Clin Invest       Date:  1981-10       Impact factor: 14.808

10.  Bias in random forest variable importance measures: illustrations, sources and a solution.

Authors:  Carolin Strobl; Anne-Laure Boulesteix; Achim Zeileis; Torsten Hothorn
Journal:  BMC Bioinformatics       Date:  2007-01-25       Impact factor: 3.169

View more
  2 in total

1.  Supervised learning methods in modeling of CD4+ T cell heterogeneity.

Authors:  Pinyi Lu; Vida Abedi; Yongguo Mei; Raquel Hontecillas; Stefan Hoops; Adria Carbo; Josep Bassaganya-Riera
Journal:  BioData Min       Date:  2015-09-04       Impact factor: 2.522

2.  Genome-wide prediction using Bayesian additive regression trees.

Authors:  Patrik Waldmann
Journal:  Genet Sel Evol       Date:  2016-06-10       Impact factor: 4.297

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.