Literature DB >> 21067556

Shedding new light on genetic dark matter.

Nadine Melhem1, Bernie Devlin.   

Abstract

Discoveries from genome-wide association studies have contributed to our knowledge of the genetic etiology of many complex diseases. However, these account for only a small fraction of each disease's heritability. Here, we comment on approaches currently available to uncover more of the genetic 'dark matter,' including an approach introduced recently by Naukkarinen and colleagues. These authors propose a method for distinguishing between gene expression driven by genetic variation and that driven by non-genetic factors. This dichotomy allows investigators to focus statistical tests and further molecular analyses on a smaller set of genes, thereby discovering new genetic variation affecting risk for disease. We need more methods like this one if we are to shed a powerful light on dark matter. By enhancing our understanding of molecular genetic etiology, such methods will help us to understand disease processes better and will advance the promise of personalized medicine.

Entities:  

Year:  2010        PMID: 21067556      PMCID: PMC3092108          DOI: 10.1186/gm200

Source DB:  PubMed          Journal:  Genome Med        ISSN: 1756-994X            Impact factor:   11.117


Background

The past three decades of studies have unveiled some of the genetic underpinnings of human disease. For complex diseases, those with obscure genetic roots, discoveries have accelerated recently owing to a bloom of genome-wide association studies (GWASs) [1]. Nevertheless, even for the most successful cases (such as inflammatory and ulcerative bowel disease [2,3]), discoveries account for only a fraction, often small, of the disease's heritability. These yet to be discovered genetic variants comprise the 'missing heritability' or the genetic 'dark matter' for disease.

State of dark matter

Heritability, the proportion of trait variability explained by genetic factors, has two somewhat different meanings. Narrow-sense heritability involves only the additive effects of genes. Broad-sense heritability involves both additive and non-additive effects. The difference between the two makes a difference when hunting for dark matter. If genetic variation were all to act additively, the best predictor of an offspring's trait value would be the average of his/her parents' values. Human height is an excellent example, after adjusting for gender. Hunting for dark matter for a trait such as human height will be more straightforward than for a disease such as schizophrenia, for which the evidence for substantial gene-gene interaction is compelling [4]. Yet when researchers refer to heritability of human height, they implicitly mean narrow-sense heritability; for schizophrenia, it is heritability in a much broader sense. Why should we care about the genetic basis of disease? Greater understanding of the genetics equals greater understanding of molecular etiology and, with it, eventually more cogent treatments. However, the origins of some human diseases, especially those of the mind, can be mysterious. For diseases of the mind, few environmental or genetic risk factors are understood; instead the hope is that identified genetic factors will lead to a subtler understanding of why diseases such as schizophrenia arise and how they can be treated effectively. Even for cardiovascular disease, for which environmental risk factors are well characterized, new insights into its genetics could produce more targeted treatment. This leads to the other expectation - that greater genetic knowledge will pave the way for 'personalized' medicine. The rapid technological advances in genomics will soon make it feasible to sequence whole genomes at relatively low cost. The idea that each individual will have meaningful sequence variation in their medical records and will have interventions tailored to their risk profile and likely treatment response is quite appealing. The goal of personalized medicine, however, is hindered because so much molecular etiology remains in the dark. One way to explain more of the dark matter is to develop more efficient ways to use existing data. Naukkarinen et al. [5] develop an innovative approach that integrates gene expression and genotype data. They apply these ideas to a GWAS of obesity, as measured by body mass index (BMI). Studies estimate BMI's heritability at 45 to 85%, but identified genetic variants explain about 1% of the total variance [6]. To discover more variants, the authors [5] examined gene expression of adipose tissue in a sample of monozygotic (MZ) twins discordant for BMI and in a sample of unrelated individuals. Because MZ twins are genetically identical, or nearly so, the authors reasoned that genes showing expression differences between twins are 'reactive' genes with differences that are due to regulatory or epigenetic changes in response to environmental factors. By contrast, genes uncovered in unrelated individuals are a combination of reactive and genetically 'causal' genes. By contrasting results from the unrelated sample and discordant MZ twins, the authors identified 27 causal genes that were differentially regulated. They then tested 197 single nucleotide polymorphisms (SNPs) falling in and around these genes in a sample of 21,000 subjects. They discovered a significant excess of small P-values in this set of SNPs. Neither the set of SNPs defined by reactive genes nor the individual SNPs in the reactive set were associated with BMI. Notably, this work identifies a new gene, F13A1, which encodes the coagulation factor XIII A chain, with variation that affects BMI. This gene has also been identified by meta-analysis of 12 studies of venous thromboembolism [7]. Obesity is well known to predispose to vein thromboses; however, the study of Naukkarinen et al. [5] reveals a potential biological pathway for the relationship between obesity, thrombosis and cardiovascular risk. The methods advanced by Naukkarinen and colleagues [5] require discordant MZ twins, which were available for BMI. This experimental design could prove highly informative for similar quantitative traits, for which extremes are easily identified and by which the pathology or phenotype of interest is defined. For some diseases, especially diseases of the brain, quantitative traits that map precisely onto risk are not yet available. In addition, because reactive genes are environment-dependent, successful implementation of this design might require a sample exposed to a homogeneous environment, limiting its generality. Regardless, this study shows how innovative research can cast more light on dark matter. Moreover, the study design could also inform us about pathways of correlated gene expression and how much these correlations are influenced by genetic and environmental variation. Many other methods and designs are available to illuminate dark matter [8-15]. One appealing approach teams gene-expression results with genome-wide association data to produce targeted hypothesis tests [8]. One possibility is to organize tests by expression quantitative trait loci affecting genes in pathways meaningful for the disease. Statistical methods for targeted testing are available, whether on the basis of prior information of the likelihood of an association between a SNP and the phenotype or on the basis of plausible disease pathways [9,10]. Genetic variants with parental origin effects, or whose effects depend on the parent from whom they were inherited, could be part of the dark matter; methods are now available to determine the parental origin of alleles and haplotypes even in the absence of genotyped parents [13]. Studies of copy number variants and their inheritance in families could also reveal insight into plausible biological pathways for disease [14,15]. It is also safe to say that rare variants account for some of the dark matter [16], possibly the majority of it in some cases. Next-generation sequencing promises to fill some of our void in knowledge by identifying more penetrant but rarer variants. Other approaches are less illuminating. Let's reconsider human height. We know numerous rare variants and about 50 common variants that have an impact on height. Thus far, known genetics account for roughly 5% of the variance. Using many SNPs from GWAS analysis that are not significantly associated with height, Yang et al. [17] estimated the proportion of variance in height explained by SNPs as 0.45 and even got close to the heritability estimate of 0.84 after correcting for incomplete linkage disequilibrium between SNPs genotyped and causal variants. In spirit, this approach [17] is similar to the allele score method [18], which seeks a predictive model for disease status on the basis of thousands of SNPs with modest evidence for association. If their results are correct, both studies [17,18] suggest that the effects of SNPs are small and will be difficult or impossible to detect from simple analyses of GWASs, at least for current sample sizes [19]. These intriguing approaches have some drawbacks: they shed no new light on the molecular etiology of phenotype; and inherent in the calculations are assumptions that could prove difficult to validate. We all recognize the hidden biases that inflate estimates of heritability. There are other complex pathways for the transmission of a phenotype across generations without the transmission of a specific common or rare variant, namely through epigenetic factors that can result in the inheritance of gene expression patterns without an alteration of the DNA sequence [20]. Gene-environment interactions could also affect the estimates of heritability and when they are in play, they can explain as much of the variance in the phenotype as genetic factors [21].

Conclusions

Concerted effort will almost surely be required to understand the genetic architecture of most complex diseases. Naukkarinen et al.'s [5] novel study design illustrates the impact that concerted effort can have in advancing our knowledge of the genetic etiology of such diseases. There remains ample room for novel analytic methods and study designs to shed light on the genetic dark matter of disease. It is entirely possible, 10 years hence, that we will realize that much of the missing heritability was hiding in plain sight in common variants.

Abbreviations

GWAS: genome-wide association study; MZ: monozygotic; SNP: single nucleotide polymorphism

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

The authors contributed equally to the writing and preparation of this commentary.

Authors' information

BD is Associate Professor of Psychiatry and Human Genetics, University of Pittsburgh School of Medicine, Pittsburgh. His background and area of expertise is statistical genetics. NM is Assistant Professor of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh. Her background and areas of expertise are psychiatric epidemiology and statistical genetics.
  21 in total

1.  Pathway-based approaches for analysis of genomewide association studies.

Authors:  Kai Wang; Mingyao Li; Maja Bucan
Journal:  Am J Hum Genet       Date:  2007-12       Impact factor: 11.025

Review 2.  Transgenerational genetic effects on phenotypic variation and disease risk.

Authors:  Joseph H Nadeau
Journal:  Hum Mol Genet       Date:  2009-10-15       Impact factor: 6.150

3.  Common SNPs explain a large proportion of the heritability for human height.

Authors:  Jian Yang; Beben Benyamin; Brian P McEvoy; Scott Gordon; Anjali K Henders; Dale R Nyholt; Pamela A Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

4.  Genome-wide association identifies multiple ulcerative colitis susceptibility loci.

Authors:  Dermot P B McGovern; Agnès Gardet; Leif Törkvist; Philippe Goyette; Jonah Essers; Kent D Taylor; Benjamin M Neale; Rick T H Ong; Caroline Lagacé; Chun Li; Todd Green; Christine R Stevens; Claudine Beauchamp; Phillip R Fleshner; Marie Carlson; Mauro D'Amato; Jonas Halfvarson; Martin L Hibberd; Mikael Lördal; Leonid Padyukov; Angelo Andriulli; Elisabetta Colombo; Anna Latiano; Orazio Palmieri; Edmond-Jean Bernard; Colette Deslandres; Daan W Hommes; Dirk J de Jong; Pieter C Stokkers; Rinse K Weersma; Yashoda Sharma; Mark S Silverberg; Judy H Cho; Jing Wu; Kathryn Roeder; Steven R Brant; L Phillip Schumm; Richard H Duerr; Marla C Dubinsky; Nicole L Glazer; Talin Haritunians; Andy Ippoliti; Gil Y Melmed; David S Siscovick; Eric A Vasiliauskas; Stephan R Targan; Vito Annese; Cisca Wijmenga; Sven Pettersson; Jerome I Rotter; Ramnik J Xavier; Mark J Daly; John D Rioux; Mark Seielstad
Journal:  Nat Genet       Date:  2010-03-14       Impact factor: 38.330

5.  Factor XIII Val34Leu variant is protective against venous thromboembolism: a HuGE review and meta-analysis.

Authors:  Philip S Wells; Josdalyne L Anderson; Dimitrios K Scarvelis; Steve P Doucette; France Gagnon
Journal:  Am J Epidemiol       Date:  2006-06-01       Impact factor: 4.897

6.  Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS.

Authors:  Dan L Nicolae; Eric Gamazon; Wei Zhang; Shiwei Duan; M Eileen Dolan; Nancy J Cox
Journal:  PLoS Genet       Date:  2010-04-01       Impact factor: 5.917

7.  Use of genome-wide expression data to mine the "Gray Zone" of GWA studies leads to novel candidate obesity genes.

Authors:  Jussi Naukkarinen; Ida Surakka; Kirsi H Pietiläinen; Aila Rissanen; Veikko Salomaa; Samuli Ripatti; Hannele Yki-Järvinen; Cornelia M van Duijn; H-Erich Wichmann; Jaakko Kaprio; Marja-Riitta Taskinen; Leena Peltonen
Journal:  PLoS Genet       Date:  2010-06-03       Impact factor: 5.917

8.  Autism genome-wide copy number variation reveals ubiquitin and neuronal genes.

Authors:  Joseph T Glessner; Kai Wang; Guiqing Cai; Olena Korvatska; Cecilia E Kim; Shawn Wood; Haitao Zhang; Annette Estes; Camille W Brune; Jonathan P Bradfield; Marcin Imielinski; Edward C Frackelton; Jennifer Reichert; Emily L Crawford; Jeffrey Munson; Patrick M A Sleiman; Rosetta Chiavacci; Kiran Annaiah; Kelly Thomas; Cuiping Hou; Wendy Glaberson; James Flory; Frederick Otieno; Maria Garris; Latha Soorya; Lambertus Klei; Joseph Piven; Kacie J Meyer; Evdokia Anagnostou; Takeshi Sakurai; Rachel M Game; Danielle S Rudd; Danielle Zurawiecki; Christopher J McDougle; Lea K Davis; Judith Miller; David J Posey; Shana Michaels; Alexander Kolevzon; Jeremy M Silverman; Raphael Bernier; Susan E Levy; Robert T Schultz; Geraldine Dawson; Thomas Owley; William M McMahon; Thomas H Wassink; John A Sweeney; John I Nurnberger; Hilary Coon; James S Sutcliffe; Nancy J Minshew; Struan F A Grant; Maja Bucan; Edwin H Cook; Joseph D Buxbaum; Bernie Devlin; Gerard D Schellenberg; Hakon Hakonarson
Journal:  Nature       Date:  2009-04-28       Impact factor: 49.962

9.  A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity.

Authors:  Timothy M Frayling; Nicholas J Timpson; Michael N Weedon; Eleftheria Zeggini; Rachel M Freathy; Cecilia M Lindgren; John R B Perry; Katherine S Elliott; Hana Lango; Nigel W Rayner; Beverley Shields; Lorna W Harries; Jeffrey C Barrett; Sian Ellard; Christopher J Groves; Bridget Knight; Ann-Marie Patch; Andrew R Ness; Shah Ebrahim; Debbie A Lawlor; Susan M Ring; Yoav Ben-Shlomo; Marjo-Riitta Jarvelin; Ulla Sovio; Amanda J Bennett; David Melzer; Luigi Ferrucci; Ruth J F Loos; Inês Barroso; Nicholas J Wareham; Fredrik Karpe; Katharine R Owen; Lon R Cardon; Mark Walker; Graham A Hitman; Colin N A Palmer; Alex S F Doney; Andrew D Morris; George Davey Smith; Andrew T Hattersley; Mark I McCarthy
Journal:  Science       Date:  2007-04-12       Impact factor: 47.728

10.  Common variants at five new loci associated with early-onset inflammatory bowel disease.

Authors:  Marcin Imielinski; Robert N Baldassano; Anne Griffiths; Richard K Russell; Vito Annese; Marla Dubinsky; Subra Kugathasan; Jonathan P Bradfield; Thomas D Walters; Patrick Sleiman; Cecilia E Kim; Aleixo Muise; Kai Wang; Joseph T Glessner; Shehzad Saeed; Haitao Zhang; Edward C Frackelton; Cuiping Hou; James H Flory; George Otieno; Rosetta M Chiavacci; Robert Grundmeier; Massimo Castro; Anna Latiano; Bruno Dallapiccola; Joanne Stempak; Debra J Abrams; Kent Taylor; Dermot McGovern; Gary Silber; Iwona Wrobel; Antonio Quiros; Jeffrey C Barrett; Sarah Hansoul; Dan L Nicolae; Judy H Cho; Richard H Duerr; John D Rioux; Steven R Brant; Mark S Silverberg; Kent D Taylor; M Michael Barmuda; Alain Bitton; Themistocles Dassopoulos; Lisa Wu Datta; Todd Green; Anne M Griffiths; Emily O Kistner; Michael T Murtha; Miguel D Regueiro; Jerome I Rotter; L Philip Schumm; A Hillary Steinhart; Stephen R Targan; Ramnik J Xavier; Cécile Libioulle; Cynthia Sandor; Mark Lathrop; Jacques Belaiche; Olivier Dewit; Ivo Gut; Simon Heath; Debby Laukens; Myriam Mni; Paul Rutgeerts; André Van Gossum; Diana Zelenika; Denis Franchimont; J P Hugot; Martine de Vos; Severine Vermeire; Edouard Louis; Lon R Cardon; Carl A Anderson; Hazel Drummond; Elaine Nimmo; Tariq Ahmad; Natalie J Prescott; Clive M Onnie; Sheila A Fisher; Jonathan Marchini; Jilur Ghori; Suzannah Bumpstead; Rhian Gwillam; Mark Tremelling; Panos Delukas; John Mansfield; Derek Jewell; Jack Satsangi; Christopher G Mathew; Miles Parkes; Michel Georges; Mark J Daly; Melvin B Heyman; George D Ferry; Barbara Kirschner; Jessica Lee; Jonah Essers; Richard Grand; Michael Stephens; Arie Levine; David Piccoli; John Van Limbergen; Salvatore Cucchiara; Dimitri S Monos; Stephen L Guthery; Lee Denson; David C Wilson; Straun F A Grant; Mark Daly; Mark S Silverberg; Jack Satsangi; Hakon Hakonarson
Journal:  Nat Genet       Date:  2009-11-15       Impact factor: 38.330

View more
  3 in total

1.  The blood exposome and its role in discovering causes of disease.

Authors:  Stephen M Rappaport; Dinesh K Barupal; David Wishart; Paolo Vineis; Augustin Scalbert
Journal:  Environ Health Perspect       Date:  2014-03-21       Impact factor: 9.031

2.  Common genetic variants, acting additively, are a major source of risk for autism.

Authors:  Lambertus Klei; Stephan J Sanders; Michael T Murtha; Vanessa Hus; Jennifer K Lowe; A Jeremy Willsey; Daniel Moreno-De-Luca; Timothy W Yu; Eric Fombonne; Daniel Geschwind; Dorothy E Grice; David H Ledbetter; Catherine Lord; Shrikant M Mane; Christa Lese Martin; Donna M Martin; Eric M Morrow; Christopher A Walsh; Nadine M Melhem; Pauline Chaste; James S Sutcliffe; Matthew W State; Edwin H Cook; Kathryn Roeder; Bernie Devlin
Journal:  Mol Autism       Date:  2012-10-15       Impact factor: 7.509

3.  Deep landscape update of dispersed and tandem repeats in the genome model of the red jungle fowl, Gallus gallus, using a series of de novo investigating tools.

Authors:  Sébastien Guizard; Benoît Piégu; Peter Arensburger; Florian Guillou; Yves Bigot
Journal:  BMC Genomics       Date:  2016-08-19       Impact factor: 3.969

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.