Hongbing Shen1, Guangfu Jin. 1. Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 210029, China; ; Section of Clinical Epidemiology, Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Cancer Center, Nanjing Medical University, Nanjing, Jiangsu 210029, China.
Human genome epidemiology (HuGE) uses systematic applications of epidemiologic methods to assess the impact of human genetic variation on health and disease. In the past ten years, human genome epidemiology has made great progresses along with advances in genomics technologies, which make it possible for the examination of genetic variants in a large sample size at a sufficiently low cost. Genetic association study in population provides a powerful approach to identify variants or genes associated with disease of interest by comparing distributions of genetic variants between affected and unaffected individuals. A critical question remains that these findings from genetic association studies are not consistently reproducible, possibly due to false positive, false negative or true variability in association among different populations. Before 2006, only few low penetrance variants outside the HLA locus had been discovered to be reproducibly associated with disease susceptibility based on the candidate gene approach[1]. With the application of high throughput genotyping technology, genome-wide association studies (GWAS) have emerged as a powerful tool for investigating the genetic architecture of complex diseases without a prior hypothesis about a particular gene or locus. In the GWAS approach, several hundred thousand to more than a million single nucleotide polymorphisms (SNPs) are assayed across the whole genome in a large sample size of thousands of individuals[2]. To date, GWAS have led to the discovery of thousands of loci that are associated with different kinds of human diseases or traits. These findings have advanced our current knowledge of genetics of complex diseases, and provided new insights into identification of therapeutic targets and developing targeted interventions[3]. Given the advance in association study of complex diseases, especially for cancer, in the current issue, two reviews have been invited to comment on the progress of HuGE on head and neck cancer[4] and bladder cancer[5].The SNP chips used for GWAS are designed to provide better coverage of common SNPs, which, however, may be incompetent to interrogate all common variants for certain regions and have very limited potential to capture rare and low frequency variants[6]. Such variants may hardly be focused on using available genome-wide genotyping technologies. To date, only a small fraction of disease hereditability could be interpreted by known susceptibility loci. For example, more than 40 prostate cancer susceptibility loci have been reported; however, these only account for approximately 25% of the familial risk of the disease[7]. In this regard, genetic association studies should direct more efforts at the variants around genes/regions already implicated in disease pathogenesis. Functional variants in such genes/regions represent particularly impressive candidates. Several reports in this issue demonstrate the necessity and importance of association studies based on functionally candidate genes for variants associated with disease susceptibility or prognosis[8]–[10]. Of course, these findings, in the absence of replication, need to be validated in other studies with independent samples. Nevertheless, we suggest that more efforts should be taken on regions identified as relevant to diseases through GWAS using a targeted region approach, which has been shown as an efficient and cost-effective screening for rare and low-frequency polymorphisms to expand the disease variance explained[11].In addition to the issues discussed above, rare variants, epistasis, epigenetics and genotype-environment interactions may also contribute to the missing heritability[12]. As compared with the identified common variants by GWAS (in general, minor allele frequency is more than 5%), rarer variants that are poorly detected by available genotyping arrays. Structural variation, including copy number variants (CNVs), inversions, translocations, microsatellite, repeat expansions, insertions of new sequence, complex rearrangements, and short insertions or deletions (indels), may also account for some of the unexplained heritability and are poorly captured by existing arrays[13]. In the current issue, Wang et al[14] explored the landscape and impact of a new form of genomic variation, run of homozygosity (ROH), on lung cancer. ROH is a continuous or uninterrupted stretch of a genomic sequence without heterozygosity in the diploid state, which is poorly investigated to date. Using an existing GWAS dataset including 1473 lung cancer cases and 1962 controls, they identified a new region at 14q23.1 that was consistently associated with lung cancer risk in Chinese population[14], suggesting that ROHs may be also responsible for the unexplained familial risk of diseases. Nevertheless, the epistasis or gene-environment interaction may contribute to a large percentage of disease variance. However, it is still difficult to detect them in the current status of epidemiological study design, exposure assessment, and methods of analysis. Long term environmental exposure assessment will be more important in the future epidemiological study design.Beyond doubt, genetic association studies, especially GWAS, have made a great progress in elucidating genetic factors underling complex diseases. However, there are still several barriers to overcome. GWAS-identified SNPs may point to functional variants but are unlikely themselves to be the causative variants, since there will often be several variants in strong linkage disequilibrium (LD) that show more or less equivalent evidence of association for any given signal of association. Extensive sequencing of the identified region followed by a well-designed fine-mapping study in multiple populations may be helpful to narrow an association signal to potentially causative variants[15]. Furthermore, much additional work is needed to determine the functional basis for the observed associations.Particularly, the potential for variants identified in GWAS to predict the risk of complex diseases has been anticipated, but the usefulness of translating these fundamental genetic findings into the bedside remains debatable[16]. Nevertheless, there are already a number of benefits of such genetic prediction over classical non-genetic models. For instance, genetic risk prediction is more stable over time than traditional risk factors, as a person's genetic sequence is absolutely constant throughout their life. Recently, Sun et al.[17] reported that genetic score calculated by genetic variants discovered through association study is an objective and better measurement of inherited risk of prostate cancer than family history, which can be obtained without a laboratory test but influenced by family size, age and survival status of male relatives, recall ability, family communication, and prevalence of the disease in populations. With increasing numbers of discovered genetic variants that can be used as biomarkers in future genetic risk prediction, we believe that identification of a proportion of a high risk population may be feasible for target diagnostic, and preventive and therapeutic interventions for complex disorders.
Authors: Eleftheria Zeggini; William Rayner; Andrew P Morris; Andrew T Hattersley; Mark Walker; Graham A Hitman; Panos Deloukas; Lon R Cardon; Mark I McCarthy Journal: Nat Genet Date: 2005-10-30 Impact factor: 38.330
Authors: Matthew L Freedman; Alvaro N A Monteiro; Simon A Gayther; Gerhard A Coetzee; Angela Risch; Christoph Plass; Graham Casey; Mariella De Biasi; Chris Carlson; David Duggan; Michael James; Pengyuan Liu; Jay W Tichelaar; Haris G Vikis; Ming You; Ian G Mills Journal: Nat Genet Date: 2011-06 Impact factor: 38.330
Authors: Zsofia Kote-Jarai; Ali Amin Al Olama; Graham G Giles; Gianluca Severi; Johanna Schleutker; Maren Weischer; Daniele Campa; Elio Riboli; Tim Key; Henrik Gronberg; David J Hunter; Peter Kraft; Michael J Thun; Sue Ingles; Stephen Chanock; Demetrius Albanes; Richard B Hayes; David E Neal; Freddie C Hamdy; Jenny L Donovan; Paul Pharoah; Fredrick Schumacher; Brian E Henderson; Janet L Stanford; Elaine A Ostrander; Karina Dalsgaard Sorensen; Thilo Dörk; Gerald Andriole; Joanne L Dickinson; Cezary Cybulski; Jan Lubinski; Amanda Spurdle; Judith A Clements; Suzanne Chambers; Joanne Aitken; R A Frank Gardiner; Stephen N Thibodeau; Dan Schaid; Esther M John; Christiane Maier; Walther Vogel; Kathleen A Cooney; Jong Y Park; Lisa Cannon-Albright; Hermann Brenner; Tomonori Habuchi; Hong-Wei Zhang; Yong-Jie Lu; Radka Kaneva; Ken Muir; Sara Benlloch; Daniel A Leongamornlert; Edward J Saunders; Malgorzata Tymrakiewicz; Nadiya Mahmud; Michelle Guy; Lynne T O'Brien; Rosemary A Wilkinson; Amanda L Hall; Emma J Sawyer; Tokhir Dadaev; Jonathan Morrison; David P Dearnaley; Alan Horwich; Robert A Huddart; Vincent S Khoo; Christopher C Parker; Nicholas Van As; Christopher J Woodhouse; Alan Thompson; Tim Christmas; Chris Ogden; Colin S Cooper; Aritaya Lophatonanon; Melissa C Southey; John L Hopper; Dallas R English; Tiina Wahlfors; Teuvo L J Tammela; Peter Klarskov; Børge G Nordestgaard; M Andreas Røder; Anne Tybjærg-Hansen; Stig E Bojesen; Ruth Travis; Federico Canzian; Rudolf Kaaks; Fredrik Wiklund; Markus Aly; Sara Lindstrom; W Ryan Diver; Susan Gapstur; Mariana C Stern; Roman Corral; Jarmo Virtamo; Angela Cox; Christopher A Haiman; Loic Le Marchand; Liesel Fitzgerald; Suzanne Kolb; Erika M Kwon; Danielle M Karyadi; Torben Falck Orntoft; Michael Borre; Andreas Meyer; Jürgen Serth; Meredith Yeager; Sonja I Berndt; James R Marthick; Briony Patterson; Dominika Wokolorczyk; Jyotsna Batra; Felicity Lose; Shannon K McDonnell; Amit D Joshi; Ahva Shahabi; Antje E Rinckleb; Ana Ray; Thomas A Sellers; Hui-Yi Lin; Robert A Stephenson; James Farnham; Heiko Muller; Dietrich Rothenbacher; Norihiko Tsuchiya; Shintaro Narita; Guang-Wen Cao; Chavdar Slavov; Vanio Mitev; Douglas F Easton; Rosalind A Eeles Journal: Nat Genet Date: 2011-07-10 Impact factor: 38.330
Authors: Teri A Manolio; Francis S Collins; Nancy J Cox; David B Goldstein; Lucia A Hindorff; David J Hunter; Mark I McCarthy; Erin M Ramos; Lon R Cardon; Aravinda Chakravarti; Judy H Cho; Alan E Guttmacher; Augustine Kong; Leonid Kruglyak; Elaine Mardis; Charles N Rotimi; Montgomery Slatkin; David Valle; Alice S Whittemore; Michael Boehnke; Andrew G Clark; Evan E Eichler; Greg Gibson; Jonathan L Haines; Trudy F C Mackay; Steven A McCarroll; Peter M Visscher Journal: Nature Date: 2009-10-08 Impact factor: 49.962
Authors: Manuel A Rivas; Mélissa Beaudoin; Agnes Gardet; Christine Stevens; Yashoda Sharma; Clarence K Zhang; Gabrielle Boucher; Stephan Ripke; David Ellinghaus; Noel Burtt; Tim Fennell; Andrew Kirby; Anna Latiano; Philippe Goyette; Todd Green; Jonas Halfvarson; Talin Haritunians; Joshua M Korn; Finny Kuruvilla; Caroline Lagacé; Benjamin Neale; Ken Sin Lo; Phil Schumm; Leif Törkvist; Marla C Dubinsky; Steven R Brant; Mark S Silverberg; Richard H Duerr; David Altshuler; Stacey Gabriel; Guillaume Lettre; Andre Franke; Mauro D'Amato; Dermot P B McGovern; Judy H Cho; John D Rioux; Ramnik J Xavier; Mark J Daly Journal: Nat Genet Date: 2011-10-09 Impact factor: 38.330