We want to discuss how recent dramatic progress in whole-genome association analysis (WGA) applied to human case-control studies will affect complex trait analysis in the mouse. At first sight it might now seem that one can identify the genetic determinants of human complex disease directly. However, the picture is less straightforward, for although a significant number of genes have been identified and replicated across different human WGA studies, in most diseases the genetic variation segregating at these genes explains only a small fraction of cases that should be accounted for by genetic causes. The question remains whether the missing genetic signal is from common variants in other genes but with very small phenotypic effect, or is caused by rare variants in the same genes, or a combination of both. In either case, to continue making progress directly with the human WGA methodology it will be necessary to increase sample sizes significantly (Zeggini et al. 2008).Where does this leave mouse complex genetics? Of course, as the readers of Mammalian Genome will appreciate, working with mice does have certain advantages in that it is possible to design and control experiments and take detailed measurements in a way that is impossible with humans. Nevertheless, the mouse complex genetics community must address three key issues in order to contribute to our understanding of human biology and disease.The first of these is mapping resolution. We need to establish suitable mouse populations in which high-resolution mapping is the norm. The small extent of linkage disequilibrium (LD) in most human populations means that a positive signal in a human WGA is localized to a few tens of kilobases, usually the span a single gene. Assuming that the functional genetic variant acts on the nearest gene (which is not always the case), then human WGA delivers single-gene resolution. In contrast, a detected QTL in an F2 intercross between two inbred laboratory mouse strains that explains 5% of the phenotypic variance will be mapped into a 95% confidence interval of approximately 30 Mb, containing approximately 300 genes. In some cases, combining information from multiple F2 crosses with the haplotype map of the mouse genome can refine QTL localization (Hitzemann et al. 2002; Li et al. 2005). Several other strategies have been proposed to solve this problem, by using populations of mice with more recombinants, and consequently steeper LD decay profiles. Heterogeneous stocks (HS) are descended from eight known inbred strain of mice that are outcrossed using a rotational breeding scheme for many generations until the genomes are relatively fine-grained mosaics of the founder haplotypes; mapping resolution of 2–3Mb is obtainable (Mott et al. 2000; Talbot et al. 1999; Valdar et al.
2006). However, despite being much more accurate than an F2 cross, this is still too crude for single-gene mapping for which we require mapping resolution of about 100 kb.Commercial mouse breeders maintain very large genetically heterogeneous outbred populations, some of which are suitable for complex trait analysis. As proof of principle, Yalcin et al. (2004) showed how one such population, MF1, could be used to fine-map a QTL for behavior down to the gene Rgs2. Our group is now in the process of evaluating the genetic variability and LD profile of several commercially available outbred populations. Preliminary data suggest that there are some populations that have suitable properties but that others are not useful, having been recently rederived from a small number of animals and therefore containing extensive LD. There is also one tantalizing study of outbred wild mice that suggests they have an LD structure similar to that of humans (Laurie et al. 2007).The disadvantage of working with outbreds is that each animal is unique so that it is impossible to perform repeat experiments on the same genetic background, which may be necessary; for example, in a study to measure gene expression changes during development. Furthermore, the cost of high-density genotyping required limits the size of experiments. In contrast, inbred strains of mice permit these types of studies, need only be genotyped once, and there is a synergy in accumulating data from different experiments on standardized genetic backgrounds. There has been considerable debate over the direct use of the standard laboratory inbred strains for WGA. The main point of contention is that the number of independent inbred strains available is limited (for example, the mouse phenome database http://www.phenome.jax.org/pub-cgi/phenome/mpdcgi uses less than 40 priority strains, and even these strains share haplotypes to a considerable degree). Although the method may work for major QTLs explaining over 50% of the phenotypic variance, results are mixed for complex traits (Payseur and Place 2007) where QTLs of small effect are lost among false-positive signals in a genome scan. On the other hand, the sharp LD decay profile of the inbred strains is very attractive, so that if one can be sure that a QTL is segregating in a region (for example, from an F2 cross), then an analysis of inbred strains across the region may identify the gene in some cases. In addition, there is a considerable saving in genotyping costs.This discussion suggests there is a strong case for designing and constructing a large population of inbred lines that contains a high density of recombinants and where the lines are independent, in the sense that they do not share any recombination events. This was the motivation behind the Collaborative Cross (CC) (Churchill et al. 2004; Threadgill et al. 2002). Currently, over 400 CC recombinant inbred lines are being bred at Oak Ridge National Laboratary, USA, and Tel Aviv University, Israel, in a collaboration funded by the U.S. Department of Energy (DOE), The Ellison Foundation, National Institutes of Health (NIH), and The Wellcome Trust. The lines are descended from eight genetically diverse founder strains (including three wild-derived strains) and are currently between generation 6 and 10 of inbreeding. They will be fully inbred in about 4 years but are already sufficiently advanced that a pilot project is planned to assess their use for QTL mapping. The CC is almost ideal, except that the expected QTL mapping resolution is about 1 Mb (Valdar et al. 2005)—not quite single gene.The second issue is that the haplotype structure of the classical laboratory strains of mice is not ideal for complex trait analysis. To make best use of the mouse, we need complete genome sequences of the common laboratory strains. Already we know from partial resequencing of 16 strains that the so-called classical strains share only a fraction of the genetic variation segregating in wild-derived strains (Frazer et al. 2007; Yang et al. 2007) so there should be fewer QTLs in a study solely using classical strains. Moreover, their genomes are not independent; the pattern of haplotype sharing is not random across the genome, causing the problem of false-positive QTLs alluded to above. By contrast, the haplotype structure of the CC is necessarily random, very rare variants should not exist, and the founding strains contain more genetic variation than is present in some human populations (Roberts et al. 2007). We do not yet know much about the haplotype structure or origins of commercial outbreds.The third issue is how we should relate discoveries made in the mouse to human disease. It should be reiterated that many studies in mice are not feasible in humans, such as the elucidation of gene networks from most tissues and developmental stages. As it is extremely unlikely that identical causative polymorphisms will be segregating in both species, we should not necessarily expect the same genes to be identified in WGA, although there are examples where this is the case, such as cancer modifiers common to mice and humans (Ruivenkamp et al. 2002). Nevertheless, we should expect the same pathways to be implicated (Emilsson et al. 2008). However, at present the functional annotation of both species is incomplete. Therefore, alongside the development of suitable mapping populations, we need comprehensive annotation of the gene networks in the mouse, and how they vary during development, between tissues and between genetic backgrounds. There is not space here to say more except that it will require a concerted international effort, integrating and extending existing online resources.
Authors: Binnaz Yalcin; Saffron A G Willis-Owen; Jan Fullerton; Anjela Meesaq; Robert M Deacon; J Nicholas P Rawlins; Richard R Copley; Andrew P Morris; Jonathan Flint; Richard Mott Journal: Nat Genet Date: 2004-10-17 Impact factor: 38.330
Authors: Gary A Churchill; David C Airey; Hooman Allayee; Joe M Angel; Alan D Attie; Jackson Beatty; William D Beavis; John K Belknap; Beth Bennett; Wade Berrettini; Andre Bleich; Molly Bogue; Karl W Broman; Kari J Buck; Ed Buckler; Margit Burmeister; Elissa J Chesler; James M Cheverud; Steven Clapcote; Melloni N Cook; Roger D Cox; John C Crabbe; Wim E Crusio; Ariel Darvasi; Christian F Deschepper; R W Doerge; Charles R Farber; Jiri Forejt; Daniel Gaile; Steven J Garlow; Hartmut Geiger; Howard Gershenfeld; Terry Gordon; Jing Gu; Weikuan Gu; Gerald de Haan; Nancy L Hayes; Craig Heller; Heinz Himmelbauer; Robert Hitzemann; Kent Hunter; Hui-Chen Hsu; Fuad A Iraqi; Boris Ivandic; Howard J Jacob; Ritsert C Jansen; Karl J Jepsen; Dabney K Johnson; Thomas E Johnson; Gerd Kempermann; Christina Kendziorski; Malak Kotb; R Frank Kooy; Bastien Llamas; Frank Lammert; Jean-Michel Lassalle; Pedro R Lowenstein; Lu Lu; Aldons Lusis; Kenneth F Manly; Ralph Marcucio; Doug Matthews; Juan F Medrano; Darla R Miller; Guy Mittleman; Beverly A Mock; Jeffrey S Mogil; Xavier Montagutelli; Grant Morahan; David G Morris; Richard Mott; Joseph H Nadeau; Hiroki Nagase; Richard S Nowakowski; Bruce F O'Hara; Alexander V Osadchuk; Grier P Page; Beverly Paigen; Kenneth Paigen; Abraham A Palmer; Huei-Ju Pan; Leena Peltonen-Palotie; Jeremy Peirce; Daniel Pomp; Michal Pravenec; Daniel R Prows; Zhonghua Qi; Roger H Reeves; John Roder; Glenn D Rosen; Eric E Schadt; Leonard C Schalkwyk; Ze'ev Seltzer; Kazuhiro Shimomura; Siming Shou; Mikko J Sillanpää; Linda D Siracusa; Hans-Willem Snoeck; Jimmy L Spearow; Karen Svenson; Lisa M Tarantino; David Threadgill; Linda A Toth; William Valdar; Fernando Pardo-Manuel de Villena; Craig Warden; Steve Whatley; Robert W Williams; Tim Wiltshire; Nengjun Yi; Dabao Zhang; Min Zhang; Fei Zou Journal: Nat Genet Date: 2004-11 Impact factor: 38.330
Authors: William Valdar; Leah C Solberg; Dominique Gauguier; Stephanie Burnett; Paul Klenerman; William O Cookson; Martin S Taylor; J Nicholas P Rawlins; Richard Mott; Jonathan Flint Journal: Nat Genet Date: 2006-07-09 Impact factor: 38.330
Authors: R Hitzemann; B Malmanger; S Cooper; S Coulombe; C Reed; K Demarest; J Koyner; L Cipp; J Flint; C Talbot; B Rademacher; K Buck; J McCaughran Journal: Genes Brain Behav Date: 2002-11 Impact factor: 3.449
Authors: Claudia A L Ruivenkamp; Tom van Wezel; Carlo Zanon; Alphons P M Stassen; Cestmir Vlcek; Tamás Csikós; Anita M Klous; Nikos Tripodis; Anastassis Perrakis; Lucie Boerrigter; Peter C Groot; Jan Lindeman; Wolter J Mooi; Gerrit A Meijjer; Gert Scholten; Hans Dauwerse; Vaclav Paces; Nico van Zandwijk; Gert Jan B van Ommen; Peter Demant Journal: Nat Genet Date: 2002-06-24 Impact factor: 38.330
Authors: Eleftheria Zeggini; Laura J Scott; Richa Saxena; Benjamin F Voight; Jonathan L Marchini; Tianle Hu; Paul I W de Bakker; Gonçalo R Abecasis; Peter Almgren; Gitte Andersen; Kristin Ardlie; Kristina Bengtsson Boström; Richard N Bergman; Lori L Bonnycastle; Knut Borch-Johnsen; Noël P Burtt; Hong Chen; Peter S Chines; Mark J Daly; Parimal Deodhar; Chia-Jen Ding; Alex S F Doney; William L Duren; Katherine S Elliott; Michael R Erdos; Timothy M Frayling; Rachel M Freathy; Lauren Gianniny; Harald Grallert; Niels Grarup; Christopher J Groves; Candace Guiducci; Torben Hansen; Christian Herder; Graham A Hitman; Thomas E Hughes; Bo Isomaa; Anne U Jackson; Torben Jørgensen; Augustine Kong; Kari Kubalanza; Finny G Kuruvilla; Johanna Kuusisto; Claudia Langenberg; Hana Lango; Torsten Lauritzen; Yun Li; Cecilia M Lindgren; Valeriya Lyssenko; Amanda F Marvelle; Christa Meisinger; Kristian Midthjell; Karen L Mohlke; Mario A Morken; Andrew D Morris; Narisu Narisu; Peter Nilsson; Katharine R Owen; Colin N A Palmer; Felicity Payne; John R B Perry; Elin Pettersen; Carl Platou; Inga Prokopenko; Lu Qi; Li Qin; Nigel W Rayner; Matthew Rees; Jeffrey J Roix; Anelli Sandbaek; Beverley Shields; Marketa Sjögren; Valgerdur Steinthorsdottir; Heather M Stringham; Amy J Swift; Gudmar Thorleifsson; Unnur Thorsteinsdottir; Nicholas J Timpson; Tiinamaija Tuomi; Jaakko Tuomilehto; Mark Walker; Richard M Watanabe; Michael N Weedon; Cristen J Willer; Thomas Illig; Kristian Hveem; Frank B Hu; Markku Laakso; Kari Stefansson; Oluf Pedersen; Nicholas J Wareham; Inês Barroso; Andrew T Hattersley; Francis S Collins; Leif Groop; Mark I McCarthy; Michael Boehnke; David Altshuler Journal: Nat Genet Date: 2008-03-30 Impact factor: 38.330
Authors: Soha N Yazbek; David A Buchner; Jonathan M Geisinger; Lindsay C Burrage; Sabrina H Spiezio; Gabriel E Zentner; Chang-Wen Hsieh; Peter C Scacheri; Colleen M Croniger; Joseph H Nadeau Journal: Genome Res Date: 2011-04-19 Impact factor: 9.043
Authors: David L Aylor; William Valdar; Wendy Foulds-Mathes; Ryan J Buus; Ricardo A Verdugo; Ralph S Baric; Martin T Ferris; Jeff A Frelinger; Mark Heise; Matt B Frieman; Lisa E Gralinski; Timothy A Bell; John D Didion; Kunjie Hua; Derrick L Nehrenberg; Christine L Powell; Jill Steigerwalt; Yuying Xie; Samir N P Kelada; Francis S Collins; Ivana V Yang; David A Schwartz; Lisa A Branstetter; Elissa J Chesler; Darla R Miller; Jason Spence; Eric Yi Liu; Leonard McMillan; Abhishek Sarkar; Jeremy Wang; Wei Wang; Qi Zhang; Karl W Broman; Ron Korstanje; Caroline Durrant; Richard Mott; Fuad A Iraqi; Daniel Pomp; David Threadgill; Fernando Pardo-Manuel de Villena; Gary A Churchill Journal: Genome Res Date: 2011-03-15 Impact factor: 9.043
Authors: Patrick T McGrath; Matthew V Rockman; Manuel Zimmer; Heeun Jang; Evan Z Macosko; Leonid Kruglyak; Cornelia I Bargmann Journal: Neuron Date: 2009-03-12 Impact factor: 17.173