| Literature DB >> 26214591 |
Martin Hrabě de Angelis1,2,3, George Nicholson4, Mohammed Selloum5,6,7,8, Jacqui White9, Hugh Morgan10, Ramiro Ramirez-Solis9, Tania Sorg5,6,7,8, Sara Wells10, Helmut Fuchs1, Martin Fray10, Chris Holmes4, Karen P Steel9, Yann Herault5,11,6,7,8, Valérie Gailus-Durner1, Ann-Marie Mallon10, Steve Dm Brown10, David J Adams9, Niels C Adams9, Thure Adler1,12, Antonio Aguilar-Pimentel1,13, Dalila Ali-Hadji5,6,7,8, Gregory Amann5,6,7,8, Philippe André5,6,7,8, Sarah Atkins10, Aurelie Auburtin5,6,7,8, Abdel Ayadi5,6,7,8, Julien Becker5,6,7,8, Lore Becker1,14, Elodie Bedu5,6,7,8, Raffi Bekeredjian1,15, Marie-Christine Birling5,6,7,8, Andrew Blake10, Joanna Bottomley9, Mike Bowl10, Véronique Brault11,6,7,8, Dirk H Busch12, James N Bussell9, Julia Calzada-Wack16, Heather Cater10, Marie-France Champy5,6,7,8, Philippe Charles5,6,7,8, Claire Chevalier11,6,7,8, Francesco Chiani17, Gemma F Codner10, Roy Combe5,6,7,8, Roger Cox10, Emilie Dalloneau11,6,7,8, André Dierich5,6,7,8, Armida Di Fenza10, Brendan Doe17, Arnaud Duchon11,6,7,8, Oliver Eickelberg18, Chris T Esapa10, Lahcen El Fertak5,6,7,8, Tanja Feigel10, Irina Emelyanova10, Jeanne Estabel9, Jack Favor19, Ann Flenniken20, Alessia Gambadoro17, Lilian Garrett21, Hilary Gates10, Anna-Karin Gerdin9, George Gkoutos22, Simon Greenaway10, Lisa Glasl21, Patrice Goetz5,6,7,8, Isabelle Goncalves Da Cruz5,6,7,8, Alexander Götz18, Jochen Graw21, Alain Guimond5,6,7,8, Wolfgang Hans1, Geoff Hicks23, Sabine M Hölter21, Heinz Höfler14, John M Hancock10, Robert Hoehndorf24, Tertius Hough10, Richard Houghton9, Anja Hurt1, Boris Ivandic1,15, Hughes Jacobs5,6,7,8, Sylvie Jacquot5,6,7,8, Nora Jones20, Natasha A Karp9, Hugo A Katus1,15, Sharon Kitchen10, Tanja Klein-Rodewald16, Martin Klingenspor1,25, Thomas Klopstock1,14, Valerie Lalanne5,6,7,8, Sophie Leblanc5,6,7,8, Christoph Lengger1, Elise le Marchand5,6,7,8, Tonia Ludwig1, Aline Lux5,6,7,8, Colin McKerlie26,27, Holger Maier1, Jean-Louis Mandel5,11,6,7,8, Susan Marschall1, Manuel Mark5,11,6,7,8, David G Melvin9, Hamid Meziane5,6,7,8, Kateryna Micklich1, Christophe Mittelhauser5,6,7,8, Laurent Monassier5,6,7,8, David Moulaert5,6,7,8, Stéphanie Muller5,6,7,8, Beatrix Naton1, Frauke Neff16, Patrick M Nolan10, Lauryl Mj Nutter27, Markus Ollert1,13, Guillaume Pavlovic5,6,7,8, Natalia S Pellegata16, Emilie Peter5,6,7,8, Benoit Petit-Demoulière5,6,7,8, Amanda Pickard10, Christine Podrini9, Paul Potter10, Laurent Pouilly5,6,7,8, Oliver Puk21, David Richardson9, Stephane Rousseau5,6,7,8, Leticia Quintanilla-Fend16, Mohamed M Quwailid10, Ildiko Racz1,28, Birgit Rathkolb1,29, Fabrice Riet5,6,7,8, Janet Rossant27, Michel Roux5,11,6,7,8, Jan Rozman1,25, Ed Ryder9, Jennifer Salisbury9, Luis Santos10, Karl-Heinz Schäble1, Evelyn Schiller1, Anja Schrewe1, Holger Schulz18, Ralf Steinkamp1, Michelle Simon10, Michelle Stewart10, Claudia Stöger1, Tobias Stöger18, Minxuan Sun21, David Sunter9, Lydia Teboul10, Isabelle Tilly5,6,7,8, Glauco P Tocchini-Valentini17, Monica Tost16, Irina Treise1, Laurent Vasseur5,6,7,8, Emilie Velot11,6,7,8, Daniela Vogt-Weisenhorn21, Christelle Wagner5,11,6,7,8, Alison Walling10, Bruno Weber5,6,7,8, Olivia Wendling5,11,6,7,8, Henrik Westerberg10, Monja Willershäuser1, Eckhard Wolf29,1, Anne Wolter5,6,7,8, Joe Wood10, Wolfgang Wurst21,2,30,31, Ali Önder Yildirim18, Ramona Zeh1, Andreas Zimmer1,28, Annemarie Zimprich21.
Abstract
The function of the majority of genes in the mouse and human genomes remains unknown. The mouse embryonic stem cell knockout resource provides a basis for the characterization of relationships between genes and phenotypes. The EUMODIC consortium developed and validated robust methodologies for the broad-based phenotyping of knockouts through a pipeline comprising 20 disease-oriented platforms. We developed new statistical methods for pipeline design and data analysis aimed at detecting reproducible phenotypes with high power. We acquired phenotype data from 449 mutant alleles, representing 320 unique genes, of which half had no previous functional annotation. We captured data from over 27,000 mice, finding that 83% of the mutant lines are phenodeviant, with 65% demonstrating pleiotropy. Surprisingly, we found significant differences in phenotype annotation according to zygosity. New phenotypes were uncovered for many genes with previously unknown function, providing a powerful basis for hypothesis generation and further investigation in diverse systems.Entities:
Mesh:
Year: 2015 PMID: 26214591 PMCID: PMC4564951 DOI: 10.1038/ng.3360
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1aEffect size versus sample size
Detectable standardized effect size, d, as a function of sample size, under a variety of experimental workflows and analysis approaches (identified in legend). The two qualitative design choices under consideration were: whether mutant animals were phenotyped across multiple days with four animals per day, or all on a single day; and whether baseline animals were phenotyped on the same day(s) as mutants (i.e. whether the mutants were accompanied). Two analytical approaches were compared: analysis of all baseline data (all data); versus analysis restricted to baseline data from animals phenotyped on the same day(s) as mutants (accompanying data only). Calculations were based on attaining 80% power while controlling the FDR at 5%. The variance components used in the power calculations were taken as the average estimates across all parameters and procedures: the variance proportion for day effect was 0.18, for the litter effect 0.12 and for the residual effect 0.69 (Supplementary Figure 2 shows similar plots for procedure-specific variance components).
Figure 3Heatmap of annotations of reference lines
Reference line comparison of annotations across centres. Colours represent scaled genotype effect (posterior median / SD), with blue/red indicating a decreased/increased mutant phenotype relative to baseline animals. Significant annotations (FDR < 5%) are indicated by a black outline around the corresponding rectangle.
Figure 4Heatmap of annotations of complete dataset
Heatmap of annotations. Colours represent scaled genotype effect (posterior median / SD), with blue/red indicating a decreased/increased mutant phenotype relative to baseline animals. Significant annotations (FDR < 5%) are indicated by a black outline around the corresponding rectangle. Labels for non-EUCOMM lines are in red. For legibility, the heatmap only displays a subset of parameters for those lines with at least three annotations.
Figure 1bHistogram of number of annotations per line
Histogram of number of annotations per line, with each bar split by colour into counts arising from homozygous and heterozygous lines.
Figure 1cHistogram of number of annotation in each top level MP term
Histogram of number of annotations within each top-level MP ontology term, with each bar split by colour into numbers arising from mutant lines with or without annotations in MGI.
Figure 2Phenotyping variance
Comparison of estimated variance components across centres. Posterior median (with error bars indicating 95% credible intervals) of total phenotypic SD (top panel), and proportions of variance (bottom three panels), are shown for each quantitative parameter, labelled top, within each test, labelled bottom. For visual comparison the total phenotypic SDs at each test were scaled multiplicatively to a mean of 1.
Figure 5Phenotyping similarity
Classification of EUMODIC-MGI gene pairs into matched or unmatched on the basis of phenotype similarity. The Receiver Operating Characteristic (ROC) curve plots the proportion of (EUMODIC-MGI) matched gene pairs correctly classified as matched against the proportion of unmatched gene pairs incorrectly classified as matched, as the phenotype-similarity threshold is varied (ROC area under curve 0.674).
Figure 6Analysis of genes with no prior annotations
The Venn diagrams illustrate the distribution of genes with relevant phenotype hits in three disease areas – (a) bone and skeleton; (b) metabolism; (c) neurological and behaviour. For each area, we identified combinations of tests, where a phenotype hit would be indicative of the relevant disease correlate and assigned genes accordingly. A total of 94 genes were identified across the three disease areas.