EuroPhenome (http://www.europhenome.org) and EMPReSS (http://empress.har.mrc.ac.uk/) form an integrated resource to provide access to data and procedures for mouse phenotyping. EMPReSS describes 96 Standard Operating Procedures for mouse phenotyping. EuroPhenome contains data resulting from carrying out EMPReSS protocols on four inbred laboratory mouse strains. As well as web interfaces, both resources support web services to enable integration with other mouse phenotyping and functional genetics resources, and are committed to initiatives to improve integration of mouse phenotype databases. EuroPhenome will be the repository for a recently initiated effort to carry out large-scale phenotyping on a large number of knockout mouse lines (EUMODIC).
EuroPhenome (http://www.europhenome.org) and EMPReSS (http://empress.har.mrc.ac.uk/) form an integrated resource to provide access to data and procedures for mouse phenotyping. EMPReSS describes 96 Standard Operating Procedures for mouse phenotyping. EuroPhenome contains data resulting from carrying out EMPReSS protocols on four inbred laboratory mouse strains. As well as web interfaces, both resources support web services to enable integration with other mouse phenotyping and functional genetics resources, and are committed to initiatives to improve integration of mouse phenotype databases. EuroPhenome will be the repository for a recently initiated effort to carry out large-scale phenotyping on a large number of knockout mouse lines (EUMODIC).
With an extensive knowledge of the gene content of mammalian genomes becoming a reality through the completion of a number of genome sequences including mouse (1), focus has shifted to the study of gene function in these organisms. The aim in this postgenomic era is to link genomic and phenotype information systematically to allow a deeper understanding of the processes leading from genomic changes to altered phenotype and disease. Mouse mutants represent one of the most powerful tools in this endeavour (2). To facilitate these aims, a number of projects are underway ranging from the production of large collections of mouse point and knockout mutants (3,4) through to the establishment of large-scale phenotype characterization projects that aim to provide a phenotype assessment of these lines (5) (EUMODIC—http://www.eumodic.eu). The ultimate challenge will be to interpret this data using computational methods.A number of critical steps must be achieved for the comprehensive analysis and interpretation of this data to be possible. First, phenotype data on both normal inbred strains and mutant strains must be collected in community databases with open access. Second, the phenotype data must be generated using comprehensive phenotyping platforms that provide standardized methods, so that the results can be compared between laboratories and across time (5). Third, structured descriptions of the phenotypes must be used to allow the data to be interpreted in a consistent manner (6–8).EMPReSS (European Mouse Phenotyping Resource for Standardized Screens) (5,9) is the product of Eumorphia (http://www.eumorphia.org), the largest programme to date to develop standardized phenotyping protocols. EMPReSS is a comprehensive database of validated Standard Operating Procedures (SOPs) for screens to determine the phenotype of a mouse. It incorporates 96 SOPs that cover all of the main body systems including: clinical chemistry, hormonal and metabolic systems, cardiovascular, allergy and infection, sensory function, neurological and behavioural function, cancer, and bone and cartilage systems. In addition, there are generic SOPs for histology, necropsy, pathology and gene expression. EMPReSS is a platform of individual tests, but these can also be grouped together into phenotyping pipelines.EuroPhenome (http://www.europhenome.org) was instigated as an online mouse phenotyping resource to store baseline data generated from the application of EMPReSS SOPs. Data was collected by individual work packages in the Eumorphia project for purposes of SOP validation and to provide baseline information against which phenotypes of experimental animals could be compared. It currently includes data from 24 EMPReSS SOPs, representing the measurement of 132 parameters across four inbred mouse strains in up to four different laboratories.
EUROPHENOME
EuroPhenome is a mySQL relational database (http://www.mysql.com/) providing baseline data on four inbred mouse strains (C57BL/6J, C3H/HeBFeJ, BALB/cByJ and 129/SvPas). Data were obtained by researchers working in the Eumorphia project. The data are subdivided into five phenotyping domains corresponding to body systems (see Table 1).
Table 1.
Summary of datasets included in Europhenome—August 2007
Phenotyping domain
SOP name
Number of parameters measured
Behaviour and cognition
Open field
7
Grip strength
3
Y-maze
4
Acoustic startle and pre-pulse inhibition specific to sensory systems
3
Tail flick
1
Clinical and haematology
Differential blood count
2
Clinical chemistry parameters
27
Haematology tests
7
Hormonal and metabolic systems
Metabolic cages
1
Dexa scan analysis
5
IPGTT (intra-peritoneal glucose tolerance test)
2
Cold test
2
Meal tolerance test
4
Oral glucose tolerance test
2
Cardiovascular
Non-invasive blood pressure and heart rate
2
Invasive blood pressure
3
Invasive left-ventricular haemodynamics
5
Surface electrocardiography
5
Echocardiography
20
Allergy and infectious diseases
FACS analysis of peripheral blood cells
9
Determination of immunoglobulin concentration in serum
6
Quantification of TNFa production by PAMP stimulated macrophages
7
Infection of proteose peptone macrophages with Listeria monocytogenes
1
Quantitative measurement of iNOS activity after macrophage stimulation
4
Summary of datasets included in Europhenome—August 2007
Navigating EuroPhenome
The EuroPhenome data browser utilizes PHP and AJAX and gives the user a variety of ways to visualize and search the data, allowing assessment of inter-strain, -gender and -laboratory variation for the various SOPs. The web-accessible data browser (http://www.europhenome.org/browser) allows users to browse through the data via the SOP. SOPs are presented in a tree structure on the left-hand side of the browser. SOPs are clustered under five phenotyping domains, corresponding to the classification in EMPReSS. Opening up a folder corresponding to one of these domains reveals a list of SOP names. Clicking on an SOP name opens a summary page in the main frame of the browser that lists the parameters measured under the selected SOP and mean, median and variation measures over the entire dataset for each parameter (Figure 1). Users can also link directly to the description of the SOP in EMPReSS, to similar measurements in the Mouse Phenome Database (10), and download the data to Excel format, allowing them to carry out their own analyses of the data and to combine it with their own in-house data. Clicking on an individual parameter reveals a more detailed breakdown of parameter means and SDs broken down by age and sex. These values are also represented graphically for easy visual comparison (Figure 2). The left-hand panel also provides a link to Phenostat, which can be used for statistical analysis of datasets (11).
Figure 1.
View of the summary data display for EuroPhenome. The left-hand panel contains the tree-like browser that allows users to search for data derived from individual SOPs in a structured manner. The main panel contains a table displaying Parameter Name, Parameter Measurement Unit, Mean, Median, SD, Standard Error, Minimum and Maximum values, and number of mice measured. Links are also provided to the EMPReSS SOP, download the data in Excel format, and link to the corresponding MPD dataset. Clicking on the name of an individual SOP in the left-hand panel takes the user to a more detailed view (Figure 2). The example shown is for the SOP ‘Clinical chemistry parameters’.
Figure 2.
View of the detailed data display for an individual parameter in EuroPhenome. The graphical display at the top of the main panel illustrates mean values of the parameter for each mouse strain tested, separated into male and female. Error bars represent SDs. The table below gives Mean, SD, Standard Error and mouse numbers for each strain subdivided into sex and, where appropriate, age groups. Clicking on the link ‘To see this data split across centres click here’ expands this table to show results obtained in the different Eumorphia laboratories that carried out the test. The pull-down menu at the top right allows navigation between the different parameters measured by a given SOP, where appropriate.
View of the summary data display for EuroPhenome. The left-hand panel contains the tree-like browser that allows users to search for data derived from individual SOPs in a structured manner. The main panel contains a table displaying Parameter Name, Parameter Measurement Unit, Mean, Median, SD, Standard Error, Minimum and Maximum values, and number of mice measured. Links are also provided to the EMPReSS SOP, download the data in Excel format, and link to the corresponding MPD dataset. Clicking on the name of an individual SOP in the left-hand panel takes the user to a more detailed view (Figure 2). The example shown is for the SOP ‘Clinical chemistry parameters’.View of the detailed data display for an individual parameter in EuroPhenome. The graphical display at the top of the main panel illustrates mean values of the parameter for each mouse strain tested, separated into male and female. Error bars represent SDs. The table below gives Mean, SD, Standard Error and mouse numbers for each strain subdivided into sex and, where appropriate, age groups. Clicking on the link ‘To see this data split across centres click here’ expands this table to show results obtained in the different Eumorphia laboratories that carried out the test. The pull-down menu at the top right allows navigation between the different parameters measured by a given SOP, where appropriate.
EuroPhenomeMart
In addition to this browsing interface, we have also implemented a MartView/BioMart (http://www.biomart.org/) interface to the database, which can be accessed from a link on the left-hand panel of the browser. This allows export of selected fields to HTML pages, and in comma or tab separated formats.
Implementation of web services in EuroPhenome
Open standard web services technology such as Simple Object Access Protocol (SOAP, http://www.w3.org/TR/soap) and Web Service Definition Language (WSDL, http://www.w3.org/TR/wsdl) enable programmers to build complex applications without the need to install and maintain the database, and facilitates integration and interoperability between bioinformatics applications and the data they require.Web services have been implemented in EuroPhenome to expose its data in a programmatically accessible manner. We are currently using RPC/Encoding style WSDL and will be providing Document/Literal style soon, as recommended by the Web Services Interoperability Organization (WS-I, http://www.ws-i.org). The URL for accessing the EuroPhenome WSDL file is http://www.europhenome.org/europhenome.wsdl.
EMPRESS—RECENT DEVELOPMENTS
The original implementation of the EMPReSS SOP site is described in Green et al. (9) and its scientific genesis in (5). The implementation of EMPReSS has subsequently been redeveloped and the browser is now a web 2.0 resource combining JAVA, XML, AJAX and PHP for the visualization and searching of the SOPs. This functionality now enables bioinformaticians to access the data in the eXist XML database (http://exist.sourceforge.net/) via web services. The EMPReSS browser has been extended to allow users to select data generated from SOPs in the two EMPReSSslim phenotyping pipelines (see below). The pipelines are comprehensive sequences of SOPs that are being utilized for high-throughput phenotyping in the EUMODIC project. The interface also allows direct access to available data generated using a particular SOP in EuroPhenome (Figure 3).
Figure 3.
View of the SOP display in the EMPReSS browser. The tree-like display in the left-hand panel provides for easy navigation to individual SOPs, which are classified under the body system they investigate. The right hand panel displays the SOP itself. The SOP document is subdivided into 11 sections as described previously (9) plus an additional section providing a link to EuroPhenome. The view also provides status information about the SOP including a unique document ID, revision number (previous versions of SOPs are retained as appendices to the current version; a new version is only recorded is there has been a substantive change), review status and the validation level of the SOP [see (9) for details].
View of the SOP display in the EMPReSS browser. The tree-like display in the left-hand panel provides for easy navigation to individual SOPs, which are classified under the body system they investigate. The right hand panel displays the SOP itself. The SOP document is subdivided into 11 sections as described previously (9) plus an additional section providing a link to EuroPhenome. The view also provides status information about the SOP including a unique document ID, revision number (previous versions of SOPs are retained as appendices to the current version; a new version is only recorded is there has been a substantive change), review status and the validation level of the SOP [see (9) for details].The EMPReSS SOPs have also been annotated with high level Mammalian Phenotype (MP) ontology terms (8) and an ontology tree browser interface developed to facilitate the identification of the annotated SOPs. This annotation currently uses top-level MP terms, but we aim to develop it further in depth and diversity.The ontology tree browser utilizes the Ontology Lookup Service (OLS) (http://www.ebi.ac.uk/ontology-lookup/) (12) at the EBI to provide the ontology data to dynamically generate an AJAX-driven JavaScript and DHTML ontology tree menu. This allows the user to search through the SOPs by specific ontology term. This system works in combination with Memcached (http://www.danga.com/memcached/), a high-performance, distributed memory object caching system, to negate any potential slow networking issues inherent in web service access. Implementing the OLS system allows the flexibility of selecting any of the 48 currently stored ontologies to dynamically generate an ontology tree and thus search through the SOPs using those terms. It also facilitates an AJAX-driven autocompletion search functionality, providing an easy way to hunt for a specific term within an ontology and use it to search within the SOPs. The results of a search are provided as a list, enabling the user to load the SOP of interest, and it also provides a link out to the MGI database of mutant lines annotated with the corresponding MP term.
FUTURE DIRECTIONS
The major future objective for EuroPhenome is to move from the collection of phenotyping data on inbred strains to mutant strains. EuroPhenome will be the primary repository and query tool for data generated from the EUMODIC project, which will phenotype up to 650 knockout lines generated by the EUCOMM project (http://www.eucomm.org). EUMODIC has developed a phenotyping screen called EMPReSSslim, which is structured for comprehensive, primary, high-throughput phenotyping of large numbers of mice. The primary phenotype assessment using EMPReSSslim will be undertaken in four large-scale phenotyping centres at the GSF, Germany; ICS, France; MRC Harwell, UK and the Sanger Institute, UK. Data generated in the centres will be submitted to EuroPhenome as XML files. It is envisaged that the EuroPhenomeXML Schema could be used more generally for the exchange of phenotyping data.EuroPhenome is a core member of the Mouse Phenotype Database Integration Consortium (MPDIC) (13) and CASIMIR (Coordination and Sustainability of International Mouse Informatics Resources; http://www.casimir.org.uk/). As such, one of its primary aims over the next 1–2 years is to integrate with many other data sources including Ensembl (http://www.ensembl.org), MGI (http://www.informatics.jax.org/), RIKEN (http://www.gsc.riken.go.jp/Mouse/), EUCOMM (http://www.eucomm.org), EURExpress (http://www.eurexpress.org) and ArrayExpress (http://www.ebi.ac.uk/arrayexpress/) to enable intelligent querying of the data. We already support some integration with the Mouse Phenome Database (see above) and have shared our SOPML with RIKEN. To develop this integration, we will in part utilize existing technologies such as BioMart and DAS, but will also investigate alternative methods. Of particular importance in this effort will be the development of new standards, as described elsewhere (13). Particular areas for development include:Standards for phenotype data transfer. Building on the XML schema being developed for data capture within EUMODIC, and on other initiative to describe phenotypes in XML, such as PhenoXML (http://www.fruitfly.org/~cjm/obd/pheno-xml-xsd.html) a standard format will be developed to facilitate the exchange of phenotype data between sites, enabling integrated analysis and searching of phenotype data across different sites.Standards for describing phenotyping procedures. In collaboration with members of the MPDIC we are currently developing an enhanced version of SOPML, which is currently used to store phenotyping SOPs in EMPReSS, to make it more descriptive and include ontological mark-up. This is intended to act as an international standard for the description, storage and interchange of Phenotyping Procedures.Using ontologies to describe phenotyping data. Data from phenotyping experiments require to be described in a value-neutral way, i.e. they should not be annotated as ‘abnormal’ until proper controls and data filtering methods have been applied. They also need to be described at a more detailed level than is available using MP (8). While taking note of future developments in MP, we will also develop the use of combinatorial approaches to phenotype description that include descriptions of the underlying experimental protocol (6,7) combined with recent developments in the PATO ontology (http://www.bioontology.org/wiki/index.php/PATO:Main_Page) and integrate these into the description frameworks mentioned above to allow for transfer of semantic information about experimental data held in EuroPhenome.A longer-term aim is to open up EuroPhenome-EMPReSS to the broad community as a repository for phenotype data via a linked submission procedure that would acquire information about phenotyping procedure, data, and ontological markup.
Authors: Robert H Waterston; Kerstin Lindblad-Toh; Ewan Birney; Jane Rogers; Josep F Abril; Pankaj Agarwal; Richa Agarwala; Rachel Ainscough; Marina Alexandersson; Peter An; Stylianos E Antonarakis; John Attwood; Robert Baertsch; Jonathon Bailey; Karen Barlow; Stephan Beck; Eric Berry; Bruce Birren; Toby Bloom; Peer Bork; Marc Botcherby; Nicolas Bray; Michael R Brent; Daniel G Brown; Stephen D Brown; Carol Bult; John Burton; Jonathan Butler; Robert D Campbell; Piero Carninci; Simon Cawley; Francesca Chiaromonte; Asif T Chinwalla; Deanna M Church; Michele Clamp; Christopher Clee; Francis S Collins; Lisa L Cook; Richard R Copley; Alan Coulson; Olivier Couronne; James Cuff; Val Curwen; Tim Cutts; Mark Daly; Robert David; Joy Davies; Kimberly D Delehaunty; Justin Deri; Emmanouil T Dermitzakis; Colin Dewey; Nicholas J Dickens; Mark Diekhans; Sheila Dodge; Inna Dubchak; Diane M Dunn; Sean R Eddy; Laura Elnitski; Richard D Emes; Pallavi Eswara; Eduardo Eyras; Adam Felsenfeld; Ginger A Fewell; Paul Flicek; Karen Foley; Wayne N Frankel; Lucinda A Fulton; Robert S Fulton; Terrence S Furey; Diane Gage; Richard A Gibbs; Gustavo Glusman; Sante Gnerre; Nick Goldman; Leo Goodstadt; Darren Grafham; Tina A Graves; Eric D Green; Simon Gregory; Roderic Guigó; Mark Guyer; Ross C Hardison; David Haussler; Yoshihide Hayashizaki; LaDeana W Hillier; Angela Hinrichs; Wratko Hlavina; Timothy Holzer; Fan Hsu; Axin Hua; Tim Hubbard; Adrienne Hunt; Ian Jackson; David B Jaffe; L Steven Johnson; Matthew Jones; Thomas A Jones; Ann Joy; Michael Kamal; Elinor K Karlsson; Donna Karolchik; Arkadiusz Kasprzyk; Jun Kawai; Evan Keibler; Cristyn Kells; W James Kent; Andrew Kirby; Diana L Kolbe; Ian Korf; Raju S Kucherlapati; Edward J Kulbokas; David Kulp; Tom Landers; J P Leger; Steven Leonard; Ivica Letunic; Rosie Levine; Jia Li; Ming Li; Christine Lloyd; Susan Lucas; Bin Ma; Donna R Maglott; Elaine R Mardis; Lucy Matthews; Evan Mauceli; John H Mayer; Megan McCarthy; W Richard McCombie; Stuart McLaren; Kirsten McLay; John D McPherson; Jim Meldrim; Beverley Meredith; Jill P Mesirov; Webb Miller; Tracie L Miner; Emmanuel Mongin; Kate T Montgomery; Michael Morgan; Richard Mott; James C Mullikin; Donna M Muzny; William E Nash; Joanne O Nelson; Michael N Nhan; Robert Nicol; Zemin Ning; Chad Nusbaum; Michael J O'Connor; Yasushi Okazaki; Karen Oliver; Emma Overton-Larty; Lior Pachter; Genís Parra; Kymberlie H Pepin; Jane Peterson; Pavel Pevzner; Robert Plumb; Craig S Pohl; Alex Poliakov; Tracy C Ponce; Chris P Ponting; Simon Potter; Michael Quail; Alexandre Reymond; Bruce A Roe; Krishna M Roskin; Edward M Rubin; Alistair G Rust; Ralph Santos; Victor Sapojnikov; Brian Schultz; Jörg Schultz; Matthias S Schwartz; Scott Schwartz; Carol Scott; Steven Seaman; Steve Searle; Ted Sharpe; Andrew Sheridan; Ratna Shownkeen; Sarah Sims; Jonathan B Singer; Guy Slater; Arian Smit; Douglas R Smith; Brian Spencer; Arne Stabenau; Nicole Stange-Thomann; Charles Sugnet; Mikita Suyama; Glenn Tesler; Johanna Thompson; David Torrents; Evanne Trevaskis; John Tromp; Catherine Ucla; Abel Ureta-Vidal; Jade P Vinson; Andrew C Von Niederhausern; Claire M Wade; Melanie Wall; Ryan J Weber; Robert B Weiss; Michael C Wendl; Anthony P West; Kris Wetterstrand; Raymond Wheeler; Simon Whelan; Jamey Wierzbowski; David Willey; Sophie Williams; Richard K Wilson; Eitan Winter; Kim C Worley; Dudley Wyman; Shan Yang; Shiaw-Pyng Yang; Evgeny M Zdobnov; Michael C Zody; Eric S Lander Journal: Nature Date: 2002-12-05 Impact factor: 49.962
Authors: Eli Reuveni; Valeria Carola; Mumna Al Banchaabouchi; Nadia Rosenthal; John M Hancock; Cornelius Gross Journal: Mamm Genome Date: 2007-08-03 Impact factor: 2.957
Authors: Eain C J Green; Georgios V Gkoutos; Heena V Lad; Andrew Blake; Joseph Weekes; John M Hancock Journal: Bioinformatics Date: 2005-04-12 Impact factor: 6.937
Authors: Warren Ladiges; Yuji Ikeno; Laura Niedernhofer; Richard A McIndoe; Marcia A Ciol; Jerry Ritchey; Denny Liggitt Journal: J Gerontol A Biol Sci Med Sci Date: 2015-08-04 Impact factor: 6.053
Authors: Gaynor Miller; Monica Neilan; Ruth Chia; Nabeia Gheryani; Natalie Holt; Annabelle Charbit; Sara Wells; Valter Tucci; Zuzanne Lalanne; Paul Denny; Elizabeth M C Fisher; Michael Cheeseman; Graham N Askew; T Neil Dear Journal: PLoS One Date: 2010-02-09 Impact factor: 3.240
Authors: John M Hancock; Ann-Marie Mallon; Tim Beck; Georgios V Gkoutos; Chris Mungall; Paul N Schofield Journal: Mamm Genome Date: 2009-08-02 Impact factor: 2.957
Authors: Hugh Morgan; Tim Beck; Andrew Blake; Hilary Gates; Niels Adams; Guillaume Debouzy; Sophie Leblanc; Christoph Lengger; Holger Maier; David Melvin; Hamid Meziane; Dave Richardson; Sara Wells; Jacqui White; Joe Wood; Martin Hrabé de Angelis; Steve D M Brown; John M Hancock; Ann-Marie Mallon Journal: Nucleic Acids Res Date: 2009-11-23 Impact factor: 16.971