Literature DB >> 35561194

OMAMO: orthology-based alternative model organism selection.

Alina Nicheperovich1, Adrian M Altenhoff2,3, Christophe Dessimoz3,4,5,6, Sina Majidian3,4.   

Abstract

SUMMARY: The conservation of pathways and genes across species has allowed scientists to use non-human model organisms to gain a deeper understanding of human biology. However, the use of traditional model systems such as mice, rats and zebrafish is costly, time-consuming and increasingly raises ethical concerns, which highlights the need to search for less complex model organisms. Existing tools only focus on the few well-studied model systems, most of which are complex animals. To address these issues, we have developed Orthologous Matrix and Alternative Model Organism (OMAMO), a software and a web service that provides the user with the best non-complex organism for research into a biological process of interest based on orthologous relationships between human and the species. The outputs provided by OMAMO were supported by a systematic literature review.
AVAILABILITY AND IMPLEMENTATION: https://omabrowser.org/omamo/, https://github.com/DessimozLab/omamo. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2022        PMID: 35561194      PMCID: PMC9113245          DOI: 10.1093/bioinformatics/btac163

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Introduction

Model organisms are non-human species used in human biomedical research to study development, gene regulation and other cellular processes because they are relatively fast-growing, inexpensive and easy to manipulate. Most importantly, their use has been possible due to the evolutionary conservation of biological processes (Wangler ). Fast-moving progress in comparative genomics has allowed scientists to identify these evolutionary relationships by inferring human orthologs, genes that have diverged due to speciation (Fitch, 1970). Since orthologous genes tend to be functionally conserved and have common gene expression patterns, they are a better basis for model organism selection than other subtypes of homologs, which tend to functionally diverge faster (Altenhoff ; Zheng-Bradley ). Currently used model organisms range from bacteria to complex mammals. The scientific community, however, aims to reduce the use of animals in research due to ethical implications, opting to use less complex organisms where possible. Currently available databases include MARRVEL (Wang ), the Alliance of Genome Resources portal (Alliance of Genome Resources Consortium, 2020), the Monarch Initiative (McMurry ) and MORPHIN (Hwang ). They focus on five to nine ‘traditional’ model organisms, most of which are complex organisms like mouse, rat and zebrafish. Moreover, their scope is restricted to human disease-related research. The only unicellular organisms considered in these databases are fission and budding yeast, whilst abundance of unicellular species in nature and their unique features make it difficult to find other non-complex model organisms for a biological process of interest. To address the challenges above, we created an orthology-based Orthologous Matrix and Alternative Model Organism (OMAMO) database alongside a user-friendly web service. This database helps to select the best non-complex model organism for a biological process. Because the majority of species in the database have not been considered as model systems in the past, OMAMO has the potential to extend the set of organisms used in human biomedical research.

2 Materials and methods

The OMAMO database is created using the OMAMO software that takes advantage of the OMA database of orthologous genes. For a given biological process, the output presents a list of potential model organisms ranked based on their orthologous relationships with human. For each species, pyOMA library was used to extract human orthologs (Altenhoff ) (Supplementary Section S1.1). For each ortholog, pyOMA was used to retrieve Gene Ontology (GO) terms, which provide information about the gene product and can represent one of the following three aspects: molecular function, cellular component and biological process (Gene Ontology Consortium, 2021). Some GO terms are general (e.g. ‘cell division’), whilst others are more specific (‘G2/M transition of mitotic cycle’). To quantify the specificity of a GO term, we used information content (IC) calculated as −log(p), where p is its empirical frequency in the UniProt database (Pesquita, 2017). Thus, more specific GO terms have a higher IC value. The IC values were used to calculate GO-based functional similarity for each orthologous pair (Supplementary Section S1.2). Orthologous pairs with GO-based functional similarity of <0.05 were discarded. This aims to reduce the number of orthologs that only share general GO terms in the output. Consequently, gene pairs from a given species were grouped according to the biological process GO term they share. To maintain sufficient specificity in functional similarity considered, only GO terms with IC ≥ 5 were kept. Finally, for each biological process GO term, species were ranked based on a scoring system, which takes into account the number of orthologs and average GO-based functional similarity across the genes relevant to the biological process. The higher the score, the more suitable an organism is for studying a process of interest. We developed a freely accessible web service for OMAMO (Fig. 1), with the source code publically available. Out of the 50 species currently present in OMAMO, 31 are unicellular eukaryotes and the rest are bacteria. We suggest at least one model organism for 4620 out of 28 923 available biological process GO terms. Since OMAMO web service is integrated into the OMA website, OMAMO will be updated alongside OMA, meaning that the set of organisms will continue to grow and the OMAMO database will include the latest GO annotations.
Fig. 1.

The web service interface. (A) The main browser page of OMAMO. The user can search a GO term (‘DNA repair’) or a GO ID (0006281). (B) The output page gives a list of species ranked based on the score, but the user has the option to sort the output based on the total number of orthologs or the average GO-based functional similarity by clicking on the up-down sorting icon. The user can view orthologs by clicking on the ‘+’ button.

The web service interface. (A) The main browser page of OMAMO. The user can search a GO term (‘DNA repair’) or a GO ID (0006281). (B) The output page gives a list of species ranked based on the score, but the user has the option to sort the output based on the total number of orthologs or the average GO-based functional similarity by clicking on the up-down sorting icon. The user can view orthologs by clicking on the ‘+’ button. To validate our results, we referred to experimental evidence through a systematic literature search on PubMed (Supplementary Section S2). The top five review articles on three of the well-studied organisms in OMA (Dictyostelium discoideum, Neurospora crassa and Schizosaccharomyces pombe) published in 2010–2021 were selected from the search output. Out of all biological processes which have been studied in one of the three organisms, the species of interest was in the top five model organism candidates in 42.6% of respective searches in OMAMO. This indicates that our algorithm is well supported by experimental data found in the literature.

3 Discussion

OMAMO is a freely available database and web service which aims to help scientists exploit alternative model species for human biomedical research. With the limited number of presently used model systems, the scientific community can now benefit from using other organisms, some of which could become model systems for processes that have previously only been studied in animals, leading to a reduction in their use in experimental research. Moreover, this is the first database that provides such a wide range of potential model organisms. Due to the lack of literature on using species presented in OMAMO, the validation of results proved to be challenging. The following step for output validation would be to utilize proposed model species as model systems in wet-lab experiments. In the future, we plan to greatly expand the species set and improve the scoring system by considering sequence similarity, conservation of protein structure and reproduction time. Additionally, we hope to provide unicellular model organisms based on their similarity to traditional model organisms like mouse and fruit fly. Besides, the search query could be extended to include gene names. Click here for additional data file.
  11 in total

1.  Large scale comparison of global gene expression patterns in human and mouse.

Authors:  Xiangqun Zheng-Bradley; Johan Rung; Helen Parkinson; Alvis Brazma
Journal:  Genome Biol       Date:  2010-12-23       Impact factor: 13.583

2.  Semantic Similarity in the Gene Ontology.

Authors:  Catia Pesquita
Journal:  Methods Mol Biol       Date:  2017

3.  Navigating the Phenotype Frontier: The Monarch Initiative.

Authors:  Julie A McMurry; Sebastian Köhler; Nicole L Washington; James P Balhoff; Charles Borromeo; Matthew Brush; Seth Carbon; Tom Conlin; Nathan Dunn; Mark Engelstad; Erin Foster; Jean-Philippe Gourdine; Julius O B Jacobsen; Daniel Keith; Bryan Laraway; Jeremy Nguyen Xuan; Kent Shefchek; Nicole A Vasilevsky; Zhou Yuan; Suzanna E Lewis; Harry Hochheiser; Tudor Groza; Damian Smedley; Peter N Robinson; Christopher J Mungall; Melissa A Haendel
Journal:  Genetics       Date:  2016-08       Impact factor: 4.562

4.  Distinguishing homologous from analogous proteins.

Authors:  W M Fitch
Journal:  Syst Zool       Date:  1970-06

5.  Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information.

Authors:  Julia Wang; Zhandong Liu; Hugo J Bellen; Shinya Yamamoto
Journal:  J Vis Exp       Date:  2019-08-15       Impact factor: 1.355

6.  Resolving the ortholog conjecture: orthologs tend to be weakly, but significantly, more similar in function than paralogs.

Authors:  Adrian M Altenhoff; Romain A Studer; Marc Robinson-Rechavi; Christophe Dessimoz
Journal:  PLoS Comput Biol       Date:  2012-05-17       Impact factor: 4.475

7.  MORPHIN: a web tool for human disease research by projecting model organism biology onto a human integrated gene network.

Authors:  Sohyun Hwang; Eiru Kim; Sunmo Yang; Edward M Marcotte; Insuk Lee
Journal:  Nucleic Acids Res       Date:  2014-05-26       Impact factor: 16.971

Review 8.  Model Organisms Facilitate Rare Disease Diagnosis and Therapeutic Research.

Authors:  Michael F Wangler; Shinya Yamamoto; Hsiao-Tuan Chao; Jennifer E Posey; Monte Westerfield; John Postlethwait; Philip Hieter; Kym M Boycott; Philippe M Campeau; Hugo J Bellen
Journal:  Genetics       Date:  2017-09       Impact factor: 4.562

9.  The Gene Ontology resource: enriching a GOld mine.

Authors: 
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

10.  OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more.

Authors:  Adrian M Altenhoff; Clément-Marie Train; Kimberly J Gilbert; Ishita Mediratta; Tarcisio Mendes de Farias; David Moi; Yannis Nevers; Hale-Seda Radoykova; Victor Rossier; Alex Warwick Vesztrocy; Natasha M Glover; Christophe Dessimoz
Journal:  Nucleic Acids Res       Date:  2020-11-11       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.