| Literature DB >> 35552456 |
Yannis Nevers1,2, Tamsin E M Jones3, Dushyanth Jyothi4, Bethan Yates3, Meritxell Ferret5, Laura Portell-Silva5, Laia Codo5, Salvatore Cosentino6, Marina Marcet-Houben5,7, Anna Vlasova5,7, Laetitia Poidevin8,9, Arnaud Kress8,9, Mark Hickman10, Emma Persson11, Ivana Piližota12, Cristina Guijarro-Clarke12, Wataru Iwasaki6,13, Odile Lecompte8, Erik Sonnhammer11, David S Roos10, Toni Gabaldón5,7,14,15, David Thybert12, Paul D Thomas16, Yanhui Hu17, David M Emms18, Elspeth Bruford3,19, Salvador Capella-Gutierrez5, Maria J Martin4, Christophe Dessimoz1,2,20,21, Adrian Altenhoff2,22.
Abstract
The Orthology Benchmark Service (https://orthology.benchmarkservice.org) is the gold standard for orthology inference evaluation, supported and maintained by the Quest for Orthologs consortium. It is an essential resource to compare existing and new methods of orthology inference (the bedrock for many comparative genomics and phylogenetic analysis) over a standard dataset and through common procedures. The Quest for Orthologs Consortium is dedicated to maintaining the resource up to date, through regular updates of the Reference Proteomes and increasingly accessible data through the OpenEBench platform. For this update, we have added a new benchmark based on curated orthology assertion from the Vertebrate Gene Nomenclature Committee, and provided an example meta-analysis of the public predictions present on the platform.Entities:
Year: 2022 PMID: 35552456 PMCID: PMC9252809 DOI: 10.1093/nar/gkac330
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Public methods in the QfO Benchmark 2020. Every method is based on different methodological premises (Described in (3) and detailed on the Benchmarking Service: https://orthology.benchmarkservice.org/proxy/projects/2020/) and provides one or multiple kinds of orthologous prediction. One-to-one orthologous pairs (orthologous relationships between single genes), co-orthologous pairs (Pairs of orthologs between species, may involve one or multiple genes in the same species depending on duplication), Orthologous clusters (a group of orthologs or paralogs defined at one taxonomic level and computed by a graph clustering method), Hierarchical orthologous groups (nested groups of orthologs at different taxonomic levels) and Gene Trees. The performance column is an indicator based on the usual performance across all benchmarks. Line coloring is tied to the ‘Performance’ column
|
|
Figure 1.VGNC symbol Benchmark. Results of each method with True Positive Rate (x-axis) as a recall measure and Positive Predictive Value (y-axis) as a precision measure. Note: The axes have been truncated to maximize the spread of the methods on the figure. As a result, the y-axis shows a significantly larger variation in values than the x-axis
Figure 2.Orthologous pairs inferred by individual methods. Number of pairs inferred by public methods included in the benchmarking platform. Subsections of the bars represent the number of methods that share the same pairs. Methods are ranked by the number of pairs they share with other methods (non-green part of the stacked bars).
Figure 3.Pairwise comparisons between predictions of public methods. The heatmap shows the proportion of the pairs inferred by methods on the right side that are recapitulated by methods on the bottom. The heatmap is hierarchically clustered on rows and columns by similarity with the corresponding trees shown.