| Literature DB >> 33959747 |
Bethan Yates1, Kristian A Gray1, Tamsin E M Jones1, Elspeth A Bruford1,2.
Abstract
Multiple resources currently exist that predict orthologous relationships between genes. These resources differ both in the methodologies used and in the species they make predictions for. The HGNC Comparison of Orthology Predictions (HCOP) search tool integrates and displays data from multiple ortholog prediction resources for a specified human gene or set of genes. An indication of the reliability of a prediction is provided by the number of resources that support it. HCOP was originally designed to show orthology predictions between human and mouse but has been expanded to include data from a current total of 20 selected vertebrate and model organism species. The HCOP pipeline used to fetch and integrate the information from the disparate ortholog and nomenclature data resources has recently been rewritten, both to enable the inclusion of new data and to take advantage of modern web technologies. Data from HCOP are used extensively in our work naming genes as the Vertebrate Gene Nomenclature Committee (https://vertebrate.genenames.org).Entities:
Keywords: aggregation; meta-database; nomenclature; orthologs; vertebrates; website
Mesh:
Year: 2021 PMID: 33959747 PMCID: PMC8574622 DOI: 10.1093/bib/bbab155
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Orthology sources in HCOP
| Orthology source | Version | Species data applies to |
|---|---|---|
| eggNOG | Version 5.0 | All species except horse |
| Ensembl | Release 102 | All species |
| HGNC | N/A | Human and mouse |
| HomoloGene | Release 68 | Human, chimp, macaque, mouse, rat, dog, cow, chicken, xenopus, zebrafish, |
| InParanoid | Version 8.0 | All species |
| OMA | Release January 2020 | All species |
| OrthoDB | Version 10.1 | Human, chimp, macaque, mouse, rat, dog, cat, horse, cow, pig, opossum, platypus, chicken, anole lizard, xenopus, zebrafish, |
| OrthoMCL | Version 5 | Human, mouse, dog, chicken, zebrafish |
| NCBI Gene orthologs | N/A | All species except |
| Panther | Version 15 | All species |
| PhylomeDB | Version 4, data are taken from phylome 514 | Human, chimp, macaque, mouse, rat, dog, cow, opossum, platypus, chicken, xenopus, zebrafish, |
| PomBase | N/A | Human and |
| TreeFam | Release 9.0 | All species except cat, |
| ZFIN | N/A | Human and zebrafish |
Figure 1
The HCOP data production pipeline. The pipeline is broken up into several stages: nomenclature/gene resource data update, orthology data update, generation of an ID mapping table which links together the MOD ID, NCBI Gene ID, Ensembl stable gene ID and UniProt identifiers for a specific gene, conversion of the raw orthology data to HCOP orthology assertions which include the appropriate gene ID information from the ID mapping table and finally combination of assertions that share the same NCBI Gene and Ensembl gene IDs to produce a single combined ortholog for each ortholog pair. Each combined ortholog is added to the HCOP orthologs table in the HCOP MySQL database, which is then used to update the public database and FTP site files.
Figure 2
The HCOP web tool interface. The ‘Input’ section shows the updated HCOP search form. The users select a primary species and one or more species that they wish to identify orthologs in. They then select the ortholog resources they wish to include in the search, and the type of search term, e.g. approved symbol, or database identifier, that they are providing. A single search term or list of search terms may be pasted into a text box or uploaded as a file to be used to run the search. The ‘Results’ section shows an example result panel. Information about the query gene appears in the dark blue section at the top of the panel, with each ortholog identified having its own section below this. Basic information about both the query and ortholog genes, as well as links to these genes in other resources, is displayed. Each ortholog section has an additional column labeled ‘Assertion derived from’ that contains a set of icons that represent the orthology sources that support this assignment.