| Literature DB >> 35892968 |
Joshua D Breidenbach1, E Francis Begue Iii1, David J Kennedy1, Steven T Haller1.
Abstract
The increasing incorporation of omics technologies into biomedical research and translational medicine presents challenges to end users of the large and complex datasets that are generated by these methods. A particular challenge in genomics is that the nomenclature for genes is not uniform between large genomic databases or between commonly used genetic analysis tools. Furthermore, outdated genomic nomenclature can still be found amongst scientific communications, including peer-reviewed manuscripts. Therefore, a web application (GeneToList) was developed to assist in gene ID conversion and alias matching, with a specific focus on achieving a user-friendly interface for the non-bioinformatics-savvy scientist. It currently includes gene information for over 38,000 different taxa retrieved from the National Center for Biotechnology and Information (NCBI) Gene resource. Supported databases of gene IDs include NCBI Gene Symbols, NCBI Gene IDs (Entrez IDs), OMIM IDs, HGNC IDs, Ensembl IDs, and 28 other taxa-specific identifiers. GeneToList is available at genetolist.com. The tool is a web application that is compatible with many standard browsers. The gene ID conversion feature of this application was found to outcompete the common gene ID conversion tools. Specifically, it was able to successfully convert all tested IDs, whereas the others were not able to recognize the gene aliases. Therefore, the gene ID disambiguation provided by this application should be beneficial for many scientists dealing with gene data when the uniformity of gene nomenclature is important for downstream analysis.Entities:
Keywords: gene ID; gene nomenclature; web application
Year: 2022 PMID: 35892968 PMCID: PMC9332626 DOI: 10.3390/biology11081113
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1Overview of the gene information currently available to the GeneToList application.
Results of an example search of inflammation-related genes with GeneToList, demonstrating the disambiguation of gene IDs. Color corresponds to different match types where Green were exact matches between the searched term and the NCBI official symbol, Blue were auto-accepted suggestions where the searched term was similar to an official symbol and Orange were search terms which required alias disambiguation.
| Searched Term | Match Type | Matched Symbol |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Results of the gene ID conversion from GeneToList and other common conversion tools.
| Searched Term | GeneToList | g:Convert | DAVID | bioDBnet |
|---|---|---|---|---|
|
| 7040 | - | - | - |
|
| 3576 | - | - | - |
|
| 6347 | - | - | - |
|
| 1401 | 1401 | 1401 | 1401 |
|
| 7124 | - | - | - |
|
| 3577 | 3577 | 3577 | 3577 |
|
| 3579 | 3579 | 3579 | 3579 |
|
| 729,230 | 729,230 | 729,230 | 729,230 |
|
| 4659 | - | - | - |
|
| 7040 | - | - | - |