| Literature DB >> 31984130 |
Abstract
Due to their nature, bioinformatics datasets are often closely related to each other. For this reason, search, mapping and visualization of these relations are often performed manually or programmatically via identifiers or special keywords such as gene symbols. Although various tools exist for these situations, the growing volume of bioinformatics datasets, emerging new software tools and approaches motivates new solutions. To provide a new tool for these current cases, I present the Biobtree bioinformatics tool. Biobtree effectively fetches and indexes identifiers and special keywords with their related identifiers from supported datasets, optionally with user pre-defined datasets and provides a web interface, web services and direct B+ tree data structure based single uniform database output. Biobtree can handle billions of identifiers and runs via a single executable file with no installation and dependency required. It also aims to provide a relatively small codebase for easy maintenance, addition of new features and extension to larger datasets. Biobtree is available to download from GitHub. Copyright:Entities:
Keywords: bioinformatics; identifiers; mapping; search; visualization
Mesh:
Year: 2019 PMID: 31984130 PMCID: PMC6964641 DOI: 10.12688/f1000research.17927.4
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
List of datasets.
| Dataset | Description | Location | Format |
|---|---|---|---|
| ChEBI | ChEBI reference accession data |
| TSV |
| HGNC | Human gene nomenclature |
| JSON |
| HMDB | Human metabolome database |
| XML |
| InterPro | Protein Families |
| XML |
| Literature mappings | Literature pmid, pmcid and doi mappings |
| CSV |
| Taxonomy | NCBI Taxonomy |
| XML |
| Uniparc | UniProt Sequence Archive |
| XML |
| UniProt reviewed | UniProt Knowledgebase reviewed |
| XML |
| UniProt unreviewed | UniProt Knowledgebase unreviewed |
| XML |
| Uniref50 | UniProt sequence clusters |
| XML |
| Uniref90 | UniProt sequence clusters |
| XML |
| Uniref100 | UniProt sequence clusters |
| XML |
| GO | Gene Ontology |
| RDF/XML |
| ECO | The Evidence & Conclusion Ontology |
| RDF/XML |
| EFO | Experimental Factor Ontology |
| RDF/XML |
| ChEMBL | Chemical database of bioactive molecules |
| RDF/XML |
| Ensembl | Ensembl |
| JSON,CSV,
|
| Ensembl Genomes
| Ensembl Genomes Metazoa |
| JSON,CSV,
|
| Ensembl Genomes Plants | Ensembl Genomes Plants |
| JSON,CSV,
|
| Ensembl Genomes Fungi | Ensembl Genomes Fungi |
| JSON,CSV,
|
| Ensembl Genomes Protists | Ensembl Genomes Protists |
| JSON,CSV,
|
| Ensembl Genomes
| Ensembl Genomes Bacteria |
| JSON,GFF3 |